arXivDaily arXiv每日学术速递 周一至周五更新
重置
2606.06483 2026-06-05 math.ST math.OC math.SP stat.ME stat.TH

Statistically and Computationally Optimal Estimation and Inference of Common Subspaces

公共子空间的统计与计算最优估计与推断

Joshua Agterberg

AI总结 针对多个对称低秩矩阵共享公共子空间的问题,提出基于投影梯度下降和谱平方和初始化的估计器,在强估计信噪比下达到最优 sinΘ 误差率,并在强推断信噪比下实现渐近正态分布的自适应置信区间。

详情
AI中文摘要

给定多个数据矩阵,统计和数据科学中的许多问题依赖于估计一个捕获所有数据矩阵共享的某种结构的公共子空间。在本文中,我们研究了公共子空间模型的统计和计算极限,其中观测到一组由噪声扰动的对称低秩矩阵,每个低秩矩阵共享相同的公共子空间。我们的主要结果识别了信噪比(SNR)的几个区域,使得估计和推断在统计或计算上最优,我们将这些区域称为弱SNR、中等SNR、强估计SNR和强推断SNR。首先,我们提出了一种基于投影梯度下降的估计器,通过谱平方和初始化,并证明它在强估计SNR下达到了最优的$\sinΘ$误差率。这些结果由统计和计算下界补充,这些下界识别了弱和中等估计SNR区域。接下来,我们转向$\sinΘ$距离本身的统计推断,并证明我们的估计器在强推断SNR区域具有渐近高斯分布。基于这一极限结果,我们提出了置信区间,并证明它们在强推断SNR区域是自适应极小化最优的,其中自适应性以SNR衡量。最后,我们证明在强推断SNR区域以下,自适应置信区间在信息论上是不可能的。因此,我们的结果揭示了一个新现象:尽管SNR“高于”估计的计算极限,但自适应统计推断在信息论上可能仍然是不可能的。

英文摘要

Given multiple data matrices, many problems in statistics and data science rely on estimating a common subspace that captures certain structure shared by all the data matrices. In this paper we investigate the statistical and computational limits for the common subspace model in which one observes a collection of symmetric low-rank matrices perturbed by noise, where each low-rank matrix shares the same common subspace. Our main results identify several regimes of the signal-to-noise ratio (SNR) such that estimation and inference are statistically or computationally optimal, and we refer to these regimes as weak SNR, moderate SNR, strong estimation SNR, and strong inference SNR. First, we propose an estimator based on projected gradient descent initialized via spectral sum of squares and show that it achieves the optimal $\sinΘ$ error rate under strong estimation SNR. These results are complemented by both statistical and computational lower bounds identifying the weak and moderate estimation SNR regimes. Next, we turn to statistical inference for the $\sinΘ$ distance itself, and we show that our estimator has an asymptotically Gaussian distribution in the strong inference SNR regime. Based on this limiting result we propose confidence intervals and show that they are adaptively minimax optimal in the strong inference SNR regime, where adaptivity is measured in terms of the SNR. Finally, we show that adaptive confidence intervals are information-theoretically impossible below the strong inference SNR regime. Consequently, our results unveil a novel phenomenon: despite the SNR being ``above'' the computational limit for estimation, adaptive statistical inference may still be information-theoretically impossible.

2606.06469 2026-06-05 math.ST cs.LG math.PR stat.TH

How abundant are good interpolators?

好的插值器有多丰富?

August Y. Chen, Ahmed El Alaoui

AI总结 在高维比例下,通过大偏差原理研究随机均匀选择的线性插值分类器的泛化误差分布,发现几乎所有插值分类器具有相同的泛化性能,而高效算法(如梯度下降)优于大多数插值器。

详情
Comments
140 pages
AI中文摘要

设 $S$ 是单位范数线性分类器 $\theta\in \mathbb{R}^d$ 的集合,这些分类器以预先固定的可能负的间隔 $\kappa$ 正确分类标记数据集 $(X_i,y_i)_{i=1}^n$ 中的每个点,其中 $X_i \in \mathbb{R}^d$,$y_i \in \{-1,+1\}$。在两种自然的数据生成分布——高斯混合模型和具有高斯特征的逻辑模型——以及比例 $n/d \to \alpha$ 且 $\alpha$ 足够小的条件下,我们建立了关于事件(从 $S$ 中均匀随机选择的点 $\theta$ 达到给定泛化误差)的大偏差原理,且该事件以高概率依赖于数据的选择。相关的速率函数是确定性的,描述了在 $d$ 的指数尺度上具有给定期望性能的插值分类器的比例。作为推论,我们建立了以下集中现象:除了指数小的一部分外,所有插值分类器都具有大致相同的泛化性能,该性能由该速率函数的唯一最大值给出。我们将该最大值与通过梯度下降的经验风险最小化和自然线性规划的性能进行了数值比较,两者都找到了 $S$ 中的一个点,并推断出在 $\alpha$ 小的过参数化区域中,这些高效方法优于绝大多数插值器,指出了它们在此设置中非平凡的良性过拟合。

英文摘要

Let $S$ be the set of unit norm linear classifiers $θ\in \mathbb{R}^d$ which correctly classify every point of a labeled dataset $(X_i,y_i)_{i=1}^n$, $X_i \in \mathbb{R}^d$, $y_i \in \{-1,+1\}$, with a possibly negative margin $κ$ fixed in advance. Under two natural data-generating distributions of the $(X,y)$ pairs -- a Gaussian mixture model and a logistic model with Gaussian features -- and in the proportional regime $n/d \to α$ with small enough $α$, we establish a large deviation principle on the event that a point $θ$ chosen uniformly at random from $S$ achieves a given generalization error, with high probability over the choice of the data. The associated large deviation rate function is deterministic and describes the proportion, at the exponential scale in $d$, of interpolating classifiers having a given desired performance. As a consequence, we establish the following concentration phenomenon: all but an exponentially small fraction of interpolating classifiers have approximately the same generalization performance given by the unique maximizer of this rate function. We numerically compare this maximizer to the performance of empirical risk minimization by gradient descent and to the performance of a natural linear program, both finding a point in $S$, and deduce that in the overparametrized regime of small $α$, these efficient procedures outperform the vast majority of interpolators, pointing to their nontrivial benign overfitting in this setting.

2606.06441 2026-06-05 stat.ME stat.AP

Leveraging External Controls for Treatment Switching in Randomized Controlled Trials: A Weighted Causal Inference Framework for Overall Survival

利用外部对照处理随机对照试验中的治疗转换:一个针对总生存期的加权因果推断框架

Andy A. Shen, Chenqi Fu, Ray Lin

AI总结 针对随机对照试验中因治疗转换导致的生存期估计偏倚,提出一个利用外部对照的加权因果推断框架,通过合成对照法和时变权重校正偏倚,模拟和实例验证优于传统方法。

详情
AI中文摘要

在许多以总生存期为主要终点的肿瘤学临床试验中,患者被允许从对照组转换到实验治疗组或其他合适的疗法。转换可能因多种原因发生,包括疾病进展。这违反了随机治疗分配的因果保证,导致治疗效果估计偏倚。现有方法通常需要强假设、复杂的模型规范或两者兼有。在本文中,我们提出了一个通用框架,该框架整合外部对照以处理随机对照试验中的治疗转换。利用合成对照法和观察性因果推断中的平衡权重,我们提出了几种估计量,这些估计量使用多重插补和时变权重来调整治疗转换。我们还讨论了选择外部对照风险集以进行插补的方法。通过广泛的模拟研究,我们表明,与以朴素方式利用外部对照的标准调整方法或根本不利用外部对照的方法相比,我们提出的方法在统计上具有有意义的改进。然后,我们通过两个III期肿瘤学试验展示了基于外部对照的方法的实用性。

英文摘要

In many oncology clinical trials where overall survival is a key endpoint, patients are permitted to switch from the control arm to the experimental treatment arm or other suitable therapies. Switching can occur for various reasons, including disease progression. This violates the causal guarantees of randomized treatment assignment, resulting in biased treatment effect estimates. Existing methods often require strong assumptions, complicated model specifications, or both. In this paper, we propose a general framework that incorporates external controls to account for treatment switching in randomized controlled trials. Leveraging the synthetic control method and balancing weights from observational causal inference, we propose several estimators that use multiple imputation and time-varying weights to adjust for treatment switching. We also discuss approaches to selecting the risk set of external controls to impute from. Through extensive simulation studies, we show that our proposed methods lead to meaningful statistical improvements relative to standard adjustment methods that utilize external controls in naive ways or those that do not utilize external controls at all. We then demonstrate the utility of our external control-based approaches with two phase III oncology trials.

2606.06440 2026-06-05 cs.LG stat.ML

Causal Atlases from Entropic Inference: Bayesian Networks beyond Optimal DAGs

来自熵推理的因果图谱:超越最优DAG的贝叶斯网络

Hazhir Aliahmadi, Irina Babayan, Greg van Anders

AI总结 针对数据驱动因果识别中多因果链问题,提出基于熵推理的因果图谱方法,通过最大熵系综采样量化因果结构歧义性。

详情
Comments
18 pages, 2 figures
AI中文摘要

数据驱动的因果关系识别对于理解科学内外的复杂系统至关重要。贝叶斯网络通过有向无环图(DAG)为建模通用因果关系提供了一种概率方法。然而,构建贝叶斯网络的典型技术依赖于优化,这可能不适合学习因果关系,因为底层数据可能允许多条因果链。更忠实于数据的因果关系表示将提供构建多个因果图的框架,这些因果图与底层数据固有的变异性一致。在这里,我们展示了基于熵的推理生成了与底层数据一致的合理因果关系的图谱。在2节点和20节点线性结构方程模型的模拟噪声数据上,我们对图的最大熵系综进行采样,从而量化底层因果关系中固有的结构歧义性。我们的方法表明,“优化”的DAG可能包含在同等精确的拓扑中不一致的因果伪影。

英文摘要

Data-driven causal relationship identification is pertinent to advancing understanding of complex systems both within and beyond science. Bayesian networks offer a probabilistic method for modelling generic causal relationships via directed acyclic graphs (DAGs). However, typical techniques for constructing Bayesian networks rely on optimization, which can be ill-suited for learning causal relationships because the underlying data may admit multiple chains of causation. More data-faithful representations of causal relationships would provide frameworks for constructing multiple causal maps that are consistent with the variability that is inherent in underlying data. Here, we show that entropy-based inference generates atlases of plausible causal relationships that are consistent with underlying data. On simulated noisy data of 2- and 20-node linear structural equation models, we sample a maximum-entropy ensemble of graphs that allow us to quantify the inherent structural ambiguity in underlying causal relationships. Our method shows that "optimized" DAGs can contain causal artifacts are not consistent across equivalently accurate topologies.

2606.06426 2026-06-05 stat.ME

A Robust Framework for Model Order Selection in Correlated Large-Dimensional CES Noise

相关大维CES噪声下模型阶数选择的鲁棒框架

Eugénie Terreaux, Emmanuelle Jay, Frédéric Pascal, Jean-Philippe Ovarlez

AI总结 针对大维相关非高斯CES噪声中的模型阶数选择问题,提出一个两阶段鲁棒框架,包括基于Toeplitz修正M估计的白化步骤和基于大维随机矩阵理论的信号子空间秩推断,并证明了估计量的一致性和特征值上界。

详情
Comments
13 pages (Main Paper), 6 pages (Supplementary Material), 9 figures
AI中文摘要

本文研究了大维、相关、非高斯噪声下的模型阶数选择问题。假设源信号嵌入在具有未知Toeplitz结构散射矩阵的加性复椭圆对称(CES)噪声中。我们提出了一个两阶段鲁棒框架:(i)基于Toeplitz修正的散射矩阵M估计的白化步骤,以及(ii)通过大维随机矩阵理论(RMT)进行信号子空间秩推断。在观测维度m和样本量N成比例增长的条件下,建立了所提估计量的几乎必然一致性,并给出了区分信号与噪声分量的显式RMT特征值上界。基于样本协方差矩阵(SCM)、Maronna的M估计和无分布Tyler M估计,推导了三个估计分支用于白化。该方法在合成数据、真实高光谱图像、脑电图记录和金融数据上进行了验证,相比AIC和未白化方法有显著改进。

英文摘要

This paper addresses model order selection under large-dimensional, correlated, non-Gaussian noise. Sources are assumed to be embedded in additive Complex Elliptically Symmetric (CES) noise with an unknown Toeplitz-structured scatter matrix. We propose a two-stage robust framework: (i) a noise-whitening step based on a Toeplitz-rectified $M$-estimator of the scatter matrix, and (ii) signal subspace rank inference via large-dimensional Random Matrix Theory (RMT). Almost sure consistency of the proposed estimators is established, together with explicit RMT eigenvalue upper bounds separating signal from noise components, in the regime where the observation dimension $m$ and the sample size $N$ grow proportionally. Three estimation branches are derived, based respectively on the sample covariance matrix (SCM), Maronna's $M$-estimator, and the distribution-free Tyler $M$-estimator for whitening. The methodology is validated on synthetic data, real hyperspectral images, EEG recordings, and financial data, with significant gains over AIC and unwhitened methods.

2606.06412 2026-06-05 cond-mat.dis-nn quant-ph stat.ML

Nonreversible Gauge Fields in Fokker--Planck Dynamics: Supersymmetric Hamiltonians and Learned Finite Forces

福克-普朗克动力学中的不可逆规范场:超对称哈密顿量与学习到的有限力

Masayuki Ohzeki

AI总结 本文通过规范场形式化描述保持稳态密度的不可逆扰动,将福克-普朗克动力学与超对称哈密顿量联系起来,并引入有限时间正则化目标与演员-评论家程序学习最优规范场。

详情
Comments
33 pages, 3 figures
AI中文摘要

我们将保持稳态密度的福克-普朗克动力学不可逆扰动表述为规范场,这些规范场在保持不变状态固定的同时改变弛豫谱。当细致平衡成立时,相似变换将可逆福克-普朗克算子映射为Witten-Laplacian型超对称哈密顿量;不可逆规范则表现为非厄米扰动,保持零模但修改激发谱。这种算子观点为弛豫间隙、循环概率流、低协方差加速和有限控制成本提供了共同语言。我们用反对称张量场表示允许的规范流,并将违反细致平衡的Ohzeki-Ichiki力识别为常数辛示例,其无限强度极限为哈密顿动力学。连续时间谱间隙本身不选择有限规范强度,因此我们引入有限时间正则化目标和演员-评论家程序来学习规范。一个精确可解的各向异性高斯Ornstein-Uhlenbeck基准将谱跃迁与有限时间最优解分离,表明学习到的规范恢复了Lyapunov方程最优解。一个双阱基准随后说明了在非凸亚稳态景观中的相同约束选择。随机梯度方法作为物理相关的福克-普朗克系统进入该框架:小批量噪声充当有效扩散张量,而自适应方法如Adam对应于可能具有非平衡流的度量选择。

英文摘要

We formulate stationary-density-preserving nonreversible perturbations of Fokker--Planck dynamics as gauge fields that deform relaxation spectra while leaving the invariant state fixed. When detailed balance holds, a similarity transformation maps the reversible Fokker--Planck operator to a Witten-Laplacian-type supersymmetric Hamiltonian; nonreversible gauges then appear as non-Hermitian perturbations that preserve the zero mode but modify the excited spectrum. This operator viewpoint gives a common language for relaxation gaps, circulating probability currents, hypocoercive acceleration, and finite control costs. We represent admissible gauge currents by antisymmetric tensor fields and identify the detailed-balance-violating Ohzeki--Ichiki force as a constant symplectic example whose infinite-strength limit is Hamiltonian dynamics. The continuous-time spectral gap alone does not select a finite gauge strength, so we introduce a finite-time regularized objective and an actor--critic procedure for learning the gauge. An exactly solvable anisotropic Gaussian Ornstein--Uhlenbeck benchmark separates the spectral transition from the finite-time optimum and shows that the learned gauge recovers the Lyapunov-equation optimum. A double-well benchmark then illustrates the same constrained selection in a nonconvex metastable landscape. Stochastic gradient methods enter this framework as physically relevant Fokker--Planck systems: mini-batch noise acts as an effective diffusion tensor, and adaptive methods such as Adam correspond to metric choices with possible nonequilibrium currents.

2606.06411 2026-06-05 stat.ME

Smooth Concordance Metrics for Survival Models

生存模型的平滑一致性指标

Nicholas Hartman, Grace Richards

AI总结 针对传统一致性指标仅依赖排序且对模型改进不敏感的问题,提出将风险判别概率建模为预测风险得分差的连续函数,从而评估模型在整个风险得分差异范围内的性能。

详情
AI中文摘要

一致性指标广泛用于评估预测性生存模型区分潜在风险水平的能力。然而,这些统计量也因仅使用模型预测风险得分的排序,且对重要模型特征(如向模型添加强预测变量)不敏感而受到批评。在本文中,我们通过开发平滑一致性指标来解决这些局限性,该指标将潜在的风险判别概率建模为预测风险得分差的连续函数,这些函数的形状根据观测数据估计。因此,这些平滑一致性指标评估模型在可能的风险得分差异整个范围内的性能,从而能够识别候选模型表现特别好或优于其他模型的特定场景。模拟表明,所提出的平滑一致性指标提供了关于风险判别性能的更详细信息,并且对添加有意义的预测变量更为敏感。我们将这些方法应用于比较癌症复发的预测性生存模型。

英文摘要

Concordance indices are widely popular metrics for assessing the ability of predictive survival models to discriminate underlying risk levels. However, these statistics have also been criticized for using only the rank orderings of the model's predicted risk scores and being insensitive to important model features, such as the addition of strong predictor variables into the model. In this paper, we address these limitations by developing smooth concordance metrics that model the underlying risk discrimination probabilities as continuous functions of the predicted risk score differences, where the shapes of these functions are estimated from the observed data. As a result, these smooth concordance metrics assess model performance across the entire range of possible risk score differences, allowing one to identify specific scenarios where the candidate model performs especially well or better than other models. Simulations show that the proposed smooth concordance metrics provide more detailed information about risk discrimination performance and are much more sensitive to the addition of meaningful predictors. We apply these methods to compare predictive survival models for cancer recurrence.

2606.06391 2026-06-05 stat.ML cs.LG

Conformal Risk Sharing: Certified Cost Allocation with Participation Guarantees

共形风险分担:具有参与保证的认证成本分配

Ieva Kazlauskaite

AI总结 提出共形风险分担方法,通过可解释的分担策略与分裂共形校准相结合,从有限数据中无分布假设地分配罕见事件的财务影响,为每个参与者提供义务上限并验证无人受损。

详情
AI中文摘要

将罕见不利事件的财务影响在群体中分担可以减轻极端个人负担,但任何因该安排而变得更糟的参与者都有理由退出。因此,一个可信的机制必须为每个代理人提供其未来义务的可信上限,并且只有在参与者之间的总损害有界时才应部署。我们将此形式化为认证分配问题:从有限数据中,无需分布假设,找到一种再分配规则,为每个参与者产生义务上限,并验证没有参与者实质上变得更糟。我们提出共形风险分担,通过将可解释的分担策略与分裂共形校准相结合来解决这个问题。分担强度在训练数据上调整,而保留的校准数据产生无分布假设的每个代理保证(在可交换性下有效)。在合成和真实数据(包括降水和能源合作社数据)上的实验证实,该框架可以显著降低高风险代理的极端义务,同时控制对他人的损害。

英文摘要

Sharing the financial impact of rare adverse events across a group can soften extreme individual burdens, but any participant made worse off by the arrangement has reason to leave. A credible mechanism must therefore provide each agent with a trustworthy cap on their future obligation and should be deployed only if the aggregate harm across participants is bounded. We formalise this as the Certified Allocation Problem: from finite data and without distributional assumptions, find a redistribution rule, produce obligation caps for every participant, and verify that no participant is made materially worse off. We propose Conformal Risk Sharing, which solves this problem by pairing an interpretable sharing policy with split conformal calibration. The sharing intensity is tuned on training data, while held-out calibration data produces distribution-free per-agent guarantees (valid under exchangeability). Experiments on synthetic and real-world data, including precipitation and energy-cooperative data, confirm that the framework can substantially reduce extreme obligations for high-risk agents while controlling harm to others.

2606.06384 2026-06-05 math.ST stat.ME stat.ML stat.TH

Estimation of the sub-Gaussian parameter

次高斯参数的估计

Jason Liu, Min Xu, Jinchuan Xing

AI总结 针对均值为零的随机变量的次高斯参数(方差代理),提出基于经验加权累积量生成函数约束最大化的自然估计量,证明其一致性并给出收敛速度,在特定条件下达到根号n速率且极小化最优,并应用于基因本体富集研究中的置换检验p值构造。

详情
Comments
31 pages, 3 figures, and 1 table
AI中文摘要

均值为零的随机变量 $X$ 的次高斯参数(也称为方差代理)定义为 $ξ^2_* = \sup_{λ\in \mathbb{R}} L(λ)$,其中 $L(λ) = \frac{2}{λ^2} \log \mathbb{E} e^{λX}$ 是加权累积量生成函数。尽管次高斯随机变量无处不在,但 $ξ^2_*$ 的估计很少受到关注,且尚未被充分理解。在这项工作中,我们研究了一个基于 $L$ 的经验类比约束最大化的自然估计量。我们证明了该估计量是一致的,并在对 $L$ 的假设下给出了收敛速度:如果 $L$ 存在最大值点,则对于任意 $\varepsilon > 0$,我们的界为 $O_p(n^{-1/2 + \varepsilon})$;如果 $L$ 的最大值点也有界,则界改进为 $O_p(n^{-1/2})$。我们通过证明在所有次高斯分布上的极小化风险为 $Ω(1)$ 来表明对 $L$ 的假设是必要的;对 $L$ 的尾部增长施加越来越强的假设,会产生一个连续类,其极小化下界在 $Ω(1/\log n)$ 和 $Ω(1)$ 之间插值。如果限制在 $L$ 在有界区域内达到上确界的分布子类上,则根号n速率是可能的,此时我们的估计量是极小化最优的。如果基础分布不是次高斯的,我们证明我们的估计量趋向无穷大,其发散速率由分布的尾部控制。最后,我们将我们的估计量应用于基因本体(GO)富集研究中,以构建大规模置换检验的p值,表明它可以作为峰值超过阈值方法的可靠替代,特别是在峰值超过阈值方法有效性不确定的情况下。

英文摘要

The sub-Gaussian parameter (also called the variance proxy) of a mean-zero random variable $X$ is defined as $ξ^2_* = \sup_{λ\in \mathbb{R}} L(λ)$ where $L(λ) = \frac{2}{λ^2} \log \mathbb{E} e^{λX}$ is a weighted cumulant generating function. Despite the ubiquity of sub-Gaussian random variables, the estimation of $ξ^2_*$ has received little attention and is not yet well understood. In this work, we study a natural estimator of $ξ^2_*$ based on constrained maximization of the empirical analogue of $L$. We prove that the estimator is consistent bound the rates of convergence under assumptions on $L$: if $L$ has an maximizer, then our bound is $O_p(n^{-1/2 + \varepsilon})$ for any $\varepsilon > 0$; if the argmax of $L$ is also bounded, then the bound improves to $O_p(n^{-1/2})$. We show that our assumptions on $L$ are necessary by proving that the minimax risk over all sub-Gaussian distributions is $Ω(1)$; imposing increasingly strong assumptions on the tail growth of $L$ yields a continuum of classes whose minimax lower bound interpolates between $Ω(1/\log n)$ and $Ω(1)$. Root-n rate is possible if we restrict to a subclass of distributions where $L$ attains its supremum in a bounded region, in which case our estimator is minimax optimal. If the underlying distribution is not sub-Gaussian, we show that our estimator goes to infinity with a divergence rate controlled by the tail of the distribution. Finally, we apply our estimator in a Gene Ontology (GO) enrichment study to construct p-values for a large-scale permutation test, showing that it can serve as a reliable alternative to the peaks-over-threshold approach, particularly in regimes where the peaks-over-threshold method is of uncertain validity.

2606.06368 2026-06-05 math.ST stat.ME stat.ML stat.TH

Optimally taming biases in black-box models for efficient semiparametric estimation

最优地驯服黑箱模型中的偏差以实现高效半参数估计

Yihong Gu, Qishuo Yin, Tianxi Cai, Jianqing Fan

AI总结 针对半参数估计中黑箱模型估计干扰函数时误差传播问题,提出新估计器达到更优收敛速率并证明最优性,扩展至平均处理效应等线性泛函估计。

详情
Comments
25 pages, 3 figures; comments welcome
AI中文摘要

现代半参数估计通常依赖灵活的黑箱机器学习方法估计干扰函数,这引发了一个基本问题:干扰估计误差如何传播到低维目标参数的推断中?以双重机器学习(DML)为代表的范式给出了误差界,其中干扰估计误差以乘法方式进入。尽管被广泛采用,但对于黑箱模型而言,这种乘法率依赖是否最优仍不清楚。本文首先在结构无关设定下重新审视部分线性模型 $Y = μ_0(X)+T\cdotβ_0+\varepsilon$,其中干扰函数 $μ_0$ 使用通用机器学习模型估计,具有逼近误差 $δ^a_μ$ 和随机误差 $δ_μ^s$。我们证明,在辅助函数 $\mathbb{E}[T|X=x]$ 无法一致估计的情况下,标准 DML 速率并非最优。我们提出 $β_0$ 的新估计器,达到更优速率 $n^{-1/2}+δ^a_μ+(δ_μ^s)^2$,并建立匹配的下界证明其最优性。我们的结果揭示了一个新原理:无需施加任何额外假设即可消除干扰估计的一阶随机误差。这也导致了修正的调参策略,偏好欠平滑,即 $δ^a_μ\asymp(δ_μ^s)^2$,而非经典的偏差-方差权衡 $δ^a_μ\asymp δ_μ^s$。在温和的附加条件下,该估计量渐近正态且具有最小渐近方差。所提方法扩展到一类广泛的半参数线性泛函估计问题,包括平均处理效应估计。我们的结果表明,在使用黑箱干扰学习器的半参数估计中,流行的正交得分方法可以得到显著改进。

英文摘要

Modern semiparametric estimation often relies on flexible black-box machine learning methods to estimate nuisance functions, raising a fundamental question: how do nuisance estimation errors propagate into inference for low-dimensional target parameters? The dominant paradigm, exemplified by double machine learning (DML), yields error bounds in which nuisance estimation errors enter multiplicatively. While widely adopted, it remains unclear whether this multiplicative-rate dependence is optimal for black-box models. In this paper, we start by revisiting the partial linear model $Y = μ_0(X)+T\cdotβ_0+\varepsilon$ under a structure-agnostic setting, where the nuisance function $μ_0$ is estimated using a generic machine learning model, with approximation error $δ^a_μ$ and stochastic error $δ_μ^s$. We show that the standard DML rate is not optimal in the regime where the auxiliary function $\mathbb{E}[T|X=x]$ cannot be consistently estimated. We propose a new estimator for $β_0$ that achieves a sharper rate of $n^{-1/2}+δ^a_μ+(δ_μ^s)^2$ and establish a matching lower bound demonstrating its optimality. Our results reveal a new principle: the first-order stochastic error of nuisance estimation can be eliminated without imposing any additional assumptions. This also leads to a revised tuning strategy favoring under-smoothing, where $δ^a_μ\asymp(δ_μ^s)^2$, rather than the classical bias-variance trade-off $δ^a_μ\asymp δ_μ^s$. Under mild additional conditions, the estimator is asymptotically normal with minimal asymptotic variance. The proposed method extends to a broad class of semi-parametric linear functional estimation problems, including average treatment effect estimation. Our results imply that popular orthogonal score methods in semiparametric estimation with black-box nuisance learners can be substantially improved.

2606.06364 2026-06-05 cs.LG stat.ML

End-to-End Subgraph Detection with GraphDETR

端到端子图检测与GraphDETR

Dexiong Chen, Till Hendrik Schulz, Karsten Borgwardt

AI总结 提出GraphDETR框架,将子图检测视为集合预测问题,通过图神经网络编码目标图、Transformer解码器联合预测所有模式实例,并采用二分匹配端到端训练,支持精确和近似匹配,在多达1000节点的图中检测50节点模式,并在ChEMBL数据集上实现AP100=91.2。

详情
AI中文摘要

子图检测旨在识别查询模式实例是否出现在更大图中及其位置。该问题在科学领域至关重要,且与子图同构密切相关,后者是NP完全的,限制了组合方法只能处理小模式或中等规模图。我们提出GraphDETR,一个深度学习框架,将子图检测公式化为集合预测问题,类似于目标检测中的DETR。GraphDETR使用图神经网络编码目标图,并采用一组固定的可学习查询向量,通过Transformer解码器解码,在单次前向传播中联合预测所有模式实例。这通过端到端训练和二分匹配实现。与传统仅解决精确结构匹配的组合方法不同,GraphDETR自然扩展到近似匹配,使得能够检测超出精确模式对应的实例。实验表明,GraphDETR能够在多达1000个节点的目标图中检测多达50个节点的多样化模式,如分子结构、环、团和模糊模式。我们进一步在ChEMBL数据集上评估分子官能团检测,GraphDETR预测每个分子的完整官能团集合,实现了$ ext{AP}_{100} = 91.2$的强性能。

英文摘要

Subgraph detection seeks to identify whether and where instances of query patterns occur within a larger graph. This problem is fundamental across scientific domains and is closely related to subgraph isomorphism, which is NP-complete, limiting combinatorial approaches to small patterns or moderately sized graphs. We introduce GraphDETR, a deep learning framework that formulates subgraph detection as a set prediction problem, analogous to DETR in object detection. GraphDETR encodes the target graph with a graph neural network, and employs a fixed set of learnable query vectors, decoded via a transformer decoder, to predict all pattern occurrences jointly in a single forward pass. This is enabled by training the model end-to-end with bipartite matching. Unlike traditional combinatorial methods that only solve exact structural matching, GraphDETR naturally extends to approximate matching, enabling detection beyond exact pattern correspondence. Empirically, we show that GraphDETR can detect diverse patterns, such as molecular structures, cycles, cliques, and fuzzy patterns of up to 50 nodes, in target graphs with up to 1000 nodes. We further evaluate on molecular functional group detection over the ChEMBL dataset, where GraphDETR predicts the complete set of functional groups per molecule, achieving a strong performance of $\text{AP}_{100} = 91.2$.

2606.06351 2026-06-05 stat.ML cs.LG

Function-Space Priors for Bayesian Neural ODEs with Application to Vessel Trajectory Prediction

贝叶斯神经常微分方程的函数空间先验及其在船舶轨迹预测中的应用

Jaeyeong Lee, Wonmo Koo, Heeyoung Kim

AI总结 针对船舶轨迹预测中不规则采样、缺失报告和复杂动力学挑战,提出一种在向量场上施加高斯过程核先验的正则化方法,并结合概率多重打靶实现长序列的不确定性量化。

详情
AI中文摘要

从自动识别系统(AIS)数据预测船舶轨迹对于海上态势感知至关重要,但由于不规则采样、缺失报告和复杂动力学,这仍然具有挑战性。除了准确的点预测外,海事应用还需要良好校准的不确定性估计以支持可靠决策。贝叶斯神经常微分方程(ODE)通过在神经向量场参数上放置先验,为具有不确定性量化的连续时间轨迹建模提供了原则性框架。然而,常用的各向同性高斯权重先验无法编码船舶动力学的信息性结构特性,如平滑性和局部性。现有的函数空间贝叶斯神经网络方法解决了静态映射的这一限制,但不能直接转移到神经常微分方程,因为其主要关注量是轨迹而非向量场本身。原则上,可以直接在ODE解上放置高斯过程(GP)先验,但这需要将分布通过非线性ODE求解器传播,这在分析上是棘手的。为了解决这一挑战,我们采用了一种实用方法,直接在有限测量点集上评估的向量场上施加基于GP核的先验。具体来说,我们用基于核的正则化器增强标准权重空间变分目标,该正则化器惩罚向量场偏离GP先验所隐含的结构。为了处理长且不规则的AIS轨迹,我们进一步将这种函数空间正则化与概率多重打靶相结合,该打靶方法在保持全局一致性的同时解耦跨时间段的推理。

英文摘要

Vessel trajectory prediction from Automatic Identification System (AIS) data is essential for maritime situational awareness, yet it remains challenging due to irregular sampling, missing reports, and complex dynamics. Beyond accurate point forecasts, maritime applications also demand well-calibrated uncertainty estimates for reliable decision-making. Bayesian Neural Ordinary Differential Equations (ODEs) offer a principled framework for continuous-time trajectory modeling with uncertainty quantification by placing a prior over the neural vector field parameters. However, the commonly used isotropic Gaussian weight prior fails to encode informative structural properties of vessel dynamics, such as smoothness and locality. Existing function-space Bayesian neural network methods address this limitation for static mappings, but do not transfer directly to Neural ODEs, where the primary quantity of interest is the trajectory rather than the vector field itself. In principle, one could place a Gaussian process (GP) prior directly over ODE solutions, but this requires propagating distributions through a nonlinear ODE solver, which is analytically intractable. To address this challenge, we adopt a practical approach that imposes a GP-kernel-based prior directly on the vector field evaluated at a finite set of measurement points. Specifically, we augment the standard weight-space variational objective with a kernel-based regularizer that penalizes deviations of the vector field from the structure implied by a GP prior. To handle long and irregular AIS trajectories, we further combine this function-space regularization with probabilistic multiple shooting, which decouples inference across temporal segments while maintaining global consistency.

2606.06346 2026-06-05 math.ST stat.ME stat.TH

Unified formulas for conditional quantities and transportation functionals

条件量和输运泛函的统一公式

Roberto Vila, Eduardo Nakano, Chang C. Y. Dorea

AI总结 本文基于分布导数和Dirac delta表示,建立了一个统一的概率框架,用于推导条件期望、条件分布、危险函数等经典概念的统一公式,并利用Fréchet-Hoeffding界和Δ-反调函数推导了绝对差矩的尖锐界,进而得到Wasserstein距离的分位数表示等结果。

详情
Comments
23 pages, 1 figure
AI中文摘要

本文基于分布导数和Dirac delta表示,为条件量和输运相关量的分析建立了一个统一的概率框架。对于任意随机变量(包括绝对连续、离散和混合分布),建立了通用恒等式。所提出的方法为条件期望、条件分布、危险函数和不恰当分布提供了统一公式,揭示了这些经典概念背后的共同局部化机制。该框架进一步与copula方法结合,通过依赖结构研究输运和散度泛函。利用Fréchet-Hoeffding界的极值性质以及由Δ-反调函数诱导的期望不等式,推导了在固定边缘分布下绝对差矩的尖锐界。这些结果导致了Wasserstein距离和相应上输运泛函的分位数表示的简洁推导,以及广义绝对差矩的生存函数表示和界。作为一个特例,得到了二元Gini平均差和关联的二元Gini指数的新表示。给出了在标准化计数分布(包括泊松、二项和负二项模型)的正态逼近中出现的Wasserstein型泛函的应用,并推导了显式的分位数表示。总体而言,这些结果建立了条件结构、依赖建模、散度度量、正态逼近和最优输运之间的显式联系,为概率论和数理统计中的几个基本构造提供了统一视角。

英文摘要

This paper develops a unified probabilistic framework based on distributional derivatives and Dirac delta representations for the analysis of conditional and transportation-related quantities. General identities are established for arbitrary random variables, encompassing absolutely continuous, discrete, and mixed distributions. The proposed approach yields unified formulas for conditional expectations, conditional distributions, hazard functions, and improper distributions, revealing a common localization mechanism underlying these classical concepts. The framework is further combined with copula methods to investigate transportation and dispersion functionals through dependence structures. Exploiting the extremal properties of the Fréchet--Hoeffding bounds together with expectation inequalities induced by $Δ$-antitonic functions, sharp bounds are derived for absolute difference moments under fixed marginals. These results lead to concise derivations of quantile representations for the Wasserstein distance and a corresponding upper transportation functional, as well as survival-function representations and bounds for generalized absolute difference moments. As a particular case, new representations are obtained for the bivariate Gini mean difference and the associated bivariate Gini index. Applications are given to Wasserstein-type functionals arising in the normal approximation of standardized counting distributions, including Poisson, Binomial, and Negative Binomial models, for which explicit quantile representations are derived. Overall, the results establish explicit links among conditional structures, dependence modeling, dispersion measures, normal approximation, and optimal transport, providing a unified perspective on several fundamental constructions in probability and mathematical statistics.

2606.06342 2026-06-05 stat.ML cs.LG

Symmetric Divergence and Normalized Similarity: A Unified Topological Framework for Representation Analysis

对称散度与归一化相似性:表示分析的统一拓扑框架

Yan Wang, Tianyang Hu

AI总结 提出对称表示拓扑散度(SRTD)和归一化拓扑相似性(NTS),分别解决现有拓扑散度的非对称性和无界性问题,实现细粒度结构诊断与跨场景标准化评估。

详情
Comments
Accepted by TMLR
AI中文摘要

拓扑数据分析(TDA)为比较神经表示提供了一种原则性的、内在的视角。然而,现有的配对拓扑散度(如RTD)受到启发式非对称性以及更关键的无界分数(依赖于样本量)的限制,阻碍了可靠的跨场景基准测试。为了解决这些挑战,我们开发了一个统一的拓扑工具包,服务于两个互补的需求:细粒度结构诊断和鲁棒的标准化评估。首先,我们通过引入对称表示拓扑散度(SRTD)及其高效变体SRTD-lite来完善RTD框架。除了解决先前变体的理论非对称性外,SRTD将诊断信息整合到一个单一的、全面的交叉条码签名中。这使得能够精确定位结构差异,并作为有效的优化目标,无需双方向计算的开销。其次,为了在异构设置中实现可靠的基准测试,我们提出了归一化拓扑相似性(NTS)。通过测量层次合并顺序的秩相关性,NTS产生一个介于-1和1之间的尺度不变度量,有效克服了未归一化散度的尺度和样本依赖性。在合成和真实深度学习设置中的实验表明,我们的工具包捕捉到了几何度量无法发现的CNN中的功能变化,并且即使在距离饱和情况下也能鲁棒地映射LLM谱系,提供了一种严格的、拓扑感知的视角,补充了CKA等度量。

英文摘要

Topological Data Analysis (TDA) offers a principled, intrinsic lens for comparing neural representations. However, existing paired topological divergences (e.g., RTD) are limited by heuristic asymmetry and, more critically, unbounded scores that depend on sample size, hindering reliable cross-scenario benchmarking. To address these challenges, we develop a unified topological toolkit serving two complementary needs: fine-grained structural diagnosis and robust, standardized evaluation. First, we complete the RTD framework by introducing Symmetric Representation Topology Divergence (SRTD) and its efficient variant SRTD-lite. Beyond resolving the theoretical asymmetry of prior variants, SRTD consolidates diagnostic information into a single, comprehensive cross-barcode signature. This allows for precise localization of structural discrepancies and serves as an effective optimization objective without the overhead of dual directional computations. Second, to enable reliable benchmarking across heterogeneous settings, we propose Normalized Topological Similarity (NTS). By measuring the rank correlation of hierarchical merge orders, NTS yields a scale-invariant metric bounded between -1 and 1, effectively overcoming the scale and sample-dependence of unnormalized divergences. Experiments across synthetic and real-world deep learning settings demonstrate that our toolkit captures functional shifts in CNNs missed by geometric measures and robustly maps LLM genealogy even under distance saturation, offering a rigorous, topology-aware perspective that complements measures like CKA.

2606.06332 2026-06-05 math.ST stat.ME stat.ML stat.TH

Bentkus-type asymptotic e-values

Bentkus型渐近e值

Diego Martinez-Taboada, Ben Chugg, Aaditya Ramdas

AI总结 针对现有渐近e值存在“缺失因子”导致推断保守的问题,基于Bentkus近最优集中不等式,提出Bentkus型渐近e值并证明其消除缺失因子,理论和实证表明其推断更锐利。

详情
AI中文摘要

渐近e值正在成为渐近p值的有力替代,特别是在事后推断和多重检验中,其中显著性水平可能依赖于数据。然而,现有的渐近e值存在“缺失因子”,这是一种缩放效率低下,导致过于保守的推断。借鉴Bentkus在2000年代发展的近最优集中不等式框架,我们引入了Bentkus型渐近e值,并证明它们成功消除了缺失因子。我们还从理论和实证上证明,Bentkus型e值始终比现有替代方案提供更锐利的推断,从而在事后置信区间和多重检验程序中实现更紧的置信区间和更高的拒绝率。

英文摘要

Asymptotic e-values are emerging as a powerful alternative to asymptotic p-values, particularly in post-hoc inference and multiple testing, where significance levels may be data-dependent. Existing asymptotic e-values, however, suffer from the ``missing factor,'' a scaling inefficiency resulting in overly conservative inference. Drawing on the framework of near-optimal concentration inequalities developed by Bentkus in the 2000s, we introduce Bentkus-type asymptotic e-values and prove that they successfully eliminate the missing factor. We also demonstrate both theoretically and empirically that Bentkus-type e-values consistently deliver sharper inference than existing alternatives, leading to tighter post-hoc confidence intervals and higher rejection rates in multiple testing procedures.

2606.06329 2026-06-05 cs.LG cs.CG cs.CV stat.ML

Efficient Mean Curvature Computation on High-Dimensional Data Manifolds

高维数据流形上的高效平均曲率计算

Alexandre L. M. Levada

AI总结 针对高维数据集局部平均曲率计算中原始方法O(m^4)每点成本过高的问题,提出基于代数恒等式和截断SVD的快速估计器,将成本降至O(k^2 m + k m p^2),在真实数据集上实现50-300倍加速且精度损失可忽略。

详情
Comments
31 pages, 2 figures and 5 tables
AI中文摘要

估计高维数据集中每个点的局部平均曲率是几何感知机器学习算法(如平均曲率边界点(MCBP)方法)的关键组成部分。该计算的朴素实现基于从k近邻块近似的局部形状算子,涉及显式构造矩阵$H$,其迹形式导致每点成本为$O(m^4)$,使得该方法对于具有超过几十个特征的数据集变得难以处理。本文提出了两个互补的贡献,共同将这一成本降低了几个数量级。第一个贡献是一个精确的代数恒等式。该恒等式源自协方差矩阵特征向量的正交性和迹算子的循环性,完全消除了$H$,并将特征分解后的每点成本降低到$O(m^2)$。第二个贡献解决了完整特征分解中剩余的$O(m^3)$瓶颈。由于局部协方差矩阵的秩最多为$k-1 \ll m$,我们将其替换为$k imes m$中心数据矩阵的截断SVD,这是一个$O(k^2 m)$操作,并基于Haar测度下零空间特征向量外积的期望值,推导出其贡献的解析近似。得到的估计器总成本为$O(k^2 m + k m p^2)$,其中$p = k-1$。在真实数据集上的实验证实,相对于原始实现,加速比为50到300倍,当使用快速估计器替换原始版本时,精度损失可忽略。通过提供可扩展且数据驱动的局部曲率估计,所提出的方法将曲率确立为从经典到现代深度学习流水线的广泛机器学习任务中的实用几何特征。

英文摘要

Estimating local mean curvature at each point of a high-dimensional dataset is a key ingredient of geometry-aware machine learning algorithms, such as the Mean Curvature Boundary Points (MCBP) method. The naive implementation of this computation, based on a local shape operator approximated from k-nearest neighbor patches, involves an explicit construction of a matrix $H$ whose trace form yields an $O(m^4)$ cost per point, rendering the approach intractable for datasets with more than a few dozen features. This paper introduces two complementary contributions that together reduce this cost by several orders of magnitude. The first contribution is an exact algebraic identity. This identity, derived from the orthogonality of the eigenvectors of the covariance matrix and the cyclicity of the trace operator, eliminates $H$ entirely and reduces the per-point cost to $O(m^2)$ after the eigendecomposition. The second contribution addresses the remaining $O(m^3)$ bottleneck of the full eigendecomposition. Since the local covariance matrix has rank at most $k-1 \ll m$, we replace it with a truncated SVD of the $k \times m$ centered data matrix, an $O(k^2 m)$ operation, and derive an analytical approximation for the contribution of the null-space eigenvectors based on the expected value of their outer product under the Haar measure. The resulting estimator has total cost $O(k^2 m + k m p^2)$, where $p = k-1$. Experiments on real-world datasets confirm speedups of 50 to 300 times relative to the original implementation, with negligible loss when the fast estimator is used to replace the original version. By providing a scalable and data-driven estimate of local curvature, the proposed method establishes curvature as a practical geometric feature for a broad range of machine learning tasks, from classical to modern deep learning pipelines.

2606.06314 2026-06-05 math.NA cs.LG cs.NA stat.ML

DAS-PINNs for high-dimensional partial differential equations: extending deep adaptive sampling to spacetime domains

DAS-PINNs 用于高维偏微分方程:将深度自适应采样扩展到时空域

Anshima Singh, David J. Silvester

AI总结 提出一种基于归一化流的深度自适应采样框架,将时空视为统一域,通过残差分布自动识别高残差区域并生成采样点,有效求解具有局部动态特征的高维时变PDE。

详情
AI中文摘要

具有空间局部和动态演化解的时变高维偏微分方程对物理信息神经网络(PINNs)构成根本性挑战,因为在高维时空域中均匀配点采样越来越无效。本文将深度自适应采样框架扩展到时变设置,将空间和时间视为统一域,无需任何显式时间推进。归一化流神经网络模型有效学习由PDE残差诱导的分布,并生成集中在解最难学习区域的新配点。与需要显式时间步进或移动网格的传统自适应策略不同,高残差区域由PDE残差分布驱动,在空间和时间上自动识别和跟踪。通过从二维空间中的尖锐移动特征到高达八维空间中的局部结构等一系列基准问题,评估了所提策略的有效性。

英文摘要

Time-dependent high-dimensional partial differential equations (PDEs) with spatially localised and dynamically evolving solutions pose a fundamental challenge for physics-informed neural networks (PINNs), as uniform collocation sampling becomes increasingly ineffective in high-dimensional spatiotemporal domains. In this work, a deep adaptive sampling framework for PINNs is extended to the time-dependent setting by treating space and time as a unified domain without any explicit time marching. A normalising flow neural network model effectively learns the distribution induced by the PDE residual and generates new collocation points concentrated in regions where the solution is most difficult to learn. Unlike conventional adaptive strategies that require explicit time stepping or moving meshes, high-residual regions are automatically identified and tracked across both space and time, driven purely by the PDE residual distribution. The effectiveness of the proposed strategy is assessed on a range of benchmark problems, from sharp and moving features in two spatial dimensions to localised structures in up to eight spatial dimensions.

2606.06293 2026-06-05 cs.LG stat.ML

PAC-Bayesian Adversarially Robust Generalization for Message Passing Graph Neural Networks: A Sensitivity Analysis

消息传递图神经网络的PAC-Bayesian对抗鲁棒泛化:敏感性分析

Ziling Liang, Xinping Yi, Qingsong Wen, Shi Jin

AI总结 通过敏感性感知的PAC-Bayesian框架,利用输出雅可比矩阵的秩约束和异向高斯后验,为消息传递图神经网络导出更紧的对抗鲁棒泛化界。

详情
AI中文摘要

尽管图神经网络(GNNs)对对抗攻击的脆弱性对图表示学习构成了严重威胁,但在对抗环境下对鲁棒泛化行为的理解仍然是一个基本挑战。最近,基于PAC-Bayesian边际的泛化分析通过提供灵活且数据依赖的分析框架,显著推动了这一研究方向。然而,现有的鲁棒分析通常依赖于各向同性高斯后验,并在全参数空间中控制权重扰动,这限制了捕捉异质参数敏感性的能力,且依赖于隐藏宽度相关的复杂度项,导致泛化界不够紧。在本文中,我们将最近提出的敏感性感知PAC-Bayesian框架从深度神经网络扩展到消息传递图神经网络(MPGNNs),并在对抗环境下导出了更紧的鲁棒泛化界。具体地,我们首先通过推导关于权重参数的输出雅可比矩阵,量化不同参数块的扰动对网络输出的敏感性。利用这些雅可比矩阵在$K$类图分类中秩最多为$K$的事实,我们构建了雅可比对齐的敏感性矩阵,并使用具有优化协方差的异向高斯后验来紧上界KL散度。值得注意的是,通过细化学习权重的谱范数依赖性,并将主导维度因子从隐藏宽度相关项减少到类别数$K$,我们的分析为MPGNNs提供了更紧的鲁棒泛化保证,从而指导其设计以增强对抗鲁棒性。

英文摘要

Whilst the vulnerability of graph neural networks (GNNs) to adversarial attacks poses a critical threat to graph representation learning, the understanding of the robust generalization behavior remains a fundamental challenge in the adversarial setting. Recently, PAC-Bayesian margin-based generalization analysis substantially advances this line of research by providing a flexible and data-dependent analytical framework. However, existing robust analyses often rely on isotropic Gaussian posteriors and control weight perturbations in the full parameter space, which limits the ability to capture heterogeneous parameter sensitivity yet hinges on hidden-width-dependent complexity terms, resulting in not-tight-enough generalization bounds. In this paper, we extend a recently proposed sensitivity-aware PAC-Bayesian framework from deep neural networks to message passing GNNs (MPGNNs) and derive a tighter robust generalization bound in the adversarial setting. Specifically, we first quantify how sensitive the perturbations across different parameter blocks are to the network outputs by deriving the output Jacobians with respect to the weight parameters. Exploiting the fact that these Jacobian matrices have rank at most $K$ in $K$-class graph classification, we then construct Jacobian-aligned sensitivity matrices and use anisotropic Gaussian posteriors with optimized covariances to upper bound the KL divergence in a tight way. Notably, by refining the spectral-norm dependence on the learned weights and reducing the leading dimension factor from hidden-width-dependent terms to the number of classes $K$, our analysis yields much tighter robust generalization guarantees for MPGNNs, thereby guiding their designs to enhance adversarial robustness.

2606.06288 2026-06-05 stat.ML cs.LG

Discrete Causal Representations from Heterogeneous Domains: A Bayesian Approach with Social Survey Applications

来自异构域的离散因果表示:一种贝叶斯方法及其在社会调查中的应用

Ankur Garg, Michael Stettler, Aaron Schein, Julius von Kügelgen

AI总结 提出一种贝叶斯方法,从多环境数据中学习离散因果概念,通过序贯蒙特卡洛采样近似多模态后验,并在社会调查数据中验证了其推断有意义的高层概念和因果关系的有效性。

详情
AI中文摘要

因果表示学习旨在推断产生观测到的低层测量的高层潜在因果概念。这对于来自不同环境或领域的异构数据尤其相关,因为分布偏移通常通过某些底层因果机制中的稀疏局部变化而发生,而生成过程的其他部分保持不变。尽管因果表示的可识别性已被广泛研究,但实用的不确定性感知方法和真实世界用例仍较少探索。在这项工作中,我们提出了一种从多环境数据中学习因果表示的贝叶斯方法,重点关注离散因果概念和未知的多节点软干预的情况。为此,我们将因果假设和可解释性需求转化为层次模型中的适当先验和参数选择。然后,我们设计了一种基于序贯蒙特卡洛采样的推理方案来近似得到的多模态后验。我们通过社会调查数据的案例研究展示了我们的方法,其中潜在因果概念对应于文化价值观或政治观点,测量对应于调查响应,环境对应于不同的国家或州。我们的模型推断出有意义的高层概念以及它们之间合理的因果关系,展示了其在学习复杂真实世界数据的因果表示方面的实用性。

英文摘要

Causal representation learning aims to infer the high-level latent causal concepts that give rise to observed low-level measurements. This is particularly relevant for heterogeneous data from different environments or domains since distribution shifts often arise through sparse, localized changes in some of the underlying causal mechanisms, while other parts of the generative process remain unchanged. Whereas identifiability of causal representations has been studied extensively, practical uncertainty-aware methods and real-world use cases remain less explored. In this work, we propose a Bayesian approach to learning causal representations from multi-environment data, focusing on the case of discrete causal concepts and unknown multi-node soft interventions. To this end, we translate causal assumptions and interpretability desiderata into suitable priors and parametric choices within a hierarchical model. We then devise an inference scheme based on sequential Monte Carlo sampling to approximate the resulting multimodal posterior. We showcase our approach through case studies on social survey data, where latent causal concepts correspond to cultural values or political opinions, measurements to survey responses, and environments to different countries or states. Our model infers meaningful high-level concepts and plausible causal relations among them, demonstrating its utility for learning causal representations of complex real-world data.

2606.06246 2026-06-05 quant-ph cs.IT math-ph math.FA math.IT math.MP math.ST stat.TH

Multiple Quantum Hypothesis Testing: One-Shot Pairwise Bounds and Sharp Asymptotics

多重量子假设检验:单次配对界与尖锐渐近性

Hao-Chung Cheng, Po-Chieh Liu

AI总结 针对多重量子态贝叶斯判别问题,提出基于配对误差之和的无维单次上界,证明多重量子Chernoff距离在任意可分希尔伯特空间中的可达性,并给出最优错误概率的常数因子尖锐渐近性。

详情
Comments
arXiv:1401.7658, arXiv:1508.06624, arXiv:quant-ph/0607216. We are sorry if the accented character "ł" does not display properly through arXiv's TeX encoding of Metadata
AI中文摘要

我们考虑多重量子态之间的贝叶斯判别,并建立了基于配对误差之和的最小错误概率的无维单次上界。这解决了Audenaert和Mosonyi [J. Math. Phys. 55 (2014)]的一个猜想,并通过去除维度相关的前因子改进了Li [Ann. Statist. 44 (2016)]的多重量子Chernoff界。在渐近多副本情形下,我们的界证明了多重量子Chernoff距离在任意可分希尔伯特空间中的可达性,从而解决了先前未解决的无穷维情况,并进一步给出了最优错误概率的常数因子尖锐渐近性。 在二元量子假设检验中,我们证明了最小错误概率(在通用常数范围内)由迹调和平均量刻画。因此,最优二元量子错误概率与相关Nussbaum-Szkoła分布的最优经典错误概率相差不超过两倍,补充了Nussbaum和Szkoła [Ann. Statist. 37 (2009)]的下界。

英文摘要

We consider Bayesian discrimination among multiple quantum states and establish a dimension-free one-shot upper bound on the minimum probability of error in terms of the sum of pairwise errors. This resolves a conjecture of Audenaert and Mosonyi [J. Math. Phys. 55 (2014)] and improves the multiple quantum Chernoff bound of Li [Ann. Statist. 44 (2016)] by removing its dimension-dependent prefactor. In the asymptotic many-copy regime, our bound proves the achievability of the multiple quantum Chernoff distance for arbitrary separable Hilbert spaces, thereby settling the previously open infinite-dimensional case, and further yields constant-factor sharp asymptotics for the optimal error probability. In binary quantum hypothesis testing, we prove that the minimum error probability is characterized, up to universal constants, by a trace harmonic-mean quantity. Consequently, the optimal binary quantum error probability is within a factor of two of the optimal classical error probability for the associated Nussbaum-Szkoła distributions, complementing the lower bound of Nussbaum and Szkoła [Ann. Statist. 37 (2009)].

2606.06233 2026-06-05 stat.ML cs.LG stat.ME

Anchor PCA

Anchor PCA

Benedikt Seiter, Anya Fries, Julius von Kügelgen, Jonas Peters

AI总结 针对多领域数据,提出Anchor PCA方法,通过修改目标矩阵进行主成分分析,在保留共享变异方向的同时权衡整体解释方差,实现鲁棒的降维。

详情
AI中文摘要

主成分分析(PCA)是最广泛使用的无监督降维技术之一。我们研究来自多个相关领域的数据的PCA。由于主成分在不同领域通常不同,获得共享低秩嵌入的一种方法是对合并数据进行PCA。然而,这种方法可能关注仅在少数领域中表现出高变异的虚假方向。为了找到在未见但相似领域中仍能解释大部分方差的鲁棒嵌入,我们提出关注共享变异方向。为此,我们引入了Anchor PCA,它在整体解释方差与共享和领域特定低秩嵌入之间的一致性之间进行权衡。Anchor PCA相当于对修改后的目标矩阵进行PCA,因此可以高效求解。此外,我们证明Anchor PCA恢复最大不变子空间,并在有界领域特定协方差膨胀下允许极小极大重构解释。在具有时间漂移的模拟和真实气体传感器数据上,我们分别证明Anchor PCA恢复了最大不变子空间,并产生了比合并基线和最坏情况替代方法在未见领域上解释更多方差的嵌入。综合来看,这些发现确立了Anchor PCA作为从多领域数据进行鲁棒无监督降维的有前途的方法。

英文摘要

Principal component analysis (PCA) is one of the most widely used unsupervised dimension reduction techniques. We study PCA for data from multiple related domains. Since principal components generally differ across domains, one way to obtain a shared low-rank embedding is to perform PCA on the pooled data. However, this approach can focus on spurious directions that exhibit high variation in only a few domains. To find a robust embedding that still explains most variance in unseen but similar domains, we propose instead to focus on shared directions of variation. To this end, we introduce Anchor PCA which trades off overall explained variance with agreement between the shared and domain-specific low-rank embeddings. Anchor PCA amounts to PCA on a modified target matrix and thus can be solved efficiently. Moreover, we show that Anchor PCA recovers a maximal invariant subspace and admits a minimax reconstruction interpretation under bounded domain-specific covariance inflations. On simulated and real-world gas sensor data with temporal drift, we demonstrate, respectively, that Anchor PCA recovers the maximally invariant subspace and yields embeddings that explain more variance on unseen domains than the pooling baseline and a worst-case alternative. Taken together, these findings establish Anchor PCA as a promising approach to robust unsupervised dimension reduction from multi-domain data.

2606.06179 2026-06-05 stat.ML cs.LG

Diffusion Models Observe Only Gradients: A Geometric Perspective on Score Matching Errors

扩散模型仅观察梯度:分数匹配误差的几何视角

Naïl B. Khelifa, Richard E. Turner, Ramji Venkataramanan

AI总结 本文从几何角度揭示L2分数误差不是衡量边缘分布质量的正确指标,通过Helmholtz-Hodge分解将分数误差分解为梯度分量和螺线管分量,证明只有梯度分量影响Fokker-Planck动力学,并给出仅依赖于梯度分量的KL散度上界及可计算的梯度分量估计器。

详情
AI中文摘要

基于分数的扩散模型通常通过最小化$L^2$分数匹配误差来训练,标准理论分析依赖该量来约束学习分布与目标分布之间的采样差异。我们证明$L^2$分数误差不是边缘分布质量的正确内在度量:一个学习的扩散模型可能在完美匹配目标分布的同时产生任意大的$L^2$分数误差。通过将分数误差分解为梯度分量和螺线管分量(Helmholtz-Hodge分解),我们识别出背后的几何原因:只有梯度分量进入边缘Fokker-Planck动力学,而螺线管分量在结构上不可见。我们在三个结果中精确阐述了这一点。首先,基于修正的几何,我们证明了一个不可能结果:没有$L^2$分数误差的单调函数能够一致地给出学习分布与目标分布之间任何散度的下界。其次,我们推导出Kullback-Leibler散度的上界,该上界仅依赖于误差的可观测梯度分量,从而收紧标准Girsanov界,并指出其宽松性源于在路径空间而非边缘空间动力学上操作的成本。第三,我们通过对偶Sobolev恒等式给出了梯度分量的可处理估计器,实验表明该估计器与样本质量的相关性显著优于完整的$L^2$误差。

英文摘要

Score-based diffusion models are typically trained by minimizing the $L^2$ score matching error, and standard theoretical analyses rely on this quantity to bound the sampling discrepancy between the learned and target distributions. We show the $L^2$ score error is not the right intrinsic measure of marginal distributional quality: a learned diffusion model can incur arbitrarily large $L^2$ score error while perfectly matching the target distribution. By decomposing score errors into a gradient and a solenoidal component (a Helmholtz-Hodge decomposition), we identify the geometric reason behind this: only the gradient component enters the marginal Fokker-Planck dynamics, while the solenoidal component is structurally invisible. We make this precise in three results. First, building on the corrected geometry, we prove an impossibility result: no monotone function of the $L^2$ score error can uniformly lower bound any divergence between the learned and target distributions. Second, we derive an upper bound on the Kullback-Leibler divergence that depends only on the observable gradient component of the error, tightening the standard Girsanov bound and identifying its looseness as the cost of operating on path-space rather than marginal-space dynamics. Third, we give a tractable estimator of the gradient component via a dual Sobolev identity, which is shown to empirically correlate substantially better with sample quality than the full $L^2$ error.

2606.06174 2026-06-05 cs.LG stat.AP

Learning to model pediatric asthma exacerbation from multiple risk factors: a case study in coastal Virginia

学习从多风险因素建模儿童哮喘加重:弗吉尼亚沿海地区案例研究

Jonathan Colen, Eric Werner, Maryam Golbazi, Heather Richter, Diana McSpadden, Amy Quinn, Jocel Santos, Mary Jane Darling, Mary Margaret Gleason

AI总结 本研究通过比较广义线性模型、神经网络和稀疏字典学习框架,建模弗吉尼亚沿海地区儿童哮喘加重与空气污染、气象及社区社会经济因素的关系,并识别关键风险因素。

详情
Comments
22 pages, 6 figures (5 supplemental)
AI中文摘要

儿童哮喘是一种常见疾病,受空气污染、气象和社区级社会经济因素加剧。在大型时空数据集中建模哮喘加重(AE)需要厘清多个贡献因素的影响。在本案例研究中,我们比较了三种平衡预测能力与可解释性的技术,以预测汉普顿路(弗吉尼亚沿海地区,包括7个城市,人口超过150万)的AE。在整理环境空气污染测量值、天气数据和社区机会指标后,我们建模了2018-2023年区域儿童医院及附属机构的邮政编码级急性AE就诊情况。广义线性模型(GLM)提供基线,神经网络(NN)作为最大预测目标。为了桥接统计模型和深度学习,我们开发了一个基于稀疏字典学习的框架,以识别和解释简约的非线性交互方程。在比较每个模型的预测性能后,我们估计了输入暴露变量导致的AE相对风险,并发现各框架间的一致性。我们的工作将统计模型与可解释机器学习模型联系起来,突出了可能影响AE的协同交互作用,并可能为未来研究指导弗吉尼亚沿海地区的公共卫生干预措施。

英文摘要

Childhood asthma is a common illness exacerbated by air pollution as well as meteorological and neighborhood-level socioeconomic factors. Modeling asthma exacerbation (AE) in large spatiotemporal datasets requires disentangling impacts from multiple contributors. In this case study, we compared three techniques that balance predictive power with interpretability to predict AE in Hampton Roads, a coastal Virginia region comprising 7 cities and over 1.5 million people. After collating ambient air pollution measurements, weather data, and measures of neighborhood opportunity, we modeled zip code-level acute AE visits to a regional children's hospital and affiliated providers from 2018-2023. Generalized linear models (GLM) provided a baseline while neural networks (NN) served as a maximally predictive target. To bridge between statistical models and deep learning, we developed a framework based on sparse dictionary learning to identify and interpret parsimonious nonlinear interacting equations. After comparing each model's predictive performance, we estimated relative risks for AE due to input exposure variables and found consensus across frameworks. Our work links statistical and interpretable machine learning models to highlight possible synergistic interactions influencing AE, and may enable future studies to guide public health interventions in coastal Virginia.

2606.06173 2026-06-05 eess.SY cs.SY stat.AP

From data to decisions: Bayesian modelling and global sensitivity analysis for flotation control

从数据到决策:浮选控制的贝叶斯建模与全局灵敏度分析

Paulina Quintanilla, Agustin Fuenzalida, Daniel Navia, Pablo Brito-Parada

AI总结 本文提出一个数据驱动框架,集成高斯过程回归与基于Sobol指数的全局灵敏度分析及SHAP局部可解释性,用于浮选系统的可解释建模和决策支持,通过实验室数据建立静态GP代理模型,识别影响空气回收率的关键变量及其交互作用。

详情
AI中文摘要

本工作提出了一个数据驱动框架,用于浮选系统的可解释建模和决策支持,集成了高斯过程回归与基于Sobol指数的全局灵敏度分析以及使用SHapley Additive exPlanations (SHAP)的局部可解释性。基于实验室规模的实验数据,开发了一个静态GP代理模型,以捕捉表观气速、溢流泡沫速度、唇口上泡沫高度、矿浆高度、气泡尺寸和尾矿流量如何影响测量的空气回收率。训练好的GP能够计算Sobol指数,以量化每个变量及其交互作用对空气回收率总体方差的贡献。贝叶斯推断与基于Sobol的灵敏度度量的结合提供了一种系统的方法来识别控制空气回收率的主导变量和交互变量。本研究将贝叶斯学习、灵敏度量化和可解释性联系起来,为浮选过程的数据驱动控制和优化提供了基础。

英文摘要

This work presents a data-driven framework for interpretable modelling and decision support in flotation systems, integrating Gaussian Process (GP) regression with Global Sensitivity Analysis (GSA) via Sobol indices and local interpretability using SHapley Additive exPlanations (SHAP). Based on laboratory-scale experimental data, a static GP surrogate model is developed to capture how superficial air velocity, overflowing froth velocity, froth height over the lip, pulp height, bubble size, and tailings flowrate influence the measured air recovery. The trained GP enables the computation of Sobol indices to quantify the contribution of each variable and their interactions to the overall variance in air recovery. The combination of Bayesian inference and Sobol-based sensitivity metrics provides a systematic approach to identify the dominant and interacting variables governing air recovery. This study links Bayesian learning, sensitivity quantification, and explainability to provide a foundation for data-driven control and optimisation of flotation processes.

2606.06171 2026-06-05 stat.ML cs.LG cs.NA math.NA physics.comp-ph

Effective Dimensionality as an Operator Invariant for Physics-Preserving Constraint Adaptation in Physics-Informed Neural Networks

有效维度作为物理信息神经网络中保物理约束适配的算子不变量

Cornelius Otchere, Michael Shields

AI总结 利用Fisher信息矩阵定义物理约束模型的有效自由度d_eff,证明其收敛于微分算子核维数,并基于此提出子空间投影策略实现边界条件适配。

详情
AI中文摘要

物理信息神经网络固有地遭受任务干扰,因为它们依赖共享参数空间来同时满足控制微分方程和边界条件。我们使用Fisher信息矩阵分析这种结构冲突,量化物理约束模型中的有效自由度($d_{eff}$)。与经典的$d_{eff}$(衡量相对于统计先验由数据提供信息的参数方向数量)不同,我们的$d_{eff}$衡量不受微分算子约束的参数方向维度。对于具有有限维核的算子,我们证明$d_{eff}$精确收敛于核维数,与网络宽度、深度或激活函数无关,将其从拟合诊断重新解释为底层连续算子的结构不变量。对于具有无限维核的算子,$d_{eff}$则衡量网络对该核的有限维表示带宽,而非恢复整数不变量。重要的是,$d_{eff}$还作为先验结构诊断。将适定问题的$d_{eff}$驱动到零,证明物理和边界约束已吸收网络的自由方向。基于这一表征,我们引入了用于边界适配的子空间投影策略。无需从头重新训练,我们将参数更新投影到预训练物理算子的零空间,使得新边界条件得到满足而不干扰已学习的物理。基于梯度的微调可以达到或超过此效果,但需要更多的挂钟时间和调参,而子空间投影在几秒到几分钟内提供近乎等效的质量。我们在线性和非线性算子上验证,展示了对初始和边界偏移以及未遇到约束类型的准确适配。

英文摘要

Physics-Informed Neural Networks inherently suffer from task interference because they rely on a shared parameter space to satisfy both governing differential equations and boundary conditions. We analyze this structural conflict using the Fisher Information Matrix to quantify the effective degrees of freedom ($d_{eff}$) in a physics-constrained model. Unlike the classical $d_{eff}$ which measures how many parameter directions are informed by data against a statistical prior, our $d_{eff}$ measures the dimension of the parameter directions unconstrained by the differential operator. For operators with finite-dimensional kernel, we show that $d_{eff}$ converges to the kernel dimension exactly, independent of network width, depth, or activation function, recasting it from a fit diagnostic into a structural invariant of the underlying continuous operator. For operators with infinite-dimensional kernel, $d_{eff}$ instead measures the network's finite-dimensional representational bandwidth for that kernel rather than recovering an integer invariant. Importantly, $d_{eff}$ also serves as an a priori structural diagnostic. Driving $d_{eff}$ of a well-posed problem to zero certifies that the physics and boundary constraints have absorbed the network's free directions. Building on this characterization, we introduce subspace projection strategies for boundary adaptation. Rather than retraining from scratch, we project parameter updates into the null space of the pre-trained physics operator so that new boundary conditions are satisfied without disturbing the learned physics. Gradient-based fine-tuning can match or exceed this but needs more wall-clock time and tuning, whereas subspace projection delivers near-equivalent quality in seconds to minutes. We validate on linear and nonlinear operators, demonstrating accurate adaptation to initial and boundary shifts and unencountered constraint types.

2606.06161 2026-06-05 math.ST stat.TH

Monitoring the Ratio of two Normal Variables using EWMA Type Control Charts in Short Production Runs

短生产运行中使用EWMA型控制图监测两个正态变量的比率

Thi Hien Nguyen, Jean-Michel Masereel, Guillaume Tartare, Kim Duc Tran

AI总结 针对短生产运行中两个正态变量比率的监测,提出一种基于马尔可夫链校准控制限的EWMA控制图,在有限检测次数内显著提升对小偏移的检测灵敏度。

详情
AI中文摘要

在许多工业和工程应用中,过程性能由两个正态分布质量特性的比率表征。在短生产运行中监测此类比率尤其具有挑战性,因为传统控制图由于可用检测次数少而灵敏度有限。本文提出了一种指数加权移动平均(EWMA)控制图,用于在短生产运行(SPR)条件下监测两个正态分布随机变量的比率。首先回顾了比率的统计分布,采用Nadarajah(2020)修正的闭式密度而非早期研究中使用的近似。所提控制图的控制限通过EWMA递归的马尔可夫链表示,校准到有限检测次数$I$内预设的受控截断平均运行长度(TARL$_0$)。然后通过一个包含平滑常数$λ$、受控相关系数$ρ_0$、变异系数$(γ_X, γ_Y)$、样本量$n$和偏移幅度$τ$的大规模因子研究评估图表的检测性能。数值结果表明,所提EWMA-RZ图对中小偏移的检测性能显著优于Tran等人(2021)最近开发的Shewhart型短运行比率图(ShRZ),尤其是当$|τ-1| \le 0.05$时。包含一个基于饮料灌装过程的说明性示例以展示该方法的实际实施。

英文摘要

In many industrial and engineering applications, process performance is characterized by the ratio of two normally distributed quality characteristics. Monitoring such ratios is particularly challenging in short production runs, where conventional control charts often suffer from limited sensitivity due to the small number of available inspections. This paper proposes an exponentially weighted moving average (EWMA) control chart for monitoring the ratio of two normally distributed random variables under short production run (SPR) conditions. The statistical distribution of the ratio is first reviewed, adopting the corrected closed-form density of Nadarajah (2020) rather than the approximation used in earlier studies. The control limit of the proposed chart is calibrated to a prescribed in-control truncated average run length (TARL$ _0 $) over a finite horizon $ I $ of inspections, using a Markov-chain representation of the EWMA recursion. The detection performance of the chart is then assessed through a large factorial study covering the smoothing constant $ λ$, the in-control correlation $ ρ_0 $, the coefficients of variation $ (γ_X, γ_Y) $, the sample size $ n $, and the magnitude of the shift $ τ$. Numerical results show that the proposed EWMA-RZ chart provides substantially better detection of small and moderate shifts than the recently developed Shewhart-type short-run ratio chart (ShRZ) of Tran et al. (2021), especially for $ |τ- 1| \le 0.05 $. An illustrative example based on a beverage filling process is included to demonstrate the practical implementation of the method.

2606.06149 2026-06-05 math.ST stat.TH

Effect of the measurement errors on one-sided Synthetic-RZ control charts for monitoring the ratio of two normal variables

测量误差对监测两个正态变量比值的单侧Synthetic-RZ控制图的影响

Kim Duc Tran, Thi Hien Nguyen, Kim Phuc Tran

AI总结 本文采用线性协变量误差模型,通过马尔可夫链分析测量误差对单侧Synthetic-RZ控制图在零状态和稳态平均运行长度(ARL)上的影响,发现测量误差会削弱检测能力,且对每个检测单元多次测量并非有效补救措施。

详情
AI中文摘要

在许多工业生产环境中,监测由两个正态分布随机变量形成的比值是一项具有相当实际意义的任务。本文研究了测量误差如何影响一对用于监测此类比值的单侧Synthetic控制图(此处称为Synthetic-RZ图)的行为,分析涵盖了零状态和稳态平均运行长度($ARL$)。为了将测量误差纳入这些图的运行中,我们采用线性协变量误差模型。我们逐步描述了当过程从受控状态转移到失控状态时,基础模型参数如何演变,并且我们有意避免了观察到的偏移幅度与测量误差无关的限制性前提。通过马尔可夫链公式获得了图的运行长度特征。一系列数值实验清楚地表明,测量误差削弱了图的检测能力。研究的一个特别有用的结果是,对每个检测单元收集多次测量并不能有效补救测量误差对Synthetic-RZ图性能的不利影响。

英文摘要

In numerous industrial production settings, keeping track of the ratio formed by two normally distributed random variables is a task of considerable practical interest. The present work examines how measurement errors influence the behaviour of a pair of one-sided Synthetic control charts designed to monitor such a ratio (referred to here as Synthetic-RZ charts), with the analysis covering both the zero-state and the steady-state average run length ($ARL$). To incorporate measurement error into the operation of these charts, we adopt a linear covariate error model. We describe, step by step, how the parameters of the underlying model evolve as the process moves from an in-control to an out-of-control state, and we deliberately avoid the restrictive premise that the observed shift magnitude is unrelated to the measurement errors. The run length characteristics of the charts are obtained by means of a Markov chain formulation. A series of numerical experiments makes clear that measurement error erodes the detection capability of the charts. A particularly useful outcome of the investigation is that collecting several measurements on each inspected unit does not constitute an efficient remedy for the adverse influence of measurement error on the performance of the Synthetic-RZ charts.

2606.06137 2026-06-05 math.ST stat.TH

An Adaptive Upper One-Sided Cumulative Sum Control Chart with Joint Parameter Optimization for Monitoring the Ratio of Two Normal Variables in Short Production Runs

短生产运行中监控两个正态变量比率的自适应上单侧累积和控制图及其联合参数优化

Kim Duc Tran

AI总结 针对短生产运行中两个相关正态变量比率的监控,提出一种具有自适应联合优化参数k和h的上单侧累积和控制图CUSUM-RZ^+,通过双层优化实现最优检测性能。

详情
AI中文摘要

监控两个相关正态变量的比率在统计过程控制中日益重要,因为许多质量特性以相对形式而非绝对形式表达。记忆型比率图大多针对长生产运行开发,而其有限时域对应方法依赖于从指定偏移导出的固定参考值$k$。这种固定$k$设计在给定的失控幅度下并非最优,并且在低变异性情况下会产生边界解,使得受控截断平均运行长度(TARL$_0$)无法达到。本文针对短生产运行中的比率$Z = X/Y$提出了一种上单侧累积和控制图,记为CUSUM-RZ$^+$(RZ代表比率$Z$),具有完全自适应的$k$和决策区间$h$的联合优化。给定目标TARL$_0 = I$和目标偏移$τ$,一个双层问题通过内部求根校准$h(k)$以满足TARL$_0$约束,并通过外部线搜索选择$k^*$以最小化失控TARL$_1$。两者均使用具有精确比率近似的有限状态马尔可夫链框架;内部步骤恢复了固定$k$设计无法处理的边界情况。通过与Shewhart-RZ、指数加权移动平均(EWMA-RZ)和固定$k$的CUSUM-RZ$^+$图进行匹配时域基准比较、蒙特卡洛稳健性研究以及第一阶段估计分析来评估该图。所有记忆型图均优于Shewhart-RZ基线;自适应设计在稳定相关性下与之匹配,并在相关性从第一阶段上升到第二阶段时显著改善。它对对称重尾不敏感,但在污染下略微反保守,且$m \geq 100$个子组使TARL$_0$相对偏差保持在1%附近。

英文摘要

Monitoring the ratio of two correlated normal variables is increasingly important in statistical process control, since many quality characteristics are expressed in relative rather than absolute form. Memory-type ratio charts have mostly been developed for long production runs, while their finite-horizon counterparts rely on a fixed reference value $ k $ derived from a specified shift. Such fixed-$ k $ designs are not optimal at a given out-of-control magnitude and, in low-variability regimes, yield boundary solutions for which the in-control truncated average run length (TARL$ _0 $) is unattainable. This paper proposes an upper one-sided cumulative sum (CUSUM) control chart for the ratio $ Z = X/Y $ in short production runs, denoted CUSUM-RZ$ ^+ $ (RZ standing for the ratio $ Z $), with fully adaptive joint optimization of $ k $ and the decision interval $ h $. Given a target TARL$ _0 = I $ and a target shift $ τ$, a bilevel problem calibrates $ h(k) $ by inner root-finding to satisfy the TARL$ _0 $ constraint and selects $ k^* $ by outer line search to minimize the out-of-control TARL$ _1 $. Both use a finite-state Markov-chain framework with an accurate ratio approximation; the inner step recovers boundary cases that fixed-$ k $ designs cannot. The chart is assessed through matched-horizon benchmarks against Shewhart-RZ, exponentially weighted moving average (EWMA-RZ), and fixed-$ k $ CUSUM-RZ$ ^+ $ charts, Monte Carlo robustness studies, and a Phase I estimation analysis. All memory-type charts outperform the Shewhart-RZ baseline; the adaptive design matches them under stable correlation and improves appreciably when correlation rises from Phase I to Phase II. It is insensitive to symmetric heavy tails yet mildly anti-conservative under contamination, and $ m \geq 100 $ subgroups keep the TARL$ _0 $ relative bias near 1%.

2606.06123 2026-06-05 cs.LG stat.ML

Adaptive state-action abstractions via rate-distortion

基于率失真的自适应状态-动作抽象

Fernando E. Rosas

AI总结 提出通过率失真原理构建软状态-动作抽象,并利用性能证书动态调整抽象粒度,以在压缩状态和动作信息时实现近似最优性能。

详情
Comments
28 pages, 2 figures
AI中文摘要

在学习走路时,婴儿似乎首先处理问题的粗略版本——保持直立、到达看护者——并且只有当在该分辨率下的进一步练习不再有回报时才会细化它。强化学习提供了多种构建复杂任务简单版本的技术,但缺乏关于如何在学习过程中动态调整这些抽象粒度的通用原则。本文提出了这样一个原则:一旦抽象内的学习误差变得与抽象本身引起的误差相当,就细化抽象。在这里,我们通过一个性能证书来研究这一原则的一种形式化方式,该证书将值误差分解为两项:由贝尔曼残差捕获的学习误差界,和由双模拟度量给出的抽象误差界。由此产生的切换策略通过基于率失真原理构建的软状态-动作抽象来实现,其沿状态和动作轴的分辨率可以连续调整。我们在各种表格设置中验证了这种构造,表明在状态和动作信息的大量有损压缩下可以实现近似最优性能。

英文摘要

When learning to walk, infants seem to address a coarse version of the problem first - stay upright, reach the caregiver - and refine it only when further practice at that resolution stops paying off. Reinforcement learning offers multiple techniques for building simple versions of complex tasks, but lacks general principles for how to dynamically adjust the granularity of these abstractions during learning. This paper proposes one such principle: refine the abstraction as soon as the learning error within it becomes comparable to the error induced by the abstraction itself. Here, we investigate one way of formalising this principle via a performance certificate that decomposes value error into two terms: a learning error bound captured by a Bellman residual, and an abstraction error bound given by a bisimulation metric. The resulting switching strategy is implemented by soft state-action abstractions built from rate-distortion principles, whose resolution along state and action axes can be continuously adjusted. We validate this construction in a range of tabular settings, showing that near-optimal performance can be achieved under substantial lossy compression of state and action information.

2606.06043 2026-06-05 stat.ML cs.LG

Adaptive Learning Rates with Surrogate Probability for Follow-the-Perturbed-Leader

基于替代概率的自适应学习率用于跟随扰动领导者

Jongyeong Lee, Junya Honda, Shinji Ito, Chansoo Kim

AI总结 针对FTPL算法因无优化特性难以设计自适应概率依赖学习率的问题,提出基于替代概率函数的自适应学习率,实现了任意形状参数α>1的Pareto扰动下的最佳双世界保证,并扩展到专家建议的赌博机问题。

详情
Comments
TBA COLT2026
AI中文摘要

跟随正则化领导者框架在在线学习问题中显示出有效性和灵活性,其中学习率的选择至关重要。最近,通过求解凸优化获得的、基于臂选择概率定义的自适应学习率,在各种赌博机问题中实现了改进的最佳双世界(BOBW)保证。相比之下,其计算高效替代方案——跟随扰动领导者(FTPL)的BOBW保证仍然相对有限,因为其无优化特性讽刺地使得设计自适应的、概率依赖的学习率变得非平凡。为了解决这一挑战,我们通过引入替代概率函数为FTPL提出了一种自适应学习率,该函数仅从可用量计算,无需精确概率。基于这些具有替代函数的学习率,我们为具有任意形状参数$α>1$的Pareto扰动的FTPL提供了BOBW保证,推广了先前仅限于特定选择$α=2$的结果。我们进一步展示了在具有专家建议的赌博机问题中,具有自适应学习率的FTPL的BOBW保证。我们的方法保留了FTPL的计算简单性,同时实现了概率依赖的自适应性,并且基于替代的方法论可能在其他算法框架(超越FTPL和学习率设计)中具有独立意义。

英文摘要

Follow-the-regularized-leader framework has shown effectiveness and flexibility in online learning problems, where the choice of learning rates are known to be crucial. Recently, adaptive learning rates defined in terms of the arm-selection probabilities, obtained by solving convex optimization, have achieved improved best-of-both-worlds (BOBW) guarantees in various bandit problems. In contrast, BOBW guarantees for its computationally efficient alternative, follow-the-perturbed-leader (FTPL), remain relatively limited since its optimization-free nature ironically makes the design of adaptive, probability-dependent learning rates non-trivial. To address this challenge, we propose an adaptive learning rate for FTPL by introducing surrogate probability functions that can be computed only from the available quantities, without requiring the exact probabilities. Based on these learning rates with surrogate functions, we provide the BOBW guarantee for FTPL with Pareto perturbations for any shape parameter $α>1$, generalizing prior results restricted to specific choices of $α=2$. We further show the BOBW guarantees for FTPL with adaptive learning rates in the bandit problem with expert advices. Our approach preserves the computational simplicity of FTPL while enabling probability-dependent adaptivity, and the surrogate-based methodology may be of independent interest in other algorithmic frameworks beyond FTPL and learning rate designs.

2606.06018 2026-06-05 math.ST math.AP math.DS stat.TH

On statistical inference for non-linear dynamical systems evolving in their global attractor

关于在其全局吸引子上演化的非线性动力系统的统计推断

Dimitri Konen, Richard Nickl

AI总结 针对反应函数自然条件下带初始条件的二维周期反应扩散系统,证明全局吸引子上反向庞加莱不等式成立,进而得到L2-Lipschitz稳定性估计,并实现初始条件统计恢复和状态预测的快速近参数收敛率。

详情
AI中文摘要

我们考虑一个二维周期反应扩散系统,反应函数满足自然条件,初始条件为$θ$。我们证明,在所得动力系统$(u_θ(t):t>0)$的全局吸引子$\mathcal A$上,反向庞加莱不等式成立,并且作为推论,对于任意固定的$t>0$,映射$θ\mapsto u_θ(t)$在$\mathcal A$上满足$L^2$-Lipschitz稳定性估计。然后我们证明,从系统的离散测量中,可以在‘快速’近参数收敛率下统计恢复吸引子$\mathcal A$中的初始条件$θ$,并预测状态$u_θ$。

英文摘要

We consider a two-dimensional periodic reaction-diffusion system under natural conditions on the reaction function and with initial condition $θ$. We show that on the global attractor $\mathcal A$ of the resulting dynamical system $(u_θ(t):t>0)$, a reverse Poincaré inequality holds true, and that as a consequence the map $θ\mapsto u_θ(t)$ satisfies a $L^2$-Lipschitz stability estimate on $\mathcal A$ for any $t>0$ fixed. We then show that statistical recovery of an initial condition $θ$ in the attractor $\mathcal A$, as well as prediction of the states $u_θ$, is possible from discrete measurements of the system at `fast' near parametric convergence rates.

2606.05957 2026-06-05 cs.LG stat.ML

Dead Directions: Geometric Singular Learning

死方向:几何奇异学习

Tejas Pradeep Shirodkar

AI总结 本文通过引入“死方向”概念,桥接奇异学习理论与信息几何,提出在原始参数坐标下从Fisher曲率衰减率恢复KL阶数的方法,并扩展到深度网络,实现无需后验采样的Watanabe三元组(λ, m, ν)轨迹率读出。

详情
Comments
139 pages, 13 figures, 13 tables
AI中文摘要

奇异学习理论和信息几何研究了相同的参数空间,但使用了大体不同的词汇:前者在解决坐标中计算贝叶斯不变量,后者在非退化假设下使用原始坐标,而过参数化模型经常违反该假设。我们通过一个原始概念——死方向——将它们桥接起来:死方向是沿着Fisher度量退化的单位向量,等价于具有确定KL阶数的解析奇异集的切向量,KL阶数由KL散度消失的速度决定。两种解读命名同一向量;我们的核心操作表明,其KL阶数可作为方向Fisher曲率趋近奇异点的衰减率恢复,在原始参数坐标中无需Hironaka分解。光滑纤维上的选择规则将该速率转化为Watanabe的单方向对实对数规范阈值的贡献,我们将恢复扩展到多分量交叉、重数m、奇异波动ν(在一维方向中KL阶数通用)、先验RLCT偏移以及温度后验。然后我们将该速率提升到深度网络:多层K-FAC分解将每个Fisher块写为激活侧和梯度侧速率的乘积,两者之间存在对偶性,并在现代网络原语(残差流、层归一化、注意力)中实例化。商定理将该速率传递到在G不变度量下梯度流的规范商Θ/G;SGD符合条件,标准Adam不符合,我们构造了一个G等变Adam族预条件器(DDCAdam)使其符合。该桥接提供了对奇异几何的参数坐标处理、每个架构的闭式预测,以及从一个检查点的前向和后向传播中无需后验采样的Watanabe三元组(λ, m, ν)轨迹率读出。

英文摘要

Singular learning theory and information geometry have studied the same parameter spaces in mostly separate vocabularies: the former computes Bayesian invariants in resolved coordinates, the latter works in original coordinates under a non-degeneracy assumption that overparameterised models routinely violate. We bridge them through one primitive, the dead direction: a unit vector along which the Fisher metric degenerates, equivalently a tangent to the analytic singular set with a definite KL order, set by how fast the KL divergence vanishes. The two readings name the same vector; our central move shows its KL order is recoverable as the decay rate of the directional Fisher curvature approaching the singularity, in original parameter coordinates and without a Hironaka resolution. A selection rule on smooth fibres translates this rate into Watanabe's single-direction contribution to the real log canonical threshold, and we extend the recovery to multi-component crossings, multiplicity $m$, the singular fluctuation $ν$ (universal in the KL order for 1D directions), prior-RLCT shifts, and tempered posteriors. We then lift this rate to a deep network: a multi-layer K-FAC factorisation writes each Fisher block as a product of activation- and gradient-side rates with a duality between them, instantiated at modern-network primitives (residual streams, layer normalisation, attention). A quotient theorem carries the rate to the gauge quotient $Θ/G$ under gradient flow on a $G$-invariant metric; SGD qualifies, standard Adam does not, and we construct a $G$-equivariant Adam-family preconditioner (DDCAdam) that does. The bridge yields a parameter-coordinate handle on singular geometry, closed-form per-architecture predictions, and a trajectory-rate readout of Watanabe's triple $(λ, m, ν)$ from one checkpoint's forward and backward passes, without posterior sampling.

2606.05954 2026-06-05 physics.soc-ph cs.SI nlin.AO stat.ME

Network model selection: A review of methods

网络模型选择:方法综述

Zoran Levnajić

AI总结 本文系统综述了网络模型选择的方法,将方法按核心原理分为四类,并探讨了未来方向,旨在为开发统一最优方法奠定基础。

详情
Comments
This is an Accepted Manuscript version of the book: Zoran Levnajic, Network model selection: A review of methods, 2026, Springer. This version has been accepted for publication, but is not the Version of Record and does not reflect post-acceptance improvements (such as copyediting or typesetting), or any corrections. The final authenticated version is available online at ISBN 978-3-032-30448-3
AI中文摘要

理解复杂网络演化背后的过程是网络科学的一个关键目标。应对这一挑战的有效框架是网络模型选择,即从一组候选模型中找到最能解释给定网络的模型。本书是对此目的方法的系统综述。每种方法分三部分概述:其核心原理(用于将方法分为四类)、其他相关细节(包括我自己的观察)以及软件可用性。本书全面概述了网络模型选择的最新进展,并探讨了未来方向。一种统一的最优方法可能比任何现有方法更精确地识别塑造现实网络的机制。这项工作代表了朝着开发这种最优方法迈出的第一步。它将成为网络科学学生和研究人员的宝贵资源。

英文摘要

Understanding the processes behind the evolution of complex networks is a key objective in network science. An effective framework for tackling this challenge is network model selection, which involves finding the model from a set of candidates that best explains a given network. This book is a systematic review of methods for this purpose. Each method is outlined in three parts: its core principle (used to organize methods into four categories), other relevant details including my own observations, and software availability. The book provides a comprehensive overview of the state-of-the-art in network model selection and concludes by exploring future directions. A unified, optimal method could identify the mechanisms that shape real-world networks more precisely than any current approach. This work represents the first step toward developing such an optimal method. It will be a valuable resource for students and researchers in network science.

2606.05942 2026-06-05 stat.ML cs.LG

EML-CD: Causal Mechanism Recovery via EML Symbolic Trees in Structure Learning

EML-CD:通过结构学习中的EML符号树进行因果机制恢复

Sota Asanuma

AI总结 提出EML-CD框架,利用EML算子构建可解释的因果机制符号树,从数据中自动发现闭式因果方程,在真实和合成数据上实现了机制恢复与结构学习的平衡。

详情
AI中文摘要

基于神经网络(NN)的非线性因果发现方法能够恢复DAG结构,但将每个因果机制视为黑箱。Waxman等人认为从NN权重中提取因果机制是不适定的。我们提出EML-CD,一个将EML算子(能够从单个二元运算符组合初等函数)集成到因果结构学习中的框架,以可解释的机制恢复为主要目标。EML-CD将每条边机制表示为门控EML二叉树,并自动发现闭式因果方程。解析雅可比矩阵可直接从输出方程计算,从而定量理解因果效应。在真实数据(Sachs蛋白信号,d=11)上,EML-CD达到SHD=11.2±0.4(5次种子均值;基线为单次确定性运行),与PC/GES在种子方差内相当且低于CAM,同时为每条检测到的边附加闭式方程(精确率0.756,召回率0.365)。在已知机制的受控双变量测试中,EML-CD忠实恢复了11个初等函数族中的10个(留出形状相关性≥0.96;仅高频正弦部分恢复)。在符号合成基准上,EML-CD的留出机制f-MSE远低于固定SINDy字典且更稳定(均值3.67对比7644,后者因一次种子的灾难性外推而膨胀),尽管其结构恢复(SHD 14.0)仅与字典相当且低于专用优化器;在Causal Chambers光隧道子集上,深度2模型将F1分数从线性OLS-BIC的0.273提升至0.444。

英文摘要

Neural network (NN)-based nonlinear causal discovery methods recover DAG structure but leave each causal mechanism as a black box. Waxman et al. argued that extracting causal mechanisms from NN weights is ill-posed. We propose EML-CD, a framework that integrates the EML operator (capable of composing elementary functions from a single binary operator) into causal structure learning, with interpretable mechanism recovery as the primary objective. EML-CD represents each edge mechanism as a gated EML binary tree and automatically discovers closed-form causal equations. Analytical Jacobians can be directly computed from the output equations, enabling quantitative understanding of causal effects. On real data (Sachs protein signaling, d=11), EML-CD achieves SHD=11.2 +/- 0.4 (5-seed mean; baselines are single deterministic runs), on par with PC/GES within seed variance and below CAM, while attaching closed-form equations to each detected edge (precision 0.756, recall 0.365). In a controlled bivariate test with known mechanisms, EML-CD recovers 10 of 11 elementary function families faithfully (held-out shape correlation >= 0.96; only high-frequency sine is partial). On a symbolic synthetic benchmark, EML-CD attains a substantially lower and more stable held-out mechanism f-MSE than a fixed SINDy dictionary (mean 3.67 vs. 7644, the latter inflated by catastrophic extrapolation on one seed), although its structure recovery (SHD 14.0) only matches the dictionary and stays below specialized optimizers; on the Causal Chambers light-tunnel subset, a depth-2 model improves F1 over linear OLS-BIC (0.444 vs. 0.273).

2606.05935 2026-06-05 stat.CO stat.ME

Hessian-informed, Coordinate Friendly Hamiltonian Monte Carlo in Linear Time

Hessian感知的、坐标友好的线性时间哈密顿蒙特卡洛

Son Luu, Nikola Surjanovic, Zuheng Xu, Trevor Campbell, Alexandre Bouchard-Côté

AI总结 提出一种将Riemannian哈密顿蒙特卡洛(RHMC)固定点迭代的计算复杂度从O(d^2)降至O(d)的方法,适用于具有“坐标友好”结构的目标分布,包括广义线性模型及稠密/稀疏图模型。

详情
AI中文摘要

Riemannian哈密顿蒙特卡洛(RHMC)是一种有前景的MCMC方法,因为它能够适应位置相关的预处理和多步提议。虽然RHMC在低维表现良好,但在高维中由于每次固定点迭代需要$O(d^3)$的成本而变得不可行,其中$d$是目标密度的维数。即使位置相关的预处理器基于Hessian的对角线,每次固定点迭代的成本仍然是$O(d^2)$。在本文中,我们提出一种计算方法,对于具有“坐标友好”结构的目标,将对角预处理器的RHMC固定点迭代的计算复杂度从$O(d^2)$降低到$O(d)$。这类分布包括广义线性模型以及其他稠密和稀疏图模型。该方法表示为操作计算图,因此可以自动化地处理黑盒目标。最后,我们通过实验证明,与使用位置无关和位置相关预处理器的先进HMC NUTS算法相比,我们的RHMC实现在各种目标分布上每单位计算时间产生了更好的样本质量。

英文摘要

Riemannian Hamiltonian Monte Carlo (RHMC) is a promising MCMC methodology thanks to its ability to accommodate position-dependent preconditioning and multi-step proposals. While RHMC performs well in low dimensions, it becomes infeasible in high dimensions due to its $O(d^3)$ cost per fixed-point iteration, where $d$ is the dimension of the target density. Even when the position-dependent preconditioner is based on the diagonal of the Hessian, the cost is still $O(d^2)$ per fixed-point iteration. In this paper, we propose a computational method to reduce the computational complexity of RHMC fixed-point iterations with diagonal preconditioners from $O(d^2)$ to $O(d)$ for targets with ``coordinate friendly'' structures. This distribution class includes generalized linear models as well as other dense and sparse graphical models. The method is expressed as manipulating the compute graph and can therefore be automated to work on black box targets. Finally, we show empirically that our implementation of RHMC results in better sample quality per unit of compute time for various target distributions compared to state-of-the-art HMC NUTS algorithms with both position-independent and position-dependent preconditioners.

2606.05898 2026-06-05 stat.CO math.ST nlin.AO stat.TH

Designing Zero-Mean Feature Functions for Multimodal Distributions

为多峰分布设计零均值特征函数

Hiroshi Yamashita, Hideyuki Suzuki

AI总结 针对多峰分布下控制变量法估计期望方差大的问题,提出基于分布近似和密度比的零均值特征函数构造方法,有效降低双峰分布的估计方差。

详情
Comments
6 pages, 4 figures, 7 subfigures, submitted to the 2026 International Symposium on Nonlinear Theory and Its Applications (NOLTA2026)
AI中文摘要

为了提高期望的蒙特卡洛估计的准确性,可以使用一组零均值特征函数,称为控制变量。它们可以作为目标函数线性回归的特征函数,我们可以利用其残差获得无偏且方差减小的估计。一种已知的构造此类函数的方法是使用称为Stein等式的方法,但这些函数对于目标分布是多峰的情况并不足够。我们提出了一种不同的方法,基于分布近似和密度比来构造这些零均值函数。我们证明,结合这两种策略构造的函数可以有效降低双峰分布的估计方差。

英文摘要

To improve the accuracy of Monte Carlo estimation of expectations, a set of zero-mean feature functions, known as control variates, can be used. They can be used as feature functions for linear regression of the target function, and we can obtain an unbiased and variance-reduced estimate using its residual. One known way to construct such functions is a method using an equality called Stein's identity, but these functions are not sufficient for the case where the target distribution is multimodal. We propose a different approach to constructing these zero-mean functions based on distribution approximation and the density ratio. We demonstrate that combining the functions constructed by these two strategies can effectively reduce the estimation variance for a bimodal distribution.

2606.05871 2026-06-05 cs.IT cs.AI math.IT stat.ME

Compositional Boundaries for Density Fusion

密度融合的组合边界

Ratan Bahadur Thapa, Ali Darijani, Jürgen Beyerer, Steffen Staab

AI总结 研究分布式不确定性管理系统中加权概率密度的层次融合顺序不变性,证明在连续二元规则下,顺序不变的层次融合等价于归一化加权线性池化,并揭示了端点-候选f-散度平衡的局部几何障碍。

详情
AI中文摘要

分布式不确定性管理系统通常沿着由通信、隐私或调度约束选择的聚合树组合局部概率模型。最终密度应取决于加权源,而不是中间节点组合它们的特定顺序。我们将这一要求研究为加权概率密度的二元融合的代数组合性问题。核心问题是局部融合规则何时可以层次化执行同时保持顺序不变。我们为局部段值融合规则建立了一个组合边界。在具有加性输出权重和仅权重系数的连续二元规则类中,顺序不变的层次执行刻画了归一化加权线性池化;范数诱导的段平衡实现了相应的系数。平滑端点-候选$f$-散度平衡具有不同的局部几何:其二次展开引入了平方根有效权重,表明仅凭成对可解性不足以实现调度无关的融合。我们证明这一障碍是端点-候选二元平衡所特有的,而全局散度重心保留了加性权重的局部极限。最后,高斯混合展示了相同问题如何在有限模型类中出现:精确融合是组合的,而逐步压缩仅在未归一化分量测度的同余条件下才是组合的。这些结果区分了精确的调度无关融合与全局聚合目标及局部近似启发式。

英文摘要

Distributed uncertainty-management systems often combine local probabilistic models along aggregation trees chosen by communication, privacy, or scheduling constraints. The final density should depend on the weighted sources, not on the particular order in which intermediate nodes combine them. We study this requirement as an algebraic compositionality problem for binary fusion of weighted probability densities. The central question is when a local fusion rule can be executed hierarchically while remaining order-invariant. We establish a compositional boundary for local segment-valued fusion rules. Within the class of continuous binary rules with additive output weights and weight-only coefficients, order-invariant hierarchical execution characterizes normalized weighted linear pooling; norm-induced segment balancing realizes the corresponding coefficient. Smooth endpoint-to-candidate $f$-divergence balancing has a different local geometry: its quadratic expansion induces square-root effective weights, showing why pairwise solvability alone is insufficient for schedule-independent fusion. We show that this obstruction is local to endpoint-to-candidate binary balancing, whereas global divergence barycenters retain additive-weight local limits. Finally, Gaussian mixtures show how the same issue appears in finite model classes: exact fusion is compositional, whereas stepwise compression is compositional only under a congruence condition on unnormalized component measures. These results distinguish exact schedule-independent fusion from global aggregation objectives and local approximation heuristics.

2606.05779 2026-06-05 cs.CR cs.AI stat.ML

TinyML-Driven Cybersecurity for Autonomous Spacecraft: Latency-Accuracy Analysis for SPARTA RF and Cyber Threat Detection

TinyML驱动的自主航天器网络安全:SPARTA射频与网络威胁检测的延迟-精度分析

Van Le, Trevor Tran, Tan Le

AI总结 针对自主航天器,基于SPARTA攻击模型分析TinyML兼容经典模型(随机森林、逻辑回归、SVM、MLP)在检测多种网络射频威胁时的延迟-精度权衡,发现逻辑回归在微秒级推理下仅比随机森林精度低1%,适合作为机载自主基线。

详情
Comments
Twenty Fifth International Conference on Security & Management (SAM'26)
AI中文摘要

自主航天器需要快速、轻量且可靠的在轨检测网络射频威胁。利用SPARTA攻击模型,我们分析了TinyML兼容的经典模型——随机森林、逻辑回归、支持向量机和多层感知机——在检测上行链路干扰、Fake-NR欺骗、有效载荷操纵、地面段妥协和未授权命令注入时的延迟-精度权衡。我们对每个模型的计算复杂度、VC维、Lipschitz连续性和延迟缩放进行了基于物理的理论分析,并通过在通过BandErasure、FakeNR和NoiseBurst损坏模式生成的对抗性射频频谱图上的经验测量加以支持。结果表明,逻辑回归实现了微秒级推理,且相对于随机森林仅下降1%的精度,使其成为机载自主的有效TinyML基线。该研究还指出了通过更丰富的特征编码器和多时间尺度学习架构来推进航天器网络安全的机会,这建立在边缘智能和可信AI的最新进展之上。

英文摘要

Autonomous spacecraft require rapid, lightweight, and reliable onboard detection of cyber-RF threats. Using the SPARTA attack model, we analyze the latency-accuracy trade-offs of TinyML-compatible classical models -- Random Forest, Logistic Regression, SVM, and MLP -- for detecting uplink jamming, Fake-NR spoofing, payload manipulation, ground-segment compromise, and unauthorized command injection. We present a physics-informed theoretical analysis of each model's computational complexity, VC dimension, Lipschitz continuity, and latency scaling, supported by empirical measurements on adversarial RF spectrograms generated via BandErasure, FakeNR, and NoiseBurst corruption modes. Results show that Logistic Regression achieves microsecond-level inference with only a 1\% accuracy drop relative to Random Forest, making it an effective TinyML baseline for onboard autonomy. The study also identifies opportunities for advancing spacecraft cybersecurity through richer feature encoders and multi-timescale learning architectures, building on recent progress in edge intelligence and trustworthy AI.

2606.05733 2026-06-05 cs.LG cs.CE q-fin.CP stat.ML

Zero-Copy Semantic Contagion: An In-Memory Streaming Architecture for Evolving Attention Graphs

零拷贝语义传染:一种用于演化注意力图的内存流式架构

Kabir Murjani

AI总结 提出一种基于Rust-Python的异构流式架构,通过零拷贝解析和神经霍克斯过程实现跨公司注意力图的实时构建与推理,在FNSPID语料库上相比随机基线提升1.70倍精度。

详情
Comments
Accepted to the 2026 ACM SIGMOD Workshop on Data Management for the Modern Financial Systems (FinDS). 10 pages, 4 figures
AI中文摘要

按代码预测模型主导金融时间序列工作,但仍无法捕捉跨公司传播:台湾的晶圆厂中断在单资产模型中不会显现,直到苹果自己的价格已经变动。为解决这一局限,我们引入一种异构的Rust-Python流式架构,将跨公司注意力映射为直接由文本驱动的连续时间图。我们表明,在摄取端,零拷贝Rust边缘解析新闻记录约需100纳秒,并在约1.2微秒内扫描目标股票宇宙。在推理端,一个多变量神经霍克斯过程,具有每节点连续时间LSTM状态和双线性潜在投影,传播定向激发,而自适应剪枝规则限制了动态邻域更新的计算成本。结合这些阶段,我们展示了在单个商用CPU上,每条传入新闻记录的端到端处理延迟约为13毫秒。在FNSPID语料库(47个代码的638篇文章)的一个月时间保持集上评估,该系统在90百分位次日回报阈值下,相比随机基线精度提升1.70倍,相比同行业基线提升3.36倍。关键的是,移除图拓扑结构会使精度降至零,证实动态注意力网络是该架构中跨公司信号的唯一驱动因素。

英文摘要

Per-ticker forecasting models dominate financial time-series work yet remain blind to cross-company propagation: a foundry disruption in Taiwan does not register in a single-asset model until Apple's own price has already moved. To address this limitation, we introduce a heterogeneous Rust-Python streaming architecture that maps cross-company attention as a continuous-time graph driven directly from text. We show that on the ingestion side, a zero-copy Rust edge parses news records in $\sim$100 ns and scans the target equity universe in $\sim$1.2 $μ$s. On the inference end, a multivariate Neural Hawkes Process featuring per-node continuous-time LSTM states and a bilinear latent projection propagates directed excitation, while an adaptive pruning rule bounds the computational cost of dynamic neighborhood updates. Combining these stages, we demonstrate an end-to-end processing latency of $\sim$13 ms per incoming news record on a single commodity CPU. Evaluated on a one-month temporal holdout of the FNSPID corpus (638 articles across 47 tickers), the system delivers a $1.70\times$ precision lift over random at the 90th-percentile next-day return threshold, and $3.36\times$ over a same-sector baseline. Crucially, removing the graph topology collapses precision to zero, confirming that the dynamic attention network is the sole driver of cross-company signal in this architecture.

2606.05681 2026-06-05 math.ST stat.TH

Local increment inference for time-inhomogeneous drift in Gaussian processes

高斯过程中时变漂移的局部增量推断

Yasutaka Shimizu

AI总结 针对高频观测下高斯过程模型中的确定性漂移结构,提出基于一阶增量的最小二乘对比估计方法,证明其相合性和渐近正态性,并揭示收敛速度依赖于高斯噪声局部粗糙度和漂移长期信息积累结构。

详情
AI中文摘要

我们研究高频观测下高斯过程模型中确定性漂移结构的统计推断。观测过程由一个中心平稳高斯分量与一类广泛的时变确定性漂移组成。为了估计漂移参数,我们引入基于一阶增量的最小二乘型对比函数。我们在高斯分量的弱依赖条件下建立了相合性和渐近正态性。该框架的一个核心特征是估计量的收敛速度联合依赖于高斯噪声的局部粗糙度和漂移产生的长期信息积累结构。该理论适用于广泛的漂移族,包括可积、多项式型和周期结构。特别地,不同的漂移密度产生性质不同的统计机制,包括非标准收敛速度以及持久或增长确定性结构的加速收敛速度。

英文摘要

We study statistical inference for deterministic drift structures in Gaussian process models under high-frequency observations.The observed process consists of a centered stationary Gaussian component combined with a broad class of time-inhomogeneous deterministic drifts. To estimate the drift parameter, we introduce a least squares-type contrast based on first-order increments. We establish consistency and asymptotic normality under weak dependence conditions on the Gaussian component. A central feature of the framework is that the rate of convergence of the estimator depends jointly on the local roughness of the Gaussian noise and the long-time information accumulation structure generated by the drift. The theory accommodates a wide range of drift families, including integrable, polynomial-type, and periodic structures. In particular, different drift densities produce qualitatively different statistical regimes, including non-standard rates of convergence and accelerated rates for persistent or growing deterministic structures.

2606.05676 2026-06-05 stat.ME stat.CO

regcorr: An R Package for Regression Models of Pearson Correlation Coefficients

regcorr:用于皮尔逊相关系数回归模型的R包

Ze Lin, Bo Li, Jinyao Shen

AI总结 提出regcorr R包,实现皮尔逊相关系数与协变量线性预测器的回归模型,支持双变量正态和伯努利响应,提供牛顿-拉夫森估计、模拟数据生成和自助法子程序。

详情
Comments
8 pages. R package available on CRAN
AI中文摘要

皮尔逊相关系数通常用作两个响应之间关联的单一数值总结。然而,在许多应用中,关联强度本身是异质的,可能随人口统计学、生物学、实验或环境协变量而变化。regcorr包实现了回归模型,其中皮尔逊相关系数与协变量的线性预测器相关联。该包支持双变量正态响应和双变量伯努利响应,提供牛顿-拉夫森估计例程,包含用于模拟研究的数据生成器,并提供了一个基于自助法的子程序,用于评估协变量效应的显著性和功效。实现遵循Dufera、Liu和Xu(2023)的基于似然的框架,并通过轻量级R接口暴露,无需编译代码且依赖最小。本文描述了统计模型、regcorr的计算设计、可重复的使用示例以及解释协变量依赖相关系数的实用指南。该包可从综合R档案网络(https://CRAN.R-project.org/package=regcorr)获取,采用MIT许可证。

英文摘要

Pearson's correlation coefficient is commonly used as a single-number summary of association between two responses. In many applications, however, the strength of association is itself heterogeneous and may vary with demographic, biological, experimental, or environmental covariates. The regcorr package implements regression models in which a Pearson correlation coefficient is linked to a linear predictor of covariates. The package supports bivariate normal responses and bivariate Bernoulli responses, provides Newton-Raphson estimation routines, includes data generators for simulation studies, and supplies a bootstrap-based subroutine for assessing the significance and power of covariate effects. The implementation follows the likelihood-based framework of Dufera, Liu, and Xu (2023) and exposes it through a lightweight R interface with no compiled code and minimal dependencies. This paper describes the statistical model, the computational design of regcorr, reproducible usage examples, and practical guidance for interpreting covariate-dependent correlations. The package is available from the Comprehensive R Archive Network at https://CRAN.R-project.org/package=regcorr under the MIT license.

2606.05672 2026-06-05 math.ST stat.TH

Trace-Class Results for MCMC Algorithms for Student-t Regression Models

Student-t 回归模型的 MCMC 算法的迹类结果

Yasuyuki Hamura

AI总结 本文研究 Student-t 回归模型的 MCMC 算法,通过分析迹类性质来评估马尔可夫链的效率,发现标准数据增广算法在无信息先验下不是迹类,而折叠 Gibbs 算法是迹类;在正态-逆伽马先验下标准算法是迹类。

详情
Comments
14 pages
AI中文摘要

本文考虑 Student-$t$ 回归模型的 MCMC 算法。我们根据迹类结果是否成立来研究基于这些算法的马尔可夫链的效率。我们首先考虑回归系数和误差方差服从不变的不恰当先验分布的情况。与标准数据增广算法相关的马尔可夫算子不是迹类的,但与折叠 Gibbs 算法相关的算子是迹类的。接下来我们考虑参数服从正态-逆伽马分布的情况。在这种情况下,标准马尔可夫算子是迹类的。

英文摘要

In this paper, we consider MCMC algorithms for Student-$t$ regression models. We investigate the efficiency of Markov chains based on the algorithms in terms of whether trace-class results hold or not. We first consider the case where the regression coefficients and error variance follow the invariant improper prior distributions. The Markov operator associated with a standard data augmentation algorithm is not trace-class but that associated with a collpased Gibbs algorithm is trace-class. We next consider the case where the parameters follow a normal-inverse gamma distribution. In this case, the standard Markov operator is trace-class.

2606.05666 2026-06-05 stat.ME

Weighting a Census as a Non-Probability Sample: A Doubly Robust Framework for Correcting Differential Undercoverage in Uruguay's 2023 Census

将人口普查视为非概率样本进行加权:纠正乌拉圭2023年人口普查差异覆盖不足的双重稳健框架

Ferreira Juan Pablo, Goyeneche Juan Jose

AI总结 针对乌拉圭2023年人口普查中非随机覆盖不足问题,提出将有效普查住户视为非概率样本,采用双重稳健估计器结合响应倾向模型与校准方法,以纠正选择偏差并稳健估计社会指标。

详情
AI中文摘要

2023年乌拉圭人口普查记录了3,444,451人口,估计覆盖不足10.3%。普查后证据显示,遗漏是非随机的,集中在脆弱地区、农村地区和年轻成年人中。整合行政记录恢复了汇总计数,但未能解决结果变量中的选择偏差,因为行政记录缺乏核心普查变量,存在城市性和机构可见性偏差,且不重建家庭。基于枚举微观数据的估计仍然有偏。我们将有效普查住户视为具有未知选择机制的非概率样本,并使用双重稳健估计器构建权重。该框架结合了使用网络链接率作为联系代理的分段级响应倾向模型,以及校准到联合普查人口总数(性别、年龄、省份)。由于双重稳健估计器在任一模型正确指定时都是一致的,因此它提供了对覆盖不足错误指定的稳健性。我们描述了在300万记录规模上的应用,记录了其对社交指标的影响,并提出了基于等效分层聚类设计的方差近似。最后,我们建立了一个方法论框架,以指导国家统计机构根据其可用的登记册和辅助数据优化无响应调整。

英文摘要

The 2023 Uruguayan Census recorded a population of 3,444,451 with an estimated undercoverage of 10.3%. Post-enumeration evidence shows that omission was non-random, concentrated in vulnerable areas, rural territories, and among young adults. Integrating administrative records (AR) recovered aggregate counts but did not resolve selection bias in outcome variables, as AR lack core census variables, exhibit urbanicity and institutional-visibility biases, and do not reconstruct households. Estimates derived from enumerated microdata remain biased. We treat effectively enumerated households as a non-probability sample with an unknown selection mechanism and construct weights using a doubly robust (DR) estimator. This framework combines a segment-level response-propensity model, using the web linkage rate as a contact proxy, with calibration to combined-census demographic totals (sex, age, department). Because the DR estimator is consistent when either model is correctly specified, it provides robustness against undercoverage misspecification. We describe the application at a scale of three million records, document its effect on social indicators, and present a variance approximation based on an equivalent stratified cluster design. Finally, we establish a methodological framework to guide national statistical offices on optimizing non-response adjustments based on their available registers and paradata.

2606.05649 2026-06-05 stat.CO cs.LG

Diff2SP: Diffusion Models for Correlated Scenario Generation in Stochastic Programming

Diff2SP:随机规划中相关场景生成的扩散模型

Haixiang Sun, Andrew Liu

AI总结 提出Diff2SP扩散生成框架,将下游优化目标嵌入场景生成过程,通过理论证明和经验验证实现统计一致性与决策感知的平衡。

详情
AI中文摘要

场景生成是随机规划(SP)中的关键组成部分,直接影响不确定性下决策的质量。现有方法主要依赖于基于采样的技术或使用神经网络的监督学习。基于采样的方法通常难以捕捉复杂依赖关系和罕见但可能的事件,而监督学习需要固定的输入-输出对进行训练,且生成不受预定义模式或规则限制的多样化现实场景的能力有限。为了解决这些局限性,我们引入了Diff2SP,一种基于扩散的生成框架,将下游优化目标直接融入场景生成中。与将场景生成和决策制定视为独立步骤的传统方法不同,Diff2SP将随机优化嵌入训练过程,从而生成既统计一致又具有决策感知的场景。为了正式证明这种优化感知设计的合理性,我们建立了将分布精度与决策质量联系起来的遗憾界,并建立了样本复杂度保证,显示出比传统生成模型(如GAN)更快的收敛速度。在合成数据集和电力系统数据集上的实证结果验证了这些理论见解,表明Diff2SP在统计保真度和下游优化结果上均有一致提升。

英文摘要

Scenario generation is a critical component in stochastic programming (SP), as it directly influences the quality of decision-making under uncertainty. Existing approaches predominantly rely on either sampling-based techniques or supervised learning using neural networks. Sampling-based techniques often struggle to capture complex dependencies and rare but plausible events, while supervised learning requires fixed input-output pairs for training and is limited in its ability to generate a wide variety of realistic scenarios that are not restricted by predefined patterns or rules. To address these limitations, we introduce Diff2SP, a diffusion-based generative framework that incorporates downstream optimization objectives directly into scenario generation. Unlike conventional methods that treat scenario generation and decision-making as separate steps, Diff2SP embeds stochastic optimization into the training process, enabling the generation of scenarios that are both statistically coherent and decision-aware. To formally justify this optimization-aware design, we establish a regret bounds that link distributional accuracy to decision quality, and establish sample complexity guarantees showing faster convergence than traditional generative models such as GANs. Empirical results on both synthetic and power-system datasets validate these theoretical insights, demonstrating that Diff2SP consistently improves both statistical fidelity and downstream optimization outcomes.

2606.05623 2026-06-05 q-fin.RM stat.AP

Bankruptcy Prediction from 10-K Narratives: Evidence from Interpretable Text Scores and Accounting Baselines

基于10-K叙述的破产预测:来自可解释文本分数与会计基线的证据

Zhen Zhang, Moxuan Zheng, Tongchen Zhang, Luyun Lin, Yiqing Wang, Lixing Lin

AI总结 本文通过构建可解释的破产前压力分数,验证了10-K叙述文本在传统会计变量之外对破产预测的增量信息,显著提升了AUC和顶部十分位破产捕获率。

详情
Comments
23 pages
AI中文摘要

破产是一种低频率但高影响的企业事件,因此早期风险识别对债权人、投资者、监管者和风险管理者至关重要。传统的破产预测模型主要依赖会计比率,但这些指标可能仅在财务恶化出现在已报告的财务报表中时才反映出来。因此,年度10-K文件中的叙述性披露可能提供关于新兴困境的增量预警信号。本研究考察了10-K叙述是否能在传统会计变量之外改进破产预测。使用与10-K文本、SEC财务报表数据以及来自佛罗里达-加州大学洛杉矶分校-LoPucki破产研究数据库的破产事件匹配的公司年度观测值,分析评估了10-K提交日期后一年内的破产风险。本文开发了一个透明的破产前压力分数,这是一种基于词典的度量,旨在捕捉与流动性和资金压力、债务契约和再融资压力、经营恶化、重组和法律困境以及业务脆弱性相关的困境特定语言。该分数与一个五变量会计基线和Loughran-McDonald词典基准进行了评估。在主要的一年期保留样本测试中,添加破产前压力分数使AUC从0.8323提高到0.9019,并将顶部十分位破产捕获率从44.12%提高到64.71%。正向增量模式在bootstrap推断、替代会计基准、替代结果定义和时段外验证中仍然可见。研究结果表明,困境特定的10-K叙述为破产风险监测提供了超越传统会计比率的可解释增量信息。

英文摘要

Bankruptcy is a low-frequency but high-impact corporate event, making early risk identification important for creditors, investors, regulators, and risk managers. Traditional bankruptcy-prediction models rely primarily on accounting ratios, but these measures may reflect financial deterioration only after it appears in reported financial statements. Narrative disclosures in annual 10-K filings may therefore provide incremental warning signals about emerging distress. This study examines whether 10-K narratives improve bankruptcy prediction beyond conventional accounting variables. Using firm-year observations matched to 10-K text, SEC financial statement data, and bankruptcy events from the Florida-UCLA-LoPucki Bankruptcy Research Database, the analysis evaluates bankruptcy risk over the year following the 10-K filing date. The paper develops a transparent Pre-Bankruptcy Stress (PB Stress) Score, a dictionary-based measure designed to capture distress-specific language related to liquidity and funding stress, debt covenant and refinancing stress, operating deterioration, restructuring and legal distress, and business fragility. The score is evaluated against a five-variable accounting baseline and a Loughran-McDonald dictionary benchmark. In the primary one-year holdout test, adding the PB Stress Score increases AUC from 0.8323 to 0.9019 and raises top-decile bankruptcy capture from 44.12% to 64.71%. The positive incremental pattern remains visible across bootstrap inference, alternative accounting benchmarks, alternative outcome definitions, and out-of-time validation. The findings indicate that distress-specific 10-K narratives provide interpretable incremental information for bankruptcy-risk monitoring beyond conventional accounting ratios.

2606.05599 2026-06-05 cs.LG math.ST stat.ME stat.ML stat.TH

Mitigating the Curse of Dimensionality in Uniform Convergence of Deep Neural Networks via Smooth Activations

通过平滑激活函数缓解深度神经网络一致收敛中的维度灾难

Yizhe Ding, Runze Li, Jia Liu, Lingzhou Xue

AI总结 本文通过分析平滑激活深度神经网络,建立了统一收敛的理论框架,证明其能够通过自适应利用目标函数的低维层次组合结构来缓解维度灾难。

详情
Comments
30 pages, 5 figures
AI中文摘要

本文为平滑激活深度神经网络(DNN)估计量的一致收敛建立了理论框架。虽然标准ReLU网络在各种非参数回归任务中,在$L^2(P)$范数下达到了极小化最优速率,但我们建立了一个理论下界,表明最小二乘ReLU估计量在其一致收敛行为中可能遭受维度灾难。受下游任务中对最坏情况可靠性的需求驱动,我们通过分析平滑激活DNN(平滑DNN),包括前馈和残差结构,来解决这一局限性。我们为这些模型的逼近器建立了新的伪维数界、非渐近逼近保证和Hölder范数界。利用这些结果,我们推导了平滑DNN估计量在多种统计上下文(包括Huber回归、最小二乘回归、分位数回归和逻辑回归)中的非渐近一致收敛速率。我们证明,平滑DNN可以通过自适应利用目标函数的低维层次组合结构来缓解一致收敛中的维度灾难。通过模拟研究和实际应用的支持,我们的结果将平滑DNN定位为在需要一致保证的统计学习任务中,理论上合理且实践上可行的ReLU网络替代方案。

英文摘要

This paper establishes a theoretical framework for the uniform convergence of smoothly activated deep neural network (DNN) estimators. While standard ReLU networks achieve minimax-optimal rates in the $L^2(P)$ norm for various nonparametric regression tasks, we establish a theoretical lower bound demonstrating that least-squares ReLU estimators can suffer from the curse of dimensionality in their uniform convergence behavior. Motivated by the need for reliable uniform guarantees in downstream tasks requiring worst-case reliability, we address this limitation by analyzing smoothly activated DNNs (smooth DNNs), encompassing both feedforward and residual structures. We establish novel pseudo-dimension bounds, non-asymptotic approximation guarantees, and Hölder-norm bounds for the approximators of these models. Leveraging these results, we derive non-asymptotic uniform convergence rates for smooth DNN estimators across multiple statistical contexts, including Huber, least-squares, quantile, and logistic regression. We prove that smooth DNNs can mitigate the {curse of dimensionality} in uniform convergence by adaptively exploiting the low-dimensional hierarchical composition structure of the target function. Supported by both simulation studies and a real-world application, our results position smooth DNNs as a theoretically grounded and practically viable alternative to ReLU networks for statistical learning tasks requiring uniform guarantees.

2606.05560 2026-06-05 stat.ME math.ST stat.ML stat.TH

Wasserstein Exponential Smoothing

Wasserstein 指数平滑

Takuo Matsubara, Peiwen Jiang, Minh-Ngoc Tran, Wilson Ye Chen

AI总结 本文提出 Wasserstein 空间中的指数平滑方法,用于分布时间序列预测,并证明通过最小化 Wasserstein 距离可一致估计平滑参数。

详情
AI中文摘要

指数平滑(ES)在多种数据生成过程的时间序列预测中通常优于其他技术。虽然 ES 传统上应用于 $\mathbb{R}$ 中的时间序列,但本文将方法扩展到分布时间序列,其中每个观测值是 $\mathbb{R}$ 上的概率分布。本研究的主要贡献有两方面。首先,我们在 Wasserstein 空间中提出了一种有原则且直观的 ES 推广,保留了经典 ES 的卓越简洁性。其次,我们从理论和实证上证明,通过最小化 Wasserstein 距离可以一致地估计平滑参数。在高频金融收益和家庭电力需求的分布时间序列上的应用证实了我们 Wasserstein ES 模型的实际有效性。

英文摘要

Exponential smoothing (ES) often outperforms other techniques in time series forecasting across a wide range of data-generating processes. While ES has traditionally been applied to time series in $\mathbb{R}$, this paper extends the methodology to distributional time series, where each observation is a probability distribution on $\mathbb{R}$. The primary contribution of this work is twofold. First, we propose a principled and intuitive generalization of ES within the Wasserstein space, which retains the exceptional parsimony of classical ES. Second, we theoretically and empirically demonstrate that the smoothing parameter can be consistently estimated by minimizing a Wasserstein distance. Applications to distributional time series of high-frequency financial returns and household electricity demands confirm the practical effectiveness of our Wasserstein ES model.

2606.05488 2026-06-05 stat.ML cs.LG stat.ME

Sparse Functional Singular Value Decomposition for Biclustering and Triclustering Longitudinal Data

纵向数据的稀疏函数奇异值分解用于双聚类和三聚类

Yue Zhao, Thierry Chekouo, Sandra Safo

AI总结 提出Tri-SfSVD框架,通过稀疏惩罚同时进行连续轨迹估计与对象、特征和时间选择,实现纵向数据中的双聚类和三聚类,优于现有方法。

详情
AI中文摘要

识别复杂疾病(如炎症性肠病,IBD)的亚型通常需要捕捉纵向组学数据中的潜在模式。然而,这些数据通常是高维、稀疏采样且时间上不规则观测的,对传统的(双)聚类和函数数据分析方法构成了重大挑战。我们提出Tri-SfSVD,一个统一的稀疏函数奇异值分解框架,用于发现纵向数据中的双聚类和三聚类。与现有的依赖于临时插值或强制限制性形状同质性假设的函数双聚类方法不同,Tri-SfSVD在单个优化框架中集成了连续轨迹估计与同时的对象、特征和时间选择。通过在对象、变量和时间子区域上施加稀疏惩罚,所提出的方法直接对观测数据操作,以发现对象级、对象-特征级和对象-特征-时间级的局部结构。大量模拟表明,Tri-SfSVD在高维设置下优于现有方法。应用于IBD多组学数据,该方法识别了三个双聚类,将样本聚类与不同的IBD相关临床特征以及特定细菌类群相关的微生物通路组联系起来,提供了可解释的对象-通路关联以表征疾病异质性。应用于多通道脑电图数据,该方法识别了三个三聚类,将样本聚类与不同的酒精相关表型以及局部脑活动模式联系起来,包括同一空间区域内由时间子区域分隔的亚组差异。

英文摘要

Identifying subtypes of complex conditions, such as Inflammatory Bowel Disease (IBD), often requires capturing latent patterns in longitudinal omics data. However, these data are typically high-dimensional, sparsely sampled, and irregularly observed over time, posing substantial challenges for conventional (bi)clustering and functional data analysis methods. We propose Tri-SfSVD, a unified sparse functional Singular Value Decomposition framework for discovering biclusters and triclusters in longitudinal data. Unlike existing functional biclustering methods that rely on ad hoc imputation or enforce restrictive shape-homogeneity assumptions, Tri-SfSVD integrates continuous trajectory estimation with simultaneous subject, feature, and temporal selection within a single optimization framework. By imposing sparse penalties across subjects, variables, and temporal subregions, the proposed method works directly on observed data to uncover localized structures at the subject, subject-feature, and subject-feature-time levels. Extensive simulations demonstrate that Tri-SfSVD outperforms existing approaches in high-dimensional settings. Applied to IBD multi-omics data, the method identified three biclusters linking sample clusters with distinct IBD-related clinical characteristics to microbial pathway groups associated with specific bacterial taxa, providing interpretable subject-pathway associations for characterizing disease heterogeneity. Applied to multi-channel EEG data, the method identified three triclusters linking sample clusters with distinct alcohol-related phenotypes to localized brain activity patterns, including subgroup differences separated by temporal subregions within the same spatial region.

2606.05462 2026-06-05 math.DS cs.NA math.NA stat.CO stat.ML

A Two-Channel F-Transform Representation for Early Trajectory Characterization in Iterated Correlation Dynamics

迭代相关动力学中早期轨迹表征的双通道F-变换表示

Ishrak Alhajj Hassan

AI总结 针对迭代相关动力学中早期轨迹难以直接比较的问题,提出一种基于双通道F-变换的紧凑、可解释的模糊坐标表示,用于保留后续收敛和轨迹几何信息。

详情
AI中文摘要

许多非线性迭代过程产生高维轨迹,其早期行为信息丰富但难以直接比较。本文研究一个软计算表示问题:如何将短的早期轨迹段转换为紧凑、可解释、固定维度的模糊坐标,以保留后续收敛和轨迹几何信息。该问题针对迭代Pearson相关矩阵进行研究,这是一种历史上与CONCOR型分块建模和重复相关方法相关的非线性矩阵迭代。所提出的描述符使用早期瞬态后阶段的两个对数信号:步长信号(测量收缩幅度)和收缩比信号(测量局部收缩演化)。每个信号通过零度F-变换系数和一个中心一阶系数投影到三节点三角模糊划分上。这产生了一个八维双通道表示,将局部水平与局部趋势分离,并将收缩幅度与收缩演化分离。在22个矩阵维度上,每个维度有1000条轨迹,使用随机森林回归进行收敛长度近似,将描述符与原始轨迹样本、统计摘要和PCA压缩的原始特征进行比较。其平均R^2=0.6480,接近原始轨迹(0.6518)和统计摘要(0.6528),优于仅步长信号的F-变换描述符(0.5001)。重复随机分割和滑动窗口实验证实了稳定性。PCA和聚类进一步显示出可重复的低维组织,前两个主成分解释了84.26%的方差,且平均轮廓准则支持k=3。

英文摘要

Many nonlinear iterative procedures generate high-dimensional trajectories whose early behavior is informative but difficult to compare directly. This paper studies a soft-computing representation problem: how to convert a short early trajectory segment into compact, interpretable, fixed-dimensional fuzzy coordinates that preserve information about subsequent convergence and trajectory geometry. The problem is investigated for iterated Pearson correlation matrices, a nonlinear matrix iteration historically connected with CONCOR-type blockmodeling and repeated-correlation methods. The proposed descriptor uses two logarithmic signals from the early post-transient regime: a step-size signal, measuring contraction magnitude, and a contraction-ratio signal, measuring local contraction evolution. Each signal is projected onto a three-node triangular fuzzy partition using zero-degree F-transform coefficients and one centered first-degree coefficient. This yields an eight-dimensional two-channel representation separating local level from local trend and contraction magnitude from contraction evolution. Across 22 matrix dimensions with 1000 trajectories per dimension, the descriptor is compared with raw trajectory samples, statistical summaries, and PCA-compressed raw features using Random Forest regression for convergence-length approximation. It achieves mean R^2 = 0.6480, close to raw trajectories (0.6518) and statistical summaries (0.6528), while improving over the step-size-only F-transform descriptor (0.5001). Repeated random-split and shifted-window experiments confirm stability. PCA and clustering further show reproducible low-dimensional organization, with the first two principal components explaining 84.26% of variance and k = 3 favored by the mean silhouette criterion.

2606.05420 2026-06-05 cs.AI stat.AP

Assessing the Carbon Emissions and Energy Consumption of U.S. Hyperscale Data Centers

评估美国超大规模数据中心的碳排放与能源消耗

Gianluca Guidi, Francesca Dominici, Tiziano Squartini, Callaway Sprinkle, Jonathan Gilmour, Kevin Butler, Eric Bell, Scott Delaney, Falco J. Bargagli-Stoffi

AI总结 本研究通过收集403个美国超大规模数据中心设施级数据,估算其电力消耗、电力来源及二氧化碳排放,发现其电力需求约占美国总用电量的1.8%,且碳强度高于全国平均水平48%。

详情
AI中文摘要

美国超大规模数据中心(HDCs)的快速扩张,主要由人工智能的采用驱动,引发了人们对该行业环境足迹的担忧。我们汇编了2024年5月至2025年4月期间运营的403个美国超大规模数据中心的设施级信息,并估算了它们的电力消耗、电力来源及可归因的二氧化碳排放。在不同的设施负载情景下,这些HDC消耗了约68-99太瓦时的电力,并产生了约3700-5400万吨二氧化碳。在中心情景下,HDC电力需求约占美国总用电量的1.8%,其中约54%的归因发电由化石燃料来源提供。HDC电力加权平均碳强度约为545克二氧化碳/千瓦时,比同期美国国家电网平均碳强度370克二氧化碳/千瓦时高出约48%。我们的方法提供了一种归因工具,利用最新的EPA eGRID电厂级数据评估超大规模数据中心的环境足迹。

英文摘要

The rapid proliferation of hyperscale data centers (HDCs) in the US, mainly driven by the adoption of artificial intelligence, has raised concerns about this industry's environmental footprint. We compiled facility-level information on 403 US hyperscale data centers operating between May 2024 and April 2025 and estimated their electricity consumption, electricity sources, and attributable CO2 emissions. Across different facility-load scenarios, these HDCs consumed approximately 68-99 TWh of electricity and were associated with about 37-54 million metric tons of CO2. Under the central scenario, HDC electricity demand corresponded to approximately 1.8% of total US electricity consumption, with roughly 54% of attributed generation supplied by fossil-fuel sources. The HDC electricity-weighted average carbon intensity was approximately 545 gCO2/kWh, about 48% above the contemporaneous US national grid-average carbon intensity of 370 gCO2/kWh. Our approach provides an attributional tool for assessing the environmental footprint of hyperscale data centers using the most recent EPA eGRID plant-level data.

2606.05374 2026-06-05 stat.ME

Analyzing spatial point processes degraded by displacement and imperfect detection

分析受位移和不完全检测退化的空间点过程

Kevin M. Collins, Erin M. Schliep, Alan E. Gelfand, Tina M. Yack, Christopher W. Clark, Robert S. Schick

AI总结 针对空间泊松过程,提出一种同时学习位移尺度、参数化稀疏和非参数强度函数的方法,并通过模拟和鲸鱼叫声数据验证其有效性。

详情
AI中文摘要

空间点过程是用于解释位置数据的概率建模的有价值工具。然而,数据本身往往被不完全观测到。为了进行准确的推断,必须考虑这些不完美之处,我们称之为退化。我们考虑空间泊松过程的两种退化形式:稀疏和位移。首先,我们提供了关于模型可识别性的一些理论结果,表明在弱条件下,可以联合学习位移的尺度、稀疏的参数形式以及非参数强度函数。通过模拟研究,实证展示了学习所有这些组成部分的能力以及由此带来的推断改进,相比于概念上未退化但错误指定的模型。最后,我们将该方法应用于来自科德角湾的北大西洋露脊鲸叫声数据。

英文摘要

Spatial point processes are a valuable tool for probabilistic modeling to explain location data. However, the data themselves are often observed imperfectly. In order to perform accurate inference, one must account for these imperfections, which we refer to as degradation. We consider two forms of degradation for spatial Poisson processes: thinning and displacement. First, we provide some theoretical results on model identifiability, showing that, under weak conditions, one can jointly learn the scale of the displacement, a parametric form of thinning, and a nonparametric intensity function. The ability to learn all of these components and the resulting improvements for inference compared to the conceptual non-degraded but misspecified model are shown empirically via simulation study. Finally, we apply this approach to North Atlantic right whale call data from Cape Cod Bay.

2606.05371 2026-06-05 cs.LG cs.NA math.NA stat.ML

Mamba-Assisted Non-Markovian Closure for Reduced-Order Modeling

Mamba辅助的非马尔可夫闭合用于降阶建模

Zhi-Feng Wei, Saad Qadeer, Panos Stinis

AI总结 针对高维动力系统降阶建模中的非马尔可夫闭合项问题,提出Mamba辅助闭合框架,利用Mamba序列模型从已解析轨迹预测闭合项,并通过数值积分器耦合降阶方程,在粘性Burgers方程和混沌双尺度Lorenz '96系统上优于马尔可夫模型、GRU序列模型和Wilks方法。

详情
Comments
Code will be released upon acceptance
AI中文摘要

高维动力系统的降阶建模常常受到非马尔可夫闭合项的阻碍,该闭合项表示未解析变量对解析动力学的影响。受Mori--Zwanzig形式论的启发,其中闭合项采取解析轨迹的记忆泛函形式,我们将闭合建模重新表述为序列建模问题,并提出Mamba辅助闭合(MAC)框架:一个基于Mamba的序列模型,经过训练从解析轨迹预测闭合项,通过数值积分器与降阶控制方程耦合,以在时间上推进解析变量。该框架的一个关键特性是利用状态空间模型的双重表示——模型通过卷积形式以序列到序列的方式进行训练,并通过循环形式进行逐步自回归部署,从而实现高效的长轨迹训练和恒定的每步推理成本。在粘性Burgers方程和混沌双尺度Lorenz '96系统上,MAC模型在预测准确性和长时间展开稳定性方面显著优于马尔可夫降阶模型、基于GRU的序列模型和Wilks方法。

英文摘要

Reduced-order modeling of high-dimensional dynamical systems is often hindered by the non-Markovian closure term that represents the effect of unresolved variables on the resolved dynamics. Inspired by the Mori--Zwanzig formalism, in which the closure takes the form of a memory functional of the resolved trajectory, we recast closure modeling as a sequence modeling problem and propose the Mamba-Assisted Closure (MAC) framework: a Mamba-based sequence model, trained to predict the closure from the resolved trajectory, is coupled with the reduced-order governing equations through a numerical integrator to advance the resolved variables in time. A key feature of the framework is its exploitation of the dual representation of state-space models -- the model is trained in a sequence-to-sequence fashion via the convolutional form, and deployed for step-by-step autoregressive rollout via the recurrent form, yielding both efficient long-trajectory training and constant per-step inference cost. On the viscous Burgers' equation and the chaotic two-scale Lorenz '96 system, the MAC model substantially outperforms the Markovian reduced-order model, the GRU-based sequence model, and the Wilks method in predictive accuracy and long-time rollout stability.

2606.05365 2026-06-05 stat.ML cs.LG

Environment-Robust Representation Learning with Empirical Bayes

基于经验贝叶斯的环境鲁棒表示学习

Yuli Slavutsky, Matthew Shen, Bohan Wu, David M. Blei

AI总结 提出一种经验贝叶斯变分方法,通过跨环境平衡项学习不变潜在变量,实现对新环境的鲁棒预测,在天文、微生物和ICU数据上优于现有方法。

详情
AI中文摘要

我们考虑多环境预测问题。假设环境改变潜在变量的分布,而生成观测协变量和目标的机制在给定该变量条件下保持稳定。例如,医院或临床队列可能在潜在患者状态的流行率上有所不同,尽管这些状态、生理测量和结果之间的关系保持不变。给定来自多个环境的数据集,我们为这类问题构建了一个贝叶斯模型,并推导出相应的变分目标。我们证明该目标分解为每个环境项和由模型结构引起的额外跨环境平衡项。我们使用经验贝叶斯方法设置先验并将其纳入目标。基于该目标,我们开发了一种用于后验近似的摊销变分算法,并利用学习到的潜在变量在新环境中形成预测。我们通过模拟以及天文源识别、基于微生物组的疾病检测和ICU脓毒症预测的实际研究来研究我们的方法。在这些设置中,我们的方法在新环境预测方面优于先前的方法。

英文摘要

We consider multi-environment prediction problems. We assume the environments change the distribution of a latent variable, while the mechanisms generating observed covariates and targets remain stable conditional on that variable. For example, hospitals or clinical cohorts may differ in the prevalence of latent patient states, even though the relationships between those states, physiological measurements, and outcomes remain unchanged. Given a dataset from multiple environments, we formulate a Bayesian model for such problems and derive the corresponding variational objective. We show that this objective decomposes into per-environment terms and an additional cross-environment balancing term induced by the model's structure. We use an empirical Bayes method to set the prior and incorporate it into the objective. Based on this objective, we develop an amortized variational algorithm for posterior approximation, and use the resulting learned latent variables to form predictions in new environments.We study our approach through simulations and real-world studies of astronomical source identification, microbiome-based disease detection, and ICU sepsis prediction. Across these settings, our method outperforms previous approaches for prediction in new environments.

2606.05361 2026-06-05 stat.ML cs.LG

TabSODA: Tabular Diffusion based Imputation with Skip Pattern Detection and Ordinal Awareness

TabSODA: 基于表格扩散的插补方法,结合跳跃模式检测与序数感知

Yuyu Chen, Taehyo Kim, Hai Shu, Yang Feng

AI总结 提出TabSODA方法,通过EM框架下的扩散模型处理大规模调查中的结构跳跃和序数变量,在PATH和NSDUH数据集上显著提升插补精度。

详情
AI中文摘要

大规模调查中的缺失数据插补面临两个当前表格扩散方法未能很好处理的挑战。首先,\emph{结构跳跃}(因问卷设计而不可回答的单元格)不应被插补,但常与项目无回答混为一谈。其次,\emph{序数}响应编码了有序类别,但大多数流程通过独热或模拟位编码将其视为名义水平。我们提出 extbf{TabSODA}(具有跳跃模式检测和序数感知的表格扩散),一种基于期望最大化(EM)的扩散插补器,建立在阐明扩散模型(EDM)框架上。TabSODA通过去噪损失和逆时采样器传播结构跳跃,并用累积概率标量潜变量表示序数变量,同时保留名义变量的模拟位编码。当码本跳跃掩码可用时,TabSODA直接使用;否则,TabSODA+SKIP变体通过基于CART的跳跃模式挖掘器从原始响应和问卷顺序估计掩码。在烟草与健康人口评估(PATH)研究和全国药物使用与健康调查(NSDUH)这两个美国全国代表性调查中,TabSODA在MCAR、MAR和MNAR掩码下将序数MACE降低高达23.7%,并将分类准确率提高高达9%(相对于最强基线)。跳跃挖掘器在两个数据集上实现了近乎完美的精确度,使得TabSODA+SKIP能够紧密跟踪码本掩码变体。

英文摘要

Missing data imputation in large-scale surveys faces two challenges that are not well handled by current tabular diffusion methods. First, \emph{structural skips}, cells made inapplicable by questionnaire design, should not be imputed but are often conflated with item nonresponse. Second, \emph{ordinal} responses encode ordered categories, yet most pipelines treat them as nominal levels through one-hot or analog-bit encodings. We introduce \textbf{TabSODA} (\textbf{Tab}ular diffusion with \textbf{S}kip pattern detection and \textbf{O}r\textbf{d}inal \textbf{A}wareness), an Expectation-Maximization (EM)-based diffusion imputer built on the Elucidated Diffusion Model (EDM) framework. TabSODA propagates structural skips through the denoising loss and reverse-time sampler, and represents ordinal variables with cumulative-probit scalar latents while retaining analog-bit encodings for nominal variables. When a codebook skip mask is available, TabSODA uses it directly; otherwise, the TabSODA+SKIP variant estimates the mask from raw responses and questionnaire order using a CART-based skip-pattern miner. On Population Assessment of Tobacco and Health (PATH) study and the National Survey on Drug Use and Health (NSDUH), two nationally representative U.S.\ surveys, TabSODA reduces ordinal MACE by up to $23.7\%$ and improves categorical accuracy by up to $9\%$ over the strongest baseline across MCAR, MAR, and MNAR masking. The skip miner achieves near-perfect precision on both datasets, allowing TabSODA+SKIP to closely track the codebook-mask variant.

2606.05348 2026-06-05 cs.PL cs.LO stat.CO

Incremental Computation for Efficient Programmable Inference in Probabilistic Programs

概率程序中高效可编程推理的增量计算

Fabian Zaiser, Jack Czenszak, Martin C. Rinard, Vikash K. Mansinghka, Alexander K. Lew

AI总结 提出基于增量计算的方法,将概率程序编译为确定性密度函数,并通过增量λ演算实现函数式程序的增量计算,以加速蒙特卡洛推理,支持非参数模型,并保证正确性。

详情
Comments
Full version of the PLDI 2026 article, including proofs and other supplementary material
AI中文摘要

概率程序中的推理通常需要评估许多可能的程序执行,以找到后验密度高的那些。为了将推理扩展到大型数据集,关键是要在这些多次评估之间共享昂贵的中间结果,而不是从头开始重新计算。本文提出了一种实现这种共享的新方法,基于 extit{增量计算},这是一种在程序输入变化时高效重新计算(确定性)程序输出的技术。首先,我们展示了如何将表达性概率程序编译为计算其密度函数的确定性程序。然后,基于增量λ演算,我们开发了一种通用技术,用于组合式地增量计算表达性函数式程序,并将其应用于这些密度。得到的增量密度可用于加速广泛的蒙特卡洛推理算法,包括现有系统不支持的参数模型。此外,我们将增量密度计算分解为独立的密度和增量步骤,允许对正确性进行模块化推理——这是现有系统中的一个关键痛点,其中临时增量功能是已知的健全性错误来源。我们为每一步独立开发了指称逻辑关系论证,并在Julia原型中实现了该方法,发现在一系列模型和推理算法上,它导致数据集大小方面的渐近运行时间改进。

英文摘要

Inference in probabilistic programs generally requires evaluating many possible program executions to find those of high posterior density. To scale inference to large datasets, it is crucial that expensive intermediate results are shared across these many evaluations, rather than recomputed from scratch. This paper presents a new approach to realizing this sharing, based on \textit{incremental computation}, a technique for efficiently recomputing (deterministic) program outputs when program inputs change. First, we show how expressive probabilistic programs can be compiled to deterministic ones that compute their density functions. Then, building on the incremental $λ$-calculus, we develop a general technique for compositionally incrementalizing expressive functional programs, and apply it to these densities. The resulting incremental densities can be used to accelerate a broad range of Monte Carlo inference algorithms, including for nonparametric models not well supported by existing systems. Furthermore, our decomposition of incremental density computation into separate density and incrementalization steps allows for modular reasoning about correctness -- a key pain point in existing systems, where ad-hoc incrementalization features are a known source of soundness bugs. We develop denotational logical relations arguments for the correctness of each step independently, and implement the approach in a Julia prototype, finding that it leads to asymptotic runtime improvements in the size of the dataset on a range of models and inference algorithms.

2606.05335 2026-06-05 cs.LG stat.ML

A prism hierarchy of learning regimes in large linear autoencoders

大型线性自编码器中学习机制的三棱柱层次结构

Eugene Golikov, Yaroslav Gusev, Dmitry Yarotsky

AI总结 本文通过形式损失展开层次结构,将大型权重绑定线性自编码器的极端学习机制与三棱柱的面相关联,推导出五种基本极端机制下的训练和总体损失演化显式表达式。

详情
Comments
33 pages, under review for NeurIPS'2026
AI中文摘要

机器学习模型的理论研究通常考虑不同的极限机制,在这些机制下梯度下降的学习动态在理论上变得可处理。然而,对于特定类型的模型,系统地获得所有定性不同的极端学习机制的图景是可取的。在本文中,我们为大型权重绑定线性自编码器提出了这样一个图景,其特征由输入和潜在维度、初始化幅度以及训练集大小决定。该模型在权重上非线性,其梯度流没有一般的理论解。我们表明,在形式损失展开层次结构层面,其极端机制自然地与三棱柱的面相关联。特别地,存在与棱柱的2-面相关的五种基本极端机制:(1) 大数据,(2) 小数据,(3) 平均场,(4) 窄潜在,以及 (5) 自由。对于机制 (1,2,3,4),我们推导了梯度流下训练和总体极限损失演化的显式表达式,与实验结果非常吻合。

英文摘要

Theoretical studies of machine learning models commonly consider different limiting regimes in which the learning dynamics of gradient descent becomes theoretically tractable. It is, however, desirable to have a systematically obtained picture of all qualitatively different extreme learning regimes for a particular type of models. In this paper we propose such a picture for large weight-tied linear autoencoders characterized by input and latent dimensions, initialization magnitude, and training set size. This model is nonlinear in the weights and its gradient flow does not have a general theoretical solution. We show that at the level of the formal loss-expansion hierarchy, its extreme regimes are naturally associated with faces of a triangular prism. In particular, there are five basic extreme regimes associated with the 2-faces of the prism: (1) large-data, (2) small-data, (3) mean-field, (4) narrow-latent, and (5) free. For regimes (1,2,3,4), we derive explicit expressions for both train and population limiting loss evolutions under gradient flow, obtaining very good agreement with experimental results.

2606.05327 2026-06-05 cs.LG q-bio.QM stat.ML

Multimarginal flow matching with optimal transport potentials

基于最优传输势的多边缘流匹配

Raghav Kansal, David Crair, Nghia Nguyen, Scott Pope, Bradley Parry

AI总结 提出一种利用动态最优传输势引导流匹配学习中间边缘分布的方法,实现高效无模拟的多边缘流匹配,在单细胞RNA测序、海洋学和气象数据集上取得最优性能。

详情
Comments
9 pages, 3 figures, 4 tables, and a 27 page appendix. Accepted to the Forty-Third International Conference on Machine Learning
AI中文摘要

流匹配(FM)已成为学习两个经验分布之间动态传输映射的强大框架。然而,对于存在中间观测边缘分布的情况,这些边缘分布有助于约束端点之间的流,这方面的研究较少。这种“多边缘”设置对于许多科学领域中动态系统的时间演化建模至关重要,这些领域可以对序列分布进行采样。我们通过一种新颖的方法解决了这个问题,该方法利用了FM与动态最优传输(OT)之间的联系,通过动态OT作用中的势项将流柔和地引导向中间边缘分布。通过扩展条件FM学习目标以包含这些势,我们推导出一种高效、无模拟的多边缘FM算法,该算法在学习流的时空动力学方面提供了相当大的灵活性。我们在不同的单细胞RNA测序、海洋学和气象数据集上展示了OT势FM(OTP-FM)的最先进性能和训练效率。我们的代码可在https://github.com/Bexorg-Inc/OTP-FM获取。

英文摘要

Flow matching (FM) has emerged as a powerful framework for learning dynamic transport maps between two empirical distributions. However, less explored is the setting with intermediate observed marginals that can help constrain the flows between the endpoints. This "multimarginal" regime is central to modeling temporal evolution in dynamical systems in many scientific domains that can sample sequential distributions. We tackle this problem with a novel approach that leverages the connection between FM and dynamic optimal transport (OT), softly steering the flow towards the intermediate marginals through potential terms in the dynamic OT action. By extending the conditional FM learning target to incorporate these potentials, we derive an efficient, simulation-free algorithm for multimarginal FM that offers considerable flexibility in the spatiotemporal dynamics of the learned flows. We demonstrate state-of-the-art performance and training efficiency of OT-potential FM (OTP-FM) on diverse single-cell RNA sequencing, oceanographic, and meteorological datasets. Our code is available at https://github.com/Bexorg-Inc/OTP-FM.

2606.05324 2026-06-05 math.NA cs.NA math.PR stat.AP stat.CO stat.ME

Optimizing Irreversible Perturbations of the Unadjusted Langevin Algorithm

优化未调整Langevin算法的不可逆扰动

Qianyu Julie Zhu, Youssef Marzouk, Konstantinos Spiliopoulos, Benjamin Zhang

AI总结 本文针对未调整Langevin算法,提出一个同时考虑混合效率和离散化偏差的框架,并显式刻画了最优位置无关不可逆扰动,数值实验表明该设计在控制偏差的同时加速收敛。

详情
Comments
60 pages, 30 figures, 1 algorithm, 1 table
AI中文摘要

不可逆扰动加速了Langevin动力学的收敛,打破了细致平衡同时保持不变测度。在连续时间高斯设置中已经研究了最优不可逆扰动的设计,但扩展到非高斯目标分布以及时间离散化对最优扰动设计的影响尚未被充分理解。Langevin动力学的数值离散化引入了偏差,而不可逆扰动通常会加剧这种偏差;处理这种相互作用需要联合考虑加速和精度。本文开发了一个系统框架,用于优化未调整Langevin算法(ULA)的位置无关不可逆扰动。我们提出了一个约束优化问题,同时考虑混合效率和离散化偏差,其中前者通过谱间隙类比来表征,后者通过加权期望平方跳跃距离来量化。在该框架内,我们推导了最优位置无关不可逆扰动的显式刻画。广泛的数值实验表明,我们的设计在控制偏差的情况下实现了更快的收敛,并且与其他不可逆扰动选择相比,提高了均方估计误差。

英文摘要

Irreversible perturbations accelerate the convergence of Langevin dynamics, breaking detailed balance while preserving the invariant measure. The design of optimal irreversible perturbations has been studied in the continuous-time Gaussian setting, but extensions to non-Gaussian target distributions, and the impact of time discretization on the design of optimal perturbations, have not been well understood. Numerical discretizations of Langevin dynamics introduce bias, which is typically exacerbated by irreversible perturbations; handling this interaction demands a joint treatment of acceleration and accuracy. This paper develops a systematic framework for optimizing position-independent irreversible perturbations of the unadjusted Langevin algorithm (ULA). We formulate a constrained optimization problem that simultaneously accounts for mixing efficiency and discretization bias, where the former is characterized by a spectral gap analogue and the latter is quantified via a weighted expected squared jump distance. Within this framework, we derive an explicit characterization of the optimal position-independent irreversible perturbation. Extensive numerical experiments demonstrate that our design yields faster convergence with controlled bias, and improves mean squared estimation errors compared to other choices of irreversible perturbation.

2606.05317 2026-06-05 stat.ME

A Family of Quantile Functions Useful in Clinical Studies

临床研究中有用的一族分位数函数

Sankaran P. G., Prasanth V. P., Midhu N. N

AI总结 本文提出并研究了一类基于分位数的有效性持久函数,通过有理(Möbius)规范导出两参数非负分布族,并推导了其统计性质、L-矩和可靠性概念,通过最大似然估计参数,并用真实生存数据验证。

详情
AI中文摘要

受上尾分位数域摘要的启发,我们研究了基于分位数的有效性持久函数,定义为尾部均值与分位数函数之比。我们推导了该度量的统计性质,并考虑了基于分位数的有效性持久函数的有理(Möbius)规范。在自然边界条件下,该规范简化为规范形式。由此得到的规范族通过其分位数函数定义了一个两参数非负分布类。我们推导了该类的各种性质,包括描述性度量、L-矩和基于分位数的可靠性概念。还开发了使用最大似然估计模型参数的方法。通过一个真实的生存数据集说明了所提出的族。

英文摘要

Motivated by upper-tail quantile-domain summaries, we study the quantile-based effectiveness persistence function defined as the ratio between the tail mean and the quantile function. We derive statistical properties of this measure and consider a rational (Möbius) specification of the quantilebased effectiveness persistence function. Under natural boundary conditions, this specification reduces to a canonical form. The resulting canonical family defines a two-parameter class of nonnegative distributions through its quantile function. Various properties, including descriptive measures, L-moments, and quantile-based reliability concepts, are derived for this class. Estimation of the model parameters using maximum likelihood is also developed. The proposed family is illustrated using a real survival dataset.

2606.05308 2026-06-05 cs.LG cs.AI cs.CL cs.IR stat.AP

Statistically Reliable LLM-Based Ranking Evaluation via Prediction-Powered Inference

基于预测驱动推断的统计可靠LLM排序评估

Abhishek Divekar

AI总结 提出PRECISE框架,将预测驱动推断扩展到排序评估指标,通过结合少量人工标注和大量LLM判断实现无偏估计,并在ESCI基准和实际系统中验证了有效性。

详情
Comments
Accepted at ACL 2026 - GEM Workshop
AI中文摘要

通过PRECISE,我们将预测驱动推断扩展到排序评估指标,通过结合少量人工标注集和大量LLM判断集,产生偏差校正的估计。PPI无论LLM判断器的错误分布如何,都是可证明无偏的。我们通过将输出空间计算从O(2^|C|)减少到O(2^K),使其适用于像Precision@K这样的分层指标,其中标注是按文档的,但指标是按查询的。在ESCI基准上,用Claude 3 Sonnet判断增强30个人工标注,将Precision@4估计的标准误差从4.45降低到3.50(相对减少21%)。在一个生产系统中,我们的框架从100个人工标签和2小时的领域专家标注中正确识别了三个系统变体中最好的一个;A/B测试确认了这一排序,日销售额增加了407个基点。

英文摘要

With PRECISE, we extended Prediction-Powered Inference to produce bias-corrected estimates of ranking evaluation metrics by combining a small human-labeled set with a large LLM-judged set. PPI is provably unbiased regardless of the LLM judge's error profile. We make it applicable to hierarchical metrics like Precision@K, where annotations are per-document but the metric is per-query, by reducing the output-space computation from O(2^|C|) to O(2^K). On the ESCI benchmark, augmenting 30 human annotations with Claude 3 Sonnet judgments reduces the standard error of Precision@4 estimates from 4.45 to 3.50 (a 21% relative reduction). In a production system, our framework correctly identified the best of three system variants from 100 human labels and 2 hours of domain-expert annotation; A/B testing confirmed this ranking with +407 bps in daily sales.

2606.05266 2026-06-05 cs.LG cs.CC cs.DS math.CO math.PR math.ST stat.TH

Sharp Low-Degree Thresholds for Planted-vs-Planted Testing

植入vs植入测试的尖锐低度阈值

Anda Skeja, Daniel Gutiérrez Espinoza, Fiona Skerman, Alexander S. Wein

AI总结 针对植入vs植入设置,建立了低度多项式测试的首个尖锐阈值,并证明在植入子矩阵和植入稠密子图模型中计数社区的匹配上下界,测试阈值与已知低度恢复阈值精确一致。

详情
AI中文摘要

我们在植入vs植入设置中建立了低度多项式测试的首个尖锐阈值,其中目标是以渐近消失的错误率确定两个结构化植入机制中的哪一个生成了观测数据。我们证明了在植入子矩阵和植入稠密子图模型中计数社区的匹配低度上下界。所得的测试阈值与已知的低度恢复阈值精确一致。相比之下,弱测试(即目标优于随机猜测)没有尖锐阈值,而是存在一个我们识别的平滑过渡。为了证明我们的结果,我们开发了一个基于低度恢复中潜在变量展开的植入vs植入测试框架,并采用新方法来识别和修剪非信号贡献。

英文摘要

We establish the first sharp thresholds for low-degree polynomial tests in planted-vs-planted settings, where the goal is to determine with vanishing error which of two structured planted mechanisms generated the observed data. We prove matching low-degree upper and lower bounds for counting communities in the planted submatrix and planted dense subgraph models. The resulting testing threshold coincides, down to the sharp constant, with the known low-degree recovery threshold. In contrast, the task of weak testing, where the goal is to outperform random guessing, does not have a sharp threshold but rather a smooth transition, which we identify. To prove our results, we develop a framework for planted-vs-planted testing that builds on a latent-variable expansion originating in low-degree recovery and employs new methods to identify and prune non-signal contributions.

2606.05258 2026-06-05 stat.ML cs.LG stat.AP

Harnessing Source Heterogeneity for Cluster-Structured Transfer Learning

利用源异质性进行聚类结构迁移学习

Xiaohui Yin, Jun Jin, Shane J. Sacco, Robert H. Aseltine, Kun Chen

AI总结 针对迁移学习中源异质性问题,提出Trans-GLMC方法,通过聚类结构自适应融合目标与源数据,提升预测性能并识别可解释的源聚类。

详情
AI中文摘要

当目标群体数据有限但存在多个相关辅助源时,迁移学习是一种自然策略。一个核心难点是源异质性:辅助源可能并非同等有用,且其有用性可能以结构化的聚类方式变化。现有的迁移学习方法通常将源选择简化为二元的信息性/非信息性决策,忽略了具有不同可迁移性的源子组。受一项使用康涅狄格医院信息管理交换(CHIME)数据的自杀风险研究(涵盖27家医院的636,758名患者)的启发,我们提出了Trans-GLMC,一种针对广义线性模型的聚类结构迁移学习程序。CHIME设置说明了核心挑战:由于自杀尝试在任何单一设施中罕见,医院特定的风险模型不稳定,而不加区分地合并所有医院会模糊设施层面在患者构成和风险特征上的差异。Trans-GLMC首先在目标和候选源之间构建基于系数的距离,以恢复潜在源聚类。然后,它结合全局融合、聚类内细化和目标去偏,产生一个适应检测到的结构的估计量。我们建立了一个非渐近误差界,当存在有意义的目标聚类时,该误差界优于其非聚类对应物,否则在常数范围内匹配非聚类速率。在模拟和CHIME研究中,Trans-GLMC改进了设施特定的预测,识别了具有相互可迁移性的可解释医院社区,并恢复了临床一致的自杀风险因素。

英文摘要

Transfer learning is a natural strategy when a target population has limited data but multiple related auxiliary sources are available. A central difficulty is source heterogeneity: auxiliary sources may not be equally useful, and their usefulness may vary in a structured, cluster-like fashion. Existing transfer-learning methods often reduce source selection to a binary informative/non-informative decision, overlooking subgroups of sources with differential transferability. Motivated by a suicide-risk study using data from the Connecticut Hospital Information Management Exchange (CHIME), comprising 636,758 patients across 27 hospitals, we propose Trans-GLMC, a cluster-structured transfer-learning procedure for generalized linear models. The CHIME setting illustrates the core challenge: hospital-specific risk models are unstable because suicide attempts are rare at any single facility, whereas indiscriminate pooling across hospitals can obscure facility-level differences in patient mix and risk profiles. Trans-GLMC first constructs a coefficient-based distance among the target and candidate sources to recover latent source clusters. It then combines global fusion, within-cluster refinement, and target debiasing to produce an estimator that adapts to the detected structure. We establish a non-asymptotic error bound that improves over its unclustered counterpart whenever a meaningful target cluster exists and matches the unclustered rate up to constants otherwise. In simulations and in the CHIME study, Trans-GLMC improves facility-specific prediction, identifies interpretable communities of hospitals with mutual transferability, and recovers clinically coherent suicide-risk factors.

2606.05247 2026-06-05 cs.LG stat.ML

DiffSlack: Learning under Nonlinear Inequality Constraints via Learnable Slack Variables

DiffSlack: 通过可学习松弛变量在非线性不等式约束下学习

Ziqian Wang, Chenxi Fang, Zhen Zhang

AI总结 提出DiffSlack,一种可微投影层,通过可学习松弛变量将非线性不等式约束转化为等式,结合阻尼高斯-牛顿投影实现端到端约束满足,在车辆路径规划中取得更高成功率和几何约束满足度。

详情
AI中文摘要

在神经网络中强制执行非线性不等式约束仍然具有挑战性,尤其是当输出受到许多耦合约束时。现有的硬约束方法通常对约束集施加结构限制,或者为大规模非线性问题引入大量计算开销。在此,我们提出DiffSlack,一种用于非线性不等式约束神经预测的可微投影层。DiffSlack将不等式重新表述为带有可学习松弛变量的等式,这些松弛变量作为增强网络输出的一部分被预测,并为阻尼高斯-牛顿投影提供数据驱动的热启动。投影层将原始预测映射到增强可行流形上,同时保持端到端可微性。两阶段课程进一步稳定训练并改善约束满足。我们在具有200个来自碰撞避免、曲率限制和航点间距的非线性不等式约束的车辆路径规划上评估DiffSlack。与现有的基于学习的基线相比,DiffSlack在相当的推理预算下实现了更高的规划成功率和更强的几何约束满足。消融研究进一步表明,硬投影层降低了对监督质量的敏感性。CARLA中的闭环跟踪和真实车辆实验证实了生成轨迹的可执行性。这些结果表明,DiffSlack为工程应用中将硬不等式约束嵌入神经网络提供了一种实用且可扩展的方法。

英文摘要

Enforcing nonlinear inequality constraints in neural networks remains challenging, especially when the output is subject to many coupled constraints. Existing hard constraint methods often impose structural restrictions on the constraint set or introduce substantial computational overhead for large-scale nonlinear problems. Here, we propose DiffSlack, a differentiable projection layer for nonlinear inequality-constrained neural prediction. DiffSlack reformulates inequalities as equalities with learnable slack variables, which are predicted as part of the augmented network output and provide a data-driven warm start for damped Gauss-Newton projection. The projection layer maps raw predictions onto the augmented feasible manifold while preserving end-to-end differentiability. A two-stage curriculum further stabilizes training and improves constraint satisfaction. We evaluate DiffSlack on vehicle path planning with 200 nonlinear inequality constraints from collision avoidance, curvature limits, and waypoint spacing. Compared with existing learning-based baselines, DiffSlack achieves a higher planning success rate and stronger geometric constraint satisfaction under a comparable inference budget. Ablation studies further show that the hard projection layer reduces sensitivity to supervision quality. Closed-loop tracking in CARLA and real-world vehicle experiments confirms the executability of the generated trajectories. These results demonstrate that DiffSlack provides a practical and scalable approach to embedding hard inequality constraints into neural networks for engineering applications.

2606.05242 2026-06-05 stat.ML cs.LG cs.NA math.NA math.PR

Deterministic Envelopes for Tamed SGLD: Decoupling Stochastic-Gradient Noise and Localizing Taming

驯化SGLD的确定性包络:解耦随机梯度噪声与局部化驯化

Yiwei Zhou, Ziheng Chen

AI总结 针对随机梯度Langevin算法中驯化分母引入的稳态偏差问题,提出一种结构保持的确定性包络框架,通过解耦梯度噪声与驯化步骤来消除偏差,并引入混合包络设计以兼顾稳定性和偏差减少。

详情
Comments
40 pages, 11 tables, 2 figures
AI中文摘要

随机梯度Langevin算法常使用驯化分母来稳定非全局Lipschitz漂移。本文表明,当分母与分子依赖于相同的随机梯度实现时,驯化步骤会改变随机预言本身,即使原始随机梯度无偏,也可能产生稳态偏差。我们提出了一种结构保持的框架来设计驯化分母。它在采样预言噪声之前固定分母,并使用局部确定性包络来避免典型区域中的不必要驯化。这些核保留了驯化的稳定效果,同时避免了由梯度依赖分母引入的偏差。我们的理论解释了稳态误差如何分解为预言依赖驯化引起的偏差和确定性稳定引入的剩余误差。在这个确定性包络族中,分析识别出一个远尾条件,解释了局部软包络的局限性,并激发了一个混合成员:在典型区域使用软包络,但在罕见游荡时通过硬尾控制提供保护。实验证实了随机分母的预测稳态失真、确定性包络设计的偏差减少以及混合结构的稳定效果。

英文摘要

Stochastic-gradient Langevin algorithms often use tamed denominators to stabilize non-globally Lipschitz drifts. This paper shows that when the denominator depends on the same stochastic-gradient realization as the numerator, the taming step changes the stochastic oracle itself and can create a stationary bias even if the original stochastic gradient is unbiased. We propose a structure-preserving framework for designing tamed denominators. It fixes the denominator before the oracle noise is sampled and uses localized deterministic envelopes to avoid unnecessary taming in typical regions. These kernels keep the stabilizing effect of taming while avoiding the bias introduced by a gradient-dependent denominator. Our theory explains how the stationary error splits into the bias caused by oracle-dependent taming and the remaining error introduced by deterministic stabilization. Within this deterministic-envelope family, the analysis identifies a far-tail condition that explains the limitation of local soft envelopes and motivates a hybrid member: soft in the typical region, but protected by hard-tail control on rare excursions. Experiments confirm the predicted stationary distortions of random denominators, the bias reduction of deterministic-envelope designs, and the stabilizing effect of the hybrid construction.

2606.05239 2026-06-05 stat.ML cs.LG

HyFAD: Hybrid Time-Frequency Diffusion with Frequency-Aware Embedding for Time Series Imputation

HyFAD: 用于时间序列插值的混合时频扩散与频率感知嵌入

Hongfan Gao, Wangmeng Shen, Bin Yang, Jilin Hu

AI总结 提出HyFAD模型,通过耦合时频扩散框架和频率感知步嵌入,实现从时域到频域的渐进式去噪,有效解决频率敏感去噪和高频重建问题,在多个基准数据集上达到最先进性能。

详情
AI中文摘要

扩散模型通过迭代去噪逐步捕捉复杂数据分布,在时间序列建模中表现出强大性能。然而,现有方法在处理频率敏感去噪、高频重建以及平衡全局趋势与局部动态方面存在困难。为解决这些限制,我们提出 extbf{HyFAD},一种用于时间序列插值的 extbf{混合}时频 extbf{扩散}模型,带有 extbf{频率感知}嵌入。基于DDPM范式,HyFAD采用耦合的时频扩散框架,其中反向去噪从时域到频域顺序进行,实现从粗到细的生成。具体地,时域扩散过程捕捉低频全局趋势,而频域扩散过程细化高频频谱分量。我们进一步引入频率感知步嵌入,利用扩散步与频谱分量之间的关系,提供步依赖的频谱引导,促进更准确的频带重建。在多个基准数据集上的大量实验表明,HyFAD达到了最先进的性能。我们的源代码可在https://github.com/hongfangao/HyFAD获取。

英文摘要

Diffusion models have demonstrated strong performance in time series modeling due to their ability to progressively capture complex data distributions through iterative denoising. However, existing approaches struggle with frequency-sensitive denoising, high-frequency reconstruction and balancing global trends with local dynamics. To address these limitations, we propose \textbf{HyFAD}, a \textbf{Hy}brid time-frequency \textbf{D}iffusion model with \textbf{F}requency-\textbf{A}ware embedding for time series imputation. Built upon the DDPM paradigm, HyFAD adopts a coupled time-frequency diffusion framework, in which the reverse denoising proceeds sequentially from the time domain to the frequency domain, enabling coarse-to-fine generation. Specifically, the time-domain diffusion process captures low-frequency global trends, while the frequency-domain diffusion process refines high-frequency spectral components. We further introduce a frequency-aware step embedding that exploits the relationship between diffusion steps and spectral components, providing step-dependent spectral guidance and facilitates more accurate band-wise reconstruction. Extensive experiments on multiple benchmark datasets demonstrate that HyFAD achieves state-of-the-art performance. Our source code is available at https://github.com/hongfangao/HyFAD.

2606.05230 2026-06-05 stat.ML cs.LG eess.SP

Central Description Length (CDL) Clustering Validation Index

中心描述长度(CDL)聚类验证指标

Mahdi Shamsi, Soosan Beheshti

AI总结 提出中心描述长度(CDL)聚类验证指标,通过计算不可观测真实聚类中心描述长度的概率上界来评估聚类质量,无需标签且适用于非凸和不规则形状数据。

详情
AI中文摘要

在工程机器学习管道中,无标签情况下选择聚类算法及其超参数是一个常见难题,这些管道通常用于传感器、图像或过程数据的无监督分析。聚类验证指标(CVI)提供内部评分来对候选聚类进行排序,但大多数流行的CVI基于欧几里得紧致性和分离项构建,因此倾向于紧凑的凸分区。已知它们在非凸、不规则或变密度数据上的性能会下降,通常需要使用核变换或替代距离度量,但代价是额外的调优和计算。本文介绍了中心描述长度(CDL)聚类验证指标。CDL利用观测到的簇内紧致性、估计的聚类中心和估计的聚类协方差,计算与不可观测的真实聚类中心相关的描述长度的概率上界。该界限将簇内紧致性和质心位移压缩为一个可计算的量,并在任何聚类算法产生的分区上进行评估。实现仅使用可观测的量(数据、分区、估计中心和估计协方差),不使用真实标签。在具有非凸和任意形状簇的合成基准测试中,CDL-CVI比我们测试的传统CVI更频繁地选择参考聚类数,并达到更高的调整兰德指数(ARI)值,且无需额外的核预处理阶段。在从冻结的无监督嵌入聚类的图像基准测试(MNIST、CIFAR-10、STL-10)中,CDL-CVI在报告的试验中,针对K-means、DBSCAN和谱聚类返回的聚类数接近参考类别数。

英文摘要

Selecting a clustering algorithm and its hyperparameters without labels is a common difficulty in engineering machine learning pipelines that work with unsupervised analysis of sensor, image, or process data. Clustering validation indices (CVIs) provide internal scores for ranking candidate clusterings, but most popular CVIs are built from Euclidean compactness and separation terms and so tend to favour compact, convex partitions. Their performance is known to degrade on non convex, irregular, or variable density data, where kernel transformations or alternative distance measures are typically used at the cost of additional tuning and computation. This paper introduces the Central Description Length (CDL) clustering validation index. CDL uses the observed within cluster compactness, the estimated cluster centers, and the estimated cluster covariances to compute a probabilistic upper bound on the description length associated with the unobservable true cluster centers. The bound condenses intra cluster compactness and centroid displacement into a single computable quantity and is evaluated on the partition produced by any clustering algorithm. The implementation uses only observable quantities (the data, the partition, the estimated centers, and the estimated covariances) and does not use ground truth labels. On synthetic benchmarks with non convex and arbitrary shape clusters, CDL-CVI selected the reference number of clusters more often and reached higher Adjusted Rand Index (ARI) values than the conventional CVIs we tested, without an additional kernel preprocessing stage. On image benchmarks (MNIST, CIFAR-10, STL-10) clustered from frozen unsupervised embeddings, CDL-CVI returned cluster numbers close to the reference class counts across K-means, DBSCAN, and spectral clustering in the reported trials.

2606.05206 2026-06-05 q-bio.NC cs.AI stat.AP

Ontology-constrained multi-LLM scoring of hypothesis support in the predictive processing literature

本体约束的多LLM评分在预测处理文献中假设支持度的应用

Hamed Nejat, Alexander Maier, Jesse Spencer-Smith, André M. Bastos

AI总结 本文提出一个本地多LLM流水线,通过本体约束对预测编码文献中的研究进行评分,将异构文献映射到定量证据空间,并揭示假设间的结构化分歧。

详情
Comments
33 pages, 5 tables and 9 figures
AI中文摘要

跨学科领域由于方法多样和理论承诺不同,常常存在碎片化问题。预测编码神经科学是一个典型例子:其文献涵盖计算理论、电生理学、影像学、行为学和建模,造成了传统荟萃分析难以解决的综合问题。本文描述了一个用于本体约束文献综合的本地多LLM流水线。该流水线读取论文、提取证据、整合图表描述、组装约束提示,并根据专家词汇表验证输出。我们手动定义了一个预测编码词汇表,包含36个概念,分为三个假设:预测抑制、前向误差传播和普遍性。由十个本地语言模型组成的委员会根据每个词汇因子在局部和全局oddball情境下的一致性或不一致性,对31项研究进行评分。这使得可以进行成对研究一致性分析、跨模型比较和三维假设空间映射。某些假设的一致性较高,而其他假设则较弱,揭示了结构化分歧,特别是在局部与全局oddball范式之间。我们进一步定义了假设空间温度,这是一种几何离散度度量,用于衡量研究在假设空间中的紧凑程度。局部oddball情境的温度较低,而全局oddball情境的温度较高,表明后者离散度更大。评分几何还允许我们估计实验情境之间的变化向量。这些结果表明,本地多LLM委员会可以产生可审计的不一致性测量,将异构文献映射到定量证据空间。该框架可能推广到传统荟萃分析缺乏共同比较空间的跨研究假设映射。

英文摘要

Fragmentation is common in interdisciplinary fields with diverse methods and theoretical commitments. Predictive coding neuroscience is a clear example: its literature spans computational theory, electrophysiology, imaging, behavior, and modeling, creating a synthesis problem that conventional meta-analysis cannot easily resolve. Here, we describe a local multi-LLM pipeline for ontology-constrained literature synthesis. The pipeline reads papers, extracts evidence, incorporates figure descriptions, assembles constrained prompts, and validates outputs against an expert glossary. We manually defined a predictive-coding glossary of thirty-six concepts grouped into three hypotheses: predictive suppression, feedforward error propagation, and ubiquity. A council of ten local language models scored 31 studies according to their agreement or disagreement with each glossary factor across local and global oddball contexts. This enabled pairwise study-agreement analysis, cross-model comparison, and three-dimensional hypothesis-space mapping. Agreement was high for some hypotheses but weaker for others, revealing structured disagreement, particularly across local versus global oddball paradigms. We further define hypothesis-space temperature, a geometric dispersion metric measuring how compactly studies occupy the hypothesis space. Temperature was lower for local oddball contexts and higher for global oddball contexts, indicating greater dispersion in the latter. The scoring geometry also allowed us to estimate vectors of change between experimental contexts. These results demonstrate that local multi-LLM councils can produce auditable disagreement measurements that map heterogeneous literatures into quantitative evidence spaces. This framework may generalize to cross-study hypothesis mapping where conventional meta-analysis lacks a common comparison space.

2606.05120 2026-06-05 stat.ME

Stochastic Sensitivity Analysis for Matched Observational Studies

匹配观察性研究的随机敏感性分析

Mengqi Lin, Colin B. Fogarty, Gongjun Xu

AI总结 针对匹配观察性研究,提出一种随机敏感性分析方法,通过将未测量混杂因素视为随机变量并寻找最坏条件分布,在允许隐藏偏差与潜在结果不完全对齐的情况下评估研究对未测量混杂的稳健性。

详情
AI中文摘要

敏感性分析探讨需要多强的未测量混杂才能解释观察性研究的结论。匹配研究中的传统方法在给定潜在结果以及观测和未观测混杂因素的条件下进行推断,然后针对未观测混杂因素的所有可能实现,找到条件处理分配的最坏情况分布。由此产生的最坏情况分配设想潜在结果与隐藏偏差之间存在强、近乎完美的相关性。我们提出一种随机敏感性分析,其目标是在给定潜在结果和观测混杂因素的条件下进行推断,同时将隐藏混杂因素视为具有未知条件分布的随机变量。我们不是寻找隐藏混杂因素的最坏情况实现,而是确定一个广泛分布类别上的最坏条件分布。这保留了敏感性分析的对抗性精神,同时允许隐藏偏差与潜在结果之间存在不完全对齐,其程度由标量敏感性参数控制。我们考虑对无参数假设的可解释类别和伯努利条件分布类别施加限制。设计敏感性计算和真实数据演示表明,与传统方法相比,即使允许很小程度的随机性,也能显著提高报告的对隐藏偏差的稳健性。

英文摘要

Sensitivity analysis asks how strong unmeasured confounding needs to be to explain away an observational study's conclusion. The conventional approach in matched studies conducts inference conditional upon the potential outcomes as well as both observed and unobserved confounders, and then finds the worst-case distribution for the conditional treatment assignments across all possible realizations of the unobserved confounder. The resulting worst-case allocation imagines strong, near perfect, correlations between the potential outcomes and hidden bias. We propose a stochastic sensitivity analysis that instead targets inference conditional upon potential outcomes and observed confounders while treating the hidden confounders as random with unknown conditional laws. Rather than finding the worst-case realizations for the hidden confounders, we instead determine the worst-case conditional law over a broad class of distributions. This preserves the adversarial spirit of sensitivity analysis while allowing for imperfect alignment between hidden bias and potential outcomes to a degree controlled by a scalar sensitivity parameter. We consider restrictions to both an interpretable class with no parametric assumptions and a Bernoulli class of conditional laws. Design sensitivity calculations and real-data demonstrations illustrate that allowing for even a small degree of stochasticity can materially increase reported robustness to hidden bias relative to the conventional approach.

2606.03067 2026-06-05 stat.ML cs.LG

Trajectory-Aware Node Contributions and the Limits of Static Controllability

轨迹感知的节点贡献与静态可控性的极限

Valentina Kuskova, Dmitry Zaytsev, Michael Coppedge

AI总结 本文提出“涌现贡献”(EC)作为节点动态杠杆的有限时域度量,通过可微模型的雅可比矩阵计算,在线性时不变极限下退化为平均可控性,并构建相图刻画两者一致与分歧的条件。

详情
Comments
11 pages, 1 figure
AI中文摘要

复杂网络中的一个常见数据挖掘任务是确定单个节点如何影响系统行为。现有方法依赖于静态图中心性或控制理论量(如可控性格拉姆矩阵),这些方法假设线性时不变动力学。然而,实际估计的系统通常是非线性和时变的。我们定义了“涌现贡献(EC)”,这是一种节点动态杠杆的有限时域度量:其脉冲响应的度量加权能量沿系统轨迹累积。EC 通过任何可微模型的雅可比矩阵计算,与估计器无关,并在线性时不变极限下精确地退化为平均可控性。我们的贡献是刻画了这两种度量一致与分歧的条件。使用一个具有已知真实贡献的受控合成族,我们构建了一个跨越非线性、机制结构、持续性和扰动幅度的相图。EC 和平均可控性在静态或平滑漂移动力学下一致,并且两者都跟踪真实值。分歧在持续机制切换下出现,在持续符号反转下最强,并在移除符号反转时消失。在极端扰动幅度下,两种度量都会退化,这揭示了局部线性化的极限。我们将来自多个领域的五个估计真实系统置于该相空间中。它们的位置可作为 EC 何时提供超出静态可控性信息的诊断,从而证明其额外计算成本的合理性。在一个深入检查的面板上,一个二十种子重训练集成揭示了稳健的方差-杠杆分离:节点的扰动广泛传播,尽管其系统内方差较低,这既未被静态中心性恢复,也未被基于方差的摘要恢复。

英文摘要

A recurring data mining task in complex networks is to determine how individual nodes contribute to system behavior. Existing approaches rely on either static-graph centralities or control-theoretic quantities such as controllability Gramians, which assume linear, time-invariant dynamics. Estimated systems, however, are typically nonlinear and time-varying. We define "emergent contribution (EC)," a finite-horizon measure of a node's dynamical leverage: the metric-weighted energy of its impulse response accumulated along the system trajectory. Computed from the Jacobians of any differentiable model, EC is estimator-agnostic and reduces exactly to average controllability in the linear, time-invariant limit. Our contribution is a characterization of when the two measures agree and diverge. Using a controlled synthetic family with known ground-truth contribution, we construct a phase diagram spanning nonlinearity, regime structure, persistence, and perturbation amplitude. EC and average controllability agree under static or smoothly drifting dynamics and both track ground truth. Divergence emerges under persistent regime switching, is strongest under persistent sign reversal, and disappears when the sign reversal is removed. At extreme perturbation amplitudes, both measures degrade, identifying the limits of local linearization. We place five estimated real systems from several domains within this phase space. Their placement serves as a diagnostic of when EC provides information beyond static controllability and therefore justifies its additional computational cost. On one panel examined in depth, a twenty-seed retraining ensemble reveals a robust variance--leverage dissociation: nodes whose perturbations propagate widely despite low within-system variance, which is not recovered by static centralities nor variance-based summaries.

2606.02772 2026-06-05 stat.OT

Closing the Gap: Can Novice Statistics and Data Science Students Collaborate as Effectively as an Expert?

缩小差距:统计学和数据科学新手学生能否像专家一样有效协作?

Jessica L. Alzen, Ilana M. Trumble, Kimberly J. Cho, Eric A. Vance

AI总结 本研究评估了新手学生在与领域专家初次协作会议中的表现,并与专家进行对比,发现新手在ASCCR框架的态度、结构和关系维度上表现接近专家,并能通过项目缩小内容与沟通方面的差距,最终获得更高的专家反馈评分。

详情
Comments
62 pages, 6 figures, revised and resubmitted to Journal of Statistics and Data Science Education
AI中文摘要

ASCCR(态度-结构-内容-沟通-关系)框架最近被开发用于教授统计学家和数据科学家的协作技能。然而,其在现实环境中的有效性尚未得到系统评估。为此,我们评估了新手本科生和研究生在与真实领域专家的初次协作会议中的表现,并将其与一位专家协作者进行比较。通过视频记录、评分量表和领域专家反馈调查,我们发现新手与专家相比表现得出奇地好。具体而言,新手在ASCCR框架的态度、结构和关系组件上得分几乎与专家一样高。尽管新手在内容或沟通方面最初表现不佳,但他们能够缩小差距。到协作项目结束时,新手的总体领域专家反馈分数高于专家。我们研究的主要启示是,新手可以在非常短的时间内成为有效的协作者。我们讨论了研究结果的实践意义,并提出了将ASCCR框架整合到统计学和数据科学协作、咨询及顶点课程中的建议。

英文摘要

The ASCCR (Attitude-Structure-Content-Communication-Relationship) framework was recently developed to teach collaboration skills to statisticians and data scientists. However, its effectiveness in real-world settings has not yet been systematically evaluated. To assess this, we evaluated novice undergraduate and graduate students' performances in initial collaboration meetings with real domain experts and compared them to an expert collaborator. Using video recordings, rubric scores, and domain expert feedback surveys, we found that novices performed surprisingly well compared to the expert. Specifically, novices scored nearly as well as the expert on the Attitude, Structure, and Relationship components of the ASCCR framework. Although novices did not initially perform as well on the Content or Communication aspects, they were able to close the gap. By the end of the collaboration projects, the novices had higher overall domain expert feedback scores than the expert. The primary implication of our study is that novices can become effective collaborators in a very short time. We discuss our findings' practical implications and provide recommendations for integrating the ASCCR framework into statistics and data science collaboration, consulting, and capstone courses.

2605.31278 2026-06-05 cs.AI cs.LG stat.ME

Industrializing Prediction-Powered Inference: The GLIDE Library for Reliable GenAI and Agentic Systems Evaluation

工业化预测驱动推断:用于可靠生成式AI与智能体系统评估的GLIDE库

Grégoire Martinon, Ibrahim Merad, Mohammed Raki

AI总结 提出GLIDE开源库,统一多种预测驱动推断方法,提供无偏估计与有效置信区间,显著降低人工标注成本。

详情
Comments
8 pages, Accepted to the ICML 2026 Workshop on Statistical Frameworks for Uncertainty in Agentic Systems, Seoul, South Korea, 2026
AI中文摘要

智能体系统的可靠评估需要具有有效不确定性的无偏估计,但标准实践在昂贵的人工标注和有偏的LLM-as-judge代理之间权衡。预测驱动推断(PPI)将两者结合为具有有效置信区间的去偏估计,然而其各种方法仍分散在不同论文的部分实现中。我们介绍GLIDE,一个开源Python库,它在专用于均值估计的scipy风格API下统一了最先进的PPI估计器(PPI++、分层PPI、先预测后去偏及其分层变体、主动统计推断)和采样器(均匀、分层、主动、成本最优)。GLIDE附带一个可复现的蒙特卡洛验证套件、一个基于经验的决策树用于方法选择,以及一个智能体评估案例研究,显示在同等精度下显著节省标注成本。GLIDE包可通过此URL获取:https://github.com/EmertonData/glide

英文摘要

Reliable evaluation of agentic systems requires unbiased estimates with valid uncertainty, but standard practice navigates between costly human annotation and biased LLM-as-judge proxies. Prediction-powered inference (PPI) combines both into debiased estimates with valid confidence intervals, yet its various methods remain scattered across papers under partial implementations. We introduce GLIDE, an open-source Python library that unifies state-of-the-art PPI estimators (PPI++, Stratified PPI, Predict-Then-Debias and its stratified variants, Active Statistical Inference) and samplers (uniform, stratified, active, cost-optimal) under a scipy-style API specialized to mean estimation. GLIDE ships with a reproducible Monte Carlo validation suite, an empirically grounded decision tree for method selection, and an agentic evaluation case study showing substantial annotation savings at equivalent precision. The GLIDE package is available at this URL: https://github.com/EmertonData/glide

2605.27991 2026-06-05 stat.ML cs.LG

Gradient-Flow Optimization as Dynamic Random-Effects Inference: Testing and Early Stopping with Applications to Deep Learning

深度神经网络训练作为随机效应:优化-推断对偶性

Minhao Yao, Ruoyu Wang, Xihong Lin, Lin Liu, Zhonghua Liu

AI总结 本文提出深度神经网络训练与经典随机效应模型等价,揭示了优化-推断对偶性,并利用限制最大似然估计实现基于似然的早停规则。

详情
AI中文摘要

深度神经网络(DNN)取得了显著的实证成功,但其训练动态主要从优化而非统计原理的角度被理解。本文通过证明连续时间神经正切核(NTK)梯度流产生的预测与经典随机效应模型的预测完全等价,为过参数化机制下的DNN训练建立了一个统计框架。在该框架中,训练时间充当方差分量,或等价地作为经验贝叶斯协方差超参数,控制噪声到结构化信号的变异分配。这种等价性揭示了一种优化-推断对偶性:梯度流路径既是优化轨迹,也是经验贝叶斯随机效应推断路径。以训练时间为条件,网络输出是潜在信号的后验均值,通过限制最大似然估计(REML)估计训练时间,将早停转化为基于似然的经验贝叶斯推断,而非外部调参。这一视角产生了一个两阶段推断程序。首先,方差分量检验确定DNN训练是否捕捉到初始化之外的统计显著结构。其次,以训练合理为条件,REML提供基于似然的早停规则。由此产生的停止时间在NTK特征基下具有谱解释,其中训练持续到谱损失去相关实现。我们进一步证明,对于固定设计下的样本内预测,REML引导的早停实现了渐近最优预测误差,并且在额外的随机设计正则条件下,对于样本外预测也成立。这项工作将DNN训练重新定义为统计推断,并为决定是否以及训练深度神经网络多长时间提供了原则性基础。

英文摘要

Gradient-flow optimization is usually viewed as an algorithmic procedure for minimizing empirical loss, with training duration selected by validation or heuristic early-stopping rules. We develop a statistical inference framework for the gradient-flow training trajectory itself. The central object is fixed-operator squared-error gradient flow: whenever the fitted value evolves through a time-invariant positive semidefinite training operator, the trained model output at each training time is exactly equivalent to the best linear unbiased predictor, or empirical-Bayes posterior mean, under a corresponding random-effects model. Under this representation, training time becomes a variance-component parameter governing how variance is reallocated from residual noise to structured signal. This turns two basic training decisions into inferential problems. First, whether training is needed is formulated as a variance-component test for signal beyond initialization. Second, how long to train is formulated as restricted maximum likelihood (REML) estimation of the training-time variance component. The resulting REML-guided early stopping rule has a spectral interpretation: it selects the training time at which optimized spectral losses become empirically decorrelated from the eigenvalues of the training operator, yielding an effective degrees-of-freedom measure for the evolving trained model. We establish asymptotic prediction optimality for fixed-design in-sample risk and, under additional kernel regularity conditions, random-design out-of-sample risk. Deep learning models in fixed-kernel gradient regimes provide canonical modern-AI instantiations of the theory. Numerical experiments and a UK Biobank proteomics application show that the proposed inferential approach attains competitive prediction accuracy while reducing the reliance on validation splits and repeated checkpoint evaluation.

2605.27292 2026-06-05 cs.LG stat.ML

Detectability in Diversity: Improved Canary Crafting for Privacy Auditing in One Run

多样性中的可检测性:单次运行中用于隐私审计的改进金丝雀构造

Mathieu Dagréou, Aurélien Bellet

AI总结 针对单次运行隐私审计中金丝雀点相互干扰导致隐私泄露估计偏弱的问题,提出结合影响函数贪婪初始化与双层优化的方法,最大化金丝雀可检测性并促进嵌入空间多样性,以较低计算成本获得更强的隐私泄露估计。

详情
AI中文摘要

隐私审计旨在利用成员推断攻击(MIAs)实证评估机器学习模型中的隐私泄露,并推导差分隐私(DP)参数的下界。最近的单次运行审计方法通过依赖单个训练运行和多个“金丝雀”点来降低标准方法的高成本,审计者需要检测这些点是否被包含或排除。在这项工作中,我们研究了为单次运行隐私审计高效构造金丝雀的问题。受最近理论见解的启发,即金丝雀之间的干扰导致与多次运行方法相比更弱的泄露估计,我们提出优化金丝雀使其既高度可检测又最小化干扰。我们的方法结合了基于影响函数的贪婪初始化与双层优化过程,该过程最大化可区分性同时促进嵌入空间中的多样性,从而能够使用计算高效的双层算法。实验表明,与现有的金丝雀构造方法相比,我们的方法以更低的计算成本实现了更强的隐私泄露估计。

英文摘要

Privacy auditing aims to empirically assess privacy leakage in machine learning models using membership inference attacks (MIAs), and to derive lower bounds on differential privacy (DP) parameters. Recent one-run auditing methods address the high cost of standard approaches by relying on a single training run with multiple "canary" points whose inclusion or exclusion must be detected by the auditor. In this work, we study the problem of efficiently crafting canaries for one-run privacy auditing. Motivated by recent theoretical insights suggesting that interference between canaries contributes to weaker leakage estimates compared to multi-run methods, we propose to optimize canaries to be both highly detectable and minimally interfering. Our approach combines a greedy initialization based on influence functions with a bilevel optimization procedure that maximizes distinguishability while promoting diversity in embedding space, enabling the use of computationally efficient bilevel algorithms. Experiments show that our method achieves stronger privacy leakage estimates at a lower computational cost than existing canary crafting approaches.

2605.30278 2026-06-05 stat.CO

modelimportance: An R package for evaluating model importance within a multi-model ensemble

modelimportance: 一个用于评估多模型集合中模型重要性的R包

Minsu Kim, Li Shandross, Evan L. Ray, Nicholas G. Reich

AI总结 本文介绍R包modelimportance,通过多种集成方法和重要性指标量化每个组成模型对点预测和概率预测集合性能的贡献,支持缺失值处理,遵循hubverse框架。

详情
AI中文摘要

集合预报通常用于支持各个领域的决策和政策规划,因为它们相比单个模型通常能提供更高的准确性和稳定性。由于每个模型都有其独特的特性,理解和衡量每个组成模型的价值可以支持构建有效的集合。R包modelimportance提供了量化每个组成模型如何对点预测和概率预测的集合性能准确性做出贡献的工具。该包支持多种集成方法和多种模型重要性指标。此外,该软件提供了处理缺失值的可定制选项。这些特性使得该包能够作为研究人员和实践者的多功能工具。它不仅有助于在广泛的预测任务中构建有效的集成模型,还有助于理解每个模型在集合中的作用,并深入了解单个模型本身。该包遵循'hubverse'框架,这是一个开源软件、工具和数据标准的集合,旨在促进协作建模中心工作并简化其设置和操作。这样做可以实现与其他预测工具和系统的无缝集成和灵活性,允许在现有中心上进行许多分析。

英文摘要

Ensemble forecasts are commonly used to support decision-making and policy planning across various fields because they often offer improved accuracy and stability compared to individual models. As each model has its own unique characteristics, understanding and measuring the value of each constituent model can support the construction of effective ensembles. The R package modelimportance provides tools to quantify how each component model contributes to the accuracy of ensemble performance for both point and probabilistic forecasts. The package supports multiple ensemble methods and multiple model importance metrics. Additionally, the software offers customizable options for handling missing values. These features enable the package to serve as a versatile tool for researchers and practitioners. It helps not only in constructing an effective ensemble model across a wide range of forecasting tasks, but also in understanding the role of each model within the ensemble and gaining insights into individual models themselves. This package follows the 'hubverse' framework, which is a collection of open-source software, tools and data standards developed to promote collaborative modeling hub efforts and simplify their setup and operation. Doing so enables seamless integration and flexibility with other forecasting tools and systems, allowing many analyses to be performed on existing hubs.

2605.29972 2026-06-05 stat.ME math.ST stat.TH

Identification-Robust Testing in Endogenous Functional Linear Regression with Weak or Irrelevant Auxiliary Variables

内生函数线性回归中弱或无关辅助变量下的识别稳健检验

Won-Ki Seo

AI总结 针对函数线性回归中斜率函数的检验问题,提出基于辅助变量函数矩条件的无降维检验方法,在弱甚至无关辅助变量下仍保持渐近有效性,并建立了渐近分布理论。

详情
AI中文摘要

我们开发了函数线性回归中斜率函数的无降维检验方法,当函数回归变量可能存在内生性或测量误差时。这些检验基于由辅助函数变量诱导的函数矩条件,且不需要估计斜率函数。这一特性在无穷维设定中尤其有用,因为一致估计所需的识别和正则化条件通常较强且难以验证。所提出的程序在辅助变量弱相关甚至完全无关的情况下仍保持渐近有效,并且对通过矩算子可检测的固定备择假设具有一致性。我们建立了渐近零分布、对可检测备择假设的一致性以及漂移备择假设下的局部功效。我们还推导了一类加权检验统计量中的局部最优检验。用于实施检验的可行临界值从数据中获得。模拟显示可靠的尺寸控制和有竞争力的功效,包括在弱相关情况下。我们通过韩国住宅电力需求与温度分布的函数回归分析说明了该方法。

英文摘要

We develop dimension-reduction-free tests for the slope function in functional linear regression when the functional regressor may be endogenous or measured with error. The tests are based on a functional moment condition induced by an auxiliary functional variable and do not require estimation of the slope function. This feature is particularly useful in infinite-dimensional settings, where the identification and regularization conditions needed for consistent estimation are often strong and difficult to verify. The proposed procedures remain asymptotically valid under weak or even failed relevance of the auxiliary variable, and they are consistent against fixed alternatives that are detectable through the moment operator. We establish the asymptotic null distribution, consistency against detectable alternatives, and local power under drifting alternatives. We also derive the locally optimal test within a class of weighted test statistics. Feasible critical values for implementation of the tests are obtained from data. Simulations show reliable size control and competitive power, including under weak relevance. We illustrate the method using a functional regression analysis of residential electricity demand and temperature distributions in South Korea.

2605.29732 2026-06-05 quant-ph hep-th math-ph math.MP stat.AP

Exact Geometric Typicality and Bipartite Entanglement from the Projected Central Limit Theorem on Hyperspheres

超球面上投影中心极限定理的精确几何典型性与二分纠缠

Zhi-Wei Wang, Pei-Wen Li, Samuel L. Braunstein

AI总结 基于超球面上精确投影中心极限定理,推导了子系统占据概率的Beta分布和Lubkin纯度公式,并给出了二分量子互信息在Haar随机纯态下的完整渐近展开及其非微扰闭式。

详情
Comments
11 pages, 1 figure. This is a companion paper to our simultaneous submission with a title "Non-Perturbative Closed Form for the Typical Bipartite Mutual Information of Haar-Random States"
AI中文摘要

从超球面上的精确投影中心极限定理出发,我们通过初等超球面矩重新推导了子系统占据概率的Beta分布和Lubkin纯度公式,量化了相对于标准本征态热化与典型性处理中使用的高斯近似的有限尺寸“峰度”尾部抑制。我们的主要新结果涉及Haar随机纯态下的二分量子互信息$\langle I(A{:}B) angle$。我们证明,其关于$1/N$的完整渐近展开具有伯努利分解形式,其中每个阶数$k \ge 1$携带对称因子$(d_A^{2k}-1)(d_B^{2k}-1)$,且所有更高奇数阶修正恒为零。通过对Page公式(在文献~\cite{Page1993}中猜想并随后被证明~\cite{Foong1994, SanchezRuiz1995, Sen1996})的精确代数重组,我们建立了主导有限尺寸修正可分解为占优的$\mathfrak{su}(d_A) \otimes \mathfrak{su}(d_B)$二分量子相干贡献$(d_A^2 - 1)(d_B^2 - 1)/(2N)$和减去的经典概率(Cartan $\otimes$ Cartan)贡献$(d_A - 1)(d_B - 1)/(2N)$,并通过Schur优超定理将这一分解追溯到对角熵与特征值熵之间的差异,其中维度计数$(d-1)$和$(d^2-1)$通过广义Bloch分解的Cartan结构获得意义。这些结果可统一为一个非微扰闭式:精确典型互信息因子化为$\langle I(A{:}B) angle = (d_A^2-1)(d_B^2-1)\,\mathcal{G}(d_A,d_B,d_E)$,其中$\mathcal{G}$由显式的Bose-Einstein积分给出,其关于$1/N$的渐近展开重现伯努利级数。

英文摘要

Starting from the exact Projected Central Limit Theorem on hyperspheres, we rederive the Beta distribution for subsystem occupation probabilities and Lubkin's purity formula from elementary hyperspherical moments, quantifying the finite-size ``platykurtic'' suppression of tails relative to the Gaussian approximation used in standard eigenstate-thermalization and typicality treatments. Our main new result concerns the bipartite quantum mutual information $\langle I(A{:}B)\rangle$ for Haar-random pure states. We show that its full asymptotic expansion in $1/N$ admits a Bernoulli-factorized form in which every order $k \ge 1$ carries the symmetric factor $(d_A^{2k}-1)(d_B^{2k}-1)$ and all higher odd-order corrections vanish identically. Through an exact algebraic reorganization of Page's formula (conjectured in Ref.~\cite{Page1993} and subsequently proven~\cite{Foong1994, SanchezRuiz1995, Sen1996}), we establish that the leading finite-size correction separates into a dominant $\mathfrak{su}(d_A) \otimes \mathfrak{su}(d_B)$ bipartite quantum coherence contribution $(d_A^2 - 1)(d_B^2 - 1)/(2N)$ and a subtracted classical-probability (Cartan $\otimes$ Cartan) contribution $(d_A - 1)(d_B - 1)/(2N)$, and we trace this separation to the difference between diagonal and eigenvalue entropies via Schur's majorisation theorem, with the dimensional counts $(d-1)$ and $(d^2-1)$ acquiring meaning through the Cartan structure of the generalised Bloch decomposition. These results admit a single non-perturbative closed form: the exact typical mutual information factors as $\langle I(A{:}B)\rangle = (d_A^2-1)(d_B^2-1)\,\mathcal{G}(d_A,d_B,d_E)$, with $\mathcal{G}$ given by an explicit Bose--Einstein integral whose asymptotic expansion in $1/N$ reproduces the Bernoulli series.

2605.29725 2026-06-05 quant-ph hep-th math-ph math.MP stat.AP

Non-Perturbative Closed Form for the Typical Bipartite Mutual Information of Haar-Random States

Haar随机态典型二分互信息的非微扰闭式

Zhi-Wei Wang, Pei-Wen Li, Samuel L. Braunstein

AI总结 通过Bose-Einstein核的显式收敛积分,给出了Haar随机纯态平均二分量子互信息的非微扰闭式,并揭示了其与su(d_A)和su(d_B)维度的精确关系。

详情
Comments
5 pages. This is a companion paper to our simultaneous submission with a title "Exact Geometric Typicality and Bipartite Entanglement from the Projected Central Limit Theorem on Hyperspheres"
AI中文摘要

Haar随机纯态的平均二分量子互信息$\langle I(A{:}B)\rangle$可以通过Page公式用digamma函数精确表示。我们证明该量具有单一的非微扰闭式:$\langle I(A{:}B)\rangle = (d_A^2-1)(d_B^2-1)\,\mathcal{G}(d_A,d_B,d_E)$,其中$\mathcal{G}$由Bose-Einstein核上的显式收敛积分给出。整体因子$(d_A^2-1)(d_B^2-1)=\dim[\mathfrak{su}(d_A)]\cdot\dim[\mathfrak{su}(d_B)]$是精确的,而不仅仅是渐近的。$\mathcal{G}$在$1/N$中的渐近展开产生一个伯努利因子化级数,其系数涉及$\zeta(1{-}2k)$;该级数发散,而我们的积分是其精确的Borel和。积分表示还通过核的尺度反演对称性使得$\langle I\rangle < (d_A^2{-}1)(d_B^2{-}1)/(2N)$显式成立。我们的推导将互信息的结构追溯到Page熵的精确分解,分为对角(Dirichlet)贡献和Schur优超特征值修正,这些组合成互信息时清晰地将经典关联与量子关联分开。

英文摘要

The average bipartite quantum mutual information $\langle I(A{:}B)\rangle$ of Haar-random pure states can be expressed exactly through Page's formula in terms of digamma functions. We show that this quantity admits a single non-perturbative closed form: $\langle I(A{:}B)\rangle = (d_A^2-1)(d_B^2-1)\,\mathcal{G}(d_A,d_B,d_E)$, where $\mathcal{G}$ is given by an explicit convergent integral over a Bose--Einstein kernel. The overall factor $(d_A^2-1)(d_B^2-1)=\dim[\mathfrak{su}(d_A)]\cdot\dim[\mathfrak{su}(d_B)]$ is exact, not merely asymptotic. The asymptotic expansion of $\mathcal{G}$ in $1/N$ yields a Bernoulli-factorised series whose coefficients involve $ζ(1{-}2k)$; this series diverges, and our integral is its exact Borel sum. The integral representation also makes $\langle I\rangle < (d_A^2{-}1)(d_B^2{-}1)/(2N)$ manifest via a scale-inversion symmetry of the kernel. Our derivation traces the mutual information's structure to an exact decomposition of Page's entropy into a diagonal (Dirichlet) contribution and a Schur-majorisation eigenvalue correction, whose assembly into the mutual information cleanly separates classical from quantum correlations.

2605.16846 2026-06-05 stat.ME

Efficient frequentist fractional polynomials for skewed dose-response and survival data: a variance-reducing alternative to OLS-FP

分数多项式基下的多项式最大化方法:通向贝叶斯分数多项式的频率学派桥梁

Serhii Zabolotnii

AI总结 提出PMM-FP方法,将多项式最大化方法扩展到分数多项式基,推导出相对于OLS-FP的闭式方差缩减系数,并验证其有效性。

详情
Comments
Revised and retitled version prepared for journal submission; applied biostatistical framing strengthened, primary-biliary-cirrhosis confirmation added, and supplementary theory separated. 25 pages, 2 figures, 5 tables
AI中文摘要

分数多项式广泛用于剂量反应建模,最近的贝叶斯分数多项式工作重新激发了对此有限模型类的兴趣。我们提出PMM-FP,将Kunchenko的多项式最大化方法频率学派扩展到分数多项式基,在适当的矩条件下,针对正幂集和全幂集并行开发。主要结果是相对于OLS-FP,在非对称非高斯误差下闭式方差缩减系数g_2=1-gamma_3^2/(2+gamma_4),在Lean 4中形式化并通过蒙特卡洛验证。在GBSG残差上,gamma_3=-1.74,gamma_4=4.91,g_2约0.56:预期标准误增益。PMM-FP是一种计算廉价的通向贝叶斯FP建模的频率学派桥梁。

英文摘要

Fractional polynomials (FP) are a standard tool for modelling nonlinear dose-response and covariate effects, implemented in the widely used mfp package. The conventional FP fit estimates its coefficients by ordinary least squares (OLS-FP), which is statistically inefficient when the regression errors are skewed or heavy-tailed, a common situation for survival times, concentrations and biomarkers. We present a drop-in replacement that keeps the identical FP model and design but estimates the coefficients with a moment-based score tuned to the residual skewness and kurtosis, giving a closed-form efficiency factor g2 = 1 - gamma3^2/(2+gamma4) relative to OLS-FP. Across skewed error laws the method reduces slope-coefficient variance by 10-20% for mildly skewed errors and up to roughly 60% for heavy-tailed log-normal errors, at realistic sample sizes, while keeping confidence-interval coverage close to nominal, and it reverts exactly to OLS-FP under symmetry, so it is never harmful when no gain is available. On the German Breast Cancer Study Group cohort it narrows the tumour-size confidence interval by 26% (bootstrap variance ratio 0.53 against the predicted 0.56), and a primary-biliary-cirrhosis cohort reproduces the gain. The estimator is closed-form, runs in milliseconds, and is released as a reproducible R package (pmm_fp in EstemPMM) with a one-command replication bundle; its core variance identity is machine-checked in Lean 4.

2605.21557 2026-06-05 stat.ML cs.AI cs.LG

Scalable Reinforcement Learning via Adaptive Batch Scaling

通过自适应批处理缩放实现可扩展的在线强化学习

Jongchan Park

AI总结 本文提出自适应批处理缩放方法,通过动态调整有效批处理大小来平衡强化学习早期的可塑性需求和晚期的稳定收敛,发现增大网络和批处理大小的组合在强化学习中取得最佳性能。

详情
AI中文摘要

传统观点认为大批次训练与强化学习(RL)本质上不兼容,超过一定阈值后增大批次大小通常会导致回报减少或性能下降,由于数据分布的固有非平稳性。我们通过观察非平稳性并非RL的固定属性,而是随着训练过程演变:早期阶段表现出快速的行为转变,需要小批次以保持可塑性,而晚期阶段接近准平稳状态,大批次可实现精确收敛。受此启发,我们提出自适应批处理缩放(ABS),根据学习策略的稳定性动态调整有效批次大小。ABS的核心是行为分歧,一种新的度量指标,通过测量连续更新之间的动作级转变来量化策略非平稳性,用于将批次大小反向缩放至策略波动性。与并行化Q网络(PQN)算法结合并在ALE基准上评估,ABS无缝地平衡了早期阶段的可塑性和晚期阶段的稳定收敛。令人惊讶的是,与传统观点相反,我们的结果表明,较大的网络和较大的批次大小的组合实现了最佳性能——一种之前被认为在强化学习中无法实现的扩展行为,现在通过自适应批处理控制得以解锁。

英文摘要

Conventional wisdom holds that large-batch training is fundamentally incompatible with Reinforcement Learning (RL) - beyond a modest threshold, increasing batch sizes typically yields diminishing returns or performance degradation due to the inherent non-stationarity of the data distribution. We challenge this view by observing that non-stationarity is not a fixed property of RL, but evolves throughout training: early stages exhibit rapid behavioral shifts that demand small batches for plasticity, whereas late stages approach a quasi-stationary regime where large batches enable precise convergence. Motivated by this observation, we propose Adaptive Batch Scaling (ABS), that dynamically adjusts the effective batch size according to the stability of the learning policy. Central to ABS is Behavioral Divergence, a novel metric that quantifies policy non-stationarity by measuring action-level shifts between consecutive updates, which we use to scale batch size inversely to policy volatility. Integrated with the Parallelised Q-Network (PQN) algorithm and evaluated on the ALE benchmark, ABS seamlessly reconciles early-stage plasticity with late-stage stable convergence. Strikingly, contrary to conventional wisdom, our results reveal that the combination of larger networks and larger batch sizes achieves the best performance - a scaling behavior previously thought to be unattainable in RL, now unlocked through adaptive batch control.

2405.05097 2026-06-05 cs.LG stat.ML

Biology-inspired joint distribution neurons based on Hierarchical Correlation Reconstruction allowing for multidirectional propagation of values and densities

受生物学启发的联合分布神经元:基于层次相关性重建的多向传播神经元

Jarek Duda

AI总结 本文提出了一种受生物学启发的联合分布神经元,通过层次相关性重建实现多向值和密度传播,改进了传统人工神经元在学习、灵活性和鲁棒性方面的不足。

详情
Comments
12 pages, 17 figures
AI中文摘要

最近,一百万生物神经元(BNN)在现代强化学习(RL)方法中表现出色,尤其在Pong游戏中表现优异,表明它们在学习、灵活性和鲁棒性方面仍具有显著优势,提示需要改进当前人工神经元(如MLP/KAN)以更好地与生物神经元一致。本文提出了一种扩展KAN方法的神经元,包含局部联合分布模型:ρ(x)=∑_{j∈B} a_j f_j(x)对于x∈[0,1]^d,增加了对KAN的解释和信息流控制,并允许逐步补充生物神经元的三个基本特性:1)生物轴突可以双向传播,而当前人工神经元仅单向传播,联合分布神经元可通过替换变量获得条件值/分布;2)动物表现出风险规避,需要处理方差,现实世界更需要概率模型,所提方法可预测和传播分布作为矩向量(期望值、方差等);3)生物神经元需要局部训练,除了反向传播外,所提方法还允许其他训练方式,如直接训练、张量分解或最终的局部和有前景的信息瓶颈。所提方法非常通用,也可用于扩展softmax在transformer或JEPA嵌入中的应用,暗示特征是现实世界属性联合密度的混合矩。

英文摘要

Recently a million of biological neurons (BNN) has turned out better from modern RL methods in playing Pong~\cite{RL}, reminding they are still qualitatively superior e.g. in learning, flexibility and robustness - suggesting to try to improve current artificial e.g. MLP/KAN for better agreement with biological. There is proposed extension of KAN approach to neurons containing model of local joint distribution: $ρ(\mathbf{x})=\sum_{\mathbf{j}\in B} a_\mathbf{j} f_\mathbf{j}(\mathbf{x})$ for $\mathbf{x} \in [0,1]^d$, adding interpretation and information flow control to KAN, and allowing to gradually add missing 3 basic properties of biological: 1) biological axons propagate in both directions~\cite{axon}, while current artificial are focused on unidirectional propagation - joint distribution neurons can repair by substituting some variables to get conditional values/distributions for the remaining. 2) Animals show risk avoidance~\cite{risk} requiring to process variance, and generally real world rather needs probabilistic models - the proposed can predict and propagate also distributions as vectors of moments: (expected value, variance) or higher. 3) biological neurons require local training, and beside backpropagation, the proposed allows many additional ways, like direct training, through tensor decomposition, or finally local and promising: information bottleneck. Proposed approach is very general, can be also used as extension of softmax in embeddings of e.g. transformer, JEPA, Mamba, suggesting interpretation that features are mixed moments of joint density of real-world properties.

2605.13587 2026-06-05 stat.ML cs.LG eess.SP

Reframing preprocessing selection as model-internal calibration in near-infrared spectroscopy: A large-scale benchmark of operator-adaptive PLS and Ridge models

将预处理选择重新定义为近红外光谱学中的模型内部校准:一种大规模的运算符自适应PLS和岭模型基准测试

Gregory Beurier, Robin Reiter, Camille Noûs, Lauriane Rouan, Denis Cornet

AI总结 本文研究了在近红外光谱学中,将预处理选择重新定义为模型内部校准的方法,通过大规模基准测试比较运算符自适应PLS和岭模型的性能和效率。

详情
Comments
17 pages, 8 figures; supplementary material (39 pages, 4 figures) included. Extended preprint version of a companion study prepared as a concise journal article (same results, different framing and scope). Code and artifacts: https://github.com/GBeurier/nirs4all-aom
AI中文摘要

预处理筛选通常是近红外光谱学校准工作流程中最昂贵的部分。其有效性在于平滑、导数、去趋势及相关滤波器会改变PLS或岭回归所看到的光谱方向,但完整的外部搜索会反复拟合几乎相同的线性模型。本文研究了将该搜索折叠成一个校准步骤的情况。对于严格的线性预处理运算符,变换后的PLS交叉协方差满足(XA^T)^T Y = AX^T Y,而岭回归依赖于运算符诱导的核X A^T A X^T。这些恒等式允许在模型内部筛选有限的运算符银行,同时保留原始波长系数。样本自适应或拟合的校正如SNV、MSC、EMSC和ASLS仍保持为折叠局部分支,而不是被吸收进代数中。本研究使用AOM基准队列:在显式中包含61个回归行和17个分类行。在主回归分母(N=32)上,普通的紧凑银行AOM-PLS记录了与PLS默认值相比的中位RMSEP比为0.991,与PLS-HPO相比为0.990;所选的ASLS-AOM-compact-cv5分支在相同的两个参考上记录为0.985和1.002。普通的AOMRidge-global-compact-none基线记录了与Ridge默认值相比的0.974,与Ridge-HPO相比为0.984,而所选的AOMRidge-Blender-headline-spxy3记录为0.918和0.966。所选分类器AOM-PLS-DA-global-simpls-covariance在13个数据集上将平衡精度提高了0.159,其中12/13胜出。运行时间差距是实际结果:PLS-HPO每次运行的中位总时间是710.81秒,而所选的AOM-PLS分支仅为1.63秒。线性运算符自适应校准因此在预测质量上与彻底的预处理筛选相当,对于PLS来说,拟合时间减少了多个数量级。

英文摘要

Preprocessing screening is often the most expensive part of a near-infrared spectroscopy calibration workflow. It works because smoothing, derivatives, detrending and related filters change the spectral directions seen by partial least squares (PLS) or Ridge regression, but a full external search repeatedly refits nearly the same linear model. This paper studies the case where that search can be collapsed into one calibration step. For a strict linear preprocessing operator A acting on row spectra as XA^T, the transformed PLS cross-covariance satisfies (XA^T)^T Y = A X^T Y, and Ridge regression depends on the operator-induced kernel X A^T A X^T. These identities let a finite operator bank be screened inside the model while retaining original-wavelength coefficients, and the same identity extends to cheaply evaluated linear operator chains. Sample-adaptive or fitted corrections such as SNV, MSC, EMSC and ASLS are not strict linear; we prove the boundary and keep them as fold-local branches. The cohort has 61 regression and 17 classification rows, with a strict paired regression denominator of N=32 for the eight paper variants. There, AOM-PLS reaches median RMSEP ratios of 0.991/0.990 (simple) and 0.985/1.002 (best) against PLS-default/PLS-HPO, and AOM-Ridge reaches 0.974/0.984 (simple) and 0.918/0.966 (best) against Ridge-default/Ridge-HPO. The operator-adaptive classifier AOM-PLS-DA improves balanced accuracy by a median 0.159 on N=13 datasets (12/13 wins). The practical result is the runtime gap: PLS-HPO takes a median 710.81 s per run, whereas AOM-PLS takes 1.18-1.63 s -- 436 to 602 times less PLS fitting time. Linear operator-adaptive calibration thus gives prediction quality comparable to exhaustive preprocessing screening, with orders-of-magnitude less fitting time for PLS.

2509.10825 2026-06-05 cs.LG cs.AI stat.ML

CUBE: Contrastive Understanding by Balanced Experiments

CUBE: 通过平衡实验实现对比理解

Dongseok Kim, Hyoungsun Choi, Mohamed Jismy Aashik Rasool, Gisung Oh

AI总结 本文提出CUBE框架,通过平衡低-高探针解释已训练的预测模型,揭示模型的主要效应和交互作用,验证了其在合成和现实表格任务中的有效性。

详情
Comments
The core framework and main claims remain unchanged; the manuscript has been revised for clarity, presentation, and consistency
AI中文摘要

事后解释依赖于模型查询是如何组织的。我们提出了CUBE,一种基于设计的框架,通过平衡的低-高探针来解释已训练的预测模型。所选变量定义了因素,设计的特征级组合定义了查询条件,模型预测被总结为因子对比。CUBE报告主效应和成对交互作用作为受控阅读的平均和条件响应变化的总结。在合成和现实表格任务中的实验表明,CUBE恢复了主导的学习效应结构,澄清了查询高效的可识别性,并支持筛查-后续细化。

英文摘要

Post-hoc explanation depends on how model queries are organized. We propose CUBE, a design-based framework that explains a trained predictive model through balanced low--high probes. Selected variables define factors, designed feature-level combinations define query conditions, and model predictions are summarized as factorial contrasts. CUBE reports main effects and pairwise interactions as controlled readings of average and conditional response changes over a declared design space. Experiments on synthetic and real tabular tasks show that CUBE recovers dominant learned effect structure, clarifies query-efficient identifiability, and supports screening--follow-up refinement.

2605.15454 2026-06-05 cs.CL cs.LG stat.ML

Reasoning Models Don't Just Think Longer, They Move Differently

推理模型不只思考更久,它们的移动方式不同

Anders Gjølbye, Lars Kai Hansen, Sanmi Koyejo

AI总结 本文研究了推理训练模型在生成链式思维时的轨迹差异,发现通过长度校正后,不同领域中难度与轨迹几何的耦合关系存在显著差异,尤其是在代码领域中,推理训练模型表现出更直接的轨迹和更一致的局部曲率。

详情
Comments
Preprint
AI中文摘要

经过训练的推理语言模型通常在更难的问题上消耗更多标记,但更长的思维链并不表明模型只是计算更多步骤或遵循不同的内部轨迹。我们通过在编程、数学和布尔可满足性问题中研究链式思维生成过程中的隐藏状态轨迹来区分这一区别。原始轨迹几何强烈受到生成长度的影响:更长的生成会机械地改变路径统计,因此在没有调整的情况下,基于难度的比较是误导的。在残差化轨迹统计后,难度在所有研究的领域中系统地与修正后的轨迹几何相关联。在代码领域中,最清晰的推理特定分离出现在更难的问题中,推理训练模型显示出更直接的修正轨迹和更一致的局部曲率,而与匹配的指令训练基线相比,这种差异更小。在数学和布尔可满足性问题中,修正后的难度-几何耦合较弱,但仍存在。提示阶段的线性探测不反映代码领域的分离,行为注释显示更强的修正耦合与策略转变和不确定性监控同时出现。这些发现确立了长度校正作为生成时间轨迹分析的先决条件,并表明推理训练可以与不同的修正轨迹几何相关联,这种效果的强度取决于领域。

英文摘要

Reasoning-trained language models often spend more tokens on harder problems, but longer chains of thought do not show whether a model is merely computing for more steps or following a different internal trajectory. We study this distinction through hidden-state trajectories during chain-of-thought generation across competitive programming, mathematics, and Boolean satisfiability. Raw trajectory geometry is strongly shaped by generation length: longer generations mechanically alter path statistics, so difficulty-dependent comparisons are misleading without adjustment. After residualizing trajectory statistics on length, difficulty remains systematically coupled to corrected trajectory geometry across all domains studied. The clearest reasoning-specific separation appears in the code domain, where harder problems show more direct corrected trajectories and less heterogeneous local curvature in reasoning-trained models than in matched instruction-tuned baselines. Corrected difficulty-geometry coupling is weaker, but still present, in mathematics and Boolean satisfiability. Prompt-stage linear probes do not mirror the code-domain separation, and behavioral annotations show that stronger corrected coupling co-occurs with strategy shifts and uncertainty monitoring. Together, these findings establish length correction as a prerequisite for generation-time trajectory analysis and show that reasoning training can be associated with distinct corrected trajectory geometry, with the strength of the effect depending on the domain.

2605.12951 2026-06-05 stat.ML cs.LG

Coreset-Induced Conditional Velocity Flow Matching

由Coreset诱导的条件速度流匹配

Xiao Wang, Zihua She, Jianxi Su

AI总结 本文提出了一种生成模型CCVFM,通过数据驱动的源分布增强层次化修正流,利用Coreset压缩目标数据并生成高斯混合分布,从而在无需学习神经采样器的情况下实现条件速度律的闭式表达,并通过轻量级修正流进一步优化生成效果。

详情
AI中文摘要

我们提出了Coreset-Induced Conditional Velocity Flow Matching (CCVFM),一种生成模型,通过数据驱动的源分布增强层次化修正流。层次化流匹配在速度空间中建模完整的条件速度定律,但其内部流被要求从头开始将各向同性高斯噪声传输到多模态目标速度分布。我们的关键观察是,此内部源可以被一个闭式近似替代,该近似基于目标的Coreset。CCVFM首先利用熵Sinkhorn Coreset将目标压缩为加权原子,并将它们提升为高斯混合分布。由此诱导的条件速度定律是一个闭式高斯混合分布,可在不学习神经采样器的情况下进行采样。一个轻量级修正流,从该精确近似源训练而来,然后优化剩余的近似到目标残差,而不是学习整个噪声到数据映射。我们证明,在显式压缩假设下,近似传输成本等于目标-近似Wasserstein差距,而噪声-源的类比具有维度尺度下界。我们进一步刻画了直接近似源训练目标的条件二次矩,并表明当近似条件律接近真实条件速度律在均值和协方差时,其源依赖的超额是小的。实验证明,在MNIST、CIFAR-10、ImageNet-32和CelebA-HQ上,所提方法在匹配架构下实现了具有竞争力的少步生成。

英文摘要

We propose Coreset-Induced Conditional Velocity Flow Matching (CCVFM), a generative model that augments hierarchical rectified flow with a data-informed source distribution. Hierarchical flow matching models the full conditional velocity law in velocity space, but its inner flow is asked to transport isotropic Gaussian noise to a multimodal target velocity distribution from scratch. Our key observation is that this inner source can be replaced by a closed-form surrogate built from a coreset of the target. CCVFM first compresses the target into weighted atoms using an entropic Sinkhorn coreset and lifts them to a Gaussian mixture. The induced conditional velocity law is then a closed-form Gaussian mixture that can be sampled without a learned neural sampler. A lightweight correction flow, trained from this exact surrogate source, then refines the remaining surrogate-to-target residual rather than learning an entire noise-to-data map. We prove that the surrogate transport cost equals the target--surrogate Wasserstein gap under an explicit compression assumption, whereas the noise-source analogue has a dimension-scale lower bound. We further characterize the conditional second moment of the direct surrogate-source training target and show that its source-dependent excess is small when the surrogate conditional law is close to the true conditional velocity law in mean and covariance. Empirically, on MNIST, CIFAR-10, ImageNet-32, and CelebA-HQ, the proposed method reaches competitive few-step generation under matched architectures.

2510.12663 2026-06-05 stat.ME

The $α$--regression for compositional data: a unified framework for standard, temporal and spatial regression models including compositional predictors

α-回归用于组成数据:一个统一的框架,用于标准、时间序列和空间回归模型,包括组成预测变量

Michail Tsagris, Nader Alharbi, Abdulaziz Alenazi, Yannis Pantazis

AI总结 本文提出了一种统一的α-回归框架,用于处理组成数据,涵盖了标准、时间序列和空间回归模型,并探讨了其在不同场景下的应用和改进。

详情
AI中文摘要

我们重新审视了用于组成数据的α-回归框架。我们将α-回归公式化为非线性最小二乘问题,研究其渐近性质,并通过Levenberg-Marquardt算法提供高效的估计。然后,我们提出基于排列的假设检验程序,推导边际效应以进行解释,并提供对每个预测变量影响的视觉检查。我们进一步讨论了鲁棒版本、自然样条的引入以及组成预测变量的纳入,这进一步促进了简单时间序列模型的构建。通过四个模型将框架扩展到空间设置中:(a) α-空间滞后X回归模型,通过空间滞后协变量纳入空间溢出效应,并分解为直接和间接效应;(b) α-空间自回归模型,允许空间自相关;(c) 地理加权α-回归,允许系数在空间上变化以捕捉局部关系;(d) α-特征向量空间过滤器,计算上高效且通过核化距离矩阵的特征向量捕捉空间依赖性。对四个真实数据集的应用表明,这些模型在文献中与现有模型相当或表现更优。示例展示了α-回归在不同场景下可以超越各种竞争回归模型,其空间扩展能够捕捉依赖性并提高预测性能。总体而言,示例提供了证据表明,对数比方法并不总是导致最优结果。

英文摘要

We revisit the $α$--regression framework for compositional data. We formulate $α$--regression as a non--linear least squares problem, study its asymptotic properties, and provide efficient estimation via the Levenberg--Marquardt algorithm. We then propose a permutation--based hypothesis testing procedure, derive marginal effects for interpretation, and provide a visual inspection of the effect of each predictor. We further discuss robustified versions, the inclusion of natural splines, and the incorporation of compositional predictors, which further facilitate the formulation of a simple time series model. The framework is extended to spatial settings through four models. (a) The $α$--spatially--lagged X regression model, which incorporates spatial spillover effects via spatially--lagged covariates, with decomposition into direct and indirect effects. (b) The $α$--spatial autoregressive model that allows for spatial autocorrelation. (c) The geographically--weighted $α$--regression, which allows coefficients to vary spatially for capturing local relationships. (d) The $α$--eigenvector spatial filtering that is computationally efficient and captures spatial dependence via the eigenvectors of the kernelized distance matrix. Applications to four real datasets illustrate that the models perform on par with or outperform existing models in the literature. The examples showcase that $α$--regression can outperform various competing regression models under different scenarios and its spatial extensions capture the dependence and improve the predictive performance. Overall, the examples provide evidence that the log--ratio methodology does not always lead to the optimal results.

2602.07165 2026-06-05 stat.CO physics.data-an stat.ME

PoissonRatioUQ: An R package for band ratio uncertainty quantification

PoissonRatioUQ: 一个用于带比不确定性量化 的 R 包

Matthew LeDuc, Tomoko Matsuo

AI总结 该研究提出一个 R 包,用于处理计数比的贝叶斯建模和不确定性量化,核心方法基于假设感兴趣量是泊松均值的比值而非计数比值,并提供了多种获取该量的方法,包括有空间信息和无空间信息的情况,同时增加了对形式为 $Z=(mT+z_0)^{p}$ 的问题的不确定性量化能力。

详情
Comments
Description of the R package in https://github.com/mfleduc/PoissonRatioUQ. New release available on Zenodo at https://doi.org/10.5281/zenodo.20492078
AI中文摘要

我们介绍了一个 R 包,用于处理涉及计数比的贝叶斯建模和不确定性量化。建模基于假设感兴趣量是泊松均值的比值而非计数比值。我们为有和无空间信息的问题提供了多种获取该量的选项。此外,我们还增加了对形式为 $Z=(mT+z_0)^{p}$ 的问题的不确定性量化能力,其中 $Z$ 是强度比,$T$ 是感兴趣量。

英文摘要

We introduce an R package for Bayesian modeling and uncertainty quantification for problems involving count ratios. The modeling relies on the assumption that the quantity of interest is the ratio of Poisson means rather than the ratio of counts. We provide multiple different options for retrieval of this quantity for problems with and without spatial information included. Some added capability for uncertainty quantification for problems of the form $Z=(mT+z_0)^{p}$, where $Z$ is the intensity ratio and $T$ the quantity of interest, is included.

2605.09726 2026-06-05 math.ST stat.ME stat.TH

On the Impossibility of Specification Testing of Interference Models Based on Exposure Mappings

关于基于暴露映射的干扰模型规范检验不可能性的研究

Chao Gao, Christopher Harshaw, Fredrik Sävje, Yitan Wang

AI总结 本文研究了基于暴露映射的干扰模型规范检验问题,证明了现有检验方法的低效力是不可避免的,并指出任何检验方法的最坏情况I型和II型错误率之和必须为一,因此不存在统一一致的检验方法。

详情
AI中文摘要

研究人员使用基于暴露映射的干扰模型来促进随机实验中因果效应的估计。为了检验此类模型的真实性,研究人员可以使用规范检验来检测模型偏离。然而,现有检验方法的效力较差,通常无法检测到重要的模型偏离。本文的主要结果是证明基于暴露映射模型的规范检验问题本质上是困难的,现有检验方法的低效力是不可避免的。特别是,对于任何此类模型的规范检验,最坏情况下的I型和II型错误率之和必须为一,这排除了存在统一一致检验的可能性。这是由一种简单检验方法所达到的最坏情况总体错误率,该方法丢弃所有数据并随机拒绝原假设。因此,从这个意义上说,检验问题是不可能的。这一负面结果适用于所有暴露映射、所有样本量、对于有界结果以及对于与原假设最大分离的替代假设。尽管某些检验可以检测某些类型的偏离,但总会有相关的偏离无法被检测到。因此,信息性规范检验必须在暴露映射本身所施加的限制之外,对所寻求的功率的替代模型进行限制。我们通过提供一种统一一致的检验方法来区分无干扰模型与网络线性均值模型来说明这一点。

英文摘要

Researchers use interference models based on exposure mappings to facilitate estimation of causal effects in randomized experiments with interference. To test the veracity of such models, researchers can use specification tests that aim to detect departures from the stipulated model. However, existing tests suffer from poor power and are often unable to detect important model violations. The main result in this paper is to show that the specification testing problem for exposure mapping models is inherently difficult, and the poor power of existing tests is inescapable. In particular, the worst-case Type I and Type II error rates must sum to one for any specification test of such models, ruling out the existence of a uniformly consistent test. This is the worst-case overall error rate achieved by a naive test that discards all data and arbitrarily rejects the null at random; the testing problem is in this sense impossible. This negative result holds true for all exposure mappings, all sample sizes, for uniformly bounded outcomes, and for alternatives that are maximally separated from the null. While some tests can detect some type of departures from the null model, there will always be relevant departures from the null that are undetectable. Informative specification tests must therefore restrict the alternative model against which they seek to attain power for, beyond the restrictions imposed by the exposure mappings alone. We illustrate this by providing a uniformly consistent test for differentiating no-interference from a network-linear-in-means model.

2605.08318 2026-06-05 cs.LG cs.AI cs.NA math.NA physics.comp-ph stat.ML

When Attention Beats Fourier: Multi-Scale Transformers for PDE Solving on Irregular Domains

当注意力胜过傅里叶:用于不规则域上的PDE求解的多尺度变换器

Brandon Yee, Pairie Koh, Jack Rodriguez, Mihir Tekal

AI总结 本文研究了深度学习模型在求解偏微分方程(PDE)时的架构选择问题,探讨了基于学习注意力的变换器架构在何时优于傅里叶域神经算子。引入了多尺度注意力变换器(MSAT),该架构将时空解的历史编码为令牌序列,并通过复合监督目标进行端到端训练。在五个基准问题上,与九种基线方法(包括物理信息神经网络、神经算子和状态空间模型)进行了全面的实证评估,展示了在复杂几何问题上的最佳泛化能力。

详情
Comments
Substantial Revision Required
AI中文摘要

我们研究了深度学习模型在求解偏微分方程(PDE)时的架构选择问题,探讨了基于学习注意力的变换器架构在何时优于傅里叶域神经算子。我们介绍了多尺度注意力变换器(MSAT),一种深度学习架构,将时空解的历史编码为令牌序列,并通过复合监督目标进行端到端训练。我们对九种基线方法(包括物理信息神经网络、神经算子和状态空间模型)进行了全面的实证评估,覆盖了PINNacle套件中的五个基准问题,使用相同的训练/测试分割和参考数据。MSAT在复杂几何问题上实现了最先进的泛化能力(Heat2D-CG的L²相对误差为0.0101,比FNO提高了3.7倍),在34秒的总推理时间下,比Mamba-NO的120,812秒快得多。对物理正则化组件的消融研究揭示了精确的归纳偏置权衡:物理先验减少了扩散主导问题的测试误差,但会退化混沌和回流流动制度的泛化能力,直接刻画了先验规格错误的边界。近似误差界作为域边界复杂性κ的函数,为这些实证发现提供了理论基础,并为架构选择提供了一个原则性的规则。

英文摘要

We study the problem of \emph{architecture selection} for deep learning models trained to solve partial differential equations (PDEs), asking when transformer-based architectures with learned attention outperform Fourier-domain neural operators. We introduce the \textbf{Multi-Scale Attention Transformer} (\msat{}), a deep learning architecture that encodes spatiotemporal solution histories as token sequences and trains end-to-end via a composite supervised objective with optional physics-informed regularization terms. We conduct a comprehensive empirical evaluation against nine baselines -- including physics-informed neural networks (PINNs), neural operators (FNO, DeepONet, GNOT), and state-space models (Mamba-NO) -- across five benchmark problems from the PINNacle suite, using identical train/test splits and reference data for all methods. \msat{} achieves state-of-the-art generalization on complex geometry problems ($L^2_\mathrm{rel} = 0.0101$ on Heat2D-CG, a $3.7\times$ improvement over FNO) at $34\,\mathrm{s}$ total inference vs.\ $120{,}812\,\mathrm{s}$ for Mamba-NO. Ablation studies over the physics regularization component reveal a precise inductive bias tradeoff: physics priors reduce test error on diffusion-dominated problems but degrade generalization on chaotic and recirculating-flow regimes, directly characterizing the prior misspecification boundary. Approximation error bounds as a function of domain boundary complexity $κ$ provide a theoretical basis for these empirical findings and a principled rule for architecture selection.

2605.07096 2026-06-05 cs.LG cs.AI stat.ME

Query-efficient model evaluation using cached responses

通过缓存响应实现高效的模型评估

Hayden Helm, Ben Johnson, Carey Priebe

AI总结 本文提出了一种基于数据核视角空间(DKPS)的方法,利用已缓存的模型响应来预测基准性能,从而减少评估新模型所需的查询数量,提高了模型评估的效率。

详情
AI中文摘要

在部署新模型之前,评估其在现有基准上的表现通常是必要的。对于现代评估框架来说,生成并评估所有查询的响应可能成本过高。实际上,先前评估模型的响应往往被缓存——这为利用此额外信息来减少准确评估新模型所需查询数量提供了潜在机会。在本文中,我们介绍了一种预测基准性能的方法,该方法利用缓存的模型响应,基于数据核视角空间(DKPS),一种在黑箱设置下量化模型之间关系的方法。理论上,我们证明了基于DKPS的方法在某些条件下是查询高效的。实证上,我们展示了基于DKPS的方法在查询预算大幅减少的情况下,能够达到与基线相同的平均绝对误差。最后,我们提出了一种离线方法,用于选择一组查询,以最大化参考模型上的拟合质量,从而在随机查询选择的基础上提高预测准确性。

英文摘要

Evaluating a new model on an existing benchmark is often necessary to understand its behavior before deployment. For modern evaluation frameworks, generating and evaluating a response for all queries can be prohibitively expensive. In practice, responses from previously-evaluated models are often cached -- creating a potential opportunity to use this additional information to decrease the number of queries required to accurately evaluate a new model. In this paper, we introduce an approach for predicting benchmark performance that leverages cached model responses based on the Data Kernel Perspective Space (DKPS), a method for quantifying the relationship between models in the black-box setting. Theoretically, we show that DKPS-based methods are query-efficient under certain conditions. Empirically, we demonstrate that DKPS-based methods achieve the same mean absolute error as baselines with a substantially decreased query budget. We conclude by proposing an offline method for selecting a set of queries that maximizes the goodness-of-fit on reference models, improving prediction accuracy over random query selection.

2604.26634 2026-06-05 cs.LG econ.GN q-fin.EC stat.AP

Electricity price forecasting across Norway's five bidding zones in the post-crisis era

在危机后时代跨挪威五个竞价区的电力价格预测

My Thi Diem Phan, Trung Tuyen Truong, Hoai Phuong Ha, Dat Thanh Nguyen

AI总结 本文研究了挪威五个竞价区在能源危机后电力价格预测的问题,通过构建多模态数据集并评估了八种预测模型,发现LightGBM在所有区域表现最佳,同时强调了外部特征在不同市场状况下的重要性。

详情
Comments
This version removes variables unavailable at prediction time to eliminate look-ahead leakage, clarifies the forecasting task definition, and updates the results and discussion accordingly. All tables and figures have been recomputed
AI中文摘要

挪威的电力市场长期以来由水电主导,但2021-2022年的能源危机和与欧洲大陆的更强整合已从根本上改变了价格形成机制,降低了基于历史数据校准的预测模型的可靠性。尽管需要更新的模型,但缺乏一个统一的基准来评估所有结构各异的挪威竞价区的特征贡献。本文提出了对Nord Pool市场在所有五个挪威竞价区的一步预测的全面评估。我们构建了一个覆盖2019-2025年的多模态小时数据集,并使用严格因果测试集评估了八种预测模型家族,包括Light Gradient Boosting Machine(LightGBM)、带有外生变量的自回归模型和先进的深度学习架构。我们实现了稳健的滚动起源回测、留一组法特征消融和条件制度分析来分解模型性能和特征效用。我们的结果表明,LightGBM在每个区域都表现最佳,平均绝对误差范围为1.60至5.58欧元每兆瓦时,而一个带有外生变量的岭正则化自回归模型在北部区域仍然是一个高度有竞争力的线性基准。特征消融揭示了仅依赖滞后价格和日历变量的模型能够获得高精度,通常与完整的多模态模型的性能相匹配或接近。然而,条件制度分析显示,外部特征如水库水位和天然气价格在分层预测误差方面至关重要,这些误差在压力市场制度下持续增加。这突显了模型可解释性和制度意识在决策者面对市场动态结构性变化时的实用价值。

英文摘要

Norway's electricity market is heavily dominated by hydropower, but the 2021-2022 energy crisis and stronger integration with Continental Europe have fundamentally altered price formation, reducing the reliability of forecasting models calibrated on historical data. Despite the critical need for updated models, a unified benchmark evaluating feature contributions across all structurally diverse Norwegian bidding zones remains lacking. Here we present a comprehensive evaluation of one-step-ahead forecasting of the Nord Pool market across all five Norwegian bidding zones. We constructed a multimodal hourly dataset spanning 2019-2025 and evaluated eight forecasting model families, including Light Gradient Boosting Machine (LightGBM), autoregressive models with exogenous variables, and advanced deep learning architectures, using a strictly causal test set. We implemented robust rolling-origin backtesting, leave-one-group-out feature ablation, and conditional regime analysis to dissect model performance and feature utility. Our results show that LightGBM achieves the best performance in every zone, with mean absolute error ranging from 1.60 to 5.58 euros per megawatt-hour, while a ridge-regularized autoregressive model with exogenous variables remains a highly competitive linear benchmark in northern zones. Feature ablation reveals that models relying solely on lagged prices and calendar variables achieve high accuracy and often match or closely approach the performance of the full multimodal model. However, conditional regime analysis demonstrates that external features like reservoir levels and gas prices remain crucial to stratify forecast errors, which consistently increase under stressed market regimes. This highlights the practical value of model interpretability and regime awareness for decision makers facing structural changes in market dynamics.

2601.13150 2026-06-05 stat.ME

Propensity Score Propagation: A General Framework for Design-Based Inference with Unknown Propensity Scores

倾向分数传播:一种用于未知倾向分数的设计基推断一般框架

Siyu Heng, Yanxin Shen, Zijian Guo

AI总结 本文提出了一种新的设计基推断框架,即倾向分数传播,用于处理未知倾向分数的情况。该框架通过再生与联合过程将倾向分数估计的不确定性传播到下游推断中,无需假设超总体结果分布。该方法兼容参数和非参数倾向分数模型,并能与现有基于已知倾向分数的设计基方法无缝集成,适用于广泛的设计基推断问题。理论结果和模拟研究显示,该框架在名义覆盖方面表现良好,即使现有方法存在显著的覆盖不足。

详情
AI中文摘要

设计基推断,也称为随机化基或有限总体推断,提供了一个原理性的框架,通过将随机性仅归因于设计机制(例如处理分配、调查抽样或缺失性),而无需对结果数据施加超总体分布或建模假设,从而实现可信的统计推断。从Fisher和Neyman的奠基性工作到最近设计基推断的复兴,这一观点在因果推断、调查抽样和缺失数据分析中发挥了核心作用。然而,一个基本障碍限制了其在许多现代应用中的使用:现有的设计基推断理论通常依赖于已知的倾向分数(即已知的设计概率),而倾向分数在观察性研究、现实世界调查设置和缺失数据问题中通常是未知的。本文提出倾向分数传播,一种用于未知倾向分数的有效设计基推断一般框架。该框架引入了再生与联合过程,将倾向分数估计的不确定性传播到下游的设计基推断中,而无需假设超总体结果假设。它兼容参数和非参数倾向分数模型,无缝集成现有基于已知倾向分数的设计基方法,并广泛应用于各种设计基推断问题。理论结果和模拟研究显示,所提出的框架实现了名义覆盖,即使现有方法表现出显著的覆盖不足。

英文摘要

Design-based inference, also known as randomization-based or finite-population inference, provides a principled framework for trustworthy statistical inference by attributing randomness solely to the design mechanism (e.g., treatment assignment, survey sampling, or missingness), without imposing super-population distributional or modeling assumptions on outcome data. From Fisher's and Neyman's seminal work to the recent resurgence of design-based inference, this perspective has played a central role in causal inference, survey sampling, and missing data analysis. However, a fundamental obstacle has limited its use in many modern applications: existing design-based inference theory typically relies on known propensity scores (i.e., known design probabilities), whereas propensity scores are usually unknown in observational studies, real-world survey settings, and missing data problems. We propose propensity score propagation, a general framework for valid design-based inference with unknown propensity scores. The framework introduces a regeneration-and-union procedure that propagates uncertainty from propensity score estimation into downstream design-based inference without imposing super-population outcome assumptions. It accommodates both parametric and nonparametric propensity score models, integrates seamlessly with existing design-based methods developed under known propensity scores, and applies broadly across design-based inference problems. Theoretical results and simulation studies show that the proposed framework achieves nominal coverage, even when existing approaches exhibit substantial under-coverage.

2601.16821 2026-06-05 stat.ME q-fin.ST stat.AP

Directional-Shift Dirichlet ARMA Models for Compositional Time Series with Structural Break Intervention

方向性位移狄利克雷ARMA模型用于具有结构性断裂干预的组成时间序列

Harrison Katz

AI总结 本文提出了一种基于方向性位移干预机制的贝叶斯狄利克雷ARMA模型,用于处理具有结构性断裂的组成时间序列,通过三个可解释参数捕捉结构性断裂,并在不同场景下验证了模型的鲁棒性和预测性能。

详情
AI中文摘要

组成时间序列经常由于外部冲击、政策变化或市场中断而出现结构性断裂。标准方法要么忽略这些断裂,要么通过固定效应或阶梯函数哑变量来处理,但这些方法无法超出样本范围进行外推或强制即时调整。我们开发了一种贝叶斯狄利克雷ARMA模型,结合了方向性位移干预机制,通过三个可解释参数捕捉结构性断裂:方向向量指定哪些成分增减份额,幅度控制再分配幅度,逻辑门控制转换时间和速度。该模型通过构造保持组成约束,维持DARMA动态以捕捉短期依赖性,并通过结构性断裂前后产生一致的概率预测。干预轨迹对应于简单形上的测地运动,并且不依赖于ILR基底的选择。通过400次拟合和8种场景的模拟研究,当位移方向正确识别时,近零幅度偏差和名义80%可信区间覆盖率(77.5%的案例)得到验证。补充研究证实了在极端转换速度和非单调DGPs下的鲁棒性。两个实证应用分析了新冠时期Airbnb数据的表现,与更简单的替代方法相比,当断裂是单调且持续时,干预模型达到近名义校准(79.6%),而固定效应显著低估(66.1%)。当断裂后动态是非单调时,两种模型都可接受校准,但固定效应在点准确性上表现更好。因此,干预模型的优势特定于具有大致单调结构性过渡的设置。

英文摘要

Compositional time series frequently exhibit structural breaks due to external shocks, policy changes, or market disruptions. Standard methods either ignore such breaks or handle them through fixed effects that cannot extrapolate beyond the sample, or step-function dummies that impose instantaneous adjustment. We develop a Bayesian Dirichlet ARMA model augmented with a directional-shift intervention mechanism that captures structural breaks through three interpretable parameters: a direction vector specifying which components gain or lose share, an amplitude controlling redistribution magnitude, and a logistic gate governing transition timing and speed. The model preserves compositional constraints by construction, maintains DARMA dynamics for short-run dependence, and produces coherent probabilistic forecasts through and after structural breaks. The intervention trajectory corresponds to geodesic motion on the simplex and is invariant to the choice of ILR basis. A simulation study with 400 fits across 8 scenarios shows near-zero amplitude bias and nominal 80\% credible interval coverage when the shift direction is correctly identified (77.5\% of cases); supplementary studies confirm robustness across extreme transition speeds and non-monotone DGPs. Two empirical applications to COVID-era Airbnb data characterize performance relative to simpler alternatives. Where the break is monotone and ongoing, the intervention model achieves near-nominal calibration (79.6\%) while the fixed effect substantially under-covers (66.1\%). Where post-break dynamics are non-monotone, both models are acceptably calibrated and the fixed effect outperforms on point accuracy. The intervention model's advantages are thus specific to settings with roughly monotone structural transitions.

2410.06326 2026-06-05 stat.ME stat.ML

Convex Estimation of Gaussian Graphical Regression Models with Covariates

具有协变量的高斯图回归模型的凸估计

Ruobin Liu, Guo Yu

AI总结 本文提出了一种凸框架,用于同时估计协变量调整后的均值和精度矩阵,通过多元高斯似然的自然参数化,改进了高维情况下理论保证。

详情
AI中文摘要

高斯图模型(GGMs)被广泛用于恢复随机变量之间的条件独立结构。最近的研究试图将辅助协变量纳入其中以提高估计精度,特别是在共表达数量性状位点(eQTL)研究中,基因表达水平及其条件依赖结构可能受遗传变异影响。现有方法在调整协变量的GGMs中要么限制协变量效应到均值结构,要么在联合估计均值和精度矩阵时导致非凸形式。本文提出了一种凸框架,通过多元高斯似然的自然参数化同时估计协变量调整后的均值和精度矩阵。所得形式允许联合凸优化,并在高维缩放下提供改进的理论保证,其中稀疏性和协变量的维度随样本量增长。我们通过数值模拟支持理论发现,并通过胶质瘤多形体eQTL研究的重新分析和饮食对人类肠道微生物群分析,展示了所提方法的实用价值。

英文摘要

Gaussian graphical models (GGMs) are widely used to recover the conditional independence structure among random variables. Recent work has sought to incorporate auxiliary covariates to improve estimation, particularly in applications such as co-expression quantitative trait locus (eQTL) studies, where both gene expression levels and their conditional dependence structure may be influenced by genetic variants. Existing approaches to covariate-adjusted GGMs either restrict covariate effects to the mean structure or lead to nonconvex formulations when jointly estimating the mean and precision matrix. In this paper, we propose a convex framework that simultaneously estimates the covariate-adjusted mean and precision matrix via a natural parametrization of the multivariate Gaussian likelihood. The resulting formulation enables joint convex optimization and yields improved theoretical guarantees under high-dimensional scaling, where the sparsity and dimension of covariates grow with the sample size. We support our theoretical findings with numerical simulations and demonstrate the practical utility of the proposed method through a reanalysis of an eQTL study of glioblastoma multiforme and an analysis of diet on the human gut microbiome.

2601.16195 2026-06-05 physics.chem-ph stat.ML

Pushing the limits of unconstrained machine-learned interatomic potentials

突破无约束机器学习原子势能的极限

Filippo Bigi, Paolo Pegolo, Arslan Mazitov, Jonathan Schmidt, Michele Ceriotti

AI总结 本文研究了无约束机器学习原子势能(MLIPs)在大规模参数和训练样本下的行为,发现当训练于大数据集时,无约束模型在准确性和速度上优于物理约束模型,并在实际应用中表现出色。

详情
Comments
21 pages, 8 figures
AI中文摘要

机器学习原子势能(MLIPs)越来越多地被用来替代计算强度大的电子结构计算,以在原子尺度上建模物质。最常用的模型架构被限制以精确满足一些物理定律,从几何对称性到能量守恒。越来越多的证据表明,放松一些这些约束可以提高MLIPs的效率和(有些令人惊讶的)准确性,尽管必须小心避免因破坏物理对称性而引起的定性失败。鉴于最近的趋势是将模型扩展到更大的参数数量和训练样本数量,一个非常重要的问题是无约束MLIPs在这种极限下如何表现。在这里,我们研究了这个问题,显示当在大数据集上训练时,无约束模型在准确性和速度上均优于物理约束模型。我们从基准准确性和实际应用中的可用性两个方面评估了这些模型,重点是静态模拟工作流,如几何优化和晶格动力学。我们得出结论,准确的无约束模型可以放心使用,特别是因为简单的推断时间修改可以用来恢复与相关物理对称性一致的可观测量。

英文摘要

Machine-learned interatomic potentials (MLIPs) are increasingly used to replace computationally demanding electronic-structure calculations to model matter at the atomic scale. The most commonly used model architectures are constrained to fulfill a number of physical laws exactly, from geometric symmetries to energy conservation. Evidence is mounting that relaxing some of these constraints can be beneficial to the efficiency and (somewhat surprisingly) accuracy of MLIPs, even though care should be taken to avoid qualitative failures associated with the breaking of physical symmetries. Given the recent trend of scaling up models to larger numbers of parameters and training samples, a very important question is how unconstrained MLIPs behave in this limit. Here we investigate this issue, showing that -- when trained on large datasets -- unconstrained models can be superior in accuracy and speed when compared to physically constrained models. We assess these models both in terms of benchmark accuracy and in terms of usability in practical scenarios, focusing on static simulation workflows such as geometry optimization and lattice dynamics. We conclude that accurate unconstrained models can be applied with confidence, especially since simple inference-time modifications can be used to recover observables that are consistent with the relevant physical symmetries.

2505.11006 2026-06-05 stat.ML cs.LG

Is Supervised Learning Really That Different from Unsupervised?

监督学习真的和无监督学习有那么大的区别吗?

Oskar Allerbo, Thomas B. Schön

AI总结 该研究通过将监督学习分解为两阶段过程,证明在不访问标签数据的情况下选择模型参数和添加输出,可以实现与传统监督学习相似的性能,表明监督与无监督学习的区别可能不如表面看起来那么根本。

详情
Comments
Paper accepted at AISTATS 2026
AI中文摘要

我们展示了监督学习如何分解为一个两阶段过程,其中(1)所有模型参数以无监督的方式选择,(2)输出y被添加到模型中,而无需改变参数值。这通过一种新的模型选择标准实现,与交叉验证不同,该标准可以在不访问y的情况下使用。对于线性岭回归,我们界定了我们方法的渐近外样本风险,以最优渐近风险为基准。我们还证明了在不访问y的情况下训练的线性和核岭回归、平滑样条、k近邻、随机森林和神经网络,其性能与基于y的传统方法相似。因此,我们的结果表明,监督学习和无监督学习之间的区别可能不如表面看起来那么根本。

英文摘要

We demonstrate how supervised learning can be decomposed into a two-stage procedure, where (1) all model parameters are selected in an unsupervised manner, and (2) the outputs y are added to the model, without changing the parameter values. This is achieved by a new model selection criterion that - in contrast to cross-validation - can be used also without access to y. For linear ridge regression, we bound the asymptotic out-of-sample risk of our method in terms of the optimal asymptotic risk. We also demonstrate that versions of linear and kernel ridge regression, smoothing splines, k-nearest neighbors, random forests, and neural networks, trained without access to y, perform similarly to their standard y-based counterparts. Hence, our results suggest that the difference between supervised and unsupervised learning is less fundamental than it may appear.

2511.15427 2026-06-05 econ.EM stat.ME

Tractable Estimation of Nonlinear Panels with Interactive Fixed Effects

可计算的非线性面板模型交互固定效应估计

Andrei Zeleneev, Weisheng Zhang

AI总结 本文提出了一种计算高效的方法,用于估计非线性面板模型中的交互固定效应,该方法在理论上等价于Chen等人(2021)提出的估计器,避免了高维非凸优化问题,适用于大规模非线性面板数据。

详情
AI中文摘要

在线性面板模型中,交互固定效应通常被控制。尽管文献中已存在非线性模型的类似固定效应(FE)估计器(Chen, Fernandez-Val和Weidner, 2021),但其在应用研究中使用有限,因为其实施涉及解决高维非凸问题。本文通过提供一个计算效率高的新估计器来补充Chen等人(2021)的理论分析,该估计器在渐近上等价于他们的估计器。与之前提出的FE估计器不同,我们的估计器避免了解高维非凸优化问题,并且可以在大规模非线性面板中可行计算。我们提出的方法包括两个步骤。第一步是使用核范数正则化(NNR)凸化优化问题,获得参数的初步估计,包括固定效应。然后,我们使用标准梯度下降法在这些初步估计上找到原始优化问题的全局解。为了使我们的方法在实践中易于应用,我们还提出了特定的数值算法来解决涉及的优化问题,建立了其收敛性,并在我们的R包NNRPanel中提供了高效的实现。

英文摘要

Interactive fixed effects are routinely controlled for in linear panel models. While an analogous fixed effects (FE) estimator for nonlinear models has been available in the literature (Chen, Fernandez-Val and Weidner, 2021), it sees much more limited use in applied research because its implementation involves solving a high-dimensional non-convex problem. In this paper, we complement the theoretical analysis of Chen, Fernandez-Val and Weidner (2021) by providing a new computationally efficient estimator that is asymptotically equivalent to their estimator. Unlike the previously proposed FE estimator, our estimator avoids solving a high-dimensional non-convex optimization problem and can be feasibly computed in large nonlinear panels. Our proposed method involves two steps. In the first step, we convexify the optimization problem using nuclear norm regularization (NNR) and obtain preliminary NNR estimators of the parameters, including the fixed effects. Then, we find the global solution of the original optimization problem using a standard gradient descent method initialized at these preliminary estimates. To make our method readily applicable in practice, we also propose specific numerical algorithms for solving the involved optimization problems, establish their convergence, and provide their efficient implementation in our R package NNRPanel.

2603.22950 2026-06-05 stat.AP

Higher-Order Multivariate Environmental Influences in Structural Health Monitoring

结构健康监测中的高阶多变量环境影响

Lizzie Neumann, Philipp Wittenberg, Jan Gertheiss

AI总结 本文研究了环境条件对结构健康监测输出统计量的影响,提出两种方法(随机森林和核方法)来识别和量化多变量干扰效应,发现核方法更准确但随机森林更稳健且易于解释。

详情
AI中文摘要

系统输出如本征频率或应变数据,常用于结构健康监测(SHM),不仅对损伤反应,还依赖于环境条件。在尝试纠正这些混杂效应时,通常(至少隐含地)假设只有预期值,即均值输出受环境条件影响。然而,对实际SHM数据的评估表明,环境条件可能不仅影响均值输出,还影响更高阶的统计矩,特别是输出量的方差以及它们之间的协方差和相关性,如不同模式的本征频率或不同位置的应变传感器。为了解决这些问题,我们讨论了两种方法来识别和量化对输出协方差和相关性的多变量混杂效应:随机森林和非参数、基于核的方法。我们在人工和实际SHM数据上比较了这两种竞争方法,发现基于核的方法具有更高的准确性,但随机森林产生的估计值更稳健,有时更容易解释。

英文摘要

System outputs such as eigenfrequencies or strain data, often used in structural health monitoring (SHM), not only react to damage but also depend on environmental conditions. When trying to correct for these confounding effects, it is often (at least implicitly) assumed that only the expected, i.e., mean, output values are affected by environmental conditions. However, the evaluation of real-world SHM data indicates that environmental conditions may influence not only the mean output but also higher-order statistical moments, particularly the variances of and the covariances and correlations between the output quantities, such as eigenfrequencies of different modes or strain sensors at different locations. To address these issues, we discuss two approaches for identifying and quantifying multivariate confounding effects on output covariances and correlations: a random forest and a nonparametric, kernel-based approach. We compare the two competing methods on both artificial and real-world SHM data, finding that the kernel-based approach achieves higher accuracy, but the random forest produces estimates that are more robust and sometimes easier to interpret.

2603.20980 2026-06-05 cs.LG cs.AI stat.AP stat.ML

From Causal Discovery to Dynamic Causal Inference in Neural Time Series

从因果发现到神经时间序列中的动态因果推断

Dmitry Zaytsev, Valentina Kuskova, Michael Coppedge

AI总结 提出动态因果网络自回归(DCNAR)两阶段框架,通过神经自回归因果发现学习稀疏有向因果网络,并将其作为结构先验用于时变神经网络自回归,实现无需预设网络结构的动态因果推断。

详情
Comments
11 pages, 2 figures
AI中文摘要

时变因果模型为研究动态科学系统提供了强大框架,然而大多数现有方法假设潜在因果网络是先验已知的——这一假设在现实领域中很少成立,因为在这些领域中因果结构是不确定的、演变的或仅能间接观测。这限制了动态因果推断在许多科学场景中的适用性。我们提出动态因果网络自回归(DCNAR),一个两阶段神经因果建模框架,将数据驱动的因果发现与时变因果推断相结合。在第一阶段,神经自回归因果发现模型从多变量时间序列中学习稀疏有向因果网络。在第二阶段,该学习到的结构被用作时变神经网络自回归的结构先验,从而无需预先指定网络结构即可实现因果影响的动态估计。我们使用评估因果必要性、时间稳定性和对结构变化敏感性的行为诊断来验证DCNAR的科学有效性,而不仅仅是预测准确性。在多国面板时间序列数据上的实验表明,即使预测性能相当,学习到的因果网络也比基于系数或无结构替代方法产生更稳定且行为上有意义的动态因果推断。这些结果将DCNAR定位为一个通用框架,用于在结构不确定性下将AI作为动态因果推理的科学工具。

英文摘要

Time-varying causal models provide a powerful framework for studying dynamic scientific systems, yet most existing approaches assume that the underlying causal network is known a priori - an assumption rarely satisfied in real-world domains where causal structure is uncertain, evolving, or only indirectly observable. This limits the applicability of dynamic causal inference in many scientific settings. We propose Dynamic Causal Network Autoregression (DCNAR), a two-stage neural causal modeling framework that integrates data-driven causal discovery with time-varying causal inference. In the first stage, a neural autoregressive causal discovery model learns a sparse directed causal network from multivariate time series. In the second stage, this learned structure is used as a structural prior for a time-varying neural network autoregression, enabling dynamic estimation of causal influence without requiring pre-specified network structure. We evaluate the scientific validity of DCNAR using behavioral diagnostics that assess causal necessity, temporal stability, and sensitivity to structural change, rather than predictive accuracy alone. Experiments on multi-country panel time-series data demonstrate that learned causal networks yield more stable and behaviorally meaningful dynamic causal inferences than coefficient-based or structure-free alternatives, even when forecasting performance is comparable. These results position DCNAR as a general framework for using AI as a scientific instrument for dynamic causal reasoning under structural uncertainty.

2603.17925 2026-06-05 stat.ME cs.LG math.ST stat.TH

Multi-Armed Sequential Hypothesis Testing by Betting

通过赌注进行多臂顺序假设检验

Ricardo J. Sandoval, Ian Waudby-Smith, Michael I. Jordan

AI总结 本文研究了通过赌注进行多臂顺序检验的问题,提出了一种在多个数据源(臂)中选择以获取数据的统计学家的变体,旨在拒绝全局空假设P(所有臂在某种意义上无效)并支持复合替代假设Q(至少有一个臂非空)。通过推广对数最优性和期望拒绝时间最优性的概念,得到了匹配的上下界,并提出了一个修改的上置信界算法来处理不可观测但足够可估计的奖励。

详情
AI中文摘要

我们考虑了一种通过赌注进行的顺序检验变体,其中在每个时间步,统计学家会面对多个数据源(臂)并选择其中一个以获取数据。我们考虑了一个复合全局空假设P,即所有臂在某种意义上(例如所有治疗剂量无效)都是空假设,并希望拒绝P以支持一个复合替代假设Q,其中至少有一个臂是非空的(例如存在有效的治疗剂量)。我们提出了一种最优性要求,即即使多个臂是非空的,我们寻求e-过程和顺序检验,其性能尽可能强,如同拥有 oracle 知识关于哪个臂生成最多反对P的证据。形式上,我们将对数最优性和期望拒绝时间最优性的概念推广到多个臂,得到两者匹配的上下界。在最优性分析中,一个关键技术设备是一个修改的上置信界算法,用于不可观测但足够“可估计”的奖励。在设计此算法时,我们推导了非渐近的集中不等式,用于最优财富增长率,即凯利[1956]的意义。这些可能具有独立的兴趣。

英文摘要

We consider a variant of sequential testing by betting where, at each time step, the statistician is presented with multiple data sources (arms) and obtains data by choosing one of the arms. We consider the composite global null hypothesis $\mathscr{P}$ that all arms are null in a certain sense (e.g. all dosages of a treatment are ineffective) and we are interested in rejecting $\mathscr{P}$ in favor of a composite alternative $\mathscr{Q}$ where at least one arm is non-null (e.g. there exists an effective treatment dosage). We posit an optimality desideratum that we describe informally as follows: even if several arms are non-null, we seek $e$-processes and sequential tests whose performance are as strong as the ones that have oracle knowledge about which arm generates the most evidence against $\mathscr{P}$. Formally, we generalize notions of log-optimality and expected rejection time optimality to more than one arm, obtaining matching lower and upper bounds for both. A key technical device in this optimality analysis is a modified upper-confidence-bound-like algorithm for unobservable but sufficiently "estimable" rewards. In the design of this algorithm, we derive nonasymptotic concentration inequalities for optimal wealth growth rates in the sense of Kelly [1956]. These may be of independent interest.

2509.11381 2026-06-05 math.ST econ.EM stat.ME stat.ML stat.TH

Accuracy Limits of Causal Trees for Individualized Treatment Effects

因果树在个体化治疗效果中的精度极限

Matias D. Cattaneo, Jason M. Klusowski, Ruiqi Rae Yu

AI总结 本文研究了基于自适应递归划分的因果树估计器,推导了其估计精度的下界,指出即使在随机分配的常数效应基准下,标准CART型划分规则构建的因果树在样本空间上的一致误差仍可能以比样本规模的任何幂更慢的速度下降,且样本划分无法消除这一限制。

详情
AI中文摘要

递归决策树被广泛用于估计实验和观察研究中的异质因果治疗效应。这些方法通常使用CART型递归划分实现,其划分标准旨在识别在协变量定义的子组中治疗效应的变化。我们研究了基于自适应递归划分的因果树估计器,并建立了其估计精度的下界。我们分析的类别包括基于常见治疗效应和平方误差划分标准的版本,有和无样本划分。即使在常数效应基准下,使用标准CART型划分规则构建的因果树的一致误差仍可能比样本规模的任何幂更慢地下降。其根本机制是贪心递归划分选择具有非消失概率的高度不平衡划分,产生包含非常少观测的终端节点,导致估计方差较大。我们进一步表明,样本划分(通常称为“诚实”)无法消除这一限制。因此,因果树估计器可能在协变量空间上以任意慢的速度收敛。同时,这些估计器可以具有小的积分均方误差,表明平均准确性可以掩盖局部不准确性。我们的结果也澄清了现有因果森林及相关集成方法的理论保证中平衡划分假设的作用。

英文摘要

Recursive decision trees are widely used to estimate heterogeneous causal treatment effects in experimental and observational studies. These methods are typically implemented using CART-type recursive partitioning, with splitting criteria designed to identify variation in treatment effects across covariate-defined subgroups. We study causal tree estimators based on adaptive recursive partitioning and establish lower bounds on their estimation accuracy. The class we analyze includes versions with and without sample splitting, based on common treatment effect and squared-error splitting criteria. Even in a constant-effect benchmark with randomized treatment assignment, causal trees constructed via standard CART-type splitting rules can have uniform-norm errors that decrease more slowly than any power of the sample size. The underlying mechanism is that greedy recursive partitioning selects highly imbalanced splits with nonvanishing probability, producing terminal nodes containing very few observations and leading to large estimation variance. We further show that sample splitting, often called ``honesty,'' does not remove this limitation. As a consequence, causal tree estimators may converge arbitrarily slowly uniformly over the covariate space. At the same time, these estimators can have small integrated mean squared error, showing that average accuracy can mask local inaccuracy. Our results also clarify the role of balanced partition assumptions in existing theoretical guarantees for causal forests and related ensemble methods.

2506.23213 2026-06-05 math.ST eess.SP stat.TH

Nuisance parameters and elliptically symmetric distributions: a geometric approach to parametric and semiparametric efficiency

干扰参数与椭圆对称分布:一种几何方法用于参数和半参数效率

Stefano Fortunati, Jean-Pierre Delmas, Esa Ollila

AI总结 本文研究了在存在有限维和无限维干扰参数的情况下,参数估计的统计效率之间的深刻且反直觉的联系,通过几何方法提出了一种新的计算投影算子的方法,适用于椭圆分布模型,并扩展到圆和非圆复椭圆对称分布。

详情
AI中文摘要

椭圆对称分布是半参数模型的经典例子,其中位置向量和散射矩阵(或其参数化)是两个有限维参数,而密度生成器代表一个无限维的干扰项。这种基本的椭圆模型可以通过考虑额外的有限维干扰参数使其更加准确、丰富和灵活。我们的目标是研究在存在有限和无限维干扰参数的情况下,参数估计的统计效率之间的深刻且反直觉的联系。先前的开创性工作通过利用一个一般结果来解决这个问题:如果统计模型具有特定的群不变性,则投影算子可以近似表示为最大不变子σ代数上的条件期望。在本文中,我们表明,对于椭圆分布的统计模型,投影算子可以显式计算,而无需依赖上述近似。这使我们能够获得原始结果,即使在位置向量和散射矩阵由一个有限维向量参数化的情况下,该向量可以分为两个子向量:一个包含感兴趣的参数,另一个包含干扰参数。作为示例,我们展示了如何将所得结果应用于著名的低秩参数化。此外,虽然理论分析将针对实椭圆对称(RES)分布进行开发,但我们将展示如何将我们的结果扩展到圆和非圆复椭圆对称(C-CES和NC-CES)分布的情况。

英文摘要

Elliptically symmetric distributions are a classic example of a semiparametric model where the location vector and the scatter matrix (or a parameterization of them) are the two finite-dimensional parameters of interest, while the density generator represents an \textit{infinite-dimensional nuisance} term. This basic representation of the elliptic model can be made more accurate, rich, and flexible by considering additional \textit{finite-dimensional nuisance} parameters. Our aim is therefore to investigate the deep and counter-intuitive links between statistical efficiency in estimating the parameters of interest in the presence of both finite and infinite-dimensional nuisance parameters. Previous seminal works have addressed this problem by leveraging a general result: if the statistical model has a specific group invariance, then the projection operator onto the semiparametric nuisance tangent space can be asymptotically expressed as a conditional expectation with respect to the maximal invariant sub-$σ$ algebra. In this article, we show that, for the statistical model of elliptical distributions, the projection operator can be explicitly computed without relying on the above-mentioned asymptotic approximation. This allows us to obtain original results also for the case in which the location vector and the scatter matrix are parameterized by a finite-dimensional vector that can be partitioned in two sub-vectors: one containing the parameters of interest and the other containing the nuisance parameters. As an example, we illustrate how the obtained results can be applied to the well-known \virg{low-rank} parameterization. Furthermore, while the theoretical analysis will be developed for Real Elliptically Symmetric (RES) distributions, we show how to extend our results to the case of Circular and Non-Circular Complex Elliptically Symmetric (C-CES and NC-CES) distributions.

2603.14169 2026-06-05 stat.ME cs.AI

Beyond Means: Topological Causal Effects under Persistent-Homology Ignorability

超越均值:基于持久同调的因果效应

Amir Saki, Usef Faghihi

AI总结 本文提出基于持久同调的因果框架,以解决均值基于因果估计在处理结局分布形状变化时的局限性,通过定义拓扑学的CATE和ATE,并证明其在近似拓扑可忽略性下的可识别性。

详情
AI中文摘要

平均处理效应(ATE)和条件平均处理效应(CATE)是因果估计的核心,但它们仅关注预期结果的变化,可能忽略处理引起的结局分布形状变化。当对照组结果单峰,处理组结果双峰且均值相同,均值基于的因果估计会失效。本文基于持久同调发展了因果框架,提出了持久同调可忽略性条件,定义了拓扑学的CATE和ATE,并证明这些估计量在近似拓扑可忽略性下可识别。同时指出,边际持久图效应不能仅通过条件拓扑可忽略性确定,因为持久同调通常不与协变量混合交换。为保持原意并确保科学正确性,本文保留边际效应作为动机量,但将数学上稳健的条件估计量置于理论中心。合成实验显示,均值基于的因果估计仍接近零,而所提拓扑效应显著增加并在调整混杂后可恢复。

英文摘要

Average treatment effects (ATE) and conditional average treatment effects (CATE) are foundational causal estimands, but they target changes in expected outcomes and can miss treatment-induced changes in the shape of outcome distributions. A canonical failure mode occurs when control outcomes are unimodal, treated outcomes become bimodal, and both distributions have the same mean. In such cases mean-based causal estimands are zero even though the geometry and topology of the outcome law change substantially. This paper develops a topological causal framework based on persistent homology. We formalize a persistent-homology ignorability condition, define topological analogues of CATE and ATE, and prove that these estimands are identifiable up to an explicit error bound under approximate topological ignorability. We also clarify a subtle but important point: a marginal persistence-diagram effect is not identified from conditional topological ignorability alone because persistent homology does not in general commute with mixtures over covariates. To preserve the original intuition while ensuring scientific correctness, we retain the marginal effect as a motivating quantity, but place the mathematically sound conditional estimands at the center of the theory. A synthetic experiment with mean-preserving topology change shows that mean-based causal estimands remain near zero while the proposed topological effect increases sharply and remains recoverable after adjustment for confounding.

2603.12427 2026-06-05 stat.ME

Variational Bayes and Truncation approximations for Enriched Dirichlet process mixtures

变分贝叶斯与截断近似方法用于增强狄利克雷过程混合模型

Somnath Bhadra, Michael J. Daniels

AI总结 本文提出了一种用于增强狄利克雷过程混合模型(EDPM)的变分贝叶斯估计器,通过改进的截断近似方法提高效率,并展示了其在Nimble中实现阻塞吉布斯采样器的应用,通过模拟和实际数据验证了方法的有效性。

详情
AI中文摘要

在进行贝叶斯非参数模型推断时,复杂MCMC算法和大数据集的计算时间是一个常见障碍。本文针对增强狄利克雷过程混合模型(EDPM)提出了解决方案。我们基于之前开发的EDPM截断近似方法推导出变分贝叶斯估计器。该估计器可以以两种方式使用:1)开发更高效的截断近似方法;2)作为基于该更高效截断近似方法的阻塞吉布斯采样器或Polya尿 sampler的初始值。我们推导了该更高效截断近似的准确性,并展示了如何通过简单的实现方式在Nimble中实现EDPM的阻塞吉布斯采样器。我们通过模拟验证了近似的有效性,并在实际数据集上进行了说明。

英文摘要

A common impediment in conducting inference for Bayesian nonparametric models is either the need for complex MCMC algorithms and/or computational run-time for large datasets. We propose solutions here for Enriched Dirichlet process mixtures (EDPM). We derive a variational Bayes estimator based on a previously developed truncation approximation for EDPMs. The variational Bayes estimator can be used in two ways: 1) to develop a more efficient truncation approximation; 2) as good initial values for a blocked Gibbs sampler based on this more efficient truncation approximation or for a polya urn sampler. We derive the accuracy of this more efficient truncation approximation and demonstrate how this allows for simple implementation of a blocked Gibbs Sampler EDPMs in Nimble. We confirm the validity of the approximations by simulations and illustrate on a real data set.

2603.11319 2026-06-05 cs.LG stat.ML

On the Robustness of Langevin Dynamics to Score Function Error

关于对数动力学对分数函数误差的鲁棒性

Daniel Yiming Cao, August Y. Chen, Karthik Sridharan, Yuchen Wu

AI总结 本文研究了基于分数函数的生成模型对分数函数估计误差的鲁棒性,发现对数动力学在L2误差(更一般地Lp误差)下并不鲁棒,即使在高维简单分布中,即使分数函数估计误差非常小,对数动力学在多项式时间内运行也会导致与目标分布的总变差距离很大,这进一步支持了扩散模型优于对数动力学。

详情
Comments
ICML 2026
AI中文摘要

我们考虑了基于分数函数的生成模型对分数函数估计误差的鲁棒性。特别是,我们证明了对数动力学对分数函数估计的L2误差(更一般地Lp误差)不具有鲁棒性。已知在分数函数估计的L2误差较小的情况下,扩散模型可以在多项式时间内忠实采样目标分布,只要满足一定的正则性假设。相比之下,我们的工作表明,即使对于高维简单分布,对数动力学在任何多项式时间内运行都会产生与目标分布在总变差(TV)距离远的分布,即使分数函数估计的L2误差(更一般地Lp误差)可以任意小。考虑到在实践中从数据学习分数函数时,分数函数估计误差是不可避免的,我们的结果进一步支持扩散模型优于对数动力学,并警示不要使用估计的分数函数进行对数动力学采样。

英文摘要

We consider the robustness of score-based generative modeling to errors in the estimate of the score function. In particular, we show that Langevin dynamics is not robust to the $L^2$ errors (more generally $L^p$ errors) in the estimate of the score function. It is well-established that with small $L^2$ errors in the estimate of the score function, diffusion models can sample faithfully from the target distribution under fairly mild regularity assumptions in a polynomial time horizon. In contrast, our work shows that even for simple distributions in high dimensions, Langevin dynamics run for any polynomial time horizon will produce a distribution far from the target distribution in Total Variation (TV) distance, even when the $L^2$ error (more generally $L^p$) of the estimate of the score function is arbitrarily small. Considering such an error in the estimate of the score function is unavoidable in practice when learning the score function from data, our results provide further justification for diffusion models over Langevin dynamics and serve to caution against the use of Langevin dynamics with estimated scores.

2603.02714 2026-06-05 math.PR cs.IT math.IT math.ST stat.TH

Gaussian Width of Convex Sets via Integral Decompositions, Projections, and the Distribution of Intrinsic Volumes

通过积分分解、投影和内在体积分布的凸集高斯宽度

Reese Pathak, Nikita Zhivotovskiy

AI总结 本文研究了通过积分分解、投影和内在体积分布来确定凸集的高斯宽度,并开发了两种基于索引集几何的分解方法,利用最近的几何分析和高斯过程工作,将高斯宽度与局部度量结构和内在体积联系起来,最终证明宽度由内在体积的'峰值指数'控制。

详情
Comments
54 pages
AI中文摘要

我们重新审视了通过一个凸集$T \subset \mathbf{R}^d$索引的典型高斯过程的期望上确界界限问题。我们开发了两种高斯宽度的分解方法,基于索引集的几何结构。第一种分解涉及高斯向量在缩放后的$T$副本上的度量投影。第二种分解涉及从二次惩罚的局部宽度变体中出现的固定点。这两种分解都不直接调用通用链构造。我们的结果利用了最近在几何分析和高斯过程中的工作。Chatterjee [Ann. Statist., 2014] 的工作通过涉及局部高斯宽度的变分问题,表征了高斯随机向量在缩放后的$T$副本上的度量投影行为。我们使用这些界限来开发利用$T$的局部度量结构的高斯宽度分解。其次,我们利用Vitale [Ann. Probab., 1996] 的工作,将Wills函数(以及$T$的内在体积)与我们分解中出现的第一项联系起来。最后,引用Mourtada [J. Eur. Math. Soc., 2025] 关于Wills函数对数的最新工作,我们证明宽度由内在体积的'峰值指数'控制。在最坏情况下,我们的界限恢复了经典的Dudley积分的局部形式。

英文摘要

We revisit the problem of bounding the expected supremum of a canonical Gaussian process indexed by a convex set $T \subset \mathbf{R}^d$. We develop two decompositions for the Gaussian width, based on the geometry of the index set. The first decomposition involves metric projections of Gaussians onto rescaled copies of $T$. The second involves fixed points arising from a quadratically penalized variant of the local width. Neither decomposition directly invokes generic chaining constructions. Our results make use of recent work in geometric analysis and Gaussian processes. The work of Chatterjee [Ann. Statist., 2014] characterizes the behavior of the metric projection of a Gaussian random vector onto rescaled copies of $T$ with a variational problem involving localized Gaussian widths. We use these bounds to develop decompositions of the Gaussian width using the local metric structure of $T$. Second, we leverage the work of Vitale [Ann. Probab., 1996] to form a connection between the Wills functional (and hence the intrinsic volumes of $T$) and the first terms that appear in our decompositions. Finally, invoking recent work by Mourtada [J. Eur. Math. Soc., 2025] on the logarithm of the Wills functional, we show that the width is controlled by a single, ''peak index'' of the intrinsic volumes. In the worst case, our bound recovers a local form of the classical Dudley integral.

2602.24207 2026-06-05 cs.LG cs.CY cs.GT stat.ML

The Stability of Online Algorithms in Performative Prediction

在线算法在表现性预测中的稳定性

Gabriele Farina, Juan Carlos Perdomo

AI总结 本文研究了在线算法在表现性预测中的稳定性问题,证明了任何在表现性设置中使用的无遗憾算法都会收敛到一种表现性稳定的均衡状态,该状态中模型主动塑造数据分布,使得其预测在事后看来是最优的。该研究避免了对模型如何影响分布的假设,并揭示了常见算法如梯度下降为何能自然稳定化并防止 runaway 反馈循环。

详情
AI中文摘要

使用算法预测进行决策会导致反馈循环,其中我们部署的模型主动影响我们看到的数据分布,以及后来用于重新训练的数据分布。这种动态由Perdomo等人在表现性预测工作中正式化。我们的主要结果是一个无条件的减少,表明任何在表现性设置中使用的无遗憾算法都会收敛到一个(混合)表现性稳定的均衡:一种解决方案,其中模型以使它们的预测在事后看来最优的方式塑造数据分布。在我们之前的工作之前,该领域所有积极结果都对模型如何影响分布施加了强限制。通过使用鞅论据并允许随机化,我们避免了对人口如何响应预测的任何假设,并绕过了最近的硬度结果,表明确定性稳定的模型通常在PPAD难度上是难以计算的。最后,从概念上讲,我们的连接揭示了常见算法如梯度下降为何自然稳定化并防止 runaway 反馈循环。我们希望我们的工作能促进未来在线优化和表现性之间的技术转移。

英文摘要

The use of algorithmic predictions in decision-making leads to a feedback loop where the models we deploy actively influence the data distributions we see, and later use to retrain on. This dynamic was formalized by Perdomo et al. 2020 in their work on performative prediction. Our main result is an unconditional reduction showing that any no-regret algorithm deployed in performative settings converges to a (mixed) performatively stable equilibrium: a solution in which models actively shape data distributions in ways that their own predictions look optimal in hindsight. Prior to our work, all positive results in this area imposed strong restrictions on how models influenced distributions. By using a martingale argument and allowing randomization, we avoid any assumption on how populations respond to predictions and sidestep recent hardness results showing that deterministic stable models are in general PPAD-hard to compute. Lastly, on a more conceptual note, our connection sheds light on why common algorithms, like gradient descent, are naturally stabilizing and prevent runaway feedback loops. We hope our work enables future technical transfer of ideas between online optimization and performativity.

2509.24882 2026-06-05 cs.LG cond-mat.dis-nn cs.AI stat.ML

Scaling Laws and Spectra of Shallow Neural Networks in the Feature Learning Regime

浅层神经网络在特征学习 regime 中的缩放定律与谱特性

Leonardo Defilippis, Yizhou Xu, Julius Girardin, Emanuele Troiani, Vittorio Erba, Lenka Zdeborová, Bruno Loureiro, Florent Krzakala

AI总结 本文研究了浅层神经网络在特征学习 regime 中的缩放定律与谱特性,通过分析二次和对角神经网络的缩放规律,揭示了样本复杂度和权重衰减对过剩风险缩放指数的影响,并建立了这些 regime 与训练网络权重谱性质的精确联系。

详情
Journal ref
ICLR 2026
AI中文摘要

神经缩放定律是深度学习近期许多进展的基础,但其理论理解仍然主要局限于线性模型。在本文中,我们系统分析了二次和对角神经网络在特征学习 regime 中的缩放定律。利用与矩阵压缩感知和LASSO的联系,我们推导了过剩风险缩放指数作为样本复杂度和权重衰减函数的详细相图。这种分析揭示了不同缩放 regime 之间的交叉和平台行为,与经验神经缩放文献中广泛报告的现象相呼应。此外,我们建立了这些 regime 与训练网络权重谱性质的精确联系,我们对其进行了详细刻画。作为结果,我们提供了最近经验观察的理论验证,这些观察将权重谱中幂律尾部的出现与网络泛化性能联系起来,从而给出了从基本原理出发的解释。

英文摘要

Neural scaling laws underlie many of the recent advances in deep learning, yet their theoretical understanding remains largely confined to linear models. In this work, we present a systematic analysis of scaling laws for quadratic and diagonal neural networks in the feature learning regime. Leveraging connections with matrix compressed sensing and LASSO, we derive a detailed phase diagram for the scaling exponents of the excess risk as a function of sample complexity and weight decay. This analysis uncovers crossovers between distinct scaling regimes and plateau behaviors, mirroring phenomena widely reported in the empirical neural scaling literature. Furthermore, we establish a precise link between these regimes and the spectral properties of the trained network weights, which we characterize in detail. As a consequence, we provide a theoretical validation of recent empirical observations connecting the emergence of power-law tails in the weight spectrum with network generalization performance, yielding an interpretation from first principles.

2509.20345 2026-06-05 stat.ME cs.LG stat.ML

General Synthetic-Powered Inference

通用合成数据驱动推断

Meshi Bashari, Yonghoon Lee, Roy Maor Lotan, Edgar Dobriban, Yaniv Romano

AI总结 本文提出了一种通用合成数据驱动推断框架,通过结合高质量合成数据和真实数据来提高样本效率,同时在合成数据质量低时自动回退到传统方法,无需分布假设即可保持误差率在用户指定范围内。

详情
AI中文摘要

高质量合成数据的快速普及——由先进的人工智能模型生成或从相关任务中收集——为统计推断带来了机遇和挑战。本文介绍了一种通用合成数据驱动推断(GESPI)框架,该框架围绕广义的统计推断程序包裹,通过结合合成和真实数据安全地提高样本效率。我们的框架利用高质量合成数据提高统计效力,但能自适应回退到仅使用真实数据的传统方法,当合成数据质量较低时。在不假设合成数据分布的情况下,该方法的误差率始终低于用户指定的界限,且随着合成数据质量的提高而降低。这种灵活性使该框架能够无缝集成到符合性预测、风险控制、假设检验和多重检验程序中,而无需修改基础推断方法。我们在有限标注数据的挑战性任务上展示了该方法的优势,包括AlphaFold蛋白质结构预测,以及在复杂数学问题上比较大型推理模型。

英文摘要

The rapid proliferation of high-quality synthetic data -- generated by advanced AI models or collected as auxiliary data from related tasks -- presents both opportunities and challenges for statistical inference. This paper introduces a GEneral Synthetic-Powered Inference (GESPI) framework that wraps around a broad class of statistical inference procedures to safely enhance sample efficiency by combining synthetic and real data. Our framework leverages high-quality synthetic data to boost statistical power, yet adaptively defaults to the standard method using only real data when synthetic data are of low quality. The error rate of our method remains below a user-specified bound without any distributional assumptions on the synthetic data, and decreases as the quality of the synthetic data improves. This flexibility enables seamless integration with conformal prediction, risk control, hypothesis testing, and multiple testing procedures, all without modifying the base inference method. We demonstrate the benefits of our method on challenging tasks with limited labeled data, including AlphaFold protein structure prediction, and comparing large reasoning models on complex math problems.

2507.12257 2026-06-05 cs.LG physics.data-an stat.ML stat.OT

Robust Causal Discovery in Real-World Time Series with Power-Laws

在现实时间序列中使用幂律实现鲁棒因果发现

Matteo Tusoni, Giuseppe Masi, Andrea Coletta, Aldo Glielmo, Viviana Arrigoni, Novella Bartolini

AI总结 本文提出了一种基于幂律谱特征提取的鲁棒因果发现方法,以提高在现实时间序列中因果关系发现的鲁棒性,该方法在合成数据集和真实数据集上均优于现有方法。

详情
AI中文摘要

在随机时间序列中探索因果关系是一项具有广泛应用(包括金融、经济、神经科学和气候科学)的挑战性但至关重要的任务。许多因果发现(CD)算法已被提出;然而,它们通常对噪声高度敏感,在真实数据中导致虚假的因果推断。在本文中,我们观察到许多现实时间序列的频率谱遵循幂律分布,这主要是由于内在的自组织行为。利用这一见解,我们构建了一种基于提取幂律谱特征的鲁棒CD方法,以放大真实的因果信号。我们的方法在合成基准和具有已知因果结构的真实数据集上均优于最先进的替代方法,证明了其鲁棒性和实际相关性。

英文摘要

Exploring causal relationships in stochastic time series is a challenging yet crucial task with a vast range of applications, including finance, economics, neuroscience, and climate science. Many algorithms for Causal Discovery (CD) have been proposed; however, they often exhibit a high sensitivity to noise, resulting in spurious causal inferences in real data. In this paper, we observe that the frequency spectra of many real-world time series follow a power-law distribution, notably due to an inherent self-organizing behavior. Leveraging this insight, we build a robust CD method based on the extraction of power-law spectral features that amplify genuine causal signals. Our method consistently outperforms state-of-the-art alternatives on both synthetic benchmarks and real-world datasets with known causal structures, demonstrating its robustness and practical relevance.

2602.10103 2026-06-05 math.ST math.PR stat.TH

Minimax properties of gamma kernel density estimators under $L^p$ loss and $β$-Hölder smoothness of the target

gamma核密度估计在$L^p$损失和β-Holder光滑目标下的极小大性质

Frédéric Ouimet

AI总结 本文研究了在β-Hölder空间中,非修改的gamma核密度估计在L^p损失下的渐近行为,特别是在目标可能有有限有效或真实上端点但估计器本身未截断并将其支持视为[0,∞)的情况下。通过将有限端点作为函数类和风险定义中的分析工具,而非作为估计器的信息,选择的功能类使得目标密度在上端点处平滑地趋于零,从而隔离了原点处的行为并避免了额外的上端点泄漏偏差。证明了当(p,β)∈[1,3)×(0,2]或当3≤p<4且(p-3)/(p-2)<β≤2时,该估计器可以实现极小大率,而当p∈[4,∞)或β∈(2,∞)时则无法实现极小大。剩余区域{(p,β):3<p<4,0<β≤(p-3)/(p-2)}仍是一个开放问题。

详情
Comments
37 pages, 3 figures
AI中文摘要

本文考虑了在β-Hölder空间中,非修改的gamma核密度估计在L^p损失下的渐近行为,特别是在目标可能有有限有效或真实上端点但估计器本身未截断并将其支持视为[0,∞)的情况下。有限端点被用作函数类和风险定义中的分析工具,而非作为估计器的信息。功能类被选择使得目标密度在上端点处平滑地趋于零,从而隔离了原点处的行为并避免了额外的上端点泄漏偏差。证明了当(p,β)∈[1,3)×(0,2]或当3≤p<4且(p-3)/(p-2)<β≤2时,该估计器可以实现极小大率,而当p∈[4,∞)或β∈(2,∞)时则无法实现极小大。剩余区域{(p,β):3<p<4,0<β≤(p-3)/(p-2)}仍是一个开放问题。

英文摘要

This paper considers the asymptotic behavior in $β$-Hölder spaces, and under $L^p$ loss, of the non-modified gamma kernel density estimator introduced by Chen [Ann. Inst. Statist. Math. 52 (2000), 471-480] for the analysis of nonnegative data, in the situation where the target may have a finite effective or true upper endpoint but the estimator itself is left untruncated and treats the support as $[0,\infty)$. The finite endpoint is used as an analytical device in the definition of the function class and the risk, not as information supplied to the estimator. The functional classes are chosen so that the target density matches smoothly to zero at the upper endpoint, which isolates the behavior at the origin and avoids an additional upper-endpoint leakage bias. It is shown that this estimator can achieve the minimax rate asymptotically for a suitable choice of bandwidth whenever $(p,β)\in [1,3)\times(0,2]$, or whenever $3 \leq p < 4$ and $(p-3)/(p-2) < β\leq 2$. It is also shown that this estimator cannot be minimax when either $p\in [4,\infty)$ or $β\in (2,\infty)$. The remaining region $\left\{(p,β): 3 < p < 4,\ 0 < β\leq (p-3)/(p-2)\right\}$ is an open case.

2508.04409 2026-06-05 stat.ML cs.LG

The Relative Instability of Model Comparison with Cross-validation

模型比较与交叉验证的相对不稳定性

Alexandre Bayle, Lucas Janson, Lester Mackey

AI总结 研究指出即使个体稳定的模型在比较时也可能产生相对不稳定的结果,挑战了交叉验证推断的有效性,特别指出Lasso和软阈值化在最有利的学习条件下仍会导致无效的交叉验证推断。

详情
AI中文摘要

交叉验证(CV)已知能提供渐近精确的模型改进测试和置信区间,但仅在模型比较相对稳定时才成立。令人惊讶的是,我们证明了即使简单且个体稳定的模型也能产生相对不稳定的比较,从而质疑CV推断的有效性。具体来说,我们展示了Lasso及其近亲软阈值化在最有利的学习条件下,即使两个模型本身都稳定,也会产生相对不稳定的比较和无效的CV推断。这些发现强调在部署CV进行模型比较前验证相对稳定性的重要性。

英文摘要

Cross-validation (CV) is known to provide asymptotically exact tests and confidence intervals for model improvement but only when the model comparison is relatively stable. Surprisingly, we prove that even simple, individually stable models can generate relatively unstable comparisons, calling into question the validity of CV inference. Specifically, we show that the Lasso and its close cousin, soft-thresholding, generate relatively unstable comparisons and invalid CV inferences, even in the most favorable of learning settings and when both models are individually stable. These findings highlight the importance of verifying relative stability before deploying CV for model comparison.

2602.06773 2026-06-05 cs.LG stat.ML

On the Convergence of Multicalibration Gradient Boosting

多校准梯度提升的收敛性研究

Daniel Haimovich, Fridolin Linder, Lorenzo Perini, Niek Tax, Milan Vojnovic

AI总结 本文研究了多校准梯度提升的收敛性,证明了预测更新的幅度以O(1/√T)衰减,并在额外的平滑假设下实现线性收敛,实验验证了理论结果和方法的快速收敛性。

详情
Comments
Under submission
AI中文摘要

多校准梯度提升最近作为一种可扩展的方法出现,实证上能够产生近似多校准的预测器,并在大规模网络上部署。尽管这种实证成功,其收敛性质尚未得到充分理解。在本文中,我们为多校准梯度提升算法提供了计算保证。我们证明了连续预测更新的幅度以O(1/√T)衰减,这表明在轮次中经验多校准误差的相同收敛率界限。在额外的弱学习器平滑性假设下,该速率提高到线性收敛。我们进一步建立了自适应变体的收敛性。在真实世界数据集上的实验支持我们的理论,并澄清了该方法在何种情况下实现快速收敛性。

英文摘要

Multicalibration gradient boosting has recently emerged as a scalable method that empirically produces approximately multicalibrated predictors and has been deployed at web scale. Despite this empirical success, its convergence properties are not well understood. In this paper, we provide computational guarantees for multicalibration gradient boosting algorithms. We show that the magnitude of successive prediction updates decays at $O(1/\sqrt{T})$, which implies the same convergence rate bound for the empirical multicalibration error over rounds. Under additional smoothness assumptions on the weak learners, this rate improves to linear convergence. We further establish convergence for adaptive variants. Experiments on real-world datasets support our theory and clarify the regimes in which the method achieves fast convergence.

2602.01607 2026-06-05 math.ST cs.IT cs.LG math.IT stat.ML stat.TH

Minimax optimal differentially private synthetic data for smooth queries

最小最大最优差分隐私合成数据用于平滑查询

Rundong Ding, Yiyun He, Yizhe Zhu

AI总结 本文研究了如何生成具有(ε,δ)差分隐私的合成数据,以在保证个体隐私的同时,为有意义的下游分析提供强效用保证。提出了一种多项式时间算法,实现了最小最大误差率O_{k,d}(n^{-min{1, k/d}}),并建立了针对k-平滑查询的首个最小最大下界。

详情
Comments
COLT 2026 arXiv version. 34 pages
AI中文摘要

差分隐私合成数据使敏感数据集的共享和分析成为可能,同时为个体贡献者提供严格的隐私保证。一个核心挑战是为有意义的下游分析提供强效用保证。许多现有方法确保在广泛的查询类上具有均匀的准确性,如所有Lipschitz函数,但这种通用性往往导致对实际感兴趣的统计量的次优速率。由于许多常见数据分析查询的平滑性超出了最坏情况Lipschitz界所捕捉的范围,我们询问是否可以利用这种额外的结构来提高效用。我们研究了从大小为n的数据集生成(ε,δ)差分隐私合成数据的问题,该数据集支持在超立方体[-1,1]^d上,具有对所有具有受界导数的平滑查询的均匀效用保证。我们提出了一种多项式时间算法,实现了最小最大误差率O_{k,d}(n^{-min{1, k/d}}),除了一个log(n)因子。这一特征揭示了k=d处的相变。我们的结果推广了Chebyshev矩匹配框架(Musco等,2025;Wang等,2016),并且严格改进了在\citep{wang2016differentially}中为k-平滑查询建立的误差率。此外,我们建立了针对k-平滑查询的首个最小最大下界,扩展了Boedihardjo等(2024)中关于ε-差分隐私的Wasserstein下界。

英文摘要

Differentially private synthetic data enables the sharing and analysis of sensitive datasets while providing rigorous privacy guarantees for individual contributors. A central challenge is to achieve strong utility guarantees for meaningful downstream analysis. Many existing methods ensure uniform accuracy over broad query classes, such as all Lipschitz functions, but this level of generality often leads to suboptimal rates for statistics of practical interest. Since many common data analysis queries exhibit smoothness beyond what worst-case Lipschitz bounds capture, we ask whether exploiting this additional structure can yield improved utility. We study the problem of generating $(\varepsilon,δ)$-differentially private synthetic data from a dataset of size $n$ supported on the hypercube $[-1,1]^d$, with utility guarantees uniformly for all smooth queries having bounded derivatives up to order $k$. We propose a polynomial-time algorithm that achieves a minimax error rate of $O_{k,d}(n^{-\min \{1, \frac{k}{d}\}})$, up to a $\log(n)$ factor. This characterization uncovers a phase transition at $k=d$. Our results generalize the Chebyshev moment matching framework of (Musco et al., 2025; Wang et al., 2016) and strictly improve the error rates for $k$-smooth queries established in \citep{wang2016differentially}. Moreover, we establish the first minimax lower bound for the utility of $(\varepsilon,δ)$-differentially private synthetic data with respect to $k$-smooth queries, extending the Wasserstein lower bound for $\varepsilon$-differential privacy in (Boedihardjo et al., 2024).

2601.06655 2026-06-05 cs.CE physics.comp-ph stat.ML

Physics-constrained Gaussian Processes for Predicting Shockwave Hugoniot Curves

具有物理约束的高斯过程用于预测冲击波赫荣尼特曲线

George D. Pasparakis, Himanshu Sharma, Rushik Desai, Chunyu Li, Alejandro Strachan, Lori Graham-Brady, Michael D. Shields

AI总结 本文提出了一种基于物理约束的高斯过程回归框架,利用少量冲击波模拟数据预测受冲击材料状态及其不确定性沿赫荣尼特曲线的演变,通过约束Rankine-Hugoniot跃变条件构建热力学一致的协方差函数,从而在少量可解释超参数上进行优化,识别从领先弹性波到滞后塑性及相变波的转变。

详情
AI中文摘要

本文开发了一种受物理约束的高斯过程回归框架,用于利用少量冲击波模拟数据预测受冲击材料状态及其相关不确定性沿赫荣尼特曲线。所提出的高斯过程通过约束不同受冲击材料状态之间的Rankine-Hugoniot跃变条件来构建热力学一致的协方差函数。这导致在少量可解释超参数上进行优化的问题,从而能够识别从领先弹性波到滞后塑性及相变波的转变。赫荣尼特曲线是理解材料在极端条件下行为的重要指标,包括用于开发状态方程和确定材料性质如赫荣尼特弹性极限,但通过大规模分子动力学模拟或冲击实验生成这些曲线成本很高。在这些约束下,所提出的方法利用有限数量的分子动力学模拟生成赫荣尼特曲线。我们考虑碳化硅作为代表性材料,并使用反向弹道方法进行分子动力学模拟。该框架能够以令人满意的精度重现赫荣尼特曲线,同时利用高斯过程后验量化预测的不确定性。这些不确定的赫荣尼特预测可用于校准状态方程模型、估计材料性质或指导未来的实验和/或模拟活动。

英文摘要

A physics-constrained Gaussian Process regression framework is developed for predicting shocked material states and their associated uncertainties along the Hugoniot curve using data from a small number of shockwave simulations. The proposed Gaussian process is constrained by the Rankine-Hugoniot jump conditions between the various shocked material states to construct a thermodynamically consistent covariance function. This leads to the formulation of an optimization problem over a small number of interpretable hyperparameters and enables the identification of regime transitions, from a leading elastic wave to trailing plastic and phase transformation waves. Shock Hugoniots are an important measure for understanding material behavior under extreme conditions, including for the development of equations of state and determining material properties such as the Hugoniot Elastic Limit, but they are costly to generate through large-scale molecular dynamics simulations or shock experiments. Under these constraints, the proposed methodology establishes Hugoniot curves from a limited number of molecular dynamics simulations. We consider silicon carbide as a representative material and Molecular Dynamics simulations are performed using a reverse ballistic approach. The framework reproduces the Hugoniot curve with satisfactory accuracy while also quantifying the uncertainty in the predictions using the Gaussian Process posterior. These uncertain Hugoniot predictions can then be used to calibrate equation of state models, estimate material properties, or inform future experimental and/or simulation campaigns.

2510.10968 2026-06-05 cs.LG stat.ML

Blade: A Derivative-free Bayesian Inversion Method using Diffusion Priors

Blade:一种使用扩散先验的无导数贝叶斯反演方法

Hongkai Zheng, Austin Wang, Zihui Wu, Zhengyu Huang, Ricardo Baptista, Yisong Yue

AI总结 本文提出Blade方法,通过使用扩散模型作为数据驱动的先验,解决无导数贝叶斯反演中高维非线性问题的后验估计问题,实现了准确且校准良好的后验分布。

详情
AI中文摘要

无导数贝叶斯反演在科学和工程应用中出现,特别是在正向模型成本高或无法通过导数进行微分时。现有的无导数方法将后验缩减为点估计或在高维非线性问题中返回严重过自信的不确定性。我们介绍了Blade,它使用相互作用粒子的集合产生准确且校准良好的后验。Blade利用扩散模型作为数据驱动的先验,并且只通过正向评估(即无导数)查询正向模型。理论上,我们证明了在正向模型近似和先验分数估计误差下,Blade的收敛性和稳定性。经验上,在非线性流体动力学中,Blade产生校准良好的后验样本,这些样本现有无导数方法无法产生,通过CRPS、扩展-技能比和等级直方图进行测量。其准确性和校准随着迭代次数和粒子数的增加而持续提高,这得到了我们的收敛性和稳定性分析以及经验实验的支持。

英文摘要

Derivative-free Bayesian inversion arises in science and engineering applications, particularly when forward model is costly or infeasible to differentiate through. Existing derivative-free methods collapse the posterior to a point estimate or return severely over-confident uncertainty on high-dimensional, nonlinear problems. We introduce Blade, which produces accurate and well-calibrated posteriors using an ensemble of interacting particles. Blade leverages diffusion models as data-driven priors, and only queries the forward model through forward evaluations (i.e., derivative-free). Theoretically, we show the convergence and stability of Blade under forward model approximation and prior score estimation error. Empirically, on nonlinear fluid dynamics, Blade produces well-calibrated posterior samples that existing derivative-free methods cannot, as measured by CRPS, the spread-skill ratio, and the rank histogram. Its accuracy and calibration improve consistently with more iterations and particles, backed by our convergence and stability analysis and empirical experiments.

2511.09890 2026-06-05 stat.ME stat.AP

A Clustering Approach for Basket Trials Based on Treatment Response Trajectories

基于治疗反应轨迹的篮子试验聚类方法

Masahiro Kojima, Keisuke Hanada, Atsuya Sato

AI总结 本文提出了一种基于治疗反应轨迹转移概率的无模型聚类方法,用于对篮子试验中的篮子进行分组,以提高疗效终点估计的精度和统计功效,同时保持I类错误率在名义水平。

详情
AI中文摘要

在篮子试验中,有时会观察到疗效的异质性。在本研究中,我们提出了一种无模型的聚类框架,该框架根据治疗反应轨迹导出的转移概率对篮子进行分组,而不是仅依赖单一疗效终点,如客观反应率。簇的数量不是预设的,而是通过数据驱动的方式根据篮子间的相似性结构自动确定。聚类后,同一簇内的篮子使用分层贝叶斯模型进行分析。该框架旨在提高疗效终点的估计精度,增强统计功效,同时保持I类错误率在名义水平。通过模拟研究评估了所提出方法的性能。结果表明,所提出的方法在异质设置中能够准确识别簇结构,并在这样的条件下保持I类错误率在名义水平,同时提高统计功效。

英文摘要

Heterogeneity in efficacy is sometimes observed across baskets in basket trials. In this study, we propose a model-free clustering framework that groups baskets based on transition probabilities derived from the trajectories of treatment response, rather than relying solely on a single efficacy endpoint such as the objective response rate. The number of clusters is not predetermined but is automatically determined in a data-driven manner based on the similarity structure among baskets. After clustering, baskets within the same cluster are analyzed using a hierarchical Bayesian model. This framework aims to improve the estimation precision of efficacy endpoints and enhance statistical power while maintaining the type~I error rate at the nominal level. The performance of the proposed method was evaluated through simulation studies. The results demonstrated that the proposed method can accurately identify cluster structures in heterogeneous settings and, even under such conditions, maintain the type~I error rate at the nominal level while improving statistical power.

2412.11800 2026-06-05 cs.LG stat.ML

Scalable Temporal Anomaly Causality Discovery in Large Systems: Achieving Computational Efficiency with Binary Anomaly Flag Data

在大规模系统中实现可扩展的时间异常因果发现:通过二进制异常标志数据实现计算效率

Mulugeta Weldezgina Asres, Christian Walter Omlin, The CMS-HCAL Collaboration

AI总结 本文提出了一种异常因果发现方法(AnomalyCD),旨在解决从时间二进制标志数据集生成图形因果模型(GCMs)的准确性和计算挑战,通过异常数据感知的因果测试、稀疏数据和先验链接压缩以及边修剪调整等策略,提高了计算效率和准确性。

详情
Journal ref
Eur. Phys. J. C, 86, 585 (2026)
Comments
26 pages, 17 figures, 8 tables, published version at EPJ-C: Computing, Software and Data Science
AI中文摘要

提取异常因果关系有助于在监控系统检测系统故障时进行诊断。在大规模系统中识别异常原因涉及在多个子系统中调查更广泛的监控变量。然而,学习图形因果模型(GCMs)带来了显著的计算负担,限制了现有方法在实时和大规模部署中的应用。此外,现代大规模系统的监控应用通常生成大量二进制警报标志,二进制异常数据的特征——状态转换的意义和数据稀疏性——挑战了现有的因果学习机制。本文提出了一种异常因果发现方法(AnomalyCD),以解决从时间二进制标志数据集生成GCMs的准确性和计算挑战。AnomalyCD提出了几种策略,例如异常数据感知的因果测试、稀疏数据和先验链接压缩,以及边修剪调整方法。我们在两个数据集上验证了该方法的性能:来自欧洲核子研究中心紧凑缪子对撞机实验读出盒系统的传感器数据,以及一个来自信息技术监控系统的公开数据集。在时间GCMs上的结果表明,计算开销显著减少,且在二进制异常数据集上准确性有所提高。代码:https://github.com/muleina/AnomalyCD

英文摘要

Extracting anomaly causality facilitates diagnostics once monitoring systems detect system faults. Identifying anomaly causes in large systems involves investigating a broader set of monitoring variables across multiple subsystems. However, learning graphical causal models (GCMs) comes with a significant computational burden that restrains the applicability of most existing methods in real-time and large-scale deployments. In addition, modern monitoring applications for large systems often generate large amounts of binary alarm flags, and the distinct characteristics of binary anomaly data -- the meaning of state transition and data sparsity -- challenge existing causality learning mechanisms. This study proposes an anomaly causal discovery approach (AnomalyCD), addressing the accuracy and computational challenges of generating GCMs from temporal binary flag datasets. The AnomalyCD presents several strategies, such as anomaly data-aware causality testing, sparse data and prior link compression, and edge pruning adjustment approaches. We validate the performance of the approach on two datasets: monitoring sensor data from the readout-box system of the Compact Muon Solenoid experiment at CERN, and a public dataset from an information technology monitoring system. The results on temporal GCMs demonstrate a considerable reduction of computation overhead and a moderate enhancement of accuracy on the binary anomaly datasets. Code: https://github.com/muleina/AnomalyCD .

2512.19510 2026-06-05 cs.LG stat.ML

Toward Scalable and Valid Conditional Independence Testing with Spectral Representations

迈向基于谱表示的可扩展且有效的条件独立性检验

Alek Fröhlich, Vladimir R. Kostic, Karim Lounici, Daniel Perazzo, Daniel Tiezzi, Massimiliano Pontil

AI总结 本文提出了一种基于谱表示的学习方法,用于解决传统条件独立性检验在适应性和可扩展性方面的不足,通过构造简单的检验统计量和双层对比算法,建立了表示学习误差与检验性能之间的理论联系,并在实际和合成数据上验证了其有效性。

详情
Comments
Accepted at ICML 2026. Revised to match the accepted version; updated experiments and exposition
AI中文摘要

条件独立性(CI)在因果推断、特征选择和图模型中至关重要,然而在许多情况下,没有额外假设的情况下无法进行检验。现有的CI检验通常依赖于限制性的结构条件,限制了其有效性。核方法使用偏协方差算子提供了一种更系统的方法,但存在有限的适应性和可扩展性。在本工作中,我们探讨了表示学习是否能帮助解决这些限制。具体而言,我们关注由偏协方差算子的奇异值分解得到的表示,并利用这些表示构造一个简单的检验统计量。我们还引入了一个双层对比算法来学习这些表示。我们的理论将表示学习误差与检验性能联系起来,并建立了渐近有效性和功效保证。在实际和合成数据上的实验表明,这种方法提供了一条系统且统计上站得住脚的路径,以实现可扩展的CI检验,将基于核的理论与现代表示学习相结合。

英文摘要

Conditional independence (CI) is central to causal inference, feature selection, and graphical modeling, yet it is untestable in many settings without additional assumptions. Existing CI tests often rely on restrictive structural conditions, limiting their validity. Kernel methods using partial covariance operators offer a more principled approach but suffer from limited adaptivity and scalability. In this work, we explore whether representation learning can help address these limitations. Specifically, we focus on representations derived from the singular value decomposition of partial covariance operators and use them to construct a simple test statistic. We also introduce a bi-level contrastive algorithm to learn these representations. Our theory links representation learning error to test performance and establishes asymptotic validity and power guarantees. Experiments on real and synthetic data suggest that this approach offers a principled and statistically grounded path toward scalable CI testing, bridging kernel-based theory with modern representation learning.

2512.05013 2026-06-05 cs.AI cs.MA stat.ME

Detecting Perspective Shifts in Multi-agent Systems

在多智能体系统中检测视角变化

Eric Bridgeford, Hayden Helm

AI总结 本文提出了一种名为TDKPS的框架,用于检测多智能体系统中智能体和群体层面的行为变化,通过模拟和自然实验验证了其在检测真实外部事件变化方面的有效性。

详情
AI中文摘要

增强型生成模型结合外部工具和更新机制(或称为智能体)已展现出超越基础模型智能提示的能力。随着智能体的广泛应用,动态多智能体系统自然地出现了。最近的研究探讨了基于单时间点查询响应的低维表示的理论和经验属性。本文引入了时间数据核视角空间(TDKPS),该方法跨时间联合嵌入智能体,并提出了几种新的假设检验方法,用于检测多智能体系统中智能体和群体层面的行为变化。我们通过受演进数字身份多智能体系统启发的模拟,表征了所提出检验的实证属性,包括其对关键超参数的敏感性。最后,我们通过自然实验证明,所提出检验能够检测出与真实外生事件相关、敏感且显著变化。据我们所知,TDKPS是首个系统性的框架,用于监控多智能体系统中的行为动态——随着生成智能体部署的持续扩展,这一能力至关重要。

英文摘要

Generative models augmented with external tools and update mechanisms (or \textit{agents}) have demonstrated capabilities beyond intelligent prompting of base models. As agent use proliferates, dynamic multi-agent systems have naturally emerged. Recent work has investigated the theoretical and empirical properties of low-dimensional representations of agents based on query responses at a single time point. This paper introduces the Temporal Data Kernel Perspective Space (TDKPS), which jointly embeds agents across time, and proposes several novel hypothesis tests for detecting behavioral change at the agent- and group-level in black-box multi-agent systems. We characterize the empirical properties of our proposed tests, including their sensitivity to key hyperparameters, in simulations motivated by a multi-agent system of evolving digital personas. Finally, we demonstrate via natural experiment that our proposed tests detect changes that correlate sensitively, specifically, and significantly with a real exogenous event. As far as we are aware, TDKPS is the first principled framework for monitoring behavioral dynamics in black-box multi-agent systems -- a critical capability as generative agent deployment continues to scale.

2511.15841 2026-06-05 math.ST stat.TH

New Empirical Process Tools and Their Applications to Robust Deep ReLU Networks and Phase Transitions for Nonparametric Regression

新的经验过程工具及其在鲁棒深度ReLU网络和非参数回归相变中的应用

Yizhe Ding, Runze Li, Lingzhou Xue

AI总结 本文提出新的经验过程工具,用于在重尾噪声和复杂函数类下分析广泛的统计学习模型。主要贡献是推导出两种Dudley型最大不等式,这些不等式消除了如轻尾和函数类均匀有界等限制性假设。这些不等式扩展了经验过程理论在统计学习和非参数估计中的应用范围。利用新的界限,我们建立了深度ReLU网络估计器在Huber和分位数回归中的鲁棒性保证。特别是,我们证明了一个统一的非渐近子高斯集中界,即使在无限方差噪声下仍然有效,并对深度Huber估计器在所有噪声情况下的非渐近鲁棒性进行了全面分析。对于深度分位数回归,我们提供了第一个无渐近子高斯界,无需假设矩条件。此外,我们的框架还为非参数最小二乘估计器提供了误差界,同时容纳无限方差噪声、非Donsker函数类和近似误差。此外,与以往基于专门乘数过程的方法不同,我们的框架扩展到更广泛的经验风险最小化问题,包括非参数广义线性模型和``集合结构``模型。

详情
AI中文摘要

本文介绍了新的经验过程工具,用于分析在重尾噪声和复杂函数类下的广泛统计学习模型。我们的主要贡献是推导出两种Dudley型最大不等式,用于预期经验过程,这些不等式消除了如轻尾和函数类均匀有界等限制性假设。这些不等式扩展了经验过程理论在统计学习和非参数估计中的应用范围。利用新的界限,我们建立了深度ReLU网络估计器在Huber和分位数回归中的鲁棒性保证。特别是,我们证明了一个统一的非渐近子高斯集中界,即使在无限方差噪声下仍然有效,并对深度Huber估计器在所有噪声情况下的非渐近鲁棒性进行了全面分析。对于深度分位数回归,我们提供了第一个非渐近子高斯界,无需要求矩假设。此外,我们的框架还为非参数最小二乘估计器提供了误差界,同时容纳无限方差噪声、非Donsker函数类和近似误差。此外,与以往基于专门乘数过程的方法不同,我们的框架扩展到更广泛的经验风险最小化问题,包括非参数广义线性模型和``集合结构``模型。

英文摘要

This paper introduces new empirical process tools for analyzing a broad class of statistical learning models under heavy-tailed noise and complex function classes. Our primary contribution is the derivation of two Dudley-type maximal inequalities for expected empirical processes that remove restrictive assumptions such as light tails and uniform boundedness of the function class. These inequalities enlarge the scope of empirical process theory available for statistical learning and nonparametric estimation. Exploiting the new bounds, we establish robustness guarantees for deep ReLU network estimators in Huber and quantile regression. In particular, we prove a unified non-asymptotic sub-Gaussian concentration bound that remains valid even under infinite-variance noise and provide a comprehensive analysis of non-asymptotic robustness for deep Huber estimators across all noise regimes. For deep quantile regression, we provide the first non-asymptotic sub-Gaussian bounds without requiring moment assumptions. As an additional application, our framework yields estimation error bounds for nonparametric least-squares estimators that simultaneously accommodate infinite-variance noise, non-Donsker function classes, and approximation error. Moreover, unlike prior approaches based on specialized multiplier processes, our framework extends to broader empirical risk minimization problems, including the nonparametric generalized linear models and the ``set-structured'' models.

2511.16111 2026-06-05 stat.ML cs.LG math.SP

Rotation-Parameterized Graph Fractional Fourier Transform: Definition, Properties, and Optimal Filtering

旋转参数化图分数阶傅里叶变换:定义、性质和最优滤波

Feiyue Zhao, Mingzhi Wang, Yangfan He, Zhichao Zhang

AI总结 本文提出旋转参数化图分数阶傅里叶变换(RP-GFRFT),通过统一分数阶和旋转参数化的谱分析,解决现有方法在旋转基控制和零角度退化方面的不足,提升图信号处理的去噪、重建和特征保留性能。

详情
AI中文摘要

图谱表示在图信号处理中是基础,为分析图结构数据提供严谨的框架。图分数阶傅里叶变换(GFRFT)通过分数阶参数扩展图傅里叶变换(GFT),实现灵活的谱分析并保持数学一致性。角图傅里叶变换(AGFT)通过旋转GFT特征向量引入角度控制;然而现有构造可能无法在零角度时精确还原为GFT,削弱理论一致性和可解释性。为解决这些互补的局限性,即GFRFT缺乏基于旋转的基控制和AGFT的零角度退化问题,本文提出旋转参数化图分数阶傅里叶变换(RP-GFRFT),统一分数阶和旋转参数化的谱分析。构造了一个保持退化的旋转矩阵族以保证在零角度时精确还原为GFT。然后提出了两种RP-GFRFT变体,I-RP-GFRFT和II-RP-GFRFT,并通过理论分析确认其幺正性、可逆性、还原行为和光滑参数依赖性。将分数阶和旋转角度联合优化用于自适应图谱滤波。在真实世界信号、图像和点云上的实验表明,RP-GFRFT在去噪精度、重建质量和特征保留方面优于GFRFT、AGFT和代表性滤波基线。

英文摘要

Graph spectral representations are fundamental in graph signal processing, providing a rigorous frameworkforanalyzing graph-structured data. The graph fractional Fourier transform (GFRFT) extends the graph Fourier transform (GFT) through a fractional-order parameter, enabling flexible spectral analysis with mathematical consistency. The angular graph Fourier transform (AGFT) further introduces angular control by rotating GFT eigenvectors; however, existing constructions may fail to reduce exactly to the GFT at zero angle, weakening theoretical consistency and interpretability. To address these complementary limitations, namely the lack of rotation-based basis control in GFRFT and the defective zero-angle degeneracy of AGFT, this paper proposes the rotation-parameterized graph fractional Fourier transform (RP-GFRFT), which unifies fractional order and rotation-parameterized spectral analysis. A degeneracy preserving rotation matrix family is constructed to guarantee exact GFT reduction at zero angle. TwoRP-GFRFTvariants,I-RP-GFRFTandII-RP-GFRFT,arethenformulated, with theoretical analyses confirming their unitarity, invertibility, reduction behavior, and smooth parameter dependence. The fractional order and rotation angle are jointly optimized for adaptive graph spectral filtering. Experiments on real-world signals, images, and point clouds demonstrate that RP-GFRFT improves denoising accuracy, reconstruction quality, and feature preservation over GFRFT, AGFT, and representative filtering baselines.

2511.04058 2026-06-05 math.ST math.PR stat.TH

Finding Planted Cycles in a Random Graph

在随机图中寻找植入的环

Julia Gaudio, Colin Sandon, Jiaming Xu, Dana Yang

AI总结 该研究探讨了在随机图中寻找植入环的问题,证明了当λ满足特定条件时,几乎精确恢复是信息论上可能的,而设计了一个多项式时间算法实现这一目标,与植入聚类问题形成对比。

详情
AI中文摘要

在本文中,我们研究了在ER随机图G∼G(n,λ/n)中寻找一组植入环的问题,类似于著名的植入聚类问题。当环被植入在均匀随机选择的δn个顶点上时,我们证明,当λ<1/(√(2δ)+√(1−δ))²时,几乎精确恢复(即随着n→∞,恢复所有但极小比例的植入环边)在信息论上是可能的,而当λ>1/(√(2δ)+√(1−δ))²时是不可能的。此外,尽管寻找长环在最坏情况下计算上具有难度,我们设计了一个多项式时间算法,在λ<1/(√(2δ)+√(1−δ))²时实现几乎精确恢复。这与植入聚类问题形成鲜明对比,后者广泛认为存在显著的计算-统计差距。

英文摘要

In this paper, we study the problem of finding a collection of planted cycles in an \ER random graph $G \sim \mathcal{G}(n, λ/n)$, in analogy to the famous Planted Clique Problem. When the cycles are planted on a uniformly random subset of $δn$ vertices, we show that almost-exact recovery (that is, recovering all but a vanishing fraction of planted-cycle edges as $n \to \infty$) is information-theoretically possible if $λ< \frac{1}{(\sqrt{2 δ} + \sqrt{1-δ})^2}$ and impossible if $λ> \frac{1}{(\sqrt{2 δ} + \sqrt{1-δ})^2}$. Moreover, despite the worst-case computational hardness of finding long cycles, we design a polynomial-time algorithm that attains almost exact recovery when $λ< \frac{1}{(\sqrt{2 δ} + \sqrt{1-δ})^2}$. This stands in stark contrast to the Planted Clique Problem, where a significant computational-statistical gap is widely conjectured.

2410.23587 2026-06-05 econ.EM q-fin.CP stat.CO

Moments by Integrating the Moment-Generating Function

通过积分动差生成函数计算动差

Peter Reinhard Hansen, Chen Tong

AI总结 本文提出了一种通用积分框架,用于在满足显式正则条件的情况下,从动差生成函数计算分数、复数、绝对和对数动差。通过沿垂直轮廓评估复数扩展的动差生成函数,获得精确的积分表达式,从而避免了显式概率密度和高阶导数的需要。

详情
AI中文摘要

我们介绍了一种通用的积分框架,用于在满足显式正则条件的情况下,从动差生成函数(MGF)计算分数、复数、绝对和对数动差。通过沿垂直轮廓评估复数扩展的MGF,我们获得了精确的积分表达式,从而避免了显式概率密度和高阶导数的需要。我们通过对称柯西主值建立了负分数动差的条件,包括分布在中心点处没有点质量的要求。我们通过正态-逆高斯分布和半连续复合泊松-伽马分布的应用,展示了该框架的理论范围和计算实用性。在后者情况下,该框架通过评估条件分数动差来处理边界处的点质量。

英文摘要

We introduce a general integral framework for computing fractional, complex, absolute, and logarithmic moments from the moment-generating function (MGF) under explicit regularity conditions. By evaluating a complex extension of the MGF along a vertical contour, we obtain exact integral expressions that bypass the need for explicit probability densities and high-order derivatives. We establish conditions for negative fractional moments using the symmetric Cauchy principal value, including the requirement that the distribution have no point mass at the centering point. We demonstrate the theoretical scope and computational practicality of the framework through applications to the normal-inverse Gaussian distribution and a semicontinuous compound Poisson-Gamma distribution. In the latter case, the framework handles point masses at the boundary by evaluating conditional fractional moments.

2510.15814 2026-06-05 stat.ML cs.LG

On Universality of Deep Equivariant Networks

关于深度等变网络的通用性

Marco Pacini, Mircea Petrache, Bruno Lepri, Shubhendu Trivedi, Robin Walters

AI总结 本文研究了等变神经网络的通用性问题,提出在分离约束下,通过全连接读出层可实现连续函数的近似,并引入了更严格的逐元素分离性准则,证明了足够深度或适当读出层可使等变网络在逐元素分离性范围内实现通用性。

详情
Journal ref
International Conference on Learning Representations (ICLR), 2026
Comments
Published as a conference paper at ICLR 2026
AI中文摘要

对于等变神经网络的通用性结果仍然很少。已有的结果通常仅在受限的设置中成立:要么依赖于常规或高阶张量表示,导致隐藏空间维度过高,要么针对专门的架构,通常局限于不变设置。本文提出了一种更一般性的结论。对于不变网络,我们在分离约束下建立了通用性定理,证明添加全连接读出层可使连续函数的近似在分离约束下实现。对于等变网络,其中结果更为稀少,我们证明标准分离性概念不足,并引入更严格的逐元素分离性准则。我们证明在足够深度或添加适当读出层的情况下,等变网络可在逐元素分离性范围内实现通用性。结合先前结果表明浅层模型无法实现通用性,我们的发现将深度和读出层识别为通用性的关键机制,同时提供了一个统一的视角,涵盖了并扩展了先前专门的结果。

英文摘要

Universality results for equivariant neural networks remain rare. Those that do exist typically hold only in restrictive settings: either they rely on regular or higher-order tensor representations, leading to impractically high-dimensional hidden spaces, or they target specialized architectures, often confined to the invariant setting. This work develops a more general account. For invariant networks, we establish a universality theorem under separation constraints, showing that the addition of a fully connected readout layer secures approximation within the class of separation-constrained continuous functions. For equivariant networks, where results are even scarcer, we demonstrate that standard separability notions are inadequate and introduce the sharper criterion of $\textit{entry-wise separability}$. We show that with sufficient depth or with the addition of appropriate readout layers, equivariant networks attain universality within the entry-wise separable regime. Together with prior results showing the failure of universality for shallow models, our findings identify depth and readout layers as a decisive mechanism for universality, additionally offering a unified perspective that subsumes and extends earlier specialized results.

2510.06789 2026-06-05 stat.ME

Model-free Rank Aggregation in the Presence of Rater Heterogeneity: A Maximum Score Approach

无模型的排名聚合在评分者异质性存在下的研究:最大分数方法

Haoran Zhang, Yunxiao Chen

AI总结 本文研究了在评分者异质性存在下的排名聚合问题,提出了一种无模型方法,能够处理评分者之间高度异质的偏好分布,并涵盖弱随机传递性作为特殊情况。通过证明估计器的一致性,展示了不一致对比例(Kendall tau)在评分者数量趋于无穷时收敛于零。进一步推导了基于Kendall tau的性能指标的上下界,并在某些渐进行为下,这些界限在对数因子范围内一致,因此估计器几乎是最小最大最优的。这些结果通过分析U经验过程的收敛行为获得,所开发的新技术结果可能具有独立的理论兴趣。通过广泛模拟和体育运动员排名及调查偏好聚合的应用验证了该方法的实用性。

详情
AI中文摘要

本文通过从评分者评分中获得的多向比较数据的视角,研究了排名聚合问题。与传统的参数框架,如Bradley-Terry和Plackett-Luce模型不同,我们提出了一种无模型方法,能够处理评分者之间高度异质的偏好分布,并涵盖弱随机传递性作为特殊情况。我们通过证明估计器的一致性建立了所提出估计器的理论基础,证明了不一致对比例(Kendall tau)在评分者数量趋于无穷时以概率收敛于零。进一步,我们推导了基于Kendall's tau的性能指标的上下界。在某些渐进行为下,这些界限在对数因子范围内一致,因此估计器几乎是最小最大最优的。这些结果通过分析U-经验过程的收敛行为获得;为此分析开发的新技术结果可能具有独立的理论兴趣。通过广泛的模拟和体育运动员排名及调查偏好聚合的应用验证了该方法的实用性。

英文摘要

This paper investigates the rank aggregation problem through the lens of multi-way comparison data derived from rater scores. Departing from traditional parametric frameworks, such as the Bradley-Terry and Plackett-Luce models, we propose a model-free method that accommodates highly heterogeneous preference distributions across raters and encompasses weak stochastic transitivity in pairwise comparisons as a special case. We establish the theoretical foundations of the proposed estimator by proving its consistency, demonstrating that the proportion of discordant pairs (Kendall tau) converges to zero in probability as the number of raters diverges. Furthermore, we derive upper and lower bounds for a performance metric based on Kendall's tau. In certain asymptotic regimes, these bounds coincide up to logarithmic factors, so the estimator is nearly minimax optimal. These results are obtained by analyzing the convergence behavior of a U-empirical process; the novel technical results developed for this analysis may be of independent theoretical interest. The practical utility of our method is validated through extensive simulations and applications to sports player rankings and survey preference aggregation.

2505.01318 2026-06-05 stat.ME

Modeling Large Nonstationary Spatial Data with the Full-Scale Basis Graphical Lasso

用全尺度基图拉索方法建模大非平稳空间数据

Matthew LeDuc, William Kleiber, Tomoko Matsuo

AI总结 本文提出了一种结合隐含低秩过程和稀疏协方差模型的新方法,用于建模大非平稳空间数据,通过灵活的图高斯马尔可夫随机场模型对低秩组件系数进行建模,并结合全尺度近似和基图拉索方法,提出全尺度基图拉索方法(FSBGL),采用图拉索惩罚似然进行估计,通过差异凸方案优化,通过合成场和热层高分辨率模拟数据集验证,与现有空间模型相比,在有限训练数据下更能捕捉热层温度场的显著特征。

详情
AI中文摘要

我们提出了一种新的方法,用于建模大非平稳空间过程的数据集,该方法结合了隐含的低秩过程和稀疏协方差模型。低秩组件的系数被赋予了灵活的图高斯马尔可夫随机场模型。利用低秩和紧支撑协方差结构结合了全尺度近似和基图拉索;我们称这种新方法为全尺度基图拉索(FSBGL)。估计采用图拉索惩罚似然,通过差异凸方案进行优化。我们在合成场以及具有挑战性的高分辨率热层模拟数据集上展示了所提出的方法。在与现有空间模型的比较中,即使在可用训练数据有限的情况下,FSBGL在捕捉热层温度场的显著特征方面表现更好。

英文摘要

We propose a new approach for the modeling large datasets of nonstationary spatial processes that combines a latent low rank process and a sparse covariance model. The low rank component coefficients are endowed with a flexible graphical Gaussian Markov random field model. The utilization of a low rank and compactly-supported covariance structure combines the full-scale approximation and the basis graphical lasso; we term this new approach the full-scale basis graphical lasso (FSBGL). Estimation employs a graphical lasso-penalized likelihood, which is optimized using a difference-of-convex scheme. We illustrate the proposed approach on synthetic fields as well as with a challenging high-resolution simulation dataset of the thermosphere. In a comparison against state-of-the-art spatial models, the FSBGL performs better at capturing salient features of the thermospheric temperature fields, even with limited available training data.

2510.05085 2026-06-05 stat.ME

WOW: WAIC-Optimized Gating of Mixture Priors for External Data Borrowing

WOW: 用于外部数据借用的WAIC优化混合先验门控

Shouhao Zhou, Qiuxin Gao, Chenqi Fu, Yanxun Xu

AI总结 本文提出WOW方法,通过WAIC优化的门控策略在混合先验框架中评估外部与同期数据的兼容性,以减少不恰当的数据借用,提高临床试验的估计准确性与决策可靠性。

详情
AI中文摘要

利用贝叶斯混合先验整合外部数据已成为临床试验中的强大方法,具有显著提高试验效率的潜力。尽管现有方法如稳健元分析-预测(rMAP)和自适应混合(SAM)在分析可操作性和实际灵活性方面表现优异,但它们通常假定数据借用而未严格评估外部信息是否适合纳入。当外部和同期数据不一致时,过度借用会导致估计偏差并得出误导性结论。为了解决这一问题,我们引入WOW,一种基于Kullback-Leibler的门控策略,由广泛应用信息准则(WAIC)引导。在混合先验框架内,WAIC优化加权(WOW)对外部和同期试验数据进行初步兼容性评估,以确定是否适合借用。只有在门控标准满足的情况下,才能进行后续的混合先验过程,使用用户指定的固定或自适应权重确定借用量。模拟研究证明,在贝叶斯混合先验借用方法之前整合WOW策略可有效减少过度借用并提高估计准确性。一个真实数据示例进一步突显了所提出门控后再借用策略的可行性和可解释性。通过提供一种实用的防止不适当借用的保障,WOW增强了混合先验方法的可靠性,并支持临床试验中的更好决策。

英文摘要

The integration of external data using Bayesian mixture priors has become a powerful approach in clinical trials, offering significant potential to improve trial efficiency. Despite their strengths in analytical tractability and practical flexibility, existing methods such as the robust meta-analytic-predictive (rMAP) and self-adapting mixture (SAM) often presume borrowing without rigorously assessing whether external information is appropriate to incorporate. When external and concurrent data are discordant, excessive borrowing can bias estimation and lead to misleading conclusions. To address this, we introduce WOW, a Kullback-Leibler-based gating strategy guided by the widely applicable information criterion (WAIC). Within the mixture-prior framework, WAIC-Optimized Weighting (WOW) conducts a preliminary compatibility assessment between external and concurrent trial data to determine eligibility for borrowing. Only if this gating criterion is satisfied does borrowing proceed; a downstream mixture prior procedure, using user-specified fixed or adaptive weights, can then be applied to determine the amount of borrowing. Simulation studies demonstrate that incorporating the WOW strategy before Bayesian mixture prior borrowing methods effectively mitigates excessive borrowing and improves estimation accuracy. A real-data illustration further highlights the feasibility and interpretability of the proposed gate-then-borrow strategy. By providing a practical safeguard against inappropriate borrowing, WOW strengthens the reliability of mixture-prior methods and supports better decision-making in clinical trials.

2508.01164 2026-06-05 math.ST math.PR stat.TH

M-estimation for Gaussian processes with time-inhomogeneous drifts from high-frequency data

具有时间非齐次漂移的高频率数据的高斯过程M估计

Yasutaka Shimizu

AI总结 本文提出了一种基于对比的估计方法,用于估计具有时间非齐次漂移的高斯过程,该方法在高频采样下具有高效计算能力,并证明了估计量的一致性和渐近正态性,同时展示了非标准收敛速度和对协方差核参数的修正方法。

详情
AI中文摘要

我们提出了一种用于具有时间非齐次漂移的高斯过程的对比估计方法,该过程在高频采样下被观测。该过程被建模为确定性漂移函数和具有参数核的平稳高斯成分之和。我们的方法从相邻增量构造了一个局部对比函数,这避免了大协方差矩阵的求逆,从而允许高效计算。我们在一般遍历条件下证明了所得到估计量的一致性和渐近正态性。我们方法的一个显著特点是漂移估计器具有非标准收敛速度,这源于漂移密度的直接黎曼可积性。这突显了与标准估计框架的根本区别。此外,当局部对比无法识别协方差核中的所有参数时,可以纳入矩修正以恢复可识别性。所提出的框架简单、灵活,并特别适合具有时间非齐次结构的高频推断。

英文摘要

We propose a contrast-based estimation method for Gaussian processes with time-inhomogeneous drifts, observed under high-frequency sampling. The process is modeled as the sum of a deterministic drift function and a stationary Gaussian component with a parametric kernel. Our method constructs a local contrast function from adjacent increments, which avoids inversion of large covariance matrices and allows for efficient computation. We prove consistency and asymptotic normality of the resulting estimators under general ergodicity conditions. A distinctive feature of our approach is that the drift estimator attains a nonstandard convergence rate, stemming from the direct Riemann integrability of the drift density. This highlights a fundamental difference from standard estimation regimes. Furthermore, when the local contrast fails to identify all parameters in the covariance kernel, moment-based corrections can be incorporated to recover identifiability. The proposed framework is simple, flexible, and particularly well suited for high-frequency inference with time-inhomogeneous structure.

2509.02971 2026-06-05 stat.ML cs.LG cs.NA math.NA math.PR

Scale-Adaptive Generative Flows for Multiscale Scientific Data

多尺度科学数据的自适应生成流

Yifan Chen, Eric Vanden-Eijnden

AI总结 本文提出了一种多尺度科学数据生成模型,通过设计噪声分布和插值计划,解决多尺度傅里叶谱数据中的数值挑战,提高了生成样本的质量和效率。

详情
AI中文摘要

基于流的生成模型在处理具有多尺度傅里叶谱的科学数据时常常面临数值挑战,通常在细尺度上产生较大的误差。我们通过在流匹配和随机插值框架内,通过噪声分布和插值计划的原理性设计来解决这个问题。在函数空间中工作可以确保生成模型在分辨率细化时仍然定义良好;漂移的Lipschitz正则性对这种函数空间的良定义性和固定分辨率下的积分成本都很重要。核心观察是噪声应至少与目标分布一样粗糙——通过傅里叶谱衰减来衡量——以保持Lipschitz常数有限。对于已知细尺度结构的高斯和近高斯目标,匹配谱噪声比标准白噪声选择更有效。对于更复杂的非高斯目标,匹配谱噪声可能不足以应对噪声比数据粗糙时出现的终端时间刚性问题,我们提出自适应插值计划来缓解这种情况。在合成高斯随机场和随机Allen-Cahn和Navier-Stokes方程不变测度上的数值实验展示了该方法,并证明了其在传统方法基础上以更低计算成本生成高质量样本的能力。

英文摘要

Flow-based generative models can face numerical challenges on scientific data with multiscale Fourier spectra, often producing large errors at fine scales. We approach this problem within the flow matching and stochastic interpolants framework, through the principled design of noise distributions and interpolation schedules. Working in function space ensures that the generative model remains well defined as the resolution is refined; the Lipschitz regularity of the drift is important to both this function-space well-posedness and the integration cost at fixed resolution. The central observation is that the noise should be at least as rough as the target distribution -- measured by Fourier-spectrum decay -- in order to keep the Lipschitz constant finite. For Gaussian and near-Gaussian targets whose fine-scale structure is known, matched-spectrum noise improves numerical efficiency over standard white-noise choices. For more complex non-Gaussian targets, matched-spectrum noise may not be sufficient, and we propose scale-adaptive interpolation schedules to mitigate the terminal-time stiffness that arises when the noise is rougher than the data. Numerical experiments on synthetic Gaussian random fields and on invariant measures of the stochastic Allen--Cahn and Navier--Stokes equations illustrate the approach and demonstrate its ability to generate high-fidelity samples at lower computational cost than traditional approaches.

2508.11861 2026-06-05 stat.ME

A novel approach to generate distributions with applications to regression modeling

生成分布的新方法及其在回归建模中的应用

Subhankar Dutta, Roberto Vila, Terezinha K. A. Ribeiro

AI总结 本文提出了一种生成分布的新方法,通过引入额外参数提高适应性,从而构建了一类支持正实数的分布族。该方法的主要贡献是通过重新参数化尾部行为来调节尾部厚重度,并基于该分布族提出了两个新的回归模型,用于正连续数据的建模。

详情
Comments
35 pages, 7 figures
AI中文摘要

本文提出了一种向分布族中添加额外参数以提高适应性的新方法。该方法产生了一类支持正实数的分布族。所提出分布族的重要优势在于,额外参数在尾部行为方面具有明确的解释,提供了一种简单机制来调节尾部厚重度。我们进一步分析了其数学特性,如临界点、模态性、随机表示、可识别性、分位数、矩和截断矩。我们基于新提出的分布族的子模型,提出了两个新的回归模型,用于正连续数据,其中响应变量的分布以中位数重新参数化。我们使用最大似然方法估计参数,通过R中的gamlss包实现。所提出的回归模型应用于实际数据集,并通过分位数残差分析和信息准则展示了其相对于常用替代回归模型的优势。

英文摘要

A novel approach to adding an additional parameter to a family of distributions for better adaptability has been put forth. This approach yields a versatile class of distributions supported on the positive real line. An important advantage of the proposed family is that the additional parameter admits a clear interpretation in terms of tail behavior, providing a simple mechanism for modulating tail heaviness. We proceed to analyze its mathematical characteristics, such as critical points, modality, stochastic representation, identifiability, quantiles, moments, and truncated moments. We present two new regression models for positive continuous data based on submodels of the newly proposed family of distributions, in which the distribution of the response variable is reparameterized in terms of the median. We use the maximum likelihood method to estimate the parameters, which was implemented through the gamlss package in R. The proposed regression models were applied to a real dataset, and their advantages over common alternative regression models were demonstrated through quantile residual analysis and information criteria.

2508.06126 2026-06-05 stat.ME

IOCC: Aligning Semantic and Cluster Centers for Few-shot Short Text Clustering

IOCC: 语义与聚类中心对齐的少样本短文本聚类

Zhihao Yao

AI总结 本文提出IOCC方法,通过交互增强最优传输和中心感知对比学习模块,实现语义中心与聚类中心对齐,提升短文本聚类性能。

详情
Comments
arXiv admin comment: This version has been removed by arXiv administrators as the submitter did not have the rights to agree to the license at the time of submission
AI中文摘要

在聚类任务中,构建清晰、分离良好的特征空间至关重要。然而,由于短文本表示的表达能力有限,传统方法难以识别真正反映每个类别底层语义的聚类中心,导致表示在次优方向上优化。为解决此问题,我们提出IOCC,一种新颖的少样本对比学习方法,实现聚类中心与语义中心的对齐。IOCC包含两个关键模块:交互增强最优传输(IEOT)和中心感知对比学习(CACL)。具体而言,IEOT将单个样本间的语义交互纳入传统最优传输问题,生成伪标签。基于这些伪标签,我们聚合高置信度样本以构建伪中心,近似语义中心。随后,CACL优化文本表示以接近对应的伪中心。随着训练进行,两个模块的协作逐渐缩小聚类中心与语义中心之间的差距。因此,模型将学习高质量分布,提升聚类性能。在八个基准数据集上的广泛实验表明,IOCC优于先前方法,在具有挑战性的生物医学数据集上实现了高达7.34%的提升,并在聚类稳定性和效率方面也表现出色。代码可在:https://anonymous.4open.science/r/IOCC-C438获取。

英文摘要

In clustering tasks, it is essential to structure the feature space into clear, well-separated distributions. However, because short text representations have limited expressiveness, conventional methods struggle to identify cluster centers that truly capture each category's underlying semantics, causing the representations to be optimized in suboptimal directions. To address this issue, we propose IOCC, a novel few-shot contrastive learning method that achieves alignment between the cluster centers and the semantic centers. IOCC consists of two key modules: Interaction-enhanced Optimal Transport (IEOT) and Center-aware Contrastive Learning (CACL). Specifically, IEOT incorporates semantic interactions between individual samples into the conventional optimal transport problem, and generate pseudo-labels. Based on these pseudo-labels, we aggregate high-confidence samples to construct pseudo-centers that approximate the semantic centers. Next, CACL optimizes text representations toward their corresponding pseudo-centers. As training progresses, the collaboration between the two modules gradually reduces the gap between cluster centers and semantic centers. Therefore, the model will learn a high-quality distribution, improving clustering performance. Extensive experiments on eight benchmark datasets show that IOCC outperforms previous methods, achieving up to 7.34\% improvement on challenging Biomedical dataset and also excelling in clustering stability and efficiency. The code is available at: https://anonymous.4open.science/r/IOCC-C438.

2503.08984 2026-06-05 math.PR math.ST stat.TH

Phase Transitions in Planted k-Factor Recovery

在植根k-因子恢复中的相变

Julia Gaudio, Colin Sandon, Jiaming Xu, Dana Yang

AI总结 本文研究了在Erdos-Renyi随机图G(n,λ/n)中推断一个k-因子(即一个 spanning k-正则图)的问题,发现当平均度λ超过临界阈值1/k时,推断问题从几乎精确恢复过渡到部分恢复。此外,当λ趋向于无穷大时,恢复的准确性趋于零。我们还刻画了线性时间迭代修剪算法的恢复精度,并证明当λ<1/k时该算法可以实现几乎精确恢复。分析中的关键组件是两步循环构造:首先通过局部邻域探索构建树,然后通过保留边进行喷洒连接。有趣的是,在证明几乎精确恢复不可能时,我们构造了Θ(n)多个大小为Θ(1)的小树,而在建立算法下界时,一个大小为Θ(√(n log n))的大树就足够。

详情
Comments
36 pages, 5 figures. Extended abstract appeared at the 2025 Conference on Learning Theory, Lyon, France; accepted for publication in the Annals of Applied Probability
AI中文摘要

本文研究了在Erdos-Renyi随机图G(n,λ/n)中推断一个k-因子(即一个 spanning k-正则图)的问题。我们证明当平均度λ超过临界阈值1/k时,推断问题从几乎精确恢复过渡到部分恢复。此外,当λ趋向于无穷大时,恢复的准确性趋于零。此外,我们刻画了线性时间迭代修剪算法的恢复精度,并证明当λ<1/k时该算法可以实现几乎精确恢复。分析中的关键组件是两步循环构造:首先通过局部邻域探索构建树,然后通过保留边进行喷洒连接。有趣的是,在证明几乎精确恢复不可能时,我们构造了Θ(n)多个大小为Θ(1)的小树,而在建立算法下界时,一个大小为Θ(√(n log n))的大树就足够。

英文摘要

This paper studies the problem of inferring a $k$-factor, specifically a spanning $k$-regular graph, planted within an Erdos-Renyi random graph $G(n,λ/n)$. We show that as the average degree $λ$ surpasses the critical threshold of $1/k$, the inference problem undergoes a transition from almost exact recovery to partial recovery. Moreover, as $λ$ tends to infinity, the accuracy of recovery diminishes to zero. In addition, we characterize the recovery accuracy of a linear-time iterative pruning algorithm and show that it achieves almost exact recovery when $λ< 1/k$. A key component of our analysis is a two-step cycle construction: we first build trees through local neighborhood exploration and then connect them by sprinkling using reserved edges. Interestingly, for proving impossibility of almost exact recovery, we construct $Θ(n)$ many small trees of size $Θ(1)$, whereas for establishing the algorithmic lower bound, a single large tree of size $Θ(\sqrt{n\log n})$ suffices.

2501.14291 2026-06-05 cs.LG stat.ML

Advances in Temporal Point Processes: Bayesian, Neural, and LLM Approaches

时间点过程的进展:贝叶斯、神经网络和大语言模型方法

Feng Zhou, Quyu Kong, Jie Qiao, Cheng Wan, Yixuan Zhang, Ruichu Cai

AI总结 本文综述了时间点过程的最新研究,从贝叶斯、深度学习和大语言模型三个角度探讨了模型设计、参数估计以及经典应用领域,并展望了未来的研究挑战和方向。

详情
AI中文摘要

时间点过程(TPPs)是用于表征连续时间中事件序列的随机过程模型。传统统计TPPs已有长久的历史,众多模型已被提出并在不同领域中成功应用。近年来,深度学习的进步推动了神经TPPs的发展,使捕捉复杂时间动态变得更加灵活和表达性更强。大语言模型(LLMs)的出现进一步引发了关注,通过利用其丰富的上下文理解能力,为事件序列建模和分析提供了新的可能性。本文从贝叶斯、深度学习和LLM三个视角全面回顾了最近关于TPPs的研究。我们首先回顾了TPPs的基本概念,随后深入讨论了这三种框架中的模型设计和参数估计技术。我们还回顾了TPPs的经典应用领域,以突出其实际相关性。最后,我们概述了TPPs面临的挑战和未来研究的有前景方向。

英文摘要

Temporal point processes (TPPs) are stochastic process models used to characterize event sequences occurring in continuous time. Traditional statistical TPPs have a long-standing history, with numerous models proposed and successfully applied across diverse domains. In recent years, advances in deep learning have spurred the development of neural TPPs, enabling greater flexibility and expressiveness in capturing complex temporal dynamics. The emergence of large language models (LLMs) has further sparked excitement, offering new possibilities for modeling and analyzing event sequences by leveraging their rich contextual understanding. This survey presents a comprehensive review of recent research on TPPs from three perspectives: Bayesian, deep learning, and LLM approaches. We begin with a review of the fundamental concepts of TPPs, followed by an in-depth discussion of model design and parameter estimation techniques in these three frameworks. We also revisit classic application areas of TPPs to highlight their practical relevance. Finally, we outline challenges and promising directions for future research.

2404.15253 2026-06-05 stat.CO math.ST stat.ML stat.TH

GIST: Gibbs self-tuning for locally adaptive Hamiltonian Monte Carlo

GIST:基于Gibbs自调制的局部自适应Hamilton-Monte Carlo

Nawaf Bou-Rabee, Bob Carpenter, Milo Marsden

AI总结 本文提出了一种新的灵活框架,通过基于每一步的位置和动量进行Gibbs采样来构建局部自适应Hamilton-Monte Carlo(HMC)采样器。GIST方法能够适应路径长度的采样,涵盖了随机HMC、多项式HMC、无U-turn采样器(NUTS)和Apogee-to-Apogee路径采样器等特殊情况。

详情
Journal ref
Statist. Surv. 20 135 - 179, 2026
Comments
for companion code, see https://github.com/bob-carpenter/adaptive-hmc
AI中文摘要

我们介绍了一种新的灵活框架,用于通过基于每一步的位置和动量进行Gibbs采样来构建局部自适应Hamilton-Monte Carlo(HMC)采样器。为了适应路径长度的采样,我们的Gibbs自调制(GIST)方法涵盖了随机HMC、多项式HMC、无U-turn采样器(NUTS)以及Apogee-to-Apogee路径采样器等特殊情况。我们通过一种新的NUTS替代方法来展示GIST框架,该方法在高维、病态高斯分布的精确哈密顿量以及各种不同模型的leapfrog积分器上进行了评估。

英文摘要

We introduce a novel and flexible framework for constructing locally adaptive Hamiltonian Monte Carlo (HMC) samplers by Gibbs sampling the algorithm's tuning parameters conditionally based on the position and momentum at each step. For adaptively sampling path lengths, our Gibbs self-tuning (GIST) approach encompasses randomized HMC, multinomial HMC, the No-U-Turn Sampler (NUTS), and the Apogee-to-Apogee Path Sampler as special cases. We exemplify the GIST framework with a novel alternative to NUTS for locally adapting path lengths, evaluated with an exact Hamiltonian for a high-dimensional, ill-conditioned Gaussian measure and with the leapfrog integrator for a suite of diverse models.

2412.04605 2026-06-05 econ.EM stat.ML

Semiparametric Bayesian Difference-in-Differences

半参数贝叶斯差分-in-差分

Christoph Breunig, Ruixuan Liu, Zhengfei Yu

AI总结 本文研究了在差分-in-差分研究设计中半参数贝叶斯推断平均处理效应(ATT)的方法,提出了两种新的贝叶斯方法并证明了其频率有效性。

详情
AI中文摘要

本文研究了在差分-in-差分研究设计中半参数贝叶斯推断平均处理效应(ATT)的方法。我们提出了两种新的贝叶斯方法,具有频率有效性。第一种是半参数贝叶斯结果回归,其中我们对控制组的条件均值函数放置高斯过程先验。第二种方法是一种双重鲁棒的贝叶斯程序,调整条件均值函数的先验分布,并随后修正由此产生的ATT的后验分布。我们为这两种方法证明了新的半参数伯恩斯坦-冯·米泽斯(BvM)定理。蒙特卡罗模拟和实证应用显示,所提出的贝叶斯差分-in-差分方法在有限样本性能方面表现良好。我们还提出了经典差分-in-差分方法的扩展,纳入了聚类数据和多时期 staggered entry。

英文摘要

This paper studies semiparametric Bayesian inference for the average treatment effect on the treated (ATT) within the difference-in-differences (DiD) research design. We propose two new Bayesian methods with frequentist validity. The first one is the semiparametric Bayesian outcome regression, where we place a Gaussian process prior on the conditional mean function of the control group. The second method is a doubly robust Bayesian procedure that adjusts the prior distribution of the conditional mean function and subsequently corrects the posterior distribution of the resulting ATT. We prove new semiparametric Bernstein-von Mises (BvM) theorems for both proposals. Monte Carlo simulations and an empirical application demonstrate that the proposed Bayesian DiD methods exhibit strong finite-sample performance. We also present extensions of the canonical DiD approach, incorporating clustered data and staggered entry with multiple periods.

2506.03232 2026-06-05 stat.OT cs.CY

Pivoting the paradigm: the role of spreadsheets in K-12 data science

转变范式:电子表格在K-12数据科学中的作用

Oren Tirschwell, Nicholas Jon Horton

AI总结 本文探讨了电子表格在K-12教育中作为数据科学工具的潜力,回顾了相关框架和标准,并提出了数据驱动的数据技能以培养学生的数据素养和计算能力。

详情
AI中文摘要

电子表格工具被广泛用于K-12学生和教师中。尽管电子表格并非适用于所有类型的统计分析,但它们在数据收集和组织中起着重要作用。从教育角度看,电子表格使数据可视化并易于交互,促进学生在数据探索、分析和计算中的参与。尽管不适用于所有任务,电子表格可以帮助K-12学生学习和练习数据和计算技能。本文1)展示了电子表格在K-12中的潜在用途;2)回顾了与K-12数据工具相关的重要框架和标准;3)提出了数据驱动的数据技能以帮助发展数据素养和计算能力。我们提供了一些示例活动,识别了采用中的挑战和障碍,建议了教学方法以帮助教师和学生克服学习曲线,并讨论了专业发展的重要性,以促进更深入地使用电子表格进行数据科学和STEM学科的学习。

英文摘要

Spreadsheet tools are widely accessible to and commonly used by K-12 students and teachers. While spreadsheets are not ideal for many types of statistical analysis, they have an important role in data collection and organization. From a pedagogical standpoint, spreadsheets make data visible and easy to interact with, facilitating student engagement in data exploration, analysis, and computation. Though not suitable for all tasks, spreadsheets can facilitate learning and practicing data and computing skills for K-12 students. This paper 1) demonstrates the potential utility of spreadsheets in K-12; 2) reviews prior frameworks and standards that are relevant for K-12 data tools; and 3) proposes data-driven data skills to help develop data acumen and computational fluency. We provide some example activities, identify challenges and barriers to adoption, suggest pedagogical approaches to ease the learning curve for instructors and students, and discuss the need for professional development to facilitate deeper use of spreadsheets for data science and STEM disciplines.

2506.00666 2026-06-05 stat.ME

Unbiased estimation in new Gini index extensions under gamma distributions, with application to real income data

在伽马分布下新吉尼指数扩展的无偏估计,及其在实际收入数据中的应用

Roberto Vila, Helton Saulo

AI总结 本文提出两种灵活的吉尼指数扩展方法,用于对分布的下尾和上尾进行位置导向的不平等评估,并通过蒙特卡洛模拟和南美国家2023年人均GDP数据展示了其在实际应用中的有效性。

详情
Comments
18 pages, 3 figures
AI中文摘要

在本文中,我们介绍了两种灵活的经典吉尼指数扩展方法,称为扩展下吉尼指数和上吉尼指数。所提出的度量方法基于样本大小m≥2中的观测值与最小和最大顺序统计量之间的差异,并在m=2时退化为经典吉尼系数。与传统吉尼型度量不同,它们提供了相对于分布下尾和上尾的位置导向的不平等评估。我们在温和的正则条件下建立了所提出估计量的一致性和渐近正态性。对于伽马分布总体,我们推导了其期望的精确表达式,并证明了其无偏性,从而扩展了Deltas (2003)和Baydil等人(2025)之前的结果。通过蒙特卡洛模拟研究了估计量的有限样本性能,并通过2023年南美国家人均GDP数据的应用展示了所提出度量的实际实用性。结果表明,扩展的下和上吉尼指数比传统吉尼型度量提供了更丰富和更有信息的不平等刻画。

英文摘要

In this paper, we introduce two flexible extensions of the classical Gini index, referred to as the extended lower and upper Gini indices. The proposed measures are based on the differences between an observation and the minimum and maximum order statistics in samples of size $m\geqslant 2$ and reduce to the classical Gini coefficient when $m=2$. Unlike conventional Gini-type measures, they provide a position-oriented assessment of inequality relative to the lower and upper tails of the distribution. We establish the consistency and asymptotic normality of the proposed estimators under mild regularity conditions. For gamma-distributed populations, we derive exact expressions for their expectations and prove their unbiasedness, thereby extending previous results of [Deltas, G. 2003. The small-sample bias of the gini coefficient: Results and implications for empirical research. Review of Economics and Statistics 85:226-234] and [Baydil, B., de la Peña, V. H., Zou, H., and Yao, H. 2025. Unbiased estimation of the gini coefficient. Statistics & Probability Letters 222:110376]. The finite-sample performance of the estimators is investigated through Monte Carlo simulations, and an application to 2023 GDP per capita data from South American countries illustrates the practical usefulness of the proposed measures. The results show that the extended lower and upper Gini indices provide a richer and more informative characterization of inequality than traditional Gini-type measures.

2506.00188 2026-06-05 cs.LG stat.ML

Cluster-Aware Causal Mixer for Online Anomaly Detection in Multivariate Time Series

基于聚类的因果混合器用于多变量时间序列的在线异常检测

Md Mahmuddun Nabi Murad, Yasin Yilmaz

AI总结 本文提出了一种基于聚类的因果混合器,用于多变量时间序列的在线异常检测,通过聚类处理通道间的相关性,结合因果混合器保持时间因果性,并开发了序列异常评分方法以提高检测准确性。

详情
AI中文摘要

在时间序列数据中早期和准确地检测异常至关重要,因为假阳性和漏检带来的风险很大。虽然基于MLP的混合模型在时间序列分析中显示出潜力,但它们在数据处理过程中不维护时间因果性。此外,现实中的多变量时间序列通常包含众多通道,具有多样的通道间相关性。重构时间序列中的虚假相关性导致表示噪声,从而导致检测不准确。此外,忽略时间连续性的异常评分方法可能会误导连续检测。为了解决这些挑战,我们提出了一种多变量时间序列异常检测的基于聚类的因果混合器。根据相关性将通道分组为集群,并通过专用嵌入层对每个集群进行嵌入。引入因果混合器以在保持时间因果性的同时整合信息。我们进一步开发了一种序列异常评分方法,该方法在时间上累积证据并细化异常边界。我们提出的模型以在线方式运行,使其适合实时时间序列异常检测。在六个公开基准数据集上的实验评估表明,所提出的方法在性能上始终优于其他方法。

英文摘要

Early and accurate detection of anomalies in time-series data is critical due to the substantial risks associated with false or missed detections. While MLP-based mixer models have shown promise in time-series analysis, they do not maintain temporal causality during data processing. Moreover, real-world multivariate time series often contain numerous channels with diverse inter-channel correlations. Spurious correlations in the reconstructed time series lead to noisy representations, resulting in inaccurate anomaly detection. In addition, anomaly scoring methods that ignore temporal continuity can mislead sequential detection. To address these challenges, we propose a cluster-aware causal mixer for multivariate time-series anomaly detection. Channels are grouped into clusters based on their correlations, and each cluster is embedded through a dedicated embedding layer. A causal mixer is introduced to integrate information while maintaining temporal causality. We further develop a sequential anomaly-scoring method that accumulates evidence over time and refines anomaly boundaries. Our proposed model operates in an online fashion, making it suitable for real-time time-series anomaly detection. Experimental evaluations across six public benchmark datasets demonstrate that the proposed approach consistently achieves superior performance.

2505.16311 2026-06-05 stat.ML cs.LG stat.ME

Generator-Mediated Bandits: Thompson Sampling for GenAI-Powered Adaptive Interventions

生成器介导的老虎机:面向生成式人工智能的自适应干预的汤普森采样

Marc Brooks, Gabriel Durham, Kihyuk Hong, Ambuj Tewari

AI总结 本文提出了一种生成器介导的老虎机算法(GAMBITTS),用于解决生成式人工智能(GenAI)驱动的自适应干预问题。该算法通过建模治疗和奖励生成过程,利用观察到的治疗信息加速策略学习,并在模拟研究中优于传统算法。

详情
Journal ref
Advances in Neural Information Processing Systems 38 (NeurIPS 2025)
Comments
39 pages, 12 figures
AI中文摘要

近期生成式人工智能(GenAI)模型的进步使生成个性化内容成为可能,该内容能够适应最新的用户情境。尽管个性化决策系统通常采用老虎机建模,但GenAI的引入为经典序列学习问题带来了新的结构。在GenAI驱动的干预中,智能体选择查询,但环境会经历由生成模型产生的随机响应。标准老虎机方法并未显式考虑这种结构,其中动作仅通过随机、观察到的治疗影响奖励。我们引入生成器介导的老虎机-汤普森采样(GAMBITTS),一种针对这种动作/治疗分割设计的老虎机方法,以移动健康干预中的大型语言模型生成文本作为动机案例。GAMBITTS显式建模治疗和奖励生成过程,利用所交付的治疗信息,相对于标准方法加速策略学习。我们通过分解治疗和奖励中的不确定性来源,建立了GAMBITTS的遗憾界,并识别了其在某些条件下优于标准老虎机方法的保证条件。在模拟研究中,GAMBITTS通过利用观察到的治疗更准确地估计预期奖励,始终优于传统算法。

英文摘要

Recent advances in generative artificial intelligence (GenAI) models have enabled the generation of personalized content that adapts to up-to-date user context. While personalized decision systems are often modeled using bandit formulations, the integration of GenAI introduces new structure into otherwise classical sequential learning problems. In GenAI-powered interventions, the agent selects a query, but the environment experiences a stochastic response drawn from the generative model. Standard bandit methods do not explicitly account for this structure, where actions influence rewards only through stochastic, observed treatments. We introduce generator-mediated bandit-Thompson sampling (GAMBITTS), a bandit approach designed for this action/treatment split, using mobile health interventions with large language model-generated text as a motivating case study. GAMBITTS explicitly models both the treatment and reward generation processes, using information in the delivered treatment to accelerate policy learning relative to standard methods. We establish regret bounds for GAMBITTS by decomposing sources of uncertainty in treatment and reward, identifying conditions where it achieves stronger guarantees than standard bandit approaches. In simulation studies, GAMBITTS consistently outperforms conventional algorithms by leveraging observed treatments to more accurately estimate expected rewards.

2411.02123 2026-06-05 stat.ME

Uncertainty quantification and multi-stage variable selection for personalized treatment regimes

不确定性量化与多阶段变量选择用于个性化治疗方案

Jiefeng Bi, Matteo Borrotti, Bernardo Nipoti

AI总结 本文提出了一种贝叶斯模型,用于优化动态治疗方案,通过处理反事实变量来处理最优决策序列的不确定性,并引入新的尖峰- slab先验来实现多阶段变量选择,以提高高维协变量的处理能力。

详情
AI中文摘要

动态治疗方案是一系列医疗决策,能够适应患者随时间变化的临床状况。为了促进个性化护理,必须评估每个可用治疗方案对特定患者最优的概率,同时识别决定最佳治疗序列的关键预后因素。由于通常可用的个体预后因素数量不断增加,这一任务变得更加具有挑战性。为应对这些挑战,我们提出了一种贝叶斯模型,用于优化动态治疗方案,该模型解决了识别最优决策序列的不确定性,并通过降维技术来处理高维个体协变量。第一个任务通过适当扩展模型来处理反事实变量实现。对于第二个任务,我们引入了一种新的尖峰- slab先验类,用于多阶段选择显著因素,以促进阶段间的信息共享。所提出方法的有效性通过广泛的模拟研究和严重急性动脉高血压临床试验数据得到验证。

英文摘要

A dynamic treatment regime is a sequence of medical decisions that adapts to the evolving clinical status of a patient over time. To facilitate personalized care, it is crucial to assess the probability of each available treatment option being optimal for a specific patient, while also identifying the key prognostic factors that determine the optimal sequence of treatments. This task has become increasingly challenging due to the growing number of individual prognostic factors typically available. In response to these challenges, we propose a Bayesian model for optimizing dynamic treatment regimes that addresses the uncertainty in identifying optimal decision sequences and incorporates dimensionality reduction to manage high-dimensional individual covariates. The first task is achieved through a suitable augmentation of the model to handle counterfactual variables. For the second, we introduce a novel class of spike-and-slab priors for the multi-stage selection of significant factors, to favor the sharing of information across stages. The effectiveness of the proposed approach is demonstrated through extensive simulation studies and illustrated using clinical trial data on severe acute arterial hypertension.

2402.12825 2026-06-05 stat.ME

Quasi-maximum likelihood estimation for scalable ARMA models

可扩展ARMA模型的准最大似然估计

Yuchang Lin, Wenyu Li, Qianqian Zhu

AI总结 本文提出了一种用于可扩展ARMA模型的准最大似然估计方法,解决了传统VARMA模型在高维设置下的计算效率问题,并提供了统计上更高效的估计方法及模型选择工具。

详情
Comments
65 pages, 2 figures, 5 tables
AI中文摘要

最近提出的可扩展ARMA模型在保持传统VARMA模型简洁性的同时实现了更高的计算可行性。然而,现有研究仅限于高维设置下的正则化最小二乘估计(LSE),这不仅在统计上效率较低,而且需要子高斯假设以保证理论保证。此外,它仍然缺乏实际应用中的推断工具。为了填补这一空白,我们为可扩展ARMA模型开发了准最大似然估计(QMLE)框架。其渐近正态性在有限四阶矩条件下得以建立,并且我们正式证明了其在统计效率上优于LSE。我们还引入了高效的块坐标下降算法用于计算,并引入了一种一致的贝叶斯信息准则用于模型选择。模拟研究验证了我们方法的有限样本性能,而对六个宏观经济指标的实证应用则展示了其实际用途。

英文摘要

The recently proposed scalable ARMA model preserves the parsimony of traditional VARMA models while achieving greater computational tractability. However, existing studies are limited to regularized least squares estimation (LSE) for high-dimensional settings, which is not only statistically less efficient but also requires the sub-Gaussian assumption for its theoretical guarantees. Moreover, it still lacks inference tool for real applications. To fill this gap, we develop a quasi-maximum likelihood estimation (QMLE) framework for scalable ARMA models. Its asymptotic normality is established under a finite fourth order moment condition, and we formally prove its asymptotic efficiency gain over LSE. We also introduce an efficient block coordinate descent algorithm for computation and a consistent Bayesian information criterion for model selection. Simulation studies validate the finite-sample performance of our methodology, and an empirical application to six macroeconomic indicators demonstrates its practical utility.

2311.07565 2026-06-05 cs.LG stat.ML

Exploration via linearly perturbed loss minimisation

通过线性扰动损失最小化进行探索

David Janz, Shuai Liu, Alex Ayoub, Csaba Szepesvári

AI总结 本文提出了一种基于线性扰动损失的探索方法EVILL,通过求解线性扰动的正则化负对数似然函数的最小化问题,解释了随机奖励扰动为何能产生有效的多臂老虎机算法,并展示了数据依赖扰动如何使EVILL在理论和实践中达到与Thompson采样类参数扰动方法相当的性能。

详情
Comments
Updated with erratum note: Appendix I contains a gap in the proof; all main-paper claims remain valid via the corrected argument of Perneczky, Abeille & Janz (2026, arXiv:2606.00431)
AI中文摘要

我们引入了通过线性损失扰动进行探索(EVILL),一种用于结构化随机老虎机问题的随机探索方法,其通过求解线性扰动的正则化负对数似然函数的最小化问题来工作。我们证明,在一般线性老虎机的情况下,EVILL简化为扰动历史探索(PHE),一种通过在随机扰动的奖励上进行训练来实现探索的方法。通过这样做,我们提供了一个简单清晰的解释,说明何时以及为什么随机奖励扰动会产生有效的老虎机算法。我们提出了之前PHE类型方法中未出现的数据依赖扰动,使EVILL能够匹配Thompson-sampling风格的参数扰动方法的性能,理论和实践中均如此。此外,我们展示了在一般线性老虎机之外的一个例子,其中PHE导致不一致的估计,从而产生线性遗憾,而EVILL仍然表现良好。与PHE一样,EVILL可以通过几行代码实现。

英文摘要

We introduce exploration via linear loss perturbations (EVILL), a randomised exploration method for structured stochastic bandit problems that works by solving for the minimiser of a linearly perturbed regularised negative log-likelihood function. We show that, for the case of generalised linear bandits, EVILL reduces to perturbed history exploration (PHE), a method where exploration is done by training on randomly perturbed rewards. In doing so, we provide a simple and clean explanation of when and why random reward perturbations give rise to good bandit algorithms. We propose data-dependent perturbations not present in previous PHE-type methods that allow EVILL to match the performance of Thompson-sampling-style parameter-perturbation methods, both in theory and in practice. Moreover, we show an example outside generalised linear bandits where PHE leads to inconsistent estimates, and thus linear regret, while EVILL remains performant. Like PHE, EVILL can be implemented in just a few lines of code.

2403.00965 2026-06-05 stat.AP cs.AI cs.LG

Binary Gaussian Copula Synthesis: an LLM-powered data augmentation framework for early dialysis prediction in chronic kidney disease

二元高斯卷积合成:一种基于LLM的数据增强框架,用于慢性肾病早期透析预测

Hamed Khosravi, Milad Khanchi, Mobina Noori, Srinjoy Das, Abdullah Al-Mamun, Imtiaz Ahmed

AI总结 本文提出Binary Gaussian Copula Synthesis (BGCS),一种专为二元临床数据设计的两阶段数据增强方法,通过生成合成少数类样本并过滤不合理的样本,提高了早期透析预测的性能。

详情
AI中文摘要

只有极少数慢性肾病(CKD)患者会进展到透析,这导致了严重的类别不平衡,限制了机器学习模型在早期透析预测中的性能。这一挑战进一步加剧了电子健康记录(EHR)数据的二元结构,而现有的大多数增强方法并未为此设计。我们提出了Binary Gaussian Copula Synthesis (BGCS),一种专为二元临床数据设计的两阶段数据增强方法。BGCS首先使用高斯卷积框架生成合成少数类样本,该框架明确建模二元特征之间的成对依赖关系,然后应用微调的GPT-2分类器过滤出临床上不合理的样本后再进行训练。我们在一个包含15,169名CKD患者的真实世界EHR数据集中评估了BGCS,该数据集来自西弗吉尼亚州,收集时间从2008年到2022年。我们将其与SMOTE、CTGAN和标准高斯卷积在四个机器学习分类器上进行了基准测试,共进行了25次独立运行。BGCS在所有比较方法中表现一致,实现了90天透析预测的最高少数类召回率,不同分类器的中位数值范围从0.78到0.87,且在真实数据上的分布忠实度最强,特征的均值p值为0.68。表现最好的BGCS增强模型被集成到一个可解释的决策树基于的临床决策支持系统中,用于透析风险分层,其中电解质失衡、心血管合并症和肾脏监测指标成为最显著的预测特征。这些发现表明,为二元EHR数据的结构特性设计的增强方法可以显著提高早期透析风险预测,并支持开发可解释的临床决策支持工具用于CKD护理。

英文摘要

Only a small fraction of patients with chronic kidney disease (CKD) progress to dialysis, creating severe class imbalance that limits the performance of machine learning models for early dialysis prediction. This challenge is compounded by the binary structure of electronic health record (EHR) data, for which most existing augmentation methods were not designed. We propose Binary Gaussian Copula Synthesis (BGCS), a two-stage data augmentation method tailored to binary clinical data. BGCS first generates synthetic minority-class samples using a Gaussian copula framework that explicitly models pairwise dependencies among binary features, then applies a fine-tuned GPT-2 classifier to filter out clinically implausible samples before training. We evaluated BGCS on a real-world EHR dataset of 15,169 patients with CKD from West Virginia collected between 2008 and 2022, benchmarking it against SMOTE, CTGAN, and standard Gaussian Copula across four machine learning classifiers over 25 independent runs. BGCS consistently outperformed all comparison methods, achieving the highest minority-class recall for 90-day dialysis prediction, with median values ranging from 0.78 to 0.87 across classifiers, and the strongest distributional fidelity to real data, with a mean p-value of 0.68 across features. The best-performing BGCS-augmented model was integrated into an interpretable decision tree-based clinical decision support system for dialysis risk stratification, with electrolyte imbalances, cardiovascular comorbidities, and renal monitoring indicators emerging as the most influential predictive features. These findings suggest that augmentation methods designed for the structural properties of binary EHR data can meaningfully improve early dialysis risk prediction and support the development of interpretable clinical decision-support tools for CKD care.

2312.01265 2026-06-05 math.PR stat.ME

The optimal sub-Gaussian normalisation for randomised monotone functions

随机单调函数的最优亚高斯归一化

Thomas Anton, Rabee Tourky

AI总结 本文研究了随机单调函数的最优亚高斯归一化问题,通过分析有限样本大小下的概率不等式,推导出归一化尺度与对数函数之间的紧密关系。

详情
Comments
The updated paper refines mathematical results to be significantly sharper. Methods are unchanged, specific corrections are implemented. All empirical applications and field experiment results have been removed. Sutanuka Roy is no longer an author as they are writing a separate empirical paper. Thomas Anton is now at Columbia; Rabee Tourky remains at ANU
AI中文摘要

令$\mathcal{M}$表示从$\mathbb{R}$到$[0,1]$的随机单调函数类,令$U_{\mathcal{M}}\colon \mathbb{R}_+ o \mathbb{R}_+$为最小函数,使得对于每一个成员$f_Z$具有有限有效样本大小$η_f$和任意正数$\varepsilon$,有$$\mathbb{P}\left\{ \sqrt{η_f}\, \sup_{t\in\mathbb{R}} \left| f_Z(t) - \Exf{f_Z(t)} ight| \ge \varepsilon\sqrt{U_{\mathcal{M}}(η_f)} ight\} \le 2\mathrm{e}^{-2\varepsilon^2}$$成立。我们证明对于每个$x> 1$,$$\left| \sqrt{U_{\mathcal{M}}(x)} - \sqrt{\log_4 x} ight| \le 2 \min\!\left\{ 1,\, rac{2 \ln(\mathrm{e} + \ln x)}{\sqrt{\ln x}} ight\}\,.$$最优尺度$\sqrt{U_{\mathcal{M}}(x)}$在有限样本大小上与$ rac{1}{\sqrt{2\ln 2}}\sqrt{\ln x}$紧密相关。

英文摘要

Let $\mathcal{M}$ denote the class of randomised monotone functions on $\mathbb{R}$ with values in $[0,1]$, and let $U_{\mathcal{M}}\colon \mathbb{R}_+\to \mathbb{R}_+$ be the minimal function for which \[ \mathbb{P}\left\{ \sqrt{η_f}\, \sup_{t\in\mathbb{R}} \left| f_Z(t) - \Exf{f_Z(t)} \right| \ge \varepsilon\sqrt{U_{\mathcal{M}}(η_f)} \right\} \le 2\mathrm{e}^{-2\varepsilon^2} \] holds for every member $f_Z$ of $\mathcal{M}$ of finite effective sample size $η_f$ and every positive $\varepsilon$. We prove that for every $x> 1$, \[ \left| \sqrt{U_{\mathcal{M}}(x)} - \sqrt{\log_4 x} \right| \le 2 \min\!\left\{ 1,\, \frac{2 \ln(\mathrm{e} + \ln x)}{\sqrt{\ln x}} \right\}\,. \] The optimal scale $\sqrt{U_{\mathcal{M}}(x)}$ is sharply tied, uniformly at finite sample sizes, to $\frac{1}{\sqrt{2\ln 2}}\sqrt{\ln x}$.

2209.10975 2026-06-05 stat.CO

Liesel: A Python Framework for Graph-Based Bayesian Modeling and Customizable MCMC with Support for Generalized Additive Models

Liesel:一个用于基于图的贝叶斯建模和可定制MCMC的Python框架,支持广义加性模型

Hannes Riebl, Johannes Brachem, Thomas Kneib, Gianmarco Callegher, Paul F. V. Wiemann

AI总结 Liesel是一个用于贝叶斯建模和后验计算的Python框架,特别支持广义加性回归模型,旨在减少方法学工作的摩擦。该框架包含三个组件:Liesel-Model用于模型构建和可视化,Liesel-Goose提供模块化的MCMC框架,Liesel-GAM提供广义加性回归模型的高级构建块,共同使Liesel成为快速开发、测试和应用贝叶斯模型和MCMC算法的有效工具。

详情
Comments
38 pages, 9 figures, restructured, updated examples and applications, uses Liesel-GAM, updated for compatibility with Liesel v0.5
AI中文摘要

Liesel是一个用于贝叶斯模型构建和后验计算的Python框架,特别支持广义加性回归模型,旨在减少方法学工作的摩擦。该框架由三个组件组成。第一个组件,Liesel-Model,将模型表示为有向无环图,并支持交互式模型构建、修改、可视化、预测以及先验或后验预测模拟。第二个组件,Liesel-Goose,提供基于可重用内核的模块化MCMC框架,支持阻塞组件wise采样以及用户定义的Gibbs和Metropolis--Hastings更新,同时利用JAX进行自动微分、即时编译和硬件加速。第三个组件,Liesel-GAM,提供广义加性回归模型的高级构建块,并提供基于公式的模型规范、摘要、诊断和效应可视化功能。这些部分共同使Liesel成为快速开发、测试和应用贝叶斯模型和MCMC算法的有效工具。Liesel的模块化架构允许用户扩展软件以支持超出广义加性模型和MCMC的模型和推断算法,为广泛的统计研究提供了显著的灵活性。

英文摘要

Liesel is a Python framework for Bayesian model building and posterior computation with dedicated support for generalized additive regression models that is designed to reduce friction in methodological work. The framework consists of three components. The first component, Liesel-Model, represents models as directed acyclic graphs and supports interactive model construction, modification, visualization, prediction, and prior or posterior predictive simulation. The second component, Liesel-Goose, provides a modular MCMC framework based on reusable kernels and supports blocked componentwise sampling as well as user-defined Gibbs and Metropolis--Hastings updates, while leveraging JAX for automatic differentiation, just-in-time compilation, and hardware acceleration. The third component, Liesel-GAM, supplies high-level building blocks for generalized additive regression models and provides functionality for formula-based model specification, summaries, diagnostics, and effect visualization. Together, these parts make Liesel an effective tool for the rapid development, testing, and application of Bayesian models and MCMC algorithms. Liesel's modular architecture allows users to extend the software to models and inference algorithms beyond generalized additive models and MCMC, offering considerable flexibility for a wide spectrum of statistical research.

2305.12640 2026-06-05 cs.AI cs.LG stat.ML

Limited Resource Allocation in a Non-Markovian World: The Case of Maternal and Child Healthcare

在非马尔可夫世界中的有限资源分配:产科与儿童保健的案例

Panayiotis Danassis, Shresth Verma, Jackson A. Killian, Aparna Taneja, Milind Tambe

AI总结 本文研究了在非马尔可夫环境下如何通过时间序列方法优化资源分配,提出了一种新的时间序列臂排名指数(TARI)策略,以提高产科和儿童保健项目的参与度和依从性。

详情
Comments
Proceedings of the 32nd International Joint Conference on Artificial Intelligence (IJCAI 2023)
AI中文摘要

许多医疗项目成功的关键在于参与者的依从性。我们考虑在资源有限的环境中(例如健康工作者及时拨打电话)安排干预措施,以提高依从性和/或参与度。以往的工作已经成功开发了几种基于活跃多臂老虎机(RMAB)的解决方案。然而,所有以往的RMAB方法都假设参与者的行为遵循马尔可夫性质。我们展示了在我们合作伙伴NGO ARMMAN的产科健康意识项目上的真实数据中,存在显著偏离马尔可夫假设的现象。此外,我们扩展RMAB到连续状态空间,这是之前研究较少的领域。为解决一般的非马尔可夫RMAB环境,我们(i)将每个参与者的时间轨迹建模为时间序列,(ii)利用时间序列预测模型的力量来学习复杂模式和动态以预测未来状态,(iii)提出时间序列臂排名指数(TARI)策略,这是一种新的算法,选择最能从干预中受益的RMAB臂,基于我们的未来状态预测。我们在合成数据和ARMMAN的真实数据二次分析上评估了我们的方法,并证明了与部署的Whittle指数解决方案相比,参与度显著增加。这相当于额外16.3小时的内容被聆听,90.8%更多的脱节风险被防止,并覆盖了超过两倍的高脱节风险受益人。

英文摘要

The success of many healthcare programs depends on participants' adherence. We consider the problem of scheduling interventions in low resource settings (e.g., placing timely support calls from health workers) to increase adherence and/or engagement. Past works have successfully developed several classes of Restless Multi-armed Bandit (RMAB) based solutions for this problem. Nevertheless, all past RMAB approaches assume that the participants' behaviour follows the Markov property. We demonstrate significant deviations from the Markov assumption on real-world data on a maternal health awareness program from our partner NGO, ARMMAN. Moreover, we extend RMABs to continuous state spaces, a previously understudied area. To tackle the generalised non-Markovian RMAB setting we (i) model each participant's trajectory as a time-series, (ii) leverage the power of time-series forecasting models to learn complex patterns and dynamics to predict future states, and (iii) propose the Time-series Arm Ranking Index (TARI) policy, a novel algorithm that selects the RMAB arms that will benefit the most from an intervention, given our future state predictions. We evaluate our approach on both synthetic data, and a secondary analysis on real data from ARMMAN, and demonstrate significant increase in engagement compared to the SOTA, deployed Whittle index solution. This translates to 16.3 hours of additional content listened, 90.8% more engagement drops prevented, and reaching more than twice as many high dropout-risk beneficiaries.

0911.2381 2026-06-05 physics.data-an cond-mat.stat-mech cs.LG nlin.CD stat.ME

Analytical Determination of Fractal Structure in Stochastic Time Series

随机时间序列中分形结构的解析确定

Fermín Moscoso del Prado Martín

AI总结 本文提出了一种基于贝叶斯评估的分析框架,用于客观准确地推断时间序列的分形结构,同时推导出一种优于现有方法的Hurst指数最大似然估计器。

详情
Journal ref
Psychological Methods (2013) 18(4):514-34
Comments
9 pages, 4 figures
AI中文摘要

当前确定时间序列是否具有分形结构(FS)的方法依赖于对Hurst指数估计值的主观评估。本文引入了贝叶斯评估的标度性,一种用于对时间序列的FS进行客观和准确推断的分析框架。该技术利用了时间序列相关扩散的标度性质。所得标准易于计算,并代表了支持时间序列标度域不同假设的证据的准确表征。此外,从该标准导出了H的闭式最大似然估计器,该估计器优于目前最好的估计器。

英文摘要

Current methods for determining whether a time series exhibits fractal structure (FS) rely on subjective assessments on estimators of the Hurst exponent (H). Here, I introduce the Bayesian Assessment of Scaling, an analytical framework for drawing objective and accurate inferences on the FS of time series. The technique exploits the scaling property of the diffusion associated to a time series. The resulting criterion is simple to compute and represents an accurate characterization of the evidence supporting different hypotheses on the scaling regime of a time series. Additionally, a closed-form Maximum Likelihood estimator of H is derived from the criterion, and this estimator outperforms the best available estimators.