arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2412.17470 2026-06-19 math.ST econ.EM stat.ME stat.TH 版本更新

A Necessary and Sufficient Condition for Size Controllability of Heteroskedasticity Robust Test Statistics

异方差稳健检验统计量尺寸可控性的一个充要条件

Benedikt M. Pötscher, David Preinerstorfer

AI总结针对回归模型中单个约束检验，给出了异方差稳健检验统计量尺寸可控性的充要条件，改进了现有仅充分条件的结果。

Comments Clarification in Footnote 15 added

2512.19187 2026-06-19 stat.ME 版本更新

Interpolated Quantile Estimation: A Unified Framework Bridging Quantiles and the Mean

插值分位数估计：桥接分位数与均值的统一框架

Saïd Maanan, Azzouz Dermoune, Ahmed El Ghini

AI总结提出三类在经典分位数与样本均值之间连续插值的估计量，基于平滑L1损失构建统一M估计框架，证明一致性和渐近正态性，并揭示轻尾和重尾分布下的不同效率特性。

详情

AI中文摘要

本文开发并分析了三类估计量，它们在经典分位数与样本均值之间连续插值。构造从$L_1$损失的平滑版本开始，由位置参数$z$和平滑参数$h \ge 0$索引，其最小化器$\hat q(z,h)$产生一个统一的$M$估计框架。根据$(z, h)$的指定方式，该框架生成三类不同的估计量：固定参数平滑分位数估计量、固定分位数的插入估计量，以及一个新的均值估计程序连续统。对于所有三个族，我们通过一致渐近等连续性论证建立了一致性和渐近正态性。极限方差具有封闭形式，允许跨族和平滑水平的效率透明比较。参数空间的几何分解表明，对于固定分位数水平$\tau$，可接受的$(z, h)$对位于直线上，沿该线估计量针对相同的总体分位数，而其渐近方差发生变化。理论分析揭示了两种效率机制。在轻尾分布（例如高斯分布）下，平滑产生单调方差减少。在重尾分布（例如拉普拉斯分布）下，有限平滑参数$h^{*}(\tau) > 0$严格提高了分位数估计的效率。基于模拟数据和真实金融收益的数值实验验证了这些结论，并表明，在渐近和有限样本中，均值估计族并未改进样本均值。

英文摘要

This paper develops and analyzes three families of estimators that continuously interpolate between classical quantiles and the sample mean. The construction begins with a smoothed version of the $L_1$ loss, indexed by a location parameter $z$ and a smoothing parameter $h \ge 0$, whose minimizer $\hat q(z,h)$ yields a unified $M$-estimation framework. Depending on how $(z, h)$ is specified, this framework generates three distinct classes of estimators: fixed-parameter smoothed quantile estimators, plug -- in estimators of fixed quantiles, and a new continuum of mean -- estimating procedures. For all three families we establish consistency and asymptotic normality via a uniform asymptotic equicontinuity argument. The limiting variances admit closed forms, allowing a transparent comparison of efficiency across families and smoothing levels. A geometric decomposition of the parameter space shows that, for fixed quantile level $τ$, admissible pairs $(z, h)$ lie on straight lines along which the estimator targets the same population quantile while its asymptotic variance evolves. The theoretical analysis reveals two efficiency regimes. Under light-tailed distributions (e.g., Gaussian), smoothing yields a monotone variance reduction. Under heavy-tailed distributions (e.g., Laplace), a finite smoothing parameter $h^{*}(τ) > 0$ strictly improves efficiency for quantile estimation. Numerical experiments -- based on simulated data and real financial returns -- validate these conclusions and show that, both asymptotically and in finite samples, the mean-estimating family does not improve upon the sample mean.

URL PDF HTML ☆

赞 0 踩 0

2309.15769 2026-06-19 math.ST cs.LG stat.ME stat.TH 版本更新

Benign overfitting beyond prediction: The ordinary least squares interpolator

超越预测的良性过拟合：普通最小二乘插值器

Dennis Shen, Dogyoon Song, Peng Ding, Jasjeet S. Sekhon

发表机构 * Department of Data Sciences & Operations, University of Southern California（数据科学与运营系，南加州大学）； Department of Statistics, University of California, Davis（统计学系，加州大学戴维斯分校）； Department of Statistics, University of California, Berkeley（统计学系，加州大学伯克利分校）； Google DeepMind（谷歌DeepMind）

AI总结本文研究过参数化线性模型中最小ℓ2范数OLS插值器的参数估计与推断性质，推导了留k法、遗漏变量偏误公式和Frisch-Waugh-Lovell定理的过参数化版本，并扩展了高斯-马尔可夫定理。

Comments This work is accepted for publication in Biometrika

详情

AI中文摘要

深度学习的最新进展突显了过参数化统计模型中良性过拟合的现象，引发了对其基础理解的浓厚兴趣。由于其简单性和实际相关性，普通最小二乘（OLS）插值器已成为从理论上理解这一现象的关键研究对象。虽然OLS在经典欠参数化设置下的性质已得到充分理解，但其在过参数化区域中的行为——与岭回归或lasso不同——仍相对较少被探索。我们通过为最小$\ell_2$范数OLS插值器推导新的代数和统计结果，为这一不断增长的文献做出贡献。与现有大部分关注预测风险的工作不同，我们的分析集中于参数估计和推断，这对于许多统计学和因果推断应用至关重要。具体地，我们建立了以下内容的过参数化类比：(i) 留$k$法公式，(ii) 遗漏变量偏误公式，以及(iii) Frisch-Waugh-Lovell定理。在高斯-马尔可夫模型下，我们进一步扩展了高斯-马尔可夫定理，并分析了过参数化设置下同方差性时的方差估计。这些结果共同为研究过参数化线性模型中的参数估计和推断提供了一个系统框架，为超越预测含义的良性过拟合提供了新视角。

英文摘要

Recent advances in deep learning have highlighted the phenomenon of benign overfitting in overparameterized statistical models, sparking significant interest in understanding its foundations. Owing to its simplicity and practical relevance, the ordinary least squares (OLS) interpolator has become a key object of study for gaining theoretical insight into this phenomenon. While the properties of OLS are well understood in classical underparameterized settings, its behavior in the overparameterized regime -- unlike that of ridge regression or the lasso -- remains comparatively less explored. We contribute to this growing literature by deriving new algebraic and statistical results for the minimum $\ell_2$-norm OLS interpolator. In contrast to much of the existing work, which focuses on prediction risk, we center our analysis on parameter estimation and inference, which are fundamental for many statistics and causal inference applications. Specifically, we establish overparameterized analogues of (i) the leave-$k$-out formulas, (ii) the omitted variable bias formula, and (iii) the Frisch-Waugh-Lovell theorem. Under the Gauss-Markov model, we further extend the Gauss-Markov theorem and analyze variance estimation under homoskedasticity in the overparameterized setting. Collectively, these results provide a systematic framework for studying parameter estimation and inference in overparameterized linear models, offering a novel perspective on benign overfitting beyond its implications for prediction.

URL PDF HTML ☆

赞 0 踩 0

2604.06464 2026-06-19 cs.LG physics.app-ph stat.ML 版本更新

Weighted Bayesian Conformal Prediction

加权贝叶斯共形预测

Xiayin Lou, Peng Luo

发表机构 * Technical University of Munich（慕尼黑技术大学）； Massachusetts Institute of Technology（麻省理工学院）

AI总结提出加权贝叶斯共形预测（WBCP），通过加权Dirichlet先验推广贝叶斯共形预测到重要性加权设置，理论证明有效样本量决定后验方差，并提供更丰富的条件覆盖不确定性。

详情

AI中文摘要

共形预测提供具有有限样本覆盖保证的分布自由预测区间，Snell & Griffiths 最近的工作将其重新解释为贝叶斯求积（BQ-CP），通过阈值上的 Dirichlet 后验产生强大的数据条件保证。然而，BQ-CP 根本上要求 i.i.d. 假设。同时，加权共形预测通过重要性权重处理分布偏移，但仍然是频率学派方法，仅产生点估计阈值。我们提出 \textbf{加权贝叶斯共形预测（WBCP）}，它将 BQ-CP 推广到任意重要性加权设置，用加权 Dirichlet $\Dir(\neff \cdot \tilde{w}_1, \ldots, \neff \cdot \tilde{w}_n)$ 替换均匀 Dirichlet $\Dir(1,\ldots,1)$，其中 $\neff$ 是 Kish 有效样本量。我们证明了四个理论结果：(1)~$\neff$ 是匹配频率学派和贝叶斯方差的唯一集中参数；(2)~后验标准差以 $O(1/\sqrt{\neff})$ 衰减；(3)~BQ-CP 的随机占优保证扩展到每个权重轮廓的数据条件保证；(4)~HPD 阈值在条件覆盖上提供 $O(1/\sqrt{\neff})$ 的改进。我们将 WBCP 实例化为 \emph{地理贝叶斯共形预测}，其中基于核的空间权重产生每个位置的后验，并具有可解释的诊断。在合成和真实空间数据集上的实验表明，WBCP 在保持覆盖保证的同时提供了更丰富的不确定性信息。

英文摘要

Conformal prediction provides distribution-free prediction intervals with finite-sample coverage guarantees, and recent work by Snell \& Griffiths reframes it as Bayesian Quadrature (BQ-CP), yielding powerful data-conditional guarantees via Dirichlet posteriors over thresholds. However, BQ-CP fundamentally requires the i.i.d. assumption. Meanwhile, weighted conformal prediction handles distribution shift via importance weights but remains frequentist, producing only point-estimate thresholds. We propose \textbf{Weighted Bayesian Conformal Prediction (WBCP)}, which generalizes BQ-CP to arbitrary importance-weighted settings by replacing the uniform Dirichlet $\Dir(1,\ldots,1)$ with a weighted Dirichlet $\Dir(\neff \cdot \tilde{w}_1, \ldots, \neff \cdot \tilde{w}_n)$, where $\neff$ is Kish's effective sample size. We prove four theoretical results: (1)~$\neff$ is the unique concentration parameter matching frequentist and Bayesian variances; (2)~posterior standard deviation decays as $O(1/\sqrt{\neff})$; (3)~BQ-CP's stochastic dominance guarantee extends to per-weight-profile data-conditional guarantees; (4)~the HPD threshold provides $O(1/\sqrt{\neff})$ improvement in conditional coverage. We instantiate WBCP for spatial prediction as \emph{Geographical BQ-CP}, where kernel-based spatial weights yield per-location posteriors with interpretable diagnostics. Experiments on synthetic and real-world spatial datasets demonstrate that WBCP maintains coverage guarantees while providing substantially richer uncertainty information.

URL PDF HTML ☆

赞 0 踩 0

2603.19745 2026-06-19 stat.ME 版本更新

Invariant quantile regression for heterogeneous environments

异质环境下的不变分位数回归

Bo Fu, Dandan Jiang

AI总结针对多环境数据集提出不变分位数回归框架，通过核平滑估计器利用环境间不变性实现因果发现和内生性克服。

Comments 25 pages, 4 figures

详情

AI中文摘要

在本文中，我们提出了一个专门针对多环境数据集的不变分位数回归（IQR）框架，该框架捕捉了不同环境之间的不变性。该框架与迁移学习、因果推断和公平机器学习密切相关，其动机源于响应变量在给定协变量下的条件概率发生变化，而某些关键变量保持不变的场景。这一视角与以往仅关注条件均值的工作显著不同，后者通常不足以捕捉异质环境中协变量与响应变量之间的完整因果关系。相比之下，基于分位数的不变性自然地适应异质性，并且与结构因果模型更加一致，其中在一个或多个分位数水平上跨环境不变的变量直接指示潜在且稳定的因果变量。此外，我们表明，与条件均值框架相比，IQR 可能产生更大的内生变量集，从而更有效地排除虚假（非因果）变量。为此，我们引入了一种核平滑不变分位数回归（KS-IQR）估计器，该估计器利用潜在的不变结构和环境间的异质性，确保在多个环境中稳定估计。我们在非渐近框架下建立了我们方法的因果发现性质，展示了其克服“内生性诅咒”的能力，并推导了估计器的 $\ell_2$ 误差界。我们将我们的方法应用于真实数据的因果发现，获得了具有生物学意义的关系，恢复了已知的信号通路并揭示了额外的分位数特定效应。

英文摘要

In this paper, we propose an invariant quantile regression (IQR) framework specifically designed for multi-environment datasets, which captures the invariance across different environments. This framework is closely related to transfer learning, causal inference, and fair machine learning, and is motivated by scenarios in which the conditional probability of the response given covariates varies, while certain key variables remain invariant. This perspective differs notably from previous works that restrict attention to the conditional mean, which is often insufficient to capture the full causal relationships between covariates and the response in heterogeneous environments. In contrast, quantile-based invariance naturally accommodates heterogeneity, and aligns more closely with structural causal models, in which variables invariant across environments at one or multiple quantile levels directly indicate potential and stable causal variables. Moreover, we show that IQR may yield a larger set of endogenous variables compared to the conditional mean framework, which in turn promotes more effective exclusion of spurious (non-causal) variables. To achieve this, we introduce a Kernel-Smoothed Invariant Quantile Regression (KS-IQR) estimator, which leverages the underlying invariance structure and heterogeneity among environments, ensuring stable estimation across multiple environments. We establish the causal discovery properties of our method, demonstrate its ability to overcome the ``curse of endogeneity'', and derive an $\ell_2$ error bound for our estimator, all in a non-asymptotic framework. We apply our method to real data for causal discovery and obtain biologically meaningful relationships, recovering known signaling pathways and revealing additional quantile-specific effects.

URL PDF HTML ☆

赞 0 踩 0

2506.06267 2026-06-19 stat.ME 版本更新

A causal framework for evaluating the total effect of strategies aiming to expand screening and to improve outcomes

评估旨在扩大筛查和改善结果的策略总效应的因果框架

Joy Zora Nakato, Janice Litunya, Brian Beesiga, Jane Kabami, James Ayieko, Moses R. Kamya, Gabriel Chamie, Laura B. Balzer

AI总结针对集群随机试验中多层次、缺失数据和中介效应问题，提出反事实分层效应定义总效应，并扩展两阶段目标最小损失估计（TMLE）进行识别和估计。

Comments 20 pages, 3 figures, accepted at "Statistics in Medicine"

详情

AI中文摘要

对于许多健康状况，存在高效的治疗和预防产品。最大化其影响需要改善健康筛查覆盖的策略，以确定谁可能受益。例如，HIV预防策略旨在扩大风险筛查并提高风险人群对暴露前预防（PrEP）的接受度。这些策略通常引起群体层面（如卫生诊所或社区）的变化，并通过集群随机试验进行评估。这种情况产生了复杂的多层次-中介-缺失数据问题，原因如下：首先，策略在集群层面实施，而健康筛查和结果在个体层面；其次，策略通过改善健康筛查直接和间接改善健康结果；第三，每个人都有一个潜在状态，仅在接受筛查者中观察到。为正式定义此类环境中的总效应，我们使用反事实分层效应：因果估计量，其中结果仅与某个群体相关，该群体的成员资格受缺失和/或感兴趣暴露的影响。为识别和估计相应的统计估计量，我们提出了一种新颖的两阶段目标最小损失估计（TMLE）扩展。模拟展示了我们方法的实际性能以及现有方法的局限性。

英文摘要

For many health conditions, there are highly efficacious treatment and prevention products. Maximizing their impact requires strategies that improve the reach of health screening in order to establish who could benefit. For example, HIV prevention strategies aim to expand risk screening and to improve uptake of pre-exposure prophylaxis (PrEP) among those experiencing risk. Often, these strategies induce changes at the group-level (e.g., health clinics or communities) and are evaluated through cluster randomized trials. This scenario creates a complex, multilevel-mediation-missing data problem for the following reasons. First, the strategy is delivered at the cluster-level, while health screening and outcomes are at the individual-level. Second, the strategy improves health outcomes directly and indirectly through improved health screening. Third, everyone has an underlying status, which is only observed among those screened. To formally define the total effect in such settings, we use Counterfactual Strata Effects: causal estimands where the outcome is only relevant for a group whose membership is subject to missingness and/ or impacted by the exposure of interest. To identify and estimate the corresponding statistical estimand, we propose a novel extension of Two-Stage targeted minimum loss-based estimation (TMLE). Simulations demonstrate the practical performance of our approach as well as the limitations of existing approaches.

URL PDF HTML ☆

赞 0 踩 0

2506.18808 2026-06-19 stat.AP 版本更新

A Practical Introduction to Regression-based Causal Inference in Meteorology (I): All confounders measured

气象学中基于回归的因果推断实用入门（I）：所有混杂因素可测

Caren Marzban, Yikun Zhang, Nicholas Bond, Michael Richman

AI总结介绍在非时间序列场景下，利用匹配方法进行因果推断，提供气象学应用实例和R代码。

详情

AI中文摘要

一个变量是否是另一个变量的原因，或者仅仅与之相关，通常是一个重要的科学问题。因果推断是在统计背景下解决该问题的技术体系。尽管在存在时间信息时评估因果关系相对直接，但在非时间序列场景（本文考虑的情况）下，评估因果效应更为困难。因果推断领域的发展涉及广泛的主题概念，从而限制了其在包括气象学在内的一些领域的应用。然而，其核心所需的因果推断知识仅涉及基本概率论和回归，这是大多数气象学家熟悉的主题。通过聚焦这些核心领域，本文及其姊妹篇为气象学界进入（非时间序列）因果推断领域提供了垫脚石。尽管介绍了一些理论基础，但主要目标是将一种称为匹配的特定方法应用于气象学问题。应用数据为公开数据，并提供了R代码，为气象学学生和研究人员进入该领域铺平了道路。

英文摘要

Whether a variable is the cause of another, or simply associated with it, is often an important scientific question. Causal Inference is the name associated with the body of techniques for addressing that question in a statistical setting. Although assessing causality is relatively straightforward in the presence of temporal information, outside of that setting - the situation considered here - it is more difficult to assess causal effects. The development of the field of causal inference has involved concepts from a wide range of topics, thereby limiting its adoption across some fields, including meteorology. However, at its core, the requisite knowledge for causal inference involves little more than basic probability theory and regression, topics familiar to most meteorologists. By focusing on these core areas, this and a companion article provide a steppingstone for the meteorology community into the field of (non-temporal) causal inference. Although some theoretical foundations are presented, the main goal is the application of a specific method, called matching, to a problem in meteorology. The data for the application are in public domain, and R code is provided as well, forming an easy path for meteorology students and researchers to enter the field.

URL PDF HTML ☆

赞 0 踩 0

2506.18652 2026-06-19 stat.AP 版本更新

A Practical Introduction to Regression-based Causal Inference in Meteorology (II): Unmeasured confounders

气象学中基于回归的因果推断实用入门（二）：未测量的混杂因素

Caren Marzban, Yikun Zhang, Nicholas Bond, Michael Richman

AI总结介绍在未测量混杂因素存在时，利用工具变量法通过回归估计因果效应，并以气象数据为例说明工具变量选择的重要性。

详情

AI中文摘要

将相关性“提升”为因果关系的障碍之一是混杂现象，即两个变量之间的相关性实际上是由第三个变量（称为混杂因素）引起的。在先前的一篇配套文章中，我们考察了混杂因素被测量的情况。本文表明，即使混杂变量未被测量，在某些条件下，仍然可以通过一种基于回归的方法（利用工具变量的概念）来估计因果效应。使用与姊妹篇类似的气象数据集，比较和对比了因果效应的几种不同估计。结果表明，工具变量估计的因果效应依赖于工具变量的选择，而气象学考虑对于解决这种不确定性至关重要。提供了用于生成所有结果的R代码，并概述了未来工作的许多方向。

英文摘要

One obstacle to ``elevating'' correlation to causation is the phenomenon of confounding, i.e., when a correlation between two variables exists because both variables are in fact caused by a third variable, called a confounder. The situation where the confounders are measured is examined in an earlier, accompanying article. Here, it is shown that even when the confounding variables are not measured, under certain conditions it is still possible to estimate the causal effect via a regression-based method that uses the notion of instrumental variables. Using a meteorological data set, similar to that in the sister article, a number of different estimates of the causal effect are compared and contrasted. It is shown that the instrumental-variable estimates of causal effect depend on the choice of the instrumental variable, and that meteorological considerations are important in resolving the ambiguity. R code is provided for generating all of the results, and numerous directions for future work are outlined.

URL PDF HTML ☆

赞 0 踩 0

2405.00118 2026-06-19 math.ST stat.ME stat.TH 版本更新

Causal Inference with High-dimensional Discrete Covariates

高维离散协变量下的因果推断

Zhenghao Zeng, Sivaraman Balakrishnan, Yanjun Han, Edward H. Kennedy

AI总结研究高维离散协变量下因果效应的估计问题，证明常用估计量的均方误差界为d²/n²+1/n，并给出极小化下界，提出利用效应同质性和先验知识的新估计量以加速收敛。

Comments 74 pages, 9 figures

详情

AI中文摘要

在从观察性研究估计因果效应时，研究人员通常需要调整许多协变量以消除暴露与结果之间的非因果关系，其中许多协变量是离散的。常用估计量在存在许多离散协变量时的行为尚不明确，因为它们的性质通常是在稀疏性和平滑性等结构假设下分析的，而这些假设不适用于离散设置。在这项工作中，我们研究了一个模型中因果效应的估计，其中用于混杂调整的协变量是离散但高维的，意味着类别数量$d$与样本量$n$相当甚至更大。具体来说，我们证明了常用回归、加权和双稳健估计量的均方误差以$\frac{d^2}{n^2}+\frac{1}{n}$为界。然后，我们证明了平均处理效应的极小化下界为$\frac{d^2}{n^2 \log^2 n}+\frac{1}{n}$量级，这刻画了高维离散设置下因果效应估计的基本难度，并表明上述估计量在忽略对数因子时是速率最优的。我们进一步考虑了可以利用的额外结构，即效应同质性和协变量分布的先验知识，并提出了新的估计量，这些估计量具有更快的收敛速率$\frac{d}{n^2} + \frac{1}{n}$，从而在更广泛的范围内实现一致性。通过模拟研究对结果进行了实证说明。

英文摘要

When estimating causal effects from observational studies, researchers often need to adjust for many covariates to deconfound the non-causal relationship between exposure and outcome, among which many covariates are discrete. The behavior of commonly used estimators in the presence of many discrete covariates is not well understood since their properties are often analyzed under structural assumptions including sparsity and smoothness, which do not apply in discrete settings. In this work, we study the estimation of causal effects in a model where the covariates required for confounding adjustment are discrete but high-dimensional, meaning the number of categories $d$ is comparable with or even larger than sample size $n$. Specifically, we show the mean squared error of commonly used regression, weighting and doubly robust estimators is bounded by $\frac{d^2}{n^2}+\frac{1}{n}$. We then prove the minimax lower bound for the average treatment effect is of order $\frac{d^2}{n^2 \log^2 n}+\frac{1}{n}$, which characterizes the fundamental difficulty of causal effect estimation in the high-dimensional discrete setting, and shows the estimators mentioned above are rate-optimal up to log-factors. We further consider additional structures that can be exploited, namely effect homogeneity and prior knowledge of the covariate distribution, and propose new estimators that enjoy faster convergence rates of order $\frac{d}{n^2} + \frac{1}{n}$, which achieve consistency in a broader regime. The results are illustrated empirically via simulation studies.

URL PDF HTML ☆

赞 0 踩 0

2202.03332 2026-06-19 stat.ME econ.EM stat.AP 版本更新

Practical Forecasting of Environmental Maps: A Functional Data Approach

环境地图的实用预测：一种函数型数据方法

Alexander Gleim, Nazarii Salish

AI总结提出一种基于函数型数据分析的统计方法，用于预测随时间变化的地理区域环境数据，通过整合时空依赖关系生成预测表面，并以德国地面臭氧浓度预测为例验证其有效性。

详情

AI中文摘要

环境问题在社会经济和健康研究中日益受到关注，推动了相关现实过程记录和数据收集的进展。然而，传统数据处理工具往往过于局限，无法考虑此类数据集的丰富特性。本文提出了一种简单的统计视角，用于预测随时间在预定义地理区域上顺序收集的环境数据。我们将此类数据集视为具有可能复杂地理区域的表面（或函数型）时间序列。利用函数型数据分析技术，我们开发了一种预测方法，能够同时考虑地理和时间依赖性。该方法允许整合传统多元技术以提供预测表面。我们通过德国地面臭氧浓度的预测示例展示了我们方法的实用价值，证明了其有效性和广泛应用的潜力。

英文摘要

Environmental problems are receiving increasing attention in socio-economic and health studies, fostering advances in recording and data collection of related real-life processes. However, traditional tools for data processing are often found too restrictive as they do not account for the rich nature of such data sets. In this paper, we propose a simple statistical perspective on forecasting environmental data collected sequentially over time across some predefined geographic region. We treat such data set as a surface (or functional) time series with a possibly complicated geographical domain. Using techniques from functional data analysis, we develop a forecasting methodology that allows to account for both geographic and temporal dependencies. This methodology allows integration of traditional multivariate techniques to provide forecasts surfaces. We demonstrate the practical value of our approach with a forecasting example of ground-level ozone concentration across Germany, showcasing its effectiveness and potential for broad application.

URL PDF HTML ☆

赞 0 踩 0

2606.04307 2026-06-19 cs.LG stat.CO stat.ME 版本更新

Folded Transport MCMC: Eliminating Label Switching by Sampling on a Fundamental Domain

折叠传输MCMC：对称贝叶斯模型的可认证商后验计算

Jun Hu

发表机构 * Wuhan University of Technology（武汉理工大学）

AI总结针对对称贝叶斯模型中的冗余多峰性导致MCMC收敛诊断退化的问题，提出Folded Transport MCMC方法，通过在对称群的基本域上构建独立采样器直接对商后验进行推断，并利用LCNF振荡认证框架在商度量下提供可证明的认证下界。

Comments 50 pages (including supplementary material), 5 figures, 6 tables. Submitted to Journal of Computational and Graphical Statistics

详情

AI中文摘要

具有有限对称性的贝叶斯模型——如可交换分量的混合模型、具有紧密间隔模态的结构识别——定义的后验在标签置换群下不变，产生冗余的多峰性，从而降低MCMC收敛诊断的质量。我们引入折叠传输MCMC（FolT-MCMC），该方法通过在对称群的基本域上构建独立采样器，直接对商后验进行推断。商提议分布通过对群轨道上学习的归一化流进行对称化得到。我们证明了基于LCNF振荡的认证框架可以迁移到商度量，并具有稳定子修正的球质量界和改进的覆盖半径，并且当未折叠流表现出跨模态提议缺陷时，分位数核心认证下界会得到改善。在高斯混合（d=2-20）、标签切换目标（最多24个等价模态）以及标准贝叶斯三分量混合后验上，分位数核心认证改进比从2倍到145倍不等，且折叠认证经验上几乎与维度无关。在台风山竹期间超高层建筑的真实加速度计数据上，FolT-MCMC产生了非平凡的分位数核心认证，而未折叠认证是平凡的。

英文摘要

In Bayesian mixture models and other exchangeable-component models, the posterior is invariant under permutation of component labels, creating m! equivalent modes-the label-switching problem. Standard MCMC methods either mix poorly across these modes or rely on post-hoc relabelling that cannot guarantee the sampler has converged. We propose Folded Transport MCMC (FolT-MCMC), which eliminates label switching before sampling by restricting the Markov chain to a fundamental domain-a sorted or reflected subspace containing exactly one representative from each symmetric mode. The proposal is a learned normalising flow whose density is symmetrised over the group orbits, ensuring correct targeting on the reduced space. We show that this construction preserves a computable convergence diagnostic based on the oscillation of the log-density ratio, and that the diagnostic becomes sharper on the fundamental domain whenever the original-space flow under-covers one or more symmetric modes. Experiments on Gaussian mixtures (d=2-20), label-switching targets (up to 24 equivalent modes), a standard Bayesian three-component mixture posterior, and real accelerometer data from a supertall building show improvement ratios of 2x to 145x, with the folded diagnostic stable across dimensions while the unfolded diagnostic collapses.

URL PDF HTML ☆

赞 0 踩 0

2603.20022 2026-06-19 stat.ME 版本更新

Q-approximation of operating characteristics of clinical trial designs

临床试验设计操作特性的Q-近似

Susanna Gentile, Daniel E. Schwartz, Riddhiman Saha, Lorenzo Trippa

AI总结提出Q-近似方法，通过二次近似似然函数替代完整数据模拟，快速评估临床试验的操作特性，计算效率比蒙特卡罗模拟高150-1900倍。

详情

AI中文摘要

设计临床试验需要评估多个操作特性（OCs），例如早期停止决策的可能性、检测治疗效应的概率以及I类错误率。在大多数情况下，这些评估基于计算密集型的蒙特卡罗模拟。随着临床试验复杂性和适应性设计使用的增加，计算负担可能迅速变得难以承受。我们引入了一种快速近似OCs的策略，称为Q-近似。我们的方法基于对数似然的二次近似和渐近论证。主要思想是用模拟决定试验中期和最终决策的近似似然函数来替代完整试验数据集的模拟。Q-近似方法可应用于任何使用与似然原理一致的数据分析方法的试验设计，包括具有早期停止的多阶段设计、自适应随机化设计以及利用外部数据的设计。我们通过几个例子说明了该方法，并表明它在减少计算时间的同时提供了重要OCs的准确近似。特别是，在我们的实验中，要达到相当的精度水平，标准蒙特卡罗近似OCs所需的计算预算比Q-近似高150到1900倍。通过实现快速的OC评估，Q-近似可以支持在应用试验规划和方法学开发中更广泛地使用创新试验设计。

英文摘要

Designing clinical trials requires evaluating multiple operating characteristics (OCs), such as the likelihood of an early stopping decision, the probability of detecting a treatment effect, and the Type I error rate. In most cases, these evaluations are based on computationally intensive Monte Carlo simulations. As the complexity of clinical trials and the use of adaptive designs increase, the computational burden can quickly become prohibitive. We introduce a strategy for rapidly approximating OCs, called the Q-approximation. Our approach is based on quadratic approximations of the log-likelihood and asymptotic arguments. The main idea is to replace simulation of full trial datasets with simulation of the approximate likelihood functions that determine the trial's interim and final decisions. The Q-approximation approach can be applied to any trial design that uses data analysis methods coherent with the likelihood principle, including multistage designs with early stopping, adaptively randomized designs, and designs that leverage external data. We illustrate the approach with several examples and show that it provides an accurate approximation of important OCs while reducing the computation time compared to Monte Carlo simulations. In particular, in our experiments, the standard Monte Carlo approximation of OCs requires 150 to 1,900 times greater computing budget than Q-approximations to achieve comparable levels of accuracy. By enabling fast OC evaluations, Q-approximations can support the broader use of innovative trial designs in both applied trial planning and methodological development.

URL PDF HTML ☆

赞 0 踩 0

2602.01929 2026-06-19 math.DS stat.CO stat.ML 版本更新

Probabilistic function-on-function nonlinear autoregressive model for emulation and reliability analysis of stochastic dynamical systems

概率函数对函数非线性自回归模型用于随机动力系统的仿真与可靠性分析

Zhouzhou Song, Marcos A. Valdebenito, Styfen Schär, Stefano Marelli, Bruno Sudret, Matthias G. R. Faes

AI总结提出F2NARX模型，从函数对函数回归角度改进NARX方法，结合PCA和高斯过程回归实现概率预测，并通过主动学习高效估计首次穿越失效概率。

详情

AI中文摘要

在许多工程领域，构建准确且计算高效的代理模型（或仿真器）用于预测动力系统响应至关重要，但由于外部激励和系统参数到系统响应的强非线性和高维映射，这仍然具有挑战性。本文引入了一种新颖的函数对函数非线性自回归外生输入模型（F2NARX），该模型从函数对函数回归的角度重新表述了最近提出的$\mathcal{F}$-NARX方法。所提出的框架在保持高精度的同时显著提高了预测效率。通过将主成分分析与高斯过程回归相结合，F2NARX进一步通过无迹变换以自回归方式实现动力响应的概率预测。这种概率预测能力进一步促进了首次穿越概率评估的主动学习。通过不同复杂度的案例研究证明了该方法的有效性。结果表明，F2NARX在效率上比最先进的NARX模型高出几个数量级，同时通常达到更高的精度。此外，主动学习方法能够仅使用少量训练时间历程准确估计动力系统的首次穿越失效概率。

英文摘要

Constructing accurate and computationally efficient surrogate models (or emulators) for predicting dynamical system responses is critical in many engineering domains, yet remains challenging due to the strongly nonlinear and high-dimensional mapping from external excitations and system parameters to system responses. This work introduces a novel Function-on-Function Nonlinear AutoRegressive model with eXogenous inputs (F2NARX), which reformulates the recently proposed $\mathcal{F}$-NARX method from a function-on-function regression perspective. The proposed framework substantially improves predictive efficiency while maintaining high accuracy. By combining principal component analysis with Gaussian process regression, F2NARX further enables probabilistic predictions of dynamical responses via the unscented transform in an autoregressive manner. Such probabilistic prediction capabilities further facilitate active learning for first-passage probability evaluation. The effectiveness of the method is demonstrated through case studies of varying complexity. Results show that F2NARX outperforms state-of-the-art NARX model by orders of magnitude in efficiency while achieving higher accuracy in general. Meanwhile, the active learning approach enables accurate estimation of first-passage failure probabilities for dynamical systems using only a small number of training time histories.

URL PDF HTML ☆

赞 0 踩 0

2601.23173 2026-06-19 stat.ME 版本更新

Robust, partially alive particle Metropolis-Hastings via the Frankenfilter

鲁棒的、部分存活的粒子Metropolis-Hastings算法：基于Frankenfilter

Chris Sherlock, Andrew Golightly, Anthony Lee

AI总结针对隐马尔可夫模型中条件似然为零导致粒子滤波失效的问题，提出Frankenfilter，通过固定模拟次数上下限并设定成功目标，实现鲁棒且高效的似然估计，在伪边际Metropolis-Hastings中比标准粒子滤波效率提高2-3倍。

详情

AI中文摘要

当隐马尔可夫模型允许给定隐藏过程的观测条件似然为零时，从一个观测时间到下一个观测时间的所有粒子模拟可能产生零值。如果是这样，滤波分布无法估计，且估计的参数似然为零。存活粒子滤波器通过为每个观测间隔模拟随机数量的粒子来解决这个问题，在达到目标数量的非零条件似然后停止。对于异常观测或较差的参数值，非零结果可能极不可能发生，计算成本过高。我们引入了Frankenfilter，一种有原则的、部分存活的粒子滤波器，它在固定模拟次数上下限的同时，针对用户定义的成功量。Frankenfilter产生似然的无偏估计，适用于伪边际Metropolis-Hastings（PMMH）。我们证明，与使用标准粒子滤波器的PMMH相比，使用Frankenfilter的PMMH对异常值和错误指定的初始参数值更加鲁棒，并且通常效率至少提高2-3倍。我们还提供了选择成功量的建议。在n个精确观测的情况下，这特别简单：目标为n次成功。

英文摘要

When a hidden Markov model permits the conditional likelihood of an observation given the hidden process to be zero, all particle simulations from one observation time to the next could produce zeros. If so, the filtering distribution cannot be estimated and the estimated parameter likelihood is zero. The alive particle filter addresses this by simulating a random number of particles for each inter-observation interval, stopping after a target number of non-zero conditional likelihoods. For outlying observations or poor parameter values, a non-zero result can be extremely unlikely, and computational costs prohibitive. We introduce the Frankenfilter, a principled, partially alive particle filter that targets a user-defined amount of success whilst fixing lower and upper bounds on the number of simulations. The Frankenfilter produces unbiased estimators of the likelihood, suitable for pseudo-marginal Metropolis--Hastings (PMMH). We demonstrate that PMMH with the Frankenfilter is more robust to outliers and mis-specified initial parameter values than PMMH using standard particle filters, and is typically at least 2-3 times more efficient. We also provide advice for choosing the amount of success. In the case of n exact observations, this is particularly simple: target n successes.

URL PDF HTML ☆

赞 0 踩 0

2512.17473 2026-06-19 eess.SP cs.LG math.OC stat.ML 版本更新

Alternating Direction Method of Multipliers for Nonlinear Matrix Decompositions

非线性矩阵分解的交替方向乘子法

Atharva Awari, Nicolas Gillis, Arnaud Vandaele

发表机构 * University of Mons（蒙斯大学）

AI总结提出基于交替方向乘子法（ADMM）的算法求解非线性矩阵分解（NMD），支持多种非线性函数和损失函数，在真实数据集上验证了适用性和效率。

Comments 16 pages, 7 figures. v3: Revised version: added new experiments and comparisons. Code available from https://gitlab.com/Atharva05/admm-for-nmd

详情

AI中文摘要

我们提出了一种基于交替方向乘子法（ADMM）的算法，用于求解非线性矩阵分解（NMD）。给定输入矩阵 $X \in \mathbb{R}^{m \times n}$ 和分解秩 $r \ll \min(m, n)$，NMD 寻求矩阵 $W \in \mathbb{R}^{m \times r}$ 和 $H \in \mathbb{R}^{r \times n}$，使得 $X \approx f(WH)$，其中 $f$ 是逐元素非线性函数。我们在几个代表性非线性模型上评估了我们的方法：适用于非负稀疏数据近似的修正线性单元激活 $f(x) = \max(0, x)$，适用于概率电路表示的逐分量平方 $f(x) = x^2$，以及适用于推荐系统的 MinMax 变换 $f(x) = \min(b, \max(a, x))$。所提出的框架灵活支持多种损失函数，包括最小二乘、$\ell_1$ 范数和 Kullback-Leibler 散度，并且可以轻松扩展到其他非线性和度量。我们在真实世界数据集上展示了该方法的适用性、效率和适应性，突出了其在广泛应用中的潜力。

英文摘要

We present an algorithm based on the alternating direction method of multipliers (ADMM) for solving nonlinear matrix decompositions (NMD). Given an input matrix $X \in \mathbb{R}^{m \times n}$ and a factorization rank $r \ll \min(m, n)$, NMD seeks matrices $W \in \mathbb{R}^{m \times r}$ and $H \in \mathbb{R}^{r \times n}$ such that $X \approx f(WH)$, where $f$ is an element-wise nonlinear function. We evaluate our method on several representative nonlinear models: the rectified linear unit activation $f(x) = \max(0, x)$, suitable for nonnegative sparse data approximation, the component-wise square $f(x) = x^2$, applicable to probabilistic circuit representation, and the MinMax transform $f(x) = \min(b, \max(a, x))$, relevant for recommender systems. The proposed framework flexibly supports diverse loss functions, including least squares, $\ell_1$ norm, and the Kullback-Leibler divergence, and can be readily extended to other nonlinearities and metrics. We illustrate the applicability, efficiency, and adaptability of the approach on real-world datasets, highlighting its potential for a broad range of applications.

URL PDF HTML ☆

赞 0 踩 0

2508.13313 2026-06-19 stat.ML cs.LG math.OC 版本更新

Flow Matching for Efficient and Scalable Data Assimilation

用于高效可扩展数据同化的流匹配

Taos Transue, Bohan Chen, So Takao, Bao Wang

发表机构 * The Computing and Mathematical Sciences Department, California Institute of Technology（加州理工学院计算与数学科学系）； Department of Mathematics and Scientific Computing and Imaging Institute, University of Utah（犹他大学数学与科学计算系和成像研究所）

AI总结提出基于流匹配的无训练集成流滤波器（EnFF），通过蒙特卡洛估计和局部化引导加速高维非线性数据同化，在成本-精度权衡和可扩展性上优于现有方法。

Comments revamp presentation, add experiments

2503.11479 2026-06-19 stat.CO math.PR math.ST stat.ME stat.TH 版本更新

Towards practical PDMP sampling: Metropolis adjustments, locally adaptive step-sizes, and NUTS-based time lengths

走向实用的PDMP采样：Metropolis调整、局部自适应步长和基于NUTS的时间长度

Augustin Chevallier, Sam Power, Matthew Sutton

AI总结针对PDMP采样需要计算模型特定界限的难题，提出Metropolis调整近似、自适应步长机制和NUTS启发的路径长度选择，集成得到双重自适应PDMP采样器，提升鲁棒性和效率。

详情

AI中文摘要

分段确定性马尔可夫过程（PDMP）在从复杂概率分布中采样方面具有重要前景。然而，其实践应用受到需要计算模型特定界限的限制。相反，虽然哈密顿蒙特卡洛（HMC）提供了一种普遍有效的采样方法，但其无法自适应调整步长，导致在采样漏斗形等复杂分布时性能受损。为解决这些限制，我们引入了三个创新概念：(a) 一种Metropolis调整的PDMP模拟近似，无需显式界限且不破坏不变测度；(b) 一种与Metropolis校正兼容的自适应步长机制；(c) 一种受无U型转弯采样器（NUTS）启发的方案，用于动态选择PDMP中的路径长度。这三个想法可以无缝集成到一个单一的“双重自适应”PDMP采样器中，具有良好的鲁棒性和效率特性。

英文摘要

Piecewise-Deterministic Markov Processes (PDMPs) hold significant promise for sampling from complex probability distributions. However, their practical implementation is hindered by the need to compute model-specific bounds. Conversely, while Hamiltonian Monte Carlo (HMC) offers a generally efficient approach to sampling, its inability to adaptively tune step sizes impedes its performance when sampling complex distributions like funnels. To address these limitations, we introduce three innovative concepts: (a) a Metropolis-adjusted approximation for PDMP simulation that eliminates the need for explicit bounds without compromising the invariant measure, (b) an adaptive step size mechanism compatible with the Metropolis correction, and (c) a No U-Turn Sampler (NUTS)-inspired scheme for dynamically selecting path lengths in PDMPs. These three ideas can be seamlessly integrated into a single, `doubly-adaptive' PDMP sampler with favourable robustness and efficiency properties.

URL PDF HTML ☆

赞 0 踩 0

2605.02989 2026-06-19 cs.IT eess.SP math.IT stat.ML 版本更新

Information Theory and Statistical Learning

信息论与统计学习

Abbas El Gamal

AI总结本文是Cover & Thomas《信息论基础》第三版的章节预印本，系统介绍了散度度量在模型训练中的作用，涵盖线性回归、生成扩散模型等，并给出了扩散模型更系统的推导。

详情

AI中文摘要

本手稿包含即将出版的《Cover and Thomas信息论基础》第三版中一章的预印本，经Wiley许可发布。新版的目录EIT-3 ToC可在此https URL找到。反馈请联系abbas@ee. this http URL。学习与信息论在模型训练和基本性能极限的表征中均有交叉。本手稿对第一个交叉点进行了简洁易懂的处理，仅需高年级本科生或一年级研究生水平的信息论和统计学基础知识。章末习题使材料既适合课堂使用也适合自学。本章重点讨论散度度量在模型训练中的作用，示例涵盖从线性回归、逻辑回归到自回归模型、变分自编码器、扩散模型、生成对抗网络和基于分数的模型。介绍了证据下界（ELBO）、f-散度和Fisher散度。特别是，对生成扩散模型的处理提供了比文献中更系统、更明确的推导。

英文摘要

This manuscript contains preprint of a chapter under consideration for inclusion in the forthcoming third edition of {\em Cover and Thomas's Elements of Information Theory}, posted with permission from Wiley. The table of contents EIT-3 ToC of the new edition can be found at: https://docs.google.com/document/d/1L-m4oQEJw1PJhoxBeMwrrBD8S_HmvzMEkPbYvS24980/edit?usp=sharing . For feedback, please contact abbas@ee.stanford.edu Learning and information theory intersect in both model training and the characterization of fundamental performance limits. This manuscript provides a concise and accessible treatment of the first intersection, requiring only basic background in information theory and statistics at the senior undergraduate or first-year graduate level. End-of-chapter exercises make the material well suited for classroom use as well as self-study. The chapter focuses on the role of divergence measures in model training, with examples ranging from linear and logistic regression to autoregressive models, variational autoencoders, diffusion models, generative adversarial networks, and score-based models. It introduces the evidence lower bound (ELBO), f-divergences, and the Fisher divergence. In particular, the treatment of the generative diffusion model provides a more systematic and explicit derivation than is typical in the literature.

URL PDF HTML ☆

赞 0 踩 0

2605.18315 2026-06-19 math.OC stat.ML 版本更新

Attention-based PCA

基于注意力的PCA

Rodrigo Maulen-Soto, Claire Boyer

AI总结本文研究了注意力机制在无监督问题PCA中的表现，证明在高斯数据上训练时，softmax和线性注意力层学习的参数与协方差矩阵的主特征向量对齐，建立了与PCA的直接联系，并扩展到上下文设置中。

详情

AI中文摘要

我们通过一个经典无监督问题——主成分分析（PCA）的视角研究注意力机制。我们证明，当在高斯数据上训练时，softmax和线性注意力层学习的参数与协方差矩阵的主特征向量对齐，从而建立了与PCA的直接且明确的联系。我们的分析涵盖了有限和无限提示范围。在无限提示极限下，我们证明收敛到与主谱方向对齐的全局最优解；而在有限提示设置中，我们显示相同的行为在采样效应范围内出现。我们进一步将分析扩展到具有突出Wishart协方差的上下文设置中，其中注意力成功地恢复了底层信号方向。这些结果表明，在无监督目标下，注意力本质上执行类似于PCA的计算，为其实现表示学习能力提供了理论基础。

英文摘要

We study attention mechanisms through the lens of a canonical unsupervised problem: principal component analysis (PCA). We show that, when trained on Gaussian data, both softmax and linear attention layers learn parameters that align with the principal eigenvectors of the covariance matrix, thereby establishing a direct and explicit connection with PCA. Our analysis covers both finite and infinite prompt regimes. In the infinite-prompt limit, we prove convergence to globally optimal solutions aligned with the leading spectral direction, while in the finiteprompt setting we show that the same behavior emerges up to sampling effects. We further extend the analysis to an in-context setting with spiked Wishart covariances, where attention successfully recovers the underlying signal direction. These results demonstrate that attention inherently performs PCA-like computations under unsupervised objectives, providing a theoretical foundation for its representation-learning capabilities.

URL PDF HTML ☆

赞 0 踩 0

2604.21097 2026-06-19 stat.ML cs.LG 版本更新

Learning to Emulate Chaos: Adversarial Optimal Transport Regularization

学习模拟混沌：对抗最优传输正则化

Gabriel Melo, Leonardo Santiago, Peter Y. Lu

发表机构 * Department of Mechanical and Aerospace Engineering, North Carolina State University, Raleigh, NC（北卡罗来纳州立大学机械与航空航天工程系）； Department of Electrical and Computer Engineering, Tufts University, Medford, MA（塔夫茨大学电气与计算机工程系）； Work performed while at the University of Campinas（在坎皮纳斯大学工作期间）

AI总结针对混沌动力学模拟中长程统计保真度低的问题，提出基于对抗最优传输的目标函数，联合学习高质量汇总统计量和物理一致的模拟器，理论分析与实验验证了Sinkhorn散度和WGAN对偶形式的有效性。

详情

AI中文摘要

混沌出现在许多复杂动力系统中，从天气到电网，但使用机器学习模拟器等数据驱动方法难以准确建模。虽然模拟器是加速模拟和解决逆问题的有前途的工具，但它们仍然难以学习混沌动力学，其中对初始条件的敏感性使得精确的长期预测不可行，尤其是在给定噪声数据的情况下。最近的工作转而训练模拟器以匹配混沌吸引子的统计特性，但这些方法通常依赖于手工制作的汇总统计量或大型、多样的多环境数据集。在这项工作中，我们提出了一类对抗最优传输目标，可以从单个噪声轨迹中联合学习高质量的汇总统计量和物理一致的模拟器。我们从理论上分析并实验验证了我们的方法的Sinkhorn散度公式（2-Wasserstein）和WGAN风格的对偶公式（1-Wasserstein）。在各种混沌系统（包括具有高维时空混沌的系统）上的数值实验表明，使用我们提出的目标训练的模拟器具有显著改善的长期统计保真度。

英文摘要

Chaos arises in many complex dynamical systems, from weather to power grids, but is difficult to accurately model with data-driven methods such as machine learning emulators. While emulators are promising tools for accelerating simulations and solving inverse problems, they still struggle to learn chaotic dynamics, where sensitivity to initial conditions renders exact long-term forecasts infeasible, especially given noisy data. Recent work instead trains emulators to match the statistical properties of chaotic attractors, but these approaches often rely on handcrafted summary statistics or large, diverse multi-environment datasets. In this work, we propose a family of adversarial optimal transport objectives that can jointly learn high-quality summary statistics and a physically consistent emulator from a single noisy trajectory. We theoretically analyze and experimentally validate a Sinkhorn divergence formulation (2-Wasserstein) and a WGAN-style dual formulation (1-Wasserstein) of our approach. Numerical experiments across a variety of chaotic systems, including ones with high-dimensional spatiotemporal chaos, show that emulators trained using our proposed objectives have significantly improved long-term statistical fidelity.

URL PDF HTML ☆

赞 0 踩 0

2604.03146 2026-06-19 stat.ML cs.LG 版本更新

Characterization of Gaussian Universality Breakdown in High-Dimensional Empirical Risk Minimization

高维经验风险最小化中高斯普适性破坏的表征

Chiheb Yaakoubi, Cosme Louart, Malik Tiomoko, Zhenyu Liao

发表机构 * School of Data Science, The Chinese University of Hong Kong, Shenzhen, China ； Huawei Noah's Ark Lab, Huawei Technologies, Paris, France ； School of Electronic Information ； Communications, Huazhong University of Science \& Technology, China

AI总结通过将凸高斯极小极大定理推广到非高斯数据，刻画了高维经验风险最小化估计量的渐近分布，揭示了高斯普适性的适用范围与局限。

Comments 28 pages, 5 figures, 1 table

Journal ref ICML 2026

详情

AI中文摘要

我们研究了一般非高斯数据设计下的高维凸经验风险最小化（ERM）。通过启发式地将凸高斯极小极大定理（CGMT）扩展到非高斯设置，我们推导出关键统计量的渐近极小极大表征，从而能够近似ERM估计量 $\hat{\theta}$ 的均值 $\mu_{\hat{\theta}}$ 和协方差 $C_{\hat{\theta}}$。具体地，在数据矩阵的集中假设以及损失和正则化子的标准正则性条件下，我们证明：对于独立于训练数据的测试协变量 $x$，投影 $\hat{\theta}^\top x$ 近似遵循 $\mu_{\hat{\theta}}^\top x$ 的一般非高斯分布与一个独立中心高斯变量（方差为 $\mathrm{tr}(C_{\hat{\theta}} \mathbb{E}[xx^\top])$）的卷积。这一结果阐明了ERM高斯普适性的范围和局限。此外，我们证明任何 $\mathcal{C}^2$ 正则化子渐近等价于一个由其零点的Hessian矩阵和 $\mu_{\hat{\theta}}$ 处的梯度唯一确定的二次型。我们提供了跨不同损失和模型的数值模拟，以验证我们的理论预测和定性见解。

英文摘要

We study high-dimensional convex empirical risk minimization (ERM) under general non-Gaussian data designs. By heuristically extending the Convex Gaussian Min-Max Theorem (CGMT) to non-Gaussian settings, we derive an asymptotic min-max characterization of key statistics, enabling approximation of the mean $μ_{\hatθ}$ and covariance $C_{\hatθ}$ of the ERM estimator $\hatθ$. Specifically, under a concentration assumption on the data matrix and standard regularity conditions on the loss and regularizer, we show that for a test covariate $x$ independent of the training data, the projection $\hatθ^\top x$ approximately follows the convolution of the generally non-Gaussian distribution of $μ_{\hatθ}^\top x$ with an independent centered Gaussian variable of variance $\mathrm{tr}(C_{\hatθ} \mathbb{E}[xx^\top])$. This result clarifies the scope and limits of Gaussian universality for ERMs. Additionally, we prove that any $\mathcal{C}^2$ regularizer is asymptotically equivalent to a quadratic form determined solely by its Hessian at zero and gradient at $μ_{\hatθ}$. Numerical simulations across diverse losses and models are provided to validate our theoretical predictions and qualitative insights.

URL PDF HTML ☆

赞 0 踩 0

2603.10184 2026-06-19 stat.ML cs.LG 版本更新

Stabilizing Bandits using Regularization: Precise Regret and A Quantitative Central Limit Theorem

使用正则化稳定赌博机：精确遗憾与定量中心极限定理

Budhaditya Halder, Ishan Sengupta, Koustav Chowdhury, Samya Praharaj, Koulik Khamaru

发表机构 * Department of Statistics, Rutgers University（罗切斯特大学统计系）； Indian Statistical Institute, Kolkata（加尔各答印度统计研究所）

AI总结本文提出一种精细的稳定性条件，证明正则化随机镜像下降算法满足该条件，并推导出自适应采样下经验奖励估计的非渐近Berry-Esseen界、匹配的遗憾上下界，以及抗腐败下的渐近正态性，同时揭示正则化是有效推断的必要代价。

Comments Updated rate of convergence and precise regret in version 2

详情

AI中文摘要

由于自适应采样违反了经典渐近理论中的独立性假设，使用赌博机数据进行统计推断面临根本性挑战。近期工作将稳定性~\citep{laiwei82} 确定为自适应下有效推断的充分条件。本文首先提出一个精细的稳定性条件，以在线算法的迭代形式表述，并证明一大类正则化随机镜像下降算法满足该条件。这一精细条件使我们能够在多个方面加强~\citet{laiwei82} 的渐近结果。首先，我们推导出自适应采样下经验奖励估计的非渐近Berry-Esseen界。其次，我们推导出所提算法遗憾的匹配非渐近上下界，从而精确刻画其遗憾。第三，我们证明这些正则化算法在给定水平的对抗性腐败下保持渐近正态性和有效推断。最后，我们表明正则化是必要的而非偶然的：Lai-Wei稳定性与最优的$O(\sqrt{T})$遗憾率（如EXP3等非正则化算法所达到的）不相容，因此受控的多对数级遗憾膨胀是有效推断的代价。

英文摘要

Statistical inference with bandit data presents fundamental challenges owing to adaptive sampling, which violates the independence assumptions underlying classical asymptotic theory. Recent work has identified stability~\citep{laiwei82} as a sufficient condition for valid inference under adaptivity. This paper first provides a refined stability condition, stated in terms of the iterates of an online algorithm, and shows that a large class of regularized stochastic-mirror-descent-style algorithms satisfy it. This refined condition allows us to strengthen the asymptotic results of~\citet{laiwei82} in several ways. First, we derive a non-asymptotic Berry--Esseen bound for the empirical reward estimates under adaptive sampling. Second, we derive matching non-asymptotic upper and lower bounds on the regret of the proposed algorithm, yielding a precise characterization of its regret. Third, we show that these regularized algorithms preserve asymptotic normality and valid inference under a prescribed level of adversarial corruption. Finally, we show that regularization is necessary rather than incidental: Lai--Wei stability is incompatible with the optimal $O(\sqrt{T})$ regret rate -- the rate attained by unregularized algorithms such as EXP3 -- so that a controlled, polylogarithmic inflation in regret is the price of valid inference.

URL PDF HTML ☆

赞 0 踩 0

2601.14430 2026-06-19 stat.ML cs.LG 版本更新

Meta Flow Maps enable scalable reward alignment

元流映射实现可扩展的奖励对齐

Peter Potaptchik, Adhi Saravanan, Abbas Mammadov, Alvaro Prat, Michael S. Albergo, Yee Whye Teh

发表机构 * University of Oxford（牛津大学）； Harvard University（哈佛大学）； Kempner Institute（凯普纳研究所）

AI总结提出元流映射（MFMs）框架，通过可微分的单步后验采样实现高效价值函数估计，从而无需轨迹模拟即可进行推理时引导和离策略微调，显著降低计算成本。

详情

AI中文摘要

控制生成模型在计算上是昂贵的。这是因为与奖励函数的最优对齐——无论是通过推理时引导还是微调——都需要估计价值函数。这一任务需要访问条件后验 $p_{1|t}(x_1|x_t)$，即与中间状态 $x_t$ 一致的干净数据 $x_1$ 的分布，这一要求通常迫使方法诉诸昂贵的轨迹模拟。为了解决这一瓶颈，我们引入了元流映射（MFMs），这是一个将一致性模型和流映射扩展到随机机制的框架。MFMs 被训练为执行随机单步后验采样，从任意中间状态生成任意多个独立同分布的干净数据 $x_1$ 样本。关键在于，这些样本提供了一个可微分的重参数化，从而解锁了高效的价值函数估计。我们利用这一能力解决了两种范式中的瓶颈：实现无需内部展开的推理时引导，并促进对一般奖励的无偏、离策略微调。实验上，我们的单粒子引导 MFM 采样器在 ImageNet 上以极少的计算量在多个奖励上优于 Best-of-1000 基线。

英文摘要

Controlling generative models is computationally expensive. This is because optimal alignment with a reward function--whether via inference-time steering or fine-tuning--requires estimating the value function. This task demands access to the conditional posterior $p_{1|t}(x_1|x_t)$, the distribution of clean data $x_1$ consistent with an intermediate state $x_t$, a requirement that typically compels methods to resort to costly trajectory simulations. To address this bottleneck, we introduce Meta Flow Maps (MFMs), a framework extending consistency models and flow maps into the stochastic regime. MFMs are trained to perform stochastic one-step posterior sampling, generating arbitrarily many i.i.d. draws of clean data $x_1$ from any intermediate state. Crucially, these samples provide a differentiable reparametrization that unlocks efficient value function estimation. We leverage this capability to solve bottlenecks in both paradigms: enabling inference-time steering without inner rollouts, and facilitating unbiased, off-policy fine-tuning to general rewards. Empirically, our single-particle steered-MFM sampler outperforms a Best-of-1000 baseline on ImageNet across multiple rewards at a fraction of the compute.

URL PDF HTML ☆

赞 0 踩 0

2509.15822 2026-06-19 stat.ML cs.LG math.PR math.ST stat.TH 版本更新

Phase Transition for Stochastic Block Model with more than $\sqrt{n}$ Communities

具有多于 $\sqrt{n}$ 个社区的随机块模型的相变

Alexandra Carpentier, Christophe Giraud, Nicolas Verzelen

发表机构 * Institut für Mathematik – Universität Potsdam, Potsdam, Germany（波恩大学数学研究所，德国波恩）； Laboratoire de Mathématiques d’Orsay, Université Paris-Saclay, CNRS, France（奥赛数学实验室，巴黎-萨克雷大学，法国 CNRS）； INRAE, Institut Agro, MISTEA, Univ. Montpellier, France（国家农业研究院，蒙彼利埃大学，法国）

AI总结本文证明在随机块模型中，当社区数 $K\geq \sqrt{n}$ 时，低度多项式在 Chin 等人提出的阈值以下无法恢复社区，而通过计数特定子图可在多项式时间内实现恢复，支持了新相变阈值的猜想。

详情

AI中文摘要

统计物理的预测表明，在随机块模型（SBM）中，当社区数 $K$ 固定时，社区恢复在 Kesten-Stigum (KS) 阈值以上（且仅在其以上）可以在多项式时间内实现。这一猜想催生了丰富的文献，证明在 KS 阈值以上的 SBM 中，非平凡社区恢复确实是可能的。只要 $K\ll \sqrt{n}$（其中 $n$ 是观测图中的节点数），KS 阈值以下低度多项式（LDP）的失败也被证明。当 $K\geq \sqrt{n}$ 时，Chin 等人（2025）最近证明，在稀疏机制中，通过计数非回溯路径，可以在 KS 阈值以下的多项式时间内实现社区恢复。这一突破使他们提出了多社区机制 $K\geq \sqrt{n}$ 的新阈值。在这项工作中，我们为他们的猜想提供了证据：\n1- 我们证明，对于任意图密度，LDP 无法在 Chin 等人（2025）提出的阈值以下恢复社区；\n2- 我们证明，在所提出的阈值以上，不仅是在 Chin 等人（2025）考虑的稀疏机制中，而且在适度稀疏机制中，通过计数受 LDP 分析启发的某些特定子图，可以在多项式时间内实现社区恢复。\n特别地，计数长度为 $\log(n)$ 的自避路径（这与基于非回溯算子的谱算法密切相关）仅在稀疏机制中是最优的。在更密集的机制中，必须考虑基于循环放大的更复杂子图。

英文摘要

Predictions from statistical physics postulate that recovery of the communities in the Stochastic Block Model (SBM) with a fixed number $K$ of communities is possible in polynomial time above, and only above, the Kesten-Stigum (KS) threshold. This conjecture has given rise to a rich literature, proving that non-trivial community recovery is indeed possible in SBM above the KS threshold. Failure of low-degree polynomials (LDP) below the KS threshold was also proven, as long as $K\ll \sqrt{n}$, where $n$ is the number of nodes in the observed graph. When $K\geq \sqrt{n}$, Chin et al.(2025) recently proved that, in a \emph{sparse regime}, community recovery in polynomial time is possible below the KS threshold by counting non-backtracking paths. This breakthrough led them to postulate a new threshold for the many-communities regime $K\geq \sqrt{n}$. In this work, we provide evidence supporting their conjecture:\\ 1- We prove that, for \emph{any graph density}, LDP fail to recover communities below the threshold postulated by Chin et al.(2025) ;\\ 2- We prove that community recovery is possible in polynomial time above the postulated threshold, not only in the \emph{sparse regime} considered in Chin et al.~(2025), but also in \emph{moderately sparse regimes}, by counting occurrences of some specific motifs inspired by the LDP analysis.\\ In particular, counting self-avoiding paths of length $\log(n)$, which is closely related to spectral algorithms based on the Non-Backtracking operator, is optimal only in the sparse regime. More complex motifs based on the blow-up of a cycle must be considered in denser regimes.

URL PDF HTML ☆

赞 0 踩 0

2104.08928 2026-06-19 stat.ML cs.CL cs.LG 版本更新

Group-Sparse Matrix Factorization for Transfer Learning of Word Embeddings

面向词嵌入迁移学习的组稀疏矩阵分解

Kan Xu, Xuanyi Zhao, Hamsa Bastani, Osbert Bastani

发表机构 * W. P. Carey School of Business, Arizona State University（亚利桑那州立大学韦伯商学院）； University of Pennsylvania（宾夕法尼亚大学）； Wharton School, University of Pennsylvania（宾夕法尼亚大学沃顿商学院）

AI总结提出一种基于组稀疏惩罚的两阶段估计器，通过结合大规模语料和少量领域数据高效迁移学习领域特定的词嵌入，并证明了其泛化误差界和非凸目标函数的局部最优与全局最优统计等价。

详情

AI中文摘要

非结构化文本为许多领域的决策者提供了丰富的数据源，从零售中的产品评论到医疗保健中的护理记录。为了利用这些信息，单词通常通过无监督学习算法（如矩阵分解）转化为词嵌入——编码单词之间语义关系的向量。然而，从训练数据有限的新领域学习词嵌入可能具有挑战性，因为在新领域中含义/用法可能不同，例如，单词“positive”通常具有积极情感，但在医疗记录中通常具有消极情感，因为它可能意味着患者检测出疾病阳性。在实践中，我们预计只有少数领域特定的单词可能具有新含义。我们提出了一种直观的两阶段估计器，通过组稀疏惩罚利用这种结构，通过结合大规模文本语料库（如维基百科）和有限的领域特定文本数据，高效地迁移学习领域特定的词嵌入。我们限定了迁移学习估计器的泛化误差，证明当只有少量嵌入在领域间改变时，它可以用显著更少的领域特定数据实现高精度。此外，我们证明了在标准正则化条件下，由非凸目标函数识别的所有局部最小值与全局最小值在统计上不可区分，这意味着我们的估计器可以高效计算。我们的结果首次给出了组稀疏矩阵分解的界限，这可能具有独立意义。我们通过与自然语言处理中最先进的微调启发式方法进行实证比较来评估我们的方法。

英文摘要

Unstructured text provides decision-makers with a rich data source in many domains, ranging from product reviews in retail to nursing notes in healthcare. To leverage this information, words are typically translated into word embeddings -- vectors that encode the semantic relationships between words -- through unsupervised learning algorithms such as matrix factorization. However, learning word embeddings from new domains with limited training data can be challenging, because the meaning/usage may be different in the new domain, e.g., the word ``positive'' typically has positive sentiment, but often has negative sentiment in medical notes since it may imply that a patient tested positive for a disease. In practice, we expect that only a small number of domain-specific words may have new meanings. We propose an intuitive two-stage estimator that exploits this structure via a group-sparse penalty to efficiently transfer learn domain-specific word embeddings by combining large-scale text corpora (such as Wikipedia) with limited domain-specific text data. We bound the generalization error of our transfer learning estimator, proving that it can achieve high accuracy with substantially less domain-specific data when only a small number of embeddings are altered between domains. Furthermore, we prove that all local minima identified by our nonconvex objective function are statistically indistinguishable from the global minimum under standard regularization conditions, implying that our estimator can be computed efficiently. Our results provide the first bounds on group-sparse matrix factorization, which may be of independent interest. We empirically evaluate our approach compared to state-of-the-art fine-tuning heuristics from natural language processing.

URL PDF HTML ☆

赞 0 踩 0

2406.01557 2026-06-19 stat.ME stat.AP 版本更新

Flexible aggregation of compositional predictors with shared effects for microbiome association analysis

共享效应组合预测因子的灵活聚合用于微生物组关联分析

Satabdi Saha, Liangliang Zhang, Michele Guindani, Kim-Anh Do, Christine B. Peterson

AI总结提出BRACE方法，通过尖峰-聚类先验和投影约束高斯先验，实现微生物组数据的自适应聚类和变量选择，识别与结果共享效应的关键特征。

详情

AI中文摘要

微生物组分析的最新进展为微生物群落的分子动态提供了前所未有的见解，激发了揭示微生物组在人类健康中关键作用的兴趣。然而，由于微生物组数据的高维、稀疏和组成性，识别与临床结果相关的微生物特征仍然具有挑战性。此外，许多微生物分类群虽然被分类为不同的，但可能共享功能角色，使传统的变量选择方法复杂化。为了克服这些障碍，我们引入了具有聚合组成效应的贝叶斯回归（BRACE），这是一种新方法，使用结合伯努利活动指标的尖峰-聚类先验、有限活动集上的Ewens可交换分割先验以及聚类效应上的投影约束高斯先验，进行数据自适应聚类和变量选择。我们工作的方法论创新在于如何将Ewens分割先验与聚类原子上的投影约束高斯相结合，以强制执行总和为零的约束。BRACE将具有相似效应的微生物分类群分组，产生更可解释的模型，同时实现有效的降维。通过综合模拟和一项检查口腔微生物组组成对胰岛素抵抗影响的真实应用，我们证明了BRACE在识别具有共享效应的关键特征方面优于现有方法。

英文摘要

Ongoing advancements in microbiome profiling have provided unprecedented insights into the molecular dynamics of microbial communities, sparking a surge of interest in uncovering the microbiome's critical role in human health. Identifying microbial features linked to clinical outcomes, however, remains challenging due to the high-dimensional, sparse, and compositional nature of microbiome data. Additionally, many microbial taxa, although classified as distinct, may share functional roles, complicating traditional variable selection methods. To overcome these obstacles, we introduce Bayesian Regression with Agglomerated Compositional Effects (BRACE), a novel approach using a spike-and-cluster prior combining Bernoulli activity indicators, an Ewens exchangeable partition prior on the finite active set, and a projection-based constrained Gaussian prior on cluster effects to perform data-adaptive clustering and variable selection. The methodological innovation of our work lies in how we combine the Ewens partition prior with a projection-based constrained Gaussian on the cluster atoms to enforce the sum-to-zero constraint. BRACE groups microbial taxa with similar effects on the outcome, yielding more interpretable models while enabling effective dimension reduction. Through comprehensive simulations and a real-world application examining the influence of oral microbiome composition on insulin resistance, we demonstrate BRACE's superior performance over existing methods, particularly in identifying key features with shared effects on outcomes.

URL PDF HTML ☆

赞 0 踩 0

2605.15896 2026-06-19 stat.ME stat.AP 版本更新

A Model-Agnostic Bootstrap for Macro-Level Claims Reserving Under the Conditioning Principle

基于条件原理的宏观层面赔款准备金模型无关自助法

Robin Van Oirbeek, Tim Verdonck

AI总结本文提出一种满足条件原理的自助法，用于宏观层面赔款准备金估计，通过Dirichlet-Gamma层次结构实现精确校准，改进了现有自助法的覆盖误差问题。

Comments 23 pages, v2: correction of the interpretation of the $κ$ parameter

详情

AI中文摘要

正确的推断对象是条件预测分布p(R|D,θ̂)，其中D是观察到的三角形保持固定。我们称之为条件原理。所有现有自助法违反这一原理，通过在预测循环中对D的函数进行重采样，产生O(1)的覆盖误差，随着三角形增大不消失。Dirichlet-Gamma层次结构允许一种满足该原理的自助法：S^{IBNP}_i = X^{obs}_i (1-W_i)/W_i，其中W_i ~ Beta(cF_{I-i}, c(1-F_{I-i}))直接从其预测分布中采样。仅模拟分配比例W_i；观察到的三角形保持固定。因此继承了任何开发比例方法（链式梯度、Bornhuetter-Ferguson、Cape Cod或其他）的校准，使其模型无关。覆盖缺陷为O(I^{-1/2})，与开发时期数量无关。在复合泊松数据生成过程中，该自助法对于每个F_{I-i} ∈ (0,1)是保守的：预测标准差分析上超过真实值的因子为1/√F_{I-i}。ODP自助法通过两种相反方向的机制违反该原理：重新估计在ODP DGP下膨胀自助方差，而缺失事故年脆弱性在脆弱性DGP下缩小它。结果覆盖差异为Ω(1)，无论I如何，为Meyers(2015)文档的跨投资组合误校准异质性提供了结构解释。链式梯度、Bornhuetter-Ferguson和Cape Cod在稀疏、信息丰富和池化先验下分别作为可信度估计量，计数和金额具有相同结构。集中程度c作为诊断：ĉ < 30表明开发非平稳。

英文摘要

The correct inferential object in claims reserving is the conditional predictive distribution $p(R \mid \mathcal{D}, \hatθ)$, where $\mathcal{D}$ is the observed triangle held fixed. We refer to this as the conditioning principle. All existing bootstraps violate it by resampling functions of $\mathcal{D}$ inside the predictive loop, producing an $O(1)$ coverage error that does not vanish as the triangle grows. The Dirichlet-Gamma hierarchy admits a bootstrap that satisfies the principle exactly: $S^{IBNP}_i = X^{obs}_i (1-W_i)/W_i$ with $W_i \sim \mathrm{Beta}(c\hat{F}_{I-i}, c(1-\hat{F}_{I-i}))$ sampled directly from its predictive distribution. Only the allocation proportion $W_i$ is simulated; the observed triangle is held fixed. It thus inherits calibration from any development-proportion method (Chain-Ladder, Bornhuetter-Ferguson, Cape Cod, or other), making it model-agnostic. The coverage deficit is $O(I^{-1/2})$, independent of the number of development periods. Under compound Poisson data-generating processes the bootstrap is conservative for every $F_{I-i} \in (0,1)$: the predictive standard deviation analytically exceeds the true value by the factor $1/\sqrt{F_{I-i}}$. The ODP bootstrap violates the principle through two mechanisms in opposite directions: re-estimation inflates bootstrap variance under the ODP DGP, while missing accident-year frailty deflates it under frailty DGPs. The resulting coverage discrepancy is $Ω(1)$ regardless of $I$, providing a structural explanation for the cross-portfolio miscalibration heterogeneity documented by Meyers (2015). Chain-Ladder, Bornhuetter-Ferguson and Cape Cod emerge as credibility estimators under diffuse, informative and pooling priors respectively, with identical structure for counts and amounts. The concentration $c$ serves as a diagnostic: $\hat{c} < 30$ signals non-stationary development.

URL PDF HTML ☆

赞 0 踩 0

2605.15811 2026-06-19 stat.ME stat.AP 版本更新

The Negative Binomial Chain-Ladder: A Full Likelihood Model for Claim Count Reserving

负二项链梯法：一种完整的似然模型用于赔款准备

Robin Van Oirbeek

AI总结本文提出负二项链梯模型，通过泊松-伽马构造自然产生负二项分布，提供更清晰的生成解释，统一了链梯方法家族，并通过模拟验证了模型的稳健性。

Comments 35 pages, 3 figures, v2: correction of the interpretation of the $κ$ parameter

详情

AI中文摘要

链梯法仍是非寿险赔款准备的主要宏观技术，但其经典形式缺乏一致的概率基础。现有随机扩展，包括马科模型和过分散泊松（ODP）框架，提供不确定性度量但依赖二阶矩假设或准似然方差结构。本文开发了一种负二项链梯（NB-CL）模型，将链梯方法嵌入完整的似然框架中。关键贡献是微观层面推导，显示负二项分布自然源于泊松-伽马构造：索赔按具有伽马分布年度异质性的泊松过程到达，聚合产生负二项增量计数。此推导赋予分散参数κ结构解释，即年度异质性，而非随意的过分散调整。NB-CL模型在κ→∞极限下推广泊松链梯模型，与ODP模型共享点估计但方差函数不同（二次vs线性），并在单个概率层级内统一链梯家族。开发了参数Bootstrap程序以纳入过程和参数不确定性。模拟研究证实，在正确规范下，当分散参数经过偏差校正后，覆盖率接近名义水平；在模型不规范情况下表现出受控退化。对索赔计数数据（澳大利亚机动车身体伤害）和已付金额（泰勒-阿什）的实证研究证实了κ的结构解读以及在金额情况下的工作近似状态。

英文摘要

The Chain-Ladder (CL) method remains the dominant macro-level technique for claims reserving in non-life insurance, yet its classical formulation lacks a coherent probabilistic foundation. Existing stochastic extensions-including the Mack model and the Over-Dispersed Poisson (ODP) framework-provide measures of uncertainty but rely on second-moment assumptions or quasi-likelihood variance structures without clear generative interpretations. This paper develops a Negative Binomial Chain-Ladder (NB-CL) model that embeds the CL method within a full likelihood-based framework. The key contribution is a micro-level derivation showing that the negative binomial distribution arises naturally from a Poisson-Gamma construction: claims arrive according to a Poisson process with Gamma-distributed accident-year heterogeneity, and aggregation yields negative binomial incremental counts. This derivation gives the dispersion parameter $κ$ a structural interpretation as accident-year heterogeneity, rather than an ad-hoc overdispersion adjustment. The NB-CL model generalises the Poisson Chain-Ladder model in the limit $κ\to \infty$, shares the point estimates of the ODP model while differing in its variance function (quadratic vs. linear), and unifies the Chain-Ladder family within a single probabilistic hierarchy. A parametric bootstrap procedure is developed to incorporate both process and parameter uncertainty. Simulation studies confirm near-nominal coverage under correct specification once the dispersion parameter is bias-corrected, and a controlled degradation under model misspecification. Empirical illustrations on claim count data (Australian motor bodily injury) and paid amounts (Taylor-Ashe) document both the structural reading of $κ$ and the working-approximation status of the model in the amounts case.

URL PDF HTML ☆

赞 0 踩 0

2604.03076 2026-06-19 stat.AP 版本更新

Carbon cost pass-through rate in power system: evidence from Italy under the EU ETS

电力系统中碳成本传导率：来自欧盟排放交易体系下意大利的证据

Pierdomenico Duttilo, Francesco Lisi

AI总结研究欧盟排放交易体系下碳成本在意大利电力市场的传导率，基于2016-2024年数据，采用自回归线性回归模型，发现全国平均传导率约32%，且各市场区域存在显著异质性。

详情

AI中文摘要

本文研究了欧盟排放交易体系（EU ETS）下碳定价对意大利电力市场的影响，重点关注第三和第四阶段（2016-2024年）各市场区域的碳成本传导率（CPTR）。利用日度数据，研究采用基于自回归动态线性回归模型的计量经济学框架，估计碳成本在批发电力价格中的反映程度。进一步通过稳健性检验和分位数回归，评估CPTR在不同燃料价差水平下的变化。结果表明，碳成本正向且显著地传导至电力价格，证实了碳定价作为关键市场驱动因素的相关性。然而，传导不完全，CPTR值始终低于100%。在国家层面，传导率估计约为32%，第三阶段和第四阶段之间无统计显著变化。各市场区域出现显著异质性：在北部、中北部和撒丁岛，第四阶段传导率上升，而在中南部和西西里岛则下降，反映了发电结构、碳强度和市场条件的差异。总体而言，研究结果强调了市场区域因素在塑造电力市场碳定价有效性中的重要性。

英文摘要

This paper investigates the impact of carbon pricing under the EU Emissions Trading System (EU ETS) on the Italian electricity market, focusing on the carbon cost pass-through rate (CPTR) across market zones during Phases 3 and 4 (2016-2024). Using daily data, the study applies an econometric framework based on a linear regression model with autoregressive dynamics to estimate the extent to which carbon costs are reflected in wholesale electricity prices. It further incorporates robustness checks and quantile regression to assess how the CPTR varies across different fuel spread levels. The results show that carbon costs are positively and significantly transmitted to electricity prices, confirming the relevance of carbon pricing as a key market driver. However, pass-through is incomplete, with CPTR values consistently below 100%. At the national level, the pass-through estimate is around 32%, with no statistically significant change between Phase 3 and Phase 4. Substantial heterogeneity emerges across market zones: pass-through increases in the North, Centre-North, and Sardinia during Phase 4, while it declines in the Centre-South and Sicily, reflecting differences in generation mix, carbon intensity, and market conditions. Overall, the findings highlight the importance of market zones factors in shaping the effectiveness of carbon pricing in electricity markets.

URL PDF HTML ☆

赞 0 踩 0

2603.06820 2026-06-19 econ.EM stat.OT 版本更新

Hippocratic Utility and Status Quo Bias

希波克拉底效用与现状偏见

Tomasz Strzalecki

AI总结本文通过简单例子揭示一种重视失去生命多于拯救生命的效用函数，其适用范围比最初看起来有限得多。

2410.19333 2026-06-19 econ.GN physics.soc-ph q-fin.EC stat.AP 版本更新

Swiss-system chess tournaments and unfairness

瑞士制国际象棋锦标赛与不公平性

László Csató, Alex Krumer

AI总结研究瑞士制国际象棋锦标赛中轮次奇偶性导致的不公平性，发现多执白一局的选手得分显著更高，建议采用偶数轮次和平衡颜色分配机制。

Comments 13 pages, 4 tables

2512.02203 2026-06-19 econ.EM stat.AP 版本更新

Statistical Inference in Large Multi-way Networks

大规模多路网络中的统计推断

Lucas Resende, Guillaume Lecué, Lionel Wilner, Philippe Choné

AI总结提出一种基于分类任务的多路网络结构参数估计方法，无需固定效应数量与结构假设，避免 incidental parameter 问题，在稀疏网络中比 PPML 更快且置信区间更可靠，应用于法国医疗政策因果效应分析。

Comments Working paper

详情

AI中文摘要

我们提出了一种新方法，用于在多路网络中估计结构参数，同时控制丰富的固定效应结构。该方法基于一系列分类任务，对固定效应的数量和结构均不敏感。与完全最大似然方法相比，我们的估计量不会受到 incidental parameter 问题的影响。对于稀疏连接的网络，它在计算上也比 PPML 更快。我们提供的经验证据表明，我们的估计量比 PPML 及其偏差修正策略产生更可靠的置信区间。即使在模型误设下，这些改进仍然成立，并且在稀疏设置中更为显著。虽然 PPML 在密集、低维数据中仍具有竞争力，但我们的方法为多路模型提供了一种稳健的替代方案，能够随稀疏性高效扩展。该方法被应用于研究政策改革对法国医疗空间可达性的因果效应。

英文摘要

We propose a new method to estimate structural parameters in multi-way networks while controlling for rich structures of fixed effects. The method is based on a series of classification tasks and is agnostic to both the number and structure of fixed effects. In contrast to full maximum likelihood approaches, our estimator does not suffer from the incidental parameter problem. For sparsely connected networks, it is also computationally faster than PPML. We provide empirical evidence that our estimator yields more reliable confidence intervals than PPML and its bias-correction strategies. These improvements hold even under model misspecification and are more pronounced in sparse settings. While PPML remains competitive in dense, low-dimensional data, our approach offers a robust alternative for multi-way models that scales efficiently with sparsity. The method is applied to study the causal effect of a policy reform on spatial accessibility to health care in France.

URL PDF HTML ☆

赞 0 踩 0

2601.02322 2026-06-19 stat.ME cs.LG 版本更新

Environment-Adaptive Covariate Selection: Learning When to Use Spurious Correlations for Out-of-Distribution Prediction

环境自适应协变量选择：学习何时利用虚假相关进行分布外预测

Shuozhi Zuo, Yixin Wang

发表机构 * Department of Statistics, University of Michigan, Ann Arbor（统计系，密歇根大学，安阿伯分校）

AI总结针对分布外预测中协变量选择问题，提出环境自适应算法，根据环境特征动态选择协变量集，在模拟和实际数据中优于静态方法。

详情

AI中文摘要

一种常见的分布外预测方法将模型限制为因果或不变协变量，以避免可能随环境变化的虚假关联。尽管具有理论吸引力，但当仅观察到结果的部分因果父节点时，该策略可能不如经验风险最小化。在这种情况下，非因果协变量可以作为未观察到的因果父节点的代理，当代理关系稳定时改善预测，但当变化破坏这种关系时则有害。因此，最优协变量集可能取决于所遇到的具体变化。由于不同的变化会在未标记的协变量分布中留下特征，我们提出了一种环境自适应协变量选择算法，该算法将环境级摘要映射到特定于环境的协变量集。这些摘要可以是手工制作的，也可以从多环境数据中学习，并且先验因果知识可以作为约束条件纳入。在模拟和应用数据集中，所提出的方法在各种变化下优于静态因果、不变和其他非自适应规则。

英文摘要

A common approach to out-of-distribution prediction restricts models to causal or invariant covariates to avoid spurious associations that may change across environments. Despite its theoretical appeal, this strategy can underperform empirical risk minimization when only a subset of the causal parents of the outcome is observed. In such settings, non-causal covariates can serve as proxies for unobserved causal parents and improve prediction when the proxy relationship is stable, but they can hurt when shifts disrupt that relationship. Thus, the optimal covariate set can depend on the specific shift encountered. Because different shifts leave signatures in the unlabeled covariate distribution, we propose an environment-adaptive covariate selection algorithm that maps environment-level summaries to environment-specific covariate sets. These summaries may be hand-crafted or learned from multi-environment data, and prior causal knowledge can be incorporated as constraints. Across simulations and applied datasets, the proposed method improves over static causal, invariant, and other non-adaptive rules under diverse shifts.

URL PDF HTML ☆

赞 0 踩 0

2508.14009 2026-06-19 stat.OT 版本更新

Understanding Pedagogical Content Knowledge of Introductory Data Science Instructors: An Inaugural Framework

理解入门数据科学教师的教学内容知识：一个初步框架

Sinem Demirci, Mine Doğucu, Andrew Zieffler, Joshua M. Rosenberg

AI总结通过访谈14名入门数据科学教师并分析教学大纲，探索其教学内容知识（PCK）的关键组成部分，为教师发展提供见解，并建立IDS领域的PCK初步框架。

Comments 67 pages, 4 tables

2502.06866 2026-06-19 cs.LG cs.AI econ.EM stat.AP stat.ML 版本更新

Global Ease of Living Index: a machine learning framework for longitudinal analysis of major economies

全球生活便利指数：面向主要经济体纵向分析的机器学习框架

Arun Kumar Selvaraj, Tanay Panat, Rohitash Chandra

发表机构 * Transitional Artificial Intelligence Research Group, School of Mathematics and Statistics（过渡人工智能研究组，数学与统计学学院）； Centre for Artificial Intelligence and Innovation（人工智能与创新中心）； Pingla Institute（Pingla研究所）

AI总结提出全球生活便利指数，结合社会经济和基础设施因素，利用机器学习处理缺失数据，并通过主成分分析和因子分析降维，为政策制定者提供改善生活质量的可操作工具。

详情

AI中文摘要

全球经济、地缘政治条件以及COVID-19疫情等破坏性事件对生活成本和生活质量产生了巨大影响。理解主要经济体中生活成本和生活质量的长期影响至关重要。一个透明且全面的生活指数必须包含生活条件的多个维度。在本研究中，我们提出了一种通过全球生活便利指数量化生活质量的方法，该指数将各种社会经济和基础设施因素整合为一个单一综合得分。我们的指数利用定义生活水平的经济指标，这有助于针对特定领域进行干预改进。我们提出了一个机器学习框架来处理特定国家某些经济指标的数据缺失问题。然后，我们整理并更新数据，并使用降维方法（主成分分析和因子分析）创建自1970年以来主要经济体的生活便利指数。我们的工作通过为政策制定者提供识别需要改进领域（如医疗系统、就业机会和公共安全）的实用工具，显著丰富了相关文献。我们的方法使用开放数据和代码，易于复现并适用于各种情境，为生活质量评估的持续研究和政策制定提供了透明度和可访问性。

英文摘要

The drastic changes in the global economy, geopolitical conditions, and disruptions such as the COVID-19 pandemic have impacted the cost of living and quality of life. It is essential to comprehend the long-term implications of the cost of living and quality of life in major economies. A transparent and comprehensive living index must include multiple dimensions of living conditions. In this study, we present an approach to quantifying the quality of life through the Global Ease of Living Index that combines various socio-economic and infrastructural factors into a single composite score. Our index utilises economic indicators that define living standards, which could help in targeted interventions to improve specific areas. We present a machine learning framework to address missing data for certain economic indicators in specific countries. We then curate and update the data and use a dimensionality reduction approach (Principal Component Analysis and Factor Analysis) to create the Ease of Living Index for major economies since 1970. Our work significantly adds to the literature by offering a practical tool for policymakers to identify areas needing improvement, such as healthcare systems, employment opportunities, and public safety. Our approach with open data and code can be easily reproduced and applied to various contexts, providing transparency and accessibility for ongoing research and policy development in quality-of-life assessment.

URL PDF HTML ☆

赞 0 踩 0

2605.20541 2026-06-19 math.ST math.PR stat.TH 版本更新

Finite-Sample Bounds for Expected Signature Estimation under Weak Dependence

有限样本下弱依赖条件下期望签名估计的界限

Bryson Schenck

AI总结本文研究了在弱依赖条件下，从单一长依赖轨迹估计期望签名的有限样本界限，通过块平均估计器证明了非渐近的均方误差界，并探讨了在不同Hurst指数下的收敛性。

Comments 59 pages, 1 figure

详情

AI中文摘要

期望签名在满足矩增长条件时唯一确定随机粗糙路径的分布，但此前缺乏从单一长依赖轨迹估计其有限样本界限。本文研究了一个平稳随机过程，其样本路径可解释为几何粗糙路径，被划分为等间距观测的块，并证明了块平均估计器的非渐近均方误差界。当路径的Hölder正则性至多为1/2时，需要粗糙路径理论来定义估计量，因为Young积分和Riemann-Stieltjes积分无法定义签名的迭代积分。在矩、平稳性和块签名协方差衰减条件（严格弱于α-混合且适用于长程依赖驱动器）下，误差分为离散化项和波动项，其速率分别由路径正则性和依赖强度决定。通过逐层粗糙因子方差分析，保持有限截断常数显式，并在固定观测预算下获得最优分配规则。本文验证了分数奥本海姆-乌伦贝克过程在三个制度下的假设，即粗糙（Hurst H<1/2）、半鞅（H=1/2）和长程（H>1/2）。蒙特卡罗实验显示经验收敛速率快于理论上界。

英文摘要

The expected signature uniquely determines the law of a random rough path under a moment-growth condition, yet finite-sample bounds for estimating its truncations from a single long dependent trajectory remain unavailable. We study a strictly stationary stochastic process equipped with a geometric rough-path lift, observed in non-overlapping blocks of equally-spaced samples, and prove a non-asymptotic mean-squared error (MSE) bound for the block-averaging estimator of its truncated expected signature. Under moment and stationarity assumptions together with a direct covariance-decay condition on block signatures -- strictly weaker than $α$-mixing and applicable to long-range-dependent processes -- the error separates into a discretization term and a fluctuation term, with rates determined respectively by path regularity and dependence strength. A levelwise rough-factorial variance analysis keeps finite-truncation constants explicit and yields an optimal allocation rule under a fixed observation budget. We verify the assumptions for independent-coordinate fractional Ornstein--Uhlenbeck processes in three regimes: short-range (Hurst $1/4<H<1/2$), semimartingale ($H=1/2$), and long-range ($H>1/2$); in all three, the block-signature covariance is summable, so the fluctuation term decays at the same rate as in the independent-block case, even under long memory at $H>1/2$. Monte Carlo experiments show empirical slopes steeper than the guaranteed upper-bound rates.

URL PDF HTML ☆

赞 0 踩 0

2604.02336 2026-06-19 math.FA math.ST stat.TH 版本更新

The Shift Operator Calculus for Stationary Time Series Analysis

平稳时间序列分析的移位算子演算

Anand Ganesh, Babhrubahan Bose, Anand Rajagopalan

AI总结本文为平稳时间序列建模建立了严格的移位算子演算，证明了不同函数族下转移函数算子的存在性和等距性，并统一了平稳过程可逆性与转移函数算子可逆性的概念。

Comments 7 pages

2602.04550 2026-06-19 quant-ph math.ST stat.TH 版本更新

Locally Gentle State Certification for High Dimensional Quantum Systems

高维量子系统的局部温和态认证

Cristina Butucea, Jan Johannes, Henning Stein

AI总结研究局部温和量子态认证中非破坏性测量的信息代价，推导出样本复杂度为Θ(d³/(ε²α²))，揭示了α-温和性惩罚与希尔伯特空间维度d的线性关系。

详情

AI中文摘要

量子统计推断的标准方法依赖于引起波函数坍缩的测量，从而消耗量子态以提取信息。在本工作中，我们研究了\emph{局部温和}量子态认证的基本极限，其中学习算法被限制在迹范数下最多扰动态$\alpha$，从而允许样本重用。我们分析了区分未知态$\rho$等于参考态$\rho_0$还是与其$\epsilon$-远的问题。我们推导了该问题的极小极大样本复杂度，量化了非破坏性测量的信息代价。具体地，通过构造显式测量算子，我们证明了$\alpha$-温和性约束施加了$\frac{d}{\alpha^2}$的样本量惩罚，导致总样本复杂度为$n = \Theta(\frac{d^3}{\epsilon^2 \alpha^2})$。我们的结果阐明了信息提取与态扰动之间的权衡，并突出了量子学习中物理测量约束与隐私机制之间的深层联系。关键地，我们发现施加$\alpha$-温和性所导致的样本量惩罚与希尔伯特空间维度$d$呈线性关系，而非高维私有估计中典型的参数数量$d^2-1$。

英文摘要

Standard approaches to quantum statistical inference rely on measurements that induce a collapse of the wave function, effectively consuming the quantum state to extract information. In this work, we investigate the fundamental limits of \emph{locally-gentle} quantum state certification, where the learning algorithm is constrained to perturb the state by at most $α$ in trace norm, thereby allowing for the reuse of samples. We analyze the hypothesis testing problem of distinguishing whether an unknown state $ρ$ is equal to a reference $ρ_0$ or $ε$-far from it. We derive the minimax sample complexity for this problem, quantifying the information-theoretic price of non-destructive measurements. Specifically, by constructing explicit measurement operators, we show that the constraint of $α$-gentleness imposes a sample size penalty of $\frac{d}{α^2}$, yielding a total sample complexity of $n = Θ(\frac{d^3}{ε^2 α^2})$. Our results clarify the trade-off between information extraction and state disturbance, and highlight deep connections between physical measurement constraints and privacy mechanisms in quantum learning. Crucially, we find that the sample size penalty incurred by enforcing $α$-gentleness scales linearly with the Hilbert-space dimension $d$ rather than the number of parameters $d^2-1$ typical for high-dimensional private estimation.

URL PDF HTML ☆

赞 0 踩 0

2504.09564 2026-06-19 math.ST stat.TH 版本更新

The weak-feature-impact effect on the NPMLE in monotone binary regression

单调二元回归中弱特征影响对NPMLE的影响

Dario Kieffer, Angelika Rohde

AI总结研究单调二元回归中非参数最大似然估计在弱特征关系下的极限分布，发现一种新的分布连续插值于两个极端情况，并改进了小样本近似。

Comments Added Theorem 3.3 and several visualizations

详情

AI中文摘要

统计文献提供了单调二元回归中非参数最大似然估计（NPMLE）在两种极端情况下的逐点极限分布：如果特征-标签关系严格单调且足够光滑，则以立方根$n$速率收敛，具有缩放Chernoff型极限分布；如果底层关系平坦，则以参数$\sqrt{n}$速率收敛。本文提供了NPMLE分布演变的完整图景，揭示了一种新的极限分布，在弱特征-标签关系的情况下，为小样本提供了显著更好的分布近似。该分布被证明连续插值于两个极端情况之间。确定该分布的创新方法是将其作为新引入的弱特征影响三角阵列中NPMLE的极限，针对特定的参数-样本量配置。此外，在适当缩放的$L^{1}$误差中同样观察到弱特征影响场景下的相变。作为副产品，获得了平坦回归函数下的极限分布，这是先前未知的。证明开发了一种全新的策略，特别是不基于开关关系。伴随这些结果的新型局部极小极大下界。

英文摘要

Statistical literature provides pointwise limiting distributions of the nonparametric maximum likelihood estimator (NPMLE) in monotone binary regression for the two extremal cases: If the feature-label relation is strictly monotone and sufficiently smooth, it converges at a cube-root-$n$ rate with scaled Chernoff-type limiting distribution, and it converges at the parametric $\sqrt{n}$-rate if the underlying relation is flat. In this article, we provide the complete picture of the distributional metamorphosis of the NPMLE, revealing a new limiting distribution which provides a significantly better distributional approximation for small samples in case of a weak feature-label relationship. It is shown to continuously interpolate between the two extremal cases. The innovative way to determine this distribution is to generate it as a limit of the NPMLE in the newly introduced weak-feature-impact triangular array for a particular parameter-sample-size constellation. Moreover, the phase transition is likewise observed for the suitably rescaled $L^{1}$-error in this weak-feature-impact scenario. As a by-product, its limiting distribution for flat regression functions is obtained, which was unknown before. The proof develops a completely new strategy, notably not based on the switch relation. A novel type of local minimax lower bounds accompanies these results.

URL PDF HTML ☆

赞 0 踩 0

1. 统计理论与方法 3 篇

A Necessary and Sufficient Condition for Size Controllability of Heteroskedasticity Robust Test Statistics

Interpolated Quantile Estimation: A Unified Framework Bridging Quantiles and the Mean

Benign overfitting beyond prediction: The ordinary least squares interpolator

2. 贝叶斯统计与概率建模 1 篇

Weighted Bayesian Conformal Prediction

3. 因果推断与实验设计 5 篇

Invariant quantile regression for heterogeneous environments

A causal framework for evaluating the total effect of strategies aiming to expand screening and to improve outcomes

A Practical Introduction to Regression-based Causal Inference in Meteorology (I): All confounders measured

A Practical Introduction to Regression-based Causal Inference in Meteorology (II): Unmeasured confounders

Causal Inference with High-dimensional Discrete Covariates

4. 时间序列与空间统计 1 篇

Practical Forecasting of Environmental Maps: A Functional Data Approach

5. 计算统计与MCMC 7 篇

Folded Transport MCMC: Eliminating Label Switching by Sampling on a Fundamental Domain

Q-approximation of operating characteristics of clinical trial designs

Probabilistic function-on-function nonlinear autoregressive model for emulation and reliability analysis of stochastic dynamical systems

Robust, partially alive particle Metropolis-Hastings via the Frankenfilter

Alternating Direction Method of Multipliers for Nonlinear Matrix Decompositions

Flow Matching for Efficient and Scalable Data Assimilation

Towards practical PDMP sampling: Metropolis adjustments, locally adaptive step-sizes, and NUTS-based time lengths

6. 机器学习统计基础 8 篇

Information Theory and Statistical Learning

Attention-based PCA

Learning to Emulate Chaos: Adversarial Optimal Transport Regularization

Characterization of Gaussian Universality Breakdown in High-Dimensional Empirical Risk Minimization

Stabilizing Bandits using Regularization: Precise Regret and A Quantitative Central Limit Theorem

Meta Flow Maps enable scalable reward alignment

Phase Transition for Stochastic Block Model with more than $\sqrt{n}$ Communities

Group-Sparse Matrix Factorization for Transfer Learning of Word Embeddings

7. 生物统计与医学统计 1 篇

Flexible aggregation of compositional predictors with shared effects for microbiome association analysis

8. 经济金融与社会科学统计 6 篇

A Model-Agnostic Bootstrap for Macro-Level Claims Reserving Under the Conditioning Principle

The Negative Binomial Chain-Ladder: A Full Likelihood Model for Claim Count Reserving

Carbon cost pass-through rate in power system: evidence from Italy under the EU ETS

Hippocratic Utility and Status Quo Bias

Swiss-system chess tournaments and unfairness

Statistical Inference in Large Multi-way Networks

9. 数据隐私、稳健性与公平性 1 篇

Environment-Adaptive Covariate Selection: Learning When to Use Spurious Correlations for Out-of-Distribution Prediction

10. 数据集、软件与应用 2 篇

Understanding Pedagogical Content Knowledge of Introductory Data Science Instructors: An Inaugural Framework

Global Ease of Living Index: a machine learning framework for longitudinal analysis of major economies

11. 其他/综合统计 4 篇

Finite-Sample Bounds for Expected Signature Estimation under Weak Dependence

The Shift Operator Calculus for Stationary Time Series Analysis

Locally Gentle State Certification for High Dimensional Quantum Systems

The weak-feature-impact effect on the NPMLE in monotone binary regression