URL PDF HTML ☆

赞 0 踩 0

2605.16219 2026-05-18 cs.LG stat.ML

The Privacy Price of Tail-Risk Learning: Effective Tail Sample Size in Differentially Private CVaR Optimization

尾风险学习的隐私代价：差分隐私CVaR优化中的有效尾样本量

El Mustapha Mansouri

AI总结研究揭示差分隐私对CVaR学习有效样本量的影响，提出隐私代价分解方法，推导出标量估计和有限类别的学习速率，并指出隐私学习在有效尾样本量上的核心挑战。

Comments 34 pages, 3 figures, 2 tables

详情

AI中文摘要

差分隐私改变了CVaR学习的有效样本量。对于尾质量τ，隐私相关的样本量不是n，而是nτ；等价地，有效的隐私尾样本量是εnτ。私有CVaR超额风险分解为普通的尾风险统计误差和隐私代价。这种分解在标量估计和有限类别的情况下是完整的：标量估计的速率是Θ(B min{1,(nτ)^{-1/2}+(εnτ)^{-1}})，有限类别的大小为M时的速率是Θ(B min{1,√(log(2M)/(nτ))+log(2M)/(εnτ)} )。这些完整的速率在纯DP下成立，其下界可扩展到近似DP的 stated small-δ 范围内。对于凸Lipschitz学习，模块化上界和下界减少显示，CVaR特定的隐私项必然以1/(εnτ)的比例增长，其维度依赖性继承自私有随机凸优化。这些结果识别出在私有CVaR学习中，普通私有学习在Θ(nτ)信息量的尾记录上的核心挑战。

英文摘要

Differential privacy changes the effective sample size governing CVaR learning. For tail mass $τ$, the privacy-relevant sample size is not $n$, but $nτ$; equivalently, the effective private tail sample size is $εnτ$. Private CVaR excess risk decomposes into ordinary tail-risk statistical error and a privacy price. This decomposition is complete for scalar estimation and finite classes: scalar estimation has rate $Θ(B \min\{1,(nτ)^{-1/2}+(εnτ)^{-1}\})$, and finite classes of size $M$ have rate $Θ(B \min\{1,\sqrt{\log(2M)/(nτ)}+\log(2M)/(εnτ)\})$. These complete rates hold under pure DP, and their lower bounds extend to approximate DP in the stated small-$δ$ regimes. For convex Lipschitz learning, modular upper and lower reductions show that the CVaR-specific privacy term necessarily scales as $1/(εnτ)$, with dimension dependence inherited from private stochastic convex optimization. Together, these results identify ordinary private learning on $Θ(nτ)$ informative tail records as the canonical hard subproblem inside private CVaR learning.

URL PDF HTML ☆

赞 0 踩 0

2605.16208 2026-05-18 stat.ML cs.LG

熵跨桥梁：用于流和薛定谔采样的条件-边缘离散化

Bruno Trentini, Dejan Stancevic, Michael M. Bronstein, Alexander Tong, Luca Ambrogioni

AI总结本文提出一种基于熵率的目标，用于桥-aware的离散化，通过分离端点条件桥几何和边缘流演变，提升低预算下的高维桥和流采样性能。

详情

AI中文摘要

对于固定流基生成模型，在有限的推断预算下，样本质量强烈依赖于采样器在有限函数评估上的分配。流匹配和薛定谔桥梁定义了概率路径，但其推断网格通常为启发式或继承自一端扩散。本文推导出一种条件-边缘熵率目标用于桥-aware离散化，分离端点条件桥几何与边缘流演变，并以此构建无训练的熵推断时间调度器。对于高斯布朗桥，该速率具有闭式解且呈U型，推动边界密集的非均匀网格。在训练的二维桥/流模型上，估计的轮廓恢复预测形状，并在10步ODE-Heun MMD中比线性提升18.1%，在相同低NFE扫描中，SDE-Heun改进22.7%。在EDM/CIFAR-10上，熵时间离散化在五步FID测试中表现最佳（186.3±4.0 vs 200.5±2.9线性和238.0±5.3余弦）。在AlphaFlow蛋白质生成中，熵条件-边缘调度在CAMEO22和ATLAS基准上低NFE情况下表现优势。这些结果支持熵率调度作为高维桥和流采样的实用低预算分配信号。

英文摘要

For a fixed flow-based generative model under a small inference budget, sample quality can depend strongly on where the sampler spends its few function evaluations. Flow matching and Schrödinger bridges define probability paths, yet their inference grids are usually heuristic or inherited from one-endpoint diffusion. We derive a conditional-marginal entropy-rate objective for bridge-aware discretization, separating endpoint-conditioned bridge geometry from marginal flow evolution, and use it to build a training-free entropic inference-time scheduler from first principles. For Gaussian Brownian bridges this rate is closed-form and U-shaped, motivating boundary-heavy nonuniform grids. On trained two-dimensional bridge/flow models, the estimated profile recovers the predicted shape and improves 10-step ODE-Heun MMD over linear by 18.1%, with a paired 22.7% SDE-Heun improvement in the same low-NFE sweep. On EDM/CIFAR-10, the entropic time-discretization gives the best tested five-step FID (186.3 \pm 4.0 versus 200.5 \pm 2.9 for linear and 238.0 \pm 5.3 for cosine). On AlphaFlow protein generation, entropic conditional-marginal (cond-marg) scheduling shows advantage in low-NFE regimes on both CAMEO22 and ATLAS benchmarks. These results support entropy-rate scheduling as a practical low-budget allocation signal for high-dimensional bridge and flow samplers.

URL PDF HTML ☆

赞 0 踩 0

2605.16078 2026-05-18 stat.ML cs.LG

A numerical study into neural network surrogate model performance for uncertainty propagation

基于神经网络代理模型的不确定性传播性能数值研究

Noah Wade, Kirubel Teferra

AI总结本文研究神经网络代理模型在捕捉整个概率空间中解场完整分布的能力，尤其关注分布尾部表现，通过热传导方程对比了全连接网络与深度算子网络的性能。

详情

DOI: 10.1061/JENMDT/EMENG-8978

AI中文摘要

神经网络代理模型已发展为一种有前景的方法，用于建模物理建模中遇到的各种边界值问题的解场。随机问题特别受到关注，因为传统数值求解器在参数分析中可以显著减少昂贵的正向模型重复评估。然而，文献中的许多研究主要关注神经网络代理模型表示确定性样本或均值场解的能力，而忽视了代理模型在分布尾部的性能。本文详细研究了神经网络代理模型捕捉整个概率空间中解场完整分布的能力，尤其强调分布尾部的表现。作为典型问题，热传导方程具有高度随机的源项，导致热解场出现极端变化。通过比较经典前馈全连接网络和深度算子网络架构，使用数据驱动和物理指导的损失函数进行比较。结果表明，最坏情况预测误差比均值场误差大一个数量级，突显了异常样本的重要性。与极端样本相关的较大误差源于网络必须超出训练数据范围进行外推。本文提出了一种识别这些样本的方法，并讨论了处理其误差的潜在方法。在考虑的模型中，使用弱形式残差损失训练的全连接神经网络在处理这些外推输入方面表现最佳，实现了对数值生成数据集的最高预测精度。

英文摘要

Neural network surrogate models have emerged as a promising approach to model solution fields for a wide variety of boundary value problems encountered in physical modeling. Stochastic problems represent an area of particularly high interest because of the potential to significantly reduce the repeated evaluation of expensive forward models via traditional numerical solvers when conducting parametric analysis. However, many studies found in the literature primarily focus on the ability of neural network surrogate models to represent deterministic samples or mean field solutions and largely overlook surrogate model performance at the tails of the distribution. The present study examines in detail the ability of neural network surrogate models to capture the full distribution of solution fields over the entire probability space, while emphasis is placed at the tails of the distribution. Serving as a canonical problem is the heat conduction equation with a highly stochastic source term, inducing extremely large variation in the thermal solution field. Comparisons are made between a classic feed-forward fully connected network and a Deep Operator Network architecture, using both data-driven and physics-informed loss functions. Results show that the worst-case prediction errors are an order of magnitude larger than the mean field error, highlighting the importance of the outlier samples. The large errors associated with extreme samples result from the networks having to extrapolate beyond the bounds of the training data. A method for identifying these samples is presented along with a discussion of potential approaches to account of their errors. Among the models considered, the fully connected neural network trained using a weak form residual loss performs best in handling these extrapolated inputs, achieving the highest prediction accuracy for the numerically produced datasets.

URL PDF HTML ☆

赞 0 踩 0

2605.16075 2026-05-18 stat.ME stat.CO

可解释AI还不够！重新思考算法可争议性

Timo Freiesleben, Kristof Meding, Gunnar König

AI总结本文探讨了算法可争议性的重要性，提出了一种新的定义，指出传统XAI方法不足以挑战算法决策，提出了三种证据类型以支持决策逆转。

详情

AI中文摘要

机器学习系统日益影响个人生活决策，如贷款审批、招聘和作弊检测，引发如何应对这些系统不利决定的问题。尽管可解释AI（XAI）主要关注算法可逆性，但算法可争议性问题却较少受到关注。本文提出可争议性作为算法问题的正式定义，强调决策可能错误，并识别三种证据类型以挑战和推翻决策。

英文摘要

Machine learning systems increasingly make life-changing decisions about individuals, such as loan approvals, hiring, and cheating detection, raising a pressing question: how can individuals respond to negative decisions made by these opaque systems? While explainable artificial intelligence (XAI) has largely focused on algorithmic recourse -- helping individuals change their features to obtain a desired outcome -- the parallel problem of algorithmic contestability -- helping individuals review and correct erroneous algorithmic decisions -- has received far less attention, despite its central ethical and legal importance. We trace this neglect to the absence of clear formal definitions and a systematic operationalization of contestability as an algorithmic problem. To address it, we propose an operational definition of contestability as a natural complement to recourse: contestability starts from the presumption that a decision may be incorrect and focuses on identifying evidence to challenge and potentially overturn it, whereas recourse assumes the decision is valid and instead provides pathways for changing it. We show that standard XAI explanations, such as counterfactuals, LIME, or Anchors, even when combined with human intuitions about decision continuity or monotonicity, reveal only errors in the neighborhood of the individual, but provide insufficient grounds for overturning the decision at hand. Going thus beyond traditional XAI, we identify three types of evidence warranting reversal according to the decision maker's own ethical standards: predictive multiplicity, incorrect feature values, and neglected overruling evidence. We argue that these render decisions normatively indefensible and thus successfully contestable. Finally, we analyze how existing EU legislation connects to our framework and argue that individuals already hold some legal rights to these forms of evidence.

URL PDF HTML ☆

赞 0 踩 0

2605.16033 2026-05-18 math.ST stat.TH

Tests for the mean of high-dimensional data

高维数据均值检验

Dietmar Ferger

AI总结本文提出基于V_n统计量的高维数据均值检验方法，无需协方差矩阵求逆，通过嵌入Hilbert空间l2推导渐进行为，并证明Bootstrap近似在无稀疏性假设下具有渐近有效性。

Comments 16 pages

2605.16027 2026-05-18 math.ST stat.TH

Nearest-Neighbour Matching on Unbounded Supports and Covariate Shift Transfer

无界支持上的最近邻匹配与协变量转移

Simon Viel

AI总结本文研究了在无界支持上最近邻匹配的收敛性，提出无需假设协变量支持集的紧凑性，而是通过源与目标分布之间的转移性度量来保证估计效率。

详情

AI中文摘要

多变量函数在缺失标签下的期望在迁移学习和平均治疗效应等领域中经常出现。尽管基于最近邻匹配的非参数估计器在此背景下被广泛使用，但现有文献通常假设协变量生活在$\R^d$的某些良好形状的紧致子集内，且密度远离零。本文证明在最小的协变量支持集假设下也能实现通常的收敛速率。这些假设被替换为对源和目标分布的条件，其中包括衡量两个概率测度之间转移性的度量。我们证明这些条件是通用的，可以应用于支持在流形上的分布，并允许目标分布具有比源分布更重的尾部。我们还证明这种对转移性的控制对于任何估计器实现良好的收敛速率都是必需的。最后，将我们的结果应用于治疗效应的估计，我们能够放松赋值概率必须远离零和一的假设。

英文摘要

Expectations of multivariate functions with missing labels occur in various fields such as transfer learning and average treatment effects. Although non-parametric estimators based on nearest-neighbour matching are frequently used in this context, the existing literature assumes that the covariates live in some well-shaped compact subset of $\R^d$, with densities that are bounded away from zero. In this paper, we show that the usual rates of convergence can be achieved with minimal assumptions on the covariate supports. These assumptions are replaced with conditions on the source and target distributions, among which a measure of the tranferability between the two probability measures. We show that these conditions are general, can be applied to distributions supported on manifolds, and allow the target distribution to have a heavier tail than the source distribution. We also show that this control of the transferability is needed for any estimator to achieve good rates of convergence. Finally, applying our results to the estimation of treatment effects, we could relax the assumption that the assignment probabilities had to be bounded away from zero and one.

URL PDF HTML ☆

赞 0 踩 0

2605.15996 2026-05-18 stat.ML cs.LG math.ST stat.TH

Testing properties of trees in graphical models with covariance queries

利用协方差查询测试图模型中树的性质

Sofiya Burova, Francisco Calvillo, Gábor Lugosi, Piotr Zwiernik

AI总结本文研究高维图模型下树结构的性质测试，设计了基于子二次查询数量的随机测试方法，针对叶子数、最大度、典型距离和直径等属性提出显式查询复杂度界限。

2605.15966 2026-05-18 econ.EM stat.ME

Quasi-Bayesian Local Projection Instrumental-Variables Method: Application to Renewable Energy and Electricity Prices

准贝叶斯局部投影工具变量方法：应用于可再生能源和电力价格

Masahiro Tanaka

AI总结本文提出一种准贝叶斯方法用于局部投影工具变量估计，通过广义矩方法构建准后验，并采用粗糙度惩罚先验平滑不同时间跨度的冲击响应。方法保留传统LP-IV方法的一阶特性，增强有限样本稳定性，并允许联合推断。仿真显示该正则化方法在中长期预测中降低均方误差。

Comments This paper supersedes a working paper circulated under the title "Quasi-Bayesian Local Projections: Simultaneous Inference and Extension to the Instrumental Variable Method" (arXiv:2503.20249)

2605.15943 2026-05-18 math.ST stat.ML stat.TH

Node-private community estimation in stochastic block models: Tractable algorithms and lower bounds

节点私有社区估计在随机块模型中：可计算算法和下界

Laurentiu Marchis, Ethan D'souza, Tomáš Flídr, Po-Ling Loh

AI总结本文研究了在固定社区数的随机块模型中社区恢复问题，提出在节点差分隐私约束下基于谱聚类的可计算算法及下界，通过隐私保护的PCA、凸优化等方法提升社区估计一致性。

Comments 78 pages

详情

AI中文摘要

我们研究了在固定社区数的随机块模型中社区恢复问题，具有一个 twists：我们寻找在图结构节点层面变化下稳定的算法，正式定义为差分隐私约束。我们开发的算法基于谱聚类，在社区恢复流程中引入隐私保护的邻接矩阵、私有PCA、私有凸优化、私有低秩矩阵估计和私有近似子空间估计。现有隐私算法的直接应用导致隐私参数ε迅速增加以确保在节点差分隐私下的估计一致性，与边隐私更简单的设置形成对比。为缓解这些问题，我们开发了基于（1）指数机制采样与Lipschitz扩展和（2）构建从无向图空间到有限度图空间的光滑投影的一般框架的新型算法。重要的是，我们开发的所有方法在多项式时间内可计算。我们还开发了在节点隐私下实现一致社区估计所需的ε增长速率的新型下界。技术上，本文突显了在非标准缩放ε→∞下分析隐私算法的复杂性，并提出了一些解决方案。我们还提供了一个新的HGR最大相关性在PAC学习准确性放大中的应用，这可能具有独立兴趣。

英文摘要

We study the classical problem of community recovery in stochastic block models with a fixed number of communities, with a twist: We seek algorithms that are stable with respect to node-wise changes in the graph structure, formally defined as a differential privacy constraint. The algorithms we develop are based on spectral clustering, where we introduce privacy to the community recovery pipeline in the form of directly privatizing the adjacency matrix; private PCA; private convex optimization; private low-rank matrix estimation; and private approximate subspace estimation. Straightforward applications of existing private algorithms lead to a rapid increase in the privacy parameter $ε$ in order to ensure consistent estimation under node differential privacy, in contrast with the simpler setting of edge privacy. To alleviate these issues, we develop novel algorithms based on (1) sampling from an exponential mechanism with a Lipschitz extension and (2) a general framework for constructing smooth projections from the space of undirected graphs to the space of bounded-degree graphs, which can then be combined with various edge-private algorithms. Importantly, the methods we develop are all computable in polynomial-time as a function of the number of nodes in the graph. We also develop novel lower bounds on the growth rate of $ε$ required in order to achieve consistent community estimation under node privacy. On a technical note, our paper highlights the complications that arise when analyzing private algorithms under the non-standard scaling $ε\rightarrow \infty$ and proposes some solutions. We also provide a novel application of the HGR maximal correlation from information theory in the context of accuracy amplification in PAC learning, which may be of independent interest.

URL PDF HTML ☆

赞 0 踩 0

2605.15920 2026-05-18 stat.ML cs.LG

Unsupervised Domain Shift Detection with Interpretable Subspace Attribution

无监督领域偏移检测与可解释子空间归因

Sebastian Springer, Alessandro Laio

AI总结本文提出一种无监督领域偏移检测工具，通过高维特征空间中的局部密度异常检测，识别偏移特征子空间，从而可解释偏移来源，并提供补偿协议。

详情

AI中文摘要

我们开发了一种检测领域偏移的工具，即数据集概率分布的细微差异。我们通过检测高维特征空间中的局部密度异常来识别这些偏移。如果存在异常，则确定异常最显著的特征子空间。这使我们能够追溯偏移到一小部分特征，使其可解释。此外，我们提供了一种补偿领域偏移的协议，通过从两个未标记数据集中提取无明显残余分布差异的样本子集。我们在受控的20维基准上验证了该框架，恢复了广义和局部偏移及其支持的特征子空间。然后将其应用于由782个特征表示的健康心电图（ECG）记录。在年龄和性别匹配的队列比较中，方法检测到设备引起的偏移，提取了富含不平衡设备组件的代表性子集，并识别了与获取对比相关的ECG特征。这些结果表明，密度偏移检测和子空间归因提供了一种实用框架，可在下游建模之前揭示隐藏的队列偏见。

英文摘要

We developed a tool for detecting domain shifts, namely subtle differences in the probability distributions of datasets. We identify these shifts using an algorithm designed to detect localised density anomalies in high-dimensional feature spaces. If an anomaly is present, we then identify the feature subspace in which the anomaly is most pronounced. This allows us to trace the domain shift to a small set of features, making the shift interpretable. Moreover, we provide a protocol for compensating domain shifts by extracting, from two unlabelled datasets, subsets of samples with no detectable residual distributional difference. We validate the framework on controlled 20-dimensional benchmarks with known ground truth, recovering both broad and localized shifts together with their supporting feature subspaces. We then apply it to healthy electrocardiogram (ECG) recordings represented by 782 features. In age- and sex-matched cohort comparisons differing in measurement-device composition, the method detects device-induced shifts, extracts representative subsets enriched in the imbalanced device components, and identifies ECG features associated with the acquisition contrast. These results suggest that density-shift detection and subspace attribution provide a practical framework for uncovering hidden cohort biases before downstream modelling.

URL PDF HTML ☆

赞 0 踩 0

2605.15911 2026-05-18 stat.ME

Statistical Inference for Smoothed Support Vector Machines in High Dimensions: From Offline to Online Data

高维环境下平滑支持向量机的统计推断：从离线到在线数据

Shuya Zhou, Junwen Xia, Jingxiao Zhang

AI总结本文提出一种统一的推断框架，通过离线和在线设置中的平滑技术消除偏差，实现有效的统计推断和计算效率提升。

详情

AI中文摘要

高维分类问题常依赖于Lasso惩罚的线性支持向量机(SVMs)。然而，该模型中hinge损失和Lasso惩罚的双重非光滑性使统计推断变得困难，并阻碍了计算效率。本文提出了一种统一的推断框架，适用于离线和在线设置。在离线情况下，通过将hinge损失进行卷积平滑，我们构建了一个去偏差估计器，从而建立有效的置信区间。对于在线流数据，我们开发了一个实时估计器和推断程序，仅依赖于历史数据的汇总统计量。理论上，我们为离线和在线去偏差估计器的渐近正态性提供了严格的证明。模拟研究和实际数据应用表明，我们的方法实现了有效的统计推断和计算效率的提升。

英文摘要

High-dimensional classification problems often rely on the Lasso-penalized linear Support Vector Machines (SVMs). However, the double non-smoothness induced by the hinge loss and Lasso penalty in this model makes statistical inference challenging and impedes computational efficiency. In this paper, we propose a unified inference framework in both offline and online settings. In the offline case, by applying a convolution smoothing technique to the hinge loss, we construct a debiased estimator that eliminates the shrinkage bias, thereby building a valid confidence interval. For online streaming data, we develop a real-time estimator and inference procedure that relies only on summary statistics of historical data. Theoretically, we provide rigorous proofs for the asymptotic normality of our offline and online debiased estimators. Simulation studies and real data applications demonstrate that our methods achieve valid statistical inference and improved computational efficiency.

URL PDF HTML ☆

赞 0 踩 0

2605.15907 2026-05-18 math.ST stat.TH

Edge-indexed network time series with graph Ornstein-Uhlenbeck dynamics

基于图奥尔内-乌尔岑动态的边索引网络时间序列

Jiaming Chen, Almut E. D. Veraart

AI总结本文提出了一种基于图奥尔内-乌尔岑动态的边索引网络时间序列模型，通过最大似然框架估计参数并分析其渐近性质，展示了在高频金融数据中的应用价值。

详情

AI中文摘要

我们引入了一类由莱维驱动的图奥尔内-乌尔岑（grOU）模型，用于边索引网络时间序列。所提出的框架将通用网络自回归（GNAR）过程扩展到连续时间，并将最初为节点索引过程设计的图奥尔内-乌尔岑动态适应到边索引设置。该模型能够容纳一般的莱维噪声，因此能够捕捉布朗运动和跳跃行为。我们证明模型参数可通过最大似然框架估计，并推导了估计量的渐近性质。通过模拟研究检验了该方法的有限样本性能，并通过实际应用到高频金融数据中展示了其实用性。结果表明，相对于标准基准，grOU模型在边索引网络时间序列中提高了预测精度并减少了计算时间，同时通过基于网络的参数化保持了鲁棒性。

英文摘要

We introduce a class of Lévy-driven graph Ornstein-Uhlenbeck (grOU) models for edge-indexed network time series. The proposed framework extends generalized network autoregressive (GNAR) processes for edge-indexed network time series to continuous time and adapts graph Ornstein-Uhlenbeck dynamics, originally developed for node-indexed processes, to the edge-indexed setting. The model accommodates general Lévy noise and therefore captures both Brownian and jump behavior. We show that the model parameters can be estimated via a maximum-likelihood framework and derive the asymptotic properties of the estimator. We examine the finite-sample performance of the methodology through simulation studies and illustrate its practical relevance in an empirical application to high-frequency financial data. The results indicate that grOU models for edge-indexed network time series improve forecasting accuracy and reduce computational time relative to standard benchmarks while maintaining robustness through their network-based parametrization.

URL PDF HTML ☆

赞 0 踩 0

2605.15896 2026-05-18 stat.ME stat.AP

A Model-Agnostic Bootstrap for Macro-Level Claims Reserving Under the Conditioning Principle

基于条件原理的宏观层面赔款准备金模型无关自助法

Robin Van Oirbeek, Tim Verdonck

AI总结本文提出一种满足条件原理的自助法，用于宏观层面赔款准备金估计，通过Dirichlet-Gamma层次结构实现精确校准，改进了现有自助法的覆盖误差问题。

Comments 23 pages

详情

AI中文摘要

正确的推断对象是条件预测分布p(R|D,θ̂)，其中D是观察到的三角形保持固定。我们称之为条件原理。所有现有自助法违反这一原理，通过在预测循环中对D的函数进行重采样，产生O(1)的覆盖误差，随着三角形增大不消失。Dirichlet-Gamma层次结构允许一种满足该原理的自助法：S^{IBNP}_i = X^{obs}_i (1-W_i)/W_i，其中W_i ~ Beta(cF_{I-i}, c(1-F_{I-i}))直接从其预测分布中采样。仅模拟分配比例W_i；观察到的三角形保持固定。因此继承了任何开发比例方法（链式梯度、Bornhuetter-Ferguson、Cape Cod或其他）的校准，使其模型无关。覆盖缺陷为O(I^{-1/2})，与开发时期数量无关。在复合泊松数据生成过程中，该自助法对于每个F_{I-i} ∈ (0,1)是保守的：预测标准差分析上超过真实值的因子为1/√F_{I-i}。ODP自助法通过两种相反方向的机制违反该原理：重新估计在ODP DGP下膨胀自助方差，而缺失事故年脆弱性在脆弱性DGP下缩小它。结果覆盖差异为Ω(1)，无论I如何，为Meyers(2015)文档的跨投资组合误校准异质性提供了结构解释。链式梯度、Bornhuetter-Ferguson和Cape Cod在稀疏、信息丰富和池化先验下分别作为可信度估计量，计数和金额具有相同结构。集中程度c作为诊断：ĉ < 30表明开发非平稳。

英文摘要

The correct inferential object in claims reserving is the conditional predictive distribution $p(R \mid \mathcal{D}, \hatθ)$, where $\mathcal{D}$ is the observed triangle held fixed. We refer to this as the conditioning principle. All existing bootstraps violate it by resampling functions of $\mathcal{D}$ inside the predictive loop, producing an $O(1)$ coverage error that does not vanish as the triangle grows. The Dirichlet-Gamma hierarchy admits a bootstrap that satisfies the principle exactly: $S^{IBNP}_i = X^{obs}_i (1-W_i)/W_i$ with $W_i \sim \mathrm{Beta}(c\hat{F}_{I-i}, c(1-\hat{F}_{I-i}))$ sampled directly from its predictive distribution. Only the allocation proportion $W_i$ is simulated; the observed triangle is held fixed. It thus inherits calibration from any development-proportion method (Chain-Ladder, Bornhuetter-Ferguson, Cape Cod, or other), making it model-agnostic. The coverage deficit is $O(I^{-1/2})$, independent of the number of development periods. Under compound Poisson data-generating processes the bootstrap is conservative for every $F_{I-i} \in (0,1)$: the predictive standard deviation analytically exceeds the true value by the factor $1/\sqrt{F_{I-i}}$. The ODP bootstrap violates the principle through two mechanisms in opposite directions: re-estimation inflates bootstrap variance under the ODP DGP, while missing accident-year frailty deflates it under frailty DGPs. The resulting coverage discrepancy is $Ω(1)$ regardless of $I$, providing a structural explanation for the cross-portfolio miscalibration heterogeneity documented by Meyers (2015). Chain-Ladder, Bornhuetter-Ferguson and Cape Cod emerge as credibility estimators under diffuse, informative and pooling priors respectively, with identical structure for counts and amounts. The concentration $c$ serves as a diagnostic: $\hat{c} < 30$ signals non-stationary development.

URL PDF HTML ☆

赞 0 踩 0

2605.15859 2026-05-18 cs.DS cs.LG math.ST stat.ML stat.TH

组件和系统层面的主动冗余分配策略

Bidhan Modok, Shovan Chowdhury, Amarjit Kundu

AI总结本文研究了在可能依赖和相同组件构成的相干系统中，非匹配主动冗余（备用）的分配策略，以提高系统可靠性。通过皮尔逊函数建模组件依赖性，并推导出两种异质主动冗余在组件或系统层面的最优分配条件。

2605.15822 2026-05-18 cs.LG stat.ML

Intrinsic Wasserstein Rates for Score-Based Generative Models on Smooth Manifolds

基于光滑流形的分数布朗运动生成模型的内在Wasserstein速率

Guoji Fu, Taiji Suzuki, Wee Sun Lee, Atsushi Nitanda

AI总结本文研究了在光滑流形上基于分数布朗运动的生成模型的内在Wasserstein速率，证明了在满足一定条件的流形上，变差保持的SGM估计器能达到特定的样本指数，且分析了分数近似在不同噪声 regime 下的表现。

2605.15814 2026-05-18 math.ST stat.TH

Goodness-of-Fit Testing for Point Processes in Large Populations

大规模群体中点过程的拟合优度检验

Sami Umut Can, Estate V. Khmaladze, Roger J. A. Laeven

AI总结本文提出一种新方法，通过构造单位变换使自然参数检验过程弱收敛于标准目标过程，实现大规模群体中点过程参数族的渐近分布自由拟合优度检验。

详情

AI中文摘要

假设我们有一个观察到的路径，该路径来自一个大规模群体中计数事件发生的点过程。基于观察到的路径，我们希望检验原假设，即点过程的条件强度属于特定的参数族。我们提出了一种新的进行此类拟合优度检验的方法。想法是构造一个自然参数检验过程的单位变换，使其弱收敛于一个"标准"目标过程，该过程与原假设下假设的特定参数形式无关。这种变换因此为参数点过程的渐近分布自由拟合优度检验铺平了道路。我们通过Aalen型生存过程的蒙特卡洛模拟（无和有删失）、混合治愈模型以及软件可靠性模型，展示了我们方法在有限样本性能上的良好表现，并通过观察到的人类寿命以及真实软件故障示例，展示了其适用性。

英文摘要

Suppose we have an observed path from a point process counting event occurrences in a large population. Based on the observed path, we would like to test the null hypothesis that the conditional intensity of the point process belongs to a particular parametric family. We propose a novel approach to conducting such goodness-of-fit tests. The idea is to construct a unitary transformation of a natural parametric testing process such that it converges weakly to a ``standard'' target process, independent of the particular parametric form assumed under the null hypothesis. This transformation therefore paves the way for asymptotically distribution-free goodness-of-fit testing of parametric point processes. We demonstrate the good finite-sample performance of our approach through Monte Carlo simulations of Aalen-type survival processes, without and with censoring, mixture cure models, and software reliability models, and we illustrate its applicability with observed human lifetimes as well as real software failures.

URL PDF HTML ☆

赞 0 踩 0

2605.15811 2026-05-18 stat.ME stat.AP

The Negative Binomial Chain-Ladder: A Full Likelihood Model for Claim Count Reserving

负二项链梯法：一种完整的似然模型用于赔款准备

Robin Van Oirbeek

AI总结本文提出负二项链梯模型，通过泊松-伽马构造自然产生负二项分布，提供更清晰的生成解释，统一了链梯方法家族，并通过模拟验证了模型的稳健性。

Comments 35 pages, 3 figures

详情

AI中文摘要

链梯法仍是非寿险赔款准备的主要宏观技术，但其经典形式缺乏一致的概率基础。现有随机扩展，包括马科模型和过分散泊松（ODP）框架，提供不确定性度量但依赖二阶矩假设或准似然方差结构。本文开发了一种负二项链梯（NB-CL）模型，将链梯方法嵌入完整的似然框架中。关键贡献是微观层面推导，显示负二项分布自然源于泊松-伽马构造：索赔按具有伽马分布年度异质性的泊松过程到达，聚合产生负二项增量计数。此推导赋予分散参数κ结构解释，即年度异质性，而非随意的过分散调整。NB-CL模型在κ→∞极限下推广泊松链梯模型，与ODP模型共享点估计但方差函数不同（二次vs线性），并在单个概率层级内统一链梯家族。开发了参数Bootstrap程序以纳入过程和参数不确定性。模拟研究证实，在正确规范下，当分散参数经过偏差校正后，覆盖率接近名义水平；在模型不规范情况下表现出受控退化。对索赔计数数据（澳大利亚机动车身体伤害）和已付金额（泰勒-阿什）的实证研究证实了κ的结构解读以及在金额情况下的工作近似状态。

英文摘要

The Chain-Ladder (CL) method remains the dominant macro-level technique for claims reserving in non-life insurance, yet its classical formulation lacks a coherent probabilistic foundation. Existing stochastic extensions-including the Mack model and the Over-Dispersed Poisson (ODP) framework-provide measures of uncertainty but rely on second-moment assumptions or quasi-likelihood variance structures without clear generative interpretations. This paper develops a Negative Binomial Chain-Ladder (NB-CL) model that embeds the CL method within a full likelihood-based framework. The key contribution is a micro-level derivation showing that the negative binomial distribution arises naturally from a Poisson-Gamma construction: claims arrive according to a Poisson process with Gamma-distributed accident-year heterogeneity, and aggregation yields negative binomial incremental counts. This derivation gives the dispersion parameter $κ$ a structural interpretation as accident-year heterogeneity, rather than an ad-hoc overdispersion adjustment. The NB-CL model generalises the Poisson Chain-Ladder model in the limit $κ\to \infty$, shares the point estimates of the ODP model while differing in its variance function (quadratic vs. linear), and unifies the Chain-Ladder family within a single probabilistic hierarchy. A parametric bootstrap procedure is developed to incorporate both process and parameter uncertainty. Simulation studies confirm near-nominal coverage under correct specification once the dispersion parameter is bias-corrected, and a controlled degradation under model misspecification. Empirical illustrations on claim count data (Australian motor bodily injury) and paid amounts (Taylor-Ashe) document both the structural reading of $κ$ and the working-approximation status of the model in the amounts case.

URL PDF HTML ☆

赞 0 踩 0

2605.15802 2026-05-18 stat.ME

Generalized raking and stabilized weights for regression modeling in two-phase samples

双重抽样回归建模中的广义校正与稳定权重

Tong Chen, Joshua Slone, Gustavo Amorim, Pamela A. Shaw, Bryan E. Shepherd, Thomas Lumley

AI总结本文提出结合广义校正与稳定权重的方法，用于双重抽样回归建模，通过减少权重变异提升效率，利用辅助变量信息提高精度。

详情

AI中文摘要

在复杂调查设计数据拟合的回归模型中，采样权重常包含非必要的变异，导致方差估计膨胀。稳定权重通过调整采样权重以考虑协变量解释的变异来缓解这一问题。在双重抽样背景下，我们评估了最优稳定权重的表现，并提出将稳定权重估计器与广义校正结合，这是一种高效的基于设计的估计器。这种结合通过减少不必要的权重变异并利用辅助变量信息来提高效率。我们展示了这种结合可以使用标准统计软件实现，该软件处理双重抽样和广义校正。模拟研究显示，所提出的估计器在现实中的双重抽样设计下提高了精度，尽管在高度信息性设计中效率提升可能有限。所开发的方法应用于一项大规模的多国双重抽样研究，研究Kaposi肉芽肿在人类免疫缺陷病毒感染者中的情况。

英文摘要

In regression models fitted to data from complex survey designs, sampling weights often incorporate non-essential variation, inflating variance estimates. Stabilized weights mitigate this issue by adjusting sampling weights to account for variation explained by covariates. In the context of two-phase sampling, we evaluate the performance of optimal stabilized weights and propose combining the stabilized weight estimator with generalized raking, a class of efficient design-based estimators. This combination improves efficiency by reducing unnecessary weight variation and leveraging information from auxiliary variables. We show this combination can be implemented using the standard statistical package that handles two-phase samples and generalized raking. Simulation studies demonstrate that the proposed estimator enhances precision under realistic two-phase designs, though efficiency gains may be limited in highly informative designs. The developed methods were applied to a large multinational two-phase study of Kaposi sarcoma among people living with HIV.

URL PDF HTML ☆

赞 0 踩 0

2605.15789 2026-05-18 cs.LG eess.SP stat.ML

Learning Context-conditioned Gaussian Overbounds for Convolution-Based Uncertainty Propagation

基于卷积的不确定性传播的上下文条件高斯上界学习

Ruirui Liu, Xuejie Hou, Yiping Jiang, Hui Ren

AI总结本文提出一种统一的学习框架，通过训练神经网络生成上下文感知的高斯上界，确保在有限分位数网格上具有可证明的保守性，并在满足三个显式正则性假设时在认证区间内保持连续尾保守性。

详情

AI中文摘要

不确定性量化在安全关键领域至关重要——从自动驾驶到航空、金融和健康——其中决策必须依赖保守的界限而非点估计。预测层面的区间（如分位数回归、符合预测、方差网络或贝叶斯模型）通常不具有可组合性：将两个变量的区间相加不一定得到其和的合法区间或保持覆盖率。在航空领域，高斯上界用复杂的误差分布替换为保守的高斯分布，其尾部支配真实分布，因此保守性通过线性操作传播。然而，经典上界是全局的，通常过于保守，且难以适应特征条件误差。我们提出了一种统一的学习框架，训练神经网络生成上下文感知的高斯上界——均值和尺度——在有限分位数网格上具有可证明的保守性，并在满足三个显式正则性假设时在认证区间内保持连续尾保守性。我们的上界损失在选定的分位数上强制保守性，同时用一种类似瓦瑟斯坦的项惩罚分布距离。所学习的界限支持在强制网格上进行保守的线性组合和卷积分析，并在假设成立时在认证区间内进行保守性分析，同时比传统方法更不冗余。我们提供了离散到连续保守性的范围分析和紧域目标正则性的分析，并在合成数据和真实世界数据集上进行了验证，包括多路径、电离层和对流层残差误差。在这些设置中，该方法在保持强制网格上的保守性的同时，提供了更紧的界限。该框架是模态无关的，并适用于需要在动态环境中进行保守、特征条件不确定性估计的学习系统。

英文摘要

Uncertainty quantification is essential in safety-critical settings--from autonomous driving to aviation, finance, and health--where decisions must rely on conservative bounds rather than point estimates. Predictor-level intervals (e.g., from quantile regression, conformal prediction, variance networks, or Bayesian models) generally do not compose: adding two per-variable intervals need not yield a valid interval for their sum or preserve coverage. In aviation, Gaussian overbounding replaces complex error distributions with a conservative Gaussian whose tails dominate the truth, so conservatism propagates through linear operations. Yet classical overbounds are global, often overly conservative, and hard to adapt to feature-conditioned errors. We propose a unified learning framework that trains neural networks to produce context-aware Gaussian overbounds--mean and scale--with provable conservatism on a finite quantile grid and, under three explicit regularity assumptions, continuous-tail conservatism on a certified interval. Our overbounding loss enforces conservativeness at selected quantiles while penalizing distributional distance with a Wasserstein-style term. The learned bounds support conservative linear-combination and convolution analysis on the enforced grid, and on the certified interval when assumptions hold, while being less redundant than traditional methods. We provide a scoped analysis of discrete-to-continuous conservatism and compact-domain objective regularity, and validate on synthetic data and real-world datasets, including multipath, ionospheric, and tropospheric residual errors. Across these settings, the method yields tighter bounds while maintaining conservatism on the enforced grid and in experiments. The framework is modality-agnostic and applicable to learning systems that require conservative, feature-conditioned uncertainty estimates in dynamic environments.

URL PDF HTML ☆

赞 0 踩 0

2605.15108 2026-05-18 stat.ML cs.AI cs.IR cs.LG stat.ME

Logging Policy Design for Off-Policy Evaluation

为离线策略评估设计日志策略

Connor Douglas, Joel Persson, Foster Provost

AI总结本文研究如何设计日志策略以最小化OPE误差，探讨了奖励与覆盖之间的根本权衡，并在不同信息场景下提出了最优策略。

详情

AI中文摘要

离线策略评估（OPE）利用不同日志策略收集的数据来估计目标策略（如推荐系统）的价值。它使高风险实验无需实时部署，但实际准确性严重依赖于用于计算估计值的数据收集日志策略。我们研究如何设计日志策略以最小化OPE误差。我们刻画了一个根本的奖励-覆盖权衡：将概率质量集中在高奖励动作上会减少方差，但可能错过目标策略可能采取的动作的信号。我们提出了一种统一的日志策略设计框架，并在目标策略和奖励分布已知、未知或部分通过先验或噪声估计可知的信息场景中推导出最优策略。我们的结果为公司选择多个候选推荐系统提供了可行指导。我们展示了在收集OPE数据时治疗选择的重要性，并在该目标是公司主要目标时描述了理论上最优的方法。我们还提炼了在操作约束防止实施理论最优的情况下选择日志策略的实用设计原则。

英文摘要

Off-policy evaluation (OPE) estimates the value of a target treatment policy (e.g., a recommender system) using data collected by a different logging policy. It enables high-stakes experimentation without live deployment, yet in practice accuracy depends heavily on the logging policy used to collect data for computing the estimate. We study how to design logging policies that minimize OPE error for given target policies. We characterize a fundamental reward-coverage tradeoff: concentrating probability mass on high-reward actions reduces variance but risks missing signal on actions the target policy may take. We propose a unifying framework for logging policy design and derive optimal policies in canonical informational regimes where the target policy and reward distribution are (i) known, (ii) unknown, and (iii) partially known through priors or noisy estimates at logging time. Our results provide actionable guidance for firms choosing among multiple candidate recommendation systems. We demonstrate the importance of treatment selection when gathering data for OPE, and describe theoretically optimal approaches when this is a firm's primary objective. We also distill practical design principles for selecting logging policies when operational constraints prevent implementing the theoretical optimum.

URL PDF HTML ☆

赞 0 踩 0

2605.14260 2026-05-18 stat.ML cs.LG

一种通过混合整数线性规划进行最优分组序贯检验的一般框架

Dae Woong Ham, Stefanus Jasin, Xuejun Zhao

AI总结本文提出了一种基于混合整数线性规划的优化方法，用于在控制I型和II型错误的前提下，改进分组序贯检验的拒绝准则，展示了其在急性肾损伤干预研究中的应用效果。

详情

AI中文摘要

迈向Tsallis完全概率设计

Vyacheslav Kungurtsev, Giovanni Russo

AI总结本文提出基于Tsallis散度的完全概率设计框架，用于处理非高斯尾部行为的随机过程，通过双迭代方案证明了其收敛性与最优性。

2601.21765 2026-05-18 stat.CO stat.ME stat.ML

Mean-field Variational Bayes for Sparse Probit Regression

稀疏Probit回归的均场变分贝叶斯方法

Augusto Fasano, Giovanni Rebaudo

AI总结本文提出基于均场变分贝叶斯的方法，用于二元结果的稀疏变量选择，通过闭式更新实现高效推断，相比MCMC方法速度快数十倍且保持准确性。

2601.21636 2026-05-18 cs.LG cs.CR stat.ML

Sampling-Free Privacy Accounting for Matrix Mechanisms under Random Allocation

无需采样矩阵机制下的随机分配隐私计费

Jan Schuchardt, Nikita Kalinin

AI总结本文提出基于Rényi散度和条件组合的无采样界限，用于矩阵分解下随机分配的差分隐私放大，解决了采样方法的高概率保证和随机放弃问题，适用于任意带状和非带状矩阵。

详情

AI中文摘要

我们研究了在随机分配（也称为球入箱模型）下矩阵分解中差分隐私模型训练的隐私放大。Choquette-Choo等人（2025）提出了一种基于采样的蒙特卡洛方法来计算放大参数，但其保证要么仅在高概率下成立，要么需要机制的随机放弃。此外，确保(ε,δ)-DP所需的样本数与δ成反比。相反，我们开发了基于Rényi散度和条件组合的无采样界限。前者通过动态规划公式高效计算界限，后者通过提供更强的隐私保证来补充，特别是在小ε的情况下，Rényi散度界限本质上导致过估计。我们的框架适用于任意带状和非带状矩阵。通过数值比较，我们展示了我们的方法在广泛使用的矩阵机制中的有效性。

英文摘要

We study privacy amplification for differentially private model training with matrix factorization under random allocation (also known as the balls-in-bins model). Recent work by Choquette-Choo et al. (2025) proposes a sampling-based Monte Carlo approach to compute amplification parameters in this setting. However, their guarantees either only hold with some high probability or require random abstention by the mechanism. Furthermore, the required number of samples for ensuring $(ε,δ)$-DP is inversely proportional to $δ$. In contrast, we develop sampling-free bounds based on Rényi divergence and conditional composition. The former is facilitated by a dynamic programming formulation to efficiently compute the bounds. The latter complements it by offering stronger privacy guarantees for small $ε$, where Rényi divergence bounds inherently lead to an over-approximation. Our framework applies to arbitrary banded and non-banded matrices. Through numerical comparisons, we demonstrate the efficacy of our approach across a broad range of matrix mechanisms used in research and practice.

URL PDF HTML ☆

赞 0 踩 0

2601.20761 2026-05-18 cs.IT math.IT math.ST stat.TH

Anytime-Valid Quantum State Tomography via Confidence Sequences

基于置信序列的 anytime 量子态重构

Aldo Cumitini, Luca Barletta, Osvaldo Simeone

AI总结本文提出基于置信序列的 anytime 量子态重构方法，通过在每次测量后提供具有用户定义概率的置信集，实现对量子态估计不确定性的严格量化。

Comments Paper submitted to an IEEE journal

2512.14473 2026-05-18 math.ST stat.TH

Sharp convergence rates for Spectral methods via the feature space decomposition method

通过特征空间分解方法获得谱方法的精确收敛率

Guillaume Lecué, Zhifan Li, Zong Shang

AI总结本文通过特征空间分解方法，在一般条件下获得谱方法在线性回归中的总体超额风险的匹配上界和下界，从而定义谱方法的收敛率优先级，并推广反问题中的饱和效应，提供其发生条件。

2511.18225 2026-05-18 cs.LG stat.ML stat.OT

Adaptive Conformal Prediction for Quantum Machine Learning

适应性符合预测用于量子机器学习

Douglas Spencer, Samual Nicholls, Michele Caprio

AI总结本文提出适应性量子符合预测算法，解决量子处理器时间变化噪声对符合保证的影响，通过重复校准保持有效性，实验证明其在IBM量子处理器上的稳定性和覆盖率。

Comments Accepted at TMLR 05/2026. 27 pages, 5 figures

详情

Journal ref: Transactions on Machine Learning Research, May 2026, ISSN 2835-8856

AI中文摘要

量子机器学习旨在利用量子计算机改进经典机器学习算法。目前，量子领域仍缺乏稳健的不确定性量化方法，尽管需要可靠和可信的预测。最近的工作引入了量子符合预测框架，该框架能产生保证包含真实结果的概率预测集。本文正式阐述了量子处理器中固有的时间变化噪声如何即使在校准和测试数据可交换的情况下也会破坏符合保证。为解决这一挑战，我们借鉴了适应性符合推断方法，该方法通过重复校准在时间上保持有效性。我们引入了适应性量子符合预测（AQCP）算法，该算法在任意硬件噪声条件下提供渐近平均覆盖率保证。在IBM量子处理器上的实验证明，AQCP实现了目标覆盖率并表现出比量子符合预测更大的稳定性。

英文摘要

Quantum machine learning seeks to leverage quantum computers to improve upon classical machine learning algorithms. Currently, robust uncertainty quantification methods remain underdeveloped in the quantum domain, despite the critical need for reliable and trustworthy predictions. Recent work has introduced quantum conformal prediction, a framework that produces prediction sets that are guaranteed to contain the true outcome with a user-specified probability. In this work, we formalise how the time-varying noise inherent in quantum processors can undermine conformal guarantees, even when calibration and test data are exchangeable. To address this challenge, we draw on Adaptive Conformal Inference, a method which maintains validity over time via repeated recalibration. We introduce Adaptive Quantum Conformal Prediction (AQCP), an algorithm which provides asymptotic average coverage guarantees under arbitrary hardware noise conditions. Empirical studies on an IBM quantum processor demonstrate that AQCP achieves the target coverage level and exhibits greater stability than quantum conformal prediction.

URL PDF HTML ☆

赞 0 踩 0

2510.24539 2026-05-18 stat.ME

Unbiased likelihood estimation of the Langevin diffusion for animal movement modelling

兰格vin扩散在动物运动建模中的无偏似然估计

Ron R. Togunov, S. Knutsen Furset, Martin E. Pettersen, Robert B. O'Hara

AI总结本文提出利用布朗桥进行重要性采样，改进兰格vin扩散模型的似然估计，以解决 telemetry 数据中自相关和时间不规则性的问题，提升生态栖息地选择研究的准确性。

详情

AI中文摘要

动物生态学中持续存在的挑战是开发能够考虑测距数据中自相关性和时间不规则性的运动模型。连续时间兰格vin扩散模型已被提出用于建模时间自相关和不规则采样数据。然而，当前的估计技术在观测间隔增加时会获得越来越偏的参数估计。本文提出利用布朗桥在重要性采样方案中改进兰格vin扩散模型的似然估计。在一系列模拟研究中，我们展示了我们的方法在各种场景下有效去除了偏倚。我们发现，数据跨度更长但采样频率较低时，估计的栖息地系数的精度提高。这表明该模型可能更适合用于采样分辨率较低的数据集，这在使用旧一代动物标签收集的数据集时很常见。我们利用斯特勒海狮（Eumetopias jubatus）的跟踪数据展示了本模型的应用。我们发现系数估计值收敛到显著不同于以前研究估计值的值，表明传统估计方法中的偏倚可能对栖息地偏好结论产生重大影响。这些改进拓宽了兰格vin扩散模型的应用范围，从而提高了对栖息地选择的生态见解。

英文摘要

An ongoing challenge in animal ecology is developing movement models that account for the autocorrelation, and often temporal irregularity, in telemetry data. Continuous-time Langevin diffusion models have been proposed to model temporally autocorrelated and irregularly sampled data. However, current estimation techniques obtain increasingly biased parameter estimates as the time between observations increases. In this paper, we propose using Brownian bridges in an importance sampling scheme to improve the likelihood approximation of the Langevin diffusion model. In a series of simulation studies, we showed that our approach effectively removed the bias under various scenarios. We found that the precision of the estimated habitat coefficients increased for data spanning a longer duration at a lower frequency than for shorter, more frequently sampled tracks. This suggests that the model may be well suited for modelling tracking data sampled at a coarser resolution, as is common in datasets collected with older generations of animal tags. We illustrated the application of our model using tracking data from Steller sea lions, \textit{Eumetopias jubatus}. We found that the coefficient estimates converged to values significantly different than those estimated in previous studies, suggesting that bias in conventional estimation methods may meaningfully affect ecological conclusions about habitat preference. Together, these improvements broaden the applicability of Langevin diffusion models, thereby improving ecological insight into habitat selection.

URL PDF HTML ☆

赞 0 踩 0

2510.20163 2026-05-18 math.PR math.ST stat.TH

Topics in Probability, Parametric Estimation and Stochastic Calculus

概率、参数估计与随机分析中的专题

Levi Lopes de Lima

AI总结本文系统发展参数估计的核心工具，结合几何视角探讨概率理论，涵盖集中不等式、极限定理等，并介绍布朗运动与伊藤公式及其应用。

Comments 201 pages; 2 figures; substantially rewritten in several parts to improve clarity and exposition, with new examples and contextual remarks added throughout; lots of typos fixed

详情

AI中文摘要

我们从概率论的基础开始，回顾其在现实问题中的重要应用：参数估计。文中系统发展这一主题，介绍集中不等式、极限定理、置信区间、最大似然估计、最小二乘和假设检验等核心工具，强调理论基础与实际相关性。通过几何视角探讨概率的不变性性质，特别是正态分布随机向量。附录介绍布朗运动和随机分析，最终得出伊藤公式。文章还展示了高斯集中不等式、费曼-科茨公式以及金融中的黑-索斯策略等应用。

英文摘要

We begin our journey by recalling the fundamentals of Probability Theory that underlie one of its most significant applications to real-world problems: Parametric Estimation. Throughout the text, we systematically develop this theme by presenting and discussing the main tools it encompasses (concentration inequalities, limit theorems, confidence intervals, maximum likelihood, least squares, and hypothesis testing) always with an eye toward both their theoretical underpinnings and practical relevance. While our approach follows the broad contours of conventional expositions, we depart from tradition by consistently exploring the geometric aspects of probability, particularly the invariance properties of normally distributed random vectors. This geometric perspective is taken further in an extended appendix, where we introduce the rudiments of Brownian motion and the corresponding stochastic calculus, culminating in Itô's celebrated change-of-variables formula. To highlight its scope and elegance, we present some of its most striking applications: the sharp Gaussian concentration inequality (a central example of the "concentration of measure phenomenon"), the Feynman-Kac formula (used to derive a path integral representation for the Laplacian heat kernel), and, as a concluding delicacy, the Black-Scholes strategy in Finance.

URL PDF HTML ☆

赞 0 踩 0

2510.18903 2026-05-18 stat.ME math.ST q-fin.ST stat.TH

Centered-Innovation MA for Bayesian Dirichlet ARMA: Theoretical Equivalence and an Application to Bank-Asset Shares

基于贝叶斯狄利克雷ARMA的中心创新MA：理论等价性及对银行资产份额的应用

Harrison Katz

AI总结本文研究了对组合时间序列的贝叶斯狄利克雷ARMA进行最小修改：用中心创新替代原始加性对数比残差。证明了在固定参数下，中心化规格与digamma链接DARMA在1/ϕ阶上的等价性，并通过银行资产份额数据验证了其预测性能。

详情

AI中文摘要

我们研究了对组合时间序列的贝叶斯狄利克雷ARMA（B--DARMA）进行最小修改：将移动平均块中的原始加性对数比（ALR）残差替换为一个中心创新，该创新减去狄利克雷条件ALR均值，可通过digamma恒等式得到闭合形式。我们证明了在固定参数下，中心化规格与digamma链接DARMA在1/ϕ阶上的等价性，前提是显式的内部和滞后稳定性条件成立。结果澄清了为何在高精度 regime 中两个规格应预测上不可区分，但本身并不控制重新估计产生的贝叶斯后验的几何结构。在每周联邦储备委员会H.8银行资产份额（2015年10月至2025年10月，T=522周）上，预测性能在104个滚动周起始点上在所有精度指标上统计上不可区分，而原始规格下的哈密顿蒙特卡罗发散转换在孤立的滚动拟合中大约更频繁一个数量级，这由局部的滚动拟合引起，原始后验表现出局部病态。四参考敏感性分析证实了预测等价性是参考不变的，并且中心化在不同参考下保持几何优势，但随原始病态拟合的普遍性而变化，从贷款参考的显著减少到现金参考的平局。实际意义是操作而非预测：中心化避免了在孤立滚动起始点出现的原始MA发散尖峰，这对生产流程中后验模拟用于下游压力测试至关重要。该调整是分析且插件式的，只需对MA创新计算进行局部修改。

英文摘要

We study a minimal change to an observation-driven Bayesian Dirichlet ARMA (B--DARMA) for compositional time series: replace the raw additive log-ratio (ALR) residual in the moving-average block with a centered innovation that subtracts the Dirichlet conditional ALR mean, available in closed form via digamma identities. We prove a recursion-level first-order equivalence (in $1/ϕ$) between the centered specification and a digamma-link DARMA at fixed parameters, under explicit interior and lag-stability conditions. The result clarifies why the two specifications should be predictively indistinguishable in the high-precision regime but does not by itself govern the geometry of the Bayesian posteriors that re-estimation produces. On weekly Federal Reserve H.8 bank-asset shares (October~2015 through October~2025, $T=522$ weeks), predictive performance is statistically indistinguishable across $104$ rolling weekly origins on every accuracy metric examined, while Hamiltonian Monte Carlo divergent transitions are approximately an order of magnitude more frequent under the raw specification, driven by isolated rolling fits at which the raw posterior exhibits localized pathologies. A four-reference sensitivity analysis confirms that predictive equivalence is reference-invariant and that the geometric advantage of centering is preserved across references but varies with the prevalence of pathological raw fits, from a substantial reduction at the loans reference to parity at the cash reference. The practical implication is operational rather than predictive: centering avoids the catastrophic raw-MA divergence spikes that occur at isolated rolling origins, which matters for production workflows in which posterior simulation feeds downstream stress tests. The adjustment is analytic and plug-in, and requires only a local change to the MA innovation calculation.

URL PDF HTML ☆

赞 0 踩 0

2509.22739 2026-05-18 cs.CL cs.AI cs.LG stat.ML

Painless Activation Steering: An Automated, Lightweight Approach for Post-Training Large Language Models

无痛激活导向：一种自动化、轻量级的微调大型语言模型方法

Sasha Cui, Zhongren Chen

AI总结本文提出Painless Activation Steering，一种自动化方法，无需人工干预即可利用标注数据提升模型性能，尤其在行为任务中表现优异，但对智能任务效果有限。

详情

AI中文摘要

语言模型通常通过权重或提示导向进行微调，但前者耗时昂贵，后者控制不精确且需手动试错。激活导向（AS）提供了一种更经济、快速且可控的替代方法，但现有技术需人工构造提示对或进行大量特征标注，不如RL和SFT等方法方便。本文引入Painless Activation Steering（PAS），一种完全自动的方法，可利用任何标注数据集进行AS，无需提示构造、特征标注或人工干预。在三个开源模型和18个任务上评估PAS，发现其在行为任务中性能可靠，但对智能任务效果有限。 introspective variant（iPAS）在偏差、道德和对齐任务上分别提升了10.1%、5.2%和34.8%。此外，PAS在上下文学习（ICL）和SFT基础上还提供了额外增益。PAS构建了一个快速、轻量的激活向量，可低成本训练、存储和激活。实验结果为AS的应用提供了明确的指导，展示了其作为实用自动化微调方法的潜力。

英文摘要

Language models (LMs) are typically post-trained for desired capabilities and behaviors via weight-based or prompt-based steering, but the former is time-consuming and expensive, and the latter is not precisely controllable and often requires manual trial-and-error. While activation steering (AS) promises a cheap, fast, and controllable alternative to the two existing post-training methods, current AS techniques require hand-crafted prompt pairs or labor-intensive feature annotation, making them more inconvenient than the plug-and-play methods such as Reinforcement Learning (RL) and Supervised Fine-Tuning (SFT). We introduce Painless Activation Steering (PAS), a family of fully automated methods that make AS readily usable with any given labeled dataset, with no need for prompt construction, feature labeling, or human intervention. We evaluate PAS on three open-weight models (Llama3.1-8B-Instruct, DeepSeek-R1-Distill-8B, and Nous-Hermes-2) and 18 tasks; we find that PAS reliably improves performance for behavior tasks, but not for intelligence-oriented tasks. The introspective variant (iPAS) delivers the strongest causal steering effects (10.1% on Bias, 5.2% on Morality, and 34.8% on Alignment). We also show PAS delivers additional gains on top of In-Context Learning (ICL) and SFT. PAS constructs a fast, lightweight activation vector that can be cheaply trained, easily stored, and activated at will. Our results provide a characterization of where AS helps, where it fails, and how to deploy it as a practical, automated LM post-training option.

URL PDF HTML ☆

赞 0 踩 0

2508.14690 2026-05-18 stat.ME

Nesting a Target Study within a Target Trial: A Framework for Evaluating Intervention Effects on Disparities

将目标研究嵌入目标试验：评估干预对不平等影响的框架

Xinyi Sun, Theodore J. Iwashyna, Emmanuel F. Drabo, Deidra C. Crews, Kadija Ferryman, John W. Jackson

AI总结本文提出TS+TT框架，通过伦理假设测量不平等，结合分层抽样和随机化策略，评估干预对不平等的影响，并扩展G-computation处理连续干预。

Comments Main text: 23 pages, 4 tables; Appendix: 45 pages

详情

AI中文摘要

我们提出了一种新颖的框架（TS+TT），用于将目标研究（TS）嵌入目标试验（TT）中，以评估干预对不平等的影响。TS部分基于允许性概念，将不平等的测量根植于伦理假设，并将其锚定在特定时间内的明确人群。它指定了分层抽样计划，以获得在允许的协变量上社会群体分布相似的样本。在该样本中，TT部分在每个社会群体内随机化干预策略。由于社会群体在基线时在允许的协变量上处于相似位置，并且在社会群体内分配的干预组是可交换的，TS+TT反映了评估干预如何影响不平等的有意义的因果估计量。我们描述了该框架的关键组成部分、其模拟以及其在评估假设干预对脉搏血氧仪偏见影响治疗获取不平等的临床护理中的应用。我们还扩展了半参数G计算法，以适应连续随机干预，并估计时间到事件结果的因果不平等。TS+TT框架提供了一种灵活且政策相关的方法，用于生成具有伦理意识的因果证据，以减少不平等并避免加剧不平等。

英文摘要

We present a novel framework (TS+TT) to nest a Target Study (TS) within a Target Trial (TT) for evaluating the effects of interventions on disparities. The TS component grounds the measurement of disparity in ethical assumptions, based on the concept of allowability, and anchors it to an explicit population within calendar time. It specifies an enrollment plan of stratified sampling of eligible persons to yield a sample where social groups are distributionally similar on covariates deemed allowable for measuring disparity. Within this enrolled sample, the TT component specifies randomization of intervention strategies within each social group. Because social groups are similarly situated on allowable covariates at baseline, and because assigned intervention arms are exchangeable within social groups, TS+TT reflects a meaningful causal estimand for evaluating how interventions impact disparity. We describe the framework's key components, its emulation, and demonstrate its application to evaluate how hypothetical interventions on pulse oximeter bias affect disparities in treatment receipt in clinical care. We also extend semiparametric G-computation to accommodate continuous stochastic interventions and estimate counterfactual disparities in time-to-event outcomes. The TS+TT framework offers a versatile and policy-relevant approach for generating ethically informed causal evidence to reduce disparities and avoid exacerbating disparities.

URL PDF HTML ☆

赞 0 踩 0

2507.15475 2026-05-18 eess.SP math.PR stat.AP

On the Distribution of a Two-Dimensional Random Walk with Restricted Angles

二维受限角度随机游走的分布

Karl-Ludwig Besser

AI总结研究受限角度二维随机游走的分布，推导两步联合与边缘分布，提供一般步数的数值解及大步数近似，明确支持集的精确描述。

Comments 14 pages, 14 figures

2507.02032 2026-05-18 hep-ph hep-ex physics.data-an stat.ML

Neural simulation-based inference of the Higgs trilinear self-coupling via off-shell Higgs production

基于神经模拟的Higgs三线性自耦合推断：通过非壳Higgs生产

Aishik Ghosh, Maximilian Griese, Ulrich Haisch, Tae Hyoun Park

AI总结本文提出一种混合神经模拟推断方法，用于推断Higgs三线性自耦合，结合标准模型有效场论和背景过程，实现高亮度大型强子对撞机的约束。

Comments 27 pages, 17 figures, 2 tables; v2: revised and improved version of the manuscript as accepted for publication in EPJC

详情

AI中文摘要

粒子物理中的一项重大挑战是实验确定Higgs三线性自耦合。尽管研究主要集中在质子-质子碰撞中的壳内双Higgs和单Higgs生产，非壳Higgs生产也被提出作为有价值的补充探测手段。本文设计了一种混合神经模拟基于推断（NSBI）方法，以构建包含标准模型有效场论（SMEFT）修改、相关背景过程和量子干涉效应的Higgs信号似然性。该方法利用矩阵元增强技术的训练效率，对于稳健的SMEFT应用至关重要，同时结合基于分类方法的实用优势以获得有效的背景估计。我们证明了NSBI方法的灵敏度接近理论最优，并提供了预期的高亮度升级大型强子对撞机的约束。虽然我们主要关注Higgs三线性自耦合，但也考虑了影响非壳Higgs生产其他SMEFT算符的约束。

英文摘要

One of the forthcoming major challenges in particle physics is the experimental determination of the Higgs trilinear self-coupling. While efforts have largely focused on on-shell double- and single-Higgs production in proton-proton collisions, off-shell Higgs production has also been proposed as a valuable complementary probe. In this article, we design a hybrid neural simulation-based inference (NSBI) approach to construct a likelihood of the Higgs signal incorporating modifications from the Standard Model effective field theory (SMEFT), relevant background processes, and quantum interference effects. It leverages the training efficiency of matrix-element-enhanced techniques, which are vital for robust SMEFT applications, while also incorporating the practical advantages of classification-based methods for effective background estimates. We demonstrate that our NSBI approach achieves sensitivity close to the theoretical optimum and provide expected constraints for the high-luminosity upgrade of the Large Hadron Collider. While we primarily concentrate on the Higgs trilinear self-coupling, we also consider constraints on other SMEFT operators that affect off-shell Higgs production.

URL PDF HTML ☆

赞 0 踩 0

2504.20268 2026-05-18 stat.AP

图神经网络的可解释性：评估全球变化驱动因素对生态网络的影响

Emre Anakok, Pierre Barbillon, Colin Fontaine, Elisa Thebault

AI总结研究通过图神经网络分析全球变化驱动因素对传粉网络连接性的影响，探讨环境变量与植物属的交互作用，并验证去偏技术对估计效果的影响。

详情

DOI: 10.1016/j.ecolmodel.2025.111472

AI中文摘要

传粉者在植物繁殖中起关键作用，无论是自然生态系统还是人类修改的景观。全球变化驱动因素，如气候变化或土地利用修改，会改变植物-传粉者相互作用。为了评估全球变化驱动因素对传粉的影响，需要大规模的相互作用、气候和土地利用数据。尽管最近的机器学习方法，如图神经网络（GNNs），允许分析此类数据集，但解释其结果具有挑战性。我们探索现有的GNN解释方法，以突出各种环境协变量对传粉网络连接性的影响。进行了广泛的模拟研究，以确认这些方法能否检测协变量与植物属之间的交互作用，以及去偏技术的应用是否影响这些效果的估计。对Spipoll数据集的应用，包括和不包括考虑采样效应，突显了土地利用对网络连接性潜在影响，并显示考虑采样效应部分改变了这些效果的估计。

英文摘要

Pollinators play a crucial role for plant reproduction, either in natural ecosystem or in human-modified landscape. Global change drivers,including climate change or land use modifications, can alter the plant-pollinator interactions. To assess the potential influence of global change drivers on pollination, large-scale interactions, climate and land use data are required. While recent machine learning methods, such as graph neural networks (GNNs), allow the analysis of such datasets, interpreting their results can be challenging. We explore existing methods for interpreting GNNs in order to highlight the effects of various environmental covariates on pollination network connectivity. An extensive simulation study is performed to confirm whether these methods can detect the interactive effect between a covariate and a genus of plant on connectivity, and whether the application of debiasing techniques influences the estimation of these effects. An application on the Spipoll dataset, with and without accounting for sampling effects, highlights the potential impact of land use on network connectivity and shows that accounting for sampling effects partially alters the estimation of these effects.

URL PDF HTML ☆

赞 0 踩 0

2503.14311 2026-05-18 math.ST stat.ME stat.TH

Asymptotic properties of the MLE in distributional regression under random censoring

分布回归中随机截断下MLE的渐进行为

Gitte Kremling, Gerhard Dikta

AI总结研究在随机右截断下分布回归中MLE的渐近性质，证明其几乎处处一致性和渐近正态性，并通过模拟和实际数据验证。

2412.11308 2026-05-18 stat.ML cs.LG

From XAI to MLOps: Explainable Concept Drift Detection with Profile Drift Detection

从XAI到MLOps：基于轮廓漂移检测的可解释概念漂移检测

Ugur Dar, Mustafa Cavus

AI总结本文提出轮廓漂移检测方法，利用可解释AI工具部分依赖性轮廓图，通过新的漂移度量标准检测概念漂移并理解其原因，实验表明其在保持预测性能的同时有效平衡了漂移信号的敏感性和稳定性。

Comments 15 pages, 6 figures

详情

DOI: 10.1016/j.future.2026.108586
Journal ref: Future Generation Computer Systems (2026)

AI中文摘要

预测模型的性能往往因数据分布的变化而下降，这种现象称为数据漂移。其中，概念漂移（解释变量与响应变量之间的关系变化）尤其难以检测和适应。传统漂移检测方法通常依赖准确率或边缘变量分布等指标，可能无法捕捉到微妙但重要的概念变化。本文提出了一种新方法，轮廓漂移检测（PDD），通过利用可解释AI工具部分依赖性轮廓图（PDPs），实现了对概念漂移的检测和对其潜在原因的深入理解。PDD通过新的漂移度量标准量化PDPs的变化，这些度量标准对数据流中的变化敏感，同时保持计算效率。该方法与MLOps实践一致，强调在动态环境中持续的模型监控和适应性重训练。在合成和实际数据集上的实验表明，PDD在保持高预测性能的同时，有效平衡了漂移信号的敏感性和稳定性。结果突显了其在实时应用中的适用性，本文最后讨论了该方法的优势、限制以及向更广泛应用场景扩展的潜力。

英文摘要

Predictive models often degrade in performance due to evolving data distributions, a phenomenon known as data drift. Among its forms, concept drift, where the relationship between explanatory variables and the response variable changes, is particularly challenging to detect and adapt to. Traditional drift detection methods often rely on metrics such as accuracy or marginal variable distributions, which may fail to capture subtle but important conceptual changes. This paper proposes a novel method, Profile Drift Detection (PDD), which enables both the detection of concept drift and an enhanced understanding of its underlying causes by leveraging an explainable AI tool: Partial Dependence Profiles (PDPs). PDD quantifies changes in PDPs through new drift metrics that are sensitive to shifts in the data stream while remaining computationally efficient. This approach is aligned with MLOps practices, emphasizing continuous model monitoring and adaptive retraining in dynamic environments. Experiments on synthetic and real-world datasets demonstrate that PDD outperforms existing methods by maintaining high predictive performance while effectively balancing sensitivity and stability in drift signals. The results highlight its suitability for real-time applications, and the paper concludes by discussing the method's advantages, limitations, and potential extensions to broader use cases.

URL PDF HTML ☆

赞 0 踩 0

2406.02834 2026-05-18 stat.ME

Asymptotic inference with flexible covariate adjustment under rerandomization and stratified rerandomization

基于重新随机化和分层重新随机化的灵活协变量调整的渐近推断

Bingkai Wang, Fan Li

AI总结本文研究了在重新随机化和分层重新随机化下，更广泛的协变量调整估计量的渐近理论，证明了M估计量的渐近线性及影响函数在简单随机化与重新随机化下保持相同，但重新随机化可能导致非高斯渐近分布，并探讨了基于数据自适应机器学习的高效估计量的效率最优性。

详情

AI中文摘要

重新随机化是一种有效的治疗分配程序，用于控制基线协变量不平衡。对于估计平均治疗效应，重新随机化已被证明可以提高未调整和线性调整估计量的精度，而不会影响一致性。然而，重新随机化是否适用于更广泛的M估计量类，包括广义线性回归的g计算公式和双重鲁棒方法，以及更广泛的数据自适应机器学习的高效估计量，仍不清楚。本文发展了在重新随机化及其分层扩展下更广泛的协变量调整估计量的渐近理论。证明了在简单随机化和重新随机化下，任何M估计量的渐近线性及影响函数保持相同，但重新随机化可能导致非高斯渐近分布。我们进一步通过几个常见M估计量的例子解释，如果在最终估计量中适当调整重新随机化变量，则可以实现渐近正态性。这些结果扩展到分层重新随机化。最后，我们研究了基于数据自适应机器学习的高效估计量的渐近理论，并证明其在重新随机化和分层重新随机化下的效率最优性。我们的结果通过模拟和重新分析一个使用分层重新随机化的集群随机化实验得到验证。

英文摘要

Rerandomization is an effective treatment allocation procedure to control for baseline covariate imbalance. For estimating the average treatment effect, rerandomization has been previously shown to improve the precision of the unadjusted and the linearly-adjusted estimators over simple randomization without compromising consistency. However, it remains unclear whether such results apply more generally to the class of M-estimators, including the g-computation formula with generalized linear regression and doubly-robust methods, and more broadly, to efficient estimators with data-adaptive machine learners. In this paper, we develop the asymptotic theory for a more general class of covariate-adjusted estimators under rerandomization and its stratified extension. We prove that the asymptotic linearity and the influence function remain identical for any M-estimator under simple randomization and rerandomization, but rerandomization may lead to a non-Gaussian asymptotic distribution. We further explain, drawing examples from several common M-estimators, that asymptotic normality can be achieved if rerandomization variables are appropriately adjusted for in the final estimator. These results are extended to stratified rerandomization. Finally, we study the asymptotic theory for efficient estimators based on data-adaptive machine learners, and prove their efficiency optimality under rerandomization and stratified rerandomization. Our results are demonstrated via simulations and re-analyses of a cluster-randomized experiment that used stratified rerandomization.

URL PDF HTML ☆

赞 0 踩 0

2312.13992 2026-05-18 stat.ME

Bayesian nonparametric boundary detection for multiple areal data

基于多重区域数据的贝叶斯非参数边界检测

Matteo Gianella, Mario Beraha, Alessandra Guglielmi

AI总结本文提出一种贝叶斯非参数混合模型，用于多重区域数据的边界检测，通过空间依赖权重和随机组件数量，无需外部信息即可识别不同密度区域的边界，应用于洛杉矶地区收入不平等分析。

详情

AI中文摘要

我们考虑了区域数据的边界检测问题，重点在于每个区域单元有多个观测值的情况。我们提出了一种贝叶斯非参数混合模型，用于区域特定的人口密度，具有空间依赖权重和随机组件数量。与之前的方法不同，我们不需要外部信息如区域特定协变量或相似性度量。通过利用每个区域的多个样本信息，能够识别出密度不同的区域边界。关键在于混合组件数量需要从数据中学习以获得有意义的边界检测，因为过度拟合的混合模型存在非识别性。因此，我们假设该数量随机，并在其上放置先验。动机应用是分析大洛杉矶地区的经济不平等，通常导致社会不平等和动荡。通过利用最近引入的最优辅助先验，高效的后验计算由一种转维马尔可夫链蒙特卡洛采样器实现。该方法通过广泛模拟验证，并应用于大洛杉矶地区的收入数据。我们识别出收入分布中的几个边界，这些边界可以事后解释为无健康保险人口的百分比，但不能解释为总犯罪数，显示了这种分析对政策制定者的有用性。

英文摘要

We consider the problem of boundary detection for areal data, focusing on situations where for each areal unit multiple observations are available. We propose a Bayesian nonparametric mixture model for the area-specific population densities, with spatially dependent weights and a random number of components. Contrary to previously proposed methods for boundary detection, which consider one observation per areal unit, ours does not require external information such as area-specific covariates or dissimilarity metrics. Instead, by exploiting information from multiple samples per area, it is able to identify boundaries between areas that exhibit different densities. Crucially, the number of mixture components needs to be learned from data to obtain meaningful boundary detection, due to the non-identifiability of overfitted mixtures. Therefore, we assume it random by placing a prior on it. The motivating application is the analysis of economic inequality in the greater Los Angeles region, which typically yields social inequality and unrest. Efficient posterior computation is facilitated by a transdimensional Markov Chain Monte Carlo sampler which exploits the recently introduced optimal auxiliary priors to improve the mixing. The methodology is validated via extensive simulations and applied to the income data in the greater Los Angeles region. We identify several boundaries in the income distributions, which can be explained ex-post in terms of the percentage of the population without health insurance, though not in terms of the total number of crimes, showing the usefulness of such an analysis to policymakers.

URL PDF HTML ☆

赞 0 踩 0

2605.15756 2026-05-18 cs.HC stat.AP

Separating Acute Psychological Stress from Physical Exertion in Biometric Signals

在生物信号中区分急性心理压力与体力消耗

Esther Bosch

AI总结研究通过分析五种生理信号在认知压力与体力活动下的反应，发现 tonic electrodermal activity 是区分心理压力与体力消耗最有效的指标，其他信号则受体力活动影响更大。

详情

AI中文摘要

急性心理压力在日常情境中广泛出现，包括交通、职业环境和体力活动，其可靠检测可实现自适应系统响应并支持人类福祉。自动压力识别的持续挑战是区分急性心理压力的生物信号与同时发生的体力消耗信号。本研究考察了五种生理信号（ tonic electrodermal activity、trapezius electromyography、心率、心率变异性、呼吸率）在认知压力和体力活动下的反应，单独和组合情况下。十九名参与者在2x3组内设计中完成了n-back算术任务结合社交压力和金钱奖励，三个活动条件：静坐、行走和静力骑行。多级线性混合模型和重复测量方差分析用于分解每种传感器的主效应和交互作用。tonic electrodermal activity 对认知压力（r=0.48）和体力消耗（r=0.67）有稳健的加法反应，无交互作用，使其成为体力活动期间压力检测最有前途的候选者。心率和trapezius electromyography几乎完全由体力消耗驱动，无可靠敏感性。RMSSD被体力活动强烈抑制，对认知负荷只有微弱敏感性。呼吸率受体力活动主导，主分析中无可靠压力效应。这些发现提供了现实世界压力检测的传感器特异性层级，并突显tonic electrodermal activity为在体力活动人群中识别认知压力时最信息丰富的通道。

英文摘要

Acute psychological stress occurs in a wide range of everyday contexts, including transportation, occupational settings, and physical activity, where its reliable detection could enable adaptive system responses and support human well-being. A persistent challenge in automated stress recognition is disentangling the biometric signatures of acute psychological stress from those of concurrent physical exertion. This study examined how five physiological signals (tonic electrodermal activity, trapezius electromyography, heart rate, heart rate variability, and respiration rate) respond to cognitive stress and physical activity, independently and in combination. Nineteen participants completed a 2x3 within-subjects design in which acute psychological stress was induced via an n-back arithmetic task combined with social pressure and financial reward, across three activity conditions: idle sitting, walking, and stationary cycling. Multilevel linear mixed models and repeated-measures ANOVA were used to decompose main effects and interactions for each sensor. Tonic electrodermal activity showed a robust, additive response to both cognitive stress (r=0.48) and physical exertion (r=0.67), with no interaction, making it the most promising candidate for stress detection during physical activity. Heart rate and trapezius electromyography were driven almost exclusively by physical exertion, with no reliable sensitivity to the stress task. RMSSD was strongly suppressed by physical activity and showed only marginal sensitivity to cognitive load. Respiration rate was dominated by physical activity, with no reliable stress effect in the primary analysis. These findings provide a sensor-specific hierarchy for real-world stress detection and highlight tonic electrodermal activity as the most informative channel when cognitive stress must be identified in physically active populations.

URL PDF HTML ☆

赞 0 踩 0

2605.15702 2026-05-18 stat.ME

Re-examining and calibrating weighted survival analysis for causal inference

重新审视和校准加权生存分析用于因果推断

Wenfu Xu, Yi Zhang, Tobias Gerhard, Zhiqiang Tan

AI总结本文重新审视加权Kaplan-Meier方法，并开发新的校准方法以改进生存分析的统计特性，通过模拟和实证研究验证了其有效性。

详情

AI中文摘要

基于时间到事件结局的因果推断在各种科学研究中至关重要。在静态设置中使用拟合的倾向分数时，加权Kaplan-Meier估计生存概率和加权Breslow-Peto估计危险比已被广泛应用，但其统计特性往往被忽视或仅有限研究。我们通过正式将其与增强逆概率加权估计的一般框架联系起来，重新审视加权Kaplan-Meier方法，并通过校准估计在低维和高维设置中开发新的方法和相关理论。我们展示了模拟研究和精神分裂症患者辅助抗精神病治疗效果的实证应用。校准方法在模拟研究中产生更接近目标的覆盖率比例，并在模拟和实证研究中产生更短的置信区间。

英文摘要

Causal inference with time-to-event outcomes is fundamental in various scientific studies. In a static setup with fitted propensity scores, weighted Kaplan-Meier estimation for survival probabilities and weighted Breslow-Peto estimation for hazard ratios have been widely used, but their statistical properties have been overlooked or studied only to a limited extent. We re-examine the weighted Kaplan-Meier method by formally linking it with the general framework of augmented inverse probability weighted estimation including both point and variance estimation. Furthermore, to address limitations of existing weighted methods for survival analysis, we develop new methods and associated theory through calibrated estimation in both low-dimensional and high-dimensional settings. We present a simulation study and an empirical application on the effectiveness of adjunctive psychotropic treatments for patients with schizophrenia. The calibrated methods yield coverage proportions closer to target ones in the simulation study, and produce shorter confidence intervals in both simulation and empirical studies.

URL PDF HTML ☆

赞 0 踩 0

2605.15692 2026-05-18 cs.LG stat.ML

Tighter Regret Bounds for Contextual Action-Set Reinforcement Learning

更紧的基于上下文动作集强化学习的遗憾界

Zijun Chen, Zihan Zhang

AI总结本文研究了具有固定奖励和转移函数的回合制强化学习，但每个回合的动作集依赖于回合。通过MVP算法，建立了对抗性和随机性情境下的更紧遗憾界，并推导了样本复杂度和间隙依赖的遗憾界。

详情

AI中文摘要

我们研究了具有固定奖励和转移函数的回合制强化学习，但每个回合的动作集依赖于回合。性能通过累积遗憾衡量，即$\sum_{k=1}^K [V^{*,M^k} - V^{π^k,M^k}]$，其中$M^k$表示第$k$个回合的动作上下文。我们证明MVP算法可以自然扩展到此框架并享有强理论保证。特别是，我们建立了对抗性情境下的最小最大遗憾界$\widetilde{O}(\sqrt{SAH^3K\log L})$，其中$L$表示可能的上下文数量。此结果意味着在随机性情境下的遗憾界为$\widetilde{O}(\sqrt{SAH^3K})$。我们进一步将随机性遗憾保证转换为固定上下文分布的样本复杂度界$\widetilde{O}(SAH^3/ε^2)$。此外，我们推导了一个依赖间隙的遗憾界$\widetilde O\left( \inf_{p\in [0,1)} \left( \frac{1}{Δ_{\min}^{p}} + pKΔ_{\min}^{p} \right)\log K \cdot \mathrm{poly}(S,A,H) \right)$，其中$Δ_{\min}^{p}$是子最优$(h,s,a)$三元组的全局$p$-修剪正间隙底。此界在相关子最优间隙较大的情况下可以显著改进最小最大速率。

英文摘要

We study episodic reinforcement learning with fixed reward and transition functions, but with episode-dependent admissible action sets that are observed at the start of each episode. Performance is measured by cumulative regret against the episode-wise optimal value, $\sum_{k=1}^K [V^{*,M^k} - V^{π^k,M^k}]$, where $M^k$ represents the action context in the $k$-th episode. We show that the MVP algorithm naturally extends to this framework and enjoys strong theoretical guarantees. In particular, we establish a minimax regret bound of $\widetilde{O}(\sqrt{SAH^3K\log L})$ for adversarial contexts, where $L$ denotes the number of possible contexts. This result implies a regret bound of $\widetilde{O}(\sqrt{SAH^3K})$ for stochastic contexts. We further translate the stochastic regret guarantee into a sample complexity bound of $\widetilde{O}(SAH^3/ε^2)$ for a fixed context distribution. In addition, we derive a gap-dependent regret bound of \[ \widetilde O\left( \inf_{p\in [0,1)} \left( \frac{1}{Δ_{\min}^{p}} + pKΔ_{\min}^{p} \right)\log K \cdot \mathrm{poly}(S,A,H) \right), \] where $Δ_{\min}^{p}$ is the global $p$-trimmed positive-gap floor over suboptimal $(h,s,a)$ triples. This bound can substantially improve upon the minimax rate when the relevant suboptimality gaps are large.

URL PDF HTML ☆

赞 0 踩 0

2605.15688 2026-05-18 stat.ML cs.AI cs.LG math.PR

$α$-TCAV: A Unified Framework for Testing with Concept Activation Vectors

$α$-TCAV：基于概念激活向量的测试统一框架

Ekkehard Schnoor, Jawher Said, Malik Tiomoko, Wojciech Samek, Alexander Jung

AI总结本文提出$α$-TCAV框架，解决传统TCAV方法中因指示函数不连续导致的方差问题，通过参数化平滑函数统一概率表述，并提供参数调优指导，挑战现有实践惯例。

Comments 44 pages, 12 figures

详情

AI中文摘要

统计两轮搜索寻找一个优秀元素

Nagananda K G, Jong Sung Kim

AI总结本文研究了统计意义上的两轮搜索问题，旨在以最小的测试次数找到至少一个优秀元素，证明在稀疏泊松条件下，测试次数随人口规模对数增长。

Comments 17 pages

详情

AI中文摘要

本文研究了统计意义上的两轮搜索问题，旨在以最小的测试次数找到至少一个优秀元素。考虑一个包含n个元素的总体，其中每个元素独立地以概率λ/n成为优秀元素，λ>0。一个子集测试是无噪声的：当查询的子集包含至少一个优秀元素时，它会返回阳性。目标是在保证以至少1-α的概率找到一个优秀元素的前提下，最小化预期测试次数，其中0<α<1。与传统的群体测试不同，目标不是恢复所有优秀元素的集合，而是仅识别其中一个。我们首先证明成功本质上受到没有优秀元素的可能性的限制。在稀疏泊松条件下，这提出了必要的可行性条件α≥e^{-λ}。当目标成功概率是可行的，我们证明最优的预期测试次数随总体规模对数增长。上界通过结合初始存在测试和第二轮分离设计获得；下界则来自信息计数论证。数值示例展示了可行性边界和由此产生的对数尺度。

英文摘要

We formulate and study a statistical version of Katona's two-round search problem of finding at least one excellent element in a set. A population of $n$ elements is considered, where each element is independently excellent with probability $λ/n$, $λ> 0$. A subset test is noiseless: it returns positive exactly when the queried subset contains at least one excellent element. The goal is to minimize the expected number of tests subject to finding one excellent element with probability at least $1-α$, where $0<α<1$, under the restriction that testing is performed in two rounds. Unlike classical group testing, the objective is not to recover the full set of excellent elements, but only to identify one of them. We first show that success is fundamentally limited by the possibility that no excellent element exists. In the sparse Poisson regime, this imposes the necessary feasibility condition $α\ge e^{-λ}$. When the target success probability is feasible, we prove that the optimal expected number of tests grows logarithmically with the population size. The upper bound is obtained by combining an initial existence test with a second-round separating design; the lower bound follows from an information-counting argument. Numerical illustrations show the feasibility boundary and the resulting logarithmic scaling.

URL PDF HTML ☆

赞 0 踩 0

2605.15596 2026-05-18 stat.ME

Tail postcoloring in long-run variance estimation of time series

长程方差估计中的尾部后着色

Xu Liu, Kin Wai Chan

AI总结本文提出尾部后着色方法，通过参数模型将非参数估计中被忽视的尾部自协方差投影到最终估计器，实现非参数方差估计与参数着色模型的连接，提高稳健性和效率。

详情

AI中文摘要

预白化是处理强自相关性的常见方法。本文提出一种新的方法，称为尾部后着色，通过参数模型将非参数估计中被忽视的尾部自协方差投影到最终估计器。该方法通过缩放因子连接非参数方差估计器和参数着色模型，并通过带宽参数自动切换这两种方法，无需将整个数据集转换为残差。当着色模型正确指定时，可获得参数速率。在有限样本中，它比标准方法中的白化模型更稳健，且避免了标准方法中由于着色因子导致的严重方差膨胀或功率下降。本文展示了多种参数模型可构建多重稳健的尾部后着色估计器，并自然适用于多元时间序列。通过马尔可夫链蒙特卡罗输出分析的实证数据示例进行了说明。

英文摘要

Prewhitening is a common approach to deal with strong autocorrelation. In this article, we propose a new approach called tail postcoloring, motivated by it. It uses parametric models to project, or color back, the neglected tail autocovariances in nonparametric estimators onto the final estimator. This approach bridges the non-parametric variance estimator and the parametric coloring model through a scaling factor. It automatically switches between these two arms using a bandwidth parameter, without the need to transform the entire dataset into residuals, as in the standard prewhitening approach. When the coloring model is well-specified, a parametric rate can be achieved. In finite samples, it is also more robust to misspecification of the coloring model compared to the whitening model in the standard approach. Besides, it avoids severe potential variance inflation or power reduction caused by the recoloring factor in the standard approach. We show that multiple parametric models can be used to construct a multiply robust tail postcolored estimator. It also naturally works for multivariate time series. A real-data example in Markov chain Monte Carlo output analysis is provided.

URL PDF HTML ☆

赞 0 踩 0

2605.15571 2026-05-18 stat.ML cs.LG

MaxSketch: Robust Distinct Counting in Streams via Random Projections

MaxSketch：通过随机投影在数据流中实现鲁棒的唯一计数

Nikos Tsikouras, Constantine Caramanis, Christos Tzamos

AI总结本文提出MaxSketch，利用随机高斯投影在高维噪声数据流中实现鲁棒的唯一计数，证明在几何结构下可将内存需求降低至~O(log n / ε²)。

详情

AI中文摘要

估计数据流中不同元素的数量在重复元素相同的情况下已知。然而在现代设置中，观测是高维且噪声的，相同对象的重复实例仅近似相似——例如不同个体的图像在像素层面可能有显著差异。经典草图如HyperLogLog依赖一致的哈希值来处理相同元素，在这种情况下会失效。最近在一般度量空间中关于鲁棒唯一计数的研究实现了~Θ(√n)的内存需求，这是最坏情况下的最优。本文证明在学习表示中常见的几何结构下，可以实现显著改进的内存保证。我们介绍了MaxSketch，一种由随机高斯投影构建的简单max线性草图，并证明其能够估计潜在对象的数量。具体而言，我们证明在这一假设下，m = ~O(log n / ε²)的随机投影（因此~O(log n / ε²)的内存）足以在(1+ε)因子内恢复真实的唯一计数。在图像流上的实验证实MaxSketch能够准确估计唯一计数，并在训练范围外泛化。我们的结果将经典流算法与现代表示学习连接起来，展示了几何结构如何从根本上减少唯一计数的复杂性。

英文摘要

Estimating the number of distinct elements in a data stream is well understood when repeated elements are identical. In modern settings, however, observations are high-dimensional and noisy, so repeated instances of the same object are only approximately similar -- for example, different images of the same individual may vary significantly at the pixel level. Classical sketches such as HyperLogLog rely on consistent hash values for identical elements and break down in this regime. Recent work on robust distinct counting in general metric spaces achieves $\widetildeΘ(\sqrt{n})$ memory, which is tight in the worst case. We show that substantially improved memory guarantees are possible under geometric structure common in learned representations. We introduce MaxSketch, a simple max-linear sketch built from random Gaussian projections, and prove that it succeeds in estimating the number of distinct latent objects. Concretely, we show that under this assumption $m = \widetilde{O} (\log n / \varepsilon^2)$ random projections (and hence $\widetilde{O} (\log n/\varepsilon^2)$ memory) suffice to recover the true distinct count within a $(1+\varepsilon)$ factor. Experiments on image streams confirm that MaxSketch accurately estimates distinct counts and generalizes beyond the training regime. Our results bridge classical streaming algorithms and modern representation learning, showing how geometric structure can fundamentally reduce the complexity of distinct counting.

URL PDF HTML ☆

赞 0 踩 0

2605.15531 2026-05-18 math.ST math.CO stat.TH

Bounds on the Number of Modes of a Gaussian Mixture Density

高斯混合密度模态数的界限

Hien Duy Nguyen

AI总结研究高斯混合密度非退化临界点和模态数的上界及下界，提出直接Pfaffian界和增强界，并通过Morse理论改进有限模态上界，同时给出同方差情况下的改进界和下界。

详情

AI中文摘要

我们推导了高斯混合密度在R^d中k个分量的非退化临界点数量的显式上界，以及当模态集有限时的模态数上界和下界。通过将临界点方程归一化于参考分量，对于k≥2，得到直接Pfaffian界U_het(d,k)=2^{d+组合数(k-1,2)}(d+2min(d,k-1)+1)^{k-1}。对于相同参数范围，通过精确消元和代数倒数变量得到替代界U_aug(d,k)=2^{组合数(k-1,2)}(d+1)((2k-1)d+2k-1)^{k-1}。因此，对于k≥2，最佳临界点界是它们的最小值。Morse理论将对应的有限模态上界改进为floor(min{U_het(d,k),U_aug(d,k)}+1)/2。在同方差情况下，对于k≥2，直接界改进为U_hom(d,k)=2^{d+组合数(k-1,2)}(d+min(d,k-1)+1)^{k-1}，仿射秩减少将d替换为组件均值的仿射秩，而增强同方差减少给出无维度界U_aug,hom(k)=2^{组合数(k-1,2)+1}(2k)^{k-1}。在下界方面，对于d,k≥2，我们得到L_bin(d,k)=k+max_{2≤r≤min(d,k)}组合数(k,r)，并给出一个填充-产品族，特别说明线性下界d+k-1，以及一个种子-闭包原理，将产品和填充构造打包。我们进一步给出了临界集连通分支数的显式界。

英文摘要

We derive explicit upper bounds for the number of nondegenerate critical points of a $k$-component Gaussian mixture density in $\mathbb{R}^d$, and the number of modes when the modal set is finite, together with lower bounds. By normalizing the critical-point equations by a reference component, for $k\ge2$ we get the direct Pfaffian bound \[ U_{\mathrm{het}}(d,k)=2^{\,d+\binom{k-1}{2}}\left(d+2\min(d,k-1)+1\right)^{k-1}. \] For the same parameter range, an exact elimination augmented by an algebraic reciprocal variable gives the alternative bound \[ U_{\mathrm{aug}}(d,k)= 2^{\binom{k-1}{2}}(d+1)\left((2k-1)d+2k-1\right)^{k-1}. \] Thus, for $k\ge2$, the best critical-point bound is their minimum. A Morse-theoretic argument improves the corresponding finite-mode upper bound to \[ \left\lfloor \frac{\min\{U_{\mathrm{het}}(d,k),U_{\mathrm{aug}}(d,k)\}+1}{2}\right\rfloor. \] In the homoscedastic case, for $k\ge2$, the direct bound improves to \[ U_{\mathrm{hom}}(d,k)=2^{\,d+\binom{k-1}{2}}\left(d+\min(d,k-1)+1\right)^{k-1}, \] an affine-rank reduction replaces $d$ by the affine rank of the component means, and an augmented homoscedastic reduction gives the dimension-free bound \[ U_{\mathrm{aug,hom}}(k)=2^{\binom{k-1}{2}+1}(2k)^{k-1}. \] On the lower-bound side, for $d,k\ge 2$ we obtain \[ L_{\mathrm{bin}}(d,k)=k+\max_{2\le r\le \min(d,k)}\binom{k}{r}, \] together with a padding-product family that in particular implies the linear lower bound $d+k-1$, and a seed-closure principle that packages product and padding constructions. We further give explicit bounds for the number of connected components of the critical set.

URL PDF HTML ☆

赞 0 踩 0

2605.15524 2026-05-18 cs.LG cs.AI math.DG math.ST stat.TH

Neural Point-Forms

神经点形

Bruno Trentini, Jacob Hume, Vincenzo Antonio Isoldi, Philipp Misof, Ekaterina S. Ivshina, Kelly Maggs

AI总结本文提出神经点形（NPFs），通过扩散几何中的拉普拉斯技术，构建点云的可学习几何特征，用于比较微分形式，并在合成和生物相关实验中展示其在处理采样密度、流形结构和群体几何时的优势。

详情

AI中文摘要

点云学习通常基于观察样本是嵌入高维特征空间的底层几何对象的噪声轨迹的假设。然而，许多几何特性无法仅通过坐标、成对距离或学习的图邻域直接捕捉。在光滑情况下，微分形式用于编码高阶切线信息。本文引入了一种新的可学习几何特征家族，称为神经点形（NPFs）。在没有自然切线结构的情况下，我们使用来自扩散几何的拉普拉斯技术，通过内积构建点云的离散模型，以比较微分形式。在连续情况下，共享环境特征空间的子流形表示为比较矩阵，其条目描述了特征形式对偶切线信息的相互作用。我们通过证明在标准采样、带宽、密度和流形假设下比较矩阵的长期一致性，使这一直觉精确化。这产生了一个紧凑、高效且可交换的神经层，其输出是一个学习的形比较矩阵。在合成和生物相关实验中，我们展示了NPFs提供了一个竞争性且可解释的表示，当标签依赖于采样密度、流形结构或响应相关群体几何时，其优势最为明显。

英文摘要

Point cloud learning often rests on the premise that observed samples are noisy traces of an underlying geometric object, such as a manifold embedded in a high-dimensional feature space. Yet much of this geometry is not captured directly by coordinates, pairwise distances, or learned graph neighborhoods alone. In the smooth setting, differential forms are devices to encode higher order tangency information. In this work, we introduce a new family of principled learnable geometric features for point clouds called neural point-forms (NPFs). In the absence of a natural tangency structure, we instead use Laplacian-based techniques from Diffusion Geometry to build a discrete model for comparing differential forms on point clouds via inner products. In the continuum, submanifolds of a shared ambient feature space are represented as comparison matrices, whose entries describe how pairs of feature forms interact with extrinsic tangency information. We make this intuition precise by proving the long-run consistency of comparison matrices under standard sampling, bandwidth, density, and manifold-hypothesis assumptions. This yields a compact, efficient and permutation-invariant neural layer whose output is a learned form-comparison matrix. Across synthetic and biologically relevant experiments, we show that NPFs provide a competitive, and interpretable representation, with the strongest benefits appearing when labels depend on sampling density, manifold-like structure, or response-relevant population geometry.

URL PDF HTML ☆

赞 0 踩 0

2605.15516 2026-05-18 eess.SY cs.SY stat.AP

Co-Design Optimization for Data Center Cooling System via Digital Twin

通过数字孪生实现数据中心冷却系统协同优化

Shrenik Jadhav, Zheng Liu

AI总结本文提出三层优化框架，解决超算中心冷却单元分配与流体分配问题，通过模型简化评估所有可行方案，实现年度冷却能耗降低35.48%。

Comments 12 pages, 8 figures

详情

AI中文摘要

液冷超算系统通过多并行子回路冷却装置散热，但如何分配冷却单元（CDUs）及流体分配尚未系统解决。本文提出三层优化框架，联合确定CDUs在子回路中的整数划分、连续流体分配及每时间步总流量和供温的协同优化，满足子回路热安全约束。基于前沿超算的数据构建Modelica仿真模型，开发降阶替代模型，评估所有611种可行划分方案。比较三种渐进丰富操作策略，最终最优设计为双子回路系统，实现35.48%年度冷却能耗节省，仅比现有三子回路前沿设计高出0.18%。流体分配优化可补偿任何可行CDU到子回路分配，降低设计敏感性93%，提供低成本软件路径实现现有前沿硬件近最优性能。该框架可迁移至其他液冷高性能计算系统。

英文摘要

Liquid-cooled exascale supercomputers dissipate heat through cooling plants organized as multiple parallel subloops, but how to allocate coolant distribution units (CDUs) across subloops and how to distribute flow among them has not been systematically addressed for facilities at this scale. This paper presents a three-layer optimization framework that jointly determines the integer partition of CDUs across subloops, the continuous flow fraction allocation, and the per-timestep co-design optimization of total flow rate and supply temperature subject to per-subloop thermal safety constraints. The Modelica simulation model is built based on the data of Frontier exascale supercomputer at Oak Ridge National Laboratory. By developing a reduced-order surrogate model, all 611 feasible partitions of 25 CDUs are evaluated across the full year operational dataset of 49,353 timesteps. Three progressively richer operational strategies are compared, ranging from flow control optimization to full three-layer co-design optimization with dynamically adjusted flow fractions. The globally optimal design is a two-subloop plant achieving 35.48% annual cooling energy savings, only 0.18% above the current three-subloop Frontier design at 35.30%. Flow fraction optimization is shown to compensate for any feasible CDU-to-subloop assignment, reducing the design sensitivity by 93% and providing a low-cost software-only pathway to near-optimal performance on the existing Frontier hardware. The framework is transferable to other liquid-cooled high-performance computing plants.

URL PDF HTML ☆

赞 0 踩 0

2605.15488 2026-05-18 cs.LG stat.ML

SurvivalPFN: Amortizing Survival Prediction via In-Context Bayesian Inference

SurvivalPFN: 通过上下文贝叶斯推断实现生存预测的 amortization

Shi-ang Qi, Vahid Balazadeh, Michael Cooper, Russell Greiner, Rahul G. Krishnan

AI总结 SurvivalPFN 通过上下文学习实现生存预测的 amortization，利用预训练的网络在单次前向传递中处理右删失数据，避免了参数假设，产生校准的生存分布，在61个数据集上表现优异。

详情

AI中文摘要

生存分析提供了一个强大的统计框架，用于在删失存在的情况下建模时间到事件的结果。然而，从众多专门的生存方法中选择合适的估计器通常需要大量方法论和领域专业知识。我们引入了SurvivalPFN，这是一种先验-数据拟合网络，通过上下文学习实现对删失观测的贝叶斯推断的amortization。SurvivalPFN 在多样化的合成、可识别和右删失数据生成过程中进行预训练，使其能够在推理过程中单次前向传递中实现生存分析的amortization。结果，模型适应每个数据集的有效复杂性，而无需任务特定的训练或超参数调整，避免了限制性的参数假设，并产生校准的生存分布。在涵盖61个数据集、21种方法和5种评估指标的大型基准测试中，SurvivalPFN实现了强大的预测性能，并经常优于已建立的生存模型。这些结果表明，SurvivalPFN为生存分析提供了一个原理上和实用的基础模型，潜在应用领域包括医疗、金融和工程（https://github.com/rgklab/SurvivalPFN）

英文摘要

Survival analysis provides a powerful statistical framework for modeling time-to-event outcomes in the presence of censoring. However, selecting an appropriate estimator from the many specialized survival approaches often requires substantial methodological and domain expertise. We introduce SurvivalPFN, a prior-data fitted network that amortizes Bayesian inference for censored observations through in-context learning. SurvivalPFN is pretrained on a diverse family of synthetic, identifiable, and right-censored data-generating processes, enabling it to amortize survival analysis in a single forward pass during inference. As a result, the model adapts to the effective complexity of each dataset without task-specific training or hyperparameter tuning, avoids restrictive parametric assumptions, and produces calibrated survival distributions. In a large-scale benchmark spanning 61 datasets, 21 methods, and 5 evaluation metrics, SurvivalPFN achieves strong predictive performance and often improves upon established survival models. These results suggest that SurvivalPFN offers a principled and practical foundation model for survival analysis, with potential applications in high-impact domains such as healthcare, finance, and engineering (https://github.com/rgklab/SurvivalPFN).

URL PDF HTML ☆

赞 0 踩 0

2605.15483 2026-05-18 stat.ME stat.ML

Improving the Efficiency of Subgroup Analysis in Randomized Controlled Trials with TMLE

利用TMLE改进随机对照试验中的亚组分析效率

Sky Qiu, Nerissa Nance, Rachael Phillips, Jens Tarp, Maya Petersen, Mark van der Laan

AI总结本文提出TMLE-PR和A-TMLE方法，通过利用非亚组参与者信息提升亚组治疗效应估计的精度，避免外部数据偏倚，用于心血管试验中亚组风险降低估计。

详情

AI中文摘要

在随机对照试验中，亚组分析常因样本量不足而缺乏统计效力。本文通过利用非亚组参与者的信息来增强亚组估计。具体而言，我们研究了两种目标最大似然估计器（TMLE）：一种使用池化回归的TMLE（TMLE-PR）和一种自适应目标最大似然估计器（A-TMLE）。这两种估计器能够在不依赖外部真实世界数据的情况下共享信息，从而利用试验的关键优势：随机治疗带来的偏倚保护，以及数据收集的一致性和定义的一致性。本文提出的一般策略直接服务于关键监管机构（如FDA）的优先事项，通过在不引入外部偏倚的情况下提高亚组特定治疗效应估计的精度，从而促进严谨推断以支持公平的标签、可及性和市场后评估。在基于心血管结局试验（LEADER，NCT01179048）数据的案例研究中，我们使用所提出的估计器估计了利拉鲁肽治疗下黑人和亚洲亚组中主要不良心脏事件（MACE）的风险降低，这两个亚组各自占试验人口的不到10%。使用A-TMLE，我们发现亚洲参与者在365、540和730天时的估计绝对MACE风险降低分别为1.6、1.5和1.5个百分点，黑人参与者分别为2.1、2.0和2.1个百分点，95%置信区间在每个时间点均不包含零。

英文摘要

Subgroup analyses within randomized controlled trials are often underpowered due to limited sample sizes. We address this challenge by leveraging trial participants outside the subgroup of interest to augment estimation within the subgroup. Specifically, we study two Targeted Maximum Likelihood Estimators (TMLEs) that borrow information from non-subgroup participants within the same trial: a TMLE with pooled regression (TMLE-PR) and an Adaptive Targeted Maximum Likelihood Estimator (A-TMLE). Both estimators enable information sharing without relying on any external real-world data, thereby capitalizing on key strengths of the trial: most importantly, the protection against bias afforded by the randomized treatment, but also harmonized data collection, and consistent treatment and outcome definitions. The general strategy proposed here directly advances the priorities of key regulatory agencies, including the FDA, by improving the precision of subgroup-specific treatment effect estimates without introducing external sources of bias, thereby facilitating rigorous inference to support equitable labeling, access, and post-market evaluation. In a case study based on analysis of data from a cardiovascular outcome trial (LEADER, NCT01179048), we estimate the risk reduction of major adverse cardiac events (MACE) under liraglutide treatment among Black and Asian subgroups -- each comprising less than 10\% of the trial population -- using the proposed estimators that borrow information from the remainder of the trial. Using A-TMLE, in particular, we find estimated absolute MACE risk reductions of 1.6, 1.5, and 1.5 percentage points among Asian participants and 2.1, 2.0, and 2.1 percentage points among Black participants at 365, 540, and 730 days, respectively, with 95\% confidence intervals excluding the null at each time point.

URL PDF HTML ☆

赞 0 踩 0

2605.15469 2026-05-18 stat.ME

Tree-aggregated regression for compositional data with measurement errors

基于测量误差的树聚合回归

Zhenghan Li, Tianying Wang

AI总结本文提出TARCO方法，通过整合偏差校正估计量和树感知正半定稳定化，解决树聚合与测量误差交互导致的偏差和不稳定性问题，提升微生物组研究的估计精度和解释性。

详情

AI中文摘要

高维组成型协变量，通常源自计数数据，易受测量误差影响，常通过预设树聚合以提高可解释性。现有方法通常处理树引导的组成回归或误差修正，但未考虑其交互引起的层级污染。本文提出TARCO方法，整合偏差校正估计量与树感知正半定稳定化及稀疏正则化，通过交叉验证选择超参数。所得凸优化问题可通过可扩展算法求解。建立预测和估计误差的有限样本界，并在反映树异质性的条件下证明符号一致性。当测量误差协方差被一致估计器替代时，保证仍成立。多树深度模拟和微生物组应用显示，相比忽略树聚合与测量误差交互的方法，TARCO在估计精度、支持恢复和聚合层面解释性上表现更优。

英文摘要

High-dimensional compositional covariates, often derived from count data, are subject to measurement error and are frequently analyzed after aggregation along a prespecified tree to improve interpretability in applications such as microbiome studies. Existing approaches typically handle either tree-guided compositional regression or errors-in-variables correction, but they do not account for the hierarchical contamination induced by their interaction. We show that tree aggregation turns leaf-level measurement error into level-dependent, correlated contamination across aggregated nodes, which inflates bias, weakens concentration rates for corrected estimating quantities, and leads to unstable variable selection for naive approaches. We propose Tree-Aggregated Regression with Correction for Observation Error (TARCO), which integrates bias-corrected estimating quantities with a tree-aware positive semidefinite stabilization and sparse regularization, with tuning selected by cross-validation based on the corrected objective. The resulting convex program can be solved with scalable algorithms. We establish finite-sample bounds for prediction and estimation errors and prove sign consistency under conditions that explicitly reflect tree heterogeneity. The guarantees persist when the measurement-error covariance is replaced by a consistent estimator. Simulations across multiple tree depths and a microbiome application demonstrate improved estimation accuracy, support recovery, and aggregation-level interpretability compared with methods that ignore the interaction between tree aggregation and measurement error.

URL PDF HTML ☆

赞 0 踩 0

2605.15459 2026-05-18 cs.LG stat.ML

Don't Stop Me Yet: Sampling Loss Minima via Dissipative Riemannian Mechanics

别停止我：通过耗散黎曼流形力学采样损失极小值

Albert Kjøller Jacobsen, Leo Uhre Jakobsen, Johanna Marie Gegenfurtner, Georgios Arvanitidis

AI总结本文提出DiMS方法，通过耗散黎曼流形力学精确采样损失极小值，解决传统方法无法准确采样重参数化不变解的问题，并在贝叶斯推断中验证其有效性。

详情

AI中文摘要

现代神经网络损失函数的极小值通常不是孤立的，而是形成在训练数据上重参数化不变解的连通组件。分析这些解是一个难题，但采样方法是可行的。现有方法要么在低损失区域扩散，无法精确采样重参数化不变解，要么本质上是局部的，限制了对其他极小值盆地的探索。本文提出基于动能的动力系统，受重力和摩擦项驱动，以精确采样极小水平集。DiMS方法依赖物理动机的超参数，允许控制采样器的探索能力。我们以不确定性量化作为动机问题，在贝叶斯推断中观察到比之前方法更好的性能。

英文摘要

The minima of modern neural network loss functions are typically not isolated, rather they form connected components of reparameterization invariant solutions on the training data. Analytically characterizing these solutions is a hard problem, but sampling approaches are feasible. By construction, existing methods either spread over low-loss regions, and thus do not sample reparameterization invariant solutions exactly, or are inherently local, which limits exploration of other minima valleys. We propose sampling such reparameterization invariant models using a dynamical system based on kinetic energy, subject to a gravitational pull and a friction term that dissipates energy from the system. Our proposed sampler, DiMS, is guaranteed to sample exactly from the minimum level sets and depends on physically motivated hyperparameters which allows control over the exploration capabilities of the sampler. We consider uncertainty quantification in Bayesian inference as the motivating problem and observe improved performance compared to previously proposed approaches.

URL PDF HTML ☆

赞 0 踩 0

2605.15428 2026-05-18 stat.ME

Modeling Misclassification in Spousal Violence Reporting: Evidence from Bayesian Quantile Regression

夫妻暴力报告中的误分类建模：来自贝叶斯分位数回归的证据

Joon Jin Song, Mohammad Arshad Rahman, Yoo-Mi Chin, James Stamey

AI总结本文提出一种贝叶斯分位数回归框架，用于处理二元误分类数据，通过引入潜在真实响应和显式建模假阴性和假阳性报告误差，改进了对敏感二元结果的推断。

详情

AI中文摘要

分位数回归扩展了回归分析，超越条件均值，提供更丰富的协变量效果特征。然而，对于敏感的二元结果，由于漏报导致的误分类可能显著偏误推断。本文提出一种贝叶斯分位数回归框架，用于误分类的二元结果，引入潜在真实响应并显式建模假阴性和假阳性报告误差。估计通过一种新的马尔可夫链蒙特卡罗（MCMC）算法进行。在不同先验规格和误分类率下的模拟研究显示，该方法在忽略误分类的模型上表现更优。本文将该方法应用于自我报告的夫妻暴力数据，研究就业状况和家庭财富与关联，同时调整社会人口因素。结果表明，各分位数中漏报超过过报，并且考虑误分类可以改变实质性结论。

英文摘要

Quantile regression extends regression analysis beyond the conditional mean, providing a richer characterization of covariate effects across the outcome distribution. For sensitive binary outcomes, however, misclassification due to underreporting can substantially bias inference. We propose a Bayesian quantile regression framework for misclassified binary outcomes that introduces a latent true response and explicitly models false negative and false positive reporting errors. Estimation is performed through a novel Markov chain Monte Carlo (MCMC) algorithm. Simulation studies under varying prior specifications and misclassification rates demonstrate improved performance over models that ignore misclassification. We apply the method to self-reported spousal violence data, examining associations with employment status and household wealth while adjusting for socio-demographic factors. The results indicate that underreporting exceeds overreporting across quantiles and that accounting for misclassification can change substantive conclusions.

URL PDF HTML ☆

赞 0 踩 0

2605.15411 2026-05-18 stat.ML cs.LG math.OC

Harnessing Unimodality in Semiparametric Contextual Pricing via Oracle Price Map Learning

通过Oracle价格图学习利用单峰性在半参数上下文定价中

Yingying Fan, Yuxuan Han, Jinchi Lv, Xiaocong Xu, Zhengyuan Zhou

AI总结本文研究了半参数标量指数估值模型中的上下文动态定价，通过Oracle价格图学习方法，利用β-Hölder光滑性和收益几何条件，提出了一种模块化粗到细策略，实现非参数Oracle图学习的最优 regret 界。

详情

AI中文摘要

我们研究了半参数标量指数估值模型中的上下文动态定价，其中潜在价值为 $v_t=μ_\ast(\mathsf c_t)+ξ_t$，其中未知效用图 $μ_\ast$ 和未知加性噪声分布。关键决策对象是通过标量指数 $u=μ_\ast(\mathsf c)$ 和噪声尾部诱导的一维Oracle价格图 $u\mapsto p^\ast(u)$。在 $β$-Hölder光滑性（$β\geq 2$）和收益几何条件（提供唯一、稳定的内部最大化器）下，该Oracle图本身为 $(β-1)$-光滑。我们通过 $\mathsf{ORBIT}$，一种模块化粗到细策略，利用标量试点指数作为输入，在每个活跃区间内局部化基准价格，并通过多臂凸优化学习Oracle图的局部多项式近似。对于基线线性效用模型 $μ_\ast(\mathsf c)=\mathsf c^\topθ_\ast$，自适应椭圆探索方案在不假设上下文分布的情况下构建所需的标量试点在线。所得到的策略达到 regret $\widetilde{O}\big(T^{\frac{2β-1}{4β-3}}+\sqrt{dT}\big)$。对于固定 $d$，我们建立了在时间范围依赖上的匹配下界，揭示了非参数Oracle图学习项的最小最大尖锐性。相同的标量试点接口还扩展到稀疏高维线性效用和非参数Hölder效用。

英文摘要

We study contextual dynamic pricing in a semiparametric scalar-index valuation model where the latent value is $v_t=μ_\ast(\mathsf c_t)+ξ_t$, with an unknown utility map $μ_\ast$ and an unknown additive noise distribution. The key decision object is the one-dimensional oracle price map $u\mapsto p^\ast(u)$ induced by the scalar index $u=μ_\ast(\mathsf c)$ and the noise tail. Under the $β$-Hölder smoothness of the tail function for $β\geq 2$ and a revenue-geometry condition that gives a unique, stable, interior maximizer, this oracle map is itself $(β-1)$-smooth. We exploit such structure through $\mathsf{ORBIT}$, a modular coarse-to-fine policy that takes a scalar pilot index as input, localizes a benchmark price in each active bin, and learns a local polynomial approximation of the oracle map inside a trust region via bandit convex optimization. For the baseline linear utility model $μ_\ast(\mathsf c)=\mathsf c^\topθ_\ast$, an adaptive elliptical exploration scheme constructs the required scalar pilot online without distributional assumptions on the contexts. The resulting policy achieves regret $\widetilde{O}\big(T^{\frac{2β-1}{4β-3}}+\sqrt{dT}\big)$. For fixed $d$, we establish a matching lower bound in the horizon dependence, unveiling that the nonparametric oracle-map learning term is minimax sharp. The same scalar-pilot interface also yields extensions to sparse high-dimensional linear utility and nonparametric Hölder utility.

URL PDF HTML ☆

赞 0 踩 0

2605.15405 2026-05-18 econ.GN q-fin.EC stat.ME

Estimating Social Norm Complementarities

估计社会规范的互补性

Eliana La Ferrara, Cheaheon Lim, Davide Viviano

AI总结本文通过实证研究探讨社会规范在技术和社会维度的互补性，发现女性割礼和童婚在塞拉利昂存在互补性，而多妻制与童婚在尼日利亚存在替代性，为政策制定提供依据。

详情

AI中文摘要

Diego Martinez-Taboada, Aaditya Ramdas

AI总结本文研究了有界自伴算子和的经验型伯内特和伯恩斯坦不等式，旨在解决传统算子值浓度不等式在实际应用中对先验方差依赖过强、且依赖环境维度的问题。作者提出了一种完全基于数据驱动的方法，用经验方差替代未知方差，并基于算子的内在维度而非环境维度进行分析，从而获得了更精确且适用于无限维空间的浓度界。该方法在非各向同性随机矩阵中表现更优，并且在理论保证上达到了已知最优的渐近精度。

2605.15240 2026-05-18 stat.ML cs.LG

On Kernel Eigen-alignments of KRR: Reconstruction and Generalization

Yang Liu, Ernest Fokoue, Richard Lange, Daniel Krutz

AI总结本文研究了核矩阵与学习目标之间的特征对齐在实现鲁棒泛化中的关键作用，建立了核方法泛化性能与矩阵特征向量和特征值估计之间的直接联系。通过分析核矩阵扰动对预测结果的影响，作者推导出基于特征值和特征向量估计稳定性的泛化误差上界，并指出在高秩核条件下，重建误差对泛化能力的预测作用有限。研究从特征值估计的角度提出了新的泛化界，表明强泛化能力需要增强特征向量对齐、增大特征值幅度或增大相邻特征值之间的间隔。

2605.15234 2026-05-18 math.NA cs.NA math.SP math.ST stat.CO stat.TH

Sampling pseudospectrum for data-driven matrices

Caroline Wormell

AI总结许多复杂系统可以通过对捕捉其动态的矩阵进行谱分解来简化为关键组件，这些矩阵通常通过最小二乘拟合等方法从数据中构建。然而，现有方法难以区分所得离散特征值是数据有限性引起的误差还是系统真实特征。本文提出了一种采样伪谱 $P(λ)$，用于在复平面上提供有限数据特征值行为的概率信息，并给出了其估计量 $\hat P(λ)$，可通过重新处理数据得到。该估计量计算高效，能够统计检验真实特征值的位置，从而严格且普遍地判断从有限数据中提取的模式是信号还是噪声。

Comments 30 pages. Comments welcome

2605.05179 2026-05-18 cs.LG cond-mat.dis-nn stat.ML

Estimating the expected output of wide random MLPs more efficiently than sampling

Wilson Wu, Victor Lecomte, Michael Winer, George Robinson, Jacob Hilton, Paul Christiano

AI总结本文提出了一种比采样更高效的方法，用于估计初始化后的宽随机多层感知机（MLP）在高斯输入下的期望输出。该方法通过构建每一层激活值的近似分布，利用累积量和Hermite展开等工具，避免了传统采样方式中逐个输入计算的耗时过程。实验表明，该方法在保证均方误差的前提下，显著减少了计算量，尤其在估计小概率事件和模型训练中表现出色，为降低模型尾部风险提供了新思路。

Comments 68 pages. Code is available at https://github.com/alignment-research-center/mlp_cumulant_propagation

2604.15598 2026-05-18 nlin.CG q-bio.QM stat.AP

When do trajectories matter? Identifiability analysis for stochastic transport phenomena

Matthew J Simpson, Michael J Plank

AI总结该研究探讨了在随机扩散模型中，轨迹数据对模型参数可识别性的影响。通过结合基于代理的模拟、偏微分方程近似、似然估计与可识别性分析等方法，研究发现仅使用计数数据可能导致结构不可识别问题，而引入个体轨迹数据可有效改善参数估计的准确性。研究还分析了不同实验设计对参数可识别性的影响，并提供了开源代码供进一步使用。

Comments 7 Figures

2604.13137 2026-05-18 stat.CO math.NT math.ST stat.TH

$p$-adic Linear Regression for Random Sampling with Digitwise Noise

Tomoki Mihara

AI总结本文提出了一种新的$p$-adic线性回归概率算法，用于处理带有逐位噪声的随机采样问题。该方法包含一种新的模$p$线性回归概率算法，能够在噪声环境下更准确地估计回归参数。研究的主要贡献在于将$p$-adic分析引入统计回归问题，为处理高噪声数据提供了新的理论工具和计算方法。

2602.16274 2026-05-18 cs.LG stat.ML

Regret and Sample Complexity of Online Q-Learning via Concentration of Stochastic Approximation with Time-Inhomogeneous Markov Chains

Rahul Singh, Siddharth Chandak, Eric Moulines, Vivek S. Borkar, Nicholas Bambos

AI总结本文首次为无限时间折扣马尔可夫决策过程中的经典在线Q学习提供了悔恨界，无需依赖乐观或奖励项。研究分析了衰减温度的玻尔兹曼Q学习，并提出了一种结合ε_n-贪心与玻尔兹曼探索的平滑探索策略，证明其悔恨界对子优化间隙具有鲁棒性，达到近似O(N^{9/10})的上界。同时，作者还给出了高概率下的样本复杂度保证，并发展了一种适用于合缩马尔可夫随机逼近的高概率集中界，该结果具有独立研究价值。

2602.14342 2026-05-18 math.ST cs.DS cs.LG math.PR stat.TH

High-accuracy log-concave sampling with stochastic queries

Fan Chen, Sinho Chewi, Constantinos Daskalakis, Alexander Rakhlin

AI总结本文研究了在对数凹函数采样中如何实现高精度的采样保证，提出使用具有亚指数尾部的随机梯度可以达到迭代和查询复杂度与 $\mathrm{poly}\log(1/δ)$ 相关的高精度采样。这与凸优化问题形成对比，后者在梯度存在随机性时需要 $\mathrm{poly}(1/δ)$ 的查询次数。研究还从信息论角度论证了轻尾随机梯度对于实现高精度采样的必要性，并给出了针对零阶随机查询和有限和势函数采样的改进复杂度结果。

2601.23030 2026-05-18 stat.ML cs.LG stat.ME

Neural Backward Filtering Forward Guiding

Gefan Yang, Frank van der Meulen, Stefan Sommer

AI总结本文提出了一种名为“神经反向滤波正向引导”（NBFFG）的统一框架，用于解决树状非线性连续随机过程中的推断问题，尤其适用于观测稀疏且拓扑结构复杂的情形。该方法通过构造一个近似的线性高斯过程，得到闭式反向滤波器以引导生成路径向高似然区域移动，并利用神经网络残差捕捉非线性偏差，从而实现无偏的路径子采样，显著降低训练复杂度。实验表明，NBFFG在合成数据集和高维系统发育分析任务中均优于现有方法。

2601.21294 2026-05-18 cs.LG stat.ML

Missing-Data-Induced Phase Transitions in Spectral PLS for Multimodal Learning

Anders Gjølbye, Ida Kargaard, Emma Kargaard, Lina Skerath, Lars Kai Hansen

AI总结本文研究了在多模态学习中，缺失数据对谱偏最小二乘（PLS）方法性能的影响。通过在高维尖峰模型下分析独立缺失的完全随机掩码对交叉协方差矩阵的影响，发现缺失数据会削弱信号强度，并导致类似BBP类型的相变现象：当信号与噪声比低于临界阈值时，主奇异向量无法有效捕捉潜在共享结构；高于该阈值时则能实现非平凡对齐。研究还提出了有限秩扩展的猜想，并通过仿真和半合成实验验证了理论预测的相图和恢复曲线。

Comments Preprint

2512.18250 2026-05-18 stat.ME

NMF-FFB: Non-negative matrix factorization with feedforward-feedback structure

Kenichi Satoh

AI总结本文提出了一种具有前馈-反馈结构的非负矩阵分解方法（NMF-FFB），用于处理非负数据中的内生变量关系。该方法在传统NMF基础上引入了内生变量之间的潜在反馈机制，通过同时方程建模实现内生与外生变量路径的分离。NMF-FFB适用于小样本、非负加法数据场景，能够自动发现潜在因子并区分直接与累积反馈效应，在多个实际数据集上展示了良好的解释性与应用效果。

2512.09673 2026-05-18 cs.LG cs.AI cs.NE stat.ML

Drawback of Enforcing Equivariance and its Compensation via the Lens of Expressive Power

Yuzhu Chen, Tian Qin, Xinmei Tian, Fengxiang He, Dacheng Tao

AI总结本文研究了强制等变性对神经网络表达能力的影响，发现这种约束可能削弱模型的表达能力。通过分析边界超平面和通道向量，作者构造性地证明了这一问题，并指出可通过扩大模型规模来补偿这一缺陷，同时证明了所需扩大的上界。令人意外的是，扩大的网络结构反而降低了假设空间的维度，可能带来更好的泛化能力。

2512.00242 2026-05-18 cs.LG cs.AI cs.ET stat.ML

Polynomial Neural Sheaf Diffusion: A Spectral Filtering Approach on Cellular Sheaves

Alessio Borgi, Fabrizio Silvestri, Pietro Liò

AI总结本文提出了一种名为多项式神经束扩散（PolyNSD）的新方法，用于改进神经束网络在图结构上的扩散过程。该方法通过在归一化束拉普拉斯矩阵上应用K次多项式传播算子，实现了与束维数无关的K跳感受野，并通过凸混合的正交多项式基响应进行可训练的谱响应建模。相比传统方法，PolyNSD在保持模型稳定性的同时，降低了计算和内存需求，并在同质和异质图基准测试中取得了新的最先进结果。

2511.17426 2026-05-18 cs.LG cs.CV stat.ML

Self-Supervised Learning by Curvature Alignment

Benyamin Ghojogh, M. Hadi Sepanj, Paul Fieguth

AI总结本文提出了一种基于曲率对齐的自监督学习方法CurvSSL及其核空间扩展kernel CurvSSL，旨在通过显式建模数据流形的局部几何结构来提升表征学习效果。该方法在传统非对比学习框架中引入曲率正则化项，通过计算嵌入特征的局部曲率并对其在不同数据增强视图间进行对齐和去相关，从而增强表示的不变性和几何一致性。实验表明，该方法在MNIST和CIFAR-10数据集上取得了优于现有方法的线性评估性能。

Comments A shorter version of this paper has been published in: Journal of Computational Vision and Imaging Systems, Vol. 11, No. 1, Special Issue: Proceedings of CVIS 2025

2511.03606 2026-05-18 stat.ML cs.LG math.ST stat.TH

Vector-valued self-normalized concentration inequalities beyond sub-Gaussianity

Diego Martinez-Taboada, Tomas Gonzalez, Aaditya Ramdas

AI总结本文研究了超越次高斯分布的向量值自归一化过程的集中不等式，填补了该领域在非次高斯条件下的理论空白。作者提出了适用于轻尾分布（如贝内特或伯努利分布）的集中界，扩展了传统自归一化分析的适用范围。研究成果在在线线性回归及核化线性强盗算法中具有重要应用价值。

2510.20741 2026-05-18 stat.ME stat.AP

A comparison of methods for designing hybrid type 2 cluster-randomized trials with continuous effectiveness and implementation endpoints

Melody Owen, Fan Li, Ruyi Liu, Donna Spiegelman

AI总结本文比较了五种用于设计具有连续有效性及实施终点的混合型II类集群随机试验的方法，旨在为研究者提供统计功效分析的实用指导。研究通过理论分析和大规模数值模拟，揭示了不同方法在不同情境下的功效差异，发现当处理效应不同时，离散型两自由度检验具有优势，而处理效应相同时，单自由度检验更为有效。文章还介绍了用于计算试验功效和样本量的R包 crt2power，为该类试验的设计提供了重要工具。

2509.01685 2026-05-18 stat.ML cs.LG math.OC stat.CO

Preconditioned Regularized Wasserstein Proximal Sampling

Hong Ye Tan, Stanley Osher, Wuchen Li

AI总结本文研究如何通过有限粒子的演化从吉布斯分布中进行采样，提出了一种预条件正则化Wasserstein近端采样方法。该方法通过正则化Wasserstein近端算子的数值可计算得分函数来近似得分函数，并基于各向异性热方程的Cole-Hopf变换推导出其核形式。实验表明，该方法在多种对数凹和非对数凹分布以及贝叶斯图像去卷积和神经网络训练任务中表现出加速和稳定性优势。

2506.18673 2026-05-18 math.PR math-ph math.MP math.ST stat.TH

Asymptotic Expansions of Gaussian and Laguerre Ensembles at the Soft Edge III: Generating Functions

Folkmar Bornemann

AI总结本文研究了高维高斯和拉盖尔系综在软边缘处的渐近展开，重点分析了间隙概率生成函数的结构。作者证明了渐近展开中的修正项是主导项高阶导数的多线性形式，并具有与生成函数变量无关的有理多项式系数。这一结构同样适用于由线性诱导得到的量，如第 $k$ 大水平的分布。对于正交和辛系综，研究基于某些假设，并通过数值模拟验证了假设的合理性。

2506.12532 2026-05-18 stat.ME

Bayesian inference for the learning rate in Generalised Bayesian inference

Jeong Eun Lee, Sitong Liu, Geoff K. Nicholls

AI总结本文研究了广义贝叶斯推断（GBI）中学习率和损失函数超参数的估计问题。作者提出了一种基于留出数据的贝叶斯方法，用于推断这些超参数的后验分布，并定义了两种不同的超参数后验形式，分别基于ELPPD效用和伪真参数覆盖。该方法支持对多个超参数进行联合估计与不确定性量化，实验表明其在模拟数据和实际文本分析任务中均优于传统贝叶斯方法，尤其适用于多数据集融合场景。

Comments 33 pages, 7 figures, 1 Table with 32 pages of appendices including 18 further figures and 4 further tables

2506.00182 2026-05-18 stat.ML cs.IT cs.LG math.IT math.ST stat.TH

Overfitting has a limitation: a model-independent generalization gap bound based on Rényi entropy

Atsushi Suzuki, Jing Wang

AI总结本文研究了机器学习模型泛化能力的限制，提出了一个与模型无关的泛化间隙上界，该上界仅依赖于数据生成分布的Rényi熵。研究指出，即使模型规模无限增大，只要数据量相对于Rényi熵足够，仍可保持较小的泛化间隙。该框架不仅解释了数据中注入噪声导致性能下降的现象，还拓展了无免费午餐定理，强调了数据分布熵在成功学习中的关键作用。

2503.16589 2026-05-18 cs.LG cs.ET math.ST stat.TH

A Statistical Analysis for Per-Instance Evaluation of Stochastic Optimizers: Avoiding Unreliable Conclusions

Moslem Noori, Elisabetta Valiante, Thomas Van Vaerenbergh, Masoud Mohseni, Ignacio Rozada

AI总结本文针对随机优化器的性能评估问题，提出了一种统计分析方法，以避免因实验设计不当导致的不可靠结论。研究分析了常用性能指标的置信区间及其与实验重复次数的关系，并推导出保证指标精度所需的最小重复次数下界。基于此，作者提出了一种自适应调整重复次数的算法，以提高评估的准确性和可靠性。实验结果验证了该方法在基准测试和超参数调优中的有效性。

2503.00326 2026-05-18 stat.ME stat.ML

A Bayesian Additive Regression Tree Model for Learning Conditional Average Treatment Effects in Regression Discontinuity Designs

Rafael Alcantara, P. Richard Hahn, Hedibert F. Lopes

AI总结本文提出了一种高效的贝叶斯方法，用于回归不连续设计（RDD）中的条件平均处理效应（CATE）估计。该方法基于贝叶斯加性回归树（BART）模型，通过在叶节点引入对运行变量和处理虚拟变量的线性回归，实现了对处理效应的可解释估计。该模型能够自适应地划分协变量空间，识别运行变量斜率显著变化的区域，避免了传统方法对基函数展开的严格假设，提升了模型的灵活性和适用性。

2502.12187 2026-05-18 cs.CL cs.FL cs.LG math.ST stat.ML stat.TH

Hallucinations are inevitable but can be made statistically negligible

Atsushi Suzuki, Yulan He, Feng Tian, Zhongyuan Wang

AI总结本文探讨了语言模型中不可避免的“幻觉”现象，即模型生成非事实内容的问题。尽管已有研究从可计算性理论角度证明，任何语言模型在无限输入集上都会产生幻觉，但本文从概率论角度提出，只要训练数据的质量和数量足够，幻觉在统计意义上可以被显著降低。研究指出，虽然可计算性理论结果具有理论意义，但概率理论结果更符合实际应用需求，为缓解幻觉问题提供了新的理论依据。

2407.08094 2026-05-18 stat.ML cs.LG physics.chem-ph physics.data-an

Density Estimation via Binless Multidimensional Integration

Matteo Carli, Alex Rodriguez, Alessandro Laio, Aldo Glielmo

AI总结本文提出了一种名为无箱多维热力学积分（BMTI）的非参数密度估计方法，用于高效、稳健地估计高维数据的密度。该方法通过计算相邻数据点之间的对数密度差异，并结合最大似然框架对其进行加权积分，从而估计密度的对数。BMTI无需对数据进行分箱或空间划分，而是基于自适应带宽选择构建邻域图，利用流形假设在数据的内在流形上进行估计，有效克服了传统非参数密度估计方法的局限性，并在高维空间中表现出优越的性能。

2404.04775 2026-05-18 stat.ME

Bipartite causal inference with interference, time series data, and a random network

Zhaoyan Song, Georgia Papadogeorgou

AI总结本文研究了在存在干扰、时间序列数据和随机网络结构下的二分图因果推断问题，旨在估计干预单元对结果单元的即时和持续影响效应。作者在暴露映射框架下定义了这些因果效应，并基于干预单元的处理分配和随机网络的无混淆假设，建立了结果单元暴露的无混淆性。研究提出了适用于二元、连续和多元暴露映射的因果效应估计方法，并在二元暴露情形下设计了结合匹配与协变量平衡的算法，证明了估计偏差的有界性。实证研究表明，野火烟雾对旧金山自行车通勤存在即时负面影响。

2404.03099 2026-05-18 cs.LG cs.AI cs.CE cs.IT math.IT stat.ML

Composite Bayesian Optimization In Function Spaces Using NEON -- Neural Epistemic Operator Networks

Leonardo Ferreira Guilhoto, Paris Perdikaris

AI总结本文提出了一种名为NEON的神经网络架构，用于在无限维函数空间中进行带有不确定性的预测，其参数数量远少于性能相当的深度集成方法。研究聚焦于复合贝叶斯优化问题，即优化由未知函数映射和已知函数组成的复合函数，并通过实验表明NEON在多个场景下取得了领先的优化效果，同时显著降低了模型复杂度。

2311.03658 2026-05-18 cs.CL cs.AI cs.LG stat.ML

The Linear Representation Hypothesis and the Geometry of Large Language Models

Kiho Park, Yo Joong Choe, Victor Veitch

AI总结本文探讨了“线性表示假设”，即高层概念在表示空间中以线性方向形式表示的问题，提出了“线性表示”的两种形式化定义，并分别对应输出（词）空间和输入（句子）空间。通过引入因果内积，作者建立了一个非欧几里得的内积结构，能够统一各种线性表示的概念，并用于构建探针和引导向量。实验表明，大型语言模型中确实存在概念的线性表示，且内积的选择对解释与控制模型具有基础性作用。

Comments Accepted for a presentation at ICML 2024 and an oral presentation at NeurIPS 2023 Workshop on Causal Representation Learning. Code is available at https://github.com/KihoPark/linear_rep_geometry

2306.15199 2026-05-18 stat.ME

Rank-Transformed Dissimilarity Profiles for High-Dimensional Classification

Xiangbo Mo, Hao Chen

AI总结在小样本高维分类任务中，由于样本量有限且类别间信号变化复杂，分类仍面临挑战。本文提出了一种基于差异性分析的分类框架，通过构建每个样本相对于各类别的差异性分布，将其转化为低维表示，从而捕捉类内和类间系统性的差异模式。该方法进一步引入秩变换，提升对异常值的鲁棒性，并在多种高维数据集上表现出优于或接近现有分类器的性能。

2212.05524 2026-05-18 stat.ME stat.AP

Bayesian inference for partial orders from random linear extensions: power relations from 12th Century Royal Acta

Geoff K. Nicholls, Jeong Eun Lee, Nicholas Karn, David Johnson, Rukuang Huang, Alexis Muir-Watt

AI总结本文研究了12世纪英格兰、威尔士和诺曼底皇家法令中主教名单的顺序变化，以揭示社会地位和权力的变化。研究将社会秩序建模为一个随时间演化的偏序集（poset），并构建了一个隐马尔可夫模型，其中隐藏状态为演化中的偏序集，观测数据为符合该偏序集的随机全序列表。该方法能够处理噪声并考虑主教在等级中的位置变化，通过模型拟合发现了社会地位随时间演变的证据，并在法院政治背景下对结果进行了解释。

Comments 64 pages, 37 figures and 3 tables including appendices

详情

DOI: 10.1214/24-AOAS2002
Journal ref: Annals of Applied Statistics, 19(2), 1663-1690, (June 2025)

英文摘要

In the eleventh and twelfth centuries in England, Wales and Normandy, Royal Acta were legal documents in which witnesses were listed in order of social status. Any bishops present were listed as a group. For our purposes, each witness-list is an ordered permutation of bishop names with a known date or date-range. Changes over time in the order bishops are listed may reflect changes in their authority. Historians would like to detect and quantify these changes. There is no reason to assume that the underlying social order which constrains bishop-order within lists is a complete order. We therefore model the evolving social order as an evolving partial ordered set or {\it poset}. We construct a Hidden Markov Model for these data. The hidden state is an evolving poset (the evolving social hierarchy) and the emitted data are random total orders (dated lists) respecting the poset present at the time the order was observed. This generalises existing models for rank-order data such as Mallows and Plackett-Luce. We account for noise via a random ``queue-jumping'' process. Our latent-variable prior for the random process of posets is marginally consistent. A parameter controls poset depth and actor-covariates inform the position of actors in the hierarchy. We fit the model, estimate posets and find evidence for changes in status over time. We interpret our results in terms of court politics. Simpler models, based on Bucket Orders and vertex-series-parallel orders, are rejected. We compare our results with a time-series extension of the Plackett-Luce model. Our software is publicly available.

URL PDF HTML ☆

赞 0 踩 0

2003.06804 2026-05-18 stat.ME math.ST stat.ML stat.TH

Semi-Modular Inference: enhanced learning in multi-modular models by tempering the influence of components

Chris U. Carmona, Geoff K. Nicholls

AI总结本文提出了一种半模块化推断（Semi-Modular Inference, SMI）方法，旨在提升多模块模型中的学习效果。该方法通过引入一个影响参数，灵活调节模块间的推理影响，既包含贝叶斯推断和Cut模型作为特例，又实现了信息流的可调和定向控制。研究还提供了一种元学习准则用于选择最佳推断方案，并在多个测试案例和考古数据集上验证了方法的有效性。

Comments for associated R package to reproduce results, see https://github.com/christianu7/aistats2020smi

1904.04185 2026-05-18 stat.ME

Multiple imputation in data that grow over time: A comparison of three strategies

X. M. Kavelaars, S. van Buuren, J. R. van Ginkel

AI总结该研究比较了三种处理随时间增长的纵向数据中缺失值的多重插补策略。核心方法包括重新插补、嵌套插补和附加插补，研究通过模拟分析发现，所有方法在单调缺失模式下均能提供有效推断，而非单调缺失模式下则可能产生偏差。研究指出，时间点内的相关性需强于时间点间相关性，才能保证推断有效性，并认为附加插补在存在退出缺失的纵向数据中尤为有益。

Comments 15 pages, 5 tables, 1 figure

1702.00971 2026-05-18 stat.ME

Multiple imputation for multilevel data with continuous and binary variables

Vincent Audigier, Ian R. White, Shahab Jolani, Thomas P. A. Debray, Matteo Quartagno, James Carpenter, Stef van Buuren, Matthieu Resche-Rigon

AI总结本文研究了针对包含连续变量和二元变量的多层数据的多重插补方法，重点比较了不同方法在系统性缺失和随机缺失情况下的表现。通过理论分析和基于真实数据集的模拟研究，发现异方差插补方法在多数情况下比同方差方法更准确，且有效推断需要数据包含大量聚类单元。研究还指出不同方法适用于不同类型的聚类规模和变量类型，为多层数据缺失值处理提供了重要参考。