arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2606.20480 2026-06-19 math.ST stat.ML stat.TH 新提交

Leveraging tails for adaptation

利用尾部进行自适应

Sergios Agapiou, Ismaël Castillo, Paul Egels

AI总结研究非参数贝叶斯中基于p-指数尾先验的后验收缩率，发现p越小收缩越快，且p→0时可实现光滑性自适应，应用于白噪声回归和ReLU神经网络。

Comments 59 pages, 3 figures

2606.20427 2026-06-19 math.ST stat.ME stat.TH 新提交

Private Rate-Double-Robust Inference

私有率双稳健推断

Máté Kormos, Aad van der Vaart

AI总结本文通过局部隐私机制注入噪声保护个体隐私，同时利用率双稳健性实现目标参数的无偏和半参数有效推断，并开发了私有化非参数和参数 nuisance 估计方法。

详情

AI中文摘要

我们协调了隐私保护和率双稳健推断。个体隐私通过局部隐私机制得到保护：向敏感数据注入噪声，仅揭示用于推断的噪声数据。因此，隐私保护阻碍了推断。相比之下，当目标参数的估计量的大样本偏差由另外两个 nuisance 参数的估计误差之间的权衡表征时，该参数的推断是率双稳健的。因此，率双稳健性促进了推断。我们协调的起点是一类由无限维线性索引和低维非线性回归索引的率双稳健目标参数。这包括因果参数等。为了私有地推断这些目标，我们展示了合适的隐私机制如何将敏感数据模型的半参数性质转移到私有设置中。率双稳健性被转移，从而实现了对目标参数的局部私有、无偏和半参数有效推断。最后，我们将一般的非参数 nuisance 估计量转化为私有估计量，这些估计量继承了其非私有对应物的收敛性质。对于参数 nuisance 模型，我们开发了一种私有矩估计方法及其大样本推断理论。

英文摘要

We reconcile privacy protection and rate-double-robust inference. The privacy of individuals is protected by a local privacy mechanism: injecting noise into their sensitive data, revealing only the noisy data for inference. Hence, privacy protection hinders inference. In contrast, the inference of a target parameter is rate-double-robust when the large-sample bias of an estimator of the parameter is characterised by a trade-off between the estimation errors of two other, nuisance, parameters. Hence, rate-double-robustness facilitates inference. Our starting point of reconciliation is a class of rate-double-robust target parameters indexed linearly by an infinite-dimensional and nonlinearly by a low-dimensional regression. Among others, this includes causal parameters. To infer these targets privately, we show how suitable privacy mechanisms transfer the semiparametric properties of the sensitive-data model to the private setting. Rate-double-robustness is transferred, enabling locally-private, unbiased and semiparametrically efficient inference of our target parameters. Finally, we transform general nonparametric nuisance estimators into private ones, which inherit convergence properties of their nonprivate counterparts. For parametric nuisance models, we develop a private method-of-moments estimator and its large-sample inference theory.

URL PDF HTML ☆

赞 0 踩 0

2606.19726 2026-06-19 math.ST stat.TH 新提交

A Laplace equation approach to the Behrens--Fisher problem

Behrens-Fisher问题的拉普拉斯方程方法

Nagananda K G, Jong Sung Kim

AI总结针对两独立正态样本方差未知且不等的情况，提出偏微分方程公式，通过正交分解和球面楔概率将分布问题转化为拉普拉斯-狄利克雷边值问题，导出累积分布函数和概率密度的精确有限样本表示，并得到尾部分布展开。

Comments 31 pages, 4 figures

详情

AI中文摘要

我们针对两个独立正态样本（方差未知且不等）的Behrens-Fisher问题，发展了一种偏微分方程公式。通过正交分解分离均值分量和残差分量（对应于去除均值方向后中心化的样本内变异），并将样本均值的学生化差异重新表述为尺度不变的几何约束。这种简化将分布问题转化为球面楔概率的评估，这些概率被识别为调和测度以及拉普拉斯-狄利克雷边值问题在原点的值。在此框架下，我们导出了累积分布函数和概率密度函数的精确有限样本表示，形式为贝塔函数，仅依赖于样本量和方差比。这些表示将Behrens-Fisher分布置于标准特殊函数形式中，可直接在广泛可用的商业软件（包括Microsoft Excel）中使用，从而便于分布评估和分位数计算。我们还得到了相关调和延拓及其阈值导数的Gegenbauer分离变量展开，系数为封闭的贝塔-伽马形式，并导出了具有显式首项常数和高阶修正的尖锐尾部分布展开。

英文摘要

We develop a partial differential equation formulation of the Behrens-Fisher problem for two independent normal samples with unknown and unequal variances. An orthogonal decomposition separates mean and residual components (corresponding to the centered within-sample variation left after removal of the mean directions) and recasts the studentized difference of sample means as a scale-invariant geometric constraint. This reduction transforms the distributional problem into the evaluation of spherical wedge probabilities, which are identified with harmonic measure and with the value at the origin of a Laplace-Dirichlet boundary value problem. From this framework, we derive exact finite-sample representations for the cumulative distribution function and the probability density function in terms of beta functions, with dependence only on the sample sizes and the variance ratio. These representations place the Behrens-Fisher law in a standard special-function form that is directly accessible in widely available commercial software -- including Microsoft Excel -- thereby facilitating distributional evaluation and quantile computation. We also obtain a Gegenbauer separation-of-variables expansion for the associated harmonic extension and its threshold derivative, with coefficients in closed Beta-Gamma form, and derive sharp tail expansions with explicit leading constants and higher-order corrections.

URL PDF HTML ☆

赞 0 踩 0

2606.20557 2026-06-19 cs.LG math.ST stat.ML stat.TH 交叉投稿

Optimal Deterministic Multicalibration and Omniprediction

最优确定性多校准与全预测

Georgy Noarov, Aaron Roth

发表机构 * University of Pennsylvania（宾夕法尼亚大学）

AI总结本文提出一种确定性算法，实现多校准的极小化最优样本复杂度，并推广到结果不可区分性，解决确定性预测器是否必要的问题。

详情

AI中文摘要

一个模型在一组群体权重 $G$ 上是多校准的，如果它是校准的——即即使以其预测为条件也是无偏的——不仅整体上，而且在通过每个 $g \in G$ 对上下文重新加权后也是如此。这对于许多下游应用是一个有用的性质，也是可信机器学习的基本要求。在这项工作之前，所有已知达到 $\varepsilon$-多校准的极小化最优 $\widetilde O(\varepsilon^{-3})$ 样本复杂度的预测器都是随机化的，而确定性预测器仅以更差的样本复杂度已知。多校准中随机化对于最优样本复杂度是否必要的问题由 [CLNR26] 明确提出，并在之前的几项工作中隐含提出。我们通过给出一个输出确定性预测器的极小化最优多校准算法解决了这个开放问题。然后我们将该算法推广到产生满足关于有限或有限覆盖测试集合的结果不可区分性（OI）的最优确定性预测器。作为一个应用，这也给出了具有最优样本复杂度的确定性全预测器和泛预测器，解决了 [OKK25] 和 [BHHLZ25] 提出的开放问题。

英文摘要

A model is multicalibrated on a collection of group weights $G$ if it is calibrated -- i.e. unbiased even conditional on its prediction -- not just overall, but also after reweighting contexts by each $g \in G$. It is a useful property for many downstream applications and is a basic desideratum of trustworthy machine learning. Before this work, all predictors known to attain the minimax-optimal $\widetilde O(\varepsilon^{-3})$ sample complexity rate for $\varepsilon$-multicalibration were randomized, while deterministic predictors were known only with substantially worse sample complexity. Whether randomization is necessary for optimal sample complexity in multicalibration was explicitly asked by [CLNR26] and implicitly in several prior works. We resolve this open problem by giving a minimax-optimal multicalibration algorithm that outputs a deterministic predictor. We then generalize the algorithm to produce optimal deterministic predictors that satisfy outcome indistinguishability (OI) with respect to finite or finitely covered collections of tests. As an application, this also gives deterministic omnipredictors and panpredictors with optimal sample complexity, resolving open problems posed by [OKK25] and [BHHLZ25].

URL PDF HTML ☆

赞 0 踩 0

2606.19655 2026-06-19 stat.CO math.ST stat.TH 交叉投稿

A Flat Connection: The Pooling Factor and the Geometry of Centring in Hierarchical MCMC

平坦联络：分层MCMC中的汇集因子与中心化几何

Aidan D. Bindoff

AI总结研究分层MCMC中中心化/非中心化障碍的几何原因，证明Fisher信息诱导的联络是平坦的，障碍源于统计上的汇集因子π_j，并据此提出诊断方法。

Comments 39 pages, 9 figures, accompanying R package

详情

AI中文摘要

标准MCMC诊断（$\hat{R}$、有效样本量、发散计数）检测链是否混合，但不检测为何未混合。我们询问分层模型中的中心化/非中心化障碍是否具有度量之外的几何原因。联合参数空间是一个纤维丛（超参数为底，组级参数为纤维），Fisher信息度量诱导一个Ehresmann联络$A = -G_{FF}^{-1}G_{BF}$；自然假设是障碍是其曲率，采样器将其感受为和乐。我们证明这是错误的。对于任何光滑的分层后验，不仅是高斯情况，联络是平坦的，因为其水平叶是纤维得分$\partial_\alpha \log p$的水平集：度量之上没有几何障碍。剩下的障碍是统计的，而非几何的，平坦联络将其识别为一个单一量：纤维对底的条件依赖性，由每组的先验比例$\pi_j$（经典汇集因子）控制。该框架由此恢复了已有图景：先验主导的组混合缓慢，每组的非中心化最优权重有闭式解，并且一项模拟研究通过它们对分层方差的相反依赖性，将这种底-纤维耦合与漏斗（一种不同的底空间病态）区分开来。一项直接归因测试确认NUTS不运输纤维：链级足迹是先验主导组中多余的条件自相关，正如$\pi_j$所预测。真正的、甚至旋转的曲率确实出现，但仅针对由采样器工作度量（固定质量矩阵）构建的联络，此时和乐作为算法现象而非几何现象重新出现。先验比例诊断作为R包fibr分发，几何方法作为附带的复现代码。

英文摘要

Standard MCMC diagnostics ($\hat{R}$, effective sample size, divergence counts) detect whether a chain has mixed, but not why it has not. We ask whether the centring/non-centring obstruction in hierarchical models has a geometric cause beyond the metric. The joint parameter space is a fiber bundle (hyperparameters the base, group-level parameters the fibers), and the Fisher information metric induces an Ehresmann connection $A = -G_{FF}^{-1}G_{BF}$; the natural hypothesis is that the obstruction is its curvature, felt by the sampler as holonomy. We prove this false. The connection is flat for any smooth hierarchical posterior, not only the Gaussian case, because its horizontal leaves are the level sets of the fiber score $\partial_α\log p$: there is no geometric obstruction above the metric. What remains is statistical, not geometric, and the flat connection identifies it as a single quantity: the conditional dependence of fiber on base, governed per group by the prior fraction $π_j$, the classical pooling factor. From it the framework recovers the established picture, that prior-dominated groups mix slowly and that the optimal per-group non-centring weight follows in closed form, and a simulation study separates this base-fiber coupling from the funnel, a distinct base-space pathology, by their opposite dependence on the hierarchical variance. A direct attribution test confirms that NUTS does not transport the fiber: the chain-level footprint is excess conditional autocorrelation in prior-dominated groups, exactly as $π_j$ predicts. Genuine, even rotational, curvature does appear, but only for connections built from a sampler's working metric (a fixed mass matrix), where holonomy re-enters as an algorithmic rather than geometric phenomenon. The prior-fraction diagnostic is distributed as the R package fibr, with the geometric methods as accompanying reproduction code.

URL PDF HTML ☆

赞 0 踩 0

2606.19859 2026-06-19 cs.IT cs.LG math.IT math.PR math.ST stat.TH 交叉投稿

Doeblin Curves

Doeblin 曲线

Dongmin Lee, William Lu, Anuran Makur, Japneet Singh

AI总结提出 Doeblin 曲线概念，量化马尔可夫核在不同散度和功率水平下的收缩行为，并应用于噪声迭代优化、噪声电路可靠计算和差分隐私等领域的更细粒度收缩分析。

Comments 42 pages, 2 figures

Journal ref IEEE Transactions on Information Theory, vol. 72, no. 6, pp. 3556-3596, June 2026

详情

DOI: 10.1109/TIT.2026.3678229

AI中文摘要

近期关于 Doeblin 系数的研究揭示了它们作为 TV 距离的 Dobrushin 收缩系数的多路泛化的有用性，这与它们在马尔可夫链遍历性理论中的经典作用不同。然而，为了建立信息收缩的存在性，通常需要强条件，例如远离 0。基于最近提出的非线性信息收缩概念，我们旨在提出一种更细粒度的基于 Doeblin 的多路收缩行为刻画，即使对于 Doeblin 系数为 0 的信道，也能产生非平凡的收缩保证。为此，我们引入了 Doeblin 曲线的概念——一种非线性函数，它量化了马尔可夫核在特定散度和功率水平下对输入分布集合的收缩行为。在我们的分析过程中，我们发展了 Doeblin 系数的新变分刻画，提出了 Doeblin 曲线的若干性质，定义了功率约束 Doeblin 曲线的几个版本，并利用上述变分刻画推导了上下界。然后，我们将这些结果应用于不同领域，包括噪声迭代优化的泛化界、噪声电路可靠计算的误差界以及在线迭代算法的差分隐私保证。特别是，我们将这些领域的结果扩展到更广泛的领域或群体设置，利用 Doeblin 曲线揭示比 Doeblin 系数更细粒度的收缩现象。

英文摘要

Recent research on Doeblin coefficients has shed light on their usefulness as a multi-way generalization of the Dobrushin contraction coefficient for TV distance, in a separate vein from their classic role in the theory of Markov chain ergodicity. However, strong conditions, such as being bounded away from 0, are typically necessary for Doeblin coefficients to establish the existence of information contraction. Building on recently formulated concepts of nonlinear information contraction, we aim to propose a finer-grained Doeblin-based characterization of multi-way contraction behavior which yields non-vacuous contraction guarantees even for channels whose Doeblin coefficient is 0. To this end, we introduce the notion of a Doeblin curve -- a nonlinear function which quantifies the contraction behavior of a Markov kernel on collections of input distributions at specific levels of divergence and power. Through the course of our analysis, we develop a new variational characterization of Doeblin coefficients, present several properties of Doeblin curves, define several versions of power-constrained Doeblin curves, and derive upper and lower bounds using our aforementioned variational characterization. We then utilize these results in diverse areas, including generalization bounds for noisy iterative optimization, error bounds for reliable computation with noisy circuits, and differential privacy guarantees for online iterative algorithms. In particular, we extend results in these areas to broader domains or group settings, leveraging Doeblin curves to reveal finer-grained contraction phenomena than Doeblin coefficients.

URL PDF HTML ☆

赞 0 踩 0

2606.17165 2026-06-19 stat.ME cs.AI econ.EM math.ST stat.TH 交叉投稿

Statistical Foundations of LLM-based A/B Testing: A Surrogacy Framework for Human Causal Inference

基于LLM的A/B测试的统计基础：用于人类因果推断的替代指标框架

Joel Persson, Mårten Schultzberg, Sebastian Ankargren

发表机构 * Spotify USA, Inc.（Spotify美国公司）

AI总结提出替代指标理论框架，证明在弱于分布等价条件下，校准LLM输出可识别平均处理效应，并分析随机性带来的偏差与方差。

详情

AI中文摘要

组织和研究者越来越有兴趣在A/B测试中使用大型语言模型（LLM）代替人类参与者，以期更快、更低成本地进行实验。我们研究当在LLM结果上估计的处理效应何时能够恢复在感兴趣的人类群体上测量的效应。LLM与人类结果之间的分布等价性会使任何标准估计量有效，但这不现实。因此，我们开发了一个统计框架，将替代终点理论适配到LLM。该框架表明，将LLM结果校准到人类结果，在替代性和可比性条件（联合弱于分布等价性）下，可以识别平均处理效应。当这些条件不成立时，感兴趣的效应仅部分可识别，我们提供了诊断方法，可以在历史实验上证伪替代性，并给出有限重叠下最坏情况偏差的界限。我们进一步证明，LLM固有的随机性会引入偏差和方差，但使用多次抽取的平均值作为替代指标可以同时缓解两者。我们在模拟和Upworthy标题的A/B测试应用中展示了方法和理论。我们工作的一个核心结论是，LLM结果作为替代指标的有效性只能对过去的处理被证伪，而无法对新处理被验证，因此对于新颖干预，人类实验仍然不可或缺。我们讨论了LLM选择、提示和温度作为设计变量的作用，以及如何确定人类实验的规模以进行验证。

英文摘要

Organizations and researchers show increasing interest in using large language models (LLMs) in place of human participants in A/B tests, in the hope of experimenting faster and at lower cost. We study when a treatment effect estimated on LLM outcomes can recover the effect that would have been measured on the human population of interest. Distributional equivalence between LLM and human outcomes would make any standard estimator valid but is unrealistic. We therefore develop a statistical framework that adapts surrogate endpoint theory to LLMs, showing that calibrating LLM outcomes to human outcomes identifies the average treatment effect under surrogacy and comparability conditions that are jointly weaker than distributional equivalence. We present a falsification test for surrogacy and a bound on the worst-case bias from limited overlap between the LLM and human samples. We further show that the stochasticity inherent to LLMs can weaken surrogacy for identification while also introducing bias and variance during estimation, but that using an average over multiple LLM draws per unit as the surrogate mitigates these issues. Simulations validate the results, and an empirical application to A/B tests on Upworthy headlines shows that raw LLM predictions recover only 39\% of the human treatment effect while nonparametric calibration closes the gap. A central takeaway is that A/B testing on LLMs yields correct results only by assumption, whereas A/B testing on humans is correct by design, and that the required assumptions are hardest to justify precisely where A/B testing on LLMs promises the greatest benefit. We discuss the role of LLM choice, prompting, and temperature as design variables, the compounded challenge posed by long-term outcomes, and how to size human pilot studies for validation.

URL PDF HTML ☆

赞 0 踩 0

2606.11171 2026-06-19 cs.LG cond-mat.stat-mech cs.IT math.IT math.OC math.ST stat.TH 交叉投稿

Indexed Bellman Information Complexity

核赌博机中的算法与极小极大复杂度

Yunbei Xu

AI总结本文通过统一MAIR框架，将GP-UCB与MAMS算法置于共同语言下，提出结合两者优势的安全主算法，并证明在过参数化模型中算法复杂度比类宽极小极大或DEC证书更具信息性。

详情

AI中文摘要

高斯过程上置信界（GP-UCB）和决策估计系数（DEC）方法乍看之下可能属于不同的理论。本文将这两种观点置于一个共同的算法信息语言中，用于频率学派RKHS赌博机。GP-UCB固定了一个算法性的（而非真实的）高斯过程先验，并利用实现轨迹的复杂度以及计算可处理性，而MAMS优化了一个鲁棒的类宽MAIR/DEC包络。通过统一的MAIR框架和异质半正定算法先验，我们推广了GP-UCB分析和MAMS算法，提出了一种结合两者优势的安全主算法，并提供了一个核赌博机构造，表明在过参数化模型中算法复杂度可以比类宽极小极大或DEC证书更具信息性。由此得出的信息是：算法信息和类宽极小极大系数回答不同的问题，并可能导致不同的差距；核赌博机提供了一个干净的环境，使得这种区别在数学上变得可见。

英文摘要

We develop indexed Bellman information complexity, a representation-level theory of interactive decision making centered on information indices and reference histories. The representation strips away problem-specific syntax and retains only the ingredients needed for dynamic programming and information accounting, thereby unifying the earlier framework of indexed algorithmic information ratios (AIR). On the upper-bound side, regret is controlled by Bellman supersolutions or potential identities whose gradient bracket is paid for by indexed information. Upper-confidence-bound (UCB), estimation-to-decision/decision-estimation-coefficient (E2D/DEC), and adaptive-minimax-sampling or exploration-by-optimization (AMS/EBO) methods appear as three relaxations of this same identity. On the lower-bound side, the posterior-reference trajectory supplies both the information telescope and the ghost quantile of small-regret trajectories. The resulting critical radius in the lower bound is an effective-dimension-scale quantity, as in Fano and local-prior-mass lower bounds, rather than the constant radius of a two-point Le Cam argument. The examples show that DEC is best viewed as a one-step relaxation of indexed Bellman information complexity, not as a universally tight conversion mechanism. We illustrate the framework through several applications, with particular emphasis on kernel bandits. In this setting, the active action marginal provides a concrete basis for comparing UCB, E2D, and AMS/EBO.

URL PDF HTML ☆

赞 0 踩 0

2605.20541 2026-06-19 math.ST math.PR stat.TH 版本更新

Finite-Sample Bounds for Expected Signature Estimation under Weak Dependence

有限样本下弱依赖条件下期望签名估计的界限

Bryson Schenck

AI总结本文研究了在弱依赖条件下，从单一长依赖轨迹估计期望签名的有限样本界限，通过块平均估计器证明了非渐近的均方误差界，并探讨了在不同Hurst指数下的收敛性。

Comments 59 pages, 1 figure

详情

AI中文摘要

期望签名在满足矩增长条件时唯一确定随机粗糙路径的分布，但此前缺乏从单一长依赖轨迹估计其有限样本界限。本文研究了一个平稳随机过程，其样本路径可解释为几何粗糙路径，被划分为等间距观测的块，并证明了块平均估计器的非渐近均方误差界。当路径的Hölder正则性至多为1/2时，需要粗糙路径理论来定义估计量，因为Young积分和Riemann-Stieltjes积分无法定义签名的迭代积分。在矩、平稳性和块签名协方差衰减条件（严格弱于α-混合且适用于长程依赖驱动器）下，误差分为离散化项和波动项，其速率分别由路径正则性和依赖强度决定。通过逐层粗糙因子方差分析，保持有限截断常数显式，并在固定观测预算下获得最优分配规则。本文验证了分数奥本海姆-乌伦贝克过程在三个制度下的假设，即粗糙（Hurst H<1/2）、半鞅（H=1/2）和长程（H>1/2）。蒙特卡罗实验显示经验收敛速率快于理论上界。

英文摘要

The expected signature uniquely determines the law of a random rough path under a moment-growth condition, yet finite-sample bounds for estimating its truncations from a single long dependent trajectory remain unavailable. We study a strictly stationary stochastic process equipped with a geometric rough-path lift, observed in non-overlapping blocks of equally-spaced samples, and prove a non-asymptotic mean-squared error (MSE) bound for the block-averaging estimator of its truncated expected signature. Under moment and stationarity assumptions together with a direct covariance-decay condition on block signatures -- strictly weaker than $α$-mixing and applicable to long-range-dependent processes -- the error separates into a discretization term and a fluctuation term, with rates determined respectively by path regularity and dependence strength. A levelwise rough-factorial variance analysis keeps finite-truncation constants explicit and yields an optimal allocation rule under a fixed observation budget. We verify the assumptions for independent-coordinate fractional Ornstein--Uhlenbeck processes in three regimes: short-range (Hurst $1/4<H<1/2$), semimartingale ($H=1/2$), and long-range ($H>1/2$); in all three, the block-signature covariance is summable, so the fluctuation term decays at the same rate as in the independent-block case, even under long memory at $H>1/2$. Monte Carlo experiments show empirical slopes steeper than the guaranteed upper-bound rates.

URL PDF HTML ☆

赞 0 踩 0

2412.17470 2026-06-19 math.ST econ.EM stat.ME stat.TH 版本更新

A Necessary and Sufficient Condition for Size Controllability of Heteroskedasticity Robust Test Statistics

异方差稳健检验统计量尺寸可控性的一个充要条件

Benedikt M. Pötscher, David Preinerstorfer

AI总结针对回归模型中单个约束检验，给出了异方差稳健检验统计量尺寸可控性的充要条件，改进了现有仅充分条件的结果。

Comments Clarification in Footnote 15 added

2604.02336 2026-06-19 math.FA math.ST stat.TH 版本更新

The Shift Operator Calculus for Stationary Time Series Analysis

平稳时间序列分析的移位算子演算

Anand Ganesh, Babhrubahan Bose, Anand Rajagopalan

AI总结本文为平稳时间序列建模建立了严格的移位算子演算，证明了不同函数族下转移函数算子的存在性和等距性，并统一了平稳过程可逆性与转移函数算子可逆性的概念。

Comments 7 pages

2602.04550 2026-06-19 quant-ph math.ST stat.TH 版本更新

Locally Gentle State Certification for High Dimensional Quantum Systems

高维量子系统的局部温和态认证

Cristina Butucea, Jan Johannes, Henning Stein

AI总结研究局部温和量子态认证中非破坏性测量的信息代价，推导出样本复杂度为Θ(d³/(ε²α²))，揭示了α-温和性惩罚与希尔伯特空间维度d的线性关系。

详情

AI中文摘要

量子统计推断的标准方法依赖于引起波函数坍缩的测量，从而消耗量子态以提取信息。在本工作中，我们研究了\emph{局部温和}量子态认证的基本极限，其中学习算法被限制在迹范数下最多扰动态$\alpha$，从而允许样本重用。我们分析了区分未知态$\rho$等于参考态$\rho_0$还是与其$\epsilon$-远的问题。我们推导了该问题的极小极大样本复杂度，量化了非破坏性测量的信息代价。具体地，通过构造显式测量算子，我们证明了$\alpha$-温和性约束施加了$\frac{d}{\alpha^2}$的样本量惩罚，导致总样本复杂度为$n = \Theta(\frac{d^3}{\epsilon^2 \alpha^2})$。我们的结果阐明了信息提取与态扰动之间的权衡，并突出了量子学习中物理测量约束与隐私机制之间的深层联系。关键地，我们发现施加$\alpha$-温和性所导致的样本量惩罚与希尔伯特空间维度$d$呈线性关系，而非高维私有估计中典型的参数数量$d^2-1$。

英文摘要

Standard approaches to quantum statistical inference rely on measurements that induce a collapse of the wave function, effectively consuming the quantum state to extract information. In this work, we investigate the fundamental limits of \emph{locally-gentle} quantum state certification, where the learning algorithm is constrained to perturb the state by at most $α$ in trace norm, thereby allowing for the reuse of samples. We analyze the hypothesis testing problem of distinguishing whether an unknown state $ρ$ is equal to a reference $ρ_0$ or $ε$-far from it. We derive the minimax sample complexity for this problem, quantifying the information-theoretic price of non-destructive measurements. Specifically, by constructing explicit measurement operators, we show that the constraint of $α$-gentleness imposes a sample size penalty of $\frac{d}{α^2}$, yielding a total sample complexity of $n = Θ(\frac{d^3}{ε^2 α^2})$. Our results clarify the trade-off between information extraction and state disturbance, and highlight deep connections between physical measurement constraints and privacy mechanisms in quantum learning. Crucially, we find that the sample size penalty incurred by enforcing $α$-gentleness scales linearly with the Hilbert-space dimension $d$ rather than the number of parameters $d^2-1$ typical for high-dimensional private estimation.

URL PDF HTML ☆

赞 0 踩 0

2509.15822 2026-06-19 stat.ML cs.LG math.PR math.ST stat.TH 版本更新

Phase Transition for Stochastic Block Model with more than $\sqrt{n}$ Communities

具有多于 $\sqrt{n}$ 个社区的随机块模型的相变

Alexandra Carpentier, Christophe Giraud, Nicolas Verzelen

发表机构 * Institut für Mathematik – Universität Potsdam, Potsdam, Germany（波恩大学数学研究所，德国波恩）； Laboratoire de Mathématiques d’Orsay, Université Paris-Saclay, CNRS, France（奥赛数学实验室，巴黎-萨克雷大学，法国 CNRS）； INRAE, Institut Agro, MISTEA, Univ. Montpellier, France（国家农业研究院，蒙彼利埃大学，法国）

AI总结本文证明在随机块模型中，当社区数 $K\geq \sqrt{n}$ 时，低度多项式在 Chin 等人提出的阈值以下无法恢复社区，而通过计数特定子图可在多项式时间内实现恢复，支持了新相变阈值的猜想。

详情

AI中文摘要

统计物理的预测表明，在随机块模型（SBM）中，当社区数 $K$ 固定时，社区恢复在 Kesten-Stigum (KS) 阈值以上（且仅在其以上）可以在多项式时间内实现。这一猜想催生了丰富的文献，证明在 KS 阈值以上的 SBM 中，非平凡社区恢复确实是可能的。只要 $K\ll \sqrt{n}$（其中 $n$ 是观测图中的节点数），KS 阈值以下低度多项式（LDP）的失败也被证明。当 $K\geq \sqrt{n}$ 时，Chin 等人（2025）最近证明，在稀疏机制中，通过计数非回溯路径，可以在 KS 阈值以下的多项式时间内实现社区恢复。这一突破使他们提出了多社区机制 $K\geq \sqrt{n}$ 的新阈值。在这项工作中，我们为他们的猜想提供了证据：\n1- 我们证明，对于任意图密度，LDP 无法在 Chin 等人（2025）提出的阈值以下恢复社区；\n2- 我们证明，在所提出的阈值以上，不仅是在 Chin 等人（2025）考虑的稀疏机制中，而且在适度稀疏机制中，通过计数受 LDP 分析启发的某些特定子图，可以在多项式时间内实现社区恢复。\n特别地，计数长度为 $\log(n)$ 的自避路径（这与基于非回溯算子的谱算法密切相关）仅在稀疏机制中是最优的。在更密集的机制中，必须考虑基于循环放大的更复杂子图。

英文摘要

Predictions from statistical physics postulate that recovery of the communities in the Stochastic Block Model (SBM) with a fixed number $K$ of communities is possible in polynomial time above, and only above, the Kesten-Stigum (KS) threshold. This conjecture has given rise to a rich literature, proving that non-trivial community recovery is indeed possible in SBM above the KS threshold. Failure of low-degree polynomials (LDP) below the KS threshold was also proven, as long as $K\ll \sqrt{n}$, where $n$ is the number of nodes in the observed graph. When $K\geq \sqrt{n}$, Chin et al.(2025) recently proved that, in a \emph{sparse regime}, community recovery in polynomial time is possible below the KS threshold by counting non-backtracking paths. This breakthrough led them to postulate a new threshold for the many-communities regime $K\geq \sqrt{n}$. In this work, we provide evidence supporting their conjecture:\\ 1- We prove that, for \emph{any graph density}, LDP fail to recover communities below the threshold postulated by Chin et al.(2025) ;\\ 2- We prove that community recovery is possible in polynomial time above the postulated threshold, not only in the \emph{sparse regime} considered in Chin et al.~(2025), but also in \emph{moderately sparse regimes}, by counting occurrences of some specific motifs inspired by the LDP analysis.\\ In particular, counting self-avoiding paths of length $\log(n)$, which is closely related to spectral algorithms based on the Non-Backtracking operator, is optimal only in the sparse regime. More complex motifs based on the blow-up of a cycle must be considered in denser regimes.

URL PDF HTML ☆

赞 0 踩 0

2408.15920 2026-06-19 math.ST math.PR stat.TH 版本更新

Nonlinear Filtering and Spatial Asymptotic Consistency for SPDEs Observed via Spatio-Temporal Point Processes

Jan Szalankiewicz, Cristina Martinez-Torres, Wilhelm Stannat

Comments Fixed several typos throughout the manuscript, substantially revised Section 4 with improved theoretical bounds, and updated simulations with corresponding code base improvements

Journal ref Stoch PDE: Anal Comp (2026)

2504.09564 2026-06-19 math.ST stat.TH 版本更新

The weak-feature-impact effect on the NPMLE in monotone binary regression

单调二元回归中弱特征影响对NPMLE的影响

Dario Kieffer, Angelika Rohde

AI总结研究单调二元回归中非参数最大似然估计在弱特征关系下的极限分布，发现一种新的分布连续插值于两个极端情况，并改进了小样本近似。

Comments Added Theorem 3.3 and several visualizations

详情

AI中文摘要

统计文献提供了单调二元回归中非参数最大似然估计（NPMLE）在两种极端情况下的逐点极限分布：如果特征-标签关系严格单调且足够光滑，则以立方根$n$速率收敛，具有缩放Chernoff型极限分布；如果底层关系平坦，则以参数$\sqrt{n}$速率收敛。本文提供了NPMLE分布演变的完整图景，揭示了一种新的极限分布，在弱特征-标签关系的情况下，为小样本提供了显著更好的分布近似。该分布被证明连续插值于两个极端情况之间。确定该分布的创新方法是将其作为新引入的弱特征影响三角阵列中NPMLE的极限，针对特定的参数-样本量配置。此外，在适当缩放的$L^{1}$误差中同样观察到弱特征影响场景下的相变。作为副产品，获得了平坦回归函数下的极限分布，这是先前未知的。证明开发了一种全新的策略，特别是不基于开关关系。伴随这些结果的新型局部极小极大下界。

英文摘要

Statistical literature provides pointwise limiting distributions of the nonparametric maximum likelihood estimator (NPMLE) in monotone binary regression for the two extremal cases: If the feature-label relation is strictly monotone and sufficiently smooth, it converges at a cube-root-$n$ rate with scaled Chernoff-type limiting distribution, and it converges at the parametric $\sqrt{n}$-rate if the underlying relation is flat. In this article, we provide the complete picture of the distributional metamorphosis of the NPMLE, revealing a new limiting distribution which provides a significantly better distributional approximation for small samples in case of a weak feature-label relationship. It is shown to continuously interpolate between the two extremal cases. The innovative way to determine this distribution is to generate it as a limit of the NPMLE in the newly introduced weak-feature-impact triangular array for a particular parameter-sample-size constellation. Moreover, the phase transition is likewise observed for the suitably rescaled $L^{1}$-error in this weak-feature-impact scenario. As a by-product, its limiting distribution for flat regression functions is obtained, which was unknown before. The proof develops a completely new strategy, notably not based on the switch relation. A novel type of local minimax lower bounds accompanies these results.

URL PDF HTML ☆

赞 0 踩 0

2503.11479 2026-06-19 stat.CO math.PR math.ST stat.ME stat.TH 版本更新

Towards practical PDMP sampling: Metropolis adjustments, locally adaptive step-sizes, and NUTS-based time lengths

走向实用的PDMP采样：Metropolis调整、局部自适应步长和基于NUTS的时间长度

Augustin Chevallier, Sam Power, Matthew Sutton

AI总结针对PDMP采样需要计算模型特定界限的难题，提出Metropolis调整近似、自适应步长机制和NUTS启发的路径长度选择，集成得到双重自适应PDMP采样器，提升鲁棒性和效率。

详情

AI中文摘要

分段确定性马尔可夫过程（PDMP）在从复杂概率分布中采样方面具有重要前景。然而，其实践应用受到需要计算模型特定界限的限制。相反，虽然哈密顿蒙特卡洛（HMC）提供了一种普遍有效的采样方法，但其无法自适应调整步长，导致在采样漏斗形等复杂分布时性能受损。为解决这些限制，我们引入了三个创新概念：(a) 一种Metropolis调整的PDMP模拟近似，无需显式界限且不破坏不变测度；(b) 一种与Metropolis校正兼容的自适应步长机制；(c) 一种受无U型转弯采样器（NUTS）启发的方案，用于动态选择PDMP中的路径长度。这三个想法可以无缝集成到一个单一的“双重自适应”PDMP采样器中，具有良好的鲁棒性和效率特性。

英文摘要

Piecewise-Deterministic Markov Processes (PDMPs) hold significant promise for sampling from complex probability distributions. However, their practical implementation is hindered by the need to compute model-specific bounds. Conversely, while Hamiltonian Monte Carlo (HMC) offers a generally efficient approach to sampling, its inability to adaptively tune step sizes impedes its performance when sampling complex distributions like funnels. To address these limitations, we introduce three innovative concepts: (a) a Metropolis-adjusted approximation for PDMP simulation that eliminates the need for explicit bounds without compromising the invariant measure, (b) an adaptive step size mechanism compatible with the Metropolis correction, and (c) a No U-Turn Sampler (NUTS)-inspired scheme for dynamically selecting path lengths in PDMPs. These three ideas can be seamlessly integrated into a single, `doubly-adaptive' PDMP sampler with favourable robustness and efficiency properties.

URL PDF HTML ☆

赞 0 踩 0

2309.15769 2026-06-19 math.ST cs.LG stat.ME stat.TH 版本更新

Benign overfitting beyond prediction: The ordinary least squares interpolator

超越预测的良性过拟合：普通最小二乘插值器

Dennis Shen, Dogyoon Song, Peng Ding, Jasjeet S. Sekhon

发表机构 * Department of Data Sciences & Operations, University of Southern California（数据科学与运营系，南加州大学）； Department of Statistics, University of California, Davis（统计学系，加州大学戴维斯分校）； Department of Statistics, University of California, Berkeley（统计学系，加州大学伯克利分校）； Google DeepMind（谷歌DeepMind）

AI总结本文研究过参数化线性模型中最小ℓ2范数OLS插值器的参数估计与推断性质，推导了留k法、遗漏变量偏误公式和Frisch-Waugh-Lovell定理的过参数化版本，并扩展了高斯-马尔可夫定理。

Comments This work is accepted for publication in Biometrika

详情

AI中文摘要

深度学习的最新进展突显了过参数化统计模型中良性过拟合的现象，引发了对其基础理解的浓厚兴趣。由于其简单性和实际相关性，普通最小二乘（OLS）插值器已成为从理论上理解这一现象的关键研究对象。虽然OLS在经典欠参数化设置下的性质已得到充分理解，但其在过参数化区域中的行为——与岭回归或lasso不同——仍相对较少被探索。我们通过为最小$\ell_2$范数OLS插值器推导新的代数和统计结果，为这一不断增长的文献做出贡献。与现有大部分关注预测风险的工作不同，我们的分析集中于参数估计和推断，这对于许多统计学和因果推断应用至关重要。具体地，我们建立了以下内容的过参数化类比：(i) 留$k$法公式，(ii) 遗漏变量偏误公式，以及(iii) Frisch-Waugh-Lovell定理。在高斯-马尔可夫模型下，我们进一步扩展了高斯-马尔可夫定理，并分析了过参数化设置下同方差性时的方差估计。这些结果共同为研究过参数化线性模型中的参数估计和推断提供了一个系统框架，为超越预测含义的良性过拟合提供了新视角。

英文摘要

Recent advances in deep learning have highlighted the phenomenon of benign overfitting in overparameterized statistical models, sparking significant interest in understanding its foundations. Owing to its simplicity and practical relevance, the ordinary least squares (OLS) interpolator has become a key object of study for gaining theoretical insight into this phenomenon. While the properties of OLS are well understood in classical underparameterized settings, its behavior in the overparameterized regime -- unlike that of ridge regression or the lasso -- remains comparatively less explored. We contribute to this growing literature by deriving new algebraic and statistical results for the minimum $\ell_2$-norm OLS interpolator. In contrast to much of the existing work, which focuses on prediction risk, we center our analysis on parameter estimation and inference, which are fundamental for many statistics and causal inference applications. Specifically, we establish overparameterized analogues of (i) the leave-$k$-out formulas, (ii) the omitted variable bias formula, and (iii) the Frisch-Waugh-Lovell theorem. Under the Gauss-Markov model, we further extend the Gauss-Markov theorem and analyze variance estimation under homoskedasticity in the overparameterized setting. Collectively, these results provide a systematic framework for studying parameter estimation and inference in overparameterized linear models, offering a novel perspective on benign overfitting beyond its implications for prediction.

URL PDF HTML ☆

赞 0 踩 0

2405.00118 2026-06-19 math.ST stat.ME stat.TH 版本更新

Causal Inference with High-dimensional Discrete Covariates

高维离散协变量下的因果推断

Zhenghao Zeng, Sivaraman Balakrishnan, Yanjun Han, Edward H. Kennedy

AI总结研究高维离散协变量下因果效应的估计问题，证明常用估计量的均方误差界为d²/n²+1/n，并给出极小化下界，提出利用效应同质性和先验知识的新估计量以加速收敛。

Comments 74 pages, 9 figures

详情

AI中文摘要

在从观察性研究估计因果效应时，研究人员通常需要调整许多协变量以消除暴露与结果之间的非因果关系，其中许多协变量是离散的。常用估计量在存在许多离散协变量时的行为尚不明确，因为它们的性质通常是在稀疏性和平滑性等结构假设下分析的，而这些假设不适用于离散设置。在这项工作中，我们研究了一个模型中因果效应的估计，其中用于混杂调整的协变量是离散但高维的，意味着类别数量$d$与样本量$n$相当甚至更大。具体来说，我们证明了常用回归、加权和双稳健估计量的均方误差以$\frac{d^2}{n^2}+\frac{1}{n}$为界。然后，我们证明了平均处理效应的极小化下界为$\frac{d^2}{n^2 \log^2 n}+\frac{1}{n}$量级，这刻画了高维离散设置下因果效应估计的基本难度，并表明上述估计量在忽略对数因子时是速率最优的。我们进一步考虑了可以利用的额外结构，即效应同质性和协变量分布的先验知识，并提出了新的估计量，这些估计量具有更快的收敛速率$\frac{d}{n^2} + \frac{1}{n}$，从而在更广泛的范围内实现一致性。通过模拟研究对结果进行了实证说明。

英文摘要

When estimating causal effects from observational studies, researchers often need to adjust for many covariates to deconfound the non-causal relationship between exposure and outcome, among which many covariates are discrete. The behavior of commonly used estimators in the presence of many discrete covariates is not well understood since their properties are often analyzed under structural assumptions including sparsity and smoothness, which do not apply in discrete settings. In this work, we study the estimation of causal effects in a model where the covariates required for confounding adjustment are discrete but high-dimensional, meaning the number of categories $d$ is comparable with or even larger than sample size $n$. Specifically, we show the mean squared error of commonly used regression, weighting and doubly robust estimators is bounded by $\frac{d^2}{n^2}+\frac{1}{n}$. We then prove the minimax lower bound for the average treatment effect is of order $\frac{d^2}{n^2 \log^2 n}+\frac{1}{n}$, which characterizes the fundamental difficulty of causal effect estimation in the high-dimensional discrete setting, and shows the estimators mentioned above are rate-optimal up to log-factors. We further consider additional structures that can be exploited, namely effect homogeneity and prior knowledge of the covariate distribution, and propose new estimators that enjoy faster convergence rates of order $\frac{d}{n^2} + \frac{1}{n}$, which achieve consistency in a broader regime. The results are illustrated empirically via simulation studies.

URL PDF HTML ☆

赞 0 踩 0

1909.03488 2026-06-19 math.AT cs.CG math.PR math.ST stat.TH 版本更新

Probabilistic Convergence and Stability of Random Mapper Graphs

Adam Brown, Omer Bobrowski, Elizabeth Munch, Bei Wang