arXivDaily arXiv每日学术速递 周一至周五更新
重置
stat.ME统计方法53
2606.12317 2026-06-11 stat.ME stat.CO 新提交

ShrinkageTrees: An R Package for Bayesian Tree Ensembles for Survival Analysis and Causal Inference

ShrinkageTrees: 用于生存分析和因果推断的贝叶斯树集成R包

Tijn Jacobs

AI总结 ShrinkageTrees是一个R包,通过贝叶斯加性回归树模型处理右删失和区间删失生存数据,支持因果推断中的预后和治疗效应分解,并引入深度惩罚、Dirichlet分裂和马蹄铁先验等正则化策略,适用于高维场景。

详情
AI中文摘要

ShrinkageTrees是一个用于生存分析和因果推断的贝叶斯树集成R包。该包在加速失效时间(AFT)框架下实现了针对右删失和区间删失生存结果的贝叶斯加性回归树模型,并可选择分解为预后和治疗效应成分以进行因果推断。提供两种互补的正则化形式:通过深度惩罚先验和Dirichlet分裂先验对树结构进行正则化,以及通过全局-局部收缩先验对步高进行正则化。ShrinkageTrees首次实现了马蹄铁森林,即对步高施加马蹄铁先验。这些正则化策略将贝叶斯树集成扩展到高维设置。高效的Rcpp后端、多链MCMC和S3方法支持完整的流程:拟合、预测、因果效应估计和收敛诊断。

英文摘要

ShrinkageTrees is an R package for Bayesian tree ensembles in survival analysis and causal inference. The package implements Bayesian additive regression tree models for right- and interval-censored survival outcomes within an accelerated failure time (AFT) framework, with optional decomposition into prognostic and treatment-effect components for causal inference. Two complementary forms of regularisation are available: regularisation of the tree structure, via depth-penalising priors and Dirichlet splitting priors, and regularisation of the step heights, via global-local shrinkage priors. ShrinkageTrees provides the first implementation of the Horseshoe Forest, which places a horseshoe prior on the step heights. These regularisation strategies extend Bayesian tree ensembles to high-dimensional settings. An efficient Rcpp backend, multi-chain MCMC, and S3 methods support the full workflow: fitting, prediction, causal effect estimation, and convergence diagnostics.

2606.12305 2026-06-11 stat.ME 新提交

Bayesian nonparametric Mallows model for clustering preference data

贝叶斯非参数Mallows模型用于偏好数据聚类

Lorenzo Zuccato, Veronica Vinciotti, Valeria Vitelli

AI总结 提出基于狄利克雷过程混合模型的贝叶斯非参数Mallows模型,实现聚类数自动推断与聚类分配联合学习,在R包BayesMallows中实现,模拟与真实数据验证有效。

详情
Comments
21 pages (main text), 28 pages including supplementary material. Submitted for peer review
AI中文摘要

偏好学习是指从不同类型的排序和偏好数据中学习潜在模式。偏好学习的典型目标是推断共享共识排序、学习个体级偏好以及进行无监督聚类。Mallows模型是少数能够同时实现所有这些目标的方法之一。先前的工作基于MCMC Metropolis-Hastings方案开发了计算上可行的贝叶斯推断方法,其中通过有限混合Mallows模型进行聚类,然后对聚类数进行后验推断。这里我们提出基于狄利克雷过程混合模型的贝叶斯非参数Mallows模型,允许对非空聚类数和聚类分配进行联合推断,以及对聚类特定参数进行后验推断。所提出的采样算法已集成到现有的R包BayesMallows中,该包还支持不完整排序和成对比较形式的数据。模拟数据表明,与有限混合模型相比,非参数模型在恢复正确聚类数方面表现良好,而电影评分的实证数据展示了该模型在丢弃评分上提供个性化电影推荐的有效性。

英文摘要

Preference learning refers to the learning of latent patterns from ranking and preference data of different kinds. Typical aims of preference learning are to infer a shared consensus ranking, to learn individual-level preferences, and to perform unsupervised clustering. The Mallows model is among the few approaches that can achieve all these objectives jointly. Previous work has developed computationally tractable methods for Bayesian inference based on a MCMC Metropolis-Hastings scheme, where clustering is performed via a finite mixture of Mallows models. Inference on the number of clusters is then conducted a posteriori. Here we propose a Bayesian nonparametric Mallows model, based on a Dirichlet process mixture model. This allows joint inference on the number of non-empty clusters and on the clustering allocation, as well as posterior inference on cluster-specific parameters. The implementation of the proposed sampling algorithm is integrated into the existing R package BayesMallows, which also supports data in the form of incomplete rankings and pairwise comparisons. Simulated data show good performance of the nonparametric model compared to a finite mixture model in terms of recovery of the correct number of clusters, while empirical data on movie ratings show the model's effectiveness in providing personalized movie recommendations on discarded ratings.

2606.12296 2026-06-11 stat.ME 新提交

Bayesian Triangulation Splines: Spatial Adaptation on Irregular Domains

贝叶斯三角剖分样条:不规则域上的空间自适应

Sihyeon Pyeon, Sunwoo Lim, Seonghyun Jeong

AI总结 提出贝叶斯三角剖分样条方法,通过约束Delaunay三角剖分处理不规则域边界和异质性平滑,实现空间自适应,并证明其最优后验收缩率和Oracle性质。

详情
AI中文摘要

针对二维非矩形域的传统非参数回归方法常常忽略域几何结构,允许跨边界平滑。在空间和地质统计应用中,这一假设通常无效,因为域边界通常约束观测之间的相互作用。适应空间变化的平滑度也比单变量设置更具挑战性,大多数现有方法未能充分捕捉目标函数的局部结构。为了解决这些问题,我们提出了贝叶斯三角剖分样条,该方法在多边形域上构造局部自适应样条。该方法采用约束Delaunay三角剖分来尊重边界几何并适应异质性平滑。精心设计的先验进一步提高了经验性能。在全局Sobolev平滑假设下,我们证明了所提方法实现了最优后验收缩率,并适应未知平滑度。我们还表明,该方法在实现非均匀或局部变化结构特征的Oracle率方面表现出理想的空间适应性。至关重要的是,这种Oracle保证并非特定于约束Delaunay三角剖分,而是适用于任何满足弱形状正则条件的三角剖分。模拟研究证实,所提方法通过实现更高的估计精度同时保持低模型复杂度,优于现有方法。

英文摘要

Conventional nonparametric regression methods for two-dimensional non-rectangular domains often overlook domain geometry and allow smoothing across boundaries. In spatial and geostatistical applications, this assumption is frequently invalid because domain boundaries typically constrain interactions among observations. Accommodating spatially varying smoothness is also substantially more challenging than in the univariate setting, and most existing methods do not adequately capture this local structure of the target function. To address these challenges, we propose Bayesian triangulation splines, which constructs locally adaptive splines over a polygonal domain. The method employs constrained Delaunay triangulations to respect boundary geometry and adapt to heterogeneous smoothness. A carefully designed prior further improves empirical performance. Under a global Sobolev smoothness assumption, we show that the proposed method achieves the optimal posterior contraction rate and adapts to unknown smoothness. We also show that the method exhibits ideal spatial adaptation in the sense that it achieves the oracle rate for inhomogeneous or locally varying structural features. Crucially, this oracle guarantee is not specific to constrained Delaunay triangulations, but holds over any triangulation satisfying weak shape-regularity conditions. Simulation studies confirm that the proposed method outperforms existing approaches by achieving higher estimation accuracy while maintaining low model complexity.

2606.12174 2026-06-11 stat.AP stat.ME 新提交

The data-driven extreme value distribution: non-parametric tail estimation with a derived stability criterion

数据驱动的极值分布:基于导出稳定性准则的非参数尾部估计

Michael Sandbichler, Tobias Hell

AI总结 提出数据驱动极值分布(DDEVD),一种非参数估计器,通过核方法重建基分布并导出稳定性准则,在降水与冶金数据中优于传统极值模型。

详情
Comments
28 pages, 6 figures
AI中文摘要

量化极端事件的可能性是风险评估的基础,然而经典极值理论依赖于渐近假设,这在数据稀疏、非平稳的情况下失效,而实践者越来越常遇到这种情况。我们引入了数据驱动极值分布(DDEVD),一种非参数估计器,它元统计地聚合所有观测值,并用核重建基分布,去除了参数尾部假设。我们推导了其最优带宽,并证明了一个稳定性定律 $m < C\\,n^{1+\gamma/2}$,将可靠外推与极值指数 $\gamma$ 联系起来。在亚小时尺度的阿尔卑斯降水数据中,DDEVD 从单个十年中恢复了稳定的100年重现水平(校准比率 $0.96$),与完整记录参考值的偏差超过 $50\\%$ 的情况在不到五十分之一的窗口中发生——而 GEV 拟合则为五分之一。在冶金显微图像中,它在安全相关的晶粒尺寸尾部上与广义极值拟合相匹配,而标准对数正态分布在 $1\\,\mathrm{cm}^{2}$ 处高估了 $58\\%$。

英文摘要

Quantifying the likelihood of extreme events underpins risk assessment, yet classical Extreme Value Theory relies on asymptotic assumptions that fail in the data-sparse, non-stationary regimes practitioners increasingly face. We introduce the Data-Driven Extreme Value Distribution (DDEVD), a non-parametric estimator that aggregates all observations metastatistically and reconstructs the base distribution with a kernel, removing parametric tail assumptions. We derive its optimal bandwidth and prove a stability law $m < C\,n^{1+\gamma/2}$ relating reliable extrapolation to the extreme value index $\gamma$. In sub-hourly Alpine precipitation, DDEVD recovers stable 100-year return levels from single decades (calibration ratio $0.96$), departing from the full-record reference by over $50\,\%$ in fewer than one window in fifty -- versus one in five for a GEV fit. In metallurgical micrographs, it matches a generalised extreme-value fit on the safety-relevant grain-size tail, where the standard log-normal over-predicts by $58\,\%$ at $1\,\mathrm{cm}^{2}$.

2606.12164 2026-06-11 stat.ME 新提交

Bayesian Effect Selection for Additive Quantile Regression with an Application to Air Pollution Thresholds

加性分位数回归的贝叶斯效应选择及其在空气污染阈值中的应用

Nadja Klein, Aaron Wei Qi Lee, Jorge Mateu

AI总结 提出一种贝叶斯效应选择方法,通过Demmler-Reinsch基展开正交分解加性效应的线性和非线性部分,并使用尖峰-板先验进行选择,应用于马德里空气污染数据分析,揭示极端NO2浓度的驱动因素。

详情
Comments
arXiv admin note: substantial text overlap with arXiv:2105.10890
AI中文摘要

空气污染监管限值通常以浓度阈值超标来定义,这些阈值自然与污染物分布的条件分位数相关,因此直接关系到严重污染事件的评估。同时,不仅要确定协变量是否影响空气污染,还要确定这种影响是线性、非线性还是两者兼有。我们通过开发加性分位数回归的贝叶斯效应选择方法来解决这些问题。虽然惩罚样条的常用混合模型表示(MMR)允许灵活的非线性效应,但它们不能提供线性和非线性效应成分的有意义分离。因此,我们采用Demmler-Reinsch基展开,将每个加性效应正交分解为线性和非线性部分,并从理论上证明两个效应成分可以一致估计。为了促进数据驱动的模型构建,我们提出贝叶斯效应选择,对与线性和非线性成分相关的标量重要性参数分别使用尖峰-板先验,并实现高效的Gibbs采样器。通过模拟研究,我们展示了该方法对非对称拉普拉斯工作似然引起的误设具有鲁棒性,并显示出相对于MMR的优越性能。在对西班牙马德里空气污染数据的详细分析中,我们强调了灵活建模极端二氧化氮(NO$_2$)浓度的附加价值,并揭示了阈值相关的污染水平受气候变量和交通相关空间结构的不同驱动。这些发现强调了需要先进的统计模型来支持短期决策,并帮助地方当局减轻或潜在防止NO$_2$浓度限值超标。

英文摘要

Air pollution regulatory limits are typically defined in terms of exceedances of concentration thresholds which are naturally related to conditional quantiles of the pollutant distribution and are therefore of direct relevance for assessing severe pollution events. At the same time, it is important to determine not only whether a covariate affects air pollution but also whether this effect is linear, nonlinear, or both. We address these issues by developing a Bayesian effect selection approach for additive quantile regression. While commonly used mixed model representations (MMRs) of penalized splines allow for flexible nonlinear effects, they do not provide a meaningful separation of linear and nonlinear effect components. We therefore employ a Demmler-Reinsch basis expansion, which yields an orthogonal decomposition of each additive effect into linear and nonlinear parts and show theoretically that both effect components can be estimated consistently. To facilitate data-driven model building, we propose Bayesian effect selection with separate spike and slab priors on the scalar importance parameters associated with the linear and nonlinear components and implement an efficient Gibbs sampler. Through simulation studies, we demonstrate robustness to the misspecification induced by the employed asymmetric Laplace working likelihood and show superior performance relative to the MMR. In a detailed analysis of air pollution data in Madrid, Spain we highlight the added value of flexibly modeling extreme nitrogen dioxide (NO$_2$) concentrations and reveal that threshold-relevant pollution levels are driven differently by climatological variables and traffic-related spatial structure. These findings underline the need for advanced statistical models that support short-term decision-making and help local authorities mitigate, or potentially prevent, exceedances of NO$_2$ concentration limits.

2606.12021 2026-06-11 stat.ME 新提交

Adaptive spatial blocking for scalable clustering inference with applications to high-throughput spatial proteomics

自适应空间分块用于可扩展聚类推断及其在高通量空间蛋白质组学中的应用

Mingyu Go, Julia Wrobel, Hoseung Song

AI总结 提出自适应空间分块算法,通过构造满足点计数和形状约束的局部块,利用渐近正态近似实现大规模点模式数据的快速聚类推断,平衡统计功效与计算效率。

详情
AI中文摘要

Ripley's K函数是一种广泛用于评估点模式聚类的空间汇总统计量。然而,现有的基于K的方法在处理大规模数据时计算成本高昂,特别是在高通量空间蛋白质组学中,因为它们依赖于图像中所有点的空间信息。为应对这一挑战,我们提出了一种计算高效的基于分块的测试框架,该框架从图像中提取不相交的局部块,并跨块聚合聚类证据。所提出的自适应空间分块算法构造满足点计数和形状约束的块,通过渐近正态近似实现可扩展的空间聚类推断和快速p值计算。数值研究表明,所提出的方法在统计功效和计算效率之间提供了良好的平衡。在健康人肠道空间蛋白质组学数据的应用中,我们的方法检测到浆细胞的强空间聚集以及浆细胞与巨噬细胞之间的共定位,同时在大图像上具有良好的可扩展性。

英文摘要

Ripley's K-function is a widely used spatial summary statistic for assessing clustering in point patterns. However, existing K-based methods can be computationally prohibitive for large-scale data, particularly in high-throughput spatial proteomics, because they rely on spatial information from all points in the image. To address this challenge, we propose a computationally efficient block-based testing framework that extracts disjoint local blocks from an image and aggregates clustering evidence across them. The proposed adaptive spatial blocking algorithm constructs blocks satisfying point-count and shape constraints, enabling scalable spatial clustering inference and fast p-value computation through an asymptotic normal approximation. Numerical studies demonstrate that the proposed method provides a favorable balance between statistical power and computational efficiency. In an application to healthy human intestine spatial proteomics data, our method detects strong spatial aggregation of plasma cells and colocalization between plasma cells and macrophages, while scaling favorably to large images.

2606.12015 2026-06-11 stat.ME 新提交

Introducing precision-weighted bias as a performance measure to inform the inclusion of adaptive designs in meta-analysis

引入精度加权偏倚作为性能度量以指导元分析中适应性设计的纳入

Martin Law (1 and 2), David S. Robertson (1), Sofia S. Villar (1), Tim P. Morris (3), Babak Choodari-Oskooei (4), Thomas Jaki (1 and 5), Ian R. White (4) ((1) Medical Research Council Biostatistics Unit, University of Cambridge, (2) Royal Papworth Hospital, Cambridge, (3) Statistical Methodology, Novartis Pharmaceuticals UK Ltd., (4) UCL Innovative Clinical Trials Unit, University College London, (5) Department of Machine Learning and Statistics, University of Regensburg, DE)

AI总结 提出精度加权偏倚作为新的统计性能指标,证明元分析中适应性设计的偏倚可忽略,建议将其作为模拟研究的标准补充。

详情
Comments
9 pages, 2 figures
AI中文摘要

我们提出一种新颖、直观的统计性能度量:精度加权偏倚。精度加权偏倚定义为估计量的无条件偏倚以其所含信息量(精度)加权。当前指南(如GRADE和CONSORT)常将适应性设计中潜在的偏倚增加视为系统综述中纳入此类设计的阻碍。然而,我们证明共同效应元分析中的偏倚近似等于其组成研究的精度加权偏倚的精度加权平均,而非其未加权无条件偏倚的平均。通过模拟研究,我们表明虽然适应性设计可能表现出未加权偏倚,但它们通常具有零精度加权偏倚。因此,纳入这些设计通常导致整体元分析偏倚的微小变化。这些结果表明,精度加权偏倚是决定是否将适应性设计纳入元分析的更优指标。我们建议在模拟研究中使用精度加权偏倚作为未加权无条件偏倚和条件偏倚的标准补充,以支持更具包容性和准确的证据合成。

英文摘要

We propose a novel, intuitive measure of statistical performance: precision-weighted bias. Precision-weighted bias is defined as the unconditional bias of an estimator weighted by the degree of information (precision) it contains. Current guidelines, such as GRADE and CONSORT, often view the potential for increased bias in adaptive designs as a deterrent for the inclusion of such designs in systematic reviews. However, we demonstrate that the bias in a common-effect meta-analysis is approximately equal to the precision-weighted average of the precision-weighted biases of its constituent studies, rather than of their unweighted unconditional biases. Through simulation studies, we show that while adaptive designs may exhibit unweighted bias, they frequently have zero precision-weighted bias. Consequently, including these designs often results in a negligible change to the overall meta-analysis bias. These results suggest that precision-weighted bias is a superior indicator for determining whether to include an adaptive design in a meta-analysis. We recommend that precision-weighted bias be used as a standard complement to unweighted unconditional and conditional bias in simulation studies to support more inclusive and accurate evidence synthesis.

2606.11962 2026-06-11 stat.ME q-fin.ST stat.CO 新提交

Composite likelihood inference of fractional Gaussian processes with sequentially optimal subset selection

具有顺序最优子集选择的分数高斯过程的复合似然推断

Mathis Fourreau, Matthieu Garcin

AI总结 针对分数高斯过程,提出通过顺序最大化Godambe信息来选择子集,以平衡估计精度与计算成本,并推导了Fisher信息和Godambe信息的理论表达式。

详情
AI中文摘要

复合似然方法通过考虑观测的几个子集而非全部来降低时间序列参数估计的计算成本。该方法的渐近性质与Godambe信息相关,Godambe信息是Fisher信息的扩展,考虑了观测子集之间的依赖性。我们旨在将该方法应用于线性高斯模型,特别是分数布朗运动和分数高斯噪声。我们推导了其Fisher信息和Godambe信息的理论表达式,并推导出一种顺序最大化Godambe信息的子集选择设计。子集的大小使我们能够控制估计精度与计算成本之间的权衡。通过模拟,我们将该方法与矩方法和最大似然估计进行比较,并将其应用于真实数据,即股票指数的波动率序列和风速时间序列。

英文摘要

The composite likelihood method reduces the computational cost of parameter estimation in time series by considering several subsets of observations instead of all observations at once. The asymptotic properties of this method are related to the Godambe information, an extension of the Fisher information that accounts for the dependence between subsets of observations. We aim to apply this method to linear Gaussian models, in particular fractional Brownian motion and fractional Gaussian noise. We derive theoretical expressions for their Fisher information and their Godambe information and deduce a subset selection design that sequentially maximizes the Godambe information. The size of the subsets then allows us to control the trade-off between estimation accuracy and computational cost. Through simulations, we compare this method with the method of moments and maximum likelihood estimation, and we apply it to real data, namely volatility series of a stock index and a wind speed time series.

2606.11933 2026-06-11 math.ST stat.ME 新提交

Testing axial symmetry in multivariate location-scale linear regression

多元位置尺度线性回归中的轴向对称性检验

Šárka Hudecová, Miroslav Šiman

AI总结 提出基于积分秩得分的检验方法,用于多元线性异方差回归中条件轴向对称性的检验,推导渐近分布,并通过模拟和实际数据验证。

详情
AI中文摘要

本文研究多元线性异方差回归框架下条件轴向对称性的检验问题。提出了一种基于积分秩得分的新检验,并推导了其渐近分布。所提出的方法将针对多元数据开发的类似程序扩展到回归设定中。该检验也可用于评估关于误差项分布特性的特定假设。通过一个小型模拟研究和实际经济数据说明了其性能和应用。本文还包含一些关于轴向对称性的理论结果,这些结果可能具有独立的意义。

英文摘要

The article deals with the problem of testing conditional axial symmetry within a~multivariate linear heteroscedastic regression framework. A new test based on integrated rank scores is introduced and its asymptotic distribution is derived. The proposed method extends a similar procedure developed for multivariate data to the regression setting. The test may also be employed to assess specific hypotheses concerning distributional properties of the error term. Its performance and application is illustrated in a small simulation study and with real economic data. The article also contains a few theoretical results regarding axial symmetry that may be of independent interest.

2606.11887 2026-06-11 stat.ME 新提交

Model-based sparse mixed-type PCA

基于模型的稀疏混合类型PCA

Lauri Heinonen, Joni Virta

AI总结 针对混合类型数据,提出一种基于矩估计的潜在协方差矩阵估计方法,实现稀疏主成分分析,并通过模拟和实际数据验证性能。

详情
AI中文摘要

本文提出了一种新的主成分分析方法,用于处理由连续、二元、整数和正连续变量组成的混合类型数据。假设数据来自一个概率模型,其中指数族分布的参数由一组共享的高斯潜在变量决定。所提出的方法MTPCA基于通过矩估计来估计这些潜在混合物的协方差矩阵。提出了一种稀疏化成分载荷的方法,并与经典稀疏PCA理论一致。我们提出了一种估计主成分得分的策略,并讨论了潜在维度的选择。通过模拟混合类型数据研究了该方法的性能,并在由二元动物特征组成的Zoo数据集上展示了该模型。

英文摘要

This work presents a new method for principal component analysis (PCA) of a mixed-type data consisting of continuous, binary, integer-valued and positive continuous variables. The data are assumed to come from a probability model, where the parameters of the exponential family distributions are determined by a set of shared Gaussian latent variables. The proposed method, MTPCA, is based on estimating the covariance matrix of these latent mixtures through the method of moments. A way to sparsify the component loadings is presented and aligns with the classical theory of sparse PCA. We propose a strategy for estimating the principal component scores and discuss the choice of the latent dimension. The method's performance is studied with a simulated mixed-type data and we illustrate the model on the Zoo data set consisting of binary animal characteristics.

2606.11876 2026-06-11 q-bio.QM cs.LG stat.ME 新提交

Seeing Below the Limit of Detection: A Censored-Poisson Bayesian Latent-Growth Change-Point Detector (the Span Detector) for Serial ctDNA in HR+/HER2- Metastatic Breast Cancer

检测限以下:用于HR+/HER2-转移性乳腺癌连续ctDNA的删失泊松贝叶斯潜在增长变点检测器(Span检测器)

Aarchi Singh Thakur, Abhijoy Sarkar

AI总结 提出Span检测器,利用删失泊松贝叶斯潜在增长变点模型处理ctDNA非检测作为左删失观测,通过序贯广义似然比统计量检测变异检测率上升点,在10%假警报率下将提前三个月捕获进展的比例从11%提升至25%。

详情
Comments
9 pages, 4 figures, 2 tables. Code and synthetic data generator: this https URL
AI中文摘要

循环肿瘤DNA(ctDNA)在影像学显示耐药性数月前就已携带证据,但最早证据存在于检测限(LoD)以下:新生亚克隆仅被间歇性检测到,产生微弱检测和非检测的闪烁序列。商业液体活检将每次抽取视为独立快照,并将非检测视为无信号。我们认为非检测是左删失观测,而随时间变化的非检测和微弱检测模式在单个值可信之前就携带了可操作的生长证据。我们引入Span,一种删失泊松贝叶斯潜在增长变点检测器,它对二元检测过程建模,为每个变异的检测率累积一个向上变点的序贯广义似然比统计量,并以校准的假警报控制发出竞争风险警报。Span没有学习权重,因此没有过拟合风险。在一线CDK4/6抑制剂联合内分泌治疗的HR+/HER2-转移性乳腺癌合成队列中,在匹配的10%假警报率下,Span将提前三个月捕获的即将进展比例大约翻倍(惰性出现:25% vs 快照的11%),具有可证伪的剂量反应:对惰性出现效果显著,对快速出现效果消失。值轨迹基线表现与快照相同,将增益归因于删失检测模型。生存主干在真实乳腺癌数据(GBSG-2,n=686;C指数0.67 vs 0.68)上与Cox基线匹配,在具有清洁生物标志物的真实纵向队列(PBC2,n=312)上,同一管道正确拒绝获胜,这是一个可证伪的边界测试,确认机制是特定于状态的。所有ctDNA轨迹均为合成数据。

英文摘要

Circulating-tumour DNA (ctDNA) carries evidence of drug resistance months before imaging shows it, but the earliest evidence lives below the assay's limit of detection (LoD): a nascent subclone is detected only intermittently, producing a flickering sequence of faint detects and non-detects. Commercial liquid biopsies treat each draw as an independent snapshot and a non-detect as nothing. We argue a non-detect is a left-censored observation, and the pattern of non-detects and faint detects over time carries actionable evidence of growth before any single value is trustworthy. We introduce Span, a censored-Poisson Bayesian latent-growth change-point detector that models the binary detection process, accumulates a sequential generalised-likelihood-ratio statistic for an upward change-point in the per-variant detection rate, and raises a competing-risks alarm with calibrated false-alarm control. Span has no learned weights, so there is nothing to overfit. On a synthetic cohort of HR+/HER2- metastatic breast cancer on first-line CDK4/6-inhibitor plus endocrine therapy, at a matched 10% false-alarm rate, Span roughly doubles the fraction of impending progressions caught three months ahead (indolent regime: 25% vs 11% for the snapshot), with a falsifiable dose-response: large for indolent emergence, vanishing for fast emergence. A value-trajectory baseline performs identically to the snapshot, isolating the gain to the censored detection model. The survival backbone matches a Cox baseline on real breast-cancer data (GBSG-2, n=686; C-index 0.67 vs 0.68), and on a real longitudinal cohort with clean biomarkers (PBC2, n=312) the same pipeline correctly declines to win, a falsifiable boundary test confirming the mechanism is regime-specific. All ctDNA trajectories are synthetic.

2606.11768 2026-06-11 stat.ME stat.AP 新提交

Hierarchical excitatory processes for modelling event-time data in the presence of exogenous stimuli

外源刺激下事件时间数据建模的分层激发过程

Francesco Sanna Passino, Nicholas A. Heard, Jeffrey W. Brown, William N. Frost, Vince P. Lyzinski

AI总结 提出分层激发过程(HEP)模型,通过动态演化核函数叠加外源刺激的激发效应,实现对重复刺激下事件时间数据的灵活建模,并嵌入聚类框架识别潜在响应模式。

详情
AI中文摘要

我们引入了分层激发过程(HEP),一种用于在重复外部刺激下观察到的事件时间数据的灵活点过程模型。所提出的框架将点过程的条件强度建模为外部刺激引起的激发效应的叠加,其特征由参数随时间动态演化的核函数刻画。这种分层结构使得能够跨重复刺激调节激发强度,提供了一种可解释的结构。我们为所提出的模型建立了基于似然的推断,并将HEP嵌入到基于模型的聚类框架中,以识别具有相似响应动态的潜在组。模拟研究证明了该模型恢复演化潜在模式的能力,而对海蛞蝓足神经节尖峰序列记录的应用展示了HEP如何能够在不同实验条件下表征重复刺激下神经元的刺激驱动兴奋性。

英文摘要

We introduce the Hierarchical Excitatory Process (HEP), a flexible point process model for event-time data observed under repeated external stimuli. The proposed framework models the conditional intensity of a point process as a superposition of excitation effects induced by external stimuli, characterised by kernels with parameters dynamically evolving over time. This hierarchical construction enables modulation of excitation strength across repeated stimuli, providing an interpretable structure. We establish likelihood-based inference for the proposed model and embed HEP within a model-based clustering framework to identify latent groups sharing similar response dynamics. Simulation studies demonstrate the model's ability to recover evolving latent patterns, and an application to spike train recordings from the sea slug Aplysia pedal ganglion illustrates how HEPs are able to characterise stimulus-driven excitability of neurons across repeated stimulation under different experimental conditions.

2606.11715 2026-06-11 stat.ME 新提交

Bracketing Relationships of Weighted Average Treatment Effects

加权平均处理效应的括号关系

Pengfei Tian, Fan Yang, Peng Ding

AI总结 在因果推断的观测研究规范设定下,证明了在倾向得分与条件平均处理效应满足单调关系时,重叠权重的平均处理效应介于处理组和对照组的平均处理效应之间,并推广到加权局部平均处理效应及其他权重,建议使用CP图。

详情
AI中文摘要

在因果推断的观测研究规范设定下,我们证明了在倾向得分与条件平均处理效应之间存在单调关系时,重叠权重(权重与给定协变量下处理的条件方差成比例)下的平均处理效应介于处理组和对照组的平均处理效应之间。我们进一步将结果推广到具有二元工具变量和二元处理的规范设定下的加权局部平均处理效应。我们还将结果推广到其他权重。基于该理论,我们建议绘制估计的条件平均处理效应与估计的倾向得分的“CP图”。

英文摘要

Under the canonical setting of observational studies for causal inference, we show that the average treatment effect under the overlap weight, the weight that is proportional to the conditional variance of the treatment given the covariates, is bounded between the average treatment effects on the treated and control, under a monotonic relationship between the propensity score and the conditional average treatment effect. We further extend the result to weighted local average treatment effects, under the canonical setting with a binary instrumental variable and a binary treatment. We also extend the results to other weights. Based on the theory, we recommend the ``CP-plot'' of the estimated conditional average treatment effect against the estimated propensity score.

2606.11624 2026-06-11 stat.ME 新提交

The Triply-Randomized Negative Binomial Beta for Robust Regression and Conjugate Models of Bounded Support Data

三重随机负二项贝塔分布用于鲁棒回归和有界支持数据的共轭模型

Jimmy Lederman, Aaron Schein

AI总结 提出三重随机负二项贝塔分布(TNBbeta),通过随机化标准贝塔分布的参数,解决其对异常值敏感、无法处理零观测及缺乏共轭先验的问题,并利用Pólya-gamma增广实现高效吉布斯采样。

详情
AI中文摘要

贝塔分布是许多响应变量支持为$[0,1]$的回归问题中默认的似然函数选择,尽管它对异常值敏感、无法处理精确为零的观测值,并且缺乏闭式共轭先验。我们通过引入三重随机负二项贝塔分布(记为$\mathrm{TNBbeta}(p,\\,q,\\,\varepsilon)$)来解决这些缺陷,该分布由中位数$p$、浓度参数$q$和允许在$0$和$1$处具有正密度的边界参数$\varepsilon$参数化。TNBbeta通过随机化标准贝塔分布的参数(使用三个相依的负二项随机变量)得到,我们证明了每个随机变量的完全条件分布本身也是负二项分布。此外,将$p$和$q$与具有logit链接函数的高斯潜变量连接,通过Pólya-gamma增广得到闭式更新。这些性质共同为有界支持数据的回归模型提供了简单的辅助变量吉布斯采样器,在有效样本量每秒和留一预测方面通常优于标准贝塔回归方法,尤其是在存在异常值的情况下。在森林冠层覆盖度的案例研究中,我们证明了该框架可以轻松融入空间结构和精确零观测。总体而言,这项工作大大扩展了可高效拟合的$[0,1]$有界支持数据的贝叶斯模型类别。

英文摘要

The beta distribution is the default choice of likelihood in many regression problems with a $[0,1]$-bounded support response despite its sensitivity to outliers, inability to accommodate exact zero observations, and a lack of closed-form conjugate priors. We address these shortcomings by introducing the triply-randomized negative binomial beta distribution, denoted $\mathrm{TNBbeta}(p,\,q,\,\varepsilon)$, parameterized by a median $p$, concentration parameter $q$, and boundary parameter $\varepsilon$ which permits positive density at $0$ and $1$. The TNBbeta arises by randomizing the parameters of a standard beta distribution with three dependent negative binomial random variables, each of whose complete conditional distribution we show is itself negative binomial. Moreover, connecting $p$ and $q$ to Gaussian latent variables with logit link functions yields closed-form updates via Pólya-gamma augmentation. Together, these properties yield simple auxiliary-variable Gibbs samplers for regression models of bounded-support data, which often outperform standard beta regression approaches in terms of effective sample size per second and held-out prediction, especially in the presence of outliers. In a case study of forest canopy cover, we demonstrate that this framework can easily incorporate spatial structure and exact zero observations. Overall, this work substantially expands the class of Bayesian models for $[0,1]$-bounded support data that can be fit efficiently.

2606.11570 2026-06-11 stat.ML cs.LG stat.ME 新提交

Enhancing Spectral Embedding through Robust and Flexible Knowledge Transfer in Electronic Health Records

通过电子健康记录中的鲁棒且灵活的知识迁移增强谱嵌入

Feiqing Huang, Zongqi Xia, Rong Ma, Tianxi Cai

AI总结 提出一种基于谱的无监督表示学习框架,通过从更广泛人群提取知识矩阵并放松信号对齐假设,为罕见病队列生成低维嵌入,在模拟和真实多发性硬化症数据中优于现有方法。

详情
AI中文摘要

我们提出了一种基于谱的无监督表示学习框架,用于从电子健康记录中为罕见病队列的临床概念和患者导出低维嵌入,其中数据是高维的但样本量有限。为了克服这一挑战,我们引入了一个从更广泛人群中提取的知识矩阵,该矩阵与罕见病队列共享部分重叠的子空间。我们的方法不同于现有方法,它放松了潜在数据矩阵和知识矩阵之间严格的一对一信号对齐假设,允许更灵活和现实的结构化共享形式。我们引入了一种新颖的两步谱嵌入过程:首先,我们从知识矩阵中识别并移除不相关的成分;然后,我们应用基于投影的方法分别恢复共享和异质成分。模拟和对真实世界多发性硬化症队列的分析表明,所提出的方法优于竞争方法,特别是在共享信号较弱且仅部分对齐的挑战性场景中,这在罕见病数据中很常见。

英文摘要

We propose a spectral-based, unsupervised representation learning framework to derive low-dimensional embeddings for clinical concepts and patients in rare disease cohorts from electronic health records, where data are high-dimensional but sample sizes are limited. To overcome this challenge, we incorporate a knowledge matrix extracted from a broader population that shares a partially overlapping subspace with the rare-disease cohort. Our method departs from existing approaches by relaxing restrictive one-to-one signal-alignment assumptions between the latent data matrix and knowledge matrix, allowing more flexible and realistic forms of structured sharing. We introduce a novel two-step spectral embedding procedure: first, we identify and remove irrelevant components from the knowledge matrix; then, we apply a projection-based method to separately recover shared and heterogeneous components. Simulations and an analysis of a real-world multiple sclerosis cohort show that the proposed method outperforms competing approaches, particularly in challenging scenarios where shared signals are weak and only partially aligned, as is common in rare-disease data.

2606.11548 2026-06-11 stat.ME 新提交

Estimating the local false discovery rate under an unknown symmetric null

在未知对称零假设下估计局部错误发现率

Daniel Xiang, William Fithian, Nikolaos Ignatiadis, Jake A. Soloff, Asaf Weinstein

AI总结 针对零分布仅对称于零的双组模型,提出基于逻辑回归和自然三次样条的局部错误发现率估计方法,并证明该估计可渐近控制多重检验的局部错误发现率。

详情
AI中文摘要

本文关注在双组模型中估计局部错误发现率(lfdr),其中关于零分布的唯一假设是它关于零对称。我们的动机来自当代多重假设检验框架,特别适用于变量选择问题,该框架将任何用户指定的分数转换为统计量,其零分布关于零对称,而非零分布通常预期在零右侧富集。虽然现代方法如knockoff滤波器(Barber and Candes; 2015)能够利用零性质来控制错误发现率(FDR),但一个更合适的目标是针对被拒绝的假设控制局部错误发现率,如Soloff等人(2024)所提出的,其中分析了标准的双组模型(已知$f_0$和独立性)。在这里,我们朝这个方向迈出一步,提出通过针对替代密度比$f(-w)/f(w)$($w>0$)来估计lfdr,其中$f$是上述“简化”双组模型中的边际密度。我们研究了几种估计量,并提出了一种基于自然三次样条基的逻辑回归方法。我们还证明了该替代的任何一致估计量都能使以名义水平阈值估计的多重检验过程渐近控制lfdr。

英文摘要

This paper is concerned with estimating the local false discovery rate (lfdr) in a two-groups model where the only assumption regarding the null distribution is symmetry about zero. Our motivation comes from the contemporary framework for multiple hypothesis testing, particularly relevant in variable selection problems, which transforms any user-specified scores into statistics whose null distributions are symmetric about zero, whereas enrichment to the right of zero is generally expected for the non-nulls. While modern methods such as the knockoff filter (Barber and Candes; 2015) are able to exploit the null property for controlling the false discovery rate (FDR), an arguably more appropriate goal is to target control of the local false discovery rate for the rejected hypotheses, as proposed in Soloff et al. (2024) where the standard two-groups model (known $f_0$ and independence) is analyzed. Here, we take a step in this direction and propose to estimate the lfdr by targeting the surrogate density ratio $f(-w)/f(w)$, for $w>0$, where $f$ is the marginal density in the aforementioned ``stripped-down'' two-groups model. We study several estimators and propose a logistic regression based method with natural cubic spline basis. We also show that any consistent estimator of this surrogate yields asymptotic lfdr control of the multiple testing procedure that thresholds the estimate at the nominal level.

2606.11526 2026-06-11 stat.ME econ.EM 新提交

What is the Long-Term Value of Reliability?

可靠性的长期价值是什么?

Chenyu Qiu, Xu Kuang, Inessa Liskovich, Ali Rauh, Stefan Wager

AI总结 提出Chronos LTV系统,利用马尔可夫决策过程建模客户交互,通过协变量平衡算法估计延迟率对业务指标的长期影响。

详情
AI中文摘要

我们描述了Chronos LTV,一个用于衡量延迟和其他服务缺陷对关键业务指标长期影响的系统。我们使用马尔可夫决策过程对客户随时间推移的交互进行建模,并将我们的目标估计量形式化为相对于移动平均延迟率的边际政策效应。在此设定下,我们表明,在给定观察到的订单特征的情况下,延迟在顺序无混淆假设下(即延迟近似随机)可以识别长期效应;并且可以使用简单的协变量平衡算法来估计这些效应。

英文摘要

We describe Chronos LTV, a system to measure the long-term impact of delays and other service defects on key business metrics. We use Markov decision processes to model customer interactions over time, and formalize our target estimand as the marginal policy effect with respect to moving the average delay rate. Given this setup, we show that we can identify long-term effects under a sequential unconfoundedness assumption where delays are as good as random given observed order characteristics; and can estimate these effects using a simple covariate-balancing algorithm.

2606.11439 2026-06-11 stat.ME 新提交

A Likelihood Ratio Testing Approach for Interval-Censored Data

区间删失数据的似然比检验方法

Yuan Wu, Susan Halabi

AI总结 针对区间删失数据,提出基于样条筛的稳健似然比检验,解决Wald检验在小样本中的不稳定性,理论推导渐近分布,模拟和实例验证其优越性。

详情
AI中文摘要

区间删失数据在临床研究中经常出现,其中事件时间仅已知落在特定的评估窗口内。尽管Cox比例风险模型是处理此类数据的标准方法,但现有的Wald型检验在小样本中常常存在不稳定性或性能较差。在本文中,我们提出了一种基于样条筛的稳健似然比检验,用于区间删失数据。我们开发了一个计算高效的估计框架,确保了数值稳定性。此外,我们严格建立了所提出的似然比统计量的渐近分布,为统计推断提供了坚实的理论基础。广泛的模拟研究表明,与传统方法相比,我们的方法实现了更优的错误控制和更高的功效。通过一个真实临床数据集的分析,进一步说明了该方法的实用性。

英文摘要

Interval-censored data frequently arise in clinical research where event times are only known to fall within specific assessment windows. Although the Cox proportional hazards model is a standard approach for such data, existing Wald-type tests often suffer from instability or poor performance in small samples. In this paper, we propose a robust spline-sieve-based likelihood ratio test for interval-censored data. We develop a computationally efficient estimation framework that ensures numerical stability. Furthermore, we rigorously establish the asymptotic distribution of the proposed likelihood ratio statistic, providing a solid theoretical foundation for statistical inference. Extensive simulation studies demonstrate that our approach achieves superior error control and higher power compared with traditional approaches. The practical utility of the method is further illustrated through the analysis of a real-world clinical dataset.

2606.11421 2026-06-11 stat.ME math.ST stat.CO 新提交

Second-Order Least Squares as a Special Case of the Polynomial Maximization Method

二阶最小二乘法作为多项式最大化方法的特例

Serhii Zabolotnii

AI总结 证明在条件同方差非高斯误差下,最优加权二阶最小二乘法与二次广义多项式最大化方法等价,并揭示高阶效率储备。

详情
Comments
26 pages, 3 figures, 7 tables. Includes Lean 4 formal verification and Monte Carlo simulation
AI中文摘要

我们证明,对于具有条件同方差非高斯误差的线性回归,最优加权二阶最小二乘法(SLS)与二次广义多项式最大化方法(PMM)是相同的总体估计方程:它们选择前两个中心残差矩的最优线性组合,求解同一个总体正规方程组,共享同一个影响函数,并达到相同的渐近方差 $c_2g_2/N$——普通最小二乘斜率方差因子 $c_2$ 乘以 PMM 方差缩减系数 $g_2=1-\gamma_3^2/(2+\gamma_4)$(其中 $\gamma_3,\gamma_4$ 为误差偏度和超额峰度)。因此,可行的插件实现是一阶等价的,仅存在高阶有限样本差异。这一等价性是尖锐的:在异方差下,无条件 PMM 主体与条件 SLS 加权分离,导致对称误差的效率损失和不对称误差的一致性损失。在二次以上,PMM 拥有 SLS 在其二阶矩范围内无法达到的效率储备。对于对称的尖峰误差,SLS 退化为普通最小二乘法估计斜率,而三次 PMM 通过闭式系数 $g_3$ 利用 SLS 矩范围之外的峰度信息;对于典型非对称分布,在三次多项式矩类中,这一储备为 $30$--$50\\%$。Lean 4 开发环境机器检验了特定次数的代数核心——$g_2$ 和 $g_3$ 的闭式、$g_2\le1$ 结果、设计抵消和对称退化——而一般单调性 $g_{S+1}\le g_S\le1$ 通过嵌套分析证明。蒙特卡洛研究说明了等价性、储备和异方差边界在有限样本中的表现。

英文摘要

We prove that optimally weighted second-order least squares (SLS) and the degree-two generalized polynomial maximization method (PMM) are the same population estimating equation for linear regression with conditionally homoskedastic non-Gaussian errors: they choose the same optimal linear combination of the first two centered residual moments, solve one population normal system, share one influence function, and attain the common asymptotic variance $c_2g_2/N$ -- the ordinary-least-squares slope-variance factor $c_2$ scaled by the PMM variance-reduction coefficient $g_2=1-\gamma_3^2/(2+\gamma_4)$ (with $\gamma_3,\gamma_4$ the error skewness and excess kurtosis). Feasible plug-in implementations are therefore first-order equivalent, with only higher-order finite-sample differences. The identity is sharp: under heteroskedasticity the unconditional PMM body and the conditional SLS weighting separate, costing efficiency for symmetric errors and consistency for asymmetric errors. Beyond degree two, PMM holds an efficiency reserve that SLS cannot reach within its second-moment span. For symmetric platykurtic errors SLS collapses to ordinary least squares for the slope, while degree-three PMM exploits kurtosis information outside the SLS moment span through a closed-form coefficient $g_3$; for canonical asymmetric laws this reserve is $30$--$50\%$ within the degree-three polynomial moment class. The Lean 4 development machine-checks the degree-specific algebraic core -- the closed forms for $g_2$ and $g_3$, the $g_2\le1$ result, the design cancellations, and the symmetric collapse -- while the general monotonicity $g_{S+1}\le g_S\le1$ is proved analytically by nesting. A Monte Carlo study illustrates the equivalence, the reserve, and the heteroskedastic boundary at finite samples.

2606.11414 2026-06-11 stat.ME 新提交

Group Sequential Sample Size for Comparing Two Survival Probabilities at a Specific Time Point

比较特定时间点两个生存概率的组序贯样本量

Susan Halabi, Lu Liu, Chenxi Yu, Yuan Wu

AI总结 提出一种新方法,在固定和组序贯试验设计中同时确定检验两个生存概率的样本量,控制I类错误,适用于比例风险假设不成立或含新辅助治疗的随机试验。

详情
AI中文摘要

我们提出了一种新方法,该方法在固定和组序贯试验设计中同时确定检验两个预先指定时间点的生存概率所需的样本量,同时保证I类错误控制。在不同假设差异、失效分布、删失比例和名义功效下的模拟显示出一致的性能,而中期分析则突出了每次分析时降低的I类错误和增加的功效,无论潜在的失效时间分布或花费函数如何。重要的是,我们的方法特别适用于评估随机试验中固定时间的生存结局,其中一组治疗包括术前新辅助治疗,而另一组仅进行手术。此外,当比例风险假设不满足时,该方法也具有优势,这常见于具有延迟或时变治疗效果或生存曲线交叉的免疫治疗试验中。该方法也适用于随机II期试验,其中较小的样本量和中间或替代时间至事件终点的使用要求高效的数据利用和稳健的错误控制。我们通过肾癌和前列腺癌的激励性例子说明了该方法。附带的R Shiny应用程序使研究者能够交互式地计算样本量,从而促进不同环境下的实际试验规划。

英文摘要

We propose a novel method that simultaneously determines the sample size for testing two survival probabilities at a pre-specified ltime while guaranteeing type I error control in both fixed and group-sequential trial designs. Simulations across varying hypothesized differences, failure distributions, censoring proportions, and nominal powers demonstrate consistent performance, while interim analyses highlight reduced type I error and increased power at each look, regardless of the underlying failure time distribution or spending function. Importantly, our method is especially useful for evaluating survival outcomes at a fixed time in randomized trials where one treatment arm includes neoadjuvant therapy prior to surgery while the other involves surgery alone. Furthermore, it is advantageous when the proportional hazards assumption is not satisfied, as often occurs in immunotherapy trials with delayed or time-varying treatment effects or crossing survival curves. The method is also applicable to randomized phase II trials, where smaller sample sizes and the use of intermediate or surrogate time-to-event endpoints demand efficient data use and robust error control. We illustrate the approach with motivating examples in renal and prostate cancer. An accompanying R Shiny application enables investigators to compute sample sizes interactively, facilitating practical trial planning in diverse settings.

2606.11405 2026-06-11 stat.ME stat.AP 新提交

Bayesian Causal Machine Learning for Cure Models

治愈模型的贝叶斯因果机器学习

Antonio R. Linero, F. Javier Rubio, Piyali Basak

AI总结 针对治愈模型中治疗对治愈概率和未治愈患者生存时间的不同影响,提出贝叶斯因果机器学习方法BartCure,分解受限平均生存时间的因果效应,并在乳腺癌试验中验证其有效性。

详情
AI中文摘要

在生存研究中,治疗可以通过不同机制使患者受益:治疗可能增加治愈的概率,或延迟未治愈患者的失败时间。量化哪种机制占主导地位,以及它是否在不同亚群中变化,具有临床重要性,但因果机器学习文献中针对此问题的研究有限。标准的因果生存学习器针对有限时间生存或受限平均生存时间,而许多治愈模型在未估计因果效应的情况下捕捉治愈结构。在这项工作中,我们在存在治愈亚群的情况下定义了有意义的因果效应,并引入了BartCure,一种用于估计这些效应的贝叶斯因果机器学习方法。我们推荐的因果效应将受限平均生存时间的因果效应分解为随机治愈和随机潜伏期成分,并将这些新效应与随机干预效应和主层中的因果效应联系起来。在模拟中,BartCure在估计平均效应方面具有竞争力,并且在保守地检测治疗效应异质性的方向方面特别有效。我们将BartCure应用于CALGB 40101乳腺癌试验,以估计平均和亚组因果效应,并识别治疗效应异质性。

英文摘要

In survival studies, treatments can benefit patients through different mechanisms: a treatment may increase the probability of being cured or delay failure among patients who are not cured. Quantifying which mechanism is dominant, and whether it varies across subpopulations, is clinically important, yet there is limited work in the causal machine learning literature addressing this problem. Standard causal survival learners target finite-horizon survival or restricted mean survival time, while many cure models capture cure structures without estimating causal effects. In this work, we define meaningful causal effects in the presence of a cured subpopulation and introduce BartCure, a Bayesian causal machine learning approach for estimating them. The causal effects we recommend decompose the causal effect on restricted mean survival time into a stochastic cure and stochastic latency component, and we relate these new effects to both stochastic intervention effects and causal effects in principal strata. In simulations, BartCure is competitive for estimating average effects and is especially effective at conservatively detecting the direction of treatment-effect heterogeneity. We apply BartCure to estimate average and subgroup causal effects and to identify treatment effect heterogeneity in the CALGB 40101 breast cancer trial.

2606.11235 2026-06-11 cs.LG cs.DB stat.ME 新提交

Few-Shot Resampling for Scalable Statistically-Sound Data Mining

少样本重采样:可扩展的统计可靠数据挖掘

Leonardo Pellegrina, Fabio Vandin

发表机构 * Department of Information Engineering, University of Padova(帕多瓦大学信息工程系)

AI总结 提出FewRS方法,基于重采样评估数据挖掘结果的统计显著性,通过推导新的上界偏差界,仅需极少量重采样数据集即可保证假发现概率,显著提升可扩展性。

详情
Comments
Accepted to KDD 2026
AI中文摘要

知识发现的一个关键步骤是评估数据挖掘结果。在包括模式挖掘、图分析等多个应用中,此步骤包括评估结果的统计显著性,以避免仅由噪声或数据随机波动导致的虚假发现。虽然针对某些特定应用已经开发了专门程序,但基于重采样的方法被广泛使用,尤其是在无法推导解析结果的复杂分析中。然而,当前基于重采样的方法需要生成和分析数千个重采样数据集,因此对于大型数据集或计算密集型分析不实用。本文中,我们介绍了FewRS,一种简单有效的基于重采样的方法,用于评估数据挖掘结果的统计显著性,并对错误发现概率提供严格保证。我们的方法可应用于任何使用重采样方法的情况。FewRS基于我们对表示数据挖掘结果质量的检验统计量的上确界偏差推导出的新界。我们证明FewRS需要生成和分析极少数量的重采样数据集,从而得到高度可扩展且广泛适用的方法。我们在常见任务(如模式挖掘和网络分析)上测试了我们的方法。在所有情况下,与现有技术相比,我们的方法在运行时间上减少了多达两个数量级,同时保持高统计功效,使得能够在大型真实世界数据集上对数据挖掘结果进行统计验证。

英文摘要

A key step in knowledge discovery is the evaluation of data mining results. In several applications, including pattern mining, graph analysis, and others, this step includes the evaluation of the statistical significance of the results, to avoid spurious discoveries due only to noise or random fluctuations in the data. While specialized procedures have been developed for some specific applications, resampling-based approaches are widely used, in particular for complex analyses where analytical results cannot be derived. However, current resampling-based approaches require the generation and analysis of thousands of resampled datasets, and are therefore impractical for large datasets or computationally intensive analyses. In this paper, we introduce FewRS, a simple and effective resampling-based approach to assess the statistical significance of data mining results with rigorous guarantees on the probability of false discoveries. Our approach can be used in every situation where resampling-based approaches are applied. FewRS builds on our derivation of a novel bound to the supremum deviation of test statistics representing the quality of data mining results. We prove that FewRS needs to generate and analyze an extremely small number of resampled datasets, leading to a highly scalable approach with wide applicability. We test our approach on common tasks such as pattern mining and network analysis. In all cases, our approach results in a reduction of up to two orders of magnitude in running time compared to the state of the art, while preserving high statistical power, enabling the statistical validation of data mining results on large-scale real-world datasets.

2606.01854 2026-06-11 stat.ME 版本更新

A Uniform Improvement of the Benjamini-Hochberg Procedure via e-Closure

使用e-闭包对Benjamini-Hochberg方法的统一改进

Jelle Goeman

AI总结 提出closed BH方法,基于e-闭包原理统一改进BH程序,在相同假设下不减少拒绝但增加功效,尤其当假零假设数量大时。

详情
AI中文摘要

本文提出了closed BH,这是Benjamini和Hochberg(BH)的假发现率控制方法的一种统一改进。Closed BH在BH相同的子集正回归依赖(PRDS)假设下有效。作为一种统一改进,closed BH从不比BH拒绝更少的假设,但它可能拒绝更多。功效的增加尤其当假零假设数量大时观察到。该新方法是使用e-闭包原理构建的,这是最近推导出的多重检验的一般原理。

英文摘要

This paper presents closed BH, a uniform improvement of the False Discovery Rate controlling method of Benjamini and Hochberg (BH). Closed BH is valid under the same assumption of Positive Regression Dependency on a Subset (PRDS) as BH, but also under an alternative and weaker minimal sufficient condition. As a uniform improvement, closed BH never rejects fewer hypotheses than BH, but it may reject quite a few more. An increase in power is observed especially when the number of false null hypotheses is large. The novel method is constructed using the e-Closure principle, a recently derived general principle for multiple testing. The method is implemented in the eClosure package in R.

2606.01650 2026-06-11 q-fin.PM q-fin.TR stat.AP stat.ME 版本更新

Post Selection Estimation of Sharpe Ratios

夏普比率的事后选择估计

Steven E. Pav

AI总结 针对从众多资产中选择具有最高样本内夏普比率的资产,研究基于多面体引理、James-Stein收缩、期望最大夏普比率去偏、阈值法和经验贝叶斯的估计器,并通过模拟评估其偏差、均方根误差和秩相关性。

详情
AI中文摘要

我们考虑估计一个资产的真实夏普比率的问题,该资产因在众多资产中具有最高的样本内夏普比率而被选中。我们讨论了基于多面体引理、James-Stein收缩、期望最大夏普比率去偏、阈值法和经验贝叶斯的估计器。我们在模拟中测试了这些估计器,计算了不同样本量、资产数量以及总体夏普比率的分布范围和形状下的偏差和均方根误差。我们还计算了估计器与潜在真实值的秩相关性,模拟了这些估计器如何用于比较或排序执行此选择过程的不同团队的结果。我们发现James-Stein估计器在相关参数的许多不同实际值下提供了最佳性能,其次是Jiang和Zhang的GMLEB估计器。这些结果对资产收益的相关性相当稳健,但有一些注意事项。

英文摘要

We consider the problem of estimating the true Sharpe ratio of an asset selected for having the highest observed in-sample Sharpe ratio among many assets. We discuss estimators based on the polyhedral lemma, James Stein shrinkage, debiasing the expected maximum Sharpe ratio, thresholding and empirical Bayes. We test these estimators in simulations, computing bias and root mean square error across different values of sample size, number of assets, and spread and shape of population Sharpe ratios. We also compute rank correlation of the estimators against the underlying quantity, simulating how these estimators might be used to compare or rank the output of different teams which perform this selection process. We find that the James Stein estimator provides the best performance across many different realistic values of the relevant parameters, followed by the GMLEB estimator of Jiang and Zhang. These results are fairly robust to correlation of asset returns, with some caveats.

2605.21641 2026-06-11 stat.ME stat.CO 版本更新

Stable direct estimation for GPLSIAMs using P-splines with dynamically updated boundaries

使用动态更新边界的P样条实现GPLSIAMs的稳定直接估计

Danilo V. Silva, Gilberto A. Paula

AI总结 本文提出了一种稳定直接估计GPLSIAMs的方法,通过使用模型矩阵和惩罚完全鱼尔信息矩阵动态更新单指数协变量的边界,在统一的迭代框架中实现快速计算有效自由度和点wise置信区间。

详情
AI中文摘要

广义部分线性单指数加法模型(GPLSIAMs)因其在功能灵活性与参数维度缩减之间的平衡而被广泛应用于不同领域。然而,估计过程面临严重的计算挑战。本文介绍了一种新的稳定方法,利用每个单指数效应的模型矩阵,定义为其单指数系数,并通过惩罚完全鱼尔信息矩阵动态更新单指数协变量的边界,以统一的迭代框架实现。推导出的模型矩阵使得能够快速计算估计的有效自由度和单指数效应的点wise置信区间。通过广义Fellner-Schall方法将平滑参数更新整合到迭代过程中,从而提供对全局惩罚优化问题的高效近似。在中等样本量和非高斯分布下的模拟研究证实了估计在多个场景下的经验一致性。值得注意的是,所提出的方法在最先进竞争方法无法恢复真实单指数系数和非线性函数的稳定情况下仍保持稳定,并且在计算最密集的场景中比常规两步方法快80.13倍。通过应用于Capital Bike Sharing数据集,展示了该方法的建模优势,其中处理每年的单指数交互效应,具有不同的单指数系数和复杂的结构,使得竞争方法不适用。所提出的方法在R中实现,提供了可重复和透明的比较功能。

英文摘要

Generalized partially linear single-index additive models (GPLSIAMs) have been increasingly applied across diverse areas due to their versatility in integrating functional flexibility with parametric dimension reduction while maintaining interpretability. However, the estimation presents severe computational challenges. This paper introduces a novel stable method that uses the model matrix for each single-index effect, defined by its single-index coefficients, and the penalized complete Fisher information matrix to dynamically update the boundaries of the single-index covariates within a unified iterative framework. The derived model matrices enable the fast computation of the estimated effective degrees of freedom and pointwise confidence bands for the single-index effects. The smoothing parameter updates are integrated into the iterative process via the generalized Fellner-Schall method, which recycles the derived matrix decompositions, thereby providing an efficient approximation to the global penalized optimization problem. Simulation studies with moderate sample sizes under non-Gaussian distributions confirm the empirical consistency of the estimation across multiple scenarios. Notably, the proposed approach remains stable where state-of-the-art competitive methods fail to recover true single-index coefficients and nonlinear functions, and is 80.13 times faster than the usual two-step method in the most computationally intensive scenario. The modeling advantage is illustrated through an application to Capital Bike Sharing data, where we deal with a single-index interaction effect for each year, with distinct single-index coefficients, a complex structure that makes competitive methods inapplicable. The proposed method is implemented in R, with functions available for reproducibility and transparency in comparisons.

2411.10959 2026-06-11 econ.EM cs.LG math.ST stat.AP stat.ME stat.ML 版本更新

Program Evaluation with Remotely Sensed Outcomes

利用遥感结果的程序评估

Ashesh Rambachan, Rahul Singh, Davide Viviano

AI总结 本文研究了在实验和准实验中,由于遥感变量不完全测量经济结果而引起的因果推断问题,提出了一种非参数识别因果参数的方法,结合实验和观测数据进行n^{-1/2}推断。

详情
AI中文摘要

我们研究了在实验和准实验中,经济结果由遥感变量不完全测量的因果推断问题。遥感变量是低成本、可扩展且在观测数据中预测经济结果的变量,例如卫星图像和移动电话活动。我们将遥感变量视为后结果:经济结果的变化导致遥感变量的变化。例如,环境质量的变化导致卫星图像的变化,而不是相反。在这一假设下,我们提出了一种结合实验和观测数据的公式,以非参数方式识别因果参数。我们开发了一种n^{-1/2}推断方法,该方法对规格不正确具有鲁棒性,并且不限制用于处理遥感变量的算法。

英文摘要

We study causal inference in experiments and quasi-experiments, where the economic outcome is imperfectly measured by a remotely sensed variable. The remotely sensed variable is low-cost, scalable, and predictive of the economic outcome in observational data; examples include satellite imagery and mobile phone activity. We model the remotely sensed variable as post-outcome: variation in the economic outcome causes variation in the remotely sensed variable. For example, changes in environmental quality cause changes in satellite imagery, not vice versa. Under this assumption, we propose a formula to nonparametrically identify the causal parameter by combining experimental and observational data. We develop a method for n^{-1/2} inference that is robust to misspecification and that does not restrict the algorithms used to process remotely sensed variables.

2605.11340 2026-06-11 stat.ME 版本更新

Hyperbolic Latent Space Models for Network Embedding: Model Specification and Bayesian Inference

双曲潜空间模型用于网络嵌入:模型规范与贝叶斯推断

Yiwei Gong, Anna L. Smith, Dena Asta, Catherine A. Calder

AI总结 本文提出双曲潜空间模型,通过贝叶斯推断解决网络嵌入中的树状结构和厚尾度分布问题,强调温度参数对网络拓扑的重要性。

详情
AI中文摘要

许多现实世界网络表现出分层、树状结构和厚尾度分布,这些现象无法被传统网络数据统计模型轻易捕捉。本文基于统计物理的见解,提出具有双曲几何基础的连续潜空间模型,以概率方式将节点嵌入具有恒定负曲率的潜空间。然而,大多数统计实现简化了原始物理模型,忽略了控制潜距离到概率映射锐度的温度参数。本文认为这一省略是关键性的。我们证明温度是控制网络树状拓扑的根本参数,未能推断温度会削弱模型表达能力。我们正式提出一个具有未知可学习温度参数的贝叶斯双曲连续潜空间模型。然后开发了两种推断程序:用于严谨后验特征化的哈密顿蒙特卡罗方法和用于大规模网络的可扩展自编码变分贝叶斯算法。通过模拟和实际数据示例,我们证明在大多数情况下,本文模型在图重建任务中优于具有固定温度和错误指定欧几里得几何的模型,确认温度是复杂网络的关键且可推断的特征。

英文摘要

Many real-world networks exhibit hierarchical, tree-like structure and heavy-tailed degree distributions, phenomena not readily captured by standard statistical models for network data. Extensions of the popular continuous latent space modeling framework have been proposed to accommodate such networks. Drawing on insights from statistical physics, continuous latent space models with underlying hyperbolic geometry have been proposed as a natural framework, probabilistically embedding nodes in a latent Riemannian manifold with constant negative curvature. Most statistical implementations, however, simplify the original physics-based model by omitting the ``temperature parameter," which controls the sharpness of the latent distance-to-probability mapping. We argue this omission is critical. We demonstrate that temperature is the fundamental parameter governing a network's tree-like topology, and that failing to infer it weakens model expressiveness. We formalize a Bayesian hyperbolic continuous latent space model with an unknown, learnable temperature parameter. We then develop two inferential procedures: a Hamiltonian Monte Carlo approach for rigorous posterior characterization and a scalable auto-encoding variational Bayes algorithm for large-scale networks. Through simulation and real data examples, we show that our model outperforms models with fixed temperature and misspecified Euclidean geometries in graph reconstruction tasks in most settings, confirming temperature is a crucial and inferable feature of complex networks.

2604.23464 2026-06-11 stat.ME stat.AP 版本更新

Design-Based Cross-Validation for Comparing Small Area Estimators

关于小区域估计器的交叉验证

Qianyu Dong, Zehang Richard Li

AI总结 本文提出一种适用于复杂调查设计的小区域估计器交叉验证框架,通过分解交叉验证平方误差,揭示可识别偏差与不可识别成分,提升模型比较的稳健性和可解释性。

详情
Comments
Previous title: "On cross-validation for small area estimators"
AI中文摘要

地方公共卫生监测常常依赖住户调查,但所需空间分辨率的数据稀少。小区域估计(SAE)方法通过跨区域借用强度和辅助信息解决这一挑战。然而,在缺乏真实数据的情况下,比较这些估计器仍然困难。我们提出了一种适用于复杂调查设计的交叉验证框架,用于评估小区域估计器。我们的方法使能够对区域级和单元级SAE模型进行模型无关的比较。框架的核心是交叉验证平方误差的分解,揭示了可识别偏差和不可识别成分,后者可以被界定。我们的理论结果和模拟研究显示,传统方法如留一区域法交叉验证可能导致误导性的模型排名,而所提方法提供了更稳健和可解释的模型比较,并具有不确定性量化。我们通过比较赞比亚Demographic and Health Surveys中估计的亚国家女性识字率的小区域估计模型,展示了该框架。

英文摘要

Subnational monitoring of public health and development indicators often relies on household surveys where data are sparse at the desired spatial resolution. Small area estimation (SAE) methods address this challenge by borrowing strength across areas and incorporating auxiliary information. However, comparing these estimators remains difficult in the absence of ground truth. We propose a design-based cross-validation framework for evaluating small area estimators that accommodates complex survey designs. Our approach enables model-agnostic comparisons between area-level and unit-level SAE models. We derive a decomposition of the conditional mean squared error that yields a consistent cross-validation score, show that finite-sample comparisons carry an unidentifiable bias that can be bounded, and use this bound as a principled threshold for ranking models. We further show that leave-one-area-out cross-validation, a popular alternative, targets extrapolation rather than smoothing error and can reverse the correct ranking. We evaluate the framework through extensive design-based simulations. We apply the framework to compare subnational female literacy estimators in Zambia using the 2024 Demographic and Health Survey. The framework applies broadly across prevalence mapping and other SAE problems and is applicable to any small area estimator irrespective of the underlying model class.

2603.27843 2026-06-11 math.ST stat.ME 版本更新

Empirical Bayes Estimation and Inference via Smooth Nonparametric Maximum Likelihood

经验贝叶斯估计与推断:基于光滑非参数最大似然法

Taehyun Kim, Bodhisattva Sen

AI总结 针对非参数最大似然估计的离散性和慢对数解卷积率,引入高斯平滑层,提出光滑NPMLE,实现多项式解卷积率、近参数去噪性能及后验一致估计,并构建最优边际覆盖集。

详情
AI中文摘要

基于非参数最大似然估计(NPMLE)的经验贝叶斯 $g$-建模方法一直是正态均值问题中大规模估计和推断的核心。然而,不确定性量化的理论保证仍然很少。一个关键障碍是NPMLE必然是离散的,这导致离散的后验可信集和缓慢的对数解卷积率。我们通过引入一个分层高斯平滑层来解决这两个限制,该平滑层将混合分布限制为高斯位置混合。我们的光滑NPMLE继承了经典NPMLE的优良性质:它可以通过凸优化计算,并实现近乎参数的降噪性能。此外,它实现了多项式解卷积率,在相应类别上是渐近极小极大的。我们的过程还导致估计的光滑后验以多项式率收敛到真实后验。进一步,我们刻画了在期望长度上最优的边际覆盖集,构造了这些集的插件估计量,并在覆盖概率和期望长度方面为估计集建立了理论保证。我们还将理论扩展到模型误设和异方差高斯观测的设置,并研究了所提分层模型的可识别性。

英文摘要

The empirical Bayes $g$-modeling approach based on the nonparametric maximum likelihood estimator (NPMLE) has been central to large-scale estimation and inference in the normal means problem. However, theoretical guarantees for uncertainty quantification remain scarce. A key obstacle is that the NPMLE is necessarily discrete, which yields discrete posterior credible sets and a slow logarithmic deconvolution rate. We address both limitations by introducing a hierarchical Gaussian smoothing layer that restricts the mixing distribution to a Gaussian location mixture. Our smooth NPMLE inherits the favorable properties of the classical NPMLE: it is computable via convex optimization and achieves nearly parametric denoising performance. Moreover, it achieves a polynomial deconvolution rate that is asymptotically minimax over the corresponding class. Our procedure also leads to estimated smooth posteriors that converge to the true posteriors at a polynomial rate. Further, we characterize marginal coverage sets that are optimal in expected length, construct plug-in estimators of these sets, and establish theoretical guarantees for the estimated sets in terms of both coverage probability and expected length. We also extend the theory to settings with model misspecification and heteroscedastic Gaussian observations, and study identifiability of the proposed hierarchical model.

2603.22668 2026-06-11 math.ST stat.ME 版本更新

Fixed-level calibration of the Cauchy combination test

柯西组合检验的固定水平校准

Hirofumi Ota

AI总结 研究柯西组合检验在固定显著性水平下的渐近精确性,发现原始CCT在固定水平下不精确,提出边界层校准CCT(BL-CCT)通过修正参考分布而非统计量实现渐近精确,并在多种备择假设下保持功效。

详情
Comments
Added several related references, conducted power analyses and polished the proofs and the simulation section
AI中文摘要

柯西组合检验(CCT)被广泛使用,因为它能产生闭式组合$p$值,并且在宽依赖结构下当名义水平$\alpha\downarrow0$时渐近有效。我们研究了一个不同的渐近问题:当组合$p$值的数量$K$在依赖下增长时,通常的柯西截断值在普通固定水平下是否仍然准确。在典型单因子等相关高斯copula模型下,我们证明原始CCT在固定$\alpha$下通常不是渐近精确的。在固定正相关下,统计量收敛到随机潜在因子极限,因此不存在通用的固定水平参考分布。当公共相关$\rho_K$随$K$减弱时,固定水平行为由边界层尺度$s_K=\sqrt{\rho_K}(\log K)^{3/2}$控制,且原始CCT渐近精确当且仅当$\rho_K(\log K)^3\to0$。由于大小失真完全来自参考分布而非统计量,因此可以在不修改检验统计量本身的情况下进行校正。我们提出了边界层校准CCT(BL-CCT),它用单参数高斯平滑柯西族替代标准柯西参考分布。与最近修改检验统计量的变体不同,BL-CCT保持统计量不变,仅校正参考分布。BL-CCT在更弱的条件$\rho_K\log K\to0$下渐近精确,并在有界边界层上提供有用的有限$K$近似。我们还进行了若干功效分析:尽管BL-CCT仅提高了截断值,但在局部密集、稀疏和密集高斯备择假设下,它在精确度尺度上相对于原始CCT没有一阶功效损失。数值实验支持校准理论。

英文摘要

The Cauchy combination test (CCT) is widely used because it yields a closed-form combined $p$-value and is known to be asymptotically valid as the nominal level $\alpha\downarrow0$ under broad dependence structures. We study a different asymptotic question: whether the usual Cauchy cutoff remains accurate at an ordinary fixed level when the number $K$ of combined $p$-values grows under dependence. Under a canonical one-factor equicorrelated Gaussian copula model, we show that the raw CCT is generally not asymptotically exact at fixed $\alpha$. With fixed positive correlation, the statistic converges to a random latent-factor limit, so there is no universal fixed-level reference law. When the common correlation $\rho_K$ weakens with $K$, fixed-level behaviour is governed by the boundary-layer scale $s_K=\sqrt{\rho_K}(\log K)^{3/2}$, and the raw CCT is asymptotically exact if and only if $\rho_K(\log K)^3\to0$. Because the size distortion arises entirely from the reference law and not from the statistic, it can be corrected without modifying the test statistic itself. We propose the boundary-layer calibrated CCT (BL-CCT), which replaces the standard Cauchy reference by a one-parameter Gaussian-smoothed Cauchy family. Unlike recent variants that modify the test statistic, BL-CCT leaves the statistic unchanged and corrects only the reference law. BL-CCT is asymptotically exact under the weaker condition $\rho_K\log K\to0$ and provides a useful finite-$K$ approximation on bounded boundary layers. We also conduct several power analyses: although BL-CCT only raises the cutoff, it incurs no first-order power loss relative to the raw CCT on the exactness scale, under local dense, sparse, and dense Gaussian alternatives. Numerical experiments support the calibration theory.