Covariance-Adaptive Residualization and Stagewise Calibration for Dependent Multiple Testing
协方差自适应残差化与分步校准用于相依多重检验
Prasenjit Ghosh, Arijit Chakrabarti
AI总结 针对任意协方差相依下的多元高斯均值同时假设检验问题,提出一种结合协方差自适应残差化与广义分步临界常数的分步校准程序,在降低计算复杂度的同时实现更优的信号恢复和错误控制。
详情
本文研究在任意协方差相依下多元高斯均值的同步假设检验。基于Cohen等人(2009)的最大残差向下(MRD)程序,我们探索了一种基于Gavrilov等人(2009)的广义分步临界常数的新校准策略。所得程序保留了MRD的协方差自适应残差化机制,同时将原始模型依赖的阈值设定替换为简单的分步校准规则。由于所提程序属于Ghosh和Chakrabarti(2026)研究的单调残差基分步程序类,其可容许性直接由其理论得出。我们还推导了MRD残差统计量的替代表示,将所有活动残差通过单个活动精度矩阵表达,大幅降低了计算复杂度。在广泛相依结构下的模拟研究表明,所提方法通常比几种广泛使用的边际检验程序获得更低的归一化误分类风险。在几种结构化相依模型下,该程序还表现出强大的信号恢复能力,实现了接近名义水平的错误发现率、极小的错误非发现率、接近1的功效以及接近预期真实信号数的平均拒绝数。这些发现提供了经验证据,表明协方差自适应残差化和分步校准在相依多重检验中可能以高度有利的方式相互作用。
In this paper, we study simultaneous hypothesis testing for multivariate Gaussian means under arbitrary covariance dependence. Building on the Maximum Residual Down (MRD) procedure of Cohen et al. (2009), we investigate a new calibration strategy based on the generalized step-down critical constants of Gavrilov et al. (2009). The resulting procedure retains the covariance-adaptive residualization mechanism of MRD while replacing the original model-dependent threshold specification with a simple stagewise calibration rule. Since the proposed procedure belongs to the class of monotone residual-based step-down procedures studied by Ghosh and Chakrabarti (2026), its admissibility follows directly from their theory. We also derive alternative representations of the MRD residual statistics that express all active residuals through a single active precision matrix, substantially reducing computational complexity. Simulation studies across a broad range of dependence structures show that the proposed methodology often achieves a lower normalized misclassification risk than several widely used marginal testing procedures. Under several structured dependence models, the procedure also exhibits strong signal-recovery behavior, attaining false discovery rates near the nominal level, extremely small false non-discovery rates, powers approaching one, and average numbers of rejections close to the expected number of true signals. These findings provide empirical evidence that covariance-adaptive residualization and stagewise calibration may interact in a highly favorable manner for dependent multiple testing.