Statistically and Computationally Optimal Estimation and Inference of Common Subspaces
公共子空间的统计与计算最优估计与推断
Joshua Agterberg
AI总结 针对多个对称低秩矩阵共享公共子空间的问题,提出基于投影梯度下降和谱平方和初始化的估计器,在强估计信噪比下达到最优 sinΘ 误差率,并在强推断信噪比下实现渐近正态分布的自适应置信区间。
详情
给定多个数据矩阵,统计和数据科学中的许多问题依赖于估计一个捕获所有数据矩阵共享的某种结构的公共子空间。在本文中,我们研究了公共子空间模型的统计和计算极限,其中观测到一组由噪声扰动的对称低秩矩阵,每个低秩矩阵共享相同的公共子空间。我们的主要结果识别了信噪比(SNR)的几个区域,使得估计和推断在统计或计算上最优,我们将这些区域称为弱SNR、中等SNR、强估计SNR和强推断SNR。首先,我们提出了一种基于投影梯度下降的估计器,通过谱平方和初始化,并证明它在强估计SNR下达到了最优的$\sinΘ$误差率。这些结果由统计和计算下界补充,这些下界识别了弱和中等估计SNR区域。接下来,我们转向$\sinΘ$距离本身的统计推断,并证明我们的估计器在强推断SNR区域具有渐近高斯分布。基于这一极限结果,我们提出了置信区间,并证明它们在强推断SNR区域是自适应极小化最优的,其中自适应性以SNR衡量。最后,我们证明在强推断SNR区域以下,自适应置信区间在信息论上是不可能的。因此,我们的结果揭示了一个新现象:尽管SNR“高于”估计的计算极限,但自适应统计推断在信息论上可能仍然是不可能的。
Given multiple data matrices, many problems in statistics and data science rely on estimating a common subspace that captures certain structure shared by all the data matrices. In this paper we investigate the statistical and computational limits for the common subspace model in which one observes a collection of symmetric low-rank matrices perturbed by noise, where each low-rank matrix shares the same common subspace. Our main results identify several regimes of the signal-to-noise ratio (SNR) such that estimation and inference are statistically or computationally optimal, and we refer to these regimes as weak SNR, moderate SNR, strong estimation SNR, and strong inference SNR. First, we propose an estimator based on projected gradient descent initialized via spectral sum of squares and show that it achieves the optimal $\sinΘ$ error rate under strong estimation SNR. These results are complemented by both statistical and computational lower bounds identifying the weak and moderate estimation SNR regimes. Next, we turn to statistical inference for the $\sinΘ$ distance itself, and we show that our estimator has an asymptotically Gaussian distribution in the strong inference SNR regime. Based on this limiting result we propose confidence intervals and show that they are adaptively minimax optimal in the strong inference SNR regime, where adaptivity is measured in terms of the SNR. Finally, we show that adaptive confidence intervals are information-theoretically impossible below the strong inference SNR regime. Consequently, our results unveil a novel phenomenon: despite the SNR being ``above'' the computational limit for estimation, adaptive statistical inference may still be information-theoretically impossible.