Sparse PCA via Covariance Thresholding

Yash Deshpande; Andrea Montanari

doi:10.48550/arxiv.1311.5179

Sparse PCA via Covariance Thresholding

Yash Deshpande(Stanford University), Andrea Montanari(Stanford University)

arXiv (Cornell University)

November 20, 2013

10.48550/arxiv.1311.5179

Cited by 45Open Access

Full Text

Abstract

In sparse principal component analysis we are given noisy observations of a low-rank matrix of dimension $n\times p$ and seek to reconstruct it under additional sparsity assumptions. In particular, we assume here each of the principal components $\mathbf{v}_1,\dots,\mathbf{v}_r$ has at most $s_0$ non-zero entries. We are particularly interested in the high dimensional regime wherein $p$ is comparable to, or even much larger than $n$. In an influential paper, \cite{johnstone2004sparse} introduced a simple algorithm that estimates the support of the principal vectors $\mathbf{v}_1,\dots,\mathbf{v}_r$ by the largest entries in the diagonal of the empirical covariance. This method can be shown to identify the correct support with high probability if $s_0\le K_1\sqrt{n/\log p}$, and to fail with high probability if $s_0\ge K_2 \sqrt{n/\log p}$ for two constants $0

Related Papers

No related papers found

Powered by citation graph analysis