Document Zbl 1269.65043

Halko, N.; Martinsson, P. G.; Tropp, J. A.

Finding structure with randomness: probabilistic algorithms for constructing approximate matrix decompositions. (English) Zbl 1269.65043

SIAM Rev. 53, No. 2, 217-288 (2011).

Summary: Low-rank matrix approximations, such as the truncated singular value decomposition and the rank-revealing QR decomposition, play a central role in data analysis and scientific computing. This work surveys and extends recent research which demonstrates that randomization offers a powerful tool for performing low-rank matrix approximation. These techniques exploit modern computational architectures more fully than classical methods and open the possibility of dealing with truly massive data sets.
This paper presents a modular framework for constructing randomized algorithms that compute partial matrix decompositions. These methods use random sampling to identify a subspace that captures most of the action of a matrix. The input matrix is then compressed-either explicitly or implicitly-to this subspace, and the reduced matrix is manipulated deterministically to obtain the desired low-rank factorization. In many cases, this approach beats its classical competitors in terms of accuracy, robustness, and/or speed. These claims are supported by extensive numerical experiments and a detailed error analysis. The specific benefits of randomized techniques depend on the computational environment. Consider the model problem of finding the \(k\) dominant components of the singular value decomposition of an \(m \times n\) matrix. (i) For a dense input matrix, randomized algorithms require \(\big (mn \log(k))\) floating-point operations (flops) in contrast to \( \big (mnk)\) for classical algorithms. (ii) For a sparse input matrix, the flop count matches classical Krylov subspace methods, but the randomized approach is more robust and can easily be reorganized to exploit multiprocessor architectures. (iii) For a matrix that is too large to fit in fast memory, the randomized techniques require only a constant number of passes over the data, as opposed to \(\big (k)\) passes for classical algorithms. In fact, it is sometimes possible to perform matrix approximation with a single pass over the data.

Cited in 7 Reviews

Cited in 649 Documents

MSC:

65F30	Other matrix algorithms (MSC2010)
68W20	Randomized algorithms
60B20	Random matrices (probabilistic aspects)
65F20	Numerical solutions to overdetermined systems, pseudoinverses
65Y05	Parallel numerical computation
15B52	Random matrices (algebraic aspects)
62-07	Data analysis (statistics) (MSC2010)
68W30	Symbolic computation and algebraic computation

Keywords:

dimension reduction; eigenvalue decomposition; interpolative decomposition; Johnson-lindenstrauss lemma; matrix approximation; parallel algorithm; pass-efficient algorithm; principal component analysis; randomized algorithm; random matrix; rank-revealing QR factorization; singular value decomposition; streaming algorithm; numerical examples; data analysis; scientific computing;sparse input matrix; Krylov subspace methods

Cite Review PDF

Full Text: DOI arXiv Link