Dictionary identification: sparse matrix-factorization via l1-minimization

Rémi Gribonval, Karin Schnass

Published: 2010, Last Modified: 14 May 2025IEEE Trans. Inf. Theory 2010EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: This paper treats the problem of learning a dictionary providing sparse representations for a given signal class, via ℓ1-minimization. The problem can also be seen as factorizing a d × N matrix Y = (y1 . . . yN), yn ∈ ℝd of training signals into a d × K dictionary matrix Φ and a K × N coefficient matrix X = (x1 . . . xN), xn ∈ ℝK, which is sparse. The exact question studied here is when a dictionary coefficient pair (Φ, X) can be recovered as local minimum of a (nonconvex) ℓ1-criterion with input Y = Φ X. First, for general dictionaries and coefficient matrices, algebraic conditions ensuring local identifiability are derived, which are then specialized to the case when the dictionary is a basis. Finally, assuming a random Bernoulli-Gaussian sparse model on the coefficient matrix, it is shown that sufficiently incoherent bases are locally identifiable with high probability. The perhaps surprising result is that the typically sufficient number of training samples N grows up to a logarithmic factor only linearly with the signal dimension, i.e., N ≈ CK log K, in contrast to previous approaches requiring combinatorially many samples.