Co-manifold learning with missing data

Gal Mishne; Eric C. Chi; Ronald R. Coifman

Co-manifold learning with missing data

Gal Mishne, Eric C. Chi, Ronald R. Coifman

27 Sept 2018 (modified: 05 May 2023)ICLR 2019 Conference Blind SubmissionReaders: Everyone

Abstract: Representation learning is typically applied to only one mode of a data matrix, either its rows or columns. Yet in many applications, there is an underlying geometry to both the rows and the columns. We propose utilizing this coupled structure to perform co-manifold learning: uncovering the underlying geometry of both the rows and the columns of a given matrix, where we focus on a missing data setting. Our unsupervised approach consists of three components. We first solve a family of optimization problems to estimate a complete matrix at multiple scales of smoothness. We then use this collection of smooth matrix estimates to compute pairwise distances on the rows and columns based on a new multi-scale metric that implicitly introduces a coupling between the rows and the columns. Finally, we construct row and column representations from these multi-scale metrics. We demonstrate that our approach outperforms competing methods in both data visualization and clustering.

Keywords: nonlinear dimensionality reduction, missing data, manifold learning, co-clustering, optimization

TL;DR: Nonlinear representations of observations and features of a data matrix with missing entries and coupled geometries

12 Replies

Loading