Neural Collective Matrix Factorization for integrated analysis of heterogeneous biomedical data

Ragunathan Mariappan, Aishwarya Jayagopal, Ho Zong Sien, Vaibhav Rajan

Published: 01 Jan 2022, Last Modified: 06 Mar 2024Bioinform. 2022Readers: Everyone

Abstract: In many biomedical studies, there arises the need to integrate data from multiple directly or indirectly related sources. Collective matrix factorization (CMF) and its variants are models designed to collectively learn from arbitrary collections of matrices. The latent factors learnt are rich integrative representations that can be used in downstream tasks, such as clustering or relation prediction with standard machine-learning models. Previous CMF-based methods have numerous modeling limitations. They do not adequately capture complex non-linear interactions and do not explicitly model varying sparsity and noise levels in the inputs, and some cannot model inputs with multiple datatypes. These inadequacies limit their use on many biomedical datasets.

0 Replies