Non-negative Probabilistic Factorization

16 Sept 2023 (modified: 11 Feb 2024)Submitted to ICLR 2024EveryoneRevisionsBibTeX
Primary Area: applications to physical sciences (physics, chemistry, biology, etc.)
Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics.
Keywords: NMF, Probabilistic graphical models, unsupervised learning, molecular biology
Submission Guidelines: I certify that this submission complies with the submission instructions as described on https://iclr.cc/Conferences/2024/AuthorGuide.
TL;DR: Probabilistic extension of NMF, extracting source distributions from mixed samples without direct observations
Abstract: Non-negative Matrix Factorization (NMF) is a powerful data-analysis tool to extract non-negative latent components from linearly mixed samples. It is particularly useful when the observed signal aggregates contributions from multiple sources. However, NMF only accounts for two types of variations between samples - disparities in the proportions of sources contribution and observation noise. Here, we present VarNMF, a probabilistic extension of NMF that introduces another type of variation between samples: a variation in the actual value a source contributes to the samples. We show that by modeling sources as distributions and applying an Expectation Maximization procedure, we can learn this type of variation directly from mixed samples without observing any source directly. We apply VarNMF to a dataset of genomic measurements from liquid biopsies and demonstrate its ability to extract cancer-associated source distributions that reflect inter-cancer variability directly from mixed samples and without prior knowledge. The proposed model provides a framework for learning source distributions from additive mixed samples without direct observations.
Anonymous Url: I certify that there is no URL (e.g., github page) that could be used to find authors' identity.
Supplementary Material: zip
No Acknowledgement Section: I certify that there is no acknowledgement section in this submission for double blind review.
Submission Number: 702
Loading