Interaction Models and Generalized Score Matching for Compositional Data

Published: 18 Nov 2023, Last Modified: 29 Nov 2023LoG 2023 PosterEveryoneRevisionsBibTeX
Keywords: Compositional data, Graphical model, High-dimensional statistics, Interaction, Sparsity
TL;DR: We provide estimation methodology for flexible statistical models that allow one to explore interactions in high-dimensional compositional data.
Abstract: Applications such as the analysis of microbiome data have led to renewed interest in statistical methods for compositional data, i.e., data in the form of relative proportions. In particular, there is considerable interest in modelling interactions among such proportions. To this end we propose a class of exponential family models that accommodate arbitrary patterns of pairwise interaction. Special cases include Dirichlet distributions as well as Aitchison's additive logistic normal distributions. Generally, the distributions we consider have a density that features a difficult-to-compute normalizing constant. To circumvent this issue, we design effective estimation methods based on generalized versions of score matching.
Submission Type: Full paper proceedings track submission (max 9 main pages).
Agreement: Check this if you are okay with being contacted to participate in an anonymous survey.
Software: https://github.com/sqyu/genscore
Poster: png
Poster Preview: png
Submission Number: 9
Loading