SiBBlInGS: Similarity-driven Building-Block Inference using Graphs across States

Published: 17 Jun 2024, Last Modified: 17 Jul 2024ICML2024-AI4Science PosterEveryoneRevisionsBibTeXCC BY 4.0
Keywords: dictionary learning, neuroscience, data analysis, EEG, tensor factorization, sparse representation, variability, time series, multi-way data, neural ensembles
TL;DR: We propose a graph-based method to identify interpretable sparse hidden components underlying time-series data recorded across trials and states. Our method captures inter- and intra-state variability and is robust to noise and missing samples.
Abstract: Time series data across scientific domains are often collected under distinct states (e.g., tasks), wherein latent processes (e.g., biological factors) create complex inter- and intra-state variability. A key approach to capture this complexity is to uncover fundamental interpretable units within the data, Building Blocks (BBs), which modulate their activity and adjust their structure across observations. Existing methods for identifying BBs in multi-way data often overlook inter- vs. intra-state variability, produce uninterpretable components, or do not align with properties of real-world data, such as missing samples and sessions of different durations. Here, we present a framework for Similarity-driven Building Block Inference using Graphs across States (SiBBlInGS). SiBBlInGS offers a graph-based dictionary learning approach for discovering sparse BBs along with their temporal traces, based on co-activity patterns and inter- vs. intra-state relationships. Moreover, SiBBlInGS captures per-trial temporal variability and controlled cross-state structural BB adaptations, identifies state-specific vs. state-invariant components, and accommodates variability in the number and duration of observed sessions across states. We demonstrate SiBBlINGS’s ability to reveal insights into complex biological and medical phenomena through several synthetic and real-world examples. Specifically, we found that SiBBlInGS recovers meaningful functional neural ensembles underlying Macaque neural recordings and can leverage human EEG data to localize the source of epileptic seizures moments before their onset.
Submission Number: 182
Loading