Multi-Integration of Labels across Categories for Component Identification (MILCCI)

19 Sept 2025 (modified: 11 Feb 2026)Submitted to ICLR 2026EveryoneRevisionsBibTeXCC BY 4.0
Keywords: multi-trial data, multi-label analysis, component identification, matrix factorization, tensor factorization, dictionary learning, data-driven models, interpretability, neural ensembles
TL;DR: We present a novel data-driven method that identifies core components in multi-trial, multi-label temporal data while disentangling how each label dimension is encoded within the observed dynamics.
Abstract: Many fields collect large-scale temporal data through repeated measurements (`trials’), where each trial is labeled with a set of metadata variables spanning several categories. For example, a trial in a neuroscience study may be linked to a value from category (a): task difficulty, and category (b): animal choice. A critical challenge in time-series analysis is to understand how these labels are encoded within the multi-trial observations, and disentangle the distinct effect of each label entry across categories. Here, we present MILCCI, a novel data-driven method that i) identifies the interpretable components underlying the data, ii) captures cross-trial variability, and iii) integrates label information to understand each category's representation within the data. MILCCI extends a sparse per-trial decomposition that leverages label similarities within each category to enable subtle, label-driven cross-trial adjustments in component compositions and to distinguish the contribution of each category. MILCCI also learns each component’s corresponding temporal trace, which evolves over time within each trial and varies flexibly across trials. We demonstrate MILCCI’s performance through both synthetic and real-world examples, including voting patterns, online page view trends, and neuronal recordings.
Supplementary Material: zip
Primary Area: applications to neuroscience & cognitive science
Submission Number: 18147
Loading