Multi-modal, multi-species, and multi-task latent-space model for decoding level of consciousness

Julia Huiming Wang; Méliya El Fakiri; Jake Swann; Elena Gado; Scott Cheng; Jordy Tasserie; Rongchen Huang; Jongha Lee; Axoft; Jia Liu; Tianyang Ye; Paul Le Floch; Oliver Armitage

Multi-modal, multi-species, and multi-task latent-space model for decoding level of consciousness

Julia Huiming Wang, Méliya El Fakiri, Jake Swann, Elena Gado, Scott Cheng, Jordy Tasserie, Rongchen Huang, Jongha Lee, Axoft, Jia Liu, Tianyang Ye, Paul Le Floch, Oliver Armitage

Published: 23 Sept 2025, Last Modified: 02 Nov 2025NeurIPS 2025 Workshop BrainBodyFMEveryoneRevisionsBibTeXCC BY 4.0

Keywords: foundation model, multi-modal, multi-species, consciousness, representation learning

TL;DR: Multi-modal latent space model trained on data across species can be used to understand depth of anesthesia outputs such as measured by behavior score and auditory stimuli paradigm response.

Abstract: Assessing the degree and characteristics of consciousness is central to caring for patients with Disorders of Consciousness (DoC), yet current standard of care is a bedside questionnaire. Using a laminar neural probe we introduce a self-supervised, multi-modal representation learning approach that constructs an interpretable 2-D latent space of brain state from continuous neural recordings. We jointly encode local field potentials (LFPs), single-unit firing rates, and a movement proxy into a time-aware autoregressive VAE (TN-VAE). In rats, pigs and humans undergoing controlled depth anesthesia experiments, the learned latent trajectories nonlinearly, but smoothly, separate four expert-defined states (awake, light, moderate, deep) at $2s$ resolution, exceeding the temporal granularity of behavioral scoring ($15m$). The latent space supports linear readout of both coarse state and individual behavioral components, and its axes align with known physiology: delta/alpha power differentiates unconscious sub-states, while gamma and unit firing distinguish wakefulness. To enable cross-species use and incomplete modality sets, we add lightweight, species-specific stitching layers. This model pretrained on tri-modal rat data and fine-tuned with unimodal (LFP) pig data and unimodal human intraoperative data, successfully separates awake versus anesthetized states in new unseen human subjects, demonstrating zero-shot multi-subject transfer without per-patient calibration. The model further generalizes in a multi-task manner to predict the results of additional stimuli. These results highlight a path toward a foundation model for DoC that generalizes across sessions, subjects, species, and stimuli to enable scalable, continuous brain-state monitoring and clinical decision support.

Submission Number: 43

Loading