Multi-modal single-cell foundation models via dynamic token adaptation

Published: 05 Mar 2025, Last Modified: 17 Apr 2025MLGenX 2025 TinyPapersEveryoneRevisionsBibTeXCC BY 4.0
Track: Tiny paper track (up to 4 pages)
Abstract:

Recent advances in applying deep learning in genomics include DNA-language and single-cell foundation models. However, these models take only one data type as input. We introduce dynamic token adaptation and demonstrate how it allows combining these models to predict gene regulation at single-cell level in different genetic contexts. Although the method is generalisable, we focus on an illustrative example by training an adapter from DNA-sequence embeddings to a single-cell foundation model's token embedding space. As qualitative evaluation, we assess the impact of DNA sequence changes on the model’s learned gene regulatory networks by mutating the transcriptional start site of the transcription factor \textit{GATA4} \textit{in silico}, observing predicted expression changes in its target genes in fetal cardiomyocytes.

Submission Number: 32
Loading