ELISA: A Generative AI Agent for Expression Grounded Discovery in Single-Cell Genomics

Published: 02 Mar 2026, Last Modified: 10 Mar 2026Gen² 2026 PosterEveryoneRevisionsCC BY 4.0
Track: Full / long paper (5-8 pages)
Keywords: Single Cell, Generative AI, Agent
TL;DR: A generative AI–based agent capable of analyzing single-cell data.
Abstract: We present ELISA, a retrieval-augmented AI agent for interpretable, hypothesisdriven exploration of single-cell RNA sequencing (scRNA-seq) data. ELISA enables natural-language querying of cell populations through a queryconditioned retrieval framework that explicitly integrates semantic biological priors with expression-derived evidence. In semantic mode, cluster-level biological summaries are embedded using BioBERT to align user queries with ontology-supported annotations. Hybrid mode extends this approach by constructing a query-adaptive expression representation from semantically relevant clusters and combining it with scGPT-derived transcriptional embeddings, prioritizing cell populations that are both semantically relevant and transcriptionally coherent with the query intent.scGPT mode relies exclusively on transcriptional structure captured in the scGPT latent embedding space, emphasizing genes that dominantly shape expression-derived representations of retrieved clusters, independent of semantic annotations or curated biological knowledge. Finally, discovery mode contrasts dataset-specific expression signals with prior biological knowledge to surface context-shifted gene programs and generate cautious, datagrounded hypotheses. By explicitly separating retrieval, expression evidence, and language-model interpretation, ELISA prioritizes transparency and reproducibility over speculative inference. The system is designed as a human-in the-loop analytical tool that supports expert reasoning and hypothesis generation rather than fully autonomous biological discovery.
Anonymization: This submission has been anonymized for double-blind review via the removal of identifying information such as names, affiliations, and identifying URLs.
Submission Number: 9
Loading