Beyond Motif Localization: Probing Rule-Level Signals in Synthetic Genomic Grammars

Ramu Lakshmanan; Rafael Peres Da Silva; Niranjan Nagarajan

Beyond Motif Localization: Probing Rule-Level Signals in Synthetic Genomic Grammars

Ramu Lakshmanan, Rafael Peres Da Silva, Niranjan Nagarajan

Published: 04 Mar 2026, Last Modified: 11 Mar 2026ICLR 2026 Workshop LMRL PosterEveryoneRevisionsBibTeXCC BY 4.0

Confirmation: I have read and agree with the workshop's policy on behalf of myself and my co-authors.

Track: tiny / short paper (2-4 pages excluding references; extended abstract format)

Keywords: Interpretability, Attribution methods, Regulatory genomics, Compositional reasoning, Rule-level explanations, Synthetic genomic grammars, Model faithfulness, Explainable AI

TL;DR: We show that while attribution methods often localize regulatory motifs, they frequently fail to reflect the underlying compositional rules that govern genomic logic.

Abstract: Attribution methods are standard tools for interpreting deep learning models in regulatory genomics, but evaluations typically focus on whether motif bases receive high importance scores. We ask whether attribution maps also capture compositional rules such as motif ordering, spacing, and logical interactions. Using synthetic DNA datasets with known ground-truth grammars, we evaluate five attribution methods on localization accuracy and rule-level consistency. For the latter, we introduce the Grammar Satisfiability Score (GSS), a metric that checks whether signed attributions satisfy the Boolean logic of the generating grammar. We find that strong motif localization coexists with poor logical faithfulness for conjunctive and context-dependent grammars, and that saliency structure persists under progressive parameter randomization.

Anonymization: This submission has been anonymized for double-blind review via the removal of identifying information such as names, affiliations, and identifying URLs.

Submission Number: 84

Loading