Agentic Memory Should Localize Compression

Published: 03 Mar 2026, Last Modified: 25 Apr 2026ICLR 2026 Workshop MemAgentsEveryoneRevisionsBibTeXCC BY 4.0
Keywords: Position, Compression, Architecture, Modularity
TL;DR: For agentic memory, the key question is not whether to compress, but where. We prove that modularizing memory localizes compression, minimizing retrieval overlap and mathematically bounding update-induced behavioral interference.
Abstract: Long-horizon LLM agents require memory, but unbounded storage is unusable at inference time, making compression unavoidable. In continual deployment, compression becomes repeated updates to accessible state and can induce behavioural drift on previously supported queries. We formalize this as _interference_: expected divergence between the agent’s policies before and after an update. Our position is that stability is governed by retrieval–update overlap; modular designs minimize overlap and thus localize update effects. Under routing stability, expected interference is controlled by the probability that updated modules are retrieved.
Submission Number: 63
Loading