Keywords: AI agents, web agents, procedural memory, state-dependent memory, retrieval-augmented action selection, reinforcement learning, post-training
TL;DR: We introduce a state-indexed procedural memory that lets AI agents learn from past experiences, improving accuracy, reliability, and efficiency on web browsing tasks by retrieving relevant action traces when encountering similar situations.
Abstract: Learning how to do things from trial and error in real time is a hallmark of biological intelligence, yet most LLM-based agents lack mechanisms to acquire procedural knowledge after deployment. We propose Procedural Recall for Agents with eXperiences Indexed by State (PRAXIS), a lightweight post-training learning mechanism that stores the consequences of actions and retrieves them by jointly matching environmental and internal states of past episodes to the current state. PRAXIS augments agentic action selection with retrieved state-action-result exemplars that are generated in real time. When evaluated on the REAL web browsing benchmark, PRAXIS improves task completion accuracy, reliability, and cost efficiency across different foundation model backbones, and shows preliminary generalization to unseen tasks in similar environments. These results demonstrate that PRAXIS enables the practical adoption of AI agents in fast-evolving stateful environments by helping them learn new procedures effectively.
Submission Number: 8
Loading