Cognitive Architectures for Language Agents

Theodore Sumers; Shunyu Yao; Karthik Narasimhan; Thomas Griffiths

Cognitive Architectures for Language Agents

Theodore Sumers, Shunyu Yao, Karthik Narasimhan, Thomas Griffiths

Published: 22 Feb 2024, Last Modified: 17 Sept 2024Accepted by TMLREveryoneRevisionsBibTeXCC BY 4.0

Abstract: Recent efforts have augmented large language models (LLMs) with external resources (e.g., the Internet) or internal control flows (e.g., prompt chaining) for tasks requiring grounding or reasoning, leading to a new class of language agents. While these agents have achieved substantial empirical success, we lack a framework to organize existing agents and plan future developments. In this paper, we draw on the rich history of cognitive science and symbolic artificial intelligence to propose Cognitive Architectures for Language Agents (CoALA). CoALA describes a language agent with modular memory components, a structured action space to interact with internal memory and external environments, and a generalized decision-making process to choose actions. We use CoALA to retrospectively survey and organize a large body of recent work, and prospectively identify actionable directions towards more capable agents. Taken together, CoALA contextualizes today's language agents within the broader history of AI and outlines a path towards language-based general intelligence.

Certifications: Survey Certification

Submission Length: Long submission (more than 12 pages of main content)

Assigned Action Editor: ~Shixiang_Gu1

License: Creative Commons Attribution 4.0 International (CC BY 4.0)

Submission Number: 1618

Loading