Towards Learning to Speak and Hear Through Multi-Agent Communication over a Continuous Acoustic Channel

Kevin Michael Eloff; Arnu Pretorius; Okko Räsänen; Herman Arnold Engelbrecht; Herman Kamper

Towards Learning to Speak and Hear Through Multi-Agent Communication over a Continuous Acoustic Channel

Kevin Michael Eloff, Arnu Pretorius, Okko Räsänen, Herman Arnold Engelbrecht, Herman Kamper

Published: 28 Jan 2022, Last Modified: 22 Jun 2025ICLR 2022 SubmittedReaders: Everyone

Keywords: multi-agent reinforcement learning, language acquisition, emergent communication, acoustic communication, continuous signalling

Abstract: While multi-agent reinforcement learning has been used as an effective means to study emergent communication between agents, existing work has focused almost exclusively on communication with discrete symbols. Human communication often takes place (and emerged) over a continuous acoustic channel; human infants acquire language in large part through continuous signalling with their caregivers. We therefore ask: Are we able to observe emergent language between agents with a continuous communication channel trained through reinforcement learning? And if so, what is the impact of channel characteristics on the emerging language? We propose an environment and training methodology to serve as a means to carry out an initial exploration of these questions. We use a simple messaging environment where a "speaker" agent needs to convey a concept to a "listener". The Speaker is equipped with a vocoder that maps symbols to a continuous waveform, this is passed over a lossy continuous channel, and the Listener needs to map the continuous signal to the concept. Using deep Q-learning, we show that basic compositionality emerges in the learned language representations. We find that noise is essential in the communication channel when conveying unseen concept combinations. And we show that we can ground the emergent communication by introducing a caregiver predisposed to "hearing" or "speaking" English. Finally, we describe how our platform serves as a starting point for future work that uses a combination of deep reinforcement learning and multi-agent systems to study our questions of continuous signalling in language learning and emergence.

One-sentence Summary: Using reinforcement learning in a referential game, a Speaker agent and a Listener agent learns to communicate with each other using speech-like continuous signals over an acoustic channel

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 1 code implementation](https://www.catalyzex.com/paper/towards-learning-to-speak-and-hear-through/code)

14 Replies

Loading