Canary Vocal Sensorimotor Model with RNN Decoder and Low-dimensional GAN Generator

Silvia Pagliarini, Arthur Leblois, Xavier Hinaut

2021 (modified: 22 Feb 2022)ICDL 2021Readers: Everyone

Abstract: Songbirds, like humans, learn to imitate sounds produced by adult conspecifics. Similarly, a complete vocal learning model should be able to produce, perceive and imitate realistic sounds. We propose (1) to use a low-dimensional generator model obtained from training WaveGAN on a canary vocalizations, (2) to use a RNN-classifier to model sensory processing. In this scenario, can a simple Hebbian learning rule drive the learning of the inverse model linking the perceptual space and the motor space? First, we study how the motor latent space topology affects the learning process. We then investigate the influence of the learning rate and of the motor latent space dimension. We observe that a simple Hebbian rule is able to drive the learning of realistic sounds produced via a low-dimensional GAN.

0 Replies