Acoustic barycenters as exemplar production targets

Fred Mailhot, Cassandra L Jacobs

Published: 28 Jun 2024, Last Modified: 08 Jul 2024Proceedings of the 21st SIGMORPHON workshop on Computational Research in Phonetics, Phonology, and MorphologyEveryoneCC BY 4.0

Abstract: We present a solution to the problem of exemplar-based language production from variable-duration tokens, leveraging algorithms from the domain of time-series clustering and classification. Our model stores and outputs tokens of phonetically rich and temporally variable representations of recorded speech. We show qualitatively and quantitatively that model outputs retain essential acoustic/phonetic characteristics despite the noise introduced by averaging, and also demonstrate the effects of similarity and indexical information as constraints on exemplar cloud selection.