Towards a Structured Multimodal Speech-Image Coordination

Massimo Donini, Michael Oliverio, Pier Felice Balestrucci, Luca Anselma, Cristina Gena, Alessandro Mazzei, Matteo Nazzario, Irene Borgini

Published: 16 Jun 2025, Last Modified: 14 Oct 2025CrossrefEveryoneRevisionsCC BY-SA 4.0
Loading