EMOVA: Emotion-driven neural volumetric avatar

Published: 01 Jan 2024, Last Modified: 19 Jan 2025Image Vis. Comput. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•A novel network, EMOVA, which uses two emotional stimuli from images and the corresponding voice to learn subtle differences in facial expression.•The attention-based fusion method to efficiently combine the information of images and voice, resulting in the high-quality reconstruction of faces.•Through exhaustive evaluations, EMOVA is demonstrated to reconstruct significantly more realistic and expressive faces than state-of-the-art methods.
Loading