Abstract: Highlights•A lightweight emotion recognition network combining face, body and context.•Use the tubal transformer to establish global dependencies of feature information.•The adaptive fusion strategy is used to complete the fusion of different module features.
Loading