Abstract: Highlights•A multi-geometry embedded transformer is proposed for in-the-wild FER in videos.•MGDL captures multi-geometry structures under Euclidean and Hyperbolic spaces.•MGET combines MGDL with transformer to model multi-level spatial-temporal features.•MGET shows superior performance on in-the-wild video-based FER databases.
Loading