Abstract: Highlights•Group-wise interactive region learning (GIRL) model is proposed.•Attentive region interaction (ARI) refines the region features.•Holistic semantic embedding (HSE) maps region features to semantic space.•Show the effectiveness of the proposed model.
Loading