Abstract: Highlights•We identify the challenges of Video Relation Detection task.•We propose a new approach called Temporal Span Proposal Network (TSPN).•We propose two key modules: (i) relationness scoring module and (ii) temporal span proposal module.•We show that TSPN is not only effective but also efficient.•We conduct comprehensive ablation studies to show the efficacy of our design choices.
Loading