Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video CaptioningOpen Website

2023 (modified: 07 Mar 2023)CoRR 2023Readers: Everyone
0 Replies

Loading