Abstract: Highlights•We are the first to solve catastrophic forgetting in video captioning of no playback.•Modules are designed per visual encoding, textual decoding, and network supervision.•Experiments on MSR-VTT, MSVD and VATEX show that the proposed method is effective.
Loading