Abstract: Highlights•The CNN-based appearance encoder for different views input.•Decomposition of the static and dynamic by exploring transient mask inpainting.•Frequency regularization base on transient mask factor in training.•Experiments and ablation studies confirm the effectiveness of our approach.
Loading