1. 1st training stage (sh warmup/run_full.sh)
2. generate corresponding segmentation.npy files via "dynamic_grounding/process.py"
3. 2nd training stage (sh dynamic_grounding/run.sh)