file 92 is an example of the results obtained when validating on our test set, experiments setting can be seen in the article.

video_head.mp4 provides the results in a first-person view.

video_third.mp4 provides the results in a third-person view.

Each subfolder contains the results of detailed execution process in both text, images and video format.

conversation.json records input and output of the model in every step in a single execution trajectory.

sft_rewrite.py will regenerate the task and analysis descriptions

data_process.py will convert EMMOE data into SFT and DPO data, DPO Augmentation can also be found in it.

task_demo is an original data in EMMOE, except for the components mentioned in the article, test.json is a scenery file to build environment in Habitat-lab v2.3.