{
    "Textual_Faithfulness": "The score is 2. Reason: The edited video does not show a woman breakdancing as instructed by the text condition. \n",
    "Frame_Consistency": "The score is 1. Reason: This video is heavily edited and makes no sense visually. There is no consistency in how the people are edited into the video. \n",
    "Video_Fidelity": "The score is 2. Reason: The video has significant visual quality issues and inconsistencies. While the model somewhat attempts to follow the text condition by replacing the man with women, the scene is very blurry,  highly distorted, and unrealistic. \n"
}