{
    "Textual_Faithfulness": "The score is 1. Reason: The text condition describes a woman riding a donkey, but the video still shows a horse jumping over the hurdles. \n",
    "Frame_Consistency": "The score is 1. Reason: This is a very poor generation. The text condition asks to replace the horse with a donkey. However, the generated video still shows a horse jumping over hurdles. There is no visual indication of a donkey. \n",
    "Video_Fidelity": "The score is 1. Reason: There is no donkey in the video, and the editing model fails to fulfill the text condition. \n"
}