{
    "Textual_Faithfulness": "The score is 1. Reason: The edited video still shows two men practicing Judo, not two girls as the text condition requests. \n",
    "Frame_Consistency": "The score is 1. Reason: The generated video depicts two men practicing Judo, not two girls as specified by the text condition. This means there is zero adherence to the text condition, resulting in extremely poor frame consistency. \n",
    "Video_Fidelity": "The score is 3. Reason: The video has slight color distortion and is generally acceptable, but some unnatural elements are still noticeable. \n"
}