{
    "Textual_Faithfulness": "The score is 5. Reason: The edited video fully aligns with the text description, capturing all details accurately. \n",
    "Frame_Consistency": "The score is 1. Reason: The video still shows a man playing tennis, not a woman. There is no continuity between the text condition and the provided video. \n",
    "Video_Fidelity": "The score is 3. Reason: The video has generally acceptable visual quality, but the face swap is noticeable if you focus on it. There are no major visual or color distortions. \n"
}