{
    "Textual_Faithfulness": "The score is 1. Reason: There is no video provided for evaluation. \n",
    "Frame_Consistency": "The score is 1. Reason: There is no video provided for evaluation, only an image of three lighthouses. \n",
    "Video_Fidelity": "The score is 1. Reason: There are no men or drones present in this video, and it depicts lighthouses, not buildings, meaning the text condition was not successfully applied. Additionally, the generated video lacks movement. \n"
}