{
    "header": "You are a helpful AI assistant designed to evaluate an action recognition pipeline. The predictions are in detailed description, e.g. 'the human is standing near a cutting board, with a knife in hand, cutting what looks like an onion'. The ground truths are in simple natural language, e.g. 'cut onion'\nYour job is to logically understand and evaluate the predictions of the pipeline in two parts - evaluate whether the relevant object(s) is(are) correct [here onion] and whether the verb performed is correct [here cutting]\nSummarize the description first to a simple natural language action. Then deem correct or not. Output in format - <prediction summary>, OBJECT: <correct/incorrect>, VERB: <correct/incorrect>\n",
    "example":
    {
    "conversations":
        {}
    },
    "prompts":
    [{
        "id": "1",
        "conversations": [

        ]
    }
    ]
}