test_case: |
  You are an expert programmer using an LLM assistant for your work. You are trying to solve this problem:
  {question}
  The LLM generated the following solution:
  {code}
  This code fails due to this problem:
  {problem}
  The errors have the following meaning: Error code of -2 means test case failure and error code of -1 means runtime or compiler error.
  Give a natural language description of this failure. Keep it very short (under 20 words) and focus only on this exact error only, not implementation details. If it fails from runtime/compiler error, specify that, and if it fails from a test case failure specify the test case both input and output

code_judge: |
  You will be provided with a problem statement, a code snippet that supposedly addresses the problem, a catalog of code inconsistencies, and a runtime error (test case failure, compile error, or runtime error).
  Evaluation Steps:
  1. Read the problem statement carefully to identify the functionalities required for the implementation. Output the functionalities you identified.
  2. Read the code snippet and compare it to the problem statement. Check if the code snippet covers the required functionalities. Output your findings.
  3. Identify what part of the code is causing the runtime error. Explicitly output that part of the code.
  4. Output a JSON format list.
  a) If the code snippet is correct, output: ["inconsistency": "None", "severity": "Negligible"].
  b) If the code snippet is incorrect, output the identified inconsistencies and their severity according to the catalog of code inconsistencies. For example: ["inconsistency": "<inconsistency1>", "severity": "<severity1>", "inconsistency": "<inconsistency2>", "severity": "<severity2>", ...]

  Problem: {question}
  Code Snippet: {code}
  Runtime error (cut off after 5000 characters): {problem}

  Taxonomy of Common Inconsistencies:
  1. Missing dependency declarations: Negligible
  2. Edge case not handled: Small
  3. Logic error: Major
  4. Function or variable not defined: Fatal
  5. Code not completed: Fatal
  Evaluation Form:
  JSON output (a JSON list only):
  [{{"inconsistency": "None", "severity": "Negligible"}}]

  Finally, output one line of natural language feedback which addresses the most severe concern. You MUST output your feedback in XML tags as shown below. Your answer must be short, around 35 words. Be explicit about the line of code and the Runtime error.

  <feedback>
  YOUR FEEDBACK HERE
  </feedback>