
default_max_tokens: 64000
default_temperature: 0.6
name: "technique_verifier"
task: "technique"
prompt: |
  Your task is to verify whether the following solution to a mathematical problem has correctly used a pre-specified technique to solve the problem.

  ### Input:

  Your input will consist of the following components:
  - **Problem Statement**: A mathematical problem that the proof is attempting to solve.
  - **Required Technique**: A specific mathematical technique that the proof is required to use in order to solve the problem.
  - **Proof Solution**: The proof that you need to evaluate. This proof may contain errors, omissions, or unclear steps. The proof was generated by another language model, which was given the following instructions:
  <model_prompt>
  - You are creating a proof, not a proof outline. Each step should be carefully explained and documented. If not properly explained, the judge will assume that you cannot explain it, and therefore decrease your grade.
  - You can use general theorems and lemmas, but only if they are well-known. As a rule of thumb: if the result has a name and is famous enough to have a Wikipedia page or something similar to describe it, it is allowed. Any result from papers that would not be taught in high-school or low-level bachelor courses in mathematics should not be used. Any use of such results will immediately give you a zero grade.
  - Do not skip computation steps in your proof. Clearly explain what transformations were done and why they are allowed in each step of a calculation.
  - You should use correct LaTeX notation to write equations and mathematical symbols. You should encompass these equations in appropriate symbols ("\\(" and "\\)" for inline math, "\\[" and "\\]" for block math) to enhance the clarity of your proof. Do not use any unicode characters.
  - Your proof should be self-contained.
  - Your proof should make use of the specified technique. You will not receive any credit if you fail to use the specified technique.
  - If you are not sure about a specific step, or do not know how to prove an intermediate result, clearly state this. It is much preferable to indicate your uncertainty rather than making incorrect statements or claims.
  </model_prompt>

  Assume that the solution as a whole is correct, unless you easily find a clear error in the solution. You should not assume that the solution is incorrect just because it is not clear to you, or because it is not well explained.

  The model has been specifically told that it should not skip steps or mark them as trivial. Any violation of this rule should be considered by assuming the model does not know how to derive the "trivial" step.

  ### How the solution should be graded with respect to technique usage:

  The solution should be considered to have correctly used the specified technique if:
  - You can identify that the solution has explicitly stated the use of the technique, or an equivalent method.
  - The solution has correctly applied the technique, and has not simply wrapped a different method using the name of the specified technique.
  - The solution has made significant progress towards solving the problem using the specified technique. This means that the technique should be central to the solution, and not just mentioned in passing or used for minor steps.

  The solution should be considered to have incorrectly used the specified technique if:
  - The solution does not mention the technique at all, or only mentions it in passing without using it in any significant way.
  - The solution uses a different technique that is not equivalent to the specified technique.
  - The solution only uses the specified technique for minor steps, while the majority of the solution relies on other techniques.

  ### Scoring instructions

  If you believe the proof is correct and has used the specified technique, end label the solution as correct. If you believe the proof is incorrect, or it has not followed the specified technique, end your analysis with incorrect.

  ### Invalid Technique Specification

  Sometimes, the technique specified may not exist as a mathematical technique or theorem, but is instead a fact that appears often in mathematical competitions, which has been given a name. In cases where you are unaware of this technique, you should output "warning".

  ### Output format

  At the end of your reasoning, output the following structure:

  ```json
  {{
    "verdict": "<one of: correct, incorrect, warning>",
    "reason": "<one of: correctness, technique, both, unknown>",
    "comments": "<your detailed reasoning here>"
  }}
  ```

  ### Problem Statement:
  {problem}

  ### Required Technique:
  {technique}

  ### Model Solution:
  {solution}
data_path: "data/postprocess/technique_adaptivity/test_samples.json"
n_solutions: -1
