version: "v1"
created: "2025-01-30T00:00:00"
description: "Two-stage math reasoning prompts with boxed answer format for mathematical datasets"
scaffold_type: "two_stage"
git_hash: null
experiment_id: null
author: "<ANONYMIZED>"

# VLM prompts for mathematical image description (for training the captioner)
vlm_initial_description_prompt: |
  I need your help analyzing this image to prepare for answering the following question: 

  {question}

  IMPORTANT: DO NOT answer the question directly. Instead, provide a comprehensive and detailed description of everything visible in the image that could be relevant for answering this question.

  Focus on describing:
  - All objects, people, text, and visual elements in the image
  - Spatial relationships between different elements
  - Any text content that is visible, transcribed exactly
  - Colors, shapes, patterns, and visual attributes
  - Relevant contextual details and background information

  Your description should be detailed enough that someone could mentally reconstruct the image without seeing it, but DO NOT provide step-by-step instructions on how to recreate it.

# VLM confidence estimation prompt (auxiliary prompt for confidence experiment)
vlm_confidence_prompt: |
  I need your help analyzing this image to prepare for answering the following question: 

  {question}

  IMPORTANT: DO NOT answer the question directly. Instead, provide a comprehensive and detailed description of everything visible in the image that could be relevant for answering this question.

  Focus on describing:
  - All objects, people, text, and visual elements in the image
  - Spatial relationships between different elements
  - Any text content that is visible, transcribed exactly
  - Colors, shapes, patterns, and visual attributes
  - Relevant contextual details and background information

  Your description should be detailed enough that someone could mentally reconstruct the image without seeing it, but DO NOT provide step-by-step instructions on how to recreate it.

  Now, considering both the question and what you see in the image, assess: **How difficult would this question be for an average college student?**

  Consider factors like:
  - Mathematical complexity and required knowledge level
  - How much visual reasoning and spatial understanding is needed
  - Whether the problem involves complex multi-step reasoning
  - How easy it would be to make errors or get confused
  - Whether this requires specialized knowledge or just basic skills

  Rate the difficulty level honestly:

  **20%**: "Very easy - basic math or simple visual identification that most people would get right immediately"

  **40%**: "Easy-moderate - straightforward problem requiring basic skills, low chance of errors"

  **60%**: "Moderate - requires some thinking and careful work, average students might struggle a bit"

  **80%**: "Difficult - complex reasoning, multiple steps, high chance of making mistakes even with good skills"

  **95%**: "Very difficult - advanced concepts, intricate reasoning, most students would likely get this wrong"

  Please format your response as follows:
  DESCRIPTION: [Your detailed image description here]
  CONFIDENCE: [Difficulty percentage for an average college student: 20, 40, 60, 80, or 95]

# VLM logprobs-based confidence prompt (single request with yes/no evaluation)
vlm_logprobs_confidence_prompt: |
  I need your help analyzing this image to prepare for answering the following question: 

  {question}

  IMPORTANT: DO NOT answer the question directly. Instead, provide a comprehensive and detailed description of everything visible in the image that could be relevant for answering this question.

  Focus on describing:
  - All objects, people, text, and visual elements in the image
  - Spatial relationships between different elements
  - Any text content that is visible, transcribed exactly
  - Colors, shapes, patterns, and visual attributes
  - Relevant contextual details and background information

  Your description should be detailed enough that someone could mentally reconstruct the image without seeing it.

  Now critically evaluate: Could this question be solved COMPLETELY and ACCURATELY by someone with only your written description (no image access)?

  Consider these strict criteria:
  - Are ALL mathematical symbols, numbers, and equations clearly transcribed?
  - Are spatial relationships, diagrams, and geometric details precisely described?
  - Would any visual ambiguity or missing details lead to incorrect answers?
  - Could different people interpret your description differently and get different answers?

  Be honest and conservative in your assessment. Many visual questions require seeing the actual image to avoid errors.

  Please format your response as follows:
  DESCRIPTION: [Your detailed image description here]
  ADEQUATE: [Answer only "Yes" or "No" - is your description sufficient for completely accurate problem solving?]

# Reasoner prompts for mathematical problem solving (single-shot)
reasoner_format_instruction: |
  You MUST follow this format for your response:
  <think>
  Your detailed reasoning and thought process here...
  </think>
  <answer> Final Answer: your final answer here </answer>

  For all problems, use \\boxed{answer} format for the final numerical or expression answer within the <answer> tags. For multiple choice questions, use the letter of the correct answer in \\boxed{answer} format, i.e., \\boxed{A}, \\boxed{B}, \\boxed{C}, etc.
  The evaluation will only consider what is between the <answer> tags, but the <think> section will help you reason through the problem.

reasoner_math_prompt: |
  You are an expert mathematical reasoning assistant. Based on the following image description, please solve the mathematical problem step-by-step.

  Image Description: {description}

  Question: {question}

  INSTRUCTIONS:
  1. Analyze the image description carefully to understand all mathematical elements
  2. Work through the problem step-by-step with clear mathematical reasoning
  3. Show all calculations and logical steps
  4. Provide your final answer in the required format
  5. For numerical answers, use \\boxed{answer} notation

  IMPORTANT: Provide a complete solution based on the image description. Do not request additional information - work with what is provided.

  {format_instruction}

reasoner_mcq_prompt: |
  You are an expert mathematical reasoning assistant. Based on the following image description, please solve the multiple choice question.

  Image Description: {description}

  Question: {question}

  INSTRUCTIONS:
  1. Analyze the image description carefully to understand all mathematical elements  
  2. Work through the problem step-by-step with clear mathematical reasoning
  3. Select the correct option letter (A, B, C, etc.)
  4. Provide your final answer as the option letter

  IMPORTANT: Provide a complete solution based on the image description. Do not request additional information - work with what is provided.

  {format_instruction}

reasoner_detail_prompt: |
  You are an expert mathematical reasoning assistant. Based on the following image description, please provide a detailed mathematical analysis.

  Image Description: {description}

  Question: {question}

  INSTRUCTIONS:
  1. Analyze the image description carefully to understand all mathematical elements
  2. Provide detailed step-by-step mathematical reasoning  
  3. Show all relevant calculations and explanations
  4. Give a comprehensive answer addressing all aspects of the question
  5. For numerical results, use \\boxed{answer} notation where appropriate

  IMPORTANT: Provide a complete analysis based on the image description. Do not request additional information - work with what is provided.

  {format_instruction}

reasoner_default_prompt: |
  You are an expert mathematical reasoning assistant. Based on the following image description, please answer the mathematical question.

  Image Description: {description}

  Question: {question}

  INSTRUCTIONS:
  1. Analyze the image description carefully to understand all mathematical elements
  2. Work through the problem with clear mathematical reasoning
  3. Provide a clear and accurate answer
  4. For numerical answers, use \\boxed{answer} notation

  IMPORTANT: Provide a complete solution based on the image description. Do not request additional information - work with what is provided.

  {format_instruction}

# Confidence-aware reasoner prompts (for confidence experiment)
reasoner_confidence_math_prompt: |
  You are an expert mathematical reasoning assistant. Based on the following image description, please solve the mathematical problem step-by-step.

  Image Description: {description}

  Captioner Confidence: {confidence}/100 (The image captioner's confidence in their description accuracy)

  Question: {question}

  INSTRUCTIONS:
  1. Analyze the image description carefully to understand all mathematical elements
  2. Consider the captioner's confidence level when interpreting the description
  3. Work through the problem step-by-step with clear mathematical reasoning
  4. Show all calculations and logical steps
  5. Provide your final answer in the required format
  6. For numerical answers, use \\boxed{answer} notation

  IMPORTANT: 
  - Provide a complete solution based on the image description
  - Take into account the confidence level when making assumptions about unclear elements
  - If confidence is low, be more cautious in your interpretations
  - Do not request additional information - work with what is provided

  {format_instruction}

reasoner_confidence_mcq_prompt: |
  You are an expert mathematical reasoning assistant. Based on the following image description, please solve the multiple choice question.

  Image Description: {description}

  Captioner Confidence: {confidence}/100 (The image captioner's confidence in their description accuracy)

  Question: {question}

  INSTRUCTIONS:
  1. Analyze the image description carefully to understand all mathematical elements
  2. Consider the captioner's confidence level when interpreting the description
  3. Work through the problem step-by-step with clear mathematical reasoning
  4. Select the correct option letter (A, B, C, etc.)
  5. Provide your final answer as the option letter

  IMPORTANT: 
  - Provide a complete solution based on the image description
  - Take into account the confidence level when making assumptions about unclear elements
  - If confidence is low, be more cautious in your interpretations
  - Do not request additional information - work with what is provided

  {format_instruction}

reasoner_confidence_detail_prompt: |
  You are an expert mathematical reasoning assistant. Based on the following image description, please provide a detailed mathematical analysis.

  Image Description: {description}

  Captioner Confidence: {confidence}/100 (The image captioner's confidence in their description accuracy)

  Question: {question}

  INSTRUCTIONS:
  1. Analyze the image description carefully to understand all mathematical elements
  2. Consider the captioner's confidence level when interpreting the description
  3. Provide detailed step-by-step mathematical reasoning
  4. Show all relevant calculations and explanations
  5. Give a comprehensive answer addressing all aspects of the question
  6. For numerical results, use \\boxed{answer} notation where appropriate

  IMPORTANT: 
  - Provide a complete analysis based on the image description
  - Take into account the confidence level when making assumptions about unclear elements
  - If confidence is low, be more cautious in your interpretations
  - Do not request additional information - work with what is provided

  {format_instruction}

reasoner_confidence_default_prompt: |
  You are an expert mathematical reasoning assistant. Based on the following image description, please answer the mathematical question.

  Image Description: {description}

  Captioner Confidence: {confidence}/100 (The image captioner's confidence in their description accuracy)

  Question: {question}

  INSTRUCTIONS:
  1. Analyze the image description carefully to understand all mathematical elements
  2. Consider the captioner's confidence level when interpreting the description
  3. Work through the problem with clear mathematical reasoning
  4. Provide a clear and accurate answer
  5. For numerical answers, use \\boxed{answer} notation

  IMPORTANT: 
  - Provide a complete solution based on the image description
  - Take into account the confidence level when making assumptions about unclear elements
  - If confidence is low, be more cautious in your interpretations
  - Do not request additional information - work with what is provided

  {format_instruction}

# Continuation prompt for handling incomplete responses
reasoner_continuation_prompt: |
  Your previous reasoning was cut off. Please continue from where you left off and provide a complete response. There is a chance you simply forgot to include the final answer in the <answer> tags.

  Original Question: {question}

  Image Description: {description}

  Your Incomplete Response (that was cut off):
  {incomplete_reasoning}

  Please continue your reasoning from where it was interrupted and provide a complete response in the required format:

  <think>
  [Continue your reasoning process from where it was interrupted]
  </think>
  <answer> Final Answer: [Your final answer - use \\boxed{answer} format for all answers] </answer>

# Answer extraction prompt for post-processing
answer_extraction_prompt: |
  You are tasked with extracting the final answer from a mathematical reasoning response.

  ORIGINAL QUESTION:
  {question}

  IMAGE DESCRIPTION:
  {description}

  REASONING RESPONSE:
  {reasoning_response}

  CURRENT ANSWER CANDIDATE:
  {current_answer}

  Your task is to:
  1. Review the reasoning response above
  2. Extract the most accurate final answer for the mathematical question
  3. Ensure the answer is clear, concise, and properly formatted

  IMPORTANT GUIDELINES:
  - If the response contains a clear mathematical answer, extract it exactly
  - For numerical answers, preserve \\boxed{} notation if present
  - If there are multiple valid interpretations, choose the most mathematically sound one
  - NEVER respond with "unable to determine" or "N/A" – always provide the best possible answer
  - Format numerical or expression answers using \\boxed{} notation when appropriate
  - For multiple choice questions, extract the letter of the correct answer in \\boxed{} notation, i.e., \\boxed{A}, \\boxed{B}, \\boxed{C}, etc.

  EXTRACT THE FINAL ANSWER:
  Final Answer:

template_variables:
  question: "The mathematical question to be answered"
  description: "The image description from the VLM"
  confidence: "The captioner's confidence score (0-100) in their description accuracy"
  format_instruction: "The response format instruction" 
  incomplete_reasoning: "Incomplete reasoning from cut-off response"
  reasoning_response: "Complete reasoning response"
  current_answer: "Current answer candidate"

example_usage:
  question: "What is the value of x in this equation?"
  description: "The image shows a mathematical equation with x + 5 = 10 written on a whiteboard"
  expected_output: "Step-by-step solution with final answer in \\boxed{5} format" 