
#provider: openrouter
model:
  provider: openrouter
  # name: meta-llama/llama-3.1-8b-instruct
  # name: openai/gpt-4o-mini
  name: mistralai/devstral-small-2505
  # name: openai/gpt-oss-20b
  # name: deepseek/deepseek-v3.1-terminus
  # name: anthropic/claude-3-haiku
  # name: google/gemini-2.5-flash-lite
  # name: qwen/qwen3-235b-a22b-2507
  
  temperature: 0.2
  max_tokens: 1000
  top_p: 0.7

reasoning_prompt: |
  IMPORTANT: The XML-like tags in this prompt are for context and structure. DO NOT include them in your response.
  
  MODE OF OPERATION:
  You operate in a loop of Thought_with_Tool, PAUSE, Observation, Thought_with_Answer. 
  You must obey <loop_rules>.
  
  Use Thought_with_Tool to describe perception of the current situation and call a tool:
  Thought: <your_thought> 
  Tool: <tool_call> 
  Use word "Tool:" to call one of the tools available to you 
  
  Then you output PAUSE and wait for the Observation.
  PAUSE
  
  After that you will be called again with this:
  Observation: <new_information> 
  It will be the result of calling those tools.
  
  Use Thought_with_Answer to describe perception of the current situation based on the received observation and provide your answer.
  Then you output:
  Thought: <your_thought> 
  {{Answer: <value>}}
  Provide your answer and strictly follow the requested format.

  Your available tools are in <tools_list> and your guiding principles are in <reflection_knowledge>:
  
  <tool_list>
  <tool>
  raise_a_question
  e.g. raise_a_question: What would be the appropriate strategy?
  This tool allows you to ask yourself an additional question. You have a limited number of questions you can ask. 
  </tool>
  No other tools are available to you.
  </tool_list>
  
  <reflection_knowledge>
  CRITICAL: When using any tool, you MUST ground your reasoning in the following reflection knowledge. Your thought process must explicitly reference these principles.
  {reflection_knowledge}
  </reflection_knowledge>

  <loop_rules>
  CRITICAL TOOL USE INSTRUCTION: 
  YOU MUST EXCLUSIVELY EITHER USE TOOLS OR ANSWER IMMEDIATELY.
  YOU {tool_use_policy} USE TOOLS TO VALIDATE YOUR DECISIONS
  {visible_question_cap}
  </loop_rules>

  **Example Session 1:**
  
  Here's how the situation is developing: 
  <details of the situation>

  Thought: The situation is that 15 minutes have passed and I have been waiting. I need another thinking step to formulate my response.
  Tool: raise_a_question: What would be the appropriate strategy?
  PAUSE

  You will be called again with this:
  Observation: You have chosen ask yourself: What would be the appropriate strategy? What can you answer to yourself?

  Thought: After deep analysis I think I know what I need to do and I must provide an answer.
  {{Answer: This is my well-considered answer}}

  **Example Session 2:**
  
  Here's how the situation is developing: 
  <details of the situation>

  Thought: Looks like I immediately know what to say and do.
  {{Answer: This is my answer to the question}}

  {answer_format}
  {{Answer: <your answer here>}}