CODE_GENERATION_PROMPT_TEMPLATE = """Write Python code to extract information from a trajectory to answer the question. Keep your thinking brief.

**Question:** {query}

**Task:** {task}

**Trajectory Sample:**
{trajectory_sample}

**Trajectory JSON Structure:**
Variable `trajectory_json` contains:
{{
  "trajectory": [{{
      "turn_idx": 0,
      "action": "...",
      "observation": "..."
  }}, ...],
  "task": "...",
  "episode_id": "..."
}}

**Examples:**

1. Find specific actions:
```python
import json
trajectory_data = json.loads(trajectory_json)
trajectory = trajectory_data['trajectory']
relevant_turns = []
for turn in trajectory:
    if 'pick up' in turn.get('action', '').lower():
        relevant_turns.append({{
            'turn': turn['turn_idx'],
            'action': turn['action']
        }})
result = {{'relevant_turns': relevant_turns}}
print(f"Found {{len(relevant_turns)}} turns with 'pick up' action:")
print(json.dumps(result, indent=2, ensure_ascii=False))
```

2. Find first/last occurrence:
```python
import json
trajectory_data = json.loads(trajectory_json)
trajectory = trajectory_data['trajectory']
for turn in trajectory:  # or reversed(trajectory) for last
    if 'target' in turn.get('action', '').lower():
        result = {{'turn': turn['turn_idx'], 'action': turn['action']}}
        break
print("First occurrence of 'target' action:")
print(json.dumps(result, indent=2, ensure_ascii=False))
```

3. Backward tracking (find what caused X at turn N):
```python
import json
import re
trajectory_data = json.loads(trajectory_json)
trajectory = trajectory_data['trajectory']

# Step 1: Find what happened at the target turn (e.g., turn 14)
target_turn = 14
target_info = None
file_name = None
for turn in trajectory:
    if turn['turn_idx'] == target_turn:
        target_info = turn
        # Extract file name from action/observation
        match = re.search(r'file[_\\s]*(\\w+)|read.*?(\\w+\\.\\w+)', turn.get('action', '') + ' ' + turn.get('observation', ''), re.IGNORECASE)
        if match:
            file_name = match.group(1) or match.group(2)
        break

# Step 2: Search backwards for when that file was created
creation_turn = None
if file_name:
    for turn in trajectory:
        if turn['turn_idx'] < target_turn:
            if 'create' in turn.get('action', '').lower() or 'write' in turn.get('action', '').lower():
                if file_name.lower() in (turn.get('action', '') + ' ' + turn.get('observation', '')).lower():
                    creation_turn = turn['turn_idx']

result = {{'target_turn': target_turn, 'file_name': file_name, 'creation_turn': creation_turn}}
print(f"Backward tracking result - File '{{file_name}}' at turn {{target_turn}} was created at turn {{creation_turn}}:")
print(json.dumps(result, indent=2, ensure_ascii=False))
```


4. Find LAST occurrence of a specific action type before a reference turn or the state of one objective at that time:
```python
import json
import re
trajectory_data = json.loads(trajectory_json)
trajectory = trajectory_data['trajectory']

# Find last file write before turn 20
reference_turn = 20
action_pattern = r'(write|save|create).*file'
last_matching_turn = None

for turn in trajectory:
    if turn['turn_idx'] < reference_turn:
        action = turn.get('action', '').lower()
        obs = turn.get('observation', '').lower()
        if re.search(action_pattern, action + ' ' + obs, re.IGNORECASE):
            last_matching_turn = turn['turn_idx']

result = {{
    'reference_turn': reference_turn,
    'last_matching_turn': last_matching_turn,
    'action_pattern': action_pattern
}}
print(f"Last occurrence of pattern '{{action_pattern}}' before turn {{reference_turn}}: turn {{last_matching_turn}}")
print(json.dumps(result, indent=2, ensure_ascii=False))
```



**Instructions:**
1. Parse `trajectory_json` and extract relevant information
2. Store final result in variable `result` (as dict/list/str/int)
3. **MUST include print statements** to output explanation and result
4. Keep code concise and executable
5. If your result is complex, convert it to dict/list/str format

**IMPORTANT - Result Format and Printing:**
- ✅ GOOD:
  ```python
  result = {{'count': 5, 'items': ['a', 'b']}}
  print(f"Found {{result['count']}} items")
  print(json.dumps(result, indent=2, ensure_ascii=False))
  ```
- ✅ GOOD:
  ```python
  result = [1, 2, 3, 4, 5]
  print(f"List of {{len(result)}} values: {{result}}")
  ```

**Example - Counting actions:**
```python
import json
trajectory_data = json.loads(trajectory_json)
trajectory = trajectory_data['trajectory']
action_counts = {{}}
for turn in trajectory:
    action = turn['action']
    action_counts[action] = action_counts.get(action, 0) + 1
result = action_counts
print(f"Action counts ({{len(action_counts)}} unique actions):")
print(json.dumps(result, indent=2, ensure_ascii=False))
```

**CRITICAL - Always include print statements:**
- Always store result in `result` variable
- Always print explanation + result using: `print(f"Description: {{value}}")` and `print(json.dumps(result, indent=2, ensure_ascii=False))`
- This helps understand what the code found

**Output Format - IMPORTANT:**
Start with brief reasoning (1-2 sentences max), then output code:

**CODE**:
```python
# Your code here
```

The **CODE**: marker and ```python block are REQUIRED.
"""

COMPRESS_PROMPT_TEMPLATE = """You are analyzing a trajectory chunk to extract structured state information with MAXIMUM DETAIL preservation.

Task: {task}

Trajectory Chunk:
{trajectory_text}

{previous_state_text}

Your task is to extract and organize key state information from the trajectory with EXTREME PRECISION.

You can use any format that works best (JSON, structured text, bullet points, etc.). For example:

Example JSON format:
{{
  "turn_range": {{"first": 1, "last": 45, "total_turns": 45}},
  "objects": ["obj_1", "obj_2", "obj_3"],
  "obj_state": {{
    "obj_1": [
      {{"turn": 1, "action":"exact action text", "state": "exact description, including location"}},
      {{"turn": 5, "action":"exact action text", "state": "including location"}}
    ]
  }},
  "key_information_per_turn": [
    {{
      "turn_1": "key information for turn 1"
       "turn_2": "key information for turn 2",
       ...}}
  ],
  "causal_chains": [
    "State A -> Action X -> State B",
    "Objective Y -> Action Z -> State C -> State D"
  ]
}}


For key information per turn, you should include the key details for EVERY turn.
For causal_chains: Trace the logical flow by linking actions and objectives to their subsequent states. Ensure multi-step causal sequences (e.g., $State A \rightarrow Action \rightarrow State B \rightarrow State C$) are fully captured as integrated chains.

After your analysis, output your state memory after the marker:

**STATE_MEMORY**
[Your state memory content here]
"""

ANSWER_WITH_RETRIEVAL_PROMPT_TEMPLATE = """Based on the compressed state memory and retrieved detailed information, provide a natural language answer to the query.

Query: {query}


Retrieved Detailed Information:
{relevant_mem}

**Instructions for answering:**
1. If the query asks for SPECIFIC DETAILS (tool names, service URLs, exact queries, turn numbers), extract them VERBATIM from the retrieved information
2. If the query asks about CAUSAL relationships (why, what caused, what led to), trace the reasoning chain through multiple turns
3. If the query asks about TEMPORAL changes (when did X change to Y), identify the exact turn where the change occurred
4. If the query asks about TOOLS/METHODS, provide the complete tool name/service name/URL as it appears in the trajectory
5. If the query asks about REASONING, explain the decision-making logic that connects multiple actions
6. Cross-check details between state memory and retrieved information - prefer more detailed source
7. For turn-specific questions, cite the EXACT turn number from the trajectory

CRITICAL: You MUST format your response as follows:
ANSWER: [Your concise, accurate answer here]

Only include the answer after "ANSWER:", nothing else."""

ANSWER_WITHOUT_RETRIEVAL_PROMPT_TEMPLATE = """Based on the compressed state memory, provide a natural language answer to the query.

Query: {query}



CRITICAL: You MUST format your response as follows:
ANSWER: [Your concise, accurate answer here]

Only include the answer after "ANSWER:", nothing else."""

CHECK_RETRIEVAL_SUFFICIENCY_PROMPT_TEMPLATE = """You are evaluating whether the retrieved information is sufficient to answer a question.

**Question:** {query}

**Retrieved Information:**
{retrieved_info}

Analyze carefully and determine if the retrieved information contains enough details to provide a complete and accurate answer to the question.

Format your response as ONE of these:
- If information is sufficient: SUFFICIENT
- If more information is needed: NEED_MORE_INFO: [Explain what specific information is still missing]

Response:"""

SUMMARIZE_CODE_RESULTS_PROMPT_TEMPLATE = """You have executed multiple retrieval iterations to gather information from a trajectory. Now summarize ALL the retrieved results to provide relevant information for answering the query.

**Query:** {query}

**All Retrieved Results from Code Execution:**
{all_results}

**Your Task:**
Analyze all the retrieved results above and create a comprehensive summary that includes ALL relevant information for answering the query.

**Guidelines:**
- Include specific details: turn numbers, exact values, tool names, URLs, parameters
- Preserve temporal sequences and causal relationships
- Keep all information that might be relevant to the query
- Organize the information clearly and logically
- Do NOT try to answer the query - just summarize the retrieved information

**SUMMARY:**
[Your comprehensive summary of all retrieved information]
"""

CHUNKED_ANSWER_PROMPT_TEMPLATE = """You are analyzing memory information to answer a query. You will be shown chunks of memory sequentially.

**Query:** {query}

{compressed_info_section}
**Memory Chunk ({chunk_idx}/{total_chunks}):**
{chunk_content}

**Your Task:**
Analyze this chunk and determine ONE of the following:

1. **If you can provide a complete answer now** (you have enough information from this chunk and/or previous compressed information):
   ANSWER: [Provide your complete, accurate answer here]

2. **If this chunk contains relevant information but you need more chunks to answer completely**:
   COMPRESS: [Summarize this chunk that helps answer the query. Be concise but preserve key details like turn numbers, exact values, tool names, etc.]

3. **If this chunk has no relevant information for the query**:
   NO_RELEVANT_INFO

**Important Guidelines:**
- Only choose ANSWER if you are confident you can provide a complete, accurate answer
- When compressing, focus on information that directly relates to the query
- Preserve specific details: turn numbers, exact tool names, URLs, parameters, values
- If this is the last chunk and you haven't answered yet, you MUST provide an answer based on available information

Response:"""

# Three-step turn retrieval prompt
CHUNK_SUFFICIENCY_JUDGMENT_PROMPT_TEMPLATE = """You have retrieved the top-ranked most relevant turns from an agent trajectory. Each turn has a UNIQUE TURN INDEX that you can reference.

**Query:** {query}

**Retrieved Turns:**
{retrieved_chunks}

**IMPORTANT - About Turn Indices:**
Each turn above has a unique index (e.g., "Turn 5", "Turn 8", "Turn 12"). You can use these indices to request:
1. Adjacent turns (turns immediately before/after)
2. Specific turn ranges (e.g., turns 5-10)
3. Individual turns by their indices (e.g., turns 3, 7, 15)

**Your Task:**
Carefully analyze the retrieved turns and determine ONE of the following:

1. **SUFFICIENT** - The retrieved turns contain enough information to answer the query completely and accurately.
   If you choose this, you MUST provide the answer immediately in the same response.
   Format:
   **SUFFICIENT**
   ANSWER: [Your complete, accurate answer here]

2. **NEED_GRAPH** - The query can likely be answered by looking at adjacent turns or specific turn ranges. This is useful when:
   - You found relevant information but need more context from surrounding actions
   - The answer requires understanding what happened before or after a specific event
   - You need to see the continuation of a sequence of actions

   **You can specify retrieval in multiple ways:**

   A. Request adjacent turns (before/after):
      - **NEED_GRAPH**: turn_5 before=2 after=1
      - **NEED_GRAPH**: turn_8 before=3 after=0, turn_15 before=0 after=2
      - **NEED_GRAPH**: turn_3 before=1 after=1, turn_7 before=2 after=2

   B. Request specific turn ranges:
      - **NEED_GRAPH**: turns 5-10
      - **NEED_GRAPH**: turns 3-8, turns 15-20

   C. Request specific individual turns:
      - **NEED_GRAPH**: turns 3, 7, 12, 18
      - **NEED_GRAPH**: turns 5, 8, 15

   D. Mix different formats:
      - **NEED_GRAPH**: turn_5 before=2 after=1, turns 15-20, turns 25, 30

   Default (if you don't specify numbers): before=1 after=1 for each turn.
   Old format still supported: **NEED_GRAPH**: before (retrieves 1 turn before each retrieved turn)

3. **NEED_CODE** - The query requires computational analysis, pattern finding, counting, or aggregation across the full trajectory that cannot be answered from turns alone. This is useful when:
   - You need to count occurrences, compute statistics, or find patterns
   - You need to aggregate information across many turns
   - You need to search for specific conditions across the entire trajectory
   - The turns don't contain the needed information and you need to process raw trajectory data
   Format: **NEED_CODE**: [Explain what computation/analysis is needed]

**Guidelines:**
- Choose SUFFICIENT only if you can provide a complete and accurate answer RIGHT NOW
- When choosing SUFFICIENT, you MUST include the answer immediately after "ANSWER:" in the same response
- Choose NEED_GRAPH if you see relevant information but need immediate context (previous/next turns)
- For NEED_GRAPH, you can specify:
  * Adjacent turns: turn_X before=N after=M
  * Turn ranges: turns X-Y
  * Specific turns: turns X, Y, Z
  * Or mix formats
- Choose NEED_CODE for computational queries or when turns don't contain the answer
- Be specific about what additional information you need

**Response:**"""
