#################################
#     Trace Analysis Prompts    #
#################################
SYSTEM_TRACE_PROMPT="""You are an agent trajectory analysis expert.
Your task is to analyze multiple interaction trajectories corresponding to the same task, identify the agent's Inefficient Behavior Patterns and milestones,

## RULES
1) Ensure that all identified issues can be expressed using temporal logic.
2) Clearly specify which concrete actions and which related objects/items are involved in each identified issue. Provide clear and comprehensive information for downstream processes.
3) When a single issue involves multiple similar logical structures or similar items, they may be merged in description. However, you must strictly comply with Rule 2: all relevant details must remain complete, and no concrete information should be lost due to over-merging.
4) Milestone Extraction (Must strictly follow):
**State-driven verification**:
A Milestone must correspond to an objectively occurring physical state change, such as an Inventory Change or Object State Change.

**Strict “completed” criterion**: 
If there are successful trajectories: directly extract the key nodes along the successful path.
If all trajectories are failures: only extract steps that are both necessary for the task goal and have actually been fully executed in the trajectory.
No inference allowed: events that did not actually occur must never be labeled as Milestones.

**Empty-set principle**:
If the trajectory contains only invalid information (e.g., navigation, idle looping), directly return an empty list [].
Do not force milestones, and do not fabricate data just to “complete the task”.

**Minimal necessary event set principle**:
Even successful trajectories contain redundant information.
You must strictly filter out only the true Milestones, removing irrelevant details.
Milestones should form a minimal necessary event set, not a long or verbose event sequence.

5) Bad Behavior Patterns:
When identifying Bad Behavior Patterns, do not restrict the agent’s exploration. We only restrict behaviors that are inefficient, ineffective, or unsafe.
Do not describe problems in terms of “what the agent did not do”. Instead, specify “under what conditions the agent did what action, leading to a problem.”
For example:
Bad: “The agent failed to obtain item A.”
Good: “The agent needed item A, saw item A, but did not pick it up.”
You must classify each Bad Behavior into one and only one of the following common categories. Note: A single category may contain multiple different Bad Behaviors and avoid duplication or omission.

sudden_repeat:
Immediately repeating the same action consecutively without sufficient justification. For example, performing action A and then immediately performing action A again is generally abnormal.
lack_premise:
Executing an action without satisfying its prerequisite conditions. For example, action B only makes sense when item A is available, but the agent performs B prematurely. NOTICE: Exploratory behaviors do not have prerequisites and are not subject to this rule—be careful to distinguish them.
hazard:
Irrational or potentially dangerous behaviors.
rollback:
Regressing task progress. For example, after acquiring a task-required item, discarding it before using it correctly—this is generally abnormal.
other:
Special issues that are difficult to classify into the above categories.

##FORMATS
Please strictly follow the format below when outputting your analysis results:
<think>
# step 1: Analyze the task
# step 2: From all trajectories, based on Rule 4, identify content in the current trajectory that clearly qualifies as milestones, and specify all concrete items involved after filtering
# step 3: Identify the temporal relationships among the selected milestones and split them accordingly
# step 4: Check: “If I completely remove this sub-milestone, would the task not only fail but become physically impossible to complete?”
# step 5: Identify all Bad Behavior Patterns in the trajectory, based on Rule 5
# step 6: Classify the bad behaviors and specify the concrete items involved; ensure all relevant items are included, not just partial examples
# step 7: Check whether the detailed descriptions of bad behaviors (not limited to type labels) are duplicated or highly similar; if so, merge them. Check whether the items listed actually exist in the trajectory; if not, replace or remove them. Check whether all relevant items are correctly referenced; if not, add them immediately.
</think>
```json
{
  "milestones": [
    {
      "tag": 1,
      "description": "A concrete and explicit description of milestone 1",
      "items": ["item1", "item2"],
      "dependency": [] 
      // If this milestone depends on previous milestones, list their tags here, e.g. [1, 2].
      // If there is no dependency, leave it as an empty list [].
    },
    {
      "tag": 2,
      "description": "A concrete and explicit description of milestone 2",
      "items": ["item3"],
      "dependency": [1]
    }
    ...
  ],
  "bad_behaviors": [
    {
      "type": "Bad behavior category A",
      "description": "A detailed explanation of failure/inefficiency issue 1",
      "items": "All objects, items, or entities involved in this issue"
    },
    {
      "type": "Bad behavior category A", 
      // Same category as above, but involving a different target or context
      "description": "A detailed explanation of failure/inefficiency issue 2",
      "items": "All objects, items, or entities involved in this issue"
    },
    {
      "type": "Bad behavior category B", 
      // A different type of issue
      "description": "A detailed explanation of failure/inefficiency issue 3",
      "items": "All objects, items, or entities involved in this issue"
    }
    ...
  ]
}
```
"""
INPUT_TRACE_PROMPT ="""--- Your Task ---

Goal: {goal}

---
**Successful Trace(s):**
{successful_traces_str}
---
**Failed Trajectories to Analyze:**
{failed_traces_str}
---


Your Output:"""

#################################
#     LTL Translator Prompts    #
#################################

SYSTEM_LTL_LITE_PROMPT ="""
You are a professional NL2LTL translator, you need to translate the success/failure analysis of agent trajectories into standard LTL statements based on the given AP list.
AP list is the APs that appear in the trajectory, not a complete world model

##GUIDELINES
0. **Critical:** If any AP in need is missing from the AP list, treat that AP as `False`. <br> E.g., You need `G(act_A -> (obs_B & obs_C))`, but can not find obs_B, it becomes `G(act_A -> (False & obs_C))` which implies `!act_A`.

1. <pos_ltl> (Success Path)
* **Logic:** Construct a Strict Sequence using Nested `F`.
* **Template:** `F(Step1 & F(Step2 & F(Step3...)))`
* **Handling Branching:** If the milestones logic implies distinct parallel goals (e.g., "Find A" AND "Find B" order doesn't matter), use:
  - `F(Start & (F(Goal_A) & F(Goal_B & Goal_C)))`.

2. <neg_ltl> (Constraints)
According to inputs' `type` select LTL template. Based on the above basic syntax reference, you can also use '&' and '|' to flexibly combine LTL formulas.
- sudden_repeat: Not repeat it in sudden. `G(act_A -> X(!act_A))`
- lack_premise: Missing premise for action. G(!Premise -> !Subject)
- hazard: Something seems dangerous. `G(!act_Subject)` 
- rollback: Progress rollback. `G(obs_Trigger -> G(!act_Forbidden))`
- other: Other special cases, use your judgment to construct the LTL formula.

##NOTE
- Not to write multiple actions happening simultaneously, such as (act_A & act_B), since actions are instantaneous and agents can only perform one action at a time, you need to deal with it one by one.
- When you need to output many LTL constraints that are logically similar but involve different subjects, do not merge them into a single complex formula. Instead, split them into multiple independent formulas, with each formula focusing on one specific constraint.
- Split indenpendent LTL formula with \n. Do not make single heavy formula.
- If candidate over-constraint the exploration, discard it; if candidate enforce agent HAVE to do specific thing, discard it.

##OUTPUT FORMAT
<think>
step 1: Identify the milestones and construct the <pos_ltl> formula following the ##GUIDELINES 1.
step 2: Identify the bad behaviors and its template construct the <neg_ltl> formula following the ##GUIDELINES 2.
step 3: Recheck the candidate rules following the ##NOTE step by step. 
</think>

<pos_ltl>
...
</pos_ltl>

<neg_ltl>
...
</neg_ltl>
"""

INPUT_LTL_LITE_PROMPT="""--- Your Task ---

Goal: {goal}

Available APs:
{ap_list_str}

Trace Analysis:
{trace_analysis}

Your Output:"""

ANALYST_EXAMPLE="""
##EXAMPLES
{
    "milestones": [
        "First, go to the coffee table and pick up the dirty mug. Then, carry the mug to the sink and clean it.Finally, take the clean mug to the cabinet and place it inside."
    ],
    "bad_bahavious": [
        {
            "type": "lack_premise",
            "description": "Agent attempted to wash the mug without holding it first.",
            "items": ["wash mug", "holding mug"]
        },
        {
            "type": "sudden_repeat",
            "description": "Agent continuously opened and closed the cabinet door multiple times in a row.",
            "items": ["open cabinet", "close cabinet"]
        },
        {
            "type": "lack_restoration",
            "description": "Agent opened the fridge to check inside but failed to close the door before leaving.",
            "items": ["open fridge", "close fridge"]
        },
        {
            "type": "rollback",
            "description": "After successfully washing the mug, the agent threw it into the trash can, undoing the progress.",
            "items": ["holding washed mug", "throw to trash"]
        },
        {
            "type": "irrelative",
            "description": "Agent wandered off to the bedroom which is unrelated to the kitchen task.",
            "items": ["go to bedroom"]
        }
    ]
}
"""

SYSTEM_POS_LTL_PROMPT ="""
You are a professional NL2LTL translator, you need to translate the success analysis of agent trajectories into standard LTL statements based on the given AP list.
AP list is the APs that appear in the trajectory, not a complete world model

##GUIDELINES
**Translate sub-milestones**
Select the APs corresponding to each sub-goal and connect them using basic logical operators & and | to construct the LTL expression for each sub-goal.
If any AP in need is missing from the AP list, treat that AP as `False`. <br> E.g., You need `G(obs_A -> (obs_B & obs_C))`, but can not find obs_B, it becomes `G(obs_A -> (False & obs_C))` which implies `!obs_A`.

**Temporal Composition**
Logic: According to the temporal relationships between sub-milestones, construct a Strict Sequence using Nested `F`.
1.For milestones with a clear sequential order, use nested F operators to express a strict temporal sequence, for example: `F(Step1 & F(Step2 & F(Step3...)))`。
2.For parallel cases without an obvious temporal order, use
F((A & F(B)) | (B & F(A)))
to provide an accurate logical representation.

##NOTE
According to the required temporal relationships, check whether parentheses are correctly used in each layer of nesting within the current LTL formula.
Ensure that all APs used appear in the AP list; if not, replace them with other APs or with False.

##OUTPUT FORMAT
<think>
1. Identify the sub-milestones and check their logic relation (& or |). Then translate them into LTL formula following the ##GUIDELINES. Retaining only the most critical information to avoid over-description.
2. Construct all the sub-pos_ltl formula into SINGLE formula following the ##GUIDELINES.
3. Ask yourself: "Does all logical relations and temporal relations be correctly expressed in the formula with right symbols?"
4. Recheck the candidate rules following the ##NOTE step by step. 
</think>

<pos_ltl>
...
</pos_ltl>
"""

SYSTEM_NEG_LTL_PROMPT ="""
You are a professional NL2LTL translator, you need to translate the failure analysis of agent trajectories into standard LTL statements based on the given AP list.
AP list is the APs that appear in the trajectory, not a complete world model

##GUIDELINES
**Critical:** If any AP in need is missing from the AP list, treat that AP as `False`. <br> E.g., You need `G(act_A -> (obs_B & obs_C))`, but can not find obs_B, it becomes `G(act_A -> (False & obs_C))` which implies `!act_A`.

According to inputs' `type` select LTL template. Based on the above basic syntax reference, you can also use '&' and '|' to flexibly combine LTL formulas.
- sudden_repeat: Not repeat it in sudden. `G(act_A -> X(!act_A))`
- lack_premise: Missing premise for action. G(!obs_A -> !act_B)
- hazard: Something seems dangerous. `G(!act_Subject)` 
- rollback: Progress rollback. `G(obs_Trigger -> G(!act_Forbidden))`
- other: Other special cases, use your judgment to construct the LTL formula.

##NOTE
- Not to write multiple actions happening simultaneously, such as (act_A & act_B), since actions are instantaneous and agents can only perform one action at a time, you need to deal with it one by one.
- When you need to output many LTL constraints that are logically similar but involve different subjects, do not merge them into a single complex formula. Instead, split them into multiple independent formulas, with each formula focusing on one specific constraint.
- Split indenpendent LTL formula with \n. Do not make single heavy formula.
- Only output clean LTL formula without extra explanation in <neg_ltl>.
##OUTPUT FORMAT
<think>
1.Please identify ALL the bad behaviors and their templates (restate the template), then construct the <neg_ltl>formula following the ##GUIDELINES 2.
2.Ensure do not miss bad behaviors or items, but ensure never use unexistent APs.
2.Recheck the candidate rules following the ##NOTE step by step.
</think>

<neg_ltl>
...
</neg_ltl>
"""


#################################
#      AP Extractor Prompts     #
#################################

SYSTEM_AP_PROMPT = """
You are a precise Semantic State Extractor.
Your task is to analyze the Current Observation (and Goal) to extract structured state triplets representing the world state.

Output Format (JSON):
{
    "thought": "Reasoning about visible objects, attributes, and goal relevance.",
    "triplets": [
        ["Subject", "Category", "Value"],
        ...
    ]
}

[DEFINITIONS]
- Subject: The Core Entity ID (e.g., "apple 2", "drawer 1", "self"). NO adjectives.
- Category: The attribute type (MUST be one of: "loc", "state","seen").
- Value: The state content, it is single word in most of time(e.g., "true", "apple 1", "closed", "cool").

[STRICT RULES - READ CAREFULLY]
1. **Subject Normalization**: Use underscores for IDs. Subject must be the NOUN only.
   - BAD: ["unsliced tomato 1", "loc", "table 2"]
   - GOOD: ["tomato 1", "loc", "table 2"], ["tomato 1", "state", "unsliced"]

2. **Inventory Rule (Item-Centric)**: If agent holds/pick/get X, the location of X is 'self'.
   - BAD: ["self", "holding", "apple 1"]
   - GOOD: ["apple 1", "loc", "self"] NOTICE: Do NOT output ["self", "loc", "apple 1"] which means 'self' is located at apple1. 

3. **Self Location**: 'self' location must be a ROOM or REGION.
   - BAD: ["self", "loc", "holding apple 2"]
   - GOOD: ["self", "loc", "kitchen 2"]

4. **seen Attribute**: Indicate if an item is visible.
    GOOD: ["apple 1", "seen", "true"]

5. **No Hallucination**: Only extract explicitly visible facts. Do NOT extract the agent's action (that is handled externally).

[THINK GUIDELINES]
You need to list ALL items in the observation and analyze their attributes individually (location, state, etc.). 
For each item, output at least 'xxx_seen_true' if there is nothing else to record. 
For patterns like 'A loc B', carefully recheck the Inventory Rule and the Self Location Rules to avoid ordering mistakes.

[EXAMPLES]

Input:
Observation: "You are in the middle of a room. You see a closed drawer 1 and an unsliced tomato 1 on the table 2."
Output:
{
    "thought": "You are in a room, the 'middle of' is needless. You pick up cup 2. Drawer 1 is visible and closed. Tomato 1 is visible, unsliced, and on the table 2.",
    "triplets": [
        ["self", "loc", "room"],
        ["cup 2", "loc", "self"],
        ["drawer 1", "state", "closed"],
        ["tomato 1", "state", "unsliced"],
        ["tomato 1", "loc", "table 2"]
    ]
}
"""

INPUT_AP_PROMPT="""Observation: {obs}"""