type,prompt
base,"I'm $AGENT_NAME$. I'm in a hurry to finish the housework with my friend $OPPO_NAME$ together. Given our shared goal, dialogue history, and my progress and previous actions, please help me choose the best available action to achieve the goal as soon as possible. Do not choose actions related to goals that have already been achieved or are likely to have been achieved by the opponent. Note that I can hold two objects at a time. All objects are denoted as <name> (id), such as <table> (712).
Goal: $GOAL$
Progress: $PROGRESS$
Dialogue history:
$DIALOGUE_HISTORY$
Previous actions: $ACTION_HISTORY$
Available actions:
$AVAILABLE_ACTIONS$
Answer:"
check,"I'm $AGENT_NAME$. I'm examining my reasoning trace for uncertainties before acting.
Reasoning Trace = the planner's internal chain-of-thought used to select the next action in the previous planning step.
Goal: $GOAL$
Progress: $PROGRESS$
Dialogue history:
$DIALOGUE_HISTORY$
Previous actions: $ACTION_HISTORY$
Available Actions:
$AVAILABLE_ACTIONS$
Reasoning Trace:
$REASONING_TRACE$

Your task is to convert the Reasoning Trace into a deeper tree-of-thinking that enumerates major uncertainties (including multi-agent cases), where each leaf maps to exactly one next action (copied verbatim from Available Actions). The evaluator will score each scenario leaf (Likelihood, Performance) versus cost (distance/communication) to select the best action.

Rules:
1. Identify the uncertain statements in the Reasoning Trace that materially affect action choice.
   - Uncertain = assumptions, guesses, speculation, or information that may be invalid due to collaborators’ past actions.
   - Example: 'The apple might be in the kitchen.' or 'Bob probably already checked the cabinet.'

2. For each uncertain statement, create a binary branch and limit the maximum tree depth to 3.
   - 'True': the assumption is correct.
   - 'False': the assumption is incorrect.

3. Expand the tree in order of (i) higher uncertainty, then (ii) greater expected impact on goal success, steps, or token usage.
   - Include multi-agent third-cases (e.g., the object existed but was already taken by the collaborator and my information is stale).
   - For each branch, continue with the next uncertainty; if none remain, attach exactly one action string copied verbatim from Available Actions.

4. Ground leaf nodes using only actions from Available Actions.
   - Do not invent new objects/IDs. Use only entities present in Reasoning Trace, Progress, or Available Actions.
   - You may communicate with the collaborator via [send_message] <'...'> even if it is not listed in Available Actions. All other actions must come from Available Actions.
   - If two different scenarios lead to the same action, keep separate leaves (the evaluator still needs distinct scenarios for expected-value computation).

5. Output only the nested JSON object that represents the scenario tree (no extra text).
   - Keys = uncertain statements (or chosen actions).
   - Values = dicts with 'True' and 'False' branches OR direct action strings.
   - Each action must preserve its full format: <name> (id), copied verbatim.

Example (paired with Reasoning Trace and Available Actions):

Example Reasoning Trace:
- You are in the bathroom and holding nothing. Goal: put 3 cutlery forks into the dishwasher.
- Distances — Kitchen: 7.38m, Living room: 13.89m, Bedroom: 8.39m.
- Checking the bathroom cabinet is cheapest (3.35m) and might contain a fork.

Available Actions:
- [gocheck] <bathroomcabinet> (190) - my cost: 3.35 meters
- [goexplore] <kitchen> (11) - my cost: 7.38 meters
- [goexplore] <livingroom> (267) - my cost: 13.89 meters
- [goexplore] <bedroom> (172) - my cost: 8.39 meters
- [send_message]

Example Output (valid JSON):
{
  'type': 'hypothesis',
  'text': 'The <bathroomcabinet> (190) contains a <cutleryfork>',
  'reason': 'Nearby low-cost container; utensils may be stored here.',
  'True': {
    'type': 'action',
    'action': '[gocheck] <bathroomcabinet> (190) - my cost: 3.35 meters'
  },
  'False': {
    'type': 'hypothesis',
    'text': 'Bob already took a <cutleryfork> from the <bathroomcabinet> (190)',
    'reason': 'Lack of observation about collaborator’s past actions; object could have been collected earlier.',
    'True': {
      'type': 'action',
      'action': '[send_message] <\'Bob, did you already take a fork from the bathroom cabinet?\'>'
    },
    'False': {
      'type': 'hypothesis',
      'text': 'Forks are in the <Kitchen> (11) (more likely than other rooms)',
      'reason': 'Utensils are commonly in the kitchen; cost to kitchen (7.38m) is lower than living room (13.89m).',
      'True': {
        'type': 'action',
        'action': '[goexplore] <kitchen> (11) - my cost: 7.38 meters'
      },
      'False': {
        'type': 'hypothesis',
        'text': 'Next best room to explore (by distance) is the <Bedroom> (172) over the <Living room> (267)',
        'reason': 'Bedroom cost (8.39m) < Living room cost (13.89m); both unexplored.',
        'True': {
          'type': 'action',
          'action': '[goexplore] <bedroom> (172) - my cost: 8.39 meters'
        },
        'False': {
          'type': 'action',
          'action': '[goexplore] <livingroom> (267) - my cost: 13.89 meters'
        }
      }
    }
  }
}"
evaluate,"I'm $AGENT_NAME$. I have received a Scenario Tree produced by the checker. Each leaf corresponds to a concrete next-action choice under a specific set of assumptions (including multi-agent cases such as collaborator-taken objects or stale information).

Tree format you may receive:
- Hypothesis node:
  {'type': 'hypothesis', 'text': '<assumption>', 'reason': '<why this is considered>', 'True': {..}, 'False': {..}}
- Action node (leaf):
  {'type': 'action', 'action': '[action_string]'}

Your task: Evaluate all actionable leaves (each leaf that contains a concrete action) and return a compact mapping that the selector can rank. For each scenario (leaf), provide:

1. Likelihood (1–5):
   - How plausible it is that this branch (set of assumptions) is true.
   - 1 = very unlikely, 5 = highly likely.

2. Gain (1–5):
   - How much this action advances/satisfies the goal under this branch.
   - 5 = directly satisfies or the most impactful next step; 3 = moderate progress; 1 = marginal.

3. CostPenalty (1–5):
   - How costly, risky, or burdensome this action is to perform (in terms of effort, time, or communication overhead).
   - Consider physical movement, communication, and resource usage
   - 1 = very low cost: communication ([send_message]), or a short move less than 3m
   - 3 = moderate cost: physical move less than 10m, or multiple small actions combined.
   - 5 = very high cost: long movement across multiple rooms (>15m), heavy resource usage, or actions likely to waste significant time/effort.

4. Action:
   - Exactly one action from Available Actions that corresponds to this scenario branch.
   - Copy the full action string verbatim (e.g., '[gograb] <apple> (36)'). If the leaf already contains a concrete action string, use that.

Constraints:
- Use only actions that appear in Available Actions (verbatim match; do not invent objects/IDs).
- You may communicate with the collaborator via [send_message] <'...'> even if it is not listed in Available Actions.
- Evaluate every leaf that contains an action. If the Scenario Tree has N actionable leaves, return N scenarios.
- Provide Likelihood, Gain, and CostPenalty as defined above. Performance will be computed by the system.
- Number scenarios in a stable order (e.g., left-to-right depth-first traversal): 'Scenario 1', 'Scenario 2', ...

Input:
Goal: $GOAL$
Progress: $PROGRESS$
Dialogue history:
$DIALOGUE_HISTORY$
Previous actions: $ACTION_HISTORY$
Available Actions:
$AVAILABLE_ACTIONS$
Scenario Tree:
$SCENARIO_TREE$

Output Format:
Return a JSON dictionary (no extra text) where:
- Keys = scenario identifiers (e.g., 'Scenario 1', 'Scenario 2', ...).
- Values = a dictionary with:
  - 'Likelihood': integer (1–5)
  - 'Gain': integer (1–5)
  - 'CostPenalty': integer (1–5)
  - 'Action': string

Example Output:
{
  'Scenario 1': {
    'Likelihood': 4,
    'Gain': 5,
    'CostPenalty': 3,
    'Action': '[gograb] <cutleryfork> (373)'
  },
  'Scenario 2': {
    'Likelihood': 3,
    'Gain': 4,
    'CostPenalty': 5,
    'Action': '[gocheck] <kitchencabinet> (75)'
  },
  'Scenario 3': {
    'Likelihood': 4,
    'Gain': 2,
    'CostPenalty': 1,
    'Action': '[send_message] <''Bob, did you check the kitchen?''>'
  }
}"
reply,"I'm $AGENT_NAME$. My collaborator $OPPO_NAME$ just sent a message and I must reply concisely and helpfully to keep us coordinated on the shared goal.
Goal: $GOAL$
My Progress: $PROGRESS$
Dialogue history:
$DIALOGUE_HISTORY$
Previous actions: $ACTION_HISTORY$
Last incoming message from $OPPO_NAME$:
$LAST_MESSAGE$

First, directly ANSWER any explicit question in the last message.

Then, provide exactly ONE short follow-up (status, location, or confirmation) that reduces uncertainty most relevant to the goal (e.g., where an item is, who will grab it, whether a room was checked).

If making/confirming commitments, reference specific objects with their full notation <name> (id) ONLY if that exact id is known from context; otherwise use just the name without inventing ids.

Keep it brief. Be polite, action-oriented, and unambiguous.

OUTPUT FORMAT: return exactly one action string in this format:
[send_message] <'...'>
Do not add any extra text before or after.

Answer:"