purpose: "In Context Learning HCraft, should only work for known dynamics"
domain: HCraft
date: "2025Feb26"
icl_prompt_action_advising: "
\n Below are examples about reasoning over the best action. \n
\n Question: 
\n You are playing MineCraft and you want to craft diamond. 
\n You are in forest.
\n Items in your current zone are:
\n Nothing
You have the following items in your inventory:\n
dirt: 8\n
wood plank: 4\n
stick: 8\n
\n
You can take these actions:\n
craft stick\n
craft crafting table\n
search for dirt at forest\n
search for wood at forest\n
move to swamp from forest\n
move to meadow from forest\n
Which action do you choose? Please think step by step.\n
Only give the answer in a new line in JSON format:\n
{\"reasoning\": <REASONING>, \"action\": <ACTION>} \n
Answer:
{\"reasoning\": \"To craft a diamond, I will eventually need tools to mine resources. Since crafting requires a crafting table and I do not have one yet, my best next step is to craft a crafting table so I can start making tools.\", \"action\": \"craft crafting table\"}
\n \n
Question: \n
You are playing MineCraft and you want to craft diamond. \n
You are in forest.\n
Items in your current zone are:\n
crafting table: 1\n
You have the following items in your inventory:\n
dirt: 12\n
reeds: 3\n
wood plank: 1\n
stick: 9\n
paper: 3\n
wood pickaxe: 6\n
wood sword: 10\n
cobblestone: 4\n
\n
You can take these actions:\n
pickup crafting table\n
craft paper\n
craft wood shovel\n
craft stone pickaxe\n
craft stone axe\n
craft stone shovel\n
craft stone  sword\n
search for dirt at forest\n
search for wood at forest\n
search for cobblestone with wood pickaxe at forest\n
move to swamp from forest\n
move to meadow from forest\n
Which action do you choose? Please think step by step.\n
Only give the answer in a new line in JSON format:\n
{\"reasoning\": <REASONING>, \"action\": <ACTION>} \n
Answer: \n
{\"reasoning\": \"To craft a diamond, I will eventually need better tools, such as an iron pickaxe, which requires iron ingots. To obtain iron, I first need a stone pickaxe, which requires cobblestone. Since I currently have 4 cobblestone and need 3 to craft a stone pickaxe, I should use my wood pickaxe to collect more cobblestone so I can make multiple stone tools later.\", \"action\": \"search for cobblestone with wood pickaxe at forest\"}
\n \n
Question: \n
You are playing MineCraft and you want to craft diamond.  \n
You are in underground. \n
Items in your current zone are: \n
Nothing \n
You have the following items in your inventory: \n
wood plank: 5 \n
stick: 4 \n
wood pickaxe: 7 \n
cobblestone: 6 \n
stone pickaxe: 16 \n
 \n
You can take these actions: \n
craft stick \n
craft crafting table \n
search for cobblestone with stone pickaxe at underground \n
search for cobblestone with wood pickaxe at underground \n
search for coal with stone pickaxe at underground \n
search for coal with wood pickaxe at underground \n
search for iron ore with stone pickaxe at underground \n
move to forest from underground \n
move to swamp from underground \n
move to meadow from underground \n
Which action do you choose? Please think step by step. \n
Only give the answer in a new line in JSON format: \n
{\"reasoning\": <REASONING>, \"action\": <ACTION>} \n
Answer: \n
{\"reasoning\": \"To craft a diamond, I need to mine diamonds, which requires an iron pickaxe. To craft an iron pickaxe, I need iron ingots, which are obtained by smelting iron ore. Since I have stone pickaxes, I should search for iron ore first to start the process of acquiring iron.\", \"action\": \"search for iron ore with stone pickaxe at underground\"}
"

icl_prompt_preference: "
\n Below are examples about reasoning over the actions given a state. \n
Question: \n
You are playing MineCraft and you want to craft diamond.You will be asked to compare two actions given a state and provide preference. \n\nYou are in forest.\nItems in your current zone are:\n\ncrafting table: 1\nYou have the following items in your inventory:\nwood plank: 5\nstick: 4\nwood pickaxe: 7\nstone pickaxe: 20\n\nYou can take these actions:\npickup crafting table\ncraft stick\ncraft crafting table\ncraft wood pickaxe\ncraft wood axe\ncraft wood shovel\ncraft wood sword\nsearch for dirt at forest\nsearch for wood at forest\nsearch for cobblestone with stone pickaxe at forest\nsearch for cobblestone with wood pickaxe at forest\nmove to swamp from forest\nmove to meadow from forest\nmove to underground from forest with stone pickaxe\n    \nGiven search for cobblestone with stone pickaxe at forest or search for dirt at forest, which action is better? Please think step by step.\nOnly give the answer in a new line in JSON format:\n{\"reasoning\": <REASONING>, \"preference\": <PREFERENCE>}\nWhere <PREFERENCE> is one of \"FIRST\" or \"SECOND\", <REASONING> is a string of your thinking steps. \n
Answer: \n
{\"reasoning\": \"To eventually get diamonds, we must obtain and smelt iron. We need cobblestone to craft a furnace, so searching for cobblestone is more useful than collecting dirt.\", \"preference\": \"FIRST\"} \n
Question: \n You are playing MineCraft and you want to craft diamond.You will be asked to compare two actions given a state and provide preference. \n\nYou are in forest.\nItems in your current zone are:\n\ncrafting table: 1\nYou have the following items in your inventory:\nwood plank: 4\nstick: 8\n\nYou can take these actions:\npickup crafting table\ncraft stick\ncraft crafting table\ncraft wood pickaxe\ncraft wood axe\ncraft wood shovel\ncraft wood sword\nsearch for dirt at forest\nsearch for wood at forest\nmove to swamp from forest\nmove to meadow from forest\n    \nGiven craft crafting table or craft wood pickaxe, which action is better? Please think step by step.\nOnly give the answer in a new line in JSON format:\n{\"reasoning\": <REASONING>, \"preference\": <PREFERENCE>}\nWhere <PREFERENCE> is one of \"FIRST\" or \"SECOND\", <REASONING> is a string of your thinking steps. \n
Answer: {\"reasoning\": \"We already have access to a crafting table in the forest, so crafting another is not immediately useful. We need a wood pickaxe to gather stone and progress toward diamonds.\", \"preference\": \"SECOND\"} \n
Question: You are playing MineCraft and you want to craft diamond.You will be asked to compare two actions given a state and provide preference. \n\nYou are in forest.\nItems in your current zone are:\n\ncrafting table: 1\nYou have the following items in your inventory:\nwood plank: 5\nstick: 4\nwood pickaxe: 7\ncobblestone: 6\nstone pickaxe: 17\n\nYou can take these actions:\npickup crafting table\ncraft stick\ncraft crafting table\ncraft wood pickaxe\ncraft wood axe\ncraft wood shovel\ncraft wood sword\ncraft stone pickaxe\ncraft stone axe\ncraft stone shovel\ncraft stone sword\nsearch for dirt at forest\nsearch for wood at forest\nsearch for cobblestone with stone pickaxe at forest\nsearch for cobblestone with wood pickaxe at forest\nmove to swamp from forest\nmove to meadow from forest\nmove to underground from forest with stone pickaxe\n    \nGiven move to underground from forest with stone pickaxe or craft stone sword, which action is better? Please think step by step.\nOnly give the answer in a new line in JSON format:\n{\"reasoning\": <REASONING>, \"preference\": <PREFERENCE>}\nWhere <PREFERENCE> is one of \"FIRST\" or \"SECOND\", <REASONING> is a string of your thinking steps.
\n
Answer: {\"reasoning\": \"To get diamonds, we must find and mine them in the underground. A stone sword might be useful later for combat, but it does not directly advance finding diamonds right now, so moving underground is more important.\", \"preference\": \"FIRST\"} \n
"
icl_prompt_binary_feedback: "
Below are examples about how to judge an action \n
Question: \n
You are playing MineCraft and you want to craft diamond.You will be asked to judge an action whether it is the optimal action, given a state. \n\nYou are in underground.\nItems in your current zone are:\nNothing\nYou have the following items in your inventory:\nwood plank: 5\nstick: 4\nwood pickaxe: 7\ncobblestone: 6\nstone pickaxe: 16\n\nYou can take these actions:\ncraft stick\ncraft crafting table\nsearch for cobblestone with stone pickaxe at underground\nsearch for cobblestone with wood pickaxe at underground\nsearch for coal with stone pickaxe at underground\nsearch for coal with wood pickaxe at underground\nsearch for iron ore with stone pickaxe at underground\nmove to forest from underground\nmove to swamp from underground\nmove to meadow from underground\n        \nIs action search for cobblestone with wood pickaxe at underground the best action you can take? Please think step by step.  \nOnly give the answer in a new line in JSON format:\n{\"reasoning\": <REASONING>, \"feedback\": <FEEDBACK>}\nWhere <FEEDBACK> is one of \"YES\" or \"NO\", <REASONING> is a string of your thinking steps. \n
Answer: \n
{\"reasoning\": \"We need more cobblestone to craft a furnace for smelting iron. Using the stone pickaxe is more efficient than the wood pickaxe. Therefore, it’s better to search for cobblestone or iron ore with the stone pickaxe rather than the wood pickaxe.\", \"feedback\": \"NO\"}
\n
Question: \n
You are playing MineCraft and you want to craft diamond.You will be asked to judge an action whether it is the optimal action, given a state. \n\nYou are in forest.\nItems in your current zone are:\nNothing\nYou have the following items in your inventory:\nwood: 1\n\nYou can take these actions:\ncraft wood plank\nsearch for dirt at forest\nsearch for wood at forest\nmove to swamp from forest\nmove to meadow from forest\n        \nIs action search for wood at forest the best action you can take? Please think step by step.  \nOnly give the answer in a new line in JSON format:\n{\"reasoning\": <REASONING>, \"feedback\": <FEEDBACK>}\nWhere <FEEDBACK> is one of \"YES\" or \"NO\", <REASONING> is a string of your thinking steps.
Answer: \n
{\"reasoning\": \"We only have one piece of wood and no crafting table. Getting more wood is the best next step so we can eventually craft the tools (crafting table, pickaxes) needed to progress toward diamonds.\", \"feedback\": \"YES\"}
\n
Question: \n
You are playing MineCraft and you want to craft diamond.You will be asked to judge an action whether it is the optimal action, given a state. \n\nYou are in forest.\nItems in your current zone are:\n\ncrafting table: 1\nYou have the following items in your inventory:\nwood plank: 5\nstick: 6\nwood pickaxe: 9\ncobblestone: 1\n\nYou can take these actions:\npickup crafting table\ncraft stick\ncraft crafting table\ncraft wood pickaxe\ncraft wood axe\ncraft wood shovel\ncraft wood sword\ncraft stone shovel\nsearch for dirt at forest\nsearch for wood at forest\nsearch for cobblestone with wood pickaxe at forest\nmove to swamp from forest\nmove to meadow from forest\n        \nIs action craft wood axe the best action you can take? Please think step by step.  \nOnly give the answer in a new line in JSON format:\n{\"reasoning\": <REASONING>, \"feedback\": <FEEDBACK>}\nWhere <FEEDBACK> is one of \"YES\" or \"NO\", <REASONING> is a string of your thinking steps. 
\n
Answer: \n
{\"reasoning\": \"We only have 1 cobblestone, but we need more to craft a furnace and move towards getting an iron pickaxe. Crafting a wood axe doesn’t help us progress; we should gather more cobblestone first.\", \"feedback\": \"NO\"}
\n
"