{
 "cells": [
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# Quick start\n",
    "\n",
    "In this quickstart guide, you'll learn how to:\n",
    "- Run pre-built hierarchical agents on PettingZoo environments \n",
    "- Configure and customize agent parameters\n",
    "- Evaluate trained agents\n",
    "- Set up your own experiments\n",
    "\n",
    "\n",
    "TAG provides a set of agents that you could easily apply to any `pettingzoo.ParallelEnv`.\n",
    "\n",
    "The agents are in `scr/tame/agents`.\n",
    "\n",
    "In order to run them, you just have to prepare a config file and run the `run_experiment` script.\n",
    "\n",
    "You can find most of the config files to run the agents on the `VMAS Balance` environment and on the `MPE Simple spread` environment in `exp_configs`."
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "As an example, to run `3-PPO` on `MPE Simple Spread`, you can just run in your terminal:\n",
    "\n",
    "```bash\n",
    "python scripts/run_experiment.py --config-path exp_configs/mpe_simple_spread --config-name cfg_3ppo_221 --n-agents 4 --seed 42 --cuda 0\n",
    "```"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "When you run this command, you should see:\n",
    "\n",
    "1. Initial environment setup information\n",
    "2. A progress bar showing training steps\n",
    "\n",
    "The agent will save checkpoints periodically in the directory specified by SAVE_PATH in your config.\n",
    "You can then plot the results by using the notebook `src/tame/plotting/plot_reward.ipynb`\n",
    "\n",
    "---"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "You can pass many parameters to the `run_experiment` script. To see what you can pass, just run:\n",
    "\n",
    "```bash\n",
    "python scripts/run_experiment.py --help\n",
    "```\n",
    "\n",
    "This will print:\n",
    "```bash\n",
    "Usage: run_experiment.py [OPTIONS]\n",
    "\n",
    "Options:\n",
    "  --config-path PATH        Path to config folder\n",
    "  --config-name TEXT        Name of config file\n",
    "  --n-agents INTEGER        Number of agents in the environment\n",
    "  --seed INTEGER            Random seed\n",
    "  --cuda INTEGER            Index of cuda to use\n",
    "  --training-steps INTEGER  Number of environment training steps\n",
    "  --train-agent BOOLEAN     Trains the agent if True. If False, it will load the trained agent from the SAVE_PATH / RUN_NAME specified in the config file\n",
    "  --exp-name TEXT           Name of the experiment\n",
    "  --help                    Show this message and exit.\n",
    "```"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## The config file"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "The config file is simply a python file in which you specify the agent, it's arguments, the environments and other related experiment paramters.\n",
    "\n",
    "The config for `3-PPO` on `Simple spread` looks like this:\n",
    "\n",
    "```python\n",
    "from pathlib import Path\n",
    "from tame.agents.ppo3 import Agent, Args  # Import agent and arguments\n",
    "from tame.envs.mpe_simple_spread import Env  # Import environment\n",
    "\n",
    "# Agent and Env\n",
    "MAX_TS = 100 # Max number of ts per episode\n",
    "\n",
    "\n",
    "# Agent args\n",
    "agent_args = Args(\n",
    "    total_timesteps=2000000,\n",
    "    cuda=2,\n",
    "    seed=10,\n",
    "    verbose=False,\n",
    "    ifreq_top=2,\n",
    "    ifreq_mid=2,\n",
    "    learn_comm=False,\n",
    "    save_all_trace=False,\n",
    ")\n",
    "\n",
    "# Env args\n",
    "TOTAL_AGENTS = 4  # Total agents in the environment (it's overwritten if --n-agents is passed when calling run_experiment.py)\n",
    "\n",
    "# Exp args\n",
    "EVAL_RUNS = 1  # Total evaluation runs\n",
    "SAVE_PATH = Path(\"./exps/simple_spread\") \n",
    "RUN_NAME = f\"3ppo_221__a{TOTAL_AGENTS}_s{agent_args.seed}\"\n",
    "TRAIN = True # If false it loads the trained agent from SAVE_PATH/RUN_NAME to evaluate it\n",
    "```"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "---\n",
    "\n",
    "### Note\n",
    "The `run_experiment.py` script can run any agent as long as it respects the `BaseAgent` interface from `src/tame/hierarchy/base_agent.py`\n",
    "\n",
    "---"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## Next steps\n",
    "Now that you have a working agent, here are some ways to expand:\n",
    "\n",
    "### Try Different Environments\n",
    "\n",
    "- Check `exp_configs/` for example configurations on other environments\n",
    "- Adapt configs for your own PettingZoo environments by:\n",
    "    - Adjusting n-agents to match your environment\n",
    "    - Modifying agent parameters for your specific task\n",
    "    - Adjusting training steps based on task complexity\n",
    "\n",
    "### Experiment with Agents\n",
    "TAG includes several agent architectures in `src/tame/agents/` :\n",
    "- `ppo3`: Three-level PPO hierarchy\n",
    "- `ppo2`: Two-level PPO hierarchy\n",
    "- `mappo_ppo`: Two-level mixed hierarchy with bottom level PPO agents and top level a MAPPO agent\n",
    "- `monolithic_ppo`: Single-level PPO\n",
    "\n",
    "Each agent has its own set of configurable parameters in its `Args` class.\n",
    "\n",
    "\n",
    "### Advanced Usage\n",
    "For more complex scenarios, consider:\n",
    "- Building a simple 2 level hierarchy ([see 03_build_a_hierarchy](tutorials/03_build_a_hierarchy.ipynb))\n",
    "- Implementing your own agents (see Tutorial 3)\n",
    "- Using learned communication between levels\n",
    "- Experimenting with different action frequencies between levels\n",
    "\n",
    "See the full documentation for detailed explanations of these advanced features."
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": []
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "tame",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "name": "python",
   "version": "3.10.15"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 2
}
