{
 "cells": [
  {
   "cell_type": "markdown",
   "id": "d231a36b",
   "metadata": {},
   "source": [
    "# Tutorial\n",
    "\n",
    "You can skip to Step 4, if you decide to use our released model and directly for inference."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "b28d750c",
   "metadata": {},
   "outputs": [],
   "source": [
    "from OrderFusion import *"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "8553be8a",
   "metadata": {},
   "source": [
    "# Step 1: define settings\n",
    "\n",
    "- define global configurations for data and model\n",
    "- read comments in the following block"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "60e7fce2",
   "metadata": {},
   "outputs": [],
   "source": [
    "save_path = train_status('local') # 'local' if run locally or 'cloud' run on Google Colab\n",
    "country = 'germany' # 'germany', 'austria' \n",
    "resolution = 'h' \n",
    "indice = 'ID3' # 'ID3', 'ID2', 'ID1'\n",
    "train_start_date = '2022-01-01' # start date for training data\n",
    "split_len = (24, 6, 6) # e.g. (24, 6, 6): first 24 months for training, next 6 months for validation, next 6 months for testing\n",
    "years = [2022, 2023, 2024] # years that you purschased the data for\n",
    "quantiles = [0.1, 0.5, 0.9] # target quantiles to forecast, could be many e.g. [0.05, 0.1, 0.5, 0.9, 0.95]\n",
    "model_shape = [16, 4, 1] # optimized hidden_dimension, max. degree of interactions, number of attention heads. Change might lead to unstable training.\n",
    "epoch, batch_size = 50, 256 # number of epochs and batch size. \n",
    "num_trade = 16 # number of trades. \n",
    "seed = 42 # random seeds for reproducibility\n",
    "show_progress_bar = False # whether to show the training progress bar\n",
    "model_mode = 'OrderFusion'\n",
    "data_config = (country, resolution, indice, save_path, split_len, train_start_date, years)\n",
    "model_config = (model_mode, model_shape, epoch, batch_size, num_trade, quantiles, seed, show_progress_bar)"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "f011b082",
   "metadata": {},
   "source": [
    "# Step 2: preprocess orderbook\n",
    "- it saves scaler, separate input and label\n",
    "- only need to do it once (i.e. first time running the code)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "8f8e65e5",
   "metadata": {},
   "outputs": [],
   "source": [
    "phase = 'prepare'\n",
    "execute_main(data_config, model_config, phase)"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "ccc13af9",
   "metadata": {},
   "source": [
    "# Step 3: train the model\n",
    "- train the model out of many epochs\n",
    "- the best model with lowest validation loss is saved under \"Model\" folder\n",
    "- the testing metrics are saved under \"Result\" folder"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "d210f49a",
   "metadata": {},
   "outputs": [],
   "source": [
    "phase = 'train'\n",
    "execute_main(data_config, model_config, phase)"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "49cac4cd",
   "metadata": {},
   "source": [
    "# Step 4: monitor the predictions\n",
    "- load the trained model from the \"Model\" folder\n",
    "- produce forecasts versus true prices, figures saved under \"Figure\" folder\n",
    "- generate GIFs using testing data"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "3e2858eb",
   "metadata": {},
   "outputs": [],
   "source": [
    "phase = 'inference'\n",
    "execute_main(data_config, model_config, phase)"
   ]
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "base",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.10.15"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 5
}
