{
 "cells": [
  {
   "cell_type": "markdown",
   "id": "84c4bb4c",
   "metadata": {},
   "source": [
    "# Load Lib"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 10,
   "id": "d19603ca",
   "metadata": {},
   "outputs": [],
   "source": [
    "import pandas as pd\n",
    "import numpy as np\n",
    "import scipy.io\n",
    "import os\n",
    "\n",
    "def load_arff(path,capitalize=False):\n",
    "    \n",
    "    data_start = \"@DATA\" if capitalize else \"@data\"\n",
    "    attr_start = \"@ATTRIBUTE\" if capitalize else \"@attribute\"\n",
    "    \n",
    "    skip_before_data = 0\n",
    "    with open(path,\"r\") as f:\n",
    "        for no, line in enumerate(f):\n",
    "            if line.startswith(data_start):\n",
    "                skip_before_data =no+1\n",
    "                while True:\n",
    "                    if next(f)==\"\\n\":\n",
    "                        skip_before_data+=1\n",
    "                    else:\n",
    "                        break\n",
    "                break\n",
    "    \n",
    "    columns = []\n",
    "    with open(path,\"r\") as f:\n",
    "        for no,line in enumerate(f):\n",
    "            if line.startswith(attr_start):\n",
    "                col = line[11:]\n",
    "                if \"\\t\" in col:\n",
    "                    col = col.split(\"\\t\")[0]\n",
    "                else:\n",
    "                    col = col.split(\" \")[0]\n",
    "                columns.append(col)\n",
    "            elif line.startswith(data_start):\n",
    "                break\n",
    "            else:\n",
    "                continue\n",
    "                \n",
    "    return pd.read_csv(path,skiprows=skip_before_data,header=None,names=columns)\n",
    "                "
   ]
  },
  {
   "cell_type": "markdown",
   "id": "feeb8a58",
   "metadata": {},
   "source": [
    "# Data Parsing\n",
    "* Datasets are prvoided in various structure. Thus we convert them into text files with their 1st row being feature names, including a target column. User should save objects(train_data, test_data or data) to csv files, without index\n",
    "    * train_data and test_data, when the test data is already defined: adlt,....\n",
    "* Assume a user download the file on the link in the paper(unzip if necessary), then place the files into a folder named as each headers.\n",
    "* \\* means that the dataset does not need parsing; already in the format we want to convert"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "ae42a6ba",
   "metadata": {},
   "source": [
    "# adlt(Adult)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 2,
   "id": "23a945c3",
   "metadata": {},
   "outputs": [],
   "source": [
    "adlt_columns = ['age', 'workclass', 'fnlwgt', 'education', 'education-num',\n",
    "                 'marital-status', 'occupation', 'relationship', 'race', 'sex',\n",
    "                 'capital-gain', 'capital-loss', 'hours-per-week', 'native-country', 'class']"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 3,
   "id": "e6ff3737",
   "metadata": {},
   "outputs": [],
   "source": [
    "train_data = pd.read_csv(os.path.join(\"adlt\",\"adult.data\"), header=None, names=adlt_columns)\n",
    "test_data = pd.read_csv(os.path.join(\"adlt\",\"adult.test\"), header=None, skiprows=1,  names=adlt_columns)"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "8157b92a",
   "metadata": {},
   "source": [
    "# blst(Blastchar)*"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "a55492c8",
   "metadata": {},
   "source": [
    "# chrn(KDD Churn)\n",
    "* Download 2 files, feature(orange_small_train.data.zip) and label(orange_large_train_churn.labels)\n",
    "* Unzip the feature file and move the label file to the same folder\n",
    "* Basically, follows CatBoost (https://github.com/catboost/benchmarks/tree/master/quality_benchmarks/prepare_appetency_churn_upselling)"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "3136f0d2",
   "metadata": {},
   "source": [
    "# Census"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 4,
   "id": "df93a5dd",
   "metadata": {},
   "outputs": [],
   "source": [
    "cnss_columns = ['age', 'class of worker', 'industry code', 'occupation code', 'adjusted gross income',\n",
    "                'education', 'wage per hour', 'enrolled in edu inst last wk', 'marital status', 'major industry code',\n",
    "                'major occupation code', 'mace', 'hispanic Origin', 'sex', 'member of a labor union',\n",
    "                'reason for unemployment', 'full or part time employment stat', 'capital gains', 'capital losses', 'divdends from stocks',\n",
    "                'federal income tax liability', 'tax filer status', 'region of previous residence', 'state of previous residence', 'detailed household and family stat',\n",
    "                'detailed household summary in household', 'instance weight', 'migration code-change in msa', 'migration code-change in reg', 'migration code-move within reg',\n",
    "                'live in this house 1 year ago', 'migration prev res in sunbelt', 'num persons worked for employer', 'family members under 18', 'total person earnings',\n",
    "                'country of birth father', 'country of birth mother', 'country of birth self', 'citizenship', 'total person income',\n",
    "                'own business or self employed', 'taxable income amount', \"fill inc questionnaire for veteran's admin\", 'veterans benefits', 'weeks worked in year']"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "bc509428",
   "metadata": {},
   "outputs": [],
   "source": [
    "train_data = pd.read_csv(os.path.join(\"cnss\",\"census-income.data\"), header=None, names=cnss_columns)\n",
    "train_data = pd.read_csv(os.path.join(\"cnss\",\"census-income.test\"), header=None, names=cnss_columns)"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "9be5e35b",
   "metadata": {},
   "source": [
    "# CoverType"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "dc4dfa03",
   "metadata": {},
   "outputs": [],
   "source": [
    "covr_columns = ['Elevation', 'Aspect', 'Slope', 'Horizontal_Distance_To_Hydrology', 'Vertical_Distance_To_Hydrology',\n",
    "                'Horizontal_Distance_To_Roadways', 'Hillshade_9am', 'Hillshade_Noon', 'Hillshade_3pm', 'Horizontal_Distance_To_Fire_Points',\n",
    "                'Wilderness_Area1', 'Soil_Type1', 'Soil_Type2', 'Soil_Type3', 'Soil_Type4',\n",
    "                'Soil_Type5', 'Soil_Type6', 'Soil_Type7', 'Soil_Type8', 'Soil_Type9',\n",
    "                'Soil_Type10', 'Soil_Type11', 'Soil_Type12', 'Soil_Type13', 'Soil_Type14',\n",
    "                'Soil_Type15', 'Soil_Type16', 'Soil_Type17', 'Soil_Type18', 'Soil_Type19',\n",
    "                'Soil_Type20', 'Soil_Type21', 'Soil_Type22', 'Soil_Type23', 'Soil_Type24',\n",
    "                'Soil_Type25', 'Soil_Type26', 'Soil_Type27', 'Soil_Type28', 'Soil_Type29',\n",
    "                'Soil_Type30', 'Soil_Type31', 'Soil_Type32', 'Soil_Type33', 'Soil_Type34',\n",
    "                'Soil_Type35', 'Soil_Type36', 'Soil_Type37', 'Soil_Type38', 'Soil_Type39',\n",
    "                'Soil_Type40', 'Cover_Type', 'Wilderness_Area2', 'Wilderness_Area3', 'Wilderness_Area4']"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "9059d9d8",
   "metadata": {},
   "outputs": [],
   "source": [
    "data = pd.read_csv(os.path.join(\"covr\",\"covtype.data\"),header=None,names=covr_columns)"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "0836eafc",
   "metadata": {},
   "source": [
    "# Eye"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 2,
   "id": "bad2e01d",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>lineNo</th>\n",
       "      <th>assgNo</th>\n",
       "      <th>fixcount</th>\n",
       "      <th>firstPassCnt</th>\n",
       "      <th>P1stFixation</th>\n",
       "      <th>P2stFixation</th>\n",
       "      <th>prevFixDur</th>\n",
       "      <th>firstfixDur</th>\n",
       "      <th>firstPassFixDur</th>\n",
       "      <th>nextFixDur</th>\n",
       "      <th>...</th>\n",
       "      <th>regressLen</th>\n",
       "      <th>nextWordRegress</th>\n",
       "      <th>regressDur</th>\n",
       "      <th>pupilDiamMax</th>\n",
       "      <th>pupilDiamLag</th>\n",
       "      <th>timePrtctg</th>\n",
       "      <th>nWordsInTitle</th>\n",
       "      <th>titleNo</th>\n",
       "      <th>wordNo</th>\n",
       "      <th>label</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>100</td>\n",
       "      <td>100</td>\n",
       "      <td>99</td>\n",
       "      <td>...</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>0.0095</td>\n",
       "      <td>0.145</td>\n",
       "      <td>0.0131</td>\n",
       "      <td>7</td>\n",
       "      <td>4</td>\n",
       "      <td>3</td>\n",
       "      <td>0</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>99</td>\n",
       "      <td>278</td>\n",
       "      <td>278</td>\n",
       "      <td>159</td>\n",
       "      <td>...</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>0.0095</td>\n",
       "      <td>0.183</td>\n",
       "      <td>0.0363</td>\n",
       "      <td>7</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>3</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>278</td>\n",
       "      <td>159</td>\n",
       "      <td>159</td>\n",
       "      <td>159</td>\n",
       "      <td>...</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>0.0370</td>\n",
       "      <td>0.183</td>\n",
       "      <td>0.0208</td>\n",
       "      <td>7</td>\n",
       "      <td>1</td>\n",
       "      <td>3</td>\n",
       "      <td>0</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>4</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>159</td>\n",
       "      <td>159</td>\n",
       "      <td>159</td>\n",
       "      <td>139</td>\n",
       "      <td>...</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>0.0370</td>\n",
       "      <td>0.183</td>\n",
       "      <td>0.0208</td>\n",
       "      <td>7</td>\n",
       "      <td>1</td>\n",
       "      <td>5</td>\n",
       "      <td>0</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>5</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>159</td>\n",
       "      <td>139</td>\n",
       "      <td>139</td>\n",
       "      <td>239</td>\n",
       "      <td>...</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>0.0390</td>\n",
       "      <td>0.183</td>\n",
       "      <td>0.0182</td>\n",
       "      <td>7</td>\n",
       "      <td>1</td>\n",
       "      <td>6</td>\n",
       "      <td>0</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>...</th>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>10931</th>\n",
       "      <td>10932</td>\n",
       "      <td>336</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>199</td>\n",
       "      <td>139</td>\n",
       "      <td>139</td>\n",
       "      <td>219</td>\n",
       "      <td>...</td>\n",
       "      <td>0</td>\n",
       "      <td>1</td>\n",
       "      <td>99</td>\n",
       "      <td>0.4730</td>\n",
       "      <td>0.069</td>\n",
       "      <td>0.0119</td>\n",
       "      <td>7</td>\n",
       "      <td>10</td>\n",
       "      <td>6</td>\n",
       "      <td>2</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>10932</th>\n",
       "      <td>10933</td>\n",
       "      <td>336</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>139</td>\n",
       "      <td>219</td>\n",
       "      <td>219</td>\n",
       "      <td>99</td>\n",
       "      <td>...</td>\n",
       "      <td>914</td>\n",
       "      <td>0</td>\n",
       "      <td>358</td>\n",
       "      <td>0.4730</td>\n",
       "      <td>0.069</td>\n",
       "      <td>0.0187</td>\n",
       "      <td>7</td>\n",
       "      <td>10</td>\n",
       "      <td>7</td>\n",
       "      <td>2</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>10933</th>\n",
       "      <td>10934</td>\n",
       "      <td>336</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>199</td>\n",
       "      <td>139</td>\n",
       "      <td>139</td>\n",
       "      <td>219</td>\n",
       "      <td>...</td>\n",
       "      <td>0</td>\n",
       "      <td>1</td>\n",
       "      <td>99</td>\n",
       "      <td>0.4730</td>\n",
       "      <td>0.473</td>\n",
       "      <td>0.0084</td>\n",
       "      <td>7</td>\n",
       "      <td>10</td>\n",
       "      <td>6</td>\n",
       "      <td>2</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>10934</th>\n",
       "      <td>10935</td>\n",
       "      <td>336</td>\n",
       "      <td>2</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>139</td>\n",
       "      <td>219</td>\n",
       "      <td>219</td>\n",
       "      <td>99</td>\n",
       "      <td>...</td>\n",
       "      <td>914</td>\n",
       "      <td>0</td>\n",
       "      <td>358</td>\n",
       "      <td>0.2150</td>\n",
       "      <td>0.215</td>\n",
       "      <td>0.0305</td>\n",
       "      <td>7</td>\n",
       "      <td>10</td>\n",
       "      <td>7</td>\n",
       "      <td>2</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>10935</th>\n",
       "      <td>10936</td>\n",
       "      <td>336</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>80</td>\n",
       "      <td>219</td>\n",
       "      <td>219</td>\n",
       "      <td>139</td>\n",
       "      <td>...</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>457</td>\n",
       "      <td>0.0690</td>\n",
       "      <td>0.069</td>\n",
       "      <td>0.0390</td>\n",
       "      <td>7</td>\n",
       "      <td>10</td>\n",
       "      <td>1</td>\n",
       "      <td>2</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "<p>10936 rows × 28 columns</p>\n",
       "</div>"
      ],
      "text/plain": [
       "       lineNo  assgNo  fixcount  firstPassCnt  P1stFixation  P2stFixation  \\\n",
       "0           1       1         1             1             1             0   \n",
       "1           2       1         1             1             1             0   \n",
       "2           3       1         1             1             1             0   \n",
       "3           4       1         1             1             1             0   \n",
       "4           5       1         1             1             1             0   \n",
       "...       ...     ...       ...           ...           ...           ...   \n",
       "10931   10932     336         1             1             1             0   \n",
       "10932   10933     336         1             1             1             0   \n",
       "10933   10934     336         1             1             1             0   \n",
       "10934   10935     336         2             1             1             0   \n",
       "10935   10936     336         1             1             1             1   \n",
       "\n",
       "       prevFixDur  firstfixDur  firstPassFixDur  nextFixDur  ...  regressLen  \\\n",
       "0               0          100              100          99  ...           0   \n",
       "1              99          278              278         159  ...           0   \n",
       "2             278          159              159         159  ...           0   \n",
       "3             159          159              159         139  ...           0   \n",
       "4             159          139              139         239  ...           0   \n",
       "...           ...          ...              ...         ...  ...         ...   \n",
       "10931         199          139              139         219  ...           0   \n",
       "10932         139          219              219          99  ...         914   \n",
       "10933         199          139              139         219  ...           0   \n",
       "10934         139          219              219          99  ...         914   \n",
       "10935          80          219              219         139  ...           0   \n",
       "\n",
       "       nextWordRegress  regressDur  pupilDiamMax  pupilDiamLag  timePrtctg  \\\n",
       "0                    0           0        0.0095         0.145      0.0131   \n",
       "1                    0           0        0.0095         0.183      0.0363   \n",
       "2                    0           0        0.0370         0.183      0.0208   \n",
       "3                    0           0        0.0370         0.183      0.0208   \n",
       "4                    0           0        0.0390         0.183      0.0182   \n",
       "...                ...         ...           ...           ...         ...   \n",
       "10931                1          99        0.4730         0.069      0.0119   \n",
       "10932                0         358        0.4730         0.069      0.0187   \n",
       "10933                1          99        0.4730         0.473      0.0084   \n",
       "10934                0         358        0.2150         0.215      0.0305   \n",
       "10935                0         457        0.0690         0.069      0.0390   \n",
       "\n",
       "       nWordsInTitle  titleNo  wordNo  label  \n",
       "0                  7        4       3      0  \n",
       "1                  7        1       1      0  \n",
       "2                  7        1       3      0  \n",
       "3                  7        1       5      0  \n",
       "4                  7        1       6      0  \n",
       "...              ...      ...     ...    ...  \n",
       "10931              7       10       6      2  \n",
       "10932              7       10       7      2  \n",
       "10933              7       10       6      2  \n",
       "10934              7       10       7      2  \n",
       "10935              7       10       1      2  \n",
       "\n",
       "[10936 rows x 28 columns]"
      ]
     },
     "execution_count": 2,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data=load_arff(\"eyem/eye_movements.arff\")\n",
    "data"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "e41fb37a",
   "metadata": {},
   "source": [
    "# Gas"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 3,
   "id": "ad70493d",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>V1</th>\n",
       "      <th>V2</th>\n",
       "      <th>V3</th>\n",
       "      <th>V4</th>\n",
       "      <th>V5</th>\n",
       "      <th>V6</th>\n",
       "      <th>V7</th>\n",
       "      <th>V8</th>\n",
       "      <th>V9</th>\n",
       "      <th>V10</th>\n",
       "      <th>...</th>\n",
       "      <th>V121</th>\n",
       "      <th>V122</th>\n",
       "      <th>V123</th>\n",
       "      <th>V124</th>\n",
       "      <th>V125</th>\n",
       "      <th>V126</th>\n",
       "      <th>V127</th>\n",
       "      <th>V128</th>\n",
       "      <th>V129</th>\n",
       "      <th>Class</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>12285.6582</td>\n",
       "      <td>4.076635</td>\n",
       "      <td>4.842317</td>\n",
       "      <td>7.509393</td>\n",
       "      <td>10.822436</td>\n",
       "      <td>-1.312657</td>\n",
       "      <td>-1.853717</td>\n",
       "      <td>-6.924985</td>\n",
       "      <td>11800.9233</td>\n",
       "      <td>4.483500</td>\n",
       "      <td>...</td>\n",
       "      <td>1784.5324</td>\n",
       "      <td>1.907000</td>\n",
       "      <td>1.729200</td>\n",
       "      <td>4.881194</td>\n",
       "      <td>8.623828</td>\n",
       "      <td>-0.314110</td>\n",
       "      <td>-0.661556</td>\n",
       "      <td>-3.521663</td>\n",
       "      <td>10.0</td>\n",
       "      <td>4</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>-35.6889</td>\n",
       "      <td>0.993944</td>\n",
       "      <td>0.166099</td>\n",
       "      <td>0.489363</td>\n",
       "      <td>3.484663</td>\n",
       "      <td>-0.130298</td>\n",
       "      <td>-0.528364</td>\n",
       "      <td>-3.735347</td>\n",
       "      <td>266.4145</td>\n",
       "      <td>1.053988</td>\n",
       "      <td>...</td>\n",
       "      <td>904.9898</td>\n",
       "      <td>1.433707</td>\n",
       "      <td>1.068069</td>\n",
       "      <td>2.532958</td>\n",
       "      <td>5.369720</td>\n",
       "      <td>-0.183779</td>\n",
       "      <td>-0.534087</td>\n",
       "      <td>-4.635975</td>\n",
       "      <td>50.0</td>\n",
       "      <td>3</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>63927.2217</td>\n",
       "      <td>14.956941</td>\n",
       "      <td>19.971376</td>\n",
       "      <td>29.188512</td>\n",
       "      <td>33.291320</td>\n",
       "      <td>-10.433776</td>\n",
       "      <td>-16.062245</td>\n",
       "      <td>-49.490143</td>\n",
       "      <td>57405.8483</td>\n",
       "      <td>15.613843</td>\n",
       "      <td>...</td>\n",
       "      <td>14585.7879</td>\n",
       "      <td>8.189021</td>\n",
       "      <td>6.099452</td>\n",
       "      <td>12.127991</td>\n",
       "      <td>15.709651</td>\n",
       "      <td>-3.887082</td>\n",
       "      <td>-6.731473</td>\n",
       "      <td>-19.326895</td>\n",
       "      <td>250.0</td>\n",
       "      <td>4</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>2992.9019</td>\n",
       "      <td>1.380553</td>\n",
       "      <td>0.808910</td>\n",
       "      <td>1.288259</td>\n",
       "      <td>4.660135</td>\n",
       "      <td>-0.755903</td>\n",
       "      <td>-1.120470</td>\n",
       "      <td>-4.075213</td>\n",
       "      <td>4301.4033</td>\n",
       "      <td>1.652701</td>\n",
       "      <td>...</td>\n",
       "      <td>6044.5554</td>\n",
       "      <td>3.488295</td>\n",
       "      <td>2.662288</td>\n",
       "      <td>5.938297</td>\n",
       "      <td>8.544508</td>\n",
       "      <td>-1.567322</td>\n",
       "      <td>-2.701235</td>\n",
       "      <td>-6.472439</td>\n",
       "      <td>600.0</td>\n",
       "      <td>3</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>57524.7812</td>\n",
       "      <td>11.912566</td>\n",
       "      <td>14.631496</td>\n",
       "      <td>19.809240</td>\n",
       "      <td>23.715868</td>\n",
       "      <td>-9.084750</td>\n",
       "      <td>-11.770585</td>\n",
       "      <td>-39.234003</td>\n",
       "      <td>50051.0703</td>\n",
       "      <td>11.732548</td>\n",
       "      <td>...</td>\n",
       "      <td>10580.1006</td>\n",
       "      <td>5.752675</td>\n",
       "      <td>3.880740</td>\n",
       "      <td>8.545897</td>\n",
       "      <td>11.831716</td>\n",
       "      <td>-2.655521</td>\n",
       "      <td>-4.312744</td>\n",
       "      <td>-8.510591</td>\n",
       "      <td>150.0</td>\n",
       "      <td>4</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>...</th>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>13905</th>\n",
       "      <td>13384.8262</td>\n",
       "      <td>2.820931</td>\n",
       "      <td>4.007378</td>\n",
       "      <td>6.618008</td>\n",
       "      <td>11.386095</td>\n",
       "      <td>-2.142994</td>\n",
       "      <td>-3.110327</td>\n",
       "      <td>-11.296786</td>\n",
       "      <td>19013.4575</td>\n",
       "      <td>4.050907</td>\n",
       "      <td>...</td>\n",
       "      <td>7548.3501</td>\n",
       "      <td>2.377836</td>\n",
       "      <td>3.400734</td>\n",
       "      <td>8.571174</td>\n",
       "      <td>11.555882</td>\n",
       "      <td>-1.617656</td>\n",
       "      <td>-2.457614</td>\n",
       "      <td>-6.226359</td>\n",
       "      <td>10.0</td>\n",
       "      <td>6</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>13906</th>\n",
       "      <td>13382.9619</td>\n",
       "      <td>2.825174</td>\n",
       "      <td>4.010915</td>\n",
       "      <td>6.483989</td>\n",
       "      <td>10.356127</td>\n",
       "      <td>-2.156512</td>\n",
       "      <td>-3.179563</td>\n",
       "      <td>-10.184803</td>\n",
       "      <td>19034.2495</td>\n",
       "      <td>4.066463</td>\n",
       "      <td>...</td>\n",
       "      <td>7510.4946</td>\n",
       "      <td>2.364505</td>\n",
       "      <td>3.401381</td>\n",
       "      <td>8.512949</td>\n",
       "      <td>12.149638</td>\n",
       "      <td>-1.613554</td>\n",
       "      <td>-2.493870</td>\n",
       "      <td>-6.859804</td>\n",
       "      <td>10.0</td>\n",
       "      <td>6</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>13907</th>\n",
       "      <td>13336.8725</td>\n",
       "      <td>2.822288</td>\n",
       "      <td>3.980818</td>\n",
       "      <td>6.487103</td>\n",
       "      <td>10.936979</td>\n",
       "      <td>-2.146688</td>\n",
       "      <td>-3.273109</td>\n",
       "      <td>-11.067489</td>\n",
       "      <td>18997.7222</td>\n",
       "      <td>4.055524</td>\n",
       "      <td>...</td>\n",
       "      <td>7530.0010</td>\n",
       "      <td>2.369898</td>\n",
       "      <td>3.400592</td>\n",
       "      <td>8.494436</td>\n",
       "      <td>11.839013</td>\n",
       "      <td>-1.612525</td>\n",
       "      <td>-2.504918</td>\n",
       "      <td>-6.263872</td>\n",
       "      <td>10.0</td>\n",
       "      <td>6</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>13908</th>\n",
       "      <td>13351.1318</td>\n",
       "      <td>2.824358</td>\n",
       "      <td>3.987819</td>\n",
       "      <td>6.554427</td>\n",
       "      <td>11.331002</td>\n",
       "      <td>-2.143651</td>\n",
       "      <td>-3.257854</td>\n",
       "      <td>-11.795109</td>\n",
       "      <td>19035.9926</td>\n",
       "      <td>4.071607</td>\n",
       "      <td>...</td>\n",
       "      <td>7599.0201</td>\n",
       "      <td>2.391834</td>\n",
       "      <td>3.358804</td>\n",
       "      <td>8.457260</td>\n",
       "      <td>11.297346</td>\n",
       "      <td>-1.606879</td>\n",
       "      <td>-2.438701</td>\n",
       "      <td>-6.044784</td>\n",
       "      <td>10.0</td>\n",
       "      <td>6</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>13909</th>\n",
       "      <td>13314.9336</td>\n",
       "      <td>2.816502</td>\n",
       "      <td>3.982182</td>\n",
       "      <td>6.620679</td>\n",
       "      <td>11.848481</td>\n",
       "      <td>-2.138961</td>\n",
       "      <td>-3.259621</td>\n",
       "      <td>-11.065998</td>\n",
       "      <td>18973.1426</td>\n",
       "      <td>4.048421</td>\n",
       "      <td>...</td>\n",
       "      <td>7485.3593</td>\n",
       "      <td>2.358200</td>\n",
       "      <td>3.397591</td>\n",
       "      <td>8.570913</td>\n",
       "      <td>13.110304</td>\n",
       "      <td>-1.630569</td>\n",
       "      <td>-2.463957</td>\n",
       "      <td>-5.726544</td>\n",
       "      <td>10.0</td>\n",
       "      <td>6</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "<p>13910 rows × 130 columns</p>\n",
       "</div>"
      ],
      "text/plain": [
       "               V1         V2         V3         V4         V5         V6  \\\n",
       "0      12285.6582   4.076635   4.842317   7.509393  10.822436  -1.312657   \n",
       "1        -35.6889   0.993944   0.166099   0.489363   3.484663  -0.130298   \n",
       "2      63927.2217  14.956941  19.971376  29.188512  33.291320 -10.433776   \n",
       "3       2992.9019   1.380553   0.808910   1.288259   4.660135  -0.755903   \n",
       "4      57524.7812  11.912566  14.631496  19.809240  23.715868  -9.084750   \n",
       "...           ...        ...        ...        ...        ...        ...   \n",
       "13905  13384.8262   2.820931   4.007378   6.618008  11.386095  -2.142994   \n",
       "13906  13382.9619   2.825174   4.010915   6.483989  10.356127  -2.156512   \n",
       "13907  13336.8725   2.822288   3.980818   6.487103  10.936979  -2.146688   \n",
       "13908  13351.1318   2.824358   3.987819   6.554427  11.331002  -2.143651   \n",
       "13909  13314.9336   2.816502   3.982182   6.620679  11.848481  -2.138961   \n",
       "\n",
       "              V7         V8          V9        V10  ...        V121      V122  \\\n",
       "0      -1.853717  -6.924985  11800.9233   4.483500  ...   1784.5324  1.907000   \n",
       "1      -0.528364  -3.735347    266.4145   1.053988  ...    904.9898  1.433707   \n",
       "2     -16.062245 -49.490143  57405.8483  15.613843  ...  14585.7879  8.189021   \n",
       "3      -1.120470  -4.075213   4301.4033   1.652701  ...   6044.5554  3.488295   \n",
       "4     -11.770585 -39.234003  50051.0703  11.732548  ...  10580.1006  5.752675   \n",
       "...          ...        ...         ...        ...  ...         ...       ...   \n",
       "13905  -3.110327 -11.296786  19013.4575   4.050907  ...   7548.3501  2.377836   \n",
       "13906  -3.179563 -10.184803  19034.2495   4.066463  ...   7510.4946  2.364505   \n",
       "13907  -3.273109 -11.067489  18997.7222   4.055524  ...   7530.0010  2.369898   \n",
       "13908  -3.257854 -11.795109  19035.9926   4.071607  ...   7599.0201  2.391834   \n",
       "13909  -3.259621 -11.065998  18973.1426   4.048421  ...   7485.3593  2.358200   \n",
       "\n",
       "           V123       V124       V125      V126      V127       V128   V129  \\\n",
       "0      1.729200   4.881194   8.623828 -0.314110 -0.661556  -3.521663   10.0   \n",
       "1      1.068069   2.532958   5.369720 -0.183779 -0.534087  -4.635975   50.0   \n",
       "2      6.099452  12.127991  15.709651 -3.887082 -6.731473 -19.326895  250.0   \n",
       "3      2.662288   5.938297   8.544508 -1.567322 -2.701235  -6.472439  600.0   \n",
       "4      3.880740   8.545897  11.831716 -2.655521 -4.312744  -8.510591  150.0   \n",
       "...         ...        ...        ...       ...       ...        ...    ...   \n",
       "13905  3.400734   8.571174  11.555882 -1.617656 -2.457614  -6.226359   10.0   \n",
       "13906  3.401381   8.512949  12.149638 -1.613554 -2.493870  -6.859804   10.0   \n",
       "13907  3.400592   8.494436  11.839013 -1.612525 -2.504918  -6.263872   10.0   \n",
       "13908  3.358804   8.457260  11.297346 -1.606879 -2.438701  -6.044784   10.0   \n",
       "13909  3.397591   8.570913  13.110304 -1.630569 -2.463957  -5.726544   10.0   \n",
       "\n",
       "       Class  \n",
       "0          4  \n",
       "1          3  \n",
       "2          4  \n",
       "3          3  \n",
       "4          4  \n",
       "...      ...  \n",
       "13905      6  \n",
       "13906      6  \n",
       "13907      6  \n",
       "13908      6  \n",
       "13909      6  \n",
       "\n",
       "[13910 rows x 130 columns]"
      ]
     },
     "execution_count": 3,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data=load_arff(\"gddc/phpN4gaxw.arff\")\n",
    "data"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "352d20d5",
   "metadata": {},
   "source": [
    "# Gesture"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 4,
   "id": "76ed8ad7",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>X1</th>\n",
       "      <th>X2</th>\n",
       "      <th>X3</th>\n",
       "      <th>X4</th>\n",
       "      <th>X5</th>\n",
       "      <th>X6</th>\n",
       "      <th>X7</th>\n",
       "      <th>X8</th>\n",
       "      <th>X9</th>\n",
       "      <th>X10</th>\n",
       "      <th>...</th>\n",
       "      <th>X24</th>\n",
       "      <th>X25</th>\n",
       "      <th>X26</th>\n",
       "      <th>X27</th>\n",
       "      <th>X28</th>\n",
       "      <th>X29</th>\n",
       "      <th>X30</th>\n",
       "      <th>X31</th>\n",
       "      <th>X32</th>\n",
       "      <th>Phase</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>-0.005009</td>\n",
       "      <td>-0.000964</td>\n",
       "      <td>0.000573</td>\n",
       "      <td>0.008623</td>\n",
       "      <td>0.005667</td>\n",
       "      <td>0.001302</td>\n",
       "      <td>-0.000631</td>\n",
       "      <td>0.000130</td>\n",
       "      <td>-0.000048</td>\n",
       "      <td>0.007762</td>\n",
       "      <td>...</td>\n",
       "      <td>1.880800e-04</td>\n",
       "      <td>0.005133</td>\n",
       "      <td>0.010400</td>\n",
       "      <td>0.000646</td>\n",
       "      <td>0.007871</td>\n",
       "      <td>0.004631</td>\n",
       "      <td>0.000963</td>\n",
       "      <td>0.000092</td>\n",
       "      <td>0.000438</td>\n",
       "      <td>'D'</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>0.004905</td>\n",
       "      <td>0.001209</td>\n",
       "      <td>-0.000649</td>\n",
       "      <td>0.004737</td>\n",
       "      <td>0.003166</td>\n",
       "      <td>0.000819</td>\n",
       "      <td>-0.000572</td>\n",
       "      <td>-0.000015</td>\n",
       "      <td>0.000023</td>\n",
       "      <td>0.002706</td>\n",
       "      <td>...</td>\n",
       "      <td>-7.500000e-07</td>\n",
       "      <td>0.005093</td>\n",
       "      <td>0.005756</td>\n",
       "      <td>0.000573</td>\n",
       "      <td>0.003459</td>\n",
       "      <td>0.000730</td>\n",
       "      <td>0.000332</td>\n",
       "      <td>0.000012</td>\n",
       "      <td>0.000433</td>\n",
       "      <td>'D'</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>-0.002393</td>\n",
       "      <td>-0.000216</td>\n",
       "      <td>0.000136</td>\n",
       "      <td>0.003028</td>\n",
       "      <td>0.001212</td>\n",
       "      <td>0.000336</td>\n",
       "      <td>-0.000449</td>\n",
       "      <td>0.000017</td>\n",
       "      <td>0.000047</td>\n",
       "      <td>0.002868</td>\n",
       "      <td>...</td>\n",
       "      <td>-3.920000e-05</td>\n",
       "      <td>0.002406</td>\n",
       "      <td>0.003279</td>\n",
       "      <td>0.000452</td>\n",
       "      <td>0.003261</td>\n",
       "      <td>0.002412</td>\n",
       "      <td>0.000852</td>\n",
       "      <td>0.000042</td>\n",
       "      <td>0.000202</td>\n",
       "      <td>'D'</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>-0.001394</td>\n",
       "      <td>-0.000242</td>\n",
       "      <td>0.000056</td>\n",
       "      <td>0.001182</td>\n",
       "      <td>0.000575</td>\n",
       "      <td>0.000225</td>\n",
       "      <td>-0.000479</td>\n",
       "      <td>-0.000050</td>\n",
       "      <td>0.000104</td>\n",
       "      <td>0.001171</td>\n",
       "      <td>...</td>\n",
       "      <td>-3.184000e-05</td>\n",
       "      <td>0.001416</td>\n",
       "      <td>0.001334</td>\n",
       "      <td>0.000493</td>\n",
       "      <td>0.001358</td>\n",
       "      <td>0.000313</td>\n",
       "      <td>0.000611</td>\n",
       "      <td>0.000029</td>\n",
       "      <td>0.000596</td>\n",
       "      <td>'D'</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>-0.000156</td>\n",
       "      <td>-0.000004</td>\n",
       "      <td>0.000023</td>\n",
       "      <td>0.001585</td>\n",
       "      <td>0.000630</td>\n",
       "      <td>0.000094</td>\n",
       "      <td>-0.000303</td>\n",
       "      <td>0.000097</td>\n",
       "      <td>0.000065</td>\n",
       "      <td>0.001579</td>\n",
       "      <td>...</td>\n",
       "      <td>-2.015000e-05</td>\n",
       "      <td>0.000158</td>\n",
       "      <td>0.001709</td>\n",
       "      <td>0.000325</td>\n",
       "      <td>0.001713</td>\n",
       "      <td>0.000203</td>\n",
       "      <td>0.000069</td>\n",
       "      <td>0.000038</td>\n",
       "      <td>0.000069</td>\n",
       "      <td>'D'</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>...</th>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>9868</th>\n",
       "      <td>-0.003709</td>\n",
       "      <td>-0.006168</td>\n",
       "      <td>0.000786</td>\n",
       "      <td>-0.000155</td>\n",
       "      <td>0.001088</td>\n",
       "      <td>-0.000144</td>\n",
       "      <td>-0.003815</td>\n",
       "      <td>-0.004658</td>\n",
       "      <td>0.000656</td>\n",
       "      <td>0.000060</td>\n",
       "      <td>...</td>\n",
       "      <td>-9.400000e-07</td>\n",
       "      <td>0.007240</td>\n",
       "      <td>0.001109</td>\n",
       "      <td>0.006056</td>\n",
       "      <td>0.000173</td>\n",
       "      <td>0.001976</td>\n",
       "      <td>0.000137</td>\n",
       "      <td>0.001575</td>\n",
       "      <td>0.000051</td>\n",
       "      <td>'D'</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>9869</th>\n",
       "      <td>-0.000727</td>\n",
       "      <td>0.001536</td>\n",
       "      <td>-0.000211</td>\n",
       "      <td>0.000700</td>\n",
       "      <td>-0.000975</td>\n",
       "      <td>0.000067</td>\n",
       "      <td>-0.001147</td>\n",
       "      <td>0.000177</td>\n",
       "      <td>0.000008</td>\n",
       "      <td>0.000423</td>\n",
       "      <td>...</td>\n",
       "      <td>1.234000e-05</td>\n",
       "      <td>0.001712</td>\n",
       "      <td>0.001202</td>\n",
       "      <td>0.001160</td>\n",
       "      <td>0.000515</td>\n",
       "      <td>0.002600</td>\n",
       "      <td>0.000701</td>\n",
       "      <td>0.001738</td>\n",
       "      <td>0.000124</td>\n",
       "      <td>'D'</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>9870</th>\n",
       "      <td>0.003074</td>\n",
       "      <td>0.007870</td>\n",
       "      <td>-0.000962</td>\n",
       "      <td>0.000526</td>\n",
       "      <td>-0.000779</td>\n",
       "      <td>0.000090</td>\n",
       "      <td>0.002786</td>\n",
       "      <td>0.005035</td>\n",
       "      <td>-0.000606</td>\n",
       "      <td>0.000413</td>\n",
       "      <td>...</td>\n",
       "      <td>2.327000e-05</td>\n",
       "      <td>0.008504</td>\n",
       "      <td>0.000944</td>\n",
       "      <td>0.005786</td>\n",
       "      <td>0.000431</td>\n",
       "      <td>0.002007</td>\n",
       "      <td>0.000071</td>\n",
       "      <td>0.001697</td>\n",
       "      <td>0.000112</td>\n",
       "      <td>'D'</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>9871</th>\n",
       "      <td>0.003297</td>\n",
       "      <td>0.008467</td>\n",
       "      <td>-0.001035</td>\n",
       "      <td>0.000578</td>\n",
       "      <td>-0.000740</td>\n",
       "      <td>0.000101</td>\n",
       "      <td>0.002947</td>\n",
       "      <td>0.005385</td>\n",
       "      <td>-0.000652</td>\n",
       "      <td>0.000580</td>\n",
       "      <td>...</td>\n",
       "      <td>1.541000e-05</td>\n",
       "      <td>0.009145</td>\n",
       "      <td>0.000944</td>\n",
       "      <td>0.006173</td>\n",
       "      <td>0.000699</td>\n",
       "      <td>0.000229</td>\n",
       "      <td>0.000023</td>\n",
       "      <td>0.000139</td>\n",
       "      <td>0.000113</td>\n",
       "      <td>'D'</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>9872</th>\n",
       "      <td>0.000204</td>\n",
       "      <td>-0.000040</td>\n",
       "      <td>0.000058</td>\n",
       "      <td>0.000586</td>\n",
       "      <td>-0.000619</td>\n",
       "      <td>0.000087</td>\n",
       "      <td>0.000229</td>\n",
       "      <td>0.000003</td>\n",
       "      <td>0.000061</td>\n",
       "      <td>0.000644</td>\n",
       "      <td>...</td>\n",
       "      <td>3.200000e-07</td>\n",
       "      <td>0.000216</td>\n",
       "      <td>0.000857</td>\n",
       "      <td>0.000237</td>\n",
       "      <td>0.000808</td>\n",
       "      <td>0.002763</td>\n",
       "      <td>0.000037</td>\n",
       "      <td>0.001840</td>\n",
       "      <td>0.000036</td>\n",
       "      <td>'D'</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "<p>9873 rows × 33 columns</p>\n",
       "</div>"
      ],
      "text/plain": [
       "            X1        X2        X3        X4        X5        X6        X7  \\\n",
       "0    -0.005009 -0.000964  0.000573  0.008623  0.005667  0.001302 -0.000631   \n",
       "1     0.004905  0.001209 -0.000649  0.004737  0.003166  0.000819 -0.000572   \n",
       "2    -0.002393 -0.000216  0.000136  0.003028  0.001212  0.000336 -0.000449   \n",
       "3    -0.001394 -0.000242  0.000056  0.001182  0.000575  0.000225 -0.000479   \n",
       "4    -0.000156 -0.000004  0.000023  0.001585  0.000630  0.000094 -0.000303   \n",
       "...        ...       ...       ...       ...       ...       ...       ...   \n",
       "9868 -0.003709 -0.006168  0.000786 -0.000155  0.001088 -0.000144 -0.003815   \n",
       "9869 -0.000727  0.001536 -0.000211  0.000700 -0.000975  0.000067 -0.001147   \n",
       "9870  0.003074  0.007870 -0.000962  0.000526 -0.000779  0.000090  0.002786   \n",
       "9871  0.003297  0.008467 -0.001035  0.000578 -0.000740  0.000101  0.002947   \n",
       "9872  0.000204 -0.000040  0.000058  0.000586 -0.000619  0.000087  0.000229   \n",
       "\n",
       "            X8        X9       X10  ...           X24       X25       X26  \\\n",
       "0     0.000130 -0.000048  0.007762  ...  1.880800e-04  0.005133  0.010400   \n",
       "1    -0.000015  0.000023  0.002706  ... -7.500000e-07  0.005093  0.005756   \n",
       "2     0.000017  0.000047  0.002868  ... -3.920000e-05  0.002406  0.003279   \n",
       "3    -0.000050  0.000104  0.001171  ... -3.184000e-05  0.001416  0.001334   \n",
       "4     0.000097  0.000065  0.001579  ... -2.015000e-05  0.000158  0.001709   \n",
       "...        ...       ...       ...  ...           ...       ...       ...   \n",
       "9868 -0.004658  0.000656  0.000060  ... -9.400000e-07  0.007240  0.001109   \n",
       "9869  0.000177  0.000008  0.000423  ...  1.234000e-05  0.001712  0.001202   \n",
       "9870  0.005035 -0.000606  0.000413  ...  2.327000e-05  0.008504  0.000944   \n",
       "9871  0.005385 -0.000652  0.000580  ...  1.541000e-05  0.009145  0.000944   \n",
       "9872  0.000003  0.000061  0.000644  ...  3.200000e-07  0.000216  0.000857   \n",
       "\n",
       "           X27       X28       X29       X30       X31       X32  Phase  \n",
       "0     0.000646  0.007871  0.004631  0.000963  0.000092  0.000438    'D'  \n",
       "1     0.000573  0.003459  0.000730  0.000332  0.000012  0.000433    'D'  \n",
       "2     0.000452  0.003261  0.002412  0.000852  0.000042  0.000202    'D'  \n",
       "3     0.000493  0.001358  0.000313  0.000611  0.000029  0.000596    'D'  \n",
       "4     0.000325  0.001713  0.000203  0.000069  0.000038  0.000069    'D'  \n",
       "...        ...       ...       ...       ...       ...       ...    ...  \n",
       "9868  0.006056  0.000173  0.001976  0.000137  0.001575  0.000051    'D'  \n",
       "9869  0.001160  0.000515  0.002600  0.000701  0.001738  0.000124    'D'  \n",
       "9870  0.005786  0.000431  0.002007  0.000071  0.001697  0.000112    'D'  \n",
       "9871  0.006173  0.000699  0.000229  0.000023  0.000139  0.000113    'D'  \n",
       "9872  0.000237  0.000808  0.002763  0.000037  0.001840  0.000036    'D'  \n",
       "\n",
       "[9873 rows x 33 columns]"
      ]
     },
     "execution_count": 4,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data=load_arff(os.path.join(\"gpsp\",\"phpYLeydd.arff\"))\n",
    "data"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "a378c9dc",
   "metadata": {},
   "source": [
    "# Shrutime*"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "e4c583f0",
   "metadata": {},
   "source": [
    "# Rossmann\n",
    "* Does same as the caboost. The link is in the Appendix E"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "3501ce1f",
   "metadata": {},
   "source": [
    "# Sarcos"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "b998e181",
   "metadata": {},
   "outputs": [],
   "source": [
    "srcs_columns =[f\"positions{no}\" for no in range(1,8)]+\\\n",
    "                [f\"velocities{no}\" for no in range(1,8)]+\\\n",
    "                [f\"accelerations{no}\" for no in range(1,8)]+\\\n",
    "                [f\"torques{no}\" for no in range(1,8)]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "4c989ca3",
   "metadata": {},
   "outputs": [],
   "source": [
    "train_data = pd.DataFrame(scipy.io.loadmat(os.path.join(\"srcs\",\"sarcos_inv\"))[\"sarcos_inv\"],columns=srcs_columns)\n",
    "test_data = pd.DataFrame(scipy.io.loadmat(os.path.join(\"srcs\",\"sarcos_inv_test\"))[\"sarcos_inv_test\"],columns=srcs_columns)"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "38db987a",
   "metadata": {},
   "source": [
    "# Year"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "241d06e5",
   "metadata": {},
   "outputs": [],
   "source": [
    "data = pd.read_csv(os.path.join(\"year\",\"YearPredictionMSD.txt\"),header=None,\n",
    "                   names=[\"Y\"]+[f\"timbreAvg{no}\" for no in range(1,13)]+[f\"timbreCov{no}\" for no in range(1,79)])"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "600c8d8d",
   "metadata": {},
   "source": [
    "# California*"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "557645f5",
   "metadata": {},
   "source": [
    "# Heloc*"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "06b93c47",
   "metadata": {},
   "source": [
    "# Higgs"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 11,
   "id": "a44d5012",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>lepton_pT</th>\n",
       "      <th>lepton_eta</th>\n",
       "      <th>lepton_phi</th>\n",
       "      <th>missing_energy_magnitude</th>\n",
       "      <th>missing_energy_phi</th>\n",
       "      <th>jet_1_pt</th>\n",
       "      <th>jet_1_eta</th>\n",
       "      <th>jet_1_phi</th>\n",
       "      <th>jet_2_pt</th>\n",
       "      <th>jet_2_eta</th>\n",
       "      <th>...</th>\n",
       "      <th>jet_4_eta</th>\n",
       "      <th>jet_4_phi</th>\n",
       "      <th>m_jj</th>\n",
       "      <th>m_jjj</th>\n",
       "      <th>m_lv</th>\n",
       "      <th>m_jlv</th>\n",
       "      <th>m_bb</th>\n",
       "      <th>m_wbb</th>\n",
       "      <th>m_wwbb</th>\n",
       "      <th>target</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>0.730938</td>\n",
       "      <td>-0.862016</td>\n",
       "      <td>1.304896</td>\n",
       "      <td>1.203239</td>\n",
       "      <td>-0.937731</td>\n",
       "      <td>1.263905</td>\n",
       "      <td>-0.407010</td>\n",
       "      <td>-0.044292</td>\n",
       "      <td>0.738928</td>\n",
       "      <td>-0.392375</td>\n",
       "      <td>...</td>\n",
       "      <td>0.674959</td>\n",
       "      <td>-1.543477</td>\n",
       "      <td>1.502878</td>\n",
       "      <td>1.119366</td>\n",
       "      <td>1.170362</td>\n",
       "      <td>0.706155</td>\n",
       "      <td>0.537799</td>\n",
       "      <td>0.839494</td>\n",
       "      <td>0.807032</td>\n",
       "      <td>0</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>0.824456</td>\n",
       "      <td>-0.017586</td>\n",
       "      <td>-1.094985</td>\n",
       "      <td>0.338968</td>\n",
       "      <td>-1.236657</td>\n",
       "      <td>1.254561</td>\n",
       "      <td>-0.678333</td>\n",
       "      <td>0.087198</td>\n",
       "      <td>1.610054</td>\n",
       "      <td>-0.788695</td>\n",
       "      <td>...</td>\n",
       "      <td>1.025577</td>\n",
       "      <td>-0.642300</td>\n",
       "      <td>2.209537</td>\n",
       "      <td>1.271808</td>\n",
       "      <td>0.993527</td>\n",
       "      <td>1.455711</td>\n",
       "      <td>1.157583</td>\n",
       "      <td>1.172590</td>\n",
       "      <td>1.101483</td>\n",
       "      <td>0</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>0.592217</td>\n",
       "      <td>-0.614628</td>\n",
       "      <td>0.600777</td>\n",
       "      <td>1.598253</td>\n",
       "      <td>0.388530</td>\n",
       "      <td>0.829687</td>\n",
       "      <td>1.490271</td>\n",
       "      <td>-1.401960</td>\n",
       "      <td>0.669303</td>\n",
       "      <td>0.395409</td>\n",
       "      <td>...</td>\n",
       "      <td>1.175486</td>\n",
       "      <td>-0.388148</td>\n",
       "      <td>0.736590</td>\n",
       "      <td>0.889902</td>\n",
       "      <td>0.988300</td>\n",
       "      <td>1.303566</td>\n",
       "      <td>0.940743</td>\n",
       "      <td>1.864889</td>\n",
       "      <td>1.440432</td>\n",
       "      <td>0</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>0.474177</td>\n",
       "      <td>-0.863964</td>\n",
       "      <td>0.021501</td>\n",
       "      <td>1.243103</td>\n",
       "      <td>-1.368377</td>\n",
       "      <td>0.524176</td>\n",
       "      <td>0.152470</td>\n",
       "      <td>1.473136</td>\n",
       "      <td>1.470679</td>\n",
       "      <td>0.187535</td>\n",
       "      <td>...</td>\n",
       "      <td>0.202748</td>\n",
       "      <td>-0.611225</td>\n",
       "      <td>0.976940</td>\n",
       "      <td>0.827594</td>\n",
       "      <td>1.009862</td>\n",
       "      <td>1.224960</td>\n",
       "      <td>1.053299</td>\n",
       "      <td>0.898413</td>\n",
       "      <td>0.863015</td>\n",
       "      <td>0</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>1.349509</td>\n",
       "      <td>1.182344</td>\n",
       "      <td>1.483561</td>\n",
       "      <td>1.152123</td>\n",
       "      <td>-1.447959</td>\n",
       "      <td>0.587569</td>\n",
       "      <td>0.345565</td>\n",
       "      <td>-0.912446</td>\n",
       "      <td>1.609298</td>\n",
       "      <td>0.926751</td>\n",
       "      <td>...</td>\n",
       "      <td>-0.205335</td>\n",
       "      <td>1.277246</td>\n",
       "      <td>0.810361</td>\n",
       "      <td>1.120607</td>\n",
       "      <td>0.989155</td>\n",
       "      <td>0.480870</td>\n",
       "      <td>0.781452</td>\n",
       "      <td>0.880766</td>\n",
       "      <td>0.778491</td>\n",
       "      <td>0</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>...</th>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>940155</th>\n",
       "      <td>0.587276</td>\n",
       "      <td>0.313564</td>\n",
       "      <td>-0.993445</td>\n",
       "      <td>0.782753</td>\n",
       "      <td>-0.888669</td>\n",
       "      <td>1.123929</td>\n",
       "      <td>0.332692</td>\n",
       "      <td>0.371592</td>\n",
       "      <td>1.156927</td>\n",
       "      <td>-1.131590</td>\n",
       "      <td>...</td>\n",
       "      <td>0.206079</td>\n",
       "      <td>0.766727</td>\n",
       "      <td>0.851075</td>\n",
       "      <td>0.943668</td>\n",
       "      <td>0.987178</td>\n",
       "      <td>1.798033</td>\n",
       "      <td>2.160513</td>\n",
       "      <td>1.500113</td>\n",
       "      <td>1.208763</td>\n",
       "      <td>1</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>940156</th>\n",
       "      <td>0.596793</td>\n",
       "      <td>0.289215</td>\n",
       "      <td>1.474128</td>\n",
       "      <td>1.210700</td>\n",
       "      <td>0.509391</td>\n",
       "      <td>1.171381</td>\n",
       "      <td>-0.634763</td>\n",
       "      <td>-0.471715</td>\n",
       "      <td>0.401003</td>\n",
       "      <td>2.376040</td>\n",
       "      <td>...</td>\n",
       "      <td>-0.949046</td>\n",
       "      <td>0.366081</td>\n",
       "      <td>0.922222</td>\n",
       "      <td>1.126429</td>\n",
       "      <td>0.991863</td>\n",
       "      <td>0.779673</td>\n",
       "      <td>0.987974</td>\n",
       "      <td>0.846424</td>\n",
       "      <td>0.757657</td>\n",
       "      <td>1</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>940157</th>\n",
       "      <td>1.170526</td>\n",
       "      <td>0.264865</td>\n",
       "      <td>0.995283</td>\n",
       "      <td>0.669898</td>\n",
       "      <td>-1.625072</td>\n",
       "      <td>0.624944</td>\n",
       "      <td>-1.218008</td>\n",
       "      <td>-0.401309</td>\n",
       "      <td>0.772796</td>\n",
       "      <td>-2.585736</td>\n",
       "      <td>...</td>\n",
       "      <td>0.913146</td>\n",
       "      <td>0.140231</td>\n",
       "      <td>0.866371</td>\n",
       "      <td>0.915057</td>\n",
       "      <td>0.990963</td>\n",
       "      <td>0.706775</td>\n",
       "      <td>1.047065</td>\n",
       "      <td>0.810152</td>\n",
       "      <td>0.688482</td>\n",
       "      <td>1</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>940158</th>\n",
       "      <td>0.753082</td>\n",
       "      <td>-0.740270</td>\n",
       "      <td>-0.325390</td>\n",
       "      <td>0.928037</td>\n",
       "      <td>-1.310634</td>\n",
       "      <td>0.609554</td>\n",
       "      <td>0.759481</td>\n",
       "      <td>0.546775</td>\n",
       "      <td>0.334903</td>\n",
       "      <td>-2.610021</td>\n",
       "      <td>...</td>\n",
       "      <td>1.627709</td>\n",
       "      <td>-0.753837</td>\n",
       "      <td>3.007397</td>\n",
       "      <td>1.614819</td>\n",
       "      <td>0.985834</td>\n",
       "      <td>1.255718</td>\n",
       "      <td>0.402101</td>\n",
       "      <td>1.169045</td>\n",
       "      <td>1.053299</td>\n",
       "      <td>1</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>940159</th>\n",
       "      <td>1.663553</td>\n",
       "      <td>0.747954</td>\n",
       "      <td>-0.489629</td>\n",
       "      <td>0.382838</td>\n",
       "      <td>-1.117248</td>\n",
       "      <td>1.274440</td>\n",
       "      <td>0.787207</td>\n",
       "      <td>1.207590</td>\n",
       "      <td>0.685167</td>\n",
       "      <td>0.489633</td>\n",
       "      <td>...</td>\n",
       "      <td>1.778450</td>\n",
       "      <td>-0.399246</td>\n",
       "      <td>0.556347</td>\n",
       "      <td>1.046078</td>\n",
       "      <td>0.988520</td>\n",
       "      <td>0.750873</td>\n",
       "      <td>1.103920</td>\n",
       "      <td>0.972404</td>\n",
       "      <td>0.807859</td>\n",
       "      <td>1</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "<p>940160 rows × 25 columns</p>\n",
       "</div>"
      ],
      "text/plain": [
       "        lepton_pT  lepton_eta  lepton_phi  missing_energy_magnitude  \\\n",
       "0        0.730938   -0.862016    1.304896                  1.203239   \n",
       "1        0.824456   -0.017586   -1.094985                  0.338968   \n",
       "2        0.592217   -0.614628    0.600777                  1.598253   \n",
       "3        0.474177   -0.863964    0.021501                  1.243103   \n",
       "4        1.349509    1.182344    1.483561                  1.152123   \n",
       "...           ...         ...         ...                       ...   \n",
       "940155   0.587276    0.313564   -0.993445                  0.782753   \n",
       "940156   0.596793    0.289215    1.474128                  1.210700   \n",
       "940157   1.170526    0.264865    0.995283                  0.669898   \n",
       "940158   0.753082   -0.740270   -0.325390                  0.928037   \n",
       "940159   1.663553    0.747954   -0.489629                  0.382838   \n",
       "\n",
       "        missing_energy_phi  jet_1_pt  jet_1_eta  jet_1_phi  jet_2_pt  \\\n",
       "0                -0.937731  1.263905  -0.407010  -0.044292  0.738928   \n",
       "1                -1.236657  1.254561  -0.678333   0.087198  1.610054   \n",
       "2                 0.388530  0.829687   1.490271  -1.401960  0.669303   \n",
       "3                -1.368377  0.524176   0.152470   1.473136  1.470679   \n",
       "4                -1.447959  0.587569   0.345565  -0.912446  1.609298   \n",
       "...                    ...       ...        ...        ...       ...   \n",
       "940155           -0.888669  1.123929   0.332692   0.371592  1.156927   \n",
       "940156            0.509391  1.171381  -0.634763  -0.471715  0.401003   \n",
       "940157           -1.625072  0.624944  -1.218008  -0.401309  0.772796   \n",
       "940158           -1.310634  0.609554   0.759481   0.546775  0.334903   \n",
       "940159           -1.117248  1.274440   0.787207   1.207590  0.685167   \n",
       "\n",
       "        jet_2_eta  ...  jet_4_eta  jet_4_phi      m_jj     m_jjj      m_lv  \\\n",
       "0       -0.392375  ...   0.674959  -1.543477  1.502878  1.119366  1.170362   \n",
       "1       -0.788695  ...   1.025577  -0.642300  2.209537  1.271808  0.993527   \n",
       "2        0.395409  ...   1.175486  -0.388148  0.736590  0.889902  0.988300   \n",
       "3        0.187535  ...   0.202748  -0.611225  0.976940  0.827594  1.009862   \n",
       "4        0.926751  ...  -0.205335   1.277246  0.810361  1.120607  0.989155   \n",
       "...           ...  ...        ...        ...       ...       ...       ...   \n",
       "940155  -1.131590  ...   0.206079   0.766727  0.851075  0.943668  0.987178   \n",
       "940156   2.376040  ...  -0.949046   0.366081  0.922222  1.126429  0.991863   \n",
       "940157  -2.585736  ...   0.913146   0.140231  0.866371  0.915057  0.990963   \n",
       "940158  -2.610021  ...   1.627709  -0.753837  3.007397  1.614819  0.985834   \n",
       "940159   0.489633  ...   1.778450  -0.399246  0.556347  1.046078  0.988520   \n",
       "\n",
       "           m_jlv      m_bb     m_wbb    m_wwbb  target  \n",
       "0       0.706155  0.537799  0.839494  0.807032       0  \n",
       "1       1.455711  1.157583  1.172590  1.101483       0  \n",
       "2       1.303566  0.940743  1.864889  1.440432       0  \n",
       "3       1.224960  1.053299  0.898413  0.863015       0  \n",
       "4       0.480870  0.781452  0.880766  0.778491       0  \n",
       "...          ...       ...       ...       ...     ...  \n",
       "940155  1.798033  2.160513  1.500113  1.208763       1  \n",
       "940156  0.779673  0.987974  0.846424  0.757657       1  \n",
       "940157  0.706775  1.047065  0.810152  0.688482       1  \n",
       "940158  1.255718  0.402101  1.169045  1.053299       1  \n",
       "940159  0.750873  1.103920  0.972404  0.807859       1  \n",
       "\n",
       "[940160 rows x 25 columns]"
      ]
     },
     "execution_count": 11,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data=load_arff(os.path.join(\"higg\",\"dataset_\"),capitalize=True)\n",
    "data"
   ]
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "Python 3 (ipykernel)",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.11.5"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 5
}
