{
 "cells": [
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## *** This is an example of reproducing PeaBERT's result of MRPC 92.9% "
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "### [0] Experiments are done with two NVIDIA T4 GPUs\n",
    "\n",
    "### * The Processes for [1] and [2] are equivalent to the  [GitHub repository](https://github.com/intersun/PKD-for-BERT-Model-Compression) for [Patient Knowledge Distillation for BERT Model Compression](https://arxiv.org/abs/1908.09355)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "### [1] 12 layer teacher \n",
    "-> saved at '/data/outputs/KD/MRPC/MRPC_patient_kd_teacher_12layer_result_summary.pkl'\n",
    "\n",
    "-> Deleted for now due to the maximum limit of 100MB for submission."
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "### [2] BERT6-PKD \n",
    "-> run several times to check the result of original BERT6-PKD "
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "the scores on the bottom of each cell is the best validation values during training. We compare these scores for the dev set."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 14,
   "metadata": {
    "collapsed": true
   },
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "10/02/2020 12:38:36 - INFO - __main__ -   IN DEBUG MODE\n",
      "10/02/2020 12:38:38 - INFO - src.argument_parser -   device: cuda n_gpu: 2, 16-bits training: False\n",
      "10/02/2020 12:38:38 - INFO - src.argument_parser -   random seed = 53690224\n",
      "10/02/2020 12:38:38 - INFO - __main__ -   actual batch size on all GPU = 64\n",
      "10/02/2020 12:38:38 - INFO - __main__ -   Seed changed to : 53690224\n",
      "10/02/2020 12:38:38 - INFO - __main__ -   loading teacher's prediction\n",
      "10/02/2020 12:38:38 - INFO - __main__ -   teacher acc = 94.11, teacher loss = 0.17171\n",
      "10/02/2020 12:38:38 - INFO - __main__ -   teacher acc = 89.22, teacher loss = 0.26664\n",
      "10/02/2020 12:38:38 - INFO - src.nli_data_processing -   Writing example 0 of 3668\n",
      "10/02/2020 12:38:40 - INFO - __main__ -   ***** Running training *****\n",
      "10/02/2020 12:38:40 - INFO - __main__ -     Num examples = 3668\n",
      "10/02/2020 12:38:40 - INFO - __main__ -     Batch size = 64\n",
      "10/02/2020 12:38:40 - INFO - __main__ -     Num steps = 342\n",
      "10/02/2020 12:38:40 - INFO - src.nli_data_processing -   Writing example 0 of 408\n",
      "10/02/2020 12:38:41 - INFO - __main__ -   ***** Running evaluation *****\n",
      "10/02/2020 12:38:41 - INFO - __main__ -     Num examples = 408\n",
      "10/02/2020 12:38:41 - INFO - __main__ -     Batch size = 32\n",
      "10/02/2020 12:38:41 - INFO - src.nli_data_processing -   Writing example 0 of 1725\n",
      "10/02/2020 12:38:42 - INFO - __main__ -   ***** Running evaluation *****\n",
      "10/02/2020 12:38:42 - INFO - __main__ -     Num examples = 1725\n",
      "10/02/2020 12:38:42 - INFO - __main__ -     Batch size = 32\n",
      "10/02/2020 12:38:42 - INFO - __main__ -   using normal Knowledge Distillation\n",
      "10/02/2020 12:38:42 - INFO - src.nli_data_processing -   predicting for MRPC\n",
      "10/02/2020 12:38:42 - INFO - src.modeling -   num hidden layer is set as 6\n",
      "10/02/2020 12:38:42 - INFO - src.modeling -   Model config {\n",
      "  \"attention_probs_dropout_prob\": 0.1,\n",
      "  \"hidden_act\": \"gelu\",\n",
      "  \"hidden_dropout_prob\": 0.1,\n",
      "  \"hidden_size\": 768,\n",
      "  \"initializer_range\": 0.02,\n",
      "  \"intermediate_size\": 3072,\n",
      "  \"max_position_embeddings\": 512,\n",
      "  \"num_attention_heads\": 12,\n",
      "  \"num_hidden_layers\": 6,\n",
      "  \"type_vocab_size\": 2,\n",
      "  \"vocab_size\": 30522\n",
      "}\n",
      "\n",
      "10/02/2020 12:38:43 - INFO - src.utils -   delete 104 layers, keep 103 layers\n",
      "10/02/2020 12:38:46 - INFO - src.utils -   data parallel because more than one gpu\n",
      "10/02/2020 12:38:46 - INFO - __main__ -   *****************************************************************************\n",
      "10/02/2020 12:38:46 - INFO - src.utils -   data parallel because more than one gpu\n",
      "10/02/2020 12:38:46 - INFO - __main__ -   number of layers in student model = 6\n",
      "10/02/2020 12:38:46 - INFO - __main__ -   num parameters in student model are 66955008 and 1538\n",
      "10/02/2020 12:38:46 - INFO - __main__ -   FP16 is not activated, use BertAdam\n",
      "Epoch:   0%|                                              | 0/6 [00:00<?, ?it/s]\n",
      "Iteration:   0%|                                         | 0/58 [00:00<?, ?it/s]\u001b[A/opt/conda/lib/python3.7/site-packages/torch/nn/functional.py:1958: UserWarning: reduction: 'mean' divides the total loss by both the batch size and the support size.'batchmean' divides only by the batch size, and aligns with the KL div math definition.'mean' will be changed to behave the same as 'batchmean' in the next major release.\n",
      "  warnings.warn(\"reduction: 'mean' divides the total loss by both the batch size and the support size.\"\n",
      "10/02/2020 12:38:52 - INFO - __main__ -   \n",
      "10/02/2020 12:38:52 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:38:52 - INFO - __main__ -   Validation f1 improved! 0 -> 0.8122270742358079\n",
      "10/02/2020 12:38:52 - INFO - __main__ -   Validation Acc : 0.6838235294117647\n",
      "10/02/2020 12:38:52 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:38:52 - INFO - __main__ -   \n",
      "10/02/2020 12:38:52 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:38:52 - INFO - __main__ -   Validation Acc improved! 0 -> 0.6838235294117647\n",
      "10/02/2020 12:38:52 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   2%|▌                                | 1/58 [00:05<05:24,  5.70s/it]\u001b[A\n",
      "Iteration:   3%|█▏                               | 2/58 [00:07<04:19,  4.63s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 3/58 [00:09<03:33,  3.88s/it]\u001b[A\n",
      "Iteration:   7%|██▎                              | 4/58 [00:12<03:01,  3.37s/it]\u001b[A\n",
      "Iteration:   9%|██▊                              | 5/58 [00:14<02:39,  3.00s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 6/58 [00:16<02:23,  2.76s/it]\u001b[A\n",
      "Iteration:  12%|███▉                             | 7/58 [00:18<02:12,  2.59s/it]\u001b[A\n",
      "Iteration:  14%|████▌                            | 8/58 [00:20<02:04,  2.48s/it]\u001b[A\n",
      "Iteration:  16%|█████                            | 9/58 [00:23<01:57,  2.41s/it]\u001b[A\n",
      "Iteration:  17%|█████▌                          | 10/58 [00:25<01:53,  2.36s/it]\u001b[A\n",
      "Iteration:  19%|██████                          | 11/58 [00:27<01:49,  2.33s/it]\u001b[A\n",
      "Iteration:  21%|██████▌                         | 12/58 [00:29<01:46,  2.31s/it]\u001b[A\n",
      "Iteration:  22%|███████▏                        | 13/58 [00:32<01:43,  2.31s/it]\u001b[A\n",
      "Iteration:  24%|███████▋                        | 14/58 [00:34<01:41,  2.31s/it]\u001b[A\n",
      "Iteration:  26%|████████▎                       | 15/58 [00:36<01:39,  2.32s/it]\u001b[A\n",
      "Iteration:  28%|████████▊                       | 16/58 [00:39<01:37,  2.31s/it]\u001b[A\n",
      "Iteration:  29%|█████████▍                      | 17/58 [00:41<01:34,  2.31s/it]\u001b[A\n",
      "Iteration:  31%|█████████▉                      | 18/58 [00:43<01:31,  2.30s/it]\u001b[A10/02/2020 12:39:32 - INFO - __main__ -   \n",
      "10/02/2020 12:39:32 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:39:32 - INFO - __main__ -   Validation f1 improved! 0.8122270742358079 -> 0.8134110787172011\n",
      "10/02/2020 12:39:32 - INFO - __main__ -   Validation Acc : 0.6862745098039216\n",
      "10/02/2020 12:39:32 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:39:32 - INFO - __main__ -   \n",
      "10/02/2020 12:39:32 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:39:32 - INFO - __main__ -   Validation Acc improved! 0.6838235294117647 -> 0.6862745098039216\n",
      "10/02/2020 12:39:32 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  33%|██████████▍                     | 19/58 [00:45<01:28,  2.28s/it]\u001b[A\n",
      "Iteration:  34%|███████████                     | 20/58 [00:48<01:26,  2.28s/it]\u001b[A10/02/2020 12:39:37 - INFO - __main__ -   \n",
      "10/02/2020 12:39:37 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:39:37 - INFO - __main__ -   Validation f1 improved! 0.8134110787172011 -> 0.8169838945827232\n",
      "10/02/2020 12:39:37 - INFO - __main__ -   Validation Acc : 0.6936274509803921\n",
      "10/02/2020 12:39:37 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:39:37 - INFO - __main__ -   \n",
      "10/02/2020 12:39:37 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:39:37 - INFO - __main__ -   Validation Acc improved! 0.6862745098039216 -> 0.6936274509803921\n",
      "10/02/2020 12:39:37 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  36%|███████████▌                    | 21/58 [00:50<01:23,  2.27s/it]\u001b[A10/02/2020 12:39:39 - INFO - __main__ -   \n",
      "10/02/2020 12:39:39 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:39:39 - INFO - __main__ -   Validation f1 improved! 0.8169838945827232 -> 0.8181818181818181\n",
      "10/02/2020 12:39:39 - INFO - __main__ -   Validation Acc : 0.696078431372549\n",
      "10/02/2020 12:39:39 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:39:39 - INFO - __main__ -   \n",
      "10/02/2020 12:39:39 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:39:39 - INFO - __main__ -   Validation Acc improved! 0.6936274509803921 -> 0.696078431372549\n",
      "10/02/2020 12:39:39 - INFO - __main__ -   =============================================================================\n",
      "\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  38%|████████████▏                   | 22/58 [00:52<01:21,  2.25s/it]\u001b[A10/02/2020 12:39:41 - INFO - __main__ -   \n",
      "10/02/2020 12:39:41 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:39:41 - INFO - __main__ -   Validation f1 improved! 0.8181818181818181 -> 0.822289156626506\n",
      "10/02/2020 12:39:41 - INFO - __main__ -   Validation Acc : 0.7107843137254902\n",
      "10/02/2020 12:39:41 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:39:41 - INFO - __main__ -   \n",
      "10/02/2020 12:39:41 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:39:41 - INFO - __main__ -   Validation Acc improved! 0.696078431372549 -> 0.7107843137254902\n",
      "10/02/2020 12:39:41 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  40%|████████████▋                   | 23/58 [00:54<01:18,  2.24s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 24/58 [00:57<01:15,  2.22s/it]\u001b[A\n",
      "Iteration:  43%|█████████████▊                  | 25/58 [00:59<01:12,  2.21s/it]\u001b[A\n",
      "Iteration:  45%|██████████████▎                 | 26/58 [01:01<01:10,  2.21s/it]\u001b[A10/02/2020 12:39:50 - INFO - __main__ -   \n",
      "10/02/2020 12:39:50 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:39:50 - INFO - __main__ -   Validation Acc improved! 0.7107843137254902 -> 0.7156862745098039\n",
      "10/02/2020 12:39:50 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  47%|██████████████▉                 | 27/58 [01:03<01:08,  2.20s/it]\u001b[A10/02/2020 12:39:52 - INFO - __main__ -   \n",
      "10/02/2020 12:39:52 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:39:52 - INFO - __main__ -   Validation f1 improved! 0.822289156626506 -> 0.8233333333333334\n",
      "10/02/2020 12:39:52 - INFO - __main__ -   Validation Acc : 0.7401960784313726\n",
      "10/02/2020 12:39:52 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:39:52 - INFO - __main__ -   \n",
      "10/02/2020 12:39:52 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:39:52 - INFO - __main__ -   Validation Acc improved! 0.7156862745098039 -> 0.7401960784313726\n",
      "10/02/2020 12:39:52 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  48%|███████████████▍                | 28/58 [01:05<01:05,  2.20s/it]\u001b[A10/02/2020 12:39:54 - INFO - __main__ -   \n",
      "10/02/2020 12:39:54 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:39:54 - INFO - __main__ -   Validation f1 improved! 0.8233333333333334 -> 0.835314091680815\n",
      "10/02/2020 12:39:54 - INFO - __main__ -   Validation Acc : 0.7622549019607843\n",
      "10/02/2020 12:39:54 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:39:54 - INFO - __main__ -   \n",
      "10/02/2020 12:39:54 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:39:54 - INFO - __main__ -   Validation Acc improved! 0.7401960784313726 -> 0.7622549019607843\n",
      "10/02/2020 12:39:54 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  50%|████████████████                | 29/58 [01:08<01:03,  2.19s/it]\u001b[A10/02/2020 12:39:56 - INFO - __main__ -   \n",
      "10/02/2020 12:39:56 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:39:56 - INFO - __main__ -   Validation f1 improved! 0.835314091680815 -> 0.8422764227642278\n",
      "10/02/2020 12:39:56 - INFO - __main__ -   Validation Acc : 0.7622549019607843\n",
      "10/02/2020 12:39:56 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  52%|████████████████▌               | 30/58 [01:10<01:01,  2.18s/it]\u001b[A\n",
      "Iteration:  53%|█████████████████               | 31/58 [01:12<00:58,  2.17s/it]\u001b[A\n",
      "Iteration:  55%|█████████████████▋              | 32/58 [01:14<00:56,  2.17s/it]\u001b[A10/02/2020 12:40:03 - INFO - __main__ -   \n",
      "10/02/2020 12:40:03 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:40:03 - INFO - __main__ -   Validation f1 improved! 0.8422764227642278 -> 0.84472049689441\n",
      "10/02/2020 12:40:03 - INFO - __main__ -   Validation Acc : 0.7549019607843137\n",
      "10/02/2020 12:40:03 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  57%|██████████████████▏             | 33/58 [01:16<00:54,  2.17s/it]\u001b[A10/02/2020 12:40:05 - INFO - __main__ -   \n",
      "10/02/2020 12:40:05 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:40:05 - INFO - __main__ -   Validation Acc improved! 0.7622549019607843 -> 0.7794117647058824\n",
      "10/02/2020 12:40:05 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  59%|██████████████████▊             | 34/58 [01:18<00:51,  2.16s/it]\u001b[A\n",
      "Iteration:  60%|███████████████████▎            | 35/58 [01:20<00:49,  2.16s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▊            | 36/58 [01:23<00:47,  2.16s/it]\u001b[A10/02/2020 12:40:11 - INFO - __main__ -   \n",
      "10/02/2020 12:40:11 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:40:11 - INFO - __main__ -   Validation f1 improved! 0.84472049689441 -> 0.8614864864864864\n",
      "10/02/2020 12:40:11 - INFO - __main__ -   Validation Acc : 0.7990196078431373\n",
      "10/02/2020 12:40:11 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:40:11 - INFO - __main__ -   \n",
      "10/02/2020 12:40:11 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:40:11 - INFO - __main__ -   Validation Acc improved! 0.7794117647058824 -> 0.7990196078431373\n",
      "10/02/2020 12:40:11 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  64%|████████████████████▍           | 37/58 [01:25<00:45,  2.16s/it]\u001b[A\n",
      "Iteration:  66%|████████████████████▉           | 38/58 [01:27<00:43,  2.16s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▌          | 39/58 [01:29<00:41,  2.16s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████          | 40/58 [01:31<00:39,  2.17s/it]\u001b[A\n",
      "Iteration:  71%|██████████████████████▌         | 41/58 [01:33<00:36,  2.17s/it]\u001b[A\n",
      "Iteration:  72%|███████████████████████▏        | 42/58 [01:36<00:34,  2.17s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▋        | 43/58 [01:38<00:32,  2.17s/it]\u001b[A\n",
      "Iteration:  76%|████████████████████████▎       | 44/58 [01:40<00:30,  2.17s/it]\u001b[A\n",
      "Iteration:  78%|████████████████████████▊       | 45/58 [01:42<00:28,  2.19s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 46/58 [01:44<00:26,  2.19s/it]\u001b[A\n",
      "Iteration:  81%|█████████████████████████▉      | 47/58 [01:47<00:24,  2.20s/it]\u001b[A\n",
      "Iteration:  83%|██████████████████████████▍     | 48/58 [01:49<00:22,  2.21s/it]\u001b[A\n",
      "Iteration:  84%|███████████████████████████     | 49/58 [01:51<00:19,  2.20s/it]\u001b[A\n",
      "Iteration:  86%|███████████████████████████▌    | 50/58 [01:53<00:17,  2.21s/it]\u001b[A\n",
      "Iteration:  88%|████████████████████████████▏   | 51/58 [01:55<00:15,  2.20s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 52/58 [01:58<00:13,  2.21s/it]\u001b[A\n",
      "Iteration:  91%|█████████████████████████████▏  | 53/58 [02:00<00:11,  2.21s/it]\u001b[A\n",
      "Iteration:  93%|█████████████████████████████▊  | 54/58 [02:02<00:08,  2.22s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 55/58 [02:04<00:06,  2.21s/it]\u001b[A\n",
      "Iteration:  97%|██████████████████████████████▉ | 56/58 [02:07<00:04,  2.22s/it]\u001b[A\n",
      "Iteration:  98%|███████████████████████████████▍| 57/58 [02:09<00:02,  2.21s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 58/58 [02:11<00:00,  2.26s/it]\u001b[A\n",
      "Epoch:  17%|██████▏                              | 1/6 [02:11<10:55, 131.19s/it]\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:   0%|                                         | 0/58 [00:00<?, ?it/s]\u001b[A10/02/2020 12:40:59 - INFO - __main__ -   \n",
      "10/02/2020 12:40:59 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:40:59 - INFO - __main__ -   Validation Acc improved! 0.7990196078431373 -> 0.8014705882352942\n",
      "10/02/2020 12:40:59 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   2%|▌                                | 1/58 [00:02<02:05,  2.20s/it]\u001b[A10/02/2020 12:41:02 - INFO - __main__ -   \n",
      "10/02/2020 12:41:02 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:41:02 - INFO - __main__ -   Validation f1 improved! 0.8614864864864864 -> 0.8618968386023296\n",
      "10/02/2020 12:41:02 - INFO - __main__ -   Validation Acc : 0.7965686274509803\n",
      "10/02/2020 12:41:02 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   3%|█▏                               | 2/58 [00:04<02:02,  2.19s/it]\u001b[A10/02/2020 12:41:04 - INFO - __main__ -   \n",
      "10/02/2020 12:41:04 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:41:04 - INFO - __main__ -   Validation f1 improved! 0.8618968386023296 -> 0.8639240506329114\n",
      "10/02/2020 12:41:04 - INFO - __main__ -   Validation Acc : 0.7892156862745098\n",
      "10/02/2020 12:41:04 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   5%|█▋                               | 3/58 [00:06<02:00,  2.19s/it]\u001b[A\n",
      "Iteration:   7%|██▎                              | 4/58 [00:08<01:58,  2.19s/it]\u001b[A\n",
      "Iteration:   9%|██▊                              | 5/58 [00:10<01:56,  2.19s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 6/58 [00:13<01:53,  2.19s/it]\u001b[A\n",
      "Iteration:  12%|███▉                             | 7/58 [00:15<01:52,  2.20s/it]\u001b[A10/02/2020 12:41:15 - INFO - __main__ -   \n",
      "10/02/2020 12:41:15 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:41:15 - INFO - __main__ -   Validation f1 improved! 0.8639240506329114 -> 0.867986798679868\n",
      "10/02/2020 12:41:15 - INFO - __main__ -   Validation Acc : 0.803921568627451\n",
      "10/02/2020 12:41:15 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:41:15 - INFO - __main__ -   \n",
      "10/02/2020 12:41:15 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:41:15 - INFO - __main__ -   Validation Acc improved! 0.8014705882352942 -> 0.803921568627451\n",
      "10/02/2020 12:41:15 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  14%|████▌                            | 8/58 [00:17<01:49,  2.20s/it]\u001b[A\n",
      "Iteration:  16%|█████                            | 9/58 [00:19<01:48,  2.21s/it]\u001b[A10/02/2020 12:41:19 - INFO - __main__ -   \n",
      "10/02/2020 12:41:19 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:41:19 - INFO - __main__ -   Validation Acc improved! 0.803921568627451 -> 0.8063725490196079\n",
      "10/02/2020 12:41:19 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  17%|█████▌                          | 10/58 [00:22<01:45,  2.21s/it]\u001b[A10/02/2020 12:41:21 - INFO - __main__ -   \n",
      "10/02/2020 12:41:21 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:41:21 - INFO - __main__ -   Validation Acc improved! 0.8063725490196079 -> 0.8088235294117647\n",
      "10/02/2020 12:41:21 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  19%|██████                          | 11/58 [00:24<01:44,  2.21s/it]\u001b[A10/02/2020 12:41:24 - INFO - __main__ -   \n",
      "10/02/2020 12:41:24 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:41:24 - INFO - __main__ -   Validation Acc improved! 0.8088235294117647 -> 0.8112745098039216\n",
      "10/02/2020 12:41:24 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  21%|██████▌                         | 12/58 [00:26<01:41,  2.20s/it]\u001b[A\n",
      "Iteration:  22%|███████▏                        | 13/58 [00:28<01:38,  2.20s/it]\u001b[A10/02/2020 12:41:28 - INFO - __main__ -   \n",
      "10/02/2020 12:41:28 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:41:28 - INFO - __main__ -   Validation f1 improved! 0.867986798679868 -> 0.8685524126455907\n",
      "10/02/2020 12:41:28 - INFO - __main__ -   Validation Acc : 0.8063725490196079\n",
      "10/02/2020 12:41:28 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  24%|███████▋                        | 14/58 [00:30<01:37,  2.21s/it]\u001b[A\n",
      "Iteration:  26%|████████▎                       | 15/58 [00:33<01:35,  2.21s/it]\u001b[A\n",
      "Iteration:  28%|████████▊                       | 16/58 [00:35<01:32,  2.20s/it]\u001b[A\n",
      "Iteration:  29%|█████████▍                      | 17/58 [00:37<01:30,  2.21s/it]\u001b[A\n",
      "Iteration:  31%|█████████▉                      | 18/58 [00:39<01:28,  2.20s/it]\u001b[A\n",
      "Iteration:  33%|██████████▍                     | 19/58 [00:41<01:26,  2.21s/it]\u001b[A\n",
      "Iteration:  34%|███████████                     | 20/58 [00:44<01:24,  2.22s/it]\u001b[A\n",
      "Iteration:  36%|███████████▌                    | 21/58 [00:46<01:21,  2.21s/it]\u001b[A10/02/2020 12:41:46 - INFO - __main__ -   \n",
      "10/02/2020 12:41:46 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:41:46 - INFO - __main__ -   Validation f1 improved! 0.8685524126455907 -> 0.870307167235495\n",
      "10/02/2020 12:41:46 - INFO - __main__ -   Validation Acc : 0.8137254901960784\n",
      "10/02/2020 12:41:46 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:41:46 - INFO - __main__ -   \n",
      "10/02/2020 12:41:46 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:41:46 - INFO - __main__ -   Validation Acc improved! 0.8112745098039216 -> 0.8137254901960784\n",
      "10/02/2020 12:41:46 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  38%|████████████▏                   | 22/58 [00:48<01:19,  2.21s/it]\u001b[A10/02/2020 12:41:48 - INFO - __main__ -   \n",
      "10/02/2020 12:41:48 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:41:48 - INFO - __main__ -   Validation f1 improved! 0.870307167235495 -> 0.8726003490401396\n",
      "10/02/2020 12:41:48 - INFO - __main__ -   Validation Acc : 0.821078431372549\n",
      "10/02/2020 12:41:48 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:41:48 - INFO - __main__ -   \n",
      "10/02/2020 12:41:48 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:41:48 - INFO - __main__ -   Validation Acc improved! 0.8137254901960784 -> 0.821078431372549\n",
      "10/02/2020 12:41:48 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  40%|████████████▋                   | 23/58 [00:50<01:17,  2.20s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 24/58 [00:52<01:15,  2.21s/it]\u001b[A\n",
      "Iteration:  43%|█████████████▊                  | 25/58 [00:55<01:13,  2.21s/it]\u001b[A\n",
      "Iteration:  45%|██████████████▎                 | 26/58 [00:57<01:10,  2.20s/it]\u001b[A\n",
      "Iteration:  47%|██████████████▉                 | 27/58 [00:59<01:08,  2.21s/it]\u001b[A\n",
      "Iteration:  48%|███████████████▍                | 28/58 [01:01<01:06,  2.22s/it]\u001b[A\n",
      "Iteration:  50%|████████████████                | 29/58 [01:03<01:03,  2.21s/it]\u001b[A10/02/2020 12:42:03 - INFO - __main__ -   \n",
      "10/02/2020 12:42:03 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:42:03 - INFO - __main__ -   Validation f1 improved! 0.8726003490401396 -> 0.8775510204081631\n",
      "10/02/2020 12:42:03 - INFO - __main__ -   Validation Acc : 0.8235294117647058\n",
      "10/02/2020 12:42:03 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:42:03 - INFO - __main__ -   \n",
      "10/02/2020 12:42:03 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:42:03 - INFO - __main__ -   Validation Acc improved! 0.821078431372549 -> 0.8235294117647058\n",
      "10/02/2020 12:42:03 - INFO - __main__ -   =============================================================================\n",
      "\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  52%|████████████████▌               | 30/58 [01:06<01:01,  2.21s/it]\u001b[A10/02/2020 12:42:06 - INFO - __main__ -   \n",
      "10/02/2020 12:42:06 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:42:06 - INFO - __main__ -   Validation f1 improved! 0.8775510204081631 -> 0.880415944540728\n",
      "10/02/2020 12:42:06 - INFO - __main__ -   Validation Acc : 0.8308823529411765\n",
      "10/02/2020 12:42:06 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:42:06 - INFO - __main__ -   \n",
      "10/02/2020 12:42:06 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:42:06 - INFO - __main__ -   Validation Acc improved! 0.8235294117647058 -> 0.8308823529411765\n",
      "10/02/2020 12:42:06 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  53%|█████████████████               | 31/58 [01:08<00:59,  2.22s/it]\u001b[A10/02/2020 12:42:08 - INFO - __main__ -   \n",
      "10/02/2020 12:42:08 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:42:08 - INFO - __main__ -   Validation f1 improved! 0.880415944540728 -> 0.8857644991212652\n",
      "10/02/2020 12:42:08 - INFO - __main__ -   Validation Acc : 0.8406862745098039\n",
      "10/02/2020 12:42:08 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:42:08 - INFO - __main__ -   \n",
      "10/02/2020 12:42:08 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:42:08 - INFO - __main__ -   Validation Acc improved! 0.8308823529411765 -> 0.8406862745098039\n",
      "10/02/2020 12:42:08 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  55%|█████████████████▋              | 32/58 [01:10<00:57,  2.21s/it]\u001b[A\n",
      "Iteration:  57%|██████████████████▏             | 33/58 [01:12<00:55,  2.21s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 34/58 [01:15<00:53,  2.22s/it]\u001b[A\n",
      "Iteration:  60%|███████████████████▎            | 35/58 [01:17<00:50,  2.21s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▊            | 36/58 [01:19<00:48,  2.21s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▍           | 37/58 [01:21<00:46,  2.22s/it]\u001b[A\n",
      "Iteration:  66%|████████████████████▉           | 38/58 [01:23<00:44,  2.21s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▌          | 39/58 [01:26<00:42,  2.21s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████          | 40/58 [01:28<00:39,  2.22s/it]\u001b[A\n",
      "Iteration:  71%|██████████████████████▌         | 41/58 [01:30<00:37,  2.21s/it]\u001b[A\n",
      "Iteration:  72%|███████████████████████▏        | 42/58 [01:32<00:35,  2.21s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▋        | 43/58 [01:34<00:33,  2.22s/it]\u001b[A\n",
      "Iteration:  76%|████████████████████████▎       | 44/58 [01:37<00:30,  2.21s/it]\u001b[A\n",
      "Iteration:  78%|████████████████████████▊       | 45/58 [01:39<00:28,  2.21s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 46/58 [01:41<00:26,  2.22s/it]\u001b[A\n",
      "Iteration:  81%|█████████████████████████▉      | 47/58 [01:43<00:24,  2.21s/it]\u001b[A\n",
      "Iteration:  83%|██████████████████████████▍     | 48/58 [01:46<00:22,  2.21s/it]\u001b[A\n",
      "Iteration:  84%|███████████████████████████     | 49/58 [01:48<00:19,  2.22s/it]\u001b[A\n",
      "Iteration:  86%|███████████████████████████▌    | 50/58 [01:50<00:17,  2.21s/it]\u001b[A\n",
      "Iteration:  88%|████████████████████████████▏   | 51/58 [01:52<00:15,  2.22s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 52/58 [01:54<00:13,  2.22s/it]\u001b[A\n",
      "Iteration:  91%|█████████████████████████████▏  | 53/58 [01:57<00:11,  2.21s/it]\u001b[A\n",
      "Iteration:  93%|█████████████████████████████▊  | 54/58 [01:59<00:08,  2.21s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 55/58 [02:01<00:06,  2.21s/it]\u001b[A\n",
      "Iteration:  97%|██████████████████████████████▉ | 56/58 [02:03<00:04,  2.21s/it]\u001b[A\n",
      "Iteration:  98%|███████████████████████████████▍| 57/58 [02:05<00:02,  2.22s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 58/58 [02:07<00:00,  2.20s/it]\u001b[A\n",
      "Epoch:  33%|████████████▎                        | 2/6 [04:19<08:40, 130.19s/it]\n",
      "Iteration:   0%|                                         | 0/58 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   2%|▌                                | 1/58 [00:02<02:04,  2.19s/it]\u001b[A\n",
      "Iteration:   3%|█▏                               | 2/58 [00:04<02:02,  2.19s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 3/58 [00:06<02:00,  2.19s/it]\u001b[A\n",
      "Iteration:   7%|██▎                              | 4/58 [00:08<01:58,  2.19s/it]\u001b[A\n",
      "Iteration:   9%|██▊                              | 5/58 [00:10<01:55,  2.19s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 6/58 [00:13<01:53,  2.19s/it]\u001b[A\n",
      "Iteration:  12%|███▉                             | 7/58 [00:15<01:52,  2.20s/it]\u001b[A\n",
      "Iteration:  14%|████▌                            | 8/58 [00:17<01:49,  2.20s/it]\u001b[A\n",
      "Iteration:  16%|█████                            | 9/58 [00:19<01:48,  2.21s/it]\u001b[A\n",
      "Iteration:  17%|█████▌                          | 10/58 [00:21<01:45,  2.20s/it]\u001b[A\n",
      "Iteration:  19%|██████                          | 11/58 [00:24<01:43,  2.21s/it]\u001b[A\n",
      "Iteration:  21%|██████▌                         | 12/58 [00:26<01:41,  2.21s/it]\u001b[A\n",
      "Iteration:  22%|███████▏                        | 13/58 [00:28<01:39,  2.21s/it]\u001b[A10/02/2020 12:43:36 - INFO - __main__ -   \n",
      "10/02/2020 12:43:36 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:43:36 - INFO - __main__ -   Validation f1 improved! 0.8857644991212652 -> 0.8862068965517241\n",
      "10/02/2020 12:43:36 - INFO - __main__ -   Validation Acc : 0.8382352941176471\n",
      "10/02/2020 12:43:36 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  24%|███████▋                        | 14/58 [00:30<01:37,  2.21s/it]\u001b[A\n",
      "Iteration:  26%|████████▎                       | 15/58 [00:33<01:35,  2.22s/it]\u001b[A\n",
      "Iteration:  28%|████████▊                       | 16/58 [00:35<01:32,  2.21s/it]\u001b[A\n",
      "Iteration:  29%|█████████▍                      | 17/58 [00:37<01:30,  2.22s/it]\u001b[A\n",
      "Iteration:  31%|█████████▉                      | 18/58 [00:39<01:28,  2.21s/it]\u001b[A\n",
      "Iteration:  33%|██████████▍                     | 19/58 [00:41<01:26,  2.22s/it]\u001b[A\n",
      "Iteration:  34%|███████████                     | 20/58 [00:44<01:24,  2.21s/it]\u001b[A10/02/2020 12:43:51 - INFO - __main__ -   \n",
      "10/02/2020 12:43:51 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:43:51 - INFO - __main__ -   Validation f1 improved! 0.8862068965517241 -> 0.889655172413793\n",
      "10/02/2020 12:43:51 - INFO - __main__ -   Validation Acc : 0.8431372549019608\n",
      "10/02/2020 12:43:51 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:43:51 - INFO - __main__ -   \n",
      "10/02/2020 12:43:51 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:43:51 - INFO - __main__ -   Validation Acc improved! 0.8406862745098039 -> 0.8431372549019608\n",
      "10/02/2020 12:43:51 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  36%|███████████▌                    | 21/58 [00:46<01:22,  2.22s/it]\u001b[A\n",
      "Iteration:  38%|████████████▏                   | 22/58 [00:48<01:19,  2.21s/it]\u001b[A\n",
      "Iteration:  40%|████████████▋                   | 23/58 [00:50<01:17,  2.22s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 24/58 [00:52<01:15,  2.21s/it]\u001b[A\n",
      "Iteration:  43%|█████████████▊                  | 25/58 [00:55<01:12,  2.20s/it]\u001b[A\n",
      "Iteration:  45%|██████████████▎                 | 26/58 [00:57<01:10,  2.21s/it]\u001b[A\n",
      "Iteration:  47%|██████████████▉                 | 27/58 [00:59<01:08,  2.20s/it]\u001b[A\n",
      "Iteration:  48%|███████████████▍                | 28/58 [01:01<01:06,  2.21s/it]\u001b[A\n",
      "Iteration:  50%|████████████████                | 29/58 [01:03<01:03,  2.21s/it]\u001b[A\n",
      "Iteration:  52%|████████████████▌               | 30/58 [01:06<01:01,  2.21s/it]\u001b[A\n",
      "Iteration:  53%|█████████████████               | 31/58 [01:08<00:59,  2.20s/it]\u001b[A\n",
      "Iteration:  55%|█████████████████▋              | 32/58 [01:10<00:57,  2.21s/it]\u001b[A\n",
      "Iteration:  57%|██████████████████▏             | 33/58 [01:12<00:55,  2.21s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 34/58 [01:15<00:53,  2.21s/it]\u001b[A\n",
      "Iteration:  60%|███████████████████▎            | 35/58 [01:17<00:50,  2.21s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▊            | 36/58 [01:19<00:48,  2.21s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▍           | 37/58 [01:21<00:46,  2.22s/it]\u001b[A\n",
      "Iteration:  66%|████████████████████▉           | 38/58 [01:23<00:44,  2.21s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▌          | 39/58 [01:26<00:42,  2.21s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████          | 40/58 [01:28<00:39,  2.22s/it]\u001b[A\n",
      "Iteration:  71%|██████████████████████▌         | 41/58 [01:30<00:37,  2.21s/it]\u001b[A10/02/2020 12:44:38 - INFO - __main__ -   \n",
      "10/02/2020 12:44:38 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:44:38 - INFO - __main__ -   Validation f1 improved! 0.889655172413793 -> 0.8900343642611684\n",
      "10/02/2020 12:44:38 - INFO - __main__ -   Validation Acc : 0.8431372549019608\n",
      "10/02/2020 12:44:38 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  72%|███████████████████████▏        | 42/58 [01:32<00:35,  2.21s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▋        | 43/58 [01:34<00:33,  2.21s/it]\u001b[A\n",
      "Iteration:  76%|████████████████████████▎       | 44/58 [01:37<00:31,  2.21s/it]\u001b[A\n",
      "Iteration:  78%|████████████████████████▊       | 45/58 [01:39<00:28,  2.21s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 46/58 [01:41<00:26,  2.21s/it]\u001b[A\n",
      "Iteration:  81%|█████████████████████████▉      | 47/58 [01:43<00:24,  2.20s/it]\u001b[A\n",
      "Iteration:  83%|██████████████████████████▍     | 48/58 [01:46<00:22,  2.21s/it]\u001b[A\n",
      "Iteration:  84%|███████████████████████████     | 49/58 [01:48<00:19,  2.21s/it]\u001b[A\n",
      "Iteration:  86%|███████████████████████████▌    | 50/58 [01:50<00:17,  2.20s/it]\u001b[A\n",
      "Iteration:  88%|████████████████████████████▏   | 51/58 [01:52<00:15,  2.21s/it]\u001b[A10/02/2020 12:45:00 - INFO - __main__ -   \n",
      "10/02/2020 12:45:00 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:45:00 - INFO - __main__ -   Validation f1 improved! 0.8900343642611684 -> 0.8923076923076922\n",
      "10/02/2020 12:45:00 - INFO - __main__ -   Validation Acc : 0.8455882352941176\n",
      "10/02/2020 12:45:00 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:45:00 - INFO - __main__ -   \n",
      "10/02/2020 12:45:00 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:45:00 - INFO - __main__ -   Validation Acc improved! 0.8431372549019608 -> 0.8455882352941176\n",
      "10/02/2020 12:45:00 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  90%|████████████████████████████▋   | 52/58 [01:54<00:13,  2.22s/it]\u001b[A10/02/2020 12:45:02 - INFO - __main__ -   \n",
      "10/02/2020 12:45:02 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:45:02 - INFO - __main__ -   Validation f1 improved! 0.8923076923076922 -> 0.8934707903780068\n",
      "10/02/2020 12:45:02 - INFO - __main__ -   Validation Acc : 0.8480392156862745\n",
      "10/02/2020 12:45:02 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:45:02 - INFO - __main__ -   \n",
      "10/02/2020 12:45:02 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:45:02 - INFO - __main__ -   Validation Acc improved! 0.8455882352941176 -> 0.8480392156862745\n",
      "10/02/2020 12:45:02 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  91%|█████████████████████████████▏  | 53/58 [01:57<00:11,  2.21s/it]\u001b[A\n",
      "Iteration:  93%|█████████████████████████████▊  | 54/58 [01:59<00:08,  2.21s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 55/58 [02:01<00:06,  2.22s/it]\u001b[A\n",
      "Iteration:  97%|██████████████████████████████▉ | 56/58 [02:03<00:04,  2.21s/it]\u001b[A\n",
      "Iteration:  98%|███████████████████████████████▍| 57/58 [02:05<00:02,  2.21s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 58/58 [02:07<00:00,  2.21s/it]\u001b[A\n",
      "Epoch:  50%|██████████████████▌                  | 3/6 [06:26<06:28, 129.50s/it]\n",
      "Iteration:   0%|                                         | 0/58 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   2%|▌                                | 1/58 [00:02<02:05,  2.21s/it]\u001b[A\n",
      "Iteration:   3%|█▏                               | 2/58 [00:04<02:03,  2.21s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 3/58 [00:06<02:00,  2.20s/it]\u001b[A\n",
      "Iteration:   7%|██▎                              | 4/58 [00:08<01:58,  2.19s/it]\u001b[A\n",
      "Iteration:   9%|██▊                              | 5/58 [00:10<01:56,  2.19s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 6/58 [00:13<01:53,  2.19s/it]\u001b[A\n",
      "Iteration:  12%|███▉                             | 7/58 [00:15<01:51,  2.19s/it]\u001b[A\n",
      "Iteration:  14%|████▌                            | 8/58 [00:17<01:49,  2.20s/it]\u001b[A\n",
      "Iteration:  16%|█████                            | 9/58 [00:19<01:48,  2.21s/it]\u001b[A\n",
      "Iteration:  17%|█████▌                          | 10/58 [00:21<01:45,  2.20s/it]\u001b[A\n",
      "Iteration:  19%|██████                          | 11/58 [00:24<01:43,  2.21s/it]\u001b[A\n",
      "Iteration:  21%|██████▌                         | 12/58 [00:26<01:41,  2.20s/it]\u001b[A\n",
      "Iteration:  22%|███████▏                        | 13/58 [00:28<01:39,  2.21s/it]\u001b[A\n",
      "Iteration:  24%|███████▋                        | 14/58 [00:30<01:37,  2.22s/it]\u001b[A\n",
      "Iteration:  26%|████████▎                       | 15/58 [00:32<01:34,  2.21s/it]\u001b[A\n",
      "Iteration:  28%|████████▊                       | 16/58 [00:35<01:32,  2.21s/it]\u001b[A\n",
      "Iteration:  29%|█████████▍                      | 17/58 [00:37<01:30,  2.22s/it]\u001b[A\n",
      "Iteration:  31%|█████████▉                      | 18/58 [00:39<01:28,  2.21s/it]\u001b[A\n",
      "Iteration:  33%|██████████▍                     | 19/58 [00:41<01:26,  2.21s/it]\u001b[A\n",
      "Iteration:  34%|███████████                     | 20/58 [00:44<01:24,  2.22s/it]\u001b[A\n",
      "Iteration:  36%|███████████▌                    | 21/58 [00:46<01:21,  2.21s/it]\u001b[A\n",
      "Iteration:  38%|████████████▏                   | 22/58 [00:48<01:19,  2.21s/it]\u001b[A\n",
      "Iteration:  40%|████████████▋                   | 23/58 [00:50<01:17,  2.22s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 24/58 [00:52<01:15,  2.21s/it]\u001b[A\n",
      "Iteration:  43%|█████████████▊                  | 25/58 [00:55<01:13,  2.21s/it]\u001b[A\n",
      "Iteration:  45%|██████████████▎                 | 26/58 [00:57<01:10,  2.21s/it]\u001b[A\n",
      "Iteration:  47%|██████████████▉                 | 27/58 [00:59<01:08,  2.21s/it]\u001b[A\n",
      "Iteration:  48%|███████████████▍                | 28/58 [01:01<01:06,  2.22s/it]\u001b[A\n",
      "Iteration:  50%|████████████████                | 29/58 [01:03<01:04,  2.21s/it]\u001b[A\n",
      "Iteration:  52%|████████████████▌               | 30/58 [01:06<01:02,  2.22s/it]\u001b[A\n",
      "Iteration:  53%|█████████████████               | 31/58 [01:08<00:59,  2.21s/it]\u001b[A\n",
      "Iteration:  55%|█████████████████▋              | 32/58 [01:10<00:57,  2.21s/it]\u001b[A\n",
      "Iteration:  57%|██████████████████▏             | 33/58 [01:12<00:55,  2.21s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 34/58 [01:15<00:53,  2.21s/it]\u001b[A\n",
      "Iteration:  60%|███████████████████▎            | 35/58 [01:17<00:50,  2.21s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▊            | 36/58 [01:19<00:48,  2.21s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▍           | 37/58 [01:21<00:46,  2.21s/it]\u001b[A\n",
      "Iteration:  66%|████████████████████▉           | 38/58 [01:23<00:44,  2.22s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▌          | 39/58 [01:26<00:41,  2.21s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████          | 40/58 [01:28<00:39,  2.22s/it]\u001b[A\n",
      "Iteration:  71%|██████████████████████▌         | 41/58 [01:30<00:37,  2.21s/it]\u001b[A\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  72%|███████████████████████▏        | 42/58 [01:32<00:35,  2.22s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▋        | 43/58 [01:34<00:33,  2.21s/it]\u001b[A\n",
      "Iteration:  76%|████████████████████████▎       | 44/58 [01:37<00:31,  2.22s/it]\u001b[A\n",
      "Iteration:  78%|████████████████████████▊       | 45/58 [01:39<00:28,  2.21s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 46/58 [01:41<00:26,  2.22s/it]\u001b[A\n",
      "Iteration:  81%|█████████████████████████▉      | 47/58 [01:43<00:24,  2.21s/it]\u001b[A\n",
      "Iteration:  83%|██████████████████████████▍     | 48/58 [01:46<00:22,  2.21s/it]\u001b[A\n",
      "Iteration:  84%|███████████████████████████     | 49/58 [01:48<00:19,  2.21s/it]\u001b[A\n",
      "Iteration:  86%|███████████████████████████▌    | 50/58 [01:50<00:17,  2.21s/it]\u001b[A\n",
      "Iteration:  88%|████████████████████████████▏   | 51/58 [01:52<00:15,  2.21s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 52/58 [01:54<00:13,  2.21s/it]\u001b[A\n",
      "Iteration:  91%|█████████████████████████████▏  | 53/58 [01:57<00:11,  2.21s/it]\u001b[A\n",
      "Iteration:  93%|█████████████████████████████▊  | 54/58 [01:59<00:08,  2.21s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 55/58 [02:01<00:06,  2.21s/it]\u001b[A\n",
      "Iteration:  97%|██████████████████████████████▉ | 56/58 [02:03<00:04,  2.21s/it]\u001b[A\n",
      "Iteration:  98%|███████████████████████████████▍| 57/58 [02:05<00:02,  2.22s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 58/58 [02:07<00:00,  2.20s/it]\u001b[A\n",
      "Epoch:  67%|████████████████████████▋            | 4/6 [08:34<04:18, 129.00s/it]\n",
      "Iteration:   0%|                                         | 0/58 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   2%|▌                                | 1/58 [00:02<02:04,  2.19s/it]\u001b[A\n",
      "Iteration:   3%|█▏                               | 2/58 [00:04<02:02,  2.19s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 3/58 [00:06<02:00,  2.19s/it]\u001b[A\n",
      "Iteration:   7%|██▎                              | 4/58 [00:08<01:58,  2.19s/it]\u001b[A\n",
      "Iteration:   9%|██▊                              | 5/58 [00:10<01:55,  2.19s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 6/58 [00:13<01:53,  2.19s/it]\u001b[A\n",
      "Iteration:  12%|███▉                             | 7/58 [00:15<01:52,  2.20s/it]\u001b[A\n",
      "Iteration:  14%|████▌                            | 8/58 [00:17<01:49,  2.20s/it]\u001b[A\n",
      "Iteration:  16%|█████                            | 9/58 [00:19<01:48,  2.21s/it]\u001b[A\n",
      "Iteration:  17%|█████▌                          | 10/58 [00:22<01:46,  2.22s/it]\u001b[A\n",
      "Iteration:  19%|██████                          | 11/58 [00:24<01:43,  2.20s/it]\u001b[A\n",
      "Iteration:  21%|██████▌                         | 12/58 [00:26<01:41,  2.21s/it]\u001b[A\n",
      "Iteration:  22%|███████▏                        | 13/58 [00:28<01:39,  2.20s/it]\u001b[A\n",
      "Iteration:  24%|███████▋                        | 14/58 [00:30<01:37,  2.21s/it]\u001b[A\n",
      "Iteration:  26%|████████▎                       | 15/58 [00:33<01:34,  2.21s/it]\u001b[A\n",
      "Iteration:  28%|████████▊                       | 16/58 [00:35<01:32,  2.21s/it]\u001b[A\n",
      "Iteration:  29%|█████████▍                      | 17/58 [00:37<01:30,  2.20s/it]\u001b[A\n",
      "Iteration:  31%|█████████▉                      | 18/58 [00:39<01:28,  2.21s/it]\u001b[A\n",
      "Iteration:  33%|██████████▍                     | 19/58 [00:41<01:26,  2.22s/it]\u001b[A\n",
      "Iteration:  34%|███████████                     | 20/58 [00:44<01:23,  2.21s/it]\u001b[A\n",
      "Iteration:  36%|███████████▌                    | 21/58 [00:46<01:21,  2.21s/it]\u001b[A\n",
      "Iteration:  38%|████████████▏                   | 22/58 [00:48<01:19,  2.22s/it]\u001b[A\n",
      "Iteration:  40%|████████████▋                   | 23/58 [00:50<01:17,  2.21s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 24/58 [00:52<01:15,  2.21s/it]\u001b[A\n",
      "Iteration:  43%|█████████████▊                  | 25/58 [00:55<01:12,  2.20s/it]\u001b[A\n",
      "Iteration:  45%|██████████████▎                 | 26/58 [00:57<01:10,  2.21s/it]\u001b[A\n",
      "Iteration:  47%|██████████████▉                 | 27/58 [00:59<01:08,  2.22s/it]\u001b[A\n",
      "Iteration:  48%|███████████████▍                | 28/58 [01:01<01:06,  2.21s/it]\u001b[A\n",
      "Iteration:  50%|████████████████                | 29/58 [01:04<01:04,  2.22s/it]\u001b[A\n",
      "Iteration:  52%|████████████████▌               | 30/58 [01:06<01:01,  2.21s/it]\u001b[A\n",
      "Iteration:  53%|█████████████████               | 31/58 [01:08<00:59,  2.21s/it]\u001b[A\n",
      "Iteration:  55%|█████████████████▋              | 32/58 [01:10<00:57,  2.20s/it]\u001b[A\n",
      "Iteration:  57%|██████████████████▏             | 33/58 [01:12<00:55,  2.21s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 34/58 [01:15<00:53,  2.22s/it]\u001b[A\n",
      "Iteration:  60%|███████████████████▎            | 35/58 [01:17<00:50,  2.21s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▊            | 36/58 [01:19<00:48,  2.21s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▍           | 37/58 [01:21<00:46,  2.22s/it]\u001b[A\n",
      "Iteration:  66%|████████████████████▉           | 38/58 [01:23<00:44,  2.21s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▌          | 39/58 [01:26<00:42,  2.21s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████          | 40/58 [01:28<00:39,  2.22s/it]\u001b[A\n",
      "Iteration:  71%|██████████████████████▌         | 41/58 [01:30<00:37,  2.21s/it]\u001b[A\n",
      "Iteration:  72%|███████████████████████▏        | 42/58 [01:32<00:35,  2.22s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▋        | 43/58 [01:34<00:33,  2.21s/it]\u001b[A\n",
      "Iteration:  76%|████████████████████████▎       | 44/58 [01:37<00:31,  2.21s/it]\u001b[A\n",
      "Iteration:  78%|████████████████████████▊       | 45/58 [01:39<00:28,  2.20s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 46/58 [01:41<00:26,  2.21s/it]\u001b[A\n",
      "Iteration:  81%|█████████████████████████▉      | 47/58 [01:43<00:24,  2.22s/it]\u001b[A\n",
      "Iteration:  83%|██████████████████████████▍     | 48/58 [01:46<00:22,  2.21s/it]\u001b[A\n",
      "Iteration:  84%|███████████████████████████     | 49/58 [01:48<00:19,  2.21s/it]\u001b[A\n",
      "Iteration:  86%|███████████████████████████▌    | 50/58 [01:50<00:17,  2.22s/it]\u001b[A\n",
      "Iteration:  88%|████████████████████████████▏   | 51/58 [01:52<00:15,  2.21s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 52/58 [01:54<00:13,  2.21s/it]\u001b[A\n",
      "Iteration:  91%|█████████████████████████████▏  | 53/58 [01:57<00:11,  2.20s/it]\u001b[A\n",
      "Iteration:  93%|█████████████████████████████▊  | 54/58 [01:59<00:08,  2.21s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 55/58 [02:01<00:06,  2.22s/it]\u001b[A\n",
      "Iteration:  97%|██████████████████████████████▉ | 56/58 [02:03<00:04,  2.21s/it]\u001b[A\n",
      "Iteration:  98%|███████████████████████████████▍| 57/58 [02:05<00:02,  2.22s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 58/58 [02:07<00:00,  2.20s/it]\u001b[A\n",
      "Epoch:  83%|██████████████████████████████▊      | 5/6 [10:42<02:08, 128.66s/it]\n",
      "Iteration:   0%|                                         | 0/58 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   2%|▌                                | 1/58 [00:02<02:04,  2.19s/it]\u001b[A\n",
      "Iteration:   3%|█▏                               | 2/58 [00:04<02:02,  2.18s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 3/58 [00:06<02:00,  2.19s/it]\u001b[A\n",
      "Iteration:   7%|██▎                              | 4/58 [00:08<01:58,  2.19s/it]\u001b[A\n",
      "Iteration:   9%|██▊                              | 5/58 [00:10<01:55,  2.19s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 6/58 [00:13<01:54,  2.20s/it]\u001b[A\n",
      "Iteration:  12%|███▉                             | 7/58 [00:15<01:51,  2.20s/it]\u001b[A\n",
      "Iteration:  14%|████▌                            | 8/58 [00:17<01:50,  2.21s/it]\u001b[A\n",
      "Iteration:  16%|█████                            | 9/58 [00:19<01:48,  2.21s/it]\u001b[A\n",
      "Iteration:  17%|█████▌                          | 10/58 [00:22<01:46,  2.21s/it]\u001b[A\n",
      "Iteration:  19%|██████                          | 11/58 [00:24<01:44,  2.22s/it]\u001b[A\n",
      "Iteration:  21%|██████▌                         | 12/58 [00:26<01:41,  2.21s/it]\u001b[A\n",
      "Iteration:  22%|███████▏                        | 13/58 [00:28<01:39,  2.20s/it]\u001b[A\n",
      "Iteration:  24%|███████▋                        | 14/58 [00:30<01:37,  2.21s/it]\u001b[A\n",
      "Iteration:  26%|████████▎                       | 15/58 [00:33<01:34,  2.21s/it]\u001b[A\n",
      "Iteration:  28%|████████▊                       | 16/58 [00:35<01:32,  2.21s/it]\u001b[A\n",
      "Iteration:  29%|█████████▍                      | 17/58 [00:37<01:30,  2.21s/it]\u001b[A\n",
      "Iteration:  31%|█████████▉                      | 18/58 [00:39<01:28,  2.22s/it]\u001b[A\n",
      "Iteration:  33%|██████████▍                     | 19/58 [00:41<01:26,  2.21s/it]\u001b[A\n",
      "Iteration:  34%|███████████                     | 20/58 [00:44<01:24,  2.21s/it]\u001b[A\n",
      "Iteration:  36%|███████████▌                    | 21/58 [00:46<01:21,  2.21s/it]\u001b[A\n",
      "Iteration:  38%|████████████▏                   | 22/58 [00:48<01:19,  2.22s/it]\u001b[A\n",
      "Iteration:  40%|████████████▋                   | 23/58 [00:50<01:17,  2.21s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 24/58 [00:52<01:15,  2.21s/it]\u001b[A\n",
      "Iteration:  43%|█████████████▊                  | 25/58 [00:55<01:13,  2.21s/it]\u001b[A\n",
      "Iteration:  45%|██████████████▎                 | 26/58 [00:57<01:10,  2.21s/it]\u001b[A\n",
      "Iteration:  47%|██████████████▉                 | 27/58 [00:59<01:08,  2.22s/it]\u001b[A\n",
      "Iteration:  48%|███████████████▍                | 28/58 [01:01<01:06,  2.21s/it]\u001b[A\n",
      "Iteration:  50%|████████████████                | 29/58 [01:04<01:04,  2.22s/it]\u001b[A\n",
      "Iteration:  52%|████████████████▌               | 30/58 [01:06<01:01,  2.21s/it]\u001b[A\n",
      "Iteration:  53%|█████████████████               | 31/58 [01:08<00:59,  2.22s/it]\u001b[A\n",
      "Iteration:  55%|█████████████████▋              | 32/58 [01:10<00:57,  2.21s/it]\u001b[A\n",
      "Iteration:  57%|██████████████████▏             | 33/58 [01:12<00:55,  2.22s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 34/58 [01:15<00:53,  2.21s/it]\u001b[A\n",
      "Iteration:  60%|███████████████████▎            | 35/58 [01:17<00:50,  2.20s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▊            | 36/58 [01:19<00:48,  2.21s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▍           | 37/58 [01:21<00:46,  2.21s/it]\u001b[A\n",
      "Iteration:  66%|████████████████████▉           | 38/58 [01:23<00:44,  2.22s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▌          | 39/58 [01:26<00:41,  2.21s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████          | 40/58 [01:28<00:39,  2.22s/it]\u001b[A\n",
      "Iteration:  71%|██████████████████████▌         | 41/58 [01:30<00:37,  2.21s/it]\u001b[A\n",
      "Iteration:  72%|███████████████████████▏        | 42/58 [01:32<00:35,  2.22s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▋        | 43/58 [01:34<00:33,  2.21s/it]\u001b[A\n",
      "Iteration:  76%|████████████████████████▎       | 44/58 [01:37<00:30,  2.20s/it]\u001b[A\n",
      "Iteration:  78%|████████████████████████▊       | 45/58 [01:39<00:28,  2.21s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 46/58 [01:41<00:26,  2.20s/it]\u001b[A\n",
      "Iteration:  81%|█████████████████████████▉      | 47/58 [01:43<00:24,  2.21s/it]\u001b[A\n",
      "Iteration:  83%|██████████████████████████▍     | 48/58 [01:46<00:22,  2.20s/it]\u001b[A\n",
      "Iteration:  84%|███████████████████████████     | 49/58 [01:48<00:19,  2.21s/it]\u001b[A\n",
      "Iteration:  86%|███████████████████████████▌    | 50/58 [01:50<00:17,  2.20s/it]\u001b[A\n",
      "Iteration:  88%|████████████████████████████▏   | 51/58 [01:52<00:15,  2.21s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 52/58 [01:54<00:13,  2.21s/it]\u001b[A\n",
      "Iteration:  91%|█████████████████████████████▏  | 53/58 [01:57<00:11,  2.20s/it]\u001b[A10/02/2020 12:51:26 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  93%|█████████████████████████████▊  | 54/58 [01:59<00:08,  2.21s/it]\u001b[A10/02/2020 12:51:28 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  95%|██████████████████████████████▎ | 55/58 [02:01<00:06,  2.22s/it]\u001b[A10/02/2020 12:51:31 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  97%|██████████████████████████████▉ | 56/58 [02:03<00:04,  2.22s/it]\u001b[A10/02/2020 12:51:33 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  98%|███████████████████████████████▍| 57/58 [02:05<00:02,  2.21s/it]\u001b[A10/02/2020 12:51:35 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration: 100%|████████████████████████████████| 58/58 [02:07<00:00,  2.21s/it]\u001b[A\n",
      "Epoch: 100%|█████████████████████████████████████| 6/6 [12:50<00:00, 128.42s/it]\n",
      "10/02/2020 12:51:37 - INFO - __main__ -   \n",
      "10/02/2020 12:51:37 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:51:37 - INFO - __main__ -   Validation Accuracy : 0.8480392156862745 Validation f1 : 0.8934707903780068\n",
      "10/02/2020 12:51:37 - INFO - __main__ -   =============================================================================\n"
     ]
    }
   ],
   "source": [
    "!python KDAP/NLI_KD_training_MRPC.py  --> 89.34"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 15,
   "metadata": {
    "collapsed": true
   },
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "10/02/2020 12:51:39 - INFO - __main__ -   IN DEBUG MODE\n",
      "10/02/2020 12:51:40 - INFO - src.argument_parser -   device: cuda n_gpu: 2, 16-bits training: False\n",
      "10/02/2020 12:51:40 - INFO - src.argument_parser -   random seed = 51660145\n",
      "10/02/2020 12:51:40 - INFO - __main__ -   actual batch size on all GPU = 64\n",
      "10/02/2020 12:51:40 - INFO - __main__ -   Seed changed to : 51660145\n",
      "10/02/2020 12:51:40 - INFO - __main__ -   loading teacher's prediction\n",
      "10/02/2020 12:51:40 - INFO - __main__ -   teacher acc = 94.11, teacher loss = 0.17171\n",
      "10/02/2020 12:51:40 - INFO - __main__ -   teacher acc = 89.22, teacher loss = 0.26664\n",
      "10/02/2020 12:51:40 - INFO - src.nli_data_processing -   Writing example 0 of 3668\n",
      "10/02/2020 12:51:43 - INFO - __main__ -   ***** Running training *****\n",
      "10/02/2020 12:51:43 - INFO - __main__ -     Num examples = 3668\n",
      "10/02/2020 12:51:43 - INFO - __main__ -     Batch size = 64\n",
      "10/02/2020 12:51:43 - INFO - __main__ -     Num steps = 342\n",
      "10/02/2020 12:51:43 - INFO - src.nli_data_processing -   Writing example 0 of 408\n",
      "10/02/2020 12:51:43 - INFO - __main__ -   ***** Running evaluation *****\n",
      "10/02/2020 12:51:43 - INFO - __main__ -     Num examples = 408\n",
      "10/02/2020 12:51:43 - INFO - __main__ -     Batch size = 32\n",
      "10/02/2020 12:51:43 - INFO - src.nli_data_processing -   Writing example 0 of 1725\n",
      "10/02/2020 12:51:44 - INFO - __main__ -   ***** Running evaluation *****\n",
      "10/02/2020 12:51:44 - INFO - __main__ -     Num examples = 1725\n",
      "10/02/2020 12:51:44 - INFO - __main__ -     Batch size = 32\n",
      "10/02/2020 12:51:44 - INFO - __main__ -   using normal Knowledge Distillation\n",
      "10/02/2020 12:51:44 - INFO - src.nli_data_processing -   predicting for MRPC\n",
      "10/02/2020 12:51:44 - INFO - src.modeling -   num hidden layer is set as 6\n",
      "10/02/2020 12:51:44 - INFO - src.modeling -   Model config {\n",
      "  \"attention_probs_dropout_prob\": 0.1,\n",
      "  \"hidden_act\": \"gelu\",\n",
      "  \"hidden_dropout_prob\": 0.1,\n",
      "  \"hidden_size\": 768,\n",
      "  \"initializer_range\": 0.02,\n",
      "  \"intermediate_size\": 3072,\n",
      "  \"max_position_embeddings\": 512,\n",
      "  \"num_attention_heads\": 12,\n",
      "  \"num_hidden_layers\": 6,\n",
      "  \"type_vocab_size\": 2,\n",
      "  \"vocab_size\": 30522\n",
      "}\n",
      "\n",
      "10/02/2020 12:51:45 - INFO - src.utils -   delete 104 layers, keep 103 layers\n",
      "10/02/2020 12:51:48 - INFO - src.utils -   data parallel because more than one gpu\n",
      "10/02/2020 12:51:48 - INFO - __main__ -   *****************************************************************************\n",
      "10/02/2020 12:51:48 - INFO - src.utils -   data parallel because more than one gpu\n",
      "10/02/2020 12:51:48 - INFO - __main__ -   number of layers in student model = 6\n",
      "10/02/2020 12:51:48 - INFO - __main__ -   num parameters in student model are 66955008 and 1538\n",
      "10/02/2020 12:51:48 - INFO - __main__ -   FP16 is not activated, use BertAdam\n",
      "Epoch:   0%|                                              | 0/6 [00:00<?, ?it/s]\n",
      "Iteration:   0%|                                         | 0/58 [00:00<?, ?it/s]\u001b[A/opt/conda/lib/python3.7/site-packages/torch/nn/functional.py:1958: UserWarning: reduction: 'mean' divides the total loss by both the batch size and the support size.'batchmean' divides only by the batch size, and aligns with the KL div math definition.'mean' will be changed to behave the same as 'batchmean' in the next major release.\n",
      "  warnings.warn(\"reduction: 'mean' divides the total loss by both the batch size and the support size.\"\n",
      "10/02/2020 12:51:54 - INFO - __main__ -   \n",
      "10/02/2020 12:51:54 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:51:54 - INFO - __main__ -   Validation f1 improved! 0 -> 0.2506963788300836\n",
      "10/02/2020 12:51:54 - INFO - __main__ -   Validation Acc : 0.34068627450980393\n",
      "10/02/2020 12:51:54 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:51:54 - INFO - __main__ -   \n",
      "10/02/2020 12:51:54 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:51:54 - INFO - __main__ -   Validation Acc improved! 0 -> 0.34068627450980393\n",
      "10/02/2020 12:51:54 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   2%|▌                                | 1/58 [00:05<05:28,  5.77s/it]\u001b[A10/02/2020 12:51:56 - INFO - __main__ -   \n",
      "10/02/2020 12:51:56 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:51:56 - INFO - __main__ -   Validation f1 improved! 0.2506963788300836 -> 0.3990384615384615\n",
      "10/02/2020 12:51:56 - INFO - __main__ -   Validation Acc : 0.3872549019607843\n",
      "10/02/2020 12:51:56 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:51:56 - INFO - __main__ -   \n",
      "10/02/2020 12:51:56 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:51:56 - INFO - __main__ -   Validation Acc improved! 0.34068627450980393 -> 0.3872549019607843\n",
      "10/02/2020 12:51:56 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   3%|█▏                               | 2/58 [00:07<04:21,  4.67s/it]\u001b[A10/02/2020 12:51:58 - INFO - __main__ -   \n",
      "10/02/2020 12:51:58 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:51:58 - INFO - __main__ -   Validation f1 improved! 0.3990384615384615 -> 0.6690909090909091\n",
      "10/02/2020 12:51:58 - INFO - __main__ -   Validation Acc : 0.553921568627451\n",
      "10/02/2020 12:51:58 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:51:58 - INFO - __main__ -   \n",
      "10/02/2020 12:51:58 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:51:58 - INFO - __main__ -   Validation Acc improved! 0.3872549019607843 -> 0.553921568627451\n",
      "10/02/2020 12:51:58 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   5%|█▋                               | 3/58 [00:10<03:35,  3.91s/it]\u001b[A10/02/2020 12:52:01 - INFO - __main__ -   \n",
      "10/02/2020 12:52:01 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:52:01 - INFO - __main__ -   Validation f1 improved! 0.6690909090909091 -> 0.8064992614475628\n",
      "10/02/2020 12:52:01 - INFO - __main__ -   Validation Acc : 0.678921568627451\n",
      "10/02/2020 12:52:01 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:52:01 - INFO - __main__ -   \n",
      "10/02/2020 12:52:01 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:52:01 - INFO - __main__ -   Validation Acc improved! 0.553921568627451 -> 0.678921568627451\n",
      "10/02/2020 12:52:01 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   7%|██▎                              | 4/58 [00:12<03:02,  3.39s/it]\u001b[A10/02/2020 12:52:03 - INFO - __main__ -   \n",
      "10/02/2020 12:52:03 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:52:03 - INFO - __main__ -   Validation f1 improved! 0.8064992614475628 -> 0.8122270742358079\n",
      "10/02/2020 12:52:03 - INFO - __main__ -   Validation Acc : 0.6838235294117647\n",
      "10/02/2020 12:52:03 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:52:03 - INFO - __main__ -   \n",
      "10/02/2020 12:52:03 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:52:03 - INFO - __main__ -   Validation Acc improved! 0.678921568627451 -> 0.6838235294117647\n",
      "10/02/2020 12:52:03 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   9%|██▊                              | 5/58 [00:14<02:41,  3.04s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 6/58 [00:16<02:25,  2.80s/it]\u001b[A\n",
      "Iteration:  12%|███▉                             | 7/58 [00:18<02:13,  2.62s/it]\u001b[A\n",
      "Iteration:  14%|████▌                            | 8/58 [00:21<02:05,  2.51s/it]\u001b[A\n",
      "Iteration:  16%|█████                            | 9/58 [00:23<01:59,  2.43s/it]\u001b[A\n",
      "Iteration:  17%|█████▌                          | 10/58 [00:25<01:54,  2.38s/it]\u001b[A\n",
      "Iteration:  19%|██████                          | 11/58 [00:27<01:50,  2.34s/it]\u001b[A\n",
      "Iteration:  21%|██████▌                         | 12/58 [00:30<01:47,  2.34s/it]\u001b[A\n",
      "Iteration:  22%|███████▏                        | 13/58 [00:32<01:44,  2.33s/it]\u001b[A\n",
      "Iteration:  24%|███████▋                        | 14/58 [00:34<01:42,  2.33s/it]\u001b[A\n",
      "Iteration:  26%|████████▎                       | 15/58 [00:37<01:40,  2.33s/it]\u001b[A\n",
      "Iteration:  28%|████████▊                       | 16/58 [00:39<01:38,  2.33s/it]\u001b[A\n",
      "Iteration:  29%|█████████▍                      | 17/58 [00:41<01:34,  2.31s/it]\u001b[A\n",
      "Iteration:  31%|█████████▉                      | 18/58 [00:44<01:32,  2.30s/it]\u001b[A\n",
      "Iteration:  33%|██████████▍                     | 19/58 [00:46<01:29,  2.28s/it]\u001b[A\n",
      "Iteration:  34%|███████████                     | 20/58 [00:48<01:26,  2.26s/it]\u001b[A\n",
      "Iteration:  36%|███████████▌                    | 21/58 [00:50<01:23,  2.25s/it]\u001b[A\n",
      "Iteration:  38%|████████████▏                   | 22/58 [00:52<01:20,  2.24s/it]\u001b[A\n",
      "Iteration:  40%|████████████▋                   | 23/58 [00:55<01:17,  2.22s/it]\u001b[A10/02/2020 12:52:46 - INFO - __main__ -   \n",
      "10/02/2020 12:52:46 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:52:46 - INFO - __main__ -   Validation f1 improved! 0.8122270742358079 -> 0.8157894736842105\n",
      "10/02/2020 12:52:46 - INFO - __main__ -   Validation Acc : 0.6911764705882353\n",
      "10/02/2020 12:52:46 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:52:46 - INFO - __main__ -   \n",
      "10/02/2020 12:52:46 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:52:46 - INFO - __main__ -   Validation Acc improved! 0.6838235294117647 -> 0.6911764705882353\n",
      "10/02/2020 12:52:46 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  41%|█████████████▏                  | 24/58 [00:57<01:15,  2.22s/it]\u001b[A10/02/2020 12:52:48 - INFO - __main__ -   \n",
      "10/02/2020 12:52:48 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:52:48 - INFO - __main__ -   Validation f1 improved! 0.8157894736842105 -> 0.8193832599118943\n",
      "10/02/2020 12:52:48 - INFO - __main__ -   Validation Acc : 0.6985294117647058\n",
      "10/02/2020 12:52:48 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:52:48 - INFO - __main__ -   \n",
      "10/02/2020 12:52:48 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:52:48 - INFO - __main__ -   Validation Acc improved! 0.6911764705882353 -> 0.6985294117647058\n",
      "10/02/2020 12:52:48 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  43%|█████████████▊                  | 25/58 [00:59<01:12,  2.21s/it]\u001b[A\n",
      "Iteration:  45%|██████████████▎                 | 26/58 [01:01<01:10,  2.20s/it]\u001b[A\n",
      "Iteration:  47%|██████████████▉                 | 27/58 [01:03<01:07,  2.19s/it]\u001b[A10/02/2020 12:52:54 - INFO - __main__ -   \n",
      "10/02/2020 12:52:54 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:52:54 - INFO - __main__ -   Validation Acc improved! 0.6985294117647058 -> 0.7058823529411765\n",
      "10/02/2020 12:52:54 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  48%|███████████████▍                | 28/58 [01:06<01:05,  2.20s/it]\u001b[A10/02/2020 12:52:57 - INFO - __main__ -   \n",
      "10/02/2020 12:52:57 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:52:57 - INFO - __main__ -   Validation Acc improved! 0.7058823529411765 -> 0.7156862745098039\n",
      "10/02/2020 12:52:57 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  50%|████████████████                | 29/58 [01:08<01:03,  2.20s/it]\u001b[A10/02/2020 12:52:59 - INFO - __main__ -   \n",
      "10/02/2020 12:52:59 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:52:59 - INFO - __main__ -   Validation Acc improved! 0.7156862745098039 -> 0.7254901960784313\n",
      "10/02/2020 12:52:59 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  52%|████████████████▌               | 30/58 [01:10<01:01,  2.20s/it]\u001b[A\n",
      "Iteration:  53%|█████████████████               | 31/58 [01:12<00:59,  2.20s/it]\u001b[A10/02/2020 12:53:03 - INFO - __main__ -   \n",
      "10/02/2020 12:53:03 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:53:03 - INFO - __main__ -   Validation f1 improved! 0.8193832599118943 -> 0.828006088280061\n",
      "10/02/2020 12:53:03 - INFO - __main__ -   Validation Acc : 0.7230392156862745\n",
      "10/02/2020 12:53:03 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  55%|█████████████████▋              | 32/58 [01:14<00:57,  2.20s/it]\u001b[A10/02/2020 12:53:05 - INFO - __main__ -   \n",
      "10/02/2020 12:53:05 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:53:05 - INFO - __main__ -   Validation f1 improved! 0.828006088280061 -> 0.8359133126934984\n",
      "10/02/2020 12:53:05 - INFO - __main__ -   Validation Acc : 0.7401960784313726\n",
      "10/02/2020 12:53:05 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:53:05 - INFO - __main__ -   \n",
      "10/02/2020 12:53:05 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:53:05 - INFO - __main__ -   Validation Acc improved! 0.7254901960784313 -> 0.7401960784313726\n",
      "10/02/2020 12:53:05 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  57%|██████████████████▏             | 33/58 [01:17<00:54,  2.19s/it]\u001b[A10/02/2020 12:53:08 - INFO - __main__ -   \n",
      "10/02/2020 12:53:08 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:53:08 - INFO - __main__ -   Validation f1 improved! 0.8359133126934984 -> 0.8388214904679375\n",
      "10/02/2020 12:53:08 - INFO - __main__ -   Validation Acc : 0.7720588235294118\n",
      "10/02/2020 12:53:08 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:53:08 - INFO - __main__ -   \n",
      "10/02/2020 12:53:08 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:53:08 - INFO - __main__ -   Validation Acc improved! 0.7401960784313726 -> 0.7720588235294118\n",
      "10/02/2020 12:53:08 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  59%|██████████████████▊             | 34/58 [01:19<00:52,  2.18s/it]\u001b[A\n",
      "Iteration:  60%|███████████████████▎            | 35/58 [01:21<00:49,  2.17s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▊            | 36/58 [01:23<00:47,  2.17s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▍           | 37/58 [01:25<00:45,  2.17s/it]\u001b[A10/02/2020 12:53:16 - INFO - __main__ -   \n",
      "10/02/2020 12:53:16 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:53:16 - INFO - __main__ -   Validation f1 improved! 0.8388214904679375 -> 0.8478260869565217\n",
      "10/02/2020 12:53:16 - INFO - __main__ -   Validation Acc : 0.7598039215686274\n",
      "10/02/2020 12:53:16 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  66%|████████████████████▉           | 38/58 [01:27<00:43,  2.16s/it]\u001b[A\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  67%|█████████████████████▌          | 39/58 [01:30<00:41,  2.16s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████          | 40/58 [01:32<00:39,  2.18s/it]\u001b[A\n",
      "Iteration:  71%|██████████████████████▌         | 41/58 [01:34<00:37,  2.19s/it]\u001b[A\n",
      "Iteration:  72%|███████████████████████▏        | 42/58 [01:36<00:34,  2.19s/it]\u001b[A10/02/2020 12:53:27 - INFO - __main__ -   \n",
      "10/02/2020 12:53:27 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:53:27 - INFO - __main__ -   Validation f1 improved! 0.8478260869565217 -> 0.8492307692307691\n",
      "10/02/2020 12:53:27 - INFO - __main__ -   Validation Acc : 0.7598039215686274\n",
      "10/02/2020 12:53:27 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  74%|███████████████████████▋        | 43/58 [01:38<00:32,  2.19s/it]\u001b[A10/02/2020 12:53:29 - INFO - __main__ -   \n",
      "10/02/2020 12:53:29 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:53:29 - INFO - __main__ -   Validation f1 improved! 0.8492307692307691 -> 0.8557844690966719\n",
      "10/02/2020 12:53:29 - INFO - __main__ -   Validation Acc : 0.7769607843137255\n",
      "10/02/2020 12:53:29 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:53:29 - INFO - __main__ -   \n",
      "10/02/2020 12:53:29 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:53:29 - INFO - __main__ -   Validation Acc improved! 0.7720588235294118 -> 0.7769607843137255\n",
      "10/02/2020 12:53:29 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  76%|████████████████████████▎       | 44/58 [01:41<00:30,  2.19s/it]\u001b[A\n",
      "Iteration:  78%|████████████████████████▊       | 45/58 [01:43<00:28,  2.19s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 46/58 [01:45<00:26,  2.19s/it]\u001b[A\n",
      "Iteration:  81%|█████████████████████████▉      | 47/58 [01:47<00:24,  2.19s/it]\u001b[A\n",
      "Iteration:  83%|██████████████████████████▍     | 48/58 [01:49<00:21,  2.19s/it]\u001b[A\n",
      "Iteration:  84%|███████████████████████████     | 49/58 [01:51<00:19,  2.19s/it]\u001b[A\n",
      "Iteration:  86%|███████████████████████████▌    | 50/58 [01:54<00:17,  2.20s/it]\u001b[A\n",
      "Iteration:  88%|████████████████████████████▏   | 51/58 [01:56<00:15,  2.20s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 52/58 [01:58<00:13,  2.21s/it]\u001b[A\n",
      "Iteration:  91%|█████████████████████████████▏  | 53/58 [02:00<00:11,  2.21s/it]\u001b[A\n",
      "Iteration:  93%|█████████████████████████████▊  | 54/58 [02:03<00:08,  2.21s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 55/58 [02:05<00:06,  2.22s/it]\u001b[A10/02/2020 12:53:56 - INFO - __main__ -   \n",
      "10/02/2020 12:53:56 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:53:56 - INFO - __main__ -   Validation f1 improved! 0.8557844690966719 -> 0.8571428571428571\n",
      "10/02/2020 12:53:56 - INFO - __main__ -   Validation Acc : 0.7941176470588235\n",
      "10/02/2020 12:53:56 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:53:56 - INFO - __main__ -   \n",
      "10/02/2020 12:53:56 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:53:56 - INFO - __main__ -   Validation Acc improved! 0.7769607843137255 -> 0.7941176470588235\n",
      "10/02/2020 12:53:56 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  97%|██████████████████████████████▉ | 56/58 [02:07<00:04,  2.21s/it]\u001b[A10/02/2020 12:53:58 - INFO - __main__ -   \n",
      "10/02/2020 12:53:58 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:53:58 - INFO - __main__ -   Validation f1 improved! 0.8571428571428571 -> 0.8591065292096219\n",
      "10/02/2020 12:53:58 - INFO - __main__ -   Validation Acc : 0.7990196078431373\n",
      "10/02/2020 12:53:58 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:53:58 - INFO - __main__ -   \n",
      "10/02/2020 12:53:58 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:53:58 - INFO - __main__ -   Validation Acc improved! 0.7941176470588235 -> 0.7990196078431373\n",
      "10/02/2020 12:53:58 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  98%|███████████████████████████████▍| 57/58 [02:09<00:02,  2.20s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 58/58 [02:11<00:00,  2.27s/it]\u001b[A\n",
      "Epoch:  17%|██████▏                              | 1/6 [02:11<10:58, 131.63s/it]\n",
      "Iteration:   0%|                                         | 0/58 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   2%|▌                                | 1/58 [00:02<02:07,  2.24s/it]\u001b[A\n",
      "Iteration:   3%|█▏                               | 2/58 [00:04<02:05,  2.24s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 3/58 [00:06<02:02,  2.23s/it]\u001b[A10/02/2020 12:54:09 - INFO - __main__ -   \n",
      "10/02/2020 12:54:09 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:54:09 - INFO - __main__ -   Validation f1 improved! 0.8591065292096219 -> 0.8616352201257863\n",
      "10/02/2020 12:54:09 - INFO - __main__ -   Validation Acc : 0.7843137254901961\n",
      "10/02/2020 12:54:09 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   7%|██▎                              | 4/58 [00:08<02:00,  2.23s/it]\u001b[A\n",
      "Iteration:   9%|██▊                              | 5/58 [00:11<01:57,  2.21s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 6/58 [00:13<01:55,  2.22s/it]\u001b[A10/02/2020 12:54:15 - INFO - __main__ -   \n",
      "10/02/2020 12:54:15 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:54:15 - INFO - __main__ -   Validation f1 improved! 0.8616352201257863 -> 0.8639455782312925\n",
      "10/02/2020 12:54:15 - INFO - __main__ -   Validation Acc : 0.803921568627451\n",
      "10/02/2020 12:54:15 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:54:15 - INFO - __main__ -   \n",
      "10/02/2020 12:54:15 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:54:15 - INFO - __main__ -   Validation Acc improved! 0.7990196078431373 -> 0.803921568627451\n",
      "10/02/2020 12:54:15 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  12%|███▉                             | 7/58 [00:15<01:52,  2.21s/it]\u001b[A\n",
      "Iteration:  14%|████▌                            | 8/58 [00:17<01:50,  2.22s/it]\u001b[A10/02/2020 12:54:20 - INFO - __main__ -   \n",
      "10/02/2020 12:54:20 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:54:20 - INFO - __main__ -   Validation f1 improved! 0.8639455782312925 -> 0.8670120898100172\n",
      "10/02/2020 12:54:20 - INFO - __main__ -   Validation Acc : 0.8112745098039216\n",
      "10/02/2020 12:54:20 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:54:20 - INFO - __main__ -   \n",
      "10/02/2020 12:54:20 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:54:20 - INFO - __main__ -   Validation Acc improved! 0.803921568627451 -> 0.8112745098039216\n",
      "10/02/2020 12:54:20 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  16%|█████                            | 9/58 [00:19<01:48,  2.21s/it]\u001b[A10/02/2020 12:54:22 - INFO - __main__ -   \n",
      "10/02/2020 12:54:22 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:54:22 - INFO - __main__ -   Validation f1 improved! 0.8670120898100172 -> 0.8692699490662139\n",
      "10/02/2020 12:54:22 - INFO - __main__ -   Validation Acc : 0.8112745098039216\n",
      "10/02/2020 12:54:22 - INFO - __main__ -   =============================================================================\n",
      "\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  17%|█████▌                          | 10/58 [00:22<01:46,  2.21s/it]\u001b[A\n",
      "Iteration:  19%|██████                          | 11/58 [00:24<01:43,  2.20s/it]\u001b[A\n",
      "Iteration:  21%|██████▌                         | 12/58 [00:26<01:41,  2.21s/it]\u001b[A\n",
      "Iteration:  22%|███████▏                        | 13/58 [00:28<01:39,  2.21s/it]\u001b[A\n",
      "Iteration:  24%|███████▋                        | 14/58 [00:30<01:37,  2.21s/it]\u001b[A\n",
      "Iteration:  26%|████████▎                       | 15/58 [00:33<01:34,  2.21s/it]\u001b[A\n",
      "Iteration:  28%|████████▊                       | 16/58 [00:35<01:32,  2.21s/it]\u001b[A10/02/2020 12:54:38 - INFO - __main__ -   \n",
      "10/02/2020 12:54:38 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:54:38 - INFO - __main__ -   Validation f1 improved! 0.8692699490662139 -> 0.8710217755443886\n",
      "10/02/2020 12:54:38 - INFO - __main__ -   Validation Acc : 0.8112745098039216\n",
      "10/02/2020 12:54:38 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  29%|█████████▍                      | 17/58 [00:37<01:30,  2.21s/it]\u001b[A\n",
      "Iteration:  31%|█████████▉                      | 18/58 [00:39<01:28,  2.21s/it]\u001b[A\n",
      "Iteration:  33%|██████████▍                     | 19/58 [00:42<01:26,  2.22s/it]\u001b[A\n",
      "Iteration:  34%|███████████                     | 20/58 [00:44<01:23,  2.21s/it]\u001b[A\n",
      "Iteration:  36%|███████████▌                    | 21/58 [00:46<01:21,  2.22s/it]\u001b[A\n",
      "Iteration:  38%|████████████▏                   | 22/58 [00:48<01:19,  2.21s/it]\u001b[A\n",
      "Iteration:  40%|████████████▋                   | 23/58 [00:50<01:17,  2.21s/it]\u001b[A10/02/2020 12:54:53 - INFO - __main__ -   \n",
      "10/02/2020 12:54:53 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:54:53 - INFO - __main__ -   Validation Acc improved! 0.8112745098039216 -> 0.8161764705882353\n",
      "10/02/2020 12:54:53 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  41%|█████████████▏                  | 24/58 [00:53<01:14,  2.20s/it]\u001b[A10/02/2020 12:54:55 - INFO - __main__ -   \n",
      "10/02/2020 12:54:55 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:54:55 - INFO - __main__ -   Validation f1 improved! 0.8710217755443886 -> 0.8741496598639455\n",
      "10/02/2020 12:54:55 - INFO - __main__ -   Validation Acc : 0.8186274509803921\n",
      "10/02/2020 12:54:55 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:54:55 - INFO - __main__ -   \n",
      "10/02/2020 12:54:55 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:54:55 - INFO - __main__ -   Validation Acc improved! 0.8161764705882353 -> 0.8186274509803921\n",
      "10/02/2020 12:54:55 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  43%|█████████████▊                  | 25/58 [00:55<01:12,  2.21s/it]\u001b[A\n",
      "Iteration:  45%|██████████████▎                 | 26/58 [00:57<01:10,  2.21s/it]\u001b[A\n",
      "Iteration:  47%|██████████████▉                 | 27/58 [00:59<01:08,  2.22s/it]\u001b[A\n",
      "Iteration:  48%|███████████████▍                | 28/58 [01:01<01:06,  2.21s/it]\u001b[A\n",
      "Iteration:  50%|████████████████                | 29/58 [01:04<01:04,  2.21s/it]\u001b[A10/02/2020 12:55:06 - INFO - __main__ -   \n",
      "10/02/2020 12:55:06 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:55:06 - INFO - __main__ -   Validation f1 improved! 0.8741496598639455 -> 0.8754208754208753\n",
      "10/02/2020 12:55:06 - INFO - __main__ -   Validation Acc : 0.8186274509803921\n",
      "10/02/2020 12:55:06 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  52%|████████████████▌               | 30/58 [01:06<01:02,  2.22s/it]\u001b[A10/02/2020 12:55:09 - INFO - __main__ -   \n",
      "10/02/2020 12:55:09 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:55:09 - INFO - __main__ -   Validation f1 improved! 0.8754208754208753 -> 0.8786324786324787\n",
      "10/02/2020 12:55:09 - INFO - __main__ -   Validation Acc : 0.8259803921568627\n",
      "10/02/2020 12:55:09 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:55:09 - INFO - __main__ -   \n",
      "10/02/2020 12:55:09 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:55:09 - INFO - __main__ -   Validation Acc improved! 0.8186274509803921 -> 0.8259803921568627\n",
      "10/02/2020 12:55:09 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  53%|█████████████████               | 31/58 [01:08<00:59,  2.21s/it]\u001b[A10/02/2020 12:55:11 - INFO - __main__ -   \n",
      "10/02/2020 12:55:11 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:55:11 - INFO - __main__ -   Validation f1 improved! 0.8786324786324787 -> 0.8808290155440415\n",
      "10/02/2020 12:55:11 - INFO - __main__ -   Validation Acc : 0.8308823529411765\n",
      "10/02/2020 12:55:11 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:55:11 - INFO - __main__ -   \n",
      "10/02/2020 12:55:11 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:55:11 - INFO - __main__ -   Validation Acc improved! 0.8259803921568627 -> 0.8308823529411765\n",
      "10/02/2020 12:55:11 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  55%|█████████████████▋              | 32/58 [01:10<00:57,  2.22s/it]\u001b[A10/02/2020 12:55:13 - INFO - __main__ -   \n",
      "10/02/2020 12:55:13 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:55:13 - INFO - __main__ -   Validation f1 improved! 0.8808290155440415 -> 0.8828522920203735\n",
      "10/02/2020 12:55:13 - INFO - __main__ -   Validation Acc : 0.8308823529411765\n",
      "10/02/2020 12:55:13 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  57%|██████████████████▏             | 33/58 [01:12<00:55,  2.21s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 34/58 [01:15<00:53,  2.21s/it]\u001b[A\n",
      "Iteration:  60%|███████████████████▎            | 35/58 [01:17<00:51,  2.22s/it]\u001b[A10/02/2020 12:55:20 - INFO - __main__ -   \n",
      "10/02/2020 12:55:20 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:55:20 - INFO - __main__ -   Validation f1 improved! 0.8828522920203735 -> 0.8842832469775476\n",
      "10/02/2020 12:55:20 - INFO - __main__ -   Validation Acc : 0.8357843137254902\n",
      "10/02/2020 12:55:20 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:55:20 - INFO - __main__ -   \n",
      "10/02/2020 12:55:20 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:55:20 - INFO - __main__ -   Validation Acc improved! 0.8308823529411765 -> 0.8357843137254902\n",
      "10/02/2020 12:55:20 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  62%|███████████████████▊            | 36/58 [01:19<00:48,  2.21s/it]\u001b[A10/02/2020 12:55:22 - INFO - __main__ -   \n",
      "10/02/2020 12:55:22 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:55:22 - INFO - __main__ -   Validation f1 improved! 0.8842832469775476 -> 0.8853615520282188\n",
      "10/02/2020 12:55:22 - INFO - __main__ -   Validation Acc : 0.8406862745098039\n",
      "10/02/2020 12:55:22 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:55:22 - INFO - __main__ -   \n",
      "10/02/2020 12:55:22 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:55:22 - INFO - __main__ -   Validation Acc improved! 0.8357843137254902 -> 0.8406862745098039\n",
      "10/02/2020 12:55:22 - INFO - __main__ -   =============================================================================\n",
      "\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  64%|████████████████████▍           | 37/58 [01:21<00:46,  2.21s/it]\u001b[A\n",
      "Iteration:  66%|████████████████████▉           | 38/58 [01:24<00:44,  2.21s/it]\u001b[A10/02/2020 12:55:26 - INFO - __main__ -   \n",
      "10/02/2020 12:55:26 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:55:26 - INFO - __main__ -   Validation f1 improved! 0.8853615520282188 -> 0.8873239436619718\n",
      "10/02/2020 12:55:26 - INFO - __main__ -   Validation Acc : 0.8431372549019608\n",
      "10/02/2020 12:55:26 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:55:26 - INFO - __main__ -   \n",
      "10/02/2020 12:55:26 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:55:26 - INFO - __main__ -   Validation Acc improved! 0.8406862745098039 -> 0.8431372549019608\n",
      "10/02/2020 12:55:26 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  67%|█████████████████████▌          | 39/58 [01:26<00:42,  2.21s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████          | 40/58 [01:28<00:39,  2.22s/it]\u001b[A\n",
      "Iteration:  71%|██████████████████████▌         | 41/58 [01:30<00:37,  2.21s/it]\u001b[A\n",
      "Iteration:  72%|███████████████████████▏        | 42/58 [01:32<00:35,  2.22s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▋        | 43/58 [01:35<00:33,  2.20s/it]\u001b[A\n",
      "Iteration:  76%|████████████████████████▎       | 44/58 [01:37<00:30,  2.21s/it]\u001b[A\n",
      "Iteration:  78%|████████████████████████▊       | 45/58 [01:39<00:28,  2.22s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 46/58 [01:41<00:26,  2.21s/it]\u001b[A\n",
      "Iteration:  81%|█████████████████████████▉      | 47/58 [01:43<00:24,  2.21s/it]\u001b[A\n",
      "Iteration:  83%|██████████████████████████▍     | 48/58 [01:46<00:22,  2.21s/it]\u001b[A\n",
      "Iteration:  84%|███████████████████████████     | 49/58 [01:48<00:19,  2.21s/it]\u001b[A10/02/2020 12:55:51 - INFO - __main__ -   \n",
      "10/02/2020 12:55:51 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:55:51 - INFO - __main__ -   Validation f1 improved! 0.8873239436619718 -> 0.888888888888889\n",
      "10/02/2020 12:55:51 - INFO - __main__ -   Validation Acc : 0.8431372549019608\n",
      "10/02/2020 12:55:51 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  86%|███████████████████████████▌    | 50/58 [01:50<00:17,  2.22s/it]\u001b[A\n",
      "Iteration:  88%|████████████████████████████▏   | 51/58 [01:52<00:15,  2.21s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 52/58 [01:55<00:13,  2.22s/it]\u001b[A\n",
      "Iteration:  91%|█████████████████████████████▏  | 53/58 [01:57<00:11,  2.21s/it]\u001b[A\n",
      "Iteration:  93%|█████████████████████████████▊  | 54/58 [01:59<00:08,  2.22s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 55/58 [02:01<00:06,  2.21s/it]\u001b[A\n",
      "Iteration:  97%|██████████████████████████████▉ | 56/58 [02:03<00:04,  2.22s/it]\u001b[A\n",
      "Iteration:  98%|███████████████████████████████▍| 57/58 [02:06<00:02,  2.21s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 58/58 [02:08<00:00,  2.21s/it]\u001b[A\n",
      "Epoch:  33%|████████████▎                        | 2/6 [04:19<08:42, 130.56s/it]\n",
      "Iteration:   0%|                                         | 0/58 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   2%|▌                                | 1/58 [00:02<02:07,  2.23s/it]\u001b[A\n",
      "Iteration:   3%|█▏                               | 2/58 [00:04<02:05,  2.23s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 3/58 [00:06<02:02,  2.22s/it]\u001b[A\n",
      "Iteration:   7%|██▎                              | 4/58 [00:08<02:00,  2.22s/it]\u001b[A\n",
      "Iteration:   9%|██▊                              | 5/58 [00:11<01:57,  2.21s/it]\u001b[A10/02/2020 12:56:21 - INFO - __main__ -   \n",
      "10/02/2020 12:56:21 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:56:21 - INFO - __main__ -   Validation f1 improved! 0.888888888888889 -> 0.8900169204737733\n",
      "10/02/2020 12:56:21 - INFO - __main__ -   Validation Acc : 0.8406862745098039\n",
      "10/02/2020 12:56:21 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  10%|███▍                             | 6/58 [00:13<01:55,  2.22s/it]\u001b[A\n",
      "Iteration:  12%|███▉                             | 7/58 [00:15<01:52,  2.21s/it]\u001b[A\n",
      "Iteration:  14%|████▌                            | 8/58 [00:17<01:51,  2.22s/it]\u001b[A\n",
      "Iteration:  16%|█████                            | 9/58 [00:19<01:48,  2.21s/it]\u001b[A\n",
      "Iteration:  17%|█████▌                          | 10/58 [00:22<01:46,  2.22s/it]\u001b[A\n",
      "Iteration:  19%|██████                          | 11/58 [00:24<01:43,  2.21s/it]\u001b[A\n",
      "Iteration:  21%|██████▌                         | 12/58 [00:26<01:41,  2.21s/it]\u001b[A\n",
      "Iteration:  22%|███████▏                        | 13/58 [00:28<01:39,  2.22s/it]\u001b[A\n",
      "Iteration:  24%|███████▋                        | 14/58 [00:31<01:37,  2.21s/it]\u001b[A\n",
      "Iteration:  26%|████████▎                       | 15/58 [00:33<01:35,  2.22s/it]\u001b[A\n",
      "Iteration:  28%|████████▊                       | 16/58 [00:35<01:32,  2.21s/it]\u001b[A\n",
      "Iteration:  29%|█████████▍                      | 17/58 [00:37<01:30,  2.21s/it]\u001b[A\n",
      "Iteration:  31%|█████████▉                      | 18/58 [00:39<01:28,  2.22s/it]\u001b[A\n",
      "Iteration:  33%|██████████▍                     | 19/58 [00:42<01:26,  2.21s/it]\u001b[A\n",
      "Iteration:  34%|███████████                     | 20/58 [00:44<01:24,  2.22s/it]\u001b[A\n",
      "Iteration:  36%|███████████▌                    | 21/58 [00:46<01:21,  2.21s/it]\u001b[A\n",
      "Iteration:  38%|████████████▏                   | 22/58 [00:48<01:19,  2.21s/it]\u001b[A\n",
      "Iteration:  40%|████████████▋                   | 23/58 [00:50<01:17,  2.21s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 24/58 [00:53<01:15,  2.21s/it]\u001b[A\n",
      "Iteration:  43%|█████████████▊                  | 25/58 [00:55<01:13,  2.22s/it]\u001b[A\n",
      "Iteration:  45%|██████████████▎                 | 26/58 [00:57<01:10,  2.21s/it]\u001b[A\n",
      "Iteration:  47%|██████████████▉                 | 27/58 [00:59<01:08,  2.21s/it]\u001b[A\n",
      "Iteration:  48%|███████████████▍                | 28/58 [01:01<01:06,  2.21s/it]\u001b[A\n",
      "Iteration:  50%|████████████████                | 29/58 [01:04<01:04,  2.22s/it]\u001b[A\n",
      "Iteration:  52%|████████████████▌               | 30/58 [01:06<01:01,  2.21s/it]\u001b[A\n",
      "Iteration:  53%|█████████████████               | 31/58 [01:08<00:59,  2.20s/it]\u001b[A10/02/2020 12:57:19 - INFO - __main__ -   \n",
      "10/02/2020 12:57:19 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:57:19 - INFO - __main__ -   Validation f1 improved! 0.8900169204737733 -> 0.8904109589041096\n",
      "10/02/2020 12:57:19 - INFO - __main__ -   Validation Acc : 0.8431372549019608\n",
      "10/02/2020 12:57:19 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  55%|█████████████████▋              | 32/58 [01:10<00:57,  2.21s/it]\u001b[A10/02/2020 12:57:21 - INFO - __main__ -   \n",
      "10/02/2020 12:57:21 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:57:21 - INFO - __main__ -   Validation f1 improved! 0.8904109589041096 -> 0.8923076923076922\n",
      "10/02/2020 12:57:21 - INFO - __main__ -   Validation Acc : 0.8455882352941176\n",
      "10/02/2020 12:57:21 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:57:21 - INFO - __main__ -   \n",
      "10/02/2020 12:57:21 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:57:21 - INFO - __main__ -   Validation Acc improved! 0.8431372549019608 -> 0.8455882352941176\n",
      "10/02/2020 12:57:21 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  57%|██████████████████▏             | 33/58 [01:13<00:55,  2.21s/it]\u001b[A10/02/2020 12:57:23 - INFO - __main__ -   \n",
      "10/02/2020 12:57:23 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:57:23 - INFO - __main__ -   Validation f1 improved! 0.8923076923076922 -> 0.8930390492359932\n",
      "10/02/2020 12:57:23 - INFO - __main__ -   Validation Acc : 0.8455882352941176\n",
      "10/02/2020 12:57:23 - INFO - __main__ -   =============================================================================\n",
      "\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  59%|██████████████████▊             | 34/58 [01:15<00:53,  2.22s/it]\u001b[A\n",
      "Iteration:  60%|███████████████████▎            | 35/58 [01:17<00:50,  2.21s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▊            | 36/58 [01:19<00:48,  2.22s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▍           | 37/58 [01:21<00:46,  2.21s/it]\u001b[A\n",
      "Iteration:  66%|████████████████████▉           | 38/58 [01:24<00:44,  2.21s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▌          | 39/58 [01:26<00:41,  2.21s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████          | 40/58 [01:28<00:39,  2.21s/it]\u001b[A\n",
      "Iteration:  71%|██████████████████████▌         | 41/58 [01:30<00:37,  2.21s/it]\u001b[A\n",
      "Iteration:  72%|███████████████████████▏        | 42/58 [01:32<00:35,  2.21s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▋        | 43/58 [01:35<00:33,  2.21s/it]\u001b[A10/02/2020 12:57:45 - INFO - __main__ -   \n",
      "10/02/2020 12:57:45 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:57:45 - INFO - __main__ -   Validation f1 improved! 0.8930390492359932 -> 0.8934010152284263\n",
      "10/02/2020 12:57:45 - INFO - __main__ -   Validation Acc : 0.8455882352941176\n",
      "10/02/2020 12:57:45 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  76%|████████████████████████▎       | 44/58 [01:37<00:31,  2.22s/it]\u001b[A\n",
      "Iteration:  78%|████████████████████████▊       | 45/58 [01:39<00:28,  2.21s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 46/58 [01:41<00:26,  2.21s/it]\u001b[A\n",
      "Iteration:  81%|█████████████████████████▉      | 47/58 [01:43<00:24,  2.21s/it]\u001b[A\n",
      "Iteration:  83%|██████████████████████████▍     | 48/58 [01:46<00:22,  2.22s/it]\u001b[A\n",
      "Iteration:  84%|███████████████████████████     | 49/58 [01:48<00:19,  2.21s/it]\u001b[A\n",
      "Iteration:  86%|███████████████████████████▌    | 50/58 [01:50<00:17,  2.21s/it]\u001b[A\n",
      "Iteration:  88%|████████████████████████████▏   | 51/58 [01:52<00:15,  2.21s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 52/58 [01:55<00:13,  2.22s/it]\u001b[A\n",
      "Iteration:  91%|█████████████████████████████▏  | 53/58 [01:57<00:11,  2.21s/it]\u001b[A\n",
      "Iteration:  93%|█████████████████████████████▊  | 54/58 [01:59<00:08,  2.21s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 55/58 [02:01<00:06,  2.21s/it]\u001b[A\n",
      "Iteration:  97%|██████████████████████████████▉ | 56/58 [02:03<00:04,  2.22s/it]\u001b[A\n",
      "Iteration:  98%|███████████████████████████████▍| 57/58 [02:06<00:02,  2.21s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 58/58 [02:08<00:00,  2.21s/it]\u001b[A\n",
      "Epoch:  50%|██████████████████▌                  | 3/6 [06:27<06:29, 129.81s/it]\n",
      "Iteration:   0%|                                         | 0/58 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   2%|▌                                | 1/58 [00:02<02:07,  2.24s/it]\u001b[A\n",
      "Iteration:   3%|█▏                               | 2/58 [00:04<02:05,  2.24s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 3/58 [00:06<02:02,  2.22s/it]\u001b[A\n",
      "Iteration:   7%|██▎                              | 4/58 [00:08<02:00,  2.23s/it]\u001b[A\n",
      "Iteration:   9%|██▊                              | 5/58 [00:11<01:57,  2.22s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 6/58 [00:13<01:55,  2.22s/it]\u001b[A\n",
      "Iteration:  12%|███▉                             | 7/58 [00:15<01:52,  2.21s/it]\u001b[A\n",
      "Iteration:  14%|████▌                            | 8/58 [00:17<01:50,  2.22s/it]\u001b[A\n",
      "Iteration:  16%|█████                            | 9/58 [00:19<01:48,  2.21s/it]\u001b[A\n",
      "Iteration:  17%|█████▌                          | 10/58 [00:22<01:46,  2.22s/it]\u001b[A\n",
      "Iteration:  19%|██████                          | 11/58 [00:24<01:43,  2.21s/it]\u001b[A\n",
      "Iteration:  21%|██████▌                         | 12/58 [00:26<01:41,  2.21s/it]\u001b[A\n",
      "Iteration:  22%|███████▏                        | 13/58 [00:28<01:39,  2.22s/it]\u001b[A\n",
      "Iteration:  24%|███████▋                        | 14/58 [00:30<01:37,  2.21s/it]\u001b[A10/02/2020 12:58:49 - INFO - __main__ -   \n",
      "10/02/2020 12:58:49 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:58:49 - INFO - __main__ -   Validation f1 improved! 0.8934010152284263 -> 0.8934707903780068\n",
      "10/02/2020 12:58:49 - INFO - __main__ -   Validation Acc : 0.8480392156862745\n",
      "10/02/2020 12:58:49 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:58:49 - INFO - __main__ -   \n",
      "10/02/2020 12:58:49 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:58:49 - INFO - __main__ -   Validation Acc improved! 0.8455882352941176 -> 0.8480392156862745\n",
      "10/02/2020 12:58:49 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  26%|████████▎                       | 15/58 [00:33<01:35,  2.22s/it]\u001b[A\n",
      "Iteration:  28%|████████▊                       | 16/58 [00:35<01:32,  2.21s/it]\u001b[A\n",
      "Iteration:  29%|█████████▍                      | 17/58 [00:37<01:30,  2.22s/it]\u001b[A\n",
      "Iteration:  31%|█████████▉                      | 18/58 [00:39<01:28,  2.21s/it]\u001b[A\n",
      "Iteration:  33%|██████████▍                     | 19/58 [00:42<01:26,  2.22s/it]\u001b[A\n",
      "Iteration:  34%|███████████                     | 20/58 [00:44<01:23,  2.21s/it]\u001b[A\n",
      "Iteration:  36%|███████████▌                    | 21/58 [00:46<01:21,  2.20s/it]\u001b[A\n",
      "Iteration:  38%|████████████▏                   | 22/58 [00:48<01:19,  2.21s/it]\u001b[A\n",
      "Iteration:  40%|████████████▋                   | 23/58 [00:50<01:17,  2.21s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 24/58 [00:53<01:15,  2.22s/it]\u001b[A\n",
      "Iteration:  43%|█████████████▊                  | 25/58 [00:55<01:12,  2.21s/it]\u001b[A\n",
      "Iteration:  45%|██████████████▎                 | 26/58 [00:57<01:10,  2.21s/it]\u001b[A\n",
      "Iteration:  47%|██████████████▉                 | 27/58 [00:59<01:08,  2.20s/it]\u001b[A\n",
      "Iteration:  48%|███████████████▍                | 28/58 [01:01<01:06,  2.21s/it]\u001b[A\n",
      "Iteration:  50%|████████████████                | 29/58 [01:04<01:04,  2.21s/it]\u001b[A\n",
      "Iteration:  52%|████████████████▌               | 30/58 [01:06<01:02,  2.21s/it]\u001b[A\n",
      "Iteration:  53%|█████████████████               | 31/58 [01:08<00:59,  2.21s/it]\u001b[A\n",
      "Iteration:  55%|█████████████████▋              | 32/58 [01:10<00:57,  2.21s/it]\u001b[A\n",
      "Iteration:  57%|██████████████████▏             | 33/58 [01:12<00:55,  2.21s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 34/58 [01:15<00:53,  2.22s/it]\u001b[A\n",
      "Iteration:  60%|███████████████████▎            | 35/58 [01:17<00:51,  2.22s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▊            | 36/58 [01:19<00:48,  2.21s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▍           | 37/58 [01:21<00:46,  2.22s/it]\u001b[A\n",
      "Iteration:  66%|████████████████████▉           | 38/58 [01:24<00:44,  2.21s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▌          | 39/58 [01:26<00:42,  2.22s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████          | 40/58 [01:28<00:39,  2.21s/it]\u001b[A\n",
      "Iteration:  71%|██████████████████████▌         | 41/58 [01:30<00:37,  2.21s/it]\u001b[A\n",
      "Iteration:  72%|███████████████████████▏        | 42/58 [01:32<00:35,  2.21s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▋        | 43/58 [01:35<00:33,  2.21s/it]\u001b[A\n",
      "Iteration:  76%|████████████████████████▎       | 44/58 [01:37<00:31,  2.22s/it]\u001b[A\n",
      "Iteration:  78%|████████████████████████▊       | 45/58 [01:39<00:28,  2.21s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 46/58 [01:41<00:26,  2.22s/it]\u001b[A\n",
      "Iteration:  81%|█████████████████████████▉      | 47/58 [01:43<00:24,  2.21s/it]\u001b[A\n",
      "Iteration:  83%|██████████████████████████▍     | 48/58 [01:46<00:22,  2.21s/it]\u001b[A\n",
      "Iteration:  84%|███████████████████████████     | 49/58 [01:48<00:19,  2.20s/it]\u001b[A\n",
      "Iteration:  86%|███████████████████████████▌    | 50/58 [01:50<00:17,  2.21s/it]\u001b[A\n",
      "Iteration:  88%|████████████████████████████▏   | 51/58 [01:52<00:15,  2.22s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 52/58 [01:55<00:13,  2.21s/it]\u001b[A\n",
      "Iteration:  91%|█████████████████████████████▏  | 53/58 [01:57<00:11,  2.22s/it]\u001b[A\n",
      "Iteration:  93%|█████████████████████████████▊  | 54/58 [01:59<00:08,  2.21s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 55/58 [02:01<00:06,  2.21s/it]\u001b[A\n",
      "Iteration:  97%|██████████████████████████████▉ | 56/58 [02:03<00:04,  2.21s/it]\u001b[A\n",
      "Iteration:  98%|███████████████████████████████▍| 57/58 [02:06<00:02,  2.22s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 58/58 [02:08<00:00,  2.21s/it]\u001b[A\n",
      "Epoch:  67%|████████████████████████▋            | 4/6 [08:35<04:18, 129.27s/it]\n",
      "Iteration:   0%|                                         | 0/58 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   2%|▌                                | 1/58 [00:02<02:04,  2.18s/it]\u001b[A\n",
      "Iteration:   3%|█▏                               | 2/58 [00:04<02:02,  2.19s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 3/58 [00:06<02:00,  2.19s/it]\u001b[A10/02/2020 13:00:33 - INFO - __main__ -   \n",
      "10/02/2020 13:00:33 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:00:33 - INFO - __main__ -   Validation f1 improved! 0.8934707903780068 -> 0.8938356164383561\n",
      "10/02/2020 13:00:33 - INFO - __main__ -   Validation Acc : 0.8480392156862745\n",
      "10/02/2020 13:00:33 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   7%|██▎                              | 4/58 [00:08<01:58,  2.19s/it]\u001b[A\n",
      "Iteration:   9%|██▊                              | 5/58 [00:10<01:55,  2.19s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 6/58 [00:13<01:54,  2.20s/it]\u001b[A\n",
      "Iteration:  12%|███▉                             | 7/58 [00:15<01:52,  2.20s/it]\u001b[A\n",
      "Iteration:  14%|████▌                            | 8/58 [00:17<01:50,  2.21s/it]\u001b[A\n",
      "Iteration:  16%|█████                            | 9/58 [00:19<01:47,  2.20s/it]\u001b[A\n",
      "Iteration:  17%|█████▌                          | 10/58 [00:22<01:46,  2.21s/it]\u001b[A\n",
      "Iteration:  19%|██████                          | 11/58 [00:24<01:43,  2.20s/it]\u001b[A\n",
      "Iteration:  21%|██████▌                         | 12/58 [00:26<01:41,  2.21s/it]\u001b[A\n",
      "Iteration:  22%|███████▏                        | 13/58 [00:28<01:39,  2.21s/it]\u001b[A\n",
      "Iteration:  24%|███████▋                        | 14/58 [00:30<01:37,  2.21s/it]\u001b[A\n",
      "Iteration:  26%|████████▎                       | 15/58 [00:33<01:34,  2.21s/it]\u001b[A\n",
      "Iteration:  28%|████████▊                       | 16/58 [00:35<01:32,  2.21s/it]\u001b[A\n",
      "Iteration:  29%|█████████▍                      | 17/58 [00:37<01:30,  2.21s/it]\u001b[A10/02/2020 13:01:04 - INFO - __main__ -   \n",
      "10/02/2020 13:01:04 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:01:04 - INFO - __main__ -   Validation f1 improved! 0.8938356164383561 -> 0.8953687821612349\n",
      "10/02/2020 13:01:04 - INFO - __main__ -   Validation Acc : 0.8504901960784313\n",
      "10/02/2020 13:01:04 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:01:04 - INFO - __main__ -   \n",
      "10/02/2020 13:01:04 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:01:04 - INFO - __main__ -   Validation Acc improved! 0.8480392156862745 -> 0.8504901960784313\n",
      "10/02/2020 13:01:04 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  31%|█████████▉                      | 18/58 [00:39<01:28,  2.21s/it]\u001b[A\n",
      "Iteration:  33%|██████████▍                     | 19/58 [00:41<01:26,  2.21s/it]\u001b[A\n",
      "Iteration:  34%|███████████                     | 20/58 [00:44<01:24,  2.21s/it]\u001b[A\n",
      "Iteration:  36%|███████████▌                    | 21/58 [00:46<01:21,  2.21s/it]\u001b[A\n",
      "Iteration:  38%|████████████▏                   | 22/58 [00:48<01:19,  2.21s/it]\u001b[A\n",
      "Iteration:  40%|████████████▋                   | 23/58 [00:50<01:17,  2.22s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 24/58 [00:52<01:15,  2.21s/it]\u001b[A\n",
      "Iteration:  43%|█████████████▊                  | 25/58 [00:55<01:13,  2.22s/it]\u001b[A\n",
      "Iteration:  45%|██████████████▎                 | 26/58 [00:57<01:10,  2.21s/it]\u001b[A\n",
      "Iteration:  47%|██████████████▉                 | 27/58 [00:59<01:08,  2.21s/it]\u001b[A\n",
      "Iteration:  48%|███████████████▍                | 28/58 [01:01<01:06,  2.21s/it]\u001b[A\n",
      "Iteration:  50%|████████████████                | 29/58 [01:04<01:04,  2.22s/it]\u001b[A\n",
      "Iteration:  52%|████████████████▌               | 30/58 [01:06<01:01,  2.21s/it]\u001b[A\n",
      "Iteration:  53%|█████████████████               | 31/58 [01:08<00:59,  2.21s/it]\u001b[A\n",
      "Iteration:  55%|█████████████████▋              | 32/58 [01:10<00:57,  2.21s/it]\u001b[A\n",
      "Iteration:  57%|██████████████████▏             | 33/58 [01:12<00:55,  2.21s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 34/58 [01:15<00:52,  2.20s/it]\u001b[A\n",
      "Iteration:  60%|███████████████████▎            | 35/58 [01:17<00:50,  2.21s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▊            | 36/58 [01:19<00:48,  2.22s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▍           | 37/58 [01:21<00:46,  2.21s/it]\u001b[A\n",
      "Iteration:  66%|████████████████████▉           | 38/58 [01:23<00:44,  2.21s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▌          | 39/58 [01:26<00:41,  2.21s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████          | 40/58 [01:28<00:39,  2.21s/it]\u001b[A\n",
      "Iteration:  71%|██████████████████████▌         | 41/58 [01:30<00:37,  2.21s/it]\u001b[A\n",
      "Iteration:  72%|███████████████████████▏        | 42/58 [01:32<00:35,  2.21s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▋        | 43/58 [01:34<00:33,  2.21s/it]\u001b[A\n",
      "Iteration:  76%|████████████████████████▎       | 44/58 [01:37<00:30,  2.21s/it]\u001b[A\n",
      "Iteration:  78%|████████████████████████▊       | 45/58 [01:39<00:28,  2.22s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 46/58 [01:41<00:26,  2.21s/it]\u001b[A\n",
      "Iteration:  81%|█████████████████████████▉      | 47/58 [01:43<00:24,  2.22s/it]\u001b[A\n",
      "Iteration:  83%|██████████████████████████▍     | 48/58 [01:46<00:22,  2.21s/it]\u001b[A\n",
      "Iteration:  84%|███████████████████████████     | 49/58 [01:48<00:19,  2.21s/it]\u001b[A\n",
      "Iteration:  86%|███████████████████████████▌    | 50/58 [01:50<00:17,  2.21s/it]\u001b[A\n",
      "Iteration:  88%|████████████████████████████▏   | 51/58 [01:52<00:15,  2.21s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 52/58 [01:54<00:13,  2.21s/it]\u001b[A\n",
      "Iteration:  91%|█████████████████████████████▏  | 53/58 [01:57<00:11,  2.21s/it]\u001b[A\n",
      "Iteration:  93%|█████████████████████████████▊  | 54/58 [01:59<00:08,  2.21s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 55/58 [02:01<00:06,  2.21s/it]\u001b[A\n",
      "Iteration:  97%|██████████████████████████████▉ | 56/58 [02:03<00:04,  2.21s/it]\u001b[A\n",
      "Iteration:  98%|███████████████████████████████▍| 57/58 [02:05<00:02,  2.22s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 58/58 [02:07<00:00,  2.20s/it]\u001b[A\n",
      "Epoch:  83%|██████████████████████████████▊      | 5/6 [10:43<02:08, 128.85s/it]\n",
      "Iteration:   0%|                                         | 0/58 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   2%|▌                                | 1/58 [00:02<02:05,  2.19s/it]\u001b[A\n",
      "Iteration:   3%|█▏                               | 2/58 [00:04<02:02,  2.19s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 3/58 [00:06<02:00,  2.19s/it]\u001b[A\n",
      "Iteration:   7%|██▎                              | 4/58 [00:08<01:58,  2.19s/it]\u001b[A\n",
      "Iteration:   9%|██▊                              | 5/58 [00:10<01:56,  2.19s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 6/58 [00:13<01:54,  2.20s/it]\u001b[A\n",
      "Iteration:  12%|███▉                             | 7/58 [00:15<01:52,  2.20s/it]\u001b[A\n",
      "Iteration:  14%|████▌                            | 8/58 [00:17<01:50,  2.21s/it]\u001b[A\n",
      "Iteration:  16%|█████                            | 9/58 [00:19<01:47,  2.20s/it]\u001b[A\n",
      "Iteration:  17%|█████▌                          | 10/58 [00:22<01:46,  2.21s/it]\u001b[A\n",
      "Iteration:  19%|██████                          | 11/58 [00:24<01:43,  2.21s/it]\u001b[A\n",
      "Iteration:  21%|██████▌                         | 12/58 [00:26<01:41,  2.21s/it]\u001b[A\n",
      "Iteration:  22%|███████▏                        | 13/58 [00:28<01:39,  2.21s/it]\u001b[A\n",
      "Iteration:  24%|███████▋                        | 14/58 [00:30<01:37,  2.22s/it]\u001b[A\n",
      "Iteration:  26%|████████▎                       | 15/58 [00:33<01:34,  2.21s/it]\u001b[A\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  28%|████████▊                       | 16/58 [00:35<01:32,  2.20s/it]\u001b[A\n",
      "Iteration:  29%|█████████▍                      | 17/58 [00:37<01:30,  2.21s/it]\u001b[A\n",
      "Iteration:  31%|█████████▉                      | 18/58 [00:39<01:28,  2.21s/it]\u001b[A\n",
      "Iteration:  33%|██████████▍                     | 19/58 [00:41<01:26,  2.22s/it]\u001b[A\n",
      "Iteration:  34%|███████████                     | 20/58 [00:44<01:23,  2.21s/it]\u001b[A\n",
      "Iteration:  36%|███████████▌                    | 21/58 [00:46<01:21,  2.20s/it]\u001b[A\n",
      "Iteration:  38%|████████████▏                   | 22/58 [00:48<01:19,  2.21s/it]\u001b[A\n",
      "Iteration:  40%|████████████▋                   | 23/58 [00:50<01:17,  2.21s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 24/58 [00:52<01:15,  2.22s/it]\u001b[A\n",
      "Iteration:  43%|█████████████▊                  | 25/58 [00:55<01:12,  2.21s/it]\u001b[A\n",
      "Iteration:  45%|██████████████▎                 | 26/58 [00:57<01:10,  2.22s/it]\u001b[A\n",
      "Iteration:  47%|██████████████▉                 | 27/58 [00:59<01:08,  2.21s/it]\u001b[A\n",
      "Iteration:  48%|███████████████▍                | 28/58 [01:01<01:06,  2.21s/it]\u001b[A\n",
      "Iteration:  50%|████████████████                | 29/58 [01:04<01:04,  2.21s/it]\u001b[A\n",
      "Iteration:  52%|████████████████▌               | 30/58 [01:06<01:02,  2.22s/it]\u001b[A\n",
      "Iteration:  53%|█████████████████               | 31/58 [01:08<00:59,  2.21s/it]\u001b[A\n",
      "Iteration:  55%|█████████████████▋              | 32/58 [01:10<00:57,  2.22s/it]\u001b[A\n",
      "Iteration:  57%|██████████████████▏             | 33/58 [01:12<00:55,  2.21s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 34/58 [01:15<00:53,  2.21s/it]\u001b[A\n",
      "Iteration:  60%|███████████████████▎            | 35/58 [01:17<00:50,  2.21s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▊            | 36/58 [01:19<00:48,  2.21s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▍           | 37/58 [01:21<00:46,  2.21s/it]\u001b[A\n",
      "Iteration:  66%|████████████████████▉           | 38/58 [01:23<00:44,  2.21s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▌          | 39/58 [01:26<00:41,  2.21s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████          | 40/58 [01:28<00:39,  2.21s/it]\u001b[A\n",
      "Iteration:  71%|██████████████████████▌         | 41/58 [01:30<00:37,  2.21s/it]\u001b[A\n",
      "Iteration:  72%|███████████████████████▏        | 42/58 [01:32<00:35,  2.21s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▋        | 43/58 [01:34<00:33,  2.21s/it]\u001b[A\n",
      "Iteration:  76%|████████████████████████▎       | 44/58 [01:37<00:31,  2.22s/it]\u001b[A\n",
      "Iteration:  78%|████████████████████████▊       | 45/58 [01:39<00:28,  2.22s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 46/58 [01:41<00:26,  2.21s/it]\u001b[A\n",
      "Iteration:  81%|█████████████████████████▉      | 47/58 [01:43<00:24,  2.21s/it]\u001b[A\n",
      "Iteration:  83%|██████████████████████████▍     | 48/58 [01:46<00:22,  2.21s/it]\u001b[A\n",
      "Iteration:  84%|███████████████████████████     | 49/58 [01:48<00:19,  2.22s/it]\u001b[A\n",
      "Iteration:  86%|███████████████████████████▌    | 50/58 [01:50<00:17,  2.21s/it]\u001b[A\n",
      "Iteration:  88%|████████████████████████████▏   | 51/58 [01:52<00:15,  2.21s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 52/58 [01:54<00:13,  2.21s/it]\u001b[A\n",
      "Iteration:  91%|█████████████████████████████▏  | 53/58 [01:57<00:11,  2.21s/it]\u001b[A10/02/2020 13:04:30 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  93%|█████████████████████████████▊  | 54/58 [01:59<00:08,  2.21s/it]\u001b[A10/02/2020 13:04:32 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  95%|██████████████████████████████▎ | 55/58 [02:01<00:06,  2.22s/it]\u001b[A10/02/2020 13:04:34 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  97%|██████████████████████████████▉ | 56/58 [02:03<00:04,  2.21s/it]\u001b[A10/02/2020 13:04:36 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  98%|███████████████████████████████▍| 57/58 [02:05<00:02,  2.22s/it]\u001b[A10/02/2020 13:04:38 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration: 100%|████████████████████████████████| 58/58 [02:07<00:00,  2.20s/it]\u001b[A\n",
      "Epoch: 100%|█████████████████████████████████████| 6/6 [12:51<00:00, 128.58s/it]\n",
      "10/02/2020 13:04:40 - INFO - __main__ -   \n",
      "10/02/2020 13:04:40 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:04:40 - INFO - __main__ -   Validation Accuracy : 0.8504901960784313 Validation f1 : 0.8953687821612349\n",
      "10/02/2020 13:04:40 - INFO - __main__ -   =============================================================================\n"
     ]
    }
   ],
   "source": [
    "!python KDAP/NLI_KD_training_MRPC.py  --> 89.53"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 16,
   "metadata": {
    "collapsed": true
   },
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "10/02/2020 13:04:42 - INFO - __main__ -   IN DEBUG MODE\n",
      "10/02/2020 13:04:43 - INFO - src.argument_parser -   device: cuda n_gpu: 2, 16-bits training: False\n",
      "10/02/2020 13:04:43 - INFO - src.argument_parser -   random seed = 85600668\n",
      "10/02/2020 13:04:43 - INFO - __main__ -   actual batch size on all GPU = 64\n",
      "10/02/2020 13:04:43 - INFO - __main__ -   Seed changed to : 85600668\n",
      "10/02/2020 13:04:43 - INFO - __main__ -   loading teacher's prediction\n",
      "10/02/2020 13:04:43 - INFO - __main__ -   teacher acc = 94.11, teacher loss = 0.17171\n",
      "10/02/2020 13:04:43 - INFO - __main__ -   teacher acc = 89.22, teacher loss = 0.26664\n",
      "10/02/2020 13:04:43 - INFO - src.nli_data_processing -   Writing example 0 of 3668\n",
      "10/02/2020 13:04:46 - INFO - __main__ -   ***** Running training *****\n",
      "10/02/2020 13:04:46 - INFO - __main__ -     Num examples = 3668\n",
      "10/02/2020 13:04:46 - INFO - __main__ -     Batch size = 64\n",
      "10/02/2020 13:04:46 - INFO - __main__ -     Num steps = 342\n",
      "10/02/2020 13:04:46 - INFO - src.nli_data_processing -   Writing example 0 of 408\n",
      "10/02/2020 13:04:46 - INFO - __main__ -   ***** Running evaluation *****\n",
      "10/02/2020 13:04:46 - INFO - __main__ -     Num examples = 408\n",
      "10/02/2020 13:04:46 - INFO - __main__ -     Batch size = 32\n",
      "10/02/2020 13:04:46 - INFO - src.nli_data_processing -   Writing example 0 of 1725\n",
      "10/02/2020 13:04:47 - INFO - __main__ -   ***** Running evaluation *****\n",
      "10/02/2020 13:04:47 - INFO - __main__ -     Num examples = 1725\n",
      "10/02/2020 13:04:47 - INFO - __main__ -     Batch size = 32\n",
      "10/02/2020 13:04:47 - INFO - __main__ -   using normal Knowledge Distillation\n",
      "10/02/2020 13:04:47 - INFO - src.nli_data_processing -   predicting for MRPC\n",
      "10/02/2020 13:04:47 - INFO - src.modeling -   num hidden layer is set as 6\n",
      "10/02/2020 13:04:47 - INFO - src.modeling -   Model config {\n",
      "  \"attention_probs_dropout_prob\": 0.1,\n",
      "  \"hidden_act\": \"gelu\",\n",
      "  \"hidden_dropout_prob\": 0.1,\n",
      "  \"hidden_size\": 768,\n",
      "  \"initializer_range\": 0.02,\n",
      "  \"intermediate_size\": 3072,\n",
      "  \"max_position_embeddings\": 512,\n",
      "  \"num_attention_heads\": 12,\n",
      "  \"num_hidden_layers\": 6,\n",
      "  \"type_vocab_size\": 2,\n",
      "  \"vocab_size\": 30522\n",
      "}\n",
      "\n",
      "10/02/2020 13:04:49 - INFO - src.utils -   delete 104 layers, keep 103 layers\n",
      "10/02/2020 13:04:52 - INFO - src.utils -   data parallel because more than one gpu\n",
      "10/02/2020 13:04:52 - INFO - __main__ -   *****************************************************************************\n",
      "10/02/2020 13:04:52 - INFO - src.utils -   data parallel because more than one gpu\n",
      "10/02/2020 13:04:52 - INFO - __main__ -   number of layers in student model = 6\n",
      "10/02/2020 13:04:52 - INFO - __main__ -   num parameters in student model are 66955008 and 1538\n",
      "10/02/2020 13:04:52 - INFO - __main__ -   FP16 is not activated, use BertAdam\n",
      "Epoch:   0%|                                              | 0/6 [00:00<?, ?it/s]\n",
      "Iteration:   0%|                                         | 0/58 [00:00<?, ?it/s]\u001b[A/opt/conda/lib/python3.7/site-packages/torch/nn/functional.py:1958: UserWarning: reduction: 'mean' divides the total loss by both the batch size and the support size.'batchmean' divides only by the batch size, and aligns with the KL div math definition.'mean' will be changed to behave the same as 'batchmean' in the next major release.\n",
      "  warnings.warn(\"reduction: 'mean' divides the total loss by both the batch size and the support size.\"\n",
      "10/02/2020 13:04:57 - INFO - __main__ -   \n",
      "10/02/2020 13:04:57 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:04:57 - INFO - __main__ -   Validation f1 improved! 0 -> 0.0071428571428571435\n",
      "10/02/2020 13:04:57 - INFO - __main__ -   Validation Acc : 0.31862745098039214\n",
      "10/02/2020 13:04:57 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:04:57 - INFO - __main__ -   \n",
      "10/02/2020 13:04:57 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:04:57 - INFO - __main__ -   Validation Acc improved! 0 -> 0.31862745098039214\n",
      "10/02/2020 13:04:57 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   2%|▌                                | 1/58 [00:05<05:25,  5.71s/it]\u001b[A\n",
      "Iteration:   3%|█▏                               | 2/58 [00:07<04:19,  4.64s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 3/58 [00:09<03:33,  3.89s/it]\u001b[A\n",
      "Iteration:   7%|██▎                              | 4/58 [00:12<03:01,  3.37s/it]\u001b[A10/02/2020 13:05:06 - INFO - __main__ -   \n",
      "10/02/2020 13:05:06 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:05:06 - INFO - __main__ -   Validation f1 improved! 0.0071428571428571435 -> 0.04123711340206186\n",
      "10/02/2020 13:05:06 - INFO - __main__ -   Validation Acc : 0.3161764705882353\n",
      "10/02/2020 13:05:06 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   9%|██▊                              | 5/58 [00:14<02:39,  3.01s/it]\u001b[A10/02/2020 13:05:08 - INFO - __main__ -   \n",
      "10/02/2020 13:05:08 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:05:08 - INFO - __main__ -   Validation f1 improved! 0.04123711340206186 -> 0.38740920096852294\n",
      "10/02/2020 13:05:08 - INFO - __main__ -   Validation Acc : 0.3799019607843137\n",
      "10/02/2020 13:05:08 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:05:08 - INFO - __main__ -   \n",
      "10/02/2020 13:05:08 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:05:08 - INFO - __main__ -   Validation Acc improved! 0.31862745098039214 -> 0.3799019607843137\n",
      "10/02/2020 13:05:08 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  10%|███▍                             | 6/58 [00:16<02:23,  2.76s/it]\u001b[A10/02/2020 13:05:10 - INFO - __main__ -   \n",
      "10/02/2020 13:05:10 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:05:10 - INFO - __main__ -   Validation f1 improved! 0.38740920096852294 -> 0.807017543859649\n",
      "10/02/2020 13:05:10 - INFO - __main__ -   Validation Acc : 0.6764705882352942\n",
      "10/02/2020 13:05:10 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:05:10 - INFO - __main__ -   \n",
      "10/02/2020 13:05:10 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:05:10 - INFO - __main__ -   Validation Acc improved! 0.3799019607843137 -> 0.6764705882352942\n",
      "10/02/2020 13:05:10 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  12%|███▉                             | 7/58 [00:18<02:12,  2.61s/it]\u001b[A10/02/2020 13:05:13 - INFO - __main__ -   \n",
      "10/02/2020 13:05:13 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:05:13 - INFO - __main__ -   Validation f1 improved! 0.807017543859649 -> 0.8122270742358079\n",
      "10/02/2020 13:05:13 - INFO - __main__ -   Validation Acc : 0.6838235294117647\n",
      "10/02/2020 13:05:13 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:05:13 - INFO - __main__ -   \n",
      "10/02/2020 13:05:13 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:05:13 - INFO - __main__ -   Validation Acc improved! 0.6764705882352942 -> 0.6838235294117647\n",
      "10/02/2020 13:05:13 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  14%|████▌                            | 8/58 [00:20<02:04,  2.49s/it]\u001b[A\n",
      "Iteration:  16%|█████                            | 9/58 [00:23<01:58,  2.43s/it]\u001b[A\n",
      "Iteration:  17%|█████▌                          | 10/58 [00:25<01:53,  2.37s/it]\u001b[A\n",
      "Iteration:  19%|██████                          | 11/58 [00:27<01:50,  2.34s/it]\u001b[A\n",
      "Iteration:  21%|██████▌                         | 12/58 [00:30<01:47,  2.34s/it]\u001b[A\n",
      "Iteration:  22%|███████▏                        | 13/58 [00:32<01:44,  2.32s/it]\u001b[A\n",
      "Iteration:  24%|███████▋                        | 14/58 [00:34<01:42,  2.32s/it]\u001b[A\n",
      "Iteration:  26%|████████▎                       | 15/58 [00:36<01:39,  2.31s/it]\u001b[A\n",
      "Iteration:  28%|████████▊                       | 16/58 [00:39<01:37,  2.32s/it]\u001b[A\n",
      "Iteration:  29%|█████████▍                      | 17/58 [00:41<01:34,  2.30s/it]\u001b[A\n",
      "Iteration:  31%|█████████▉                      | 18/58 [00:43<01:31,  2.29s/it]\u001b[A\n",
      "Iteration:  33%|██████████▍                     | 19/58 [00:46<01:29,  2.28s/it]\u001b[A\n",
      "Iteration:  34%|███████████                     | 20/58 [00:48<01:25,  2.26s/it]\u001b[A\n",
      "Iteration:  36%|███████████▌                    | 21/58 [00:50<01:23,  2.25s/it]\u001b[A\n",
      "Iteration:  38%|████████████▏                   | 22/58 [00:52<01:20,  2.23s/it]\u001b[A\n",
      "Iteration:  40%|████████████▋                   | 23/58 [00:54<01:17,  2.22s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 24/58 [00:57<01:14,  2.21s/it]\u001b[A\n",
      "Iteration:  43%|█████████████▊                  | 25/58 [00:59<01:12,  2.21s/it]\u001b[A\n",
      "Iteration:  45%|██████████████▎                 | 26/58 [01:01<01:10,  2.21s/it]\u001b[A\n",
      "Iteration:  47%|██████████████▉                 | 27/58 [01:03<01:08,  2.20s/it]\u001b[A10/02/2020 13:05:57 - INFO - __main__ -   \n",
      "10/02/2020 13:05:57 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:05:57 - INFO - __main__ -   Validation f1 improved! 0.8122270742358079 -> 0.8169838945827232\n",
      "10/02/2020 13:05:57 - INFO - __main__ -   Validation Acc : 0.6936274509803921\n",
      "10/02/2020 13:05:57 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:05:57 - INFO - __main__ -   \n",
      "10/02/2020 13:05:57 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:05:57 - INFO - __main__ -   Validation Acc improved! 0.6838235294117647 -> 0.6936274509803921\n",
      "10/02/2020 13:05:57 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  48%|███████████████▍                | 28/58 [01:05<01:05,  2.19s/it]\u001b[A10/02/2020 13:06:00 - INFO - __main__ -   \n",
      "10/02/2020 13:06:00 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:06:00 - INFO - __main__ -   Validation f1 improved! 0.8169838945827232 -> 0.8207407407407408\n",
      "10/02/2020 13:06:00 - INFO - __main__ -   Validation Acc : 0.7034313725490197\n",
      "10/02/2020 13:06:00 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:06:00 - INFO - __main__ -   \n",
      "10/02/2020 13:06:00 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:06:00 - INFO - __main__ -   Validation Acc improved! 0.6936274509803921 -> 0.7034313725490197\n",
      "10/02/2020 13:06:00 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  50%|████████████████                | 29/58 [01:08<01:03,  2.18s/it]\u001b[A\n",
      "Iteration:  52%|████████████████▌               | 30/58 [01:10<01:00,  2.17s/it]\u001b[A\n",
      "Iteration:  53%|█████████████████               | 31/58 [01:12<00:58,  2.17s/it]\u001b[A10/02/2020 13:06:06 - INFO - __main__ -   \n",
      "10/02/2020 13:06:06 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:06:06 - INFO - __main__ -   Validation f1 improved! 0.8207407407407408 -> 0.8212121212121212\n",
      "10/02/2020 13:06:06 - INFO - __main__ -   Validation Acc : 0.7107843137254902\n",
      "10/02/2020 13:06:06 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:06:06 - INFO - __main__ -   \n",
      "10/02/2020 13:06:06 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:06:06 - INFO - __main__ -   Validation Acc improved! 0.7034313725490197 -> 0.7107843137254902\n",
      "10/02/2020 13:06:06 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  55%|█████████████████▋              | 32/58 [01:14<00:56,  2.17s/it]\u001b[A10/02/2020 13:06:08 - INFO - __main__ -   \n",
      "10/02/2020 13:06:08 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:06:08 - INFO - __main__ -   Validation f1 improved! 0.8212121212121212 -> 0.8217522658610272\n",
      "10/02/2020 13:06:08 - INFO - __main__ -   Validation Acc : 0.7107843137254902\n",
      "10/02/2020 13:06:08 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  57%|██████████████████▏             | 33/58 [01:16<00:54,  2.16s/it]\u001b[A10/02/2020 13:06:10 - INFO - __main__ -   \n",
      "10/02/2020 13:06:10 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:06:10 - INFO - __main__ -   Validation f1 improved! 0.8217522658610272 -> 0.8246153846153846\n",
      "10/02/2020 13:06:10 - INFO - __main__ -   Validation Acc : 0.7205882352941176\n",
      "10/02/2020 13:06:10 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:06:10 - INFO - __main__ -   \n",
      "10/02/2020 13:06:10 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:06:10 - INFO - __main__ -   Validation Acc improved! 0.7107843137254902 -> 0.7205882352941176\n",
      "10/02/2020 13:06:10 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  59%|██████████████████▊             | 34/58 [01:18<00:51,  2.16s/it]\u001b[A10/02/2020 13:06:13 - INFO - __main__ -   \n",
      "10/02/2020 13:06:13 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:06:13 - INFO - __main__ -   Validation Acc improved! 0.7205882352941176 -> 0.75\n",
      "10/02/2020 13:06:13 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  60%|███████████████████▎            | 35/58 [01:21<00:50,  2.17s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▊            | 36/58 [01:23<00:48,  2.18s/it]\u001b[A10/02/2020 13:06:17 - INFO - __main__ -   \n",
      "10/02/2020 13:06:17 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:06:17 - INFO - __main__ -   Validation Acc improved! 0.75 -> 0.7549019607843137\n",
      "10/02/2020 13:06:17 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  64%|████████████████████▍           | 37/58 [01:25<00:45,  2.18s/it]\u001b[A10/02/2020 13:06:19 - INFO - __main__ -   \n",
      "10/02/2020 13:06:19 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:06:19 - INFO - __main__ -   Validation f1 improved! 0.8246153846153846 -> 0.8375\n",
      "10/02/2020 13:06:19 - INFO - __main__ -   Validation Acc : 0.7450980392156863\n",
      "10/02/2020 13:06:19 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  66%|████████████████████▉           | 38/58 [01:27<00:43,  2.18s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▌          | 39/58 [01:29<00:41,  2.17s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████          | 40/58 [01:31<00:39,  2.17s/it]\u001b[A\n",
      "Iteration:  71%|██████████████████████▌         | 41/58 [01:34<00:37,  2.19s/it]\u001b[A10/02/2020 13:06:28 - INFO - __main__ -   \n",
      "10/02/2020 13:06:28 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:06:28 - INFO - __main__ -   Validation f1 improved! 0.8375 -> 0.8379204892966362\n",
      "10/02/2020 13:06:28 - INFO - __main__ -   Validation Acc : 0.7401960784313726\n",
      "10/02/2020 13:06:28 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  72%|███████████████████████▏        | 42/58 [01:36<00:34,  2.19s/it]\u001b[A10/02/2020 13:06:30 - INFO - __main__ -   \n",
      "10/02/2020 13:06:30 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:06:30 - INFO - __main__ -   Validation f1 improved! 0.8379204892966362 -> 0.8462757527733756\n",
      "10/02/2020 13:06:30 - INFO - __main__ -   Validation Acc : 0.7622549019607843\n",
      "10/02/2020 13:06:30 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:06:30 - INFO - __main__ -   \n",
      "10/02/2020 13:06:30 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:06:30 - INFO - __main__ -   Validation Acc improved! 0.7549019607843137 -> 0.7622549019607843\n",
      "10/02/2020 13:06:30 - INFO - __main__ -   =============================================================================\n",
      "\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  74%|███████████████████████▋        | 43/58 [01:38<00:32,  2.20s/it]\u001b[A10/02/2020 13:06:32 - INFO - __main__ -   \n",
      "10/02/2020 13:06:32 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:06:32 - INFO - __main__ -   Validation Acc improved! 0.7622549019607843 -> 0.7647058823529411\n",
      "10/02/2020 13:06:32 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  76%|████████████████████████▎       | 44/58 [01:40<00:30,  2.21s/it]\u001b[A10/02/2020 13:06:34 - INFO - __main__ -   \n",
      "10/02/2020 13:06:34 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:06:34 - INFO - __main__ -   Validation f1 improved! 0.8462757527733756 -> 0.8525641025641025\n",
      "10/02/2020 13:06:34 - INFO - __main__ -   Validation Acc : 0.7745098039215687\n",
      "10/02/2020 13:06:34 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:06:34 - INFO - __main__ -   \n",
      "10/02/2020 13:06:34 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:06:34 - INFO - __main__ -   Validation Acc improved! 0.7647058823529411 -> 0.7745098039215687\n",
      "10/02/2020 13:06:34 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  78%|████████████████████████▊       | 45/58 [01:42<00:28,  2.20s/it]\u001b[A10/02/2020 13:06:37 - INFO - __main__ -   \n",
      "10/02/2020 13:06:37 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:06:37 - INFO - __main__ -   Validation f1 improved! 0.8525641025641025 -> 0.8621236133122028\n",
      "10/02/2020 13:06:37 - INFO - __main__ -   Validation Acc : 0.7867647058823529\n",
      "10/02/2020 13:06:37 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:06:37 - INFO - __main__ -   \n",
      "10/02/2020 13:06:37 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:06:37 - INFO - __main__ -   Validation Acc improved! 0.7745098039215687 -> 0.7867647058823529\n",
      "10/02/2020 13:06:37 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  79%|█████████████████████████▍      | 46/58 [01:45<00:26,  2.21s/it]\u001b[A\n",
      "Iteration:  81%|█████████████████████████▉      | 47/58 [01:47<00:24,  2.21s/it]\u001b[A\n",
      "Iteration:  83%|██████████████████████████▍     | 48/58 [01:49<00:22,  2.22s/it]\u001b[A\n",
      "Iteration:  84%|███████████████████████████     | 49/58 [01:51<00:19,  2.21s/it]\u001b[A\n",
      "Iteration:  86%|███████████████████████████▌    | 50/58 [01:54<00:17,  2.22s/it]\u001b[A\n",
      "Iteration:  88%|████████████████████████████▏   | 51/58 [01:56<00:15,  2.21s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 52/58 [01:58<00:13,  2.22s/it]\u001b[A\n",
      "Iteration:  91%|█████████████████████████████▏  | 53/58 [02:00<00:11,  2.21s/it]\u001b[A\n",
      "Iteration:  93%|█████████████████████████████▊  | 54/58 [02:02<00:08,  2.22s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 55/58 [02:05<00:06,  2.21s/it]\u001b[A\n",
      "Iteration:  97%|██████████████████████████████▉ | 56/58 [02:07<00:04,  2.21s/it]\u001b[A\n",
      "Iteration:  98%|███████████████████████████████▍| 57/58 [02:09<00:02,  2.21s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 58/58 [02:11<00:00,  2.27s/it]\u001b[A\n",
      "Epoch:  17%|██████▏                              | 1/6 [02:11<10:57, 131.44s/it]\n",
      "Iteration:   0%|                                         | 0/58 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   2%|▌                                | 1/58 [00:02<02:05,  2.20s/it]\u001b[A\n",
      "Iteration:   3%|█▏                               | 2/58 [00:04<02:02,  2.20s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 3/58 [00:06<02:00,  2.19s/it]\u001b[A\n",
      "Iteration:   7%|██▎                              | 4/58 [00:08<01:58,  2.19s/it]\u001b[A10/02/2020 13:07:14 - INFO - __main__ -   \n",
      "10/02/2020 13:07:14 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:07:14 - INFO - __main__ -   Validation Acc improved! 0.7867647058823529 -> 0.7916666666666666\n",
      "10/02/2020 13:07:14 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   9%|██▊                              | 5/58 [00:10<01:56,  2.19s/it]\u001b[A10/02/2020 13:07:16 - INFO - __main__ -   \n",
      "10/02/2020 13:07:16 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:07:16 - INFO - __main__ -   Validation Acc improved! 0.7916666666666666 -> 0.7965686274509803\n",
      "10/02/2020 13:07:16 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  10%|███▍                             | 6/58 [00:13<01:53,  2.19s/it]\u001b[A10/02/2020 13:07:18 - INFO - __main__ -   \n",
      "10/02/2020 13:07:18 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:07:18 - INFO - __main__ -   Validation Acc improved! 0.7965686274509803 -> 0.7990196078431373\n",
      "10/02/2020 13:07:18 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  12%|███▉                             | 7/58 [00:15<01:51,  2.19s/it]\u001b[A\n",
      "Iteration:  14%|████▌                            | 8/58 [00:17<01:50,  2.20s/it]\u001b[A10/02/2020 13:07:23 - INFO - __main__ -   \n",
      "10/02/2020 13:07:23 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:07:23 - INFO - __main__ -   Validation f1 improved! 0.8621236133122028 -> 0.8636363636363636\n",
      "10/02/2020 13:07:23 - INFO - __main__ -   Validation Acc : 0.7941176470588235\n",
      "10/02/2020 13:07:23 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  16%|█████                            | 9/58 [00:19<01:47,  2.20s/it]\u001b[A\n",
      "Iteration:  17%|█████▌                          | 10/58 [00:21<01:46,  2.21s/it]\u001b[A10/02/2020 13:07:27 - INFO - __main__ -   \n",
      "10/02/2020 13:07:27 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:07:27 - INFO - __main__ -   Validation f1 improved! 0.8636363636363636 -> 0.8668831168831169\n",
      "10/02/2020 13:07:27 - INFO - __main__ -   Validation Acc : 0.7990196078431373\n",
      "10/02/2020 13:07:27 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  19%|██████                          | 11/58 [00:24<01:43,  2.20s/it]\u001b[A\n",
      "Iteration:  21%|██████▌                         | 12/58 [00:26<01:41,  2.21s/it]\u001b[A\n",
      "Iteration:  22%|███████▏                        | 13/58 [00:28<01:39,  2.20s/it]\u001b[A\n",
      "Iteration:  24%|███████▋                        | 14/58 [00:30<01:37,  2.21s/it]\u001b[A\n",
      "Iteration:  26%|████████▎                       | 15/58 [00:33<01:35,  2.22s/it]\u001b[A\n",
      "Iteration:  28%|████████▊                       | 16/58 [00:35<01:32,  2.21s/it]\u001b[A10/02/2020 13:07:40 - INFO - __main__ -   \n",
      "10/02/2020 13:07:40 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:07:40 - INFO - __main__ -   Validation f1 improved! 0.8668831168831169 -> 0.8715447154471545\n",
      "10/02/2020 13:07:40 - INFO - __main__ -   Validation Acc : 0.8063725490196079\n",
      "10/02/2020 13:07:40 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:07:40 - INFO - __main__ -   \n",
      "10/02/2020 13:07:40 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:07:40 - INFO - __main__ -   Validation Acc improved! 0.7990196078431373 -> 0.8063725490196079\n",
      "10/02/2020 13:07:40 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  29%|█████████▍                      | 17/58 [00:37<01:30,  2.21s/it]\u001b[A\n",
      "Iteration:  31%|█████████▉                      | 18/58 [00:39<01:28,  2.21s/it]\u001b[A\n",
      "Iteration:  33%|██████████▍                     | 19/58 [00:41<01:26,  2.21s/it]\u001b[A\n",
      "Iteration:  34%|███████████                     | 20/58 [00:44<01:23,  2.21s/it]\u001b[A\n",
      "Iteration:  36%|███████████▌                    | 21/58 [00:46<01:22,  2.22s/it]\u001b[A\n",
      "Iteration:  38%|████████████▏                   | 22/58 [00:48<01:19,  2.21s/it]\u001b[A10/02/2020 13:07:54 - INFO - __main__ -   \n",
      "10/02/2020 13:07:54 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:07:54 - INFO - __main__ -   Validation Acc improved! 0.8063725490196079 -> 0.8137254901960784\n",
      "10/02/2020 13:07:54 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  40%|████████████▋                   | 23/58 [00:50<01:17,  2.22s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 24/58 [00:52<01:15,  2.21s/it]\u001b[A\n",
      "Iteration:  43%|█████████████▊                  | 25/58 [00:55<01:13,  2.22s/it]\u001b[A\n",
      "Iteration:  45%|██████████████▎                 | 26/58 [00:57<01:10,  2.21s/it]\u001b[A\n",
      "Iteration:  47%|██████████████▉                 | 27/58 [00:59<01:08,  2.22s/it]\u001b[A\n",
      "Iteration:  48%|███████████████▍                | 28/58 [01:01<01:06,  2.21s/it]\u001b[A\n",
      "Iteration:  50%|████████████████                | 29/58 [01:04<01:04,  2.21s/it]\u001b[A\n",
      "Iteration:  52%|████████████████▌               | 30/58 [01:06<01:01,  2.21s/it]\u001b[A\n",
      "Iteration:  53%|█████████████████               | 31/58 [01:08<00:59,  2.21s/it]\u001b[A\n",
      "Iteration:  55%|█████████████████▋              | 32/58 [01:10<00:57,  2.21s/it]\u001b[A\n",
      "Iteration:  57%|██████████████████▏             | 33/58 [01:12<00:55,  2.22s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 34/58 [01:15<00:53,  2.21s/it]\u001b[A\n",
      "Iteration:  60%|███████████████████▎            | 35/58 [01:17<00:50,  2.22s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▊            | 36/58 [01:19<00:48,  2.21s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▍           | 37/58 [01:21<00:46,  2.21s/it]\u001b[A10/02/2020 13:08:27 - INFO - __main__ -   \n",
      "10/02/2020 13:08:27 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:08:27 - INFO - __main__ -   Validation f1 improved! 0.8715447154471545 -> 0.8741721854304637\n",
      "10/02/2020 13:08:27 - INFO - __main__ -   Validation Acc : 0.8137254901960784\n",
      "10/02/2020 13:08:27 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  66%|████████████████████▉           | 38/58 [01:23<00:44,  2.21s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▌          | 39/58 [01:26<00:42,  2.21s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████          | 40/58 [01:28<00:39,  2.21s/it]\u001b[A\n",
      "Iteration:  71%|██████████████████████▌         | 41/58 [01:30<00:37,  2.22s/it]\u001b[A\n",
      "Iteration:  72%|███████████████████████▏        | 42/58 [01:32<00:35,  2.21s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▋        | 43/58 [01:34<00:33,  2.22s/it]\u001b[A\n",
      "Iteration:  76%|████████████████████████▎       | 44/58 [01:37<00:30,  2.21s/it]\u001b[A\n",
      "Iteration:  78%|████████████████████████▊       | 45/58 [01:39<00:28,  2.22s/it]\u001b[A10/02/2020 13:08:45 - INFO - __main__ -   \n",
      "10/02/2020 13:08:45 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:08:45 - INFO - __main__ -   Validation Acc improved! 0.8137254901960784 -> 0.8235294117647058\n",
      "10/02/2020 13:08:45 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  79%|█████████████████████████▍      | 46/58 [01:41<00:26,  2.21s/it]\u001b[A\n",
      "Iteration:  81%|█████████████████████████▉      | 47/58 [01:43<00:24,  2.21s/it]\u001b[A\n",
      "Iteration:  83%|██████████████████████████▍     | 48/58 [01:46<00:22,  2.21s/it]\u001b[A\n",
      "Iteration:  84%|███████████████████████████     | 49/58 [01:48<00:19,  2.21s/it]\u001b[A\n",
      "Iteration:  86%|███████████████████████████▌    | 50/58 [01:50<00:17,  2.21s/it]\u001b[A\n",
      "Iteration:  88%|████████████████████████████▏   | 51/58 [01:52<00:15,  2.21s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 52/58 [01:54<00:13,  2.21s/it]\u001b[A\n",
      "Iteration:  91%|█████████████████████████████▏  | 53/58 [01:57<00:11,  2.22s/it]\u001b[A\n",
      "Iteration:  93%|█████████████████████████████▊  | 54/58 [01:59<00:08,  2.21s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 55/58 [02:01<00:06,  2.21s/it]\u001b[A10/02/2020 13:09:07 - INFO - __main__ -   \n",
      "10/02/2020 13:09:07 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:09:07 - INFO - __main__ -   Validation f1 improved! 0.8741721854304637 -> 0.8752136752136752\n",
      "10/02/2020 13:09:07 - INFO - __main__ -   Validation Acc : 0.821078431372549\n",
      "10/02/2020 13:09:07 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  97%|██████████████████████████████▉ | 56/58 [02:03<00:04,  2.21s/it]\u001b[A\n",
      "Iteration:  98%|███████████████████████████████▍| 57/58 [02:05<00:02,  2.21s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 58/58 [02:07<00:00,  2.20s/it]\u001b[A\n",
      "Epoch:  33%|████████████▎                        | 2/6 [04:19<08:41, 130.36s/it]\n",
      "Iteration:   0%|                                         | 0/58 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   2%|▌                                | 1/58 [00:02<02:05,  2.20s/it]\u001b[A\n",
      "Iteration:   3%|█▏                               | 2/58 [00:04<02:03,  2.20s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 3/58 [00:06<02:00,  2.19s/it]\u001b[A\n",
      "Iteration:   7%|██▎                              | 4/58 [00:08<01:58,  2.19s/it]\u001b[A\n",
      "Iteration:   9%|██▊                              | 5/58 [00:10<01:56,  2.19s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 6/58 [00:13<01:54,  2.19s/it]\u001b[A\n",
      "Iteration:  12%|███▉                             | 7/58 [00:15<01:52,  2.20s/it]\u001b[A\n",
      "Iteration:  14%|████▌                            | 8/58 [00:17<01:49,  2.20s/it]\u001b[A\n",
      "Iteration:  16%|█████                            | 9/58 [00:19<01:48,  2.21s/it]\u001b[A\n",
      "Iteration:  17%|█████▌                          | 10/58 [00:22<01:45,  2.20s/it]\u001b[A\n",
      "Iteration:  19%|██████                          | 11/58 [00:24<01:43,  2.20s/it]\u001b[A\n",
      "Iteration:  21%|██████▌                         | 12/58 [00:26<01:41,  2.21s/it]\u001b[A\n",
      "Iteration:  22%|███████▏                        | 13/58 [00:28<01:39,  2.21s/it]\u001b[A\n",
      "Iteration:  24%|███████▋                        | 14/58 [00:30<01:37,  2.21s/it]\u001b[A\n",
      "Iteration:  26%|████████▎                       | 15/58 [00:33<01:34,  2.21s/it]\u001b[A\n",
      "Iteration:  28%|████████▊                       | 16/58 [00:35<01:33,  2.22s/it]\u001b[A\n",
      "Iteration:  29%|█████████▍                      | 17/58 [00:37<01:30,  2.21s/it]\u001b[A\n",
      "Iteration:  31%|█████████▉                      | 18/58 [00:39<01:28,  2.22s/it]\u001b[A\n",
      "Iteration:  33%|██████████▍                     | 19/58 [00:41<01:26,  2.21s/it]\u001b[A\n",
      "Iteration:  34%|███████████                     | 20/58 [00:44<01:24,  2.22s/it]\u001b[A10/02/2020 13:09:57 - INFO - __main__ -   \n",
      "10/02/2020 13:09:57 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:09:57 - INFO - __main__ -   Validation f1 improved! 0.8752136752136752 -> 0.8786885245901639\n",
      "10/02/2020 13:09:57 - INFO - __main__ -   Validation Acc : 0.8186274509803921\n",
      "10/02/2020 13:09:57 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  36%|███████████▌                    | 21/58 [00:46<01:21,  2.21s/it]\u001b[A\n",
      "Iteration:  38%|████████████▏                   | 22/58 [00:48<01:19,  2.22s/it]\u001b[A\n",
      "Iteration:  40%|████████████▋                   | 23/58 [00:50<01:17,  2.21s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 24/58 [00:52<01:14,  2.20s/it]\u001b[A10/02/2020 13:10:06 - INFO - __main__ -   \n",
      "10/02/2020 13:10:06 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:10:06 - INFO - __main__ -   Validation f1 improved! 0.8786885245901639 -> 0.881578947368421\n",
      "10/02/2020 13:10:06 - INFO - __main__ -   Validation Acc : 0.8235294117647058\n",
      "10/02/2020 13:10:06 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  43%|█████████████▊                  | 25/58 [00:55<01:13,  2.21s/it]\u001b[A\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  45%|██████████████▎                 | 26/58 [00:57<01:10,  2.20s/it]\u001b[A\n",
      "Iteration:  47%|██████████████▉                 | 27/58 [00:59<01:08,  2.21s/it]\u001b[A\n",
      "Iteration:  48%|███████████████▍                | 28/58 [01:01<01:06,  2.21s/it]\u001b[A\n",
      "Iteration:  50%|████████████████                | 29/58 [01:04<01:04,  2.22s/it]\u001b[A\n",
      "Iteration:  52%|████████████████▌               | 30/58 [01:06<01:01,  2.21s/it]\u001b[A\n",
      "Iteration:  53%|█████████████████               | 31/58 [01:08<00:59,  2.22s/it]\u001b[A\n",
      "Iteration:  55%|█████████████████▋              | 32/58 [01:10<00:57,  2.21s/it]\u001b[A\n",
      "Iteration:  57%|██████████████████▏             | 33/58 [01:12<00:55,  2.20s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 34/58 [01:15<00:53,  2.21s/it]\u001b[A\n",
      "Iteration:  60%|███████████████████▎            | 35/58 [01:17<00:50,  2.21s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▊            | 36/58 [01:19<00:48,  2.21s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▍           | 37/58 [01:21<00:46,  2.20s/it]\u001b[A\n",
      "Iteration:  66%|████████████████████▉           | 38/58 [01:23<00:44,  2.21s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▌          | 39/58 [01:26<00:42,  2.22s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████          | 40/58 [01:28<00:39,  2.21s/it]\u001b[A\n",
      "Iteration:  71%|██████████████████████▌         | 41/58 [01:30<00:37,  2.22s/it]\u001b[A\n",
      "Iteration:  72%|███████████████████████▏        | 42/58 [01:32<00:35,  2.21s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▋        | 43/58 [01:34<00:33,  2.21s/it]\u001b[A\n",
      "Iteration:  76%|████████████████████████▎       | 44/58 [01:37<00:30,  2.20s/it]\u001b[A\n",
      "Iteration:  78%|████████████████████████▊       | 45/58 [01:39<00:28,  2.21s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 46/58 [01:41<00:26,  2.22s/it]\u001b[A\n",
      "Iteration:  81%|█████████████████████████▉      | 47/58 [01:43<00:24,  2.21s/it]\u001b[A\n",
      "Iteration:  83%|██████████████████████████▍     | 48/58 [01:46<00:22,  2.21s/it]\u001b[A\n",
      "Iteration:  84%|███████████████████████████     | 49/58 [01:48<00:19,  2.20s/it]\u001b[A10/02/2020 13:11:01 - INFO - __main__ -   \n",
      "10/02/2020 13:11:01 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:11:01 - INFO - __main__ -   Validation f1 improved! 0.881578947368421 -> 0.8821548821548822\n",
      "10/02/2020 13:11:01 - INFO - __main__ -   Validation Acc : 0.8284313725490197\n",
      "10/02/2020 13:11:01 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:11:01 - INFO - __main__ -   \n",
      "10/02/2020 13:11:01 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:11:01 - INFO - __main__ -   Validation Acc improved! 0.8235294117647058 -> 0.8284313725490197\n",
      "10/02/2020 13:11:01 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  86%|███████████████████████████▌    | 50/58 [01:50<00:17,  2.21s/it]\u001b[A10/02/2020 13:11:03 - INFO - __main__ -   \n",
      "10/02/2020 13:11:03 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:11:03 - INFO - __main__ -   Validation f1 improved! 0.8821548821548822 -> 0.8866328257191202\n",
      "10/02/2020 13:11:03 - INFO - __main__ -   Validation Acc : 0.8357843137254902\n",
      "10/02/2020 13:11:03 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:11:03 - INFO - __main__ -   \n",
      "10/02/2020 13:11:03 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:11:03 - INFO - __main__ -   Validation Acc improved! 0.8284313725490197 -> 0.8357843137254902\n",
      "10/02/2020 13:11:03 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  88%|████████████████████████████▏   | 51/58 [01:52<00:15,  2.22s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 52/58 [01:54<00:13,  2.21s/it]\u001b[A\n",
      "Iteration:  91%|█████████████████████████████▏  | 53/58 [01:57<00:11,  2.21s/it]\u001b[A\n",
      "Iteration:  93%|█████████████████████████████▊  | 54/58 [01:59<00:08,  2.22s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 55/58 [02:01<00:06,  2.21s/it]\u001b[A\n",
      "Iteration:  97%|██████████████████████████████▉ | 56/58 [02:03<00:04,  2.22s/it]\u001b[A\n",
      "Iteration:  98%|███████████████████████████████▍| 57/58 [02:05<00:02,  2.22s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 58/58 [02:07<00:00,  2.20s/it]\u001b[A\n",
      "Epoch:  50%|██████████████████▌                  | 3/6 [06:27<06:28, 129.61s/it]\n",
      "Iteration:   0%|                                         | 0/58 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   2%|▌                                | 1/58 [00:02<02:04,  2.18s/it]\u001b[A\n",
      "Iteration:   3%|█▏                               | 2/58 [00:04<02:02,  2.18s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 3/58 [00:06<02:00,  2.18s/it]\u001b[A\n",
      "Iteration:   7%|██▎                              | 4/58 [00:08<01:57,  2.18s/it]\u001b[A\n",
      "Iteration:   9%|██▊                              | 5/58 [00:10<01:56,  2.19s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 6/58 [00:13<01:53,  2.19s/it]\u001b[A\n",
      "Iteration:  12%|███▉                             | 7/58 [00:15<01:52,  2.20s/it]\u001b[A\n",
      "Iteration:  14%|████▌                            | 8/58 [00:17<01:50,  2.21s/it]\u001b[A\n",
      "Iteration:  16%|█████                            | 9/58 [00:19<01:47,  2.20s/it]\u001b[A\n",
      "Iteration:  17%|█████▌                          | 10/58 [00:22<01:46,  2.21s/it]\u001b[A\n",
      "Iteration:  19%|██████                          | 11/58 [00:24<01:43,  2.20s/it]\u001b[A\n",
      "Iteration:  21%|██████▌                         | 12/58 [00:26<01:41,  2.21s/it]\u001b[A\n",
      "Iteration:  22%|███████▏                        | 13/58 [00:28<01:39,  2.20s/it]\u001b[A\n",
      "Iteration:  24%|███████▋                        | 14/58 [00:30<01:37,  2.21s/it]\u001b[A\n",
      "Iteration:  26%|████████▎                       | 15/58 [00:33<01:35,  2.22s/it]\u001b[A\n",
      "Iteration:  28%|████████▊                       | 16/58 [00:35<01:32,  2.21s/it]\u001b[A\n",
      "Iteration:  29%|█████████▍                      | 17/58 [00:37<01:30,  2.22s/it]\u001b[A\n",
      "Iteration:  31%|█████████▉                      | 18/58 [00:39<01:28,  2.21s/it]\u001b[A\n",
      "Iteration:  33%|██████████▍                     | 19/58 [00:41<01:26,  2.22s/it]\u001b[A\n",
      "Iteration:  34%|███████████                     | 20/58 [00:44<01:23,  2.21s/it]\u001b[A\n",
      "Iteration:  36%|███████████▌                    | 21/58 [00:46<01:22,  2.22s/it]\u001b[A\n",
      "Iteration:  38%|████████████▏                   | 22/58 [00:48<01:19,  2.21s/it]\u001b[A\n",
      "Iteration:  40%|████████████▋                   | 23/58 [00:50<01:17,  2.22s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 24/58 [00:52<01:15,  2.21s/it]\u001b[A\n",
      "Iteration:  43%|█████████████▊                  | 25/58 [00:55<01:13,  2.22s/it]\u001b[A\n",
      "Iteration:  45%|██████████████▎                 | 26/58 [00:57<01:10,  2.21s/it]\u001b[A\n",
      "Iteration:  47%|██████████████▉                 | 27/58 [00:59<01:08,  2.22s/it]\u001b[A\n",
      "Iteration:  48%|███████████████▍                | 28/58 [01:01<01:06,  2.21s/it]\u001b[A\n",
      "Iteration:  50%|████████████████                | 29/58 [01:04<01:04,  2.22s/it]\u001b[A\n",
      "Iteration:  52%|████████████████▌               | 30/58 [01:06<01:01,  2.21s/it]\u001b[A\n",
      "Iteration:  53%|█████████████████               | 31/58 [01:08<00:59,  2.22s/it]\u001b[A\n",
      "Iteration:  55%|█████████████████▋              | 32/58 [01:10<00:57,  2.21s/it]\u001b[A\n",
      "Iteration:  57%|██████████████████▏             | 33/58 [01:12<00:55,  2.22s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 34/58 [01:15<00:52,  2.21s/it]\u001b[A\n",
      "Iteration:  60%|███████████████████▎            | 35/58 [01:17<00:50,  2.21s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▊            | 36/58 [01:19<00:48,  2.21s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▍           | 37/58 [01:21<00:46,  2.22s/it]\u001b[A\n",
      "Iteration:  66%|████████████████████▉           | 38/58 [01:23<00:44,  2.21s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▌          | 39/58 [01:26<00:42,  2.22s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████          | 40/58 [01:28<00:39,  2.21s/it]\u001b[A\n",
      "Iteration:  71%|██████████████████████▌         | 41/58 [01:30<00:37,  2.21s/it]\u001b[A\n",
      "Iteration:  72%|███████████████████████▏        | 42/58 [01:32<00:35,  2.21s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▋        | 43/58 [01:34<00:33,  2.21s/it]\u001b[A\n",
      "Iteration:  76%|████████████████████████▎       | 44/58 [01:37<00:30,  2.21s/it]\u001b[A\n",
      "Iteration:  78%|████████████████████████▊       | 45/58 [01:39<00:28,  2.22s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 46/58 [01:41<00:26,  2.21s/it]\u001b[A\n",
      "Iteration:  81%|█████████████████████████▉      | 47/58 [01:43<00:24,  2.22s/it]\u001b[A\n",
      "Iteration:  83%|██████████████████████████▍     | 48/58 [01:46<00:22,  2.21s/it]\u001b[A\n",
      "Iteration:  84%|███████████████████████████     | 49/58 [01:48<00:19,  2.22s/it]\u001b[A\n",
      "Iteration:  86%|███████████████████████████▌    | 50/58 [01:50<00:17,  2.21s/it]\u001b[A\n",
      "Iteration:  88%|████████████████████████████▏   | 51/58 [01:52<00:15,  2.22s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 52/58 [01:54<00:13,  2.21s/it]\u001b[A\n",
      "Iteration:  91%|█████████████████████████████▏  | 53/58 [01:57<00:11,  2.21s/it]\u001b[A\n",
      "Iteration:  93%|█████████████████████████████▊  | 54/58 [01:59<00:08,  2.21s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 55/58 [02:01<00:06,  2.21s/it]\u001b[A\n",
      "Iteration:  97%|██████████████████████████████▉ | 56/58 [02:03<00:04,  2.21s/it]\u001b[A\n",
      "Iteration:  98%|███████████████████████████████▍| 57/58 [02:05<00:02,  2.21s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 58/58 [02:07<00:00,  2.20s/it]\u001b[A\n",
      "Epoch:  67%|████████████████████████▋            | 4/6 [08:35<04:18, 129.09s/it]\n",
      "Iteration:   0%|                                         | 0/58 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   2%|▌                                | 1/58 [00:02<02:05,  2.19s/it]\u001b[A\n",
      "Iteration:   3%|█▏                               | 2/58 [00:04<02:02,  2.19s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 3/58 [00:06<02:00,  2.19s/it]\u001b[A\n",
      "Iteration:   7%|██▎                              | 4/58 [00:08<01:58,  2.19s/it]\u001b[A\n",
      "Iteration:   9%|██▊                              | 5/58 [00:10<01:56,  2.19s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 6/58 [00:13<01:53,  2.19s/it]\u001b[A\n",
      "Iteration:  12%|███▉                             | 7/58 [00:15<01:52,  2.20s/it]\u001b[A\n",
      "Iteration:  14%|████▌                            | 8/58 [00:17<01:50,  2.20s/it]\u001b[A\n",
      "Iteration:  16%|█████                            | 9/58 [00:19<01:48,  2.21s/it]\u001b[A\n",
      "Iteration:  17%|█████▌                          | 10/58 [00:21<01:45,  2.20s/it]\u001b[A\n",
      "Iteration:  19%|██████                          | 11/58 [00:24<01:44,  2.21s/it]\u001b[A\n",
      "Iteration:  21%|██████▌                         | 12/58 [00:26<01:41,  2.21s/it]\u001b[A\n",
      "Iteration:  22%|███████▏                        | 13/58 [00:28<01:39,  2.21s/it]\u001b[A\n",
      "Iteration:  24%|███████▋                        | 14/58 [00:30<01:37,  2.21s/it]\u001b[A\n",
      "Iteration:  26%|████████▎                       | 15/58 [00:33<01:35,  2.22s/it]\u001b[A\n",
      "Iteration:  28%|████████▊                       | 16/58 [00:35<01:32,  2.21s/it]\u001b[A\n",
      "Iteration:  29%|█████████▍                      | 17/58 [00:37<01:30,  2.20s/it]\u001b[A\n",
      "Iteration:  31%|█████████▉                      | 18/58 [00:39<01:28,  2.22s/it]\u001b[A\n",
      "Iteration:  33%|██████████▍                     | 19/58 [00:41<01:25,  2.20s/it]\u001b[A\n",
      "Iteration:  34%|███████████                     | 20/58 [00:44<01:24,  2.21s/it]\u001b[A\n",
      "Iteration:  36%|███████████▌                    | 21/58 [00:46<01:21,  2.21s/it]\u001b[A\n",
      "Iteration:  38%|████████████▏                   | 22/58 [00:48<01:19,  2.22s/it]\u001b[A\n",
      "Iteration:  40%|████████████▋                   | 23/58 [00:50<01:17,  2.21s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 24/58 [00:52<01:15,  2.22s/it]\u001b[A\n",
      "Iteration:  43%|█████████████▊                  | 25/58 [00:55<01:12,  2.21s/it]\u001b[A\n",
      "Iteration:  45%|██████████████▎                 | 26/58 [00:57<01:10,  2.22s/it]\u001b[A\n",
      "Iteration:  47%|██████████████▉                 | 27/58 [00:59<01:08,  2.21s/it]\u001b[A\n",
      "Iteration:  48%|███████████████▍                | 28/58 [01:01<01:06,  2.22s/it]\u001b[A\n",
      "Iteration:  50%|████████████████                | 29/58 [01:04<01:04,  2.21s/it]\u001b[A\n",
      "Iteration:  52%|████████████████▌               | 30/58 [01:06<01:01,  2.21s/it]\u001b[A\n",
      "Iteration:  53%|█████████████████               | 31/58 [01:08<00:59,  2.22s/it]\u001b[A\n",
      "Iteration:  55%|█████████████████▋              | 32/58 [01:10<00:57,  2.21s/it]\u001b[A\n",
      "Iteration:  57%|██████████████████▏             | 33/58 [01:12<00:55,  2.21s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 34/58 [01:15<00:52,  2.21s/it]\u001b[A\n",
      "Iteration:  60%|███████████████████▎            | 35/58 [01:17<00:50,  2.21s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▊            | 36/58 [01:19<00:48,  2.21s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▍           | 37/58 [01:21<00:46,  2.22s/it]\u001b[A\n",
      "Iteration:  66%|████████████████████▉           | 38/58 [01:23<00:44,  2.21s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▌          | 39/58 [01:26<00:41,  2.20s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████          | 40/58 [01:28<00:39,  2.21s/it]\u001b[A\n",
      "Iteration:  71%|██████████████████████▌         | 41/58 [01:30<00:37,  2.21s/it]\u001b[A\n",
      "Iteration:  72%|███████████████████████▏        | 42/58 [01:32<00:35,  2.21s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▋        | 43/58 [01:34<00:33,  2.21s/it]\u001b[A\n",
      "Iteration:  76%|████████████████████████▎       | 44/58 [01:37<00:31,  2.21s/it]\u001b[A\n",
      "Iteration:  78%|████████████████████████▊       | 45/58 [01:39<00:28,  2.21s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 46/58 [01:41<00:26,  2.21s/it]\u001b[A\n",
      "Iteration:  81%|█████████████████████████▉      | 47/58 [01:43<00:24,  2.21s/it]\u001b[A\n",
      "Iteration:  83%|██████████████████████████▍     | 48/58 [01:46<00:22,  2.22s/it]\u001b[A\n",
      "Iteration:  84%|███████████████████████████     | 49/58 [01:48<00:19,  2.21s/it]\u001b[A\n",
      "Iteration:  86%|███████████████████████████▌    | 50/58 [01:50<00:17,  2.22s/it]\u001b[A\n",
      "Iteration:  88%|████████████████████████████▏   | 51/58 [01:52<00:15,  2.21s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 52/58 [01:54<00:13,  2.22s/it]\u001b[A\n",
      "Iteration:  91%|█████████████████████████████▏  | 53/58 [01:57<00:11,  2.21s/it]\u001b[A\n",
      "Iteration:  93%|█████████████████████████████▊  | 54/58 [01:59<00:08,  2.22s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 55/58 [02:01<00:06,  2.21s/it]\u001b[A\n",
      "Iteration:  97%|██████████████████████████████▉ | 56/58 [02:03<00:04,  2.21s/it]\u001b[A\n",
      "Iteration:  98%|███████████████████████████████▍| 57/58 [02:05<00:02,  2.21s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 58/58 [02:07<00:00,  2.21s/it]\u001b[A\n",
      "Epoch:  83%|██████████████████████████████▊      | 5/6 [10:42<02:08, 128.74s/it]\n",
      "Iteration:   0%|                                         | 0/58 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   2%|▌                                | 1/58 [00:02<02:07,  2.24s/it]\u001b[A\n",
      "Iteration:   3%|█▏                               | 2/58 [00:04<02:05,  2.24s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 3/58 [00:06<02:02,  2.22s/it]\u001b[A\n",
      "Iteration:   7%|██▎                              | 4/58 [00:08<02:00,  2.23s/it]\u001b[A\n",
      "Iteration:   9%|██▊                              | 5/58 [00:11<01:57,  2.22s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 6/58 [00:13<01:55,  2.22s/it]\u001b[A\n",
      "Iteration:  12%|███▉                             | 7/58 [00:15<01:52,  2.21s/it]\u001b[A\n",
      "Iteration:  14%|████▌                            | 8/58 [00:17<01:50,  2.22s/it]\u001b[A\n",
      "Iteration:  16%|█████                            | 9/58 [00:19<01:48,  2.21s/it]\u001b[A\n",
      "Iteration:  17%|█████▌                          | 10/58 [00:22<01:46,  2.22s/it]\u001b[A\n",
      "Iteration:  19%|██████                          | 11/58 [00:24<01:43,  2.21s/it]\u001b[A\n",
      "Iteration:  21%|██████▌                         | 12/58 [00:26<01:41,  2.22s/it]\u001b[A\n",
      "Iteration:  22%|███████▏                        | 13/58 [00:28<01:39,  2.21s/it]\u001b[A\n",
      "Iteration:  24%|███████▋                        | 14/58 [00:31<01:37,  2.22s/it]\u001b[A\n",
      "Iteration:  26%|████████▎                       | 15/58 [00:33<01:34,  2.21s/it]\u001b[A\n",
      "Iteration:  28%|████████▊                       | 16/58 [00:35<01:33,  2.21s/it]\u001b[A\n",
      "Iteration:  29%|█████████▍                      | 17/58 [00:37<01:30,  2.21s/it]\u001b[A\n",
      "Iteration:  31%|█████████▉                      | 18/58 [00:39<01:28,  2.21s/it]\u001b[A\n",
      "Iteration:  33%|██████████▍                     | 19/58 [00:42<01:25,  2.20s/it]\u001b[A\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  34%|███████████                     | 20/58 [00:44<01:24,  2.21s/it]\u001b[A\n",
      "Iteration:  36%|███████████▌                    | 21/58 [00:46<01:22,  2.22s/it]\u001b[A\n",
      "Iteration:  38%|████████████▏                   | 22/58 [00:48<01:19,  2.21s/it]\u001b[A\n",
      "Iteration:  40%|████████████▋                   | 23/58 [00:50<01:17,  2.22s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 24/58 [00:53<01:15,  2.21s/it]\u001b[A\n",
      "Iteration:  43%|█████████████▊                  | 25/58 [00:55<01:13,  2.22s/it]\u001b[A\n",
      "Iteration:  45%|██████████████▎                 | 26/58 [00:57<01:10,  2.21s/it]\u001b[A\n",
      "Iteration:  47%|██████████████▉                 | 27/58 [00:59<01:08,  2.21s/it]\u001b[A\n",
      "Iteration:  48%|███████████████▍                | 28/58 [01:01<01:06,  2.21s/it]\u001b[A\n",
      "Iteration:  50%|████████████████                | 29/58 [01:04<01:04,  2.21s/it]\u001b[A\n",
      "Iteration:  52%|████████████████▌               | 30/58 [01:06<01:02,  2.22s/it]\u001b[A\n",
      "Iteration:  53%|█████████████████               | 31/58 [01:08<00:59,  2.21s/it]\u001b[A\n",
      "Iteration:  55%|█████████████████▋              | 32/58 [01:10<00:57,  2.21s/it]\u001b[A\n",
      "Iteration:  57%|██████████████████▏             | 33/58 [01:13<00:55,  2.22s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 34/58 [01:15<00:52,  2.21s/it]\u001b[A\n",
      "Iteration:  60%|███████████████████▎            | 35/58 [01:17<00:50,  2.21s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▊            | 36/58 [01:19<00:48,  2.20s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▍           | 37/58 [01:21<00:46,  2.21s/it]\u001b[A\n",
      "Iteration:  66%|████████████████████▉           | 38/58 [01:24<00:44,  2.22s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▌          | 39/58 [01:26<00:41,  2.21s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████          | 40/58 [01:28<00:39,  2.21s/it]\u001b[A\n",
      "Iteration:  71%|██████████████████████▌         | 41/58 [01:30<00:37,  2.20s/it]\u001b[A\n",
      "Iteration:  72%|███████████████████████▏        | 42/58 [01:32<00:35,  2.21s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▋        | 43/58 [01:35<00:33,  2.22s/it]\u001b[A\n",
      "Iteration:  76%|████████████████████████▎       | 44/58 [01:37<00:30,  2.21s/it]\u001b[A\n",
      "Iteration:  78%|████████████████████████▊       | 45/58 [01:39<00:28,  2.21s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 46/58 [01:41<00:26,  2.22s/it]\u001b[A\n",
      "Iteration:  81%|█████████████████████████▉      | 47/58 [01:43<00:24,  2.21s/it]\u001b[A\n",
      "Iteration:  83%|██████████████████████████▍     | 48/58 [01:46<00:22,  2.21s/it]\u001b[A\n",
      "Iteration:  84%|███████████████████████████     | 49/58 [01:48<00:19,  2.22s/it]\u001b[A\n",
      "Iteration:  86%|███████████████████████████▌    | 50/58 [01:50<00:17,  2.21s/it]\u001b[A\n",
      "Iteration:  88%|████████████████████████████▏   | 51/58 [01:52<00:15,  2.22s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 52/58 [01:55<00:13,  2.21s/it]\u001b[A\n",
      "Iteration:  91%|█████████████████████████████▏  | 53/58 [01:57<00:11,  2.21s/it]\u001b[A10/02/2020 13:17:32 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  93%|█████████████████████████████▊  | 54/58 [01:59<00:08,  2.22s/it]\u001b[A10/02/2020 13:17:34 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  95%|██████████████████████████████▎ | 55/58 [02:01<00:06,  2.21s/it]\u001b[A10/02/2020 13:17:37 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  97%|██████████████████████████████▉ | 56/58 [02:03<00:04,  2.21s/it]\u001b[A10/02/2020 13:17:39 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  98%|███████████████████████████████▍| 57/58 [02:06<00:02,  2.20s/it]\u001b[A10/02/2020 13:17:41 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration: 100%|████████████████████████████████| 58/58 [02:08<00:00,  2.21s/it]\u001b[A\n",
      "Epoch: 100%|█████████████████████████████████████| 6/6 [12:50<00:00, 128.50s/it]\n",
      "10/02/2020 13:17:43 - INFO - __main__ -   \n",
      "10/02/2020 13:17:43 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:17:43 - INFO - __main__ -   Validation Accuracy : 0.8357843137254902 Validation f1 : 0.8866328257191202\n",
      "10/02/2020 13:17:43 - INFO - __main__ -   =============================================================================\n"
     ]
    }
   ],
   "source": [
    "!python KDAP/NLI_KD_training_MRPC.py  --> 88.66"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "### [3] PTP pretraining with SPS-student"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "Settings:[seed = 80301814 / train_batch_size = 64/ learning_rate = 2e-5 / epoch = 6]\n",
    "\n",
    "note that line 285 of 'PTP_MRPC.py' is  (global_step %2 == 1) & (epoch > 0)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 23,
   "metadata": {
    "collapsed": true
   },
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "10/02/2020 13:42:42 - INFO - __main__ -   IN DEBUG MODE\n",
      "10/02/2020 13:42:43 - INFO - src.argument_parser -   device: cuda n_gpu: 2, 16-bits training: False\n",
      "10/02/2020 13:42:43 - INFO - src.argument_parser -   random seed = 48622814\n",
      "10/02/2020 13:42:43 - INFO - __main__ -   actual batch size on all GPU = 64\n",
      "10/02/2020 13:42:43 - INFO - __main__ -   Input Argument Information\n",
      "10/02/2020 13:42:44 - INFO - __main__ -   \n",
      "10/02/2020 13:42:44 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:42:44 - INFO - __main__ -   PTP_label.eq(0).sum() = tensor(1775)\n",
      "10/02/2020 13:42:44 - INFO - __main__ -   PTP_label.eq(1).sum() = tensor(1677)\n",
      "10/02/2020 13:42:44 - INFO - __main__ -   PTP_label.eq(2).sum() = tensor(105)\n",
      "10/02/2020 13:42:44 - INFO - __main__ -   PTP_label.eq(3).sum() = tensor(111)\n",
      "10/02/2020 13:42:44 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:42:44 - INFO - src.nli_data_processing -   Initializing model for pretraining\n",
      "10/02/2020 13:42:44 - INFO - src.modeling -   num hidden layer is set as 6\n",
      "10/02/2020 13:42:44 - INFO - src.modeling -   Model config {\n",
      "  \"attention_probs_dropout_prob\": 0.1,\n",
      "  \"hidden_act\": \"gelu\",\n",
      "  \"hidden_dropout_prob\": 0.1,\n",
      "  \"hidden_size\": 768,\n",
      "  \"initializer_range\": 0.02,\n",
      "  \"intermediate_size\": 3072,\n",
      "  \"max_position_embeddings\": 512,\n",
      "  \"num_attention_heads\": 12,\n",
      "  \"num_hidden_layers\": 6,\n",
      "  \"type_vocab_size\": 2,\n",
      "  \"vocab_size\": 30522\n",
      "}\n",
      "\n",
      "10/02/2020 13:42:45 - INFO - src.utils -   delete 104 layers, keep 103 layers\n",
      "10/02/2020 13:42:48 - INFO - src.utils -   data parallel because more than one gpu\n",
      "10/02/2020 13:42:48 - INFO - __main__ -   *****************************************************************************\n",
      "10/02/2020 13:42:48 - INFO - src.utils -   data parallel because more than one gpu\n",
      "10/02/2020 13:42:48 - INFO - __main__ -   number of layers in student model = 6\n",
      "10/02/2020 13:42:48 - INFO - __main__ -   num parameters in student model are 66955008 and 3076\n",
      "10/02/2020 13:42:48 - INFO - __main__ -   FP16 is not activated, use BertAdam\n",
      "Epoch:   0%|                                              | 0/6 [00:00<?, ?it/s]\n",
      "Iteration:   0%|                                         | 0/58 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   2%|▌                                | 1/58 [00:04<03:58,  4.19s/it]\u001b[A\n",
      "Iteration:   3%|█▏                               | 2/58 [00:04<02:55,  3.13s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 3/58 [00:05<02:11,  2.38s/it]\u001b[A\n",
      "Iteration:   7%|██▎                              | 4/58 [00:06<01:40,  1.87s/it]\u001b[A\n",
      "Iteration:   9%|██▊                              | 5/58 [00:06<01:19,  1.50s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 6/58 [00:07<01:04,  1.25s/it]\u001b[A\n",
      "Iteration:  12%|███▉                             | 7/58 [00:08<00:54,  1.08s/it]\u001b[A\n",
      "Iteration:  14%|████▌                            | 8/58 [00:08<00:47,  1.05it/s]\u001b[A\n",
      "Iteration:  16%|█████                            | 9/58 [00:09<00:42,  1.16it/s]\u001b[A\n",
      "Iteration:  17%|█████▌                          | 10/58 [00:10<00:38,  1.25it/s]\u001b[A\n",
      "Iteration:  19%|██████                          | 11/58 [00:10<00:35,  1.32it/s]\u001b[A\n",
      "Iteration:  21%|██████▌                         | 12/58 [00:11<00:33,  1.37it/s]\u001b[A\n",
      "Iteration:  22%|███████▏                        | 13/58 [00:12<00:31,  1.41it/s]\u001b[A\n",
      "Iteration:  24%|███████▋                        | 14/58 [00:12<00:30,  1.44it/s]\u001b[A\n",
      "Iteration:  26%|████████▎                       | 15/58 [00:13<00:29,  1.46it/s]\u001b[A\n",
      "Iteration:  28%|████████▊                       | 16/58 [00:14<00:28,  1.47it/s]\u001b[A\n",
      "Iteration:  29%|█████████▍                      | 17/58 [00:14<00:27,  1.48it/s]\u001b[A\n",
      "Iteration:  31%|█████████▉                      | 18/58 [00:15<00:26,  1.49it/s]\u001b[A\n",
      "Iteration:  33%|██████████▍                     | 19/58 [00:16<00:26,  1.49it/s]\u001b[A\n",
      "Iteration:  34%|███████████                     | 20/58 [00:16<00:25,  1.49it/s]\u001b[A\n",
      "Iteration:  36%|███████████▌                    | 21/58 [00:17<00:24,  1.49it/s]\u001b[A\n",
      "Iteration:  38%|████████████▏                   | 22/58 [00:18<00:24,  1.49it/s]\u001b[A\n",
      "Iteration:  40%|████████████▋                   | 23/58 [00:18<00:23,  1.49it/s]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 24/58 [00:19<00:22,  1.49it/s]\u001b[A\n",
      "Iteration:  43%|█████████████▊                  | 25/58 [00:20<00:22,  1.49it/s]\u001b[A\n",
      "Iteration:  45%|██████████████▎                 | 26/58 [00:20<00:21,  1.49it/s]\u001b[A\n",
      "Iteration:  47%|██████████████▉                 | 27/58 [00:21<00:21,  1.46it/s]\u001b[A\n",
      "Iteration:  48%|███████████████▍                | 28/58 [00:22<00:20,  1.47it/s]\u001b[A\n",
      "Iteration:  50%|████████████████                | 29/58 [00:22<00:19,  1.47it/s]\u001b[A\n",
      "Iteration:  52%|████████████████▌               | 30/58 [00:23<00:19,  1.47it/s]\u001b[A\n",
      "Iteration:  53%|█████████████████               | 31/58 [00:24<00:18,  1.47it/s]\u001b[A\n",
      "Iteration:  55%|█████████████████▋              | 32/58 [00:24<00:17,  1.47it/s]\u001b[A\n",
      "Iteration:  57%|██████████████████▏             | 33/58 [00:25<00:17,  1.47it/s]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 34/58 [00:26<00:16,  1.47it/s]\u001b[A\n",
      "Iteration:  60%|███████████████████▎            | 35/58 [00:26<00:15,  1.46it/s]\u001b[A\n",
      "Iteration:  62%|███████████████████▊            | 36/58 [00:27<00:15,  1.46it/s]\u001b[A\n",
      "Iteration:  64%|████████████████████▍           | 37/58 [00:28<00:14,  1.46it/s]\u001b[A\n",
      "Iteration:  66%|████████████████████▉           | 38/58 [00:29<00:13,  1.45it/s]\u001b[A\n",
      "Iteration:  67%|█████████████████████▌          | 39/58 [00:29<00:13,  1.46it/s]\u001b[A\n",
      "Iteration:  69%|██████████████████████          | 40/58 [00:30<00:12,  1.45it/s]\u001b[A\n",
      "Iteration:  71%|██████████████████████▌         | 41/58 [00:31<00:11,  1.45it/s]\u001b[A\n",
      "Iteration:  72%|███████████████████████▏        | 42/58 [00:31<00:11,  1.45it/s]\u001b[A\n",
      "Iteration:  74%|███████████████████████▋        | 43/58 [00:32<00:10,  1.45it/s]\u001b[A\n",
      "Iteration:  76%|████████████████████████▎       | 44/58 [00:33<00:09,  1.45it/s]\u001b[A\n",
      "Iteration:  78%|████████████████████████▊       | 45/58 [00:33<00:08,  1.45it/s]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 46/58 [00:34<00:08,  1.45it/s]\u001b[A\n",
      "Iteration:  81%|█████████████████████████▉      | 47/58 [00:35<00:07,  1.45it/s]\u001b[A\n",
      "Iteration:  83%|██████████████████████████▍     | 48/58 [00:35<00:06,  1.45it/s]\u001b[A\n",
      "Iteration:  84%|███████████████████████████     | 49/58 [00:36<00:06,  1.45it/s]\u001b[A\n",
      "Iteration:  86%|███████████████████████████▌    | 50/58 [00:37<00:05,  1.45it/s]\u001b[A\n",
      "Iteration:  88%|████████████████████████████▏   | 51/58 [00:37<00:04,  1.45it/s]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 52/58 [00:38<00:04,  1.45it/s]\u001b[A\n",
      "Iteration:  91%|█████████████████████████████▏  | 53/58 [00:39<00:03,  1.45it/s]\u001b[A\n",
      "Iteration:  93%|█████████████████████████████▊  | 54/58 [00:40<00:02,  1.45it/s]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 55/58 [00:40<00:02,  1.45it/s]\u001b[A\n",
      "Iteration:  97%|██████████████████████████████▉ | 56/58 [00:41<00:01,  1.45it/s]\u001b[A\n",
      "Iteration:  98%|███████████████████████████████▍| 57/58 [00:42<00:00,  1.46it/s]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 58/58 [00:42<00:00,  1.37it/s]\u001b[A\n",
      "Epoch:  17%|██████▎                               | 1/6 [00:42<03:31, 42.40s/it]\n",
      "Iteration:   0%|                                         | 0/58 [00:00<?, ?it/s]\u001b[A10/02/2020 13:43:33 - INFO - __main__ -   \n",
      "10/02/2020 13:43:33 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:43:33 - INFO - __main__ -   Validation Accuracy improved! 0 -> 0.7549019607843137\n",
      "10/02/2020 13:43:33 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:43:33 - INFO - __main__ -   \n",
      "10/02/2020 13:43:33 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:43:33 - INFO - __main__ -   Validation improved! 100 -> 0.7500695481019861\n",
      "10/02/2020 13:43:33 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   2%|▌                                | 1/58 [00:02<01:58,  2.08s/it]\u001b[A\n",
      "Iteration:   3%|█▏                               | 2/58 [00:02<01:33,  1.66s/it]\u001b[A10/02/2020 13:43:35 - INFO - __main__ -   \n",
      "10/02/2020 13:43:35 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:43:35 - INFO - __main__ -   Validation Accuracy improved! 0.7549019607843137 -> 0.7622549019607843\n",
      "10/02/2020 13:43:35 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:43:35 - INFO - __main__ -   \n",
      "10/02/2020 13:43:35 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:43:35 - INFO - __main__ -   Validation improved! 0.7500695481019861 -> 0.741798947839176\n",
      "10/02/2020 13:43:35 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   5%|█▋                               | 3/58 [00:04<01:38,  1.78s/it]\u001b[A\n",
      "Iteration:   7%|██▎                              | 4/58 [00:05<01:18,  1.45s/it]\u001b[A\n",
      "Iteration:   9%|██▊                              | 5/58 [00:07<01:26,  1.64s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 6/58 [00:08<01:10,  1.35s/it]\u001b[A\n",
      "Iteration:  12%|███▉                             | 7/58 [00:10<01:19,  1.56s/it]\u001b[A\n",
      "Iteration:  14%|████▌                            | 8/58 [00:10<01:04,  1.29s/it]\u001b[A10/02/2020 13:43:44 - INFO - __main__ -   \n",
      "10/02/2020 13:43:44 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:43:44 - INFO - __main__ -   Validation Accuracy improved! 0.7622549019607843 -> 0.7769607843137255\n",
      "10/02/2020 13:43:44 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:43:44 - INFO - __main__ -   \n",
      "10/02/2020 13:43:44 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:43:44 - INFO - __main__ -   Validation improved! 0.741798947839176 -> 0.720411125351401\n",
      "10/02/2020 13:43:44 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  16%|█████                            | 9/58 [00:13<01:14,  1.52s/it]\u001b[A\n",
      "Iteration:  17%|█████▌                          | 10/58 [00:13<01:00,  1.26s/it]\u001b[A10/02/2020 13:43:46 - INFO - __main__ -   \n",
      "10/02/2020 13:43:46 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:43:46 - INFO - __main__ -   Validation improved! 0.720411125351401 -> 0.6879689553204704\n",
      "10/02/2020 13:43:46 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  19%|██████                          | 11/58 [00:15<01:10,  1.49s/it]\u001b[A\n",
      "Iteration:  21%|██████▌                         | 12/58 [00:16<00:57,  1.25s/it]\u001b[A\n",
      "Iteration:  22%|███████▏                        | 13/58 [00:18<01:06,  1.48s/it]\u001b[A\n",
      "Iteration:  24%|███████▋                        | 14/58 [00:19<00:54,  1.24s/it]\u001b[A\n",
      "Iteration:  26%|████████▎                       | 15/58 [00:21<01:03,  1.47s/it]\u001b[A\n",
      "Iteration:  28%|████████▊                       | 16/58 [00:21<00:51,  1.23s/it]\u001b[A\n",
      "Iteration:  29%|█████████▍                      | 17/58 [00:23<01:00,  1.47s/it]\u001b[A\n",
      "Iteration:  31%|█████████▉                      | 18/58 [00:24<00:49,  1.23s/it]\u001b[A\n",
      "Iteration:  33%|██████████▍                     | 19/58 [00:26<00:56,  1.46s/it]\u001b[A\n",
      "Iteration:  34%|███████████                     | 20/58 [00:27<00:46,  1.22s/it]\u001b[A\n",
      "Iteration:  36%|███████████▌                    | 21/58 [00:29<00:53,  1.46s/it]\u001b[A\n",
      "Iteration:  38%|████████████▏                   | 22/58 [00:29<00:43,  1.22s/it]\u001b[A\n",
      "Iteration:  40%|████████████▋                   | 23/58 [00:31<00:50,  1.46s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 24/58 [00:32<00:41,  1.22s/it]\u001b[A\n",
      "Iteration:  43%|█████████████▊                  | 25/58 [00:34<00:48,  1.46s/it]\u001b[A\n",
      "Iteration:  45%|██████████████▎                 | 26/58 [00:35<00:38,  1.22s/it]\u001b[A\n",
      "Iteration:  47%|██████████████▉                 | 27/58 [00:37<00:45,  1.46s/it]\u001b[A\n",
      "Iteration:  48%|███████████████▍                | 28/58 [00:37<00:36,  1.22s/it]\u001b[A\n",
      "Iteration:  50%|████████████████                | 29/58 [00:39<00:42,  1.45s/it]\u001b[A\n",
      "Iteration:  52%|████████████████▌               | 30/58 [00:40<00:34,  1.22s/it]\u001b[A\n",
      "Iteration:  53%|█████████████████               | 31/58 [00:42<00:39,  1.45s/it]\u001b[A\n",
      "Iteration:  55%|█████████████████▋              | 32/58 [00:43<00:31,  1.22s/it]\u001b[A\n",
      "Iteration:  57%|██████████████████▏             | 33/58 [00:45<00:36,  1.45s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 34/58 [00:45<00:29,  1.22s/it]\u001b[A\n",
      "Iteration:  60%|███████████████████▎            | 35/58 [00:47<00:33,  1.45s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▊            | 36/58 [00:48<00:26,  1.22s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▍           | 37/58 [00:50<00:30,  1.45s/it]\u001b[A\n",
      "Iteration:  66%|████████████████████▉           | 38/58 [00:51<00:24,  1.22s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▌          | 39/58 [00:53<00:27,  1.46s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████          | 40/58 [00:53<00:21,  1.22s/it]\u001b[A\n",
      "Iteration:  71%|██████████████████████▌         | 41/58 [00:55<00:24,  1.46s/it]\u001b[A\n",
      "Iteration:  72%|███████████████████████▏        | 42/58 [00:56<00:19,  1.22s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▋        | 43/58 [00:58<00:21,  1.46s/it]\u001b[A\n",
      "Iteration:  76%|████████████████████████▎       | 44/58 [00:59<00:17,  1.22s/it]\u001b[A10/02/2020 13:44:32 - INFO - __main__ -   \n",
      "10/02/2020 13:44:32 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:44:32 - INFO - __main__ -   Validation Accuracy improved! 0.7769607843137255 -> 0.8014705882352942\n",
      "10/02/2020 13:44:32 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:44:32 - INFO - __main__ -   \n",
      "10/02/2020 13:44:32 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:44:32 - INFO - __main__ -   Validation improved! 0.6879689553204704 -> 0.6513415136758018\n",
      "10/02/2020 13:44:32 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  78%|████████████████████████▊       | 45/58 [01:01<00:19,  1.47s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 46/58 [01:01<00:14,  1.23s/it]\u001b[A10/02/2020 13:44:35 - INFO - __main__ -   \n",
      "10/02/2020 13:44:35 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:44:35 - INFO - __main__ -   Validation Accuracy improved! 0.8014705882352942 -> 0.803921568627451\n",
      "10/02/2020 13:44:35 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:44:35 - INFO - __main__ -   \n",
      "10/02/2020 13:44:35 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:44:35 - INFO - __main__ -   Validation improved! 0.6513415136758018 -> 0.6237880307085374\n",
      "10/02/2020 13:44:35 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:44:35 - INFO - __main__ -   Saving the model...\n",
      "\n",
      "Iteration:  81%|█████████████████████████▉      | 47/58 [01:04<00:16,  1.54s/it]\u001b[A\n",
      "Iteration:  83%|██████████████████████████▍     | 48/58 [01:04<00:12,  1.28s/it]\u001b[A\n",
      "Iteration:  84%|███████████████████████████     | 49/58 [01:06<00:13,  1.51s/it]\u001b[A\n",
      "Iteration:  86%|███████████████████████████▌    | 50/58 [01:07<00:10,  1.26s/it]\u001b[A\n",
      "Iteration:  88%|████████████████████████████▏   | 51/58 [01:09<00:10,  1.49s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 52/58 [01:10<00:07,  1.25s/it]\u001b[A\n",
      "Iteration:  91%|█████████████████████████████▏  | 53/58 [01:12<00:07,  1.49s/it]\u001b[A\n",
      "Iteration:  93%|█████████████████████████████▊  | 54/58 [01:13<00:04,  1.25s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 55/58 [01:15<00:04,  1.48s/it]\u001b[A\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  97%|██████████████████████████████▉ | 56/58 [01:15<00:02,  1.24s/it]\u001b[A\n",
      "Iteration:  98%|███████████████████████████████▍| 57/58 [01:17<00:01,  1.48s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 58/58 [01:18<00:00,  1.35s/it]\u001b[A\n",
      "Epoch:  33%|████████████▋                         | 2/6 [02:00<03:32, 53.11s/it]\n",
      "Iteration:   0%|                                         | 0/58 [00:00<?, ?it/s]\u001b[A10/02/2020 13:44:49 - INFO - __main__ -   \n",
      "10/02/2020 13:44:49 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 13:44:49 - INFO - __main__ -   Validation Loss : 0.6237880307085374 Validation Accuracy : 0.803921568627451\n",
      "Iteration:   0%|                                         | 0/58 [00:00<?, ?it/s]\n",
      "Epoch:  33%|████████████▋                         | 2/6 [02:01<04:02, 60.59s/it]\n",
      "Traceback (most recent call last):\n",
      "  File \"KDAP/PTP_MRPC.py\", line 282, in <module>\n",
      "    raise ValueError('%s KD not found, please use kd or kd.full' % args.kd)\n",
      "AttributeError: 'Namespace' object has no attribute 'kd'\n"
     ]
    }
   ],
   "source": [
    "!python KDAP/PTP_MRPC.py"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "### [4] PeaBERT6 - MRPC 92.93%"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "Settings: [seed = 87139341 / train_batch_size = 64 / learning_rate = 2e-5 / epoch = 10 / alpha = 0.7 / beta = 40] \n",
    "\n",
    "The result is shown in the following cell."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 13,
   "metadata": {
    "collapsed": true
   },
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "10/02/2020 12:28:56 - INFO - __main__ -   IN DEBUG MODE\n",
      "10/02/2020 12:28:57 - INFO - src.argument_parser -   device: cuda n_gpu: 2, 16-bits training: False\n",
      "10/02/2020 12:28:57 - INFO - src.argument_parser -   random seed = 61999997\n",
      "10/02/2020 12:28:57 - INFO - __main__ -   actual batch size on all GPU = 64\n",
      "10/02/2020 12:28:57 - INFO - __main__ -   Seed changed to : 87139341\n",
      "10/02/2020 12:28:57 - INFO - __main__ -   loading teacher's prediction\n",
      "10/02/2020 12:28:57 - INFO - __main__ -   teacher acc = 94.11, teacher loss = 0.17171\n",
      "10/02/2020 12:28:57 - INFO - __main__ -   teacher acc = 89.22, teacher loss = 0.26664\n",
      "10/02/2020 12:28:57 - INFO - src.nli_data_processing -   Writing example 0 of 3668\n",
      "10/02/2020 12:29:00 - INFO - __main__ -   ***** Running training *****\n",
      "10/02/2020 12:29:00 - INFO - __main__ -     Num examples = 3668\n",
      "10/02/2020 12:29:00 - INFO - __main__ -     Batch size = 64\n",
      "10/02/2020 12:29:00 - INFO - __main__ -     Num steps = 570\n",
      "10/02/2020 12:29:00 - INFO - src.nli_data_processing -   Writing example 0 of 408\n",
      "10/02/2020 12:29:00 - INFO - __main__ -   ***** Running evaluation *****\n",
      "10/02/2020 12:29:00 - INFO - __main__ -     Num examples = 408\n",
      "10/02/2020 12:29:00 - INFO - __main__ -     Batch size = 32\n",
      "10/02/2020 12:29:00 - INFO - src.nli_data_processing -   Writing example 0 of 1725\n",
      "10/02/2020 12:29:01 - INFO - __main__ -   ***** Running evaluation *****\n",
      "10/02/2020 12:29:01 - INFO - __main__ -     Num examples = 1725\n",
      "10/02/2020 12:29:01 - INFO - __main__ -     Batch size = 32\n",
      "10/02/2020 12:29:01 - INFO - __main__ -   using normal Knowledge Distillation\n",
      "10/02/2020 12:29:01 - INFO - src.nli_data_processing -   predicting for MRPC\n",
      "10/02/2020 12:29:01 - INFO - src.modeling -   num hidden layer is set as 6\n",
      "10/02/2020 12:29:01 - INFO - src.modeling -   Model config {\n",
      "  \"attention_probs_dropout_prob\": 0.1,\n",
      "  \"hidden_act\": \"gelu\",\n",
      "  \"hidden_dropout_prob\": 0.1,\n",
      "  \"hidden_size\": 768,\n",
      "  \"initializer_range\": 0.02,\n",
      "  \"intermediate_size\": 3072,\n",
      "  \"max_position_embeddings\": 512,\n",
      "  \"num_attention_heads\": 12,\n",
      "  \"num_hidden_layers\": 6,\n",
      "  \"type_vocab_size\": 2,\n",
      "  \"vocab_size\": 30522\n",
      "}\n",
      "\n",
      "10/02/2020 12:29:05 - INFO - src.utils -   delete 0 layers, keep 103 layers\n",
      "10/02/2020 12:29:05 - INFO - src.utils -   data parallel because more than one gpu\n",
      "10/02/2020 12:29:05 - INFO - __main__ -   *****************************************************************************\n",
      "10/02/2020 12:29:05 - INFO - src.utils -   data parallel because more than one gpu\n",
      "10/02/2020 12:29:05 - INFO - __main__ -   number of layers in student model = 6\n",
      "10/02/2020 12:29:05 - INFO - __main__ -   num parameters in student model are 66955008 and 1538\n",
      "10/02/2020 12:29:05 - INFO - __main__ -   FP16 is not activated, use BertAdam\n",
      "Epoch:   0%|                                             | 0/10 [00:00<?, ?it/s]\n",
      "Iteration:   0%|                                         | 0/58 [00:00<?, ?it/s]\u001b[A/opt/conda/lib/python3.7/site-packages/torch/nn/functional.py:1958: UserWarning: reduction: 'mean' divides the total loss by both the batch size and the support size.'batchmean' divides only by the batch size, and aligns with the KL div math definition.'mean' will be changed to behave the same as 'batchmean' in the next major release.\n",
      "  warnings.warn(\"reduction: 'mean' divides the total loss by both the batch size and the support size.\"\n",
      "10/02/2020 12:29:12 - INFO - __main__ -   \n",
      "10/02/2020 12:29:12 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:29:12 - INFO - __main__ -   Validation Acc improved! 0 -> 0.3161764705882353\n",
      "10/02/2020 12:29:12 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   2%|▌                                | 1/58 [00:06<06:19,  6.66s/it]\u001b[A\n",
      "Iteration:   3%|█▏                               | 2/58 [00:09<05:14,  5.61s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 3/58 [00:12<04:27,  4.86s/it]\u001b[A10/02/2020 12:29:21 - INFO - __main__ -   \n",
      "10/02/2020 12:29:21 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:29:21 - INFO - __main__ -   Validation f1 improved! 0 -> 0.14617940199335547\n",
      "10/02/2020 12:29:21 - INFO - __main__ -   Validation Acc : 0.3700980392156863\n",
      "10/02/2020 12:29:21 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:29:21 - INFO - __main__ -   \n",
      "10/02/2020 12:29:21 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:29:21 - INFO - __main__ -   Validation Acc improved! 0.3161764705882353 -> 0.3700980392156863\n",
      "10/02/2020 12:29:21 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   7%|██▎                              | 4/58 [00:16<03:55,  4.35s/it]\u001b[A10/02/2020 12:29:25 - INFO - __main__ -   \n",
      "10/02/2020 12:29:25 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:29:25 - INFO - __main__ -   Validation f1 improved! 0.14617940199335547 -> 0.4905660377358491\n",
      "10/02/2020 12:29:25 - INFO - __main__ -   Validation Acc : 0.5367647058823529\n",
      "10/02/2020 12:29:25 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:29:25 - INFO - __main__ -   \n",
      "10/02/2020 12:29:25 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:29:25 - INFO - __main__ -   Validation Acc improved! 0.3700980392156863 -> 0.5367647058823529\n",
      "10/02/2020 12:29:25 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   9%|██▊                              | 5/58 [00:19<03:32,  4.00s/it]\u001b[A10/02/2020 12:29:28 - INFO - __main__ -   \n",
      "10/02/2020 12:29:28 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:29:28 - INFO - __main__ -   Validation f1 improved! 0.4905660377358491 -> 0.6822429906542057\n",
      "10/02/2020 12:29:28 - INFO - __main__ -   Validation Acc : 0.6666666666666666\n",
      "10/02/2020 12:29:28 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:29:28 - INFO - __main__ -   \n",
      "10/02/2020 12:29:28 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:29:28 - INFO - __main__ -   Validation Acc improved! 0.5367647058823529 -> 0.6666666666666666\n",
      "10/02/2020 12:29:28 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  10%|███▍                             | 6/58 [00:22<03:15,  3.76s/it]\u001b[A10/02/2020 12:29:31 - INFO - __main__ -   \n",
      "10/02/2020 12:29:31 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:29:31 - INFO - __main__ -   Validation f1 improved! 0.6822429906542057 -> 0.8016701461377871\n",
      "10/02/2020 12:29:31 - INFO - __main__ -   Validation Acc : 0.7671568627450981\n",
      "10/02/2020 12:29:31 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:29:31 - INFO - __main__ -   \n",
      "10/02/2020 12:29:31 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:29:31 - INFO - __main__ -   Validation Acc improved! 0.6666666666666666 -> 0.7671568627450981\n",
      "10/02/2020 12:29:31 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  12%|███▉                             | 7/58 [00:25<03:02,  3.58s/it]\u001b[A10/02/2020 12:29:34 - INFO - __main__ -   \n",
      "10/02/2020 12:29:34 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:29:34 - INFO - __main__ -   Validation f1 improved! 0.8016701461377871 -> 0.8631984585741811\n",
      "10/02/2020 12:29:34 - INFO - __main__ -   Validation Acc : 0.8259803921568627\n",
      "10/02/2020 12:29:34 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:29:34 - INFO - __main__ -   \n",
      "10/02/2020 12:29:34 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:29:34 - INFO - __main__ -   Validation Acc improved! 0.7671568627450981 -> 0.8259803921568627\n",
      "10/02/2020 12:29:34 - INFO - __main__ -   =============================================================================\n",
      "\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  14%|████▌                            | 8/58 [00:28<02:52,  3.45s/it]\u001b[A10/02/2020 12:29:37 - INFO - __main__ -   \n",
      "10/02/2020 12:29:37 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:29:37 - INFO - __main__ -   Validation f1 improved! 0.8631984585741811 -> 0.8844765342960288\n",
      "10/02/2020 12:29:37 - INFO - __main__ -   Validation Acc : 0.8431372549019608\n",
      "10/02/2020 12:29:37 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:29:37 - INFO - __main__ -   \n",
      "10/02/2020 12:29:37 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:29:37 - INFO - __main__ -   Validation Acc improved! 0.8259803921568627 -> 0.8431372549019608\n",
      "10/02/2020 12:29:37 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  16%|█████                            | 9/58 [00:31<02:45,  3.37s/it]\u001b[A10/02/2020 12:29:41 - INFO - __main__ -   \n",
      "10/02/2020 12:29:41 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:29:41 - INFO - __main__ -   Validation f1 improved! 0.8844765342960288 -> 0.899488926746167\n",
      "10/02/2020 12:29:41 - INFO - __main__ -   Validation Acc : 0.8553921568627451\n",
      "10/02/2020 12:29:41 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:29:41 - INFO - __main__ -   \n",
      "10/02/2020 12:29:41 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:29:41 - INFO - __main__ -   Validation Acc improved! 0.8431372549019608 -> 0.8553921568627451\n",
      "10/02/2020 12:29:41 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  17%|█████▌                          | 10/58 [00:35<02:39,  3.32s/it]\u001b[A\n",
      "Iteration:  19%|██████                          | 11/58 [00:38<02:34,  3.29s/it]\u001b[A\n",
      "Iteration:  21%|██████▌                         | 12/58 [00:41<02:30,  3.28s/it]\u001b[A\n",
      "Iteration:  22%|███████▏                        | 13/58 [00:44<02:27,  3.28s/it]\u001b[A\n",
      "Iteration:  24%|███████▋                        | 14/58 [00:48<02:23,  3.27s/it]\u001b[A\n",
      "Iteration:  26%|████████▎                       | 15/58 [00:51<02:20,  3.27s/it]\u001b[A\n",
      "Iteration:  28%|████████▊                       | 16/58 [00:54<02:17,  3.27s/it]\u001b[A\n",
      "Iteration:  29%|█████████▍                      | 17/58 [00:58<02:14,  3.29s/it]\u001b[A\n",
      "Iteration:  31%|█████████▉                      | 18/58 [01:01<02:11,  3.30s/it]\u001b[A\n",
      "Iteration:  33%|██████████▍                     | 19/58 [01:04<02:08,  3.31s/it]\u001b[A\n",
      "Iteration:  34%|███████████                     | 20/58 [01:08<02:05,  3.31s/it]\u001b[A\n",
      "Iteration:  36%|███████████▌                    | 21/58 [01:11<02:02,  3.31s/it]\u001b[A\n",
      "Iteration:  38%|████████████▏                   | 22/58 [01:14<01:58,  3.30s/it]\u001b[A\n",
      "Iteration:  40%|████████████▋                   | 23/58 [01:17<01:55,  3.30s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 24/58 [01:21<01:51,  3.29s/it]\u001b[A10/02/2020 12:30:30 - INFO - __main__ -   \n",
      "10/02/2020 12:30:30 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:30:30 - INFO - __main__ -   Validation f1 improved! 0.899488926746167 -> 0.9\n",
      "10/02/2020 12:30:30 - INFO - __main__ -   Validation Acc : 0.8627450980392157\n",
      "10/02/2020 12:30:30 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:30:30 - INFO - __main__ -   \n",
      "10/02/2020 12:30:30 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:30:30 - INFO - __main__ -   Validation Acc improved! 0.8553921568627451 -> 0.8627450980392157\n",
      "10/02/2020 12:30:30 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  43%|█████████████▊                  | 25/58 [01:24<01:48,  3.28s/it]\u001b[A10/02/2020 12:30:33 - INFO - __main__ -   \n",
      "10/02/2020 12:30:33 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:30:33 - INFO - __main__ -   Validation f1 improved! 0.9 -> 0.9015544041450777\n",
      "10/02/2020 12:30:33 - INFO - __main__ -   Validation Acc : 0.8602941176470589\n",
      "10/02/2020 12:30:33 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  45%|██████████████▎                 | 26/58 [01:27<01:45,  3.28s/it]\u001b[A\n",
      "Iteration:  47%|██████████████▉                 | 27/58 [01:30<01:41,  3.27s/it]\u001b[A\n",
      "Iteration:  48%|███████████████▍                | 28/58 [01:34<01:37,  3.26s/it]\u001b[A10/02/2020 12:30:43 - INFO - __main__ -   \n",
      "10/02/2020 12:30:43 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:30:43 - INFO - __main__ -   Validation f1 improved! 0.9015544041450777 -> 0.9023090586145649\n",
      "10/02/2020 12:30:43 - INFO - __main__ -   Validation Acc : 0.8651960784313726\n",
      "10/02/2020 12:30:43 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:30:43 - INFO - __main__ -   \n",
      "10/02/2020 12:30:43 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:30:43 - INFO - __main__ -   Validation Acc improved! 0.8627450980392157 -> 0.8651960784313726\n",
      "10/02/2020 12:30:43 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  50%|████████████████                | 29/58 [01:37<01:34,  3.25s/it]\u001b[A\n",
      "Iteration:  52%|████████████████▌               | 30/58 [01:40<01:30,  3.24s/it]\u001b[A\n",
      "Iteration:  53%|█████████████████               | 31/58 [01:43<01:27,  3.23s/it]\u001b[A\n",
      "Iteration:  55%|█████████████████▋              | 32/58 [01:47<01:23,  3.23s/it]\u001b[A\n",
      "Iteration:  57%|██████████████████▏             | 33/58 [01:50<01:20,  3.22s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 34/58 [01:53<01:17,  3.22s/it]\u001b[A\n",
      "Iteration:  60%|███████████████████▎            | 35/58 [01:56<01:14,  3.22s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▊            | 36/58 [01:59<01:10,  3.22s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▍           | 37/58 [02:03<01:07,  3.22s/it]\u001b[A\n",
      "Iteration:  66%|████████████████████▉           | 38/58 [02:06<01:04,  3.22s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▌          | 39/58 [02:09<01:01,  3.23s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████          | 40/58 [02:12<00:58,  3.23s/it]\u001b[A\n",
      "Iteration:  71%|██████████████████████▌         | 41/58 [02:16<00:55,  3.24s/it]\u001b[A\n",
      "Iteration:  72%|███████████████████████▏        | 42/58 [02:19<00:52,  3.25s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▋        | 43/58 [02:22<00:48,  3.25s/it]\u001b[A\n",
      "Iteration:  76%|████████████████████████▎       | 44/58 [02:25<00:45,  3.25s/it]\u001b[A\n",
      "Iteration:  78%|████████████████████████▊       | 45/58 [02:29<00:42,  3.26s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 46/58 [02:32<00:39,  3.27s/it]\u001b[A\n",
      "Iteration:  81%|█████████████████████████▉      | 47/58 [02:35<00:35,  3.27s/it]\u001b[A\n",
      "Iteration:  83%|██████████████████████████▍     | 48/58 [02:38<00:32,  3.26s/it]\u001b[A10/02/2020 12:31:48 - INFO - __main__ -   \n",
      "10/02/2020 12:31:48 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:31:48 - INFO - __main__ -   Validation f1 improved! 0.9023090586145649 -> 0.902439024390244\n",
      "10/02/2020 12:31:48 - INFO - __main__ -   Validation Acc : 0.8627450980392157\n",
      "10/02/2020 12:31:48 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  84%|███████████████████████████     | 49/58 [02:42<00:29,  3.26s/it]\u001b[A\n",
      "Iteration:  86%|███████████████████████████▌    | 50/58 [02:45<00:26,  3.27s/it]\u001b[A\n",
      "Iteration:  88%|████████████████████████████▏   | 51/58 [02:48<00:22,  3.26s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 52/58 [02:52<00:19,  3.26s/it]\u001b[A\n",
      "Iteration:  91%|█████████████████████████████▏  | 53/58 [02:55<00:16,  3.25s/it]\u001b[A\n",
      "Iteration:  93%|█████████████████████████████▊  | 54/58 [02:58<00:13,  3.26s/it]\u001b[A10/02/2020 12:32:07 - INFO - __main__ -   \n",
      "10/02/2020 12:32:07 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:32:07 - INFO - __main__ -   Validation f1 improved! 0.902439024390244 -> 0.9061946902654867\n",
      "10/02/2020 12:32:07 - INFO - __main__ -   Validation Acc : 0.8700980392156863\n",
      "10/02/2020 12:32:07 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:32:07 - INFO - __main__ -   \n",
      "10/02/2020 12:32:07 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:32:07 - INFO - __main__ -   Validation Acc improved! 0.8651960784313726 -> 0.8700980392156863\n",
      "10/02/2020 12:32:07 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  95%|██████████████████████████████▎ | 55/58 [03:01<00:09,  3.26s/it]\u001b[A10/02/2020 12:32:10 - INFO - __main__ -   \n",
      "10/02/2020 12:32:10 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:32:10 - INFO - __main__ -   Validation f1 improved! 0.9061946902654867 -> 0.9071803852889667\n",
      "10/02/2020 12:32:10 - INFO - __main__ -   Validation Acc : 0.8700980392156863\n",
      "10/02/2020 12:32:10 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  97%|██████████████████████████████▉ | 56/58 [03:05<00:06,  3.26s/it]\u001b[A10/02/2020 12:32:14 - INFO - __main__ -   \n",
      "10/02/2020 12:32:14 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:32:14 - INFO - __main__ -   Validation f1 improved! 0.9071803852889667 -> 0.9090909090909091\n",
      "10/02/2020 12:32:14 - INFO - __main__ -   Validation Acc : 0.8700980392156863\n",
      "10/02/2020 12:32:14 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  98%|███████████████████████████████▍| 57/58 [03:08<00:03,  3.25s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 58/58 [03:11<00:00,  3.30s/it]\u001b[A\n",
      "Epoch:  10%|███▌                                | 1/10 [03:11<28:40, 191.18s/it]\n",
      "Iteration:   0%|                                         | 0/58 [00:00<?, ?it/s]\u001b[A10/02/2020 12:32:20 - INFO - __main__ -   \n",
      "10/02/2020 12:32:20 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:32:20 - INFO - __main__ -   Validation f1 improved! 0.9090909090909091 -> 0.9244288224956063\n",
      "10/02/2020 12:32:20 - INFO - __main__ -   Validation Acc : 0.8946078431372549\n",
      "10/02/2020 12:32:20 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:32:20 - INFO - __main__ -   \n",
      "10/02/2020 12:32:20 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:32:20 - INFO - __main__ -   Validation Acc improved! 0.8700980392156863 -> 0.8946078431372549\n",
      "10/02/2020 12:32:20 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   2%|▌                                | 1/58 [00:03<03:06,  3.27s/it]\u001b[A10/02/2020 12:32:23 - INFO - __main__ -   \n",
      "10/02/2020 12:32:23 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:32:23 - INFO - __main__ -   Validation f1 improved! 0.9244288224956063 -> 0.9293286219081272\n",
      "10/02/2020 12:32:23 - INFO - __main__ -   Validation Acc : 0.9019607843137255\n",
      "10/02/2020 12:32:23 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:32:23 - INFO - __main__ -   \n",
      "10/02/2020 12:32:23 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:32:23 - INFO - __main__ -   Validation Acc improved! 0.8946078431372549 -> 0.9019607843137255\n",
      "10/02/2020 12:32:23 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   3%|█▏                               | 2/58 [00:06<03:02,  3.27s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 3/58 [00:09<02:59,  3.26s/it]\u001b[A\n",
      "Iteration:   7%|██▎                              | 4/58 [00:13<02:55,  3.25s/it]\u001b[A\n",
      "Iteration:   9%|██▊                              | 5/58 [00:16<02:52,  3.25s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 6/58 [00:19<02:49,  3.25s/it]\u001b[A\n",
      "Iteration:  12%|███▉                             | 7/58 [00:22<02:46,  3.26s/it]\u001b[A\n",
      "Iteration:  14%|████▌                            | 8/58 [00:26<02:42,  3.26s/it]\u001b[A\n",
      "Iteration:  16%|█████                            | 9/58 [00:29<02:39,  3.25s/it]\u001b[A\n",
      "Iteration:  17%|█████▌                          | 10/58 [00:32<02:35,  3.25s/it]\u001b[A\n",
      "Iteration:  19%|██████                          | 11/58 [00:35<02:32,  3.24s/it]\u001b[A\n",
      "Iteration:  21%|██████▌                         | 12/58 [00:38<02:29,  3.25s/it]\u001b[A\n",
      "Iteration:  22%|███████▏                        | 13/58 [00:42<02:26,  3.25s/it]\u001b[A\n",
      "Iteration:  24%|███████▋                        | 14/58 [00:45<02:23,  3.25s/it]\u001b[A\n",
      "Iteration:  26%|████████▎                       | 15/58 [00:48<02:19,  3.25s/it]\u001b[A\n",
      "Iteration:  28%|████████▊                       | 16/58 [00:51<02:16,  3.25s/it]\u001b[A\n",
      "Iteration:  29%|█████████▍                      | 17/58 [00:55<02:12,  3.24s/it]\u001b[A\n",
      "Iteration:  31%|█████████▉                      | 18/58 [00:58<02:09,  3.24s/it]\u001b[A\n",
      "Iteration:  33%|██████████▍                     | 19/58 [01:01<02:06,  3.25s/it]\u001b[A\n",
      "Iteration:  34%|███████████                     | 20/58 [01:05<02:03,  3.25s/it]\u001b[A\n",
      "Iteration:  36%|███████████▌                    | 21/58 [01:08<02:00,  3.25s/it]\u001b[A\n",
      "Iteration:  38%|████████████▏                   | 22/58 [01:11<01:56,  3.25s/it]\u001b[A\n",
      "Iteration:  40%|████████████▋                   | 23/58 [01:14<01:53,  3.25s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 24/58 [01:18<01:50,  3.25s/it]\u001b[A\n",
      "Iteration:  43%|█████████████▊                  | 25/58 [01:21<01:47,  3.26s/it]\u001b[A\n",
      "Iteration:  45%|██████████████▎                 | 26/58 [01:24<01:44,  3.26s/it]\u001b[A\n",
      "Iteration:  47%|██████████████▉                 | 27/58 [01:27<01:40,  3.25s/it]\u001b[A\n",
      "Iteration:  48%|███████████████▍                | 28/58 [01:30<01:37,  3.25s/it]\u001b[A\n",
      "Iteration:  50%|████████████████                | 29/58 [01:34<01:34,  3.25s/it]\u001b[A\n",
      "Iteration:  52%|████████████████▌               | 30/58 [01:37<01:31,  3.26s/it]\u001b[A\n",
      "Iteration:  53%|█████████████████               | 31/58 [01:40<01:27,  3.26s/it]\u001b[A\n",
      "Iteration:  55%|█████████████████▋              | 32/58 [01:44<01:24,  3.25s/it]\u001b[A\n",
      "Iteration:  57%|██████████████████▏             | 33/58 [01:47<01:21,  3.25s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 34/58 [01:50<01:17,  3.25s/it]\u001b[A\n",
      "Iteration:  60%|███████████████████▎            | 35/58 [01:53<01:14,  3.26s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▊            | 36/58 [01:57<01:11,  3.26s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▍           | 37/58 [02:00<01:08,  3.26s/it]\u001b[A\n",
      "Iteration:  66%|████████████████████▉           | 38/58 [02:03<01:05,  3.25s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▌          | 39/58 [02:06<01:01,  3.25s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████          | 40/58 [02:10<00:58,  3.25s/it]\u001b[A\n",
      "Iteration:  71%|██████████████████████▌         | 41/58 [02:13<00:55,  3.26s/it]\u001b[A\n",
      "Iteration:  72%|███████████████████████▏        | 42/58 [02:16<00:52,  3.26s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▋        | 43/58 [02:19<00:48,  3.26s/it]\u001b[A\n",
      "Iteration:  76%|████████████████████████▎       | 44/58 [02:23<00:45,  3.25s/it]\u001b[A\n",
      "Iteration:  78%|████████████████████████▊       | 45/58 [02:26<00:42,  3.26s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 46/58 [02:29<00:39,  3.26s/it]\u001b[A\n",
      "Iteration:  81%|█████████████████████████▉      | 47/58 [02:32<00:35,  3.26s/it]\u001b[A\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  83%|██████████████████████████▍     | 48/58 [02:36<00:32,  3.25s/it]\u001b[A\n",
      "Iteration:  84%|███████████████████████████     | 49/58 [02:39<00:29,  3.25s/it]\u001b[A\n",
      "Iteration:  86%|███████████████████████████▌    | 50/58 [02:42<00:26,  3.26s/it]\u001b[A\n",
      "Iteration:  88%|████████████████████████████▏   | 51/58 [02:45<00:22,  3.26s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 52/58 [02:49<00:19,  3.26s/it]\u001b[A\n",
      "Iteration:  91%|█████████████████████████████▏  | 53/58 [02:52<00:16,  3.25s/it]\u001b[A\n",
      "Iteration:  93%|█████████████████████████████▊  | 54/58 [02:55<00:13,  3.25s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 55/58 [02:58<00:09,  3.26s/it]\u001b[A\n",
      "Iteration:  97%|██████████████████████████████▉ | 56/58 [03:02<00:06,  3.26s/it]\u001b[A\n",
      "Iteration:  98%|███████████████████████████████▍| 57/58 [03:05<00:03,  3.26s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 58/58 [03:08<00:00,  3.25s/it]\u001b[A\n",
      "Epoch:  20%|███████▏                            | 2/10 [06:19<25:22, 190.32s/it]\n",
      "Iteration:   0%|                                         | 0/58 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   2%|▌                                | 1/58 [00:03<03:07,  3.28s/it]\u001b[A\n",
      "Iteration:   3%|█▏                               | 2/58 [00:06<03:03,  3.28s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 3/58 [00:09<02:59,  3.27s/it]\u001b[A\n",
      "Iteration:   7%|██▎                              | 4/58 [00:13<02:56,  3.26s/it]\u001b[A\n",
      "Iteration:   9%|██▊                              | 5/58 [00:16<02:52,  3.26s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 6/58 [00:19<02:49,  3.26s/it]\u001b[A\n",
      "Iteration:  12%|███▉                             | 7/58 [00:22<02:46,  3.26s/it]\u001b[A\n",
      "Iteration:  14%|████▌                            | 8/58 [00:26<02:42,  3.26s/it]\u001b[A\n",
      "Iteration:  16%|█████                            | 9/58 [00:29<02:39,  3.25s/it]\u001b[A\n",
      "Iteration:  17%|█████▌                          | 10/58 [00:32<02:36,  3.26s/it]\u001b[A\n",
      "Iteration:  19%|██████                          | 11/58 [00:35<02:33,  3.26s/it]\u001b[A\n",
      "Iteration:  21%|██████▌                         | 12/58 [00:39<02:30,  3.26s/it]\u001b[A\n",
      "Iteration:  22%|███████▏                        | 13/58 [00:42<02:26,  3.26s/it]\u001b[A\n",
      "Iteration:  24%|███████▋                        | 14/58 [00:45<02:23,  3.26s/it]\u001b[A\n",
      "Iteration:  26%|████████▎                       | 15/58 [00:48<02:20,  3.26s/it]\u001b[A\n",
      "Iteration:  28%|████████▊                       | 16/58 [00:52<02:17,  3.26s/it]\u001b[A\n",
      "Iteration:  29%|█████████▍                      | 17/58 [00:55<02:13,  3.26s/it]\u001b[A\n",
      "Iteration:  31%|█████████▉                      | 18/58 [00:58<02:10,  3.26s/it]\u001b[A\n",
      "Iteration:  33%|██████████▍                     | 19/58 [01:01<02:07,  3.26s/it]\u001b[A\n",
      "Iteration:  34%|███████████                     | 20/58 [01:05<02:04,  3.27s/it]\u001b[A\n",
      "Iteration:  36%|███████████▌                    | 21/58 [01:08<02:00,  3.26s/it]\u001b[A\n",
      "Iteration:  38%|████████████▏                   | 22/58 [01:11<01:57,  3.26s/it]\u001b[A\n",
      "Iteration:  40%|████████████▋                   | 23/58 [01:15<01:54,  3.26s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 24/58 [01:18<01:51,  3.27s/it]\u001b[A\n",
      "Iteration:  43%|█████████████▊                  | 25/58 [01:21<01:47,  3.26s/it]\u001b[A\n",
      "Iteration:  45%|██████████████▎                 | 26/58 [01:24<01:44,  3.26s/it]\u001b[A\n",
      "Iteration:  47%|██████████████▉                 | 27/58 [01:28<01:40,  3.26s/it]\u001b[A\n",
      "Iteration:  48%|███████████████▍                | 28/58 [01:31<01:37,  3.26s/it]\u001b[A\n",
      "Iteration:  50%|████████████████                | 29/58 [01:34<01:34,  3.26s/it]\u001b[A\n",
      "Iteration:  52%|████████████████▌               | 30/58 [01:37<01:31,  3.26s/it]\u001b[A\n",
      "Iteration:  53%|█████████████████               | 31/58 [01:41<01:27,  3.26s/it]\u001b[A\n",
      "Iteration:  55%|█████████████████▋              | 32/58 [01:44<01:24,  3.26s/it]\u001b[A\n",
      "Iteration:  57%|██████████████████▏             | 33/58 [01:47<01:21,  3.26s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 34/58 [01:50<01:18,  3.26s/it]\u001b[A\n",
      "Iteration:  60%|███████████████████▎            | 35/58 [01:54<01:14,  3.26s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▊            | 36/58 [01:57<01:11,  3.26s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▍           | 37/58 [02:00<01:08,  3.27s/it]\u001b[A\n",
      "Iteration:  66%|████████████████████▉           | 38/58 [02:03<01:05,  3.27s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▌          | 39/58 [02:07<01:01,  3.26s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████          | 40/58 [02:10<00:58,  3.26s/it]\u001b[A\n",
      "Iteration:  71%|██████████████████████▌         | 41/58 [02:13<00:55,  3.26s/it]\u001b[A\n",
      "Iteration:  72%|███████████████████████▏        | 42/58 [02:17<00:52,  3.27s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▋        | 43/58 [02:20<00:48,  3.26s/it]\u001b[A\n",
      "Iteration:  76%|████████████████████████▎       | 44/58 [02:23<00:45,  3.26s/it]\u001b[A\n",
      "Iteration:  78%|████████████████████████▊       | 45/58 [02:26<00:42,  3.27s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 46/58 [02:30<00:39,  3.27s/it]\u001b[A\n",
      "Iteration:  81%|█████████████████████████▉      | 47/58 [02:33<00:35,  3.26s/it]\u001b[A\n",
      "Iteration:  83%|██████████████████████████▍     | 48/58 [02:36<00:32,  3.26s/it]\u001b[A\n",
      "Iteration:  84%|███████████████████████████     | 49/58 [02:39<00:29,  3.27s/it]\u001b[A\n",
      "Iteration:  86%|███████████████████████████▌    | 50/58 [02:43<00:26,  3.27s/it]\u001b[A\n",
      "Iteration:  88%|████████████████████████████▏   | 51/58 [02:46<00:22,  3.26s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 52/58 [02:49<00:19,  3.26s/it]\u001b[A\n",
      "Iteration:  91%|█████████████████████████████▏  | 53/58 [02:52<00:16,  3.26s/it]\u001b[A\n",
      "Iteration:  93%|█████████████████████████████▊  | 54/58 [02:56<00:13,  3.27s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 55/58 [02:59<00:09,  3.26s/it]\u001b[A\n",
      "Iteration:  97%|██████████████████████████████▉ | 56/58 [03:02<00:06,  3.26s/it]\u001b[A\n",
      "Iteration:  98%|███████████████████████████████▍| 57/58 [03:05<00:03,  3.26s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 58/58 [03:08<00:00,  3.26s/it]\u001b[A\n",
      "Epoch:  30%|██████████▊                         | 3/10 [09:28<22:09, 189.87s/it]\n",
      "Iteration:   0%|                                         | 0/58 [00:00<?, ?it/s]\u001b[A10/02/2020 12:38:34 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 12:38:34 - INFO - __main__ -   Best f1 : 0.9293286219081272\n",
      "Iteration:   0%|                                         | 0/58 [00:00<?, ?it/s]\n",
      "Epoch:  30%|██████████▊                         | 3/10 [09:29<22:07, 189.68s/it]\n",
      "Traceback (most recent call last):\n",
      "  File \"KDAP/NLI_KD_training_MRPC.py\", line 376, in <module>\n",
      "    #                 raise ValueError('%s KD not found, please use kd or kd.full' % args.kd)\n",
      "AttributeError: 'Namespace' object has no attribute 'kd'\n"
     ]
    }
   ],
   "source": [
    "!python KDAP/NLI_KD_training_MRPC.py  --> 92.93"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": []
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "Python 3",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.7.8"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 4
}
