{
 "cells": [
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## *** This is an example of reproducing PeaBERT's result of RTE 73.64% "
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "### [0] Experiments are done with two NVIDIA T4 GPUs\n",
    "\n",
    "### * The Processes for [1] and [2] are equivalent to the  [GitHub repository](https://github.com/intersun/PKD-for-BERT-Model-Compression) for [Patient Knowledge Distillation for BERT Model Compression](https://arxiv.org/abs/1908.09355)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "### [1] 12 layer teacher \n",
    "checking validation -> (global_step % 2 == 0) in NLI_KD_training_RTE.py \n",
    "\n",
    "=> settings : [seed = 33725599 / learning_rate = 2e-5 / epoch = 6 / training_batch_size = 64]\n",
    "\n",
    "=> used the checkpoint saved at [epoch 1 step 11] = val[0.5860, 68.95%] as the 12 layer teacher throughout the experiment.\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 42,
   "metadata": {
    "collapsed": true
   },
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "10/02/2020 11:24:37 - INFO - __main__ -   IN DEBUG MODE\n",
      "10/02/2020 11:24:38 - INFO - src.argument_parser -   device: cuda n_gpu: 2, 16-bits training: False\n",
      "10/02/2020 11:24:38 - INFO - src.argument_parser -   random seed = 66406298\n",
      "10/02/2020 11:24:38 - INFO - __main__ -   actual batch size on all GPU = 64\n",
      "10/02/2020 11:24:38 - INFO - __main__ -   Seed changed to : 33725599\n",
      "10/02/2020 11:24:38 - INFO - __main__ -   runing simple fine-tuning because teacher's prediction is not provided\n",
      "10/02/2020 11:24:38 - INFO - src.nli_data_processing -   Writing example 0 of 2490\n",
      "10/02/2020 11:24:40 - INFO - __main__ -   ***** Running training *****\n",
      "10/02/2020 11:24:40 - INFO - __main__ -     Num examples = 2490\n",
      "10/02/2020 11:24:40 - INFO - __main__ -     Batch size = 64\n",
      "10/02/2020 11:24:40 - INFO - __main__ -     Num steps = 228\n",
      "10/02/2020 11:24:40 - INFO - src.nli_data_processing -   Writing example 0 of 277\n",
      "10/02/2020 11:24:40 - INFO - __main__ -   ***** Running evaluation *****\n",
      "10/02/2020 11:24:40 - INFO - __main__ -     Num examples = 277\n",
      "10/02/2020 11:24:40 - INFO - __main__ -     Batch size = 32\n",
      "10/02/2020 11:24:40 - INFO - src.nli_data_processing -   Writing example 0 of 3000\n",
      "10/02/2020 11:24:43 - INFO - __main__ -   ***** Running evaluation *****\n",
      "10/02/2020 11:24:43 - INFO - __main__ -     Num examples = 3000\n",
      "10/02/2020 11:24:43 - INFO - __main__ -     Batch size = 32\n",
      "10/02/2020 11:24:43 - INFO - __main__ -   using normal Knowledge Distillation\n",
      "10/02/2020 11:24:43 - INFO - src.nli_data_processing -   predicting for RTE\n",
      "10/02/2020 11:24:43 - INFO - src.modeling -   num hidden layer is set as 12\n",
      "10/02/2020 11:24:43 - INFO - src.modeling -   Model config {\n",
      "  \"attention_probs_dropout_prob\": 0.1,\n",
      "  \"hidden_act\": \"gelu\",\n",
      "  \"hidden_dropout_prob\": 0.1,\n",
      "  \"hidden_size\": 768,\n",
      "  \"initializer_range\": 0.02,\n",
      "  \"intermediate_size\": 3072,\n",
      "  \"max_position_embeddings\": 512,\n",
      "  \"num_attention_heads\": 12,\n",
      "  \"num_hidden_layers\": 12,\n",
      "  \"type_vocab_size\": 2,\n",
      "  \"vocab_size\": 30522\n",
      "}\n",
      "\n",
      "10/02/2020 11:24:45 - INFO - src.utils -   delete 8 layers, keep 199 layers\n",
      "10/02/2020 11:24:48 - INFO - src.utils -   data parallel because more than one gpu\n",
      "10/02/2020 11:24:48 - INFO - __main__ -   *****************************************************************************\n",
      "10/02/2020 11:24:48 - INFO - src.utils -   data parallel because more than one gpu\n",
      "10/02/2020 11:24:48 - INFO - __main__ -   number of layers in student model = 12\n",
      "10/02/2020 11:24:48 - INFO - __main__ -   num parameters in student model are 109482240 and 1538\n",
      "10/02/2020 11:24:48 - INFO - __main__ -   FP16 is not activated, use BertAdam\n",
      "Epoch:   0%|                                              | 0/6 [00:00<?, ?it/s]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:04<02:50,  4.49s/it]\u001b[A10/02/2020 11:24:56 - INFO - __main__ -   \n",
      "10/02/2020 11:24:56 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:24:56 - INFO - __main__ -   Validation Accuracy improved! 0 -> 0.5270758122743683\n",
      "10/02/2020 11:24:56 - INFO - __main__ -   Validation Loss is : 0.7209380938257982\n",
      "10/02/2020 11:24:56 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:24:56 - INFO - __main__ -   \n",
      "10/02/2020 11:24:56 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:24:56 - INFO - __main__ -   Validation Loss improved! 100 -> 0.7209380938257982\n",
      "10/02/2020 11:24:56 - INFO - __main__ -   Validation Accuracy is :0.5270758122743683\n",
      "10/02/2020 11:24:56 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   5%|█▋                               | 2/39 [00:07<02:31,  4.09s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:08<01:53,  3.16s/it]\u001b[A10/02/2020 11:25:00 - INFO - __main__ -   \n",
      "10/02/2020 11:25:00 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:25:00 - INFO - __main__ -   Validation Loss improved! 0.7209380938257982 -> 0.7104968163941311\n",
      "10/02/2020 11:25:00 - INFO - __main__ -   Validation Accuracy is :0.5270758122743683\n",
      "10/02/2020 11:25:00 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  10%|███▍                             | 4/39 [00:11<01:50,  3.17s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:12<01:25,  2.51s/it]\u001b[A10/02/2020 11:25:04 - INFO - __main__ -   \n",
      "10/02/2020 11:25:04 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:25:04 - INFO - __main__ -   Validation Accuracy improved! 0.5270758122743683 -> 0.5342960288808665\n",
      "10/02/2020 11:25:04 - INFO - __main__ -   Validation Loss is : 0.6929104491691727\n",
      "10/02/2020 11:25:04 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:25:04 - INFO - __main__ -   \n",
      "10/02/2020 11:25:04 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:25:04 - INFO - __main__ -   Validation Loss improved! 0.7104968163941311 -> 0.6929104491691727\n",
      "10/02/2020 11:25:04 - INFO - __main__ -   Validation Accuracy is :0.5342960288808665\n",
      "10/02/2020 11:25:04 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  15%|█████                            | 6/39 [00:16<01:29,  2.72s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:16<01:10,  2.20s/it]\u001b[A10/02/2020 11:25:08 - INFO - __main__ -   \n",
      "10/02/2020 11:25:08 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:25:08 - INFO - __main__ -   Validation Accuracy improved! 0.5342960288808665 -> 0.5631768953068592\n",
      "10/02/2020 11:25:08 - INFO - __main__ -   Validation Loss is : 0.6865879740525669\n",
      "10/02/2020 11:25:08 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:25:08 - INFO - __main__ -   \n",
      "10/02/2020 11:25:08 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:25:08 - INFO - __main__ -   Validation Loss improved! 0.6929104491691727 -> 0.6865879740525669\n",
      "10/02/2020 11:25:08 - INFO - __main__ -   Validation Accuracy is :0.5631768953068592\n",
      "10/02/2020 11:25:08 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:20<01:18,  2.52s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:21<01:01,  2.05s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:24<01:10,  2.43s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:25<00:56,  2.01s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:28<01:04,  2.40s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:29<00:51,  1.98s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:33<00:59,  2.38s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:34<00:47,  1.97s/it]\u001b[A10/02/2020 11:25:26 - INFO - __main__ -   \n",
      "10/02/2020 11:25:26 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:25:26 - INFO - __main__ -   Validation Accuracy improved! 0.5631768953068592 -> 0.6028880866425993\n",
      "10/02/2020 11:25:26 - INFO - __main__ -   Validation Loss is : 0.6833383511980519\n",
      "10/02/2020 11:25:26 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:25:26 - INFO - __main__ -   \n",
      "10/02/2020 11:25:26 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:25:26 - INFO - __main__ -   Validation Loss improved! 0.6865879740525669 -> 0.6833383511980519\n",
      "10/02/2020 11:25:26 - INFO - __main__ -   Validation Accuracy is :0.6028880866425993\n",
      "10/02/2020 11:25:26 - INFO - __main__ -   =============================================================================\n",
      "\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  41%|█████████████▏                  | 16/39 [00:37<00:55,  2.39s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:38<00:43,  1.97s/it]\u001b[A10/02/2020 11:25:30 - INFO - __main__ -   \n",
      "10/02/2020 11:25:30 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:25:30 - INFO - __main__ -   Validation Loss improved! 0.6833383511980519 -> 0.6830096124311646\n",
      "10/02/2020 11:25:30 - INFO - __main__ -   Validation Accuracy is :0.5703971119133574\n",
      "10/02/2020 11:25:30 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:41<00:50,  2.38s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:42<00:39,  1.96s/it]\u001b[A10/02/2020 11:25:34 - INFO - __main__ -   \n",
      "10/02/2020 11:25:34 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:25:34 - INFO - __main__ -   Validation Loss improved! 0.6830096124311646 -> 0.6829597665945115\n",
      "10/02/2020 11:25:34 - INFO - __main__ -   Validation Accuracy is :0.5703971119133574\n",
      "10/02/2020 11:25:34 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:46<00:44,  2.36s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:47<00:35,  1.95s/it]\u001b[A10/02/2020 11:25:38 - INFO - __main__ -   \n",
      "10/02/2020 11:25:38 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:25:38 - INFO - __main__ -   Validation Loss improved! 0.6829597665945115 -> 0.6799788629965662\n",
      "10/02/2020 11:25:38 - INFO - __main__ -   Validation Accuracy is :0.5595667870036101\n",
      "10/02/2020 11:25:38 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:50<00:39,  2.35s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:51<00:30,  1.93s/it]\u001b[A10/02/2020 11:25:43 - INFO - __main__ -   \n",
      "10/02/2020 11:25:43 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:25:43 - INFO - __main__ -   Validation Accuracy improved! 0.6028880866425993 -> 0.6101083032490975\n",
      "10/02/2020 11:25:43 - INFO - __main__ -   Validation Loss is : 0.6772337992699138\n",
      "10/02/2020 11:25:43 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:25:43 - INFO - __main__ -   \n",
      "10/02/2020 11:25:43 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:25:43 - INFO - __main__ -   Validation Loss improved! 0.6799788629965662 -> 0.6772337992699138\n",
      "10/02/2020 11:25:43 - INFO - __main__ -   Validation Accuracy is :0.6101083032490975\n",
      "10/02/2020 11:25:43 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:54<00:34,  2.33s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:55<00:26,  1.92s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:58<00:30,  2.32s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:59<00:22,  1.91s/it]\u001b[A10/02/2020 11:25:51 - INFO - __main__ -   \n",
      "10/02/2020 11:25:51 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:25:51 - INFO - __main__ -   Validation Loss improved! 0.6772337992699138 -> 0.6758093885566353\n",
      "10/02/2020 11:25:51 - INFO - __main__ -   Validation Accuracy is :0.5848375451263538\n",
      "10/02/2020 11:25:51 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [01:03<00:25,  2.31s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [01:04<00:19,  1.91s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [01:07<00:20,  2.30s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [01:08<00:15,  1.91s/it]\u001b[A10/02/2020 11:25:59 - INFO - __main__ -   \n",
      "10/02/2020 11:25:59 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:25:59 - INFO - __main__ -   Validation Loss improved! 0.6758093885566353 -> 0.6669984363046364\n",
      "10/02/2020 11:25:59 - INFO - __main__ -   Validation Accuracy is :0.592057761732852\n",
      "10/02/2020 11:25:59 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [01:11<00:16,  2.30s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [01:12<00:11,  1.91s/it]\u001b[A10/02/2020 11:26:04 - INFO - __main__ -   \n",
      "10/02/2020 11:26:04 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:26:04 - INFO - __main__ -   Validation Loss improved! 0.6669984363046364 -> 0.6624025145162313\n",
      "10/02/2020 11:26:04 - INFO - __main__ -   Validation Accuracy is :0.5884476534296029\n",
      "10/02/2020 11:26:04 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [01:15<00:11,  2.30s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [01:16<00:07,  1.91s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [01:19<00:06,  2.30s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:20<00:03,  1.91s/it]\u001b[A10/02/2020 11:26:12 - INFO - __main__ -   \n",
      "10/02/2020 11:26:12 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:26:12 - INFO - __main__ -   Validation Accuracy improved! 0.6101083032490975 -> 0.6137184115523465\n",
      "10/02/2020 11:26:12 - INFO - __main__ -   Validation Loss is : 0.6588022218284194\n",
      "10/02/2020 11:26:12 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:26:12 - INFO - __main__ -   \n",
      "10/02/2020 11:26:12 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:26:12 - INFO - __main__ -   Validation Loss improved! 0.6624025145162313 -> 0.6588022218284194\n",
      "10/02/2020 11:26:12 - INFO - __main__ -   Validation Accuracy is :0.6137184115523465\n",
      "10/02/2020 11:26:12 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:24<00:02,  2.30s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:24<00:00,  2.18s/it]\u001b[A\n",
      "Epoch:  17%|██████▎                               | 1/6 [01:24<07:04, 85.00s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A10/02/2020 11:26:16 - INFO - __main__ -   \n",
      "10/02/2020 11:26:16 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:26:16 - INFO - __main__ -   Validation Accuracy improved! 0.6137184115523465 -> 0.631768953068592\n",
      "10/02/2020 11:26:16 - INFO - __main__ -   Validation Loss is : 0.6469965841795994\n",
      "10/02/2020 11:26:16 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:26:16 - INFO - __main__ -   \n",
      "10/02/2020 11:26:16 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:26:16 - INFO - __main__ -   Validation Loss improved! 0.6588022218284194 -> 0.6469965841795994\n",
      "10/02/2020 11:26:16 - INFO - __main__ -   Validation Accuracy is :0.631768953068592\n",
      "10/02/2020 11:26:16 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   3%|▊                                | 1/39 [00:03<02:01,  3.20s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:04<01:34,  2.54s/it]\u001b[A10/02/2020 11:26:20 - INFO - __main__ -   \n",
      "10/02/2020 11:26:20 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:26:20 - INFO - __main__ -   Validation Loss improved! 0.6469965841795994 -> 0.6286313645676155\n",
      "10/02/2020 11:26:20 - INFO - __main__ -   Validation Accuracy is :0.6245487364620939\n",
      "10/02/2020 11:26:20 - INFO - __main__ -   =============================================================================\n",
      "\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:   8%|██▌                              | 3/39 [00:07<01:38,  2.74s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:08<01:17,  2.22s/it]\u001b[A10/02/2020 11:26:25 - INFO - __main__ -   \n",
      "10/02/2020 11:26:25 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:26:25 - INFO - __main__ -   Validation Accuracy improved! 0.631768953068592 -> 0.6534296028880866\n",
      "10/02/2020 11:26:25 - INFO - __main__ -   Validation Loss is : 0.6099105001786986\n",
      "10/02/2020 11:26:25 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:26:25 - INFO - __main__ -   \n",
      "10/02/2020 11:26:25 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:26:25 - INFO - __main__ -   Validation Loss improved! 0.6286313645676155 -> 0.6099105001786986\n",
      "10/02/2020 11:26:25 - INFO - __main__ -   Validation Accuracy is :0.6534296028880866\n",
      "10/02/2020 11:26:25 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  13%|████▏                            | 5/39 [00:11<01:25,  2.52s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:12<01:08,  2.06s/it]\u001b[A10/02/2020 11:26:29 - INFO - __main__ -   \n",
      "10/02/2020 11:26:29 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:26:29 - INFO - __main__ -   Validation Accuracy improved! 0.6534296028880866 -> 0.6750902527075813\n",
      "10/02/2020 11:26:29 - INFO - __main__ -   Validation Loss is : 0.604484595977012\n",
      "10/02/2020 11:26:29 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:26:29 - INFO - __main__ -   \n",
      "10/02/2020 11:26:29 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:26:29 - INFO - __main__ -   Validation Loss improved! 0.6099105001786986 -> 0.604484595977012\n",
      "10/02/2020 11:26:29 - INFO - __main__ -   Validation Accuracy is :0.6750902527075813\n",
      "10/02/2020 11:26:29 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:15<01:17,  2.41s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:16<01:01,  1.99s/it]\u001b[A10/02/2020 11:26:33 - INFO - __main__ -   \n",
      "10/02/2020 11:26:33 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:26:33 - INFO - __main__ -   Validation Accuracy improved! 0.6750902527075813 -> 0.6895306859205776\n",
      "10/02/2020 11:26:33 - INFO - __main__ -   Validation Loss is : 0.6012662687886923\n",
      "10/02/2020 11:26:33 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:26:33 - INFO - __main__ -   \n",
      "10/02/2020 11:26:33 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:26:33 - INFO - __main__ -   Validation Loss improved! 0.604484595977012 -> 0.6012662687886923\n",
      "10/02/2020 11:26:33 - INFO - __main__ -   Validation Accuracy is :0.6895306859205776\n",
      "10/02/2020 11:26:33 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:20<01:10,  2.35s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:21<00:56,  1.95s/it]\u001b[A10/02/2020 11:26:37 - INFO - __main__ -   \n",
      "10/02/2020 11:26:37 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:26:37 - INFO - __main__ -   Validation Loss improved! 0.6012662687886923 -> 0.586049682396844\n",
      "10/02/2020 11:26:37 - INFO - __main__ -   Validation Accuracy is :0.6895306859205776\n",
      "10/02/2020 11:26:37 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:26:38 - INFO - __main__ -   Saving the model...\n",
      "\n",
      "Iteration:  28%|█████████                       | 11/39 [00:24<01:08,  2.46s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:25<00:54,  2.02s/it]\u001b[A10/02/2020 11:26:42 - INFO - __main__ -   \n",
      "10/02/2020 11:26:42 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:26:42 - INFO - __main__ -   Validation Accuracy improved! 0.6895306859205776 -> 0.7111913357400722\n",
      "10/02/2020 11:26:42 - INFO - __main__ -   Validation Loss is : 0.5935920205787631\n",
      "10/02/2020 11:26:42 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:28<01:01,  2.38s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:29<00:49,  1.97s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:33<00:56,  2.35s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:34<00:44,  1.94s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:37<00:51,  2.33s/it]\u001b[A\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:38<00:40,  1.93s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:41<00:46,  2.33s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:42<00:36,  1.93s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:45<00:41,  2.32s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:46<00:32,  1.92s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:50<00:37,  2.32s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:51<00:28,  1.92s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:54<00:32,  2.32s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:55<00:24,  1.92s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:58<00:27,  2.32s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:59<00:21,  1.92s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [01:02<00:23,  2.31s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [01:03<00:17,  1.92s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [01:07<00:18,  2.31s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [01:08<00:13,  1.92s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [01:11<00:13,  2.31s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [01:12<00:09,  1.91s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [01:15<00:09,  2.31s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [01:16<00:05,  1.91s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:19<00:04,  2.31s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:20<00:01,  1.91s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:23<00:00,  2.15s/it]\u001b[A\n",
      "Epoch:  33%|████████████▋                         | 2/6 [02:48<05:38, 84.66s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:01<00:38,  1.00s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:04<01:01,  1.67s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:05<00:52,  1.47s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:08<01:09,  2.00s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:09<00:57,  1.70s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:12<01:11,  2.16s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:13<00:57,  1.81s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:16<01:09,  2.23s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:17<00:55,  1.86s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:21<01:05,  2.27s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:22<00:52,  1.89s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:25<01:01,  2.29s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:26<00:49,  1.90s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:29<00:57,  2.30s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:30<00:45,  1.91s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:33<00:53,  2.31s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:34<00:42,  1.91s/it]\u001b[A\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:38<00:48,  2.31s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:39<00:38,  1.91s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:42<00:43,  2.30s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:43<00:34,  1.90s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:46<00:39,  2.30s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:47<00:30,  1.90s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:50<00:34,  2.30s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:51<00:26,  1.91s/it]\u001b[A10/02/2020 11:28:32 - INFO - __main__ -   \n",
      "10/02/2020 11:28:32 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:28:32 - INFO - __main__ -   Validation Accuracy improved! 0.7111913357400722 -> 0.7184115523465704\n",
      "10/02/2020 11:28:32 - INFO - __main__ -   Validation Loss is : 0.6812592516737294\n",
      "10/02/2020 11:28:32 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:54<00:29,  2.31s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:55<00:22,  1.91s/it]\u001b[A10/02/2020 11:28:36 - INFO - __main__ -   \n",
      "10/02/2020 11:28:36 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:28:36 - INFO - __main__ -   Validation Accuracy improved! 0.7184115523465704 -> 0.7220216606498195\n",
      "10/02/2020 11:28:36 - INFO - __main__ -   Validation Loss is : 0.6906600256689188\n",
      "10/02/2020 11:28:36 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:59<00:25,  2.30s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [01:00<00:19,  1.91s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [01:03<00:20,  2.30s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [01:04<00:15,  1.91s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [01:07<00:16,  2.31s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [01:08<00:11,  1.91s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [01:11<00:11,  2.31s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [01:12<00:07,  1.91s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [01:15<00:06,  2.31s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:16<00:03,  1.92s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:20<00:02,  2.31s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:21<00:00,  2.08s/it]\u001b[A\n",
      "Epoch:  50%|███████████████████                   | 3/6 [04:09<04:10, 83.60s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:03<02:02,  3.23s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:04<01:34,  2.57s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:07<01:39,  2.77s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:08<01:18,  2.24s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:11<01:26,  2.54s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:12<01:08,  2.08s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:15<01:17,  2.42s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:16<01:01,  1.99s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:20<01:11,  2.37s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:21<00:56,  1.95s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:24<01:05,  2.34s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:25<00:52,  1.94s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:28<01:00,  2.33s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:29<00:48,  1.93s/it]\u001b[A10/02/2020 11:29:31 - INFO - __main__ -   \n",
      "10/02/2020 11:29:31 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:29:31 - INFO - __main__ -   Validation Accuracy improved! 0.7220216606498195 -> 0.7328519855595668\n",
      "10/02/2020 11:29:31 - INFO - __main__ -   Validation Loss is : 0.7566816436684949\n",
      "10/02/2020 11:29:31 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:32<00:55,  2.32s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:33<00:44,  1.92s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:37<00:50,  2.31s/it]\u001b[A\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:38<00:40,  1.92s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:41<00:46,  2.31s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:42<00:36,  1.92s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:45<00:41,  2.31s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:46<00:32,  1.92s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:49<00:36,  2.31s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:50<00:28,  1.92s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:54<00:32,  2.31s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:55<00:24,  1.92s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:58<00:27,  2.31s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:59<00:21,  1.91s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [01:02<00:23,  2.31s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [01:03<00:17,  1.92s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [01:06<00:18,  2.31s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [01:07<00:13,  1.92s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [01:10<00:13,  2.31s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [01:11<00:09,  1.92s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [01:15<00:09,  2.31s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [01:16<00:05,  1.92s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:19<00:04,  2.31s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:20<00:01,  1.91s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:23<00:00,  2.14s/it]\u001b[A\n",
      "Epoch:  67%|█████████████████████████▎            | 4/6 [05:33<02:47, 83.58s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:01<00:38,  1.00s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:04<01:01,  1.67s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:05<00:52,  1.47s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:08<01:09,  2.00s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:09<00:57,  1.70s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:12<01:11,  2.16s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:13<00:58,  1.81s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:16<01:09,  2.24s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:17<00:55,  1.87s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:21<01:06,  2.28s/it]\u001b[A\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  28%|█████████                       | 11/39 [00:22<00:53,  1.90s/it]\u001b[A10/02/2020 11:30:47 - INFO - __main__ -   \n",
      "10/02/2020 11:30:47 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:30:47 - INFO - __main__ -   Validation Accuracy improved! 0.7328519855595668 -> 0.7472924187725631\n",
      "10/02/2020 11:30:47 - INFO - __main__ -   Validation Loss is : 0.8296041764076867\n",
      "10/02/2020 11:30:47 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:25<01:01,  2.29s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:26<00:49,  1.90s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:29<00:57,  2.31s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:30<00:45,  1.91s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:33<00:53,  2.31s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:34<00:42,  1.91s/it]\u001b[A\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:38<00:48,  2.31s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:39<00:38,  1.92s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:42<00:44,  2.32s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:43<00:34,  1.92s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:46<00:39,  2.32s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:47<00:30,  1.92s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:50<00:34,  2.31s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:51<00:26,  1.91s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:55<00:30,  2.31s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:56<00:22,  1.91s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:59<00:25,  2.31s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [01:00<00:19,  1.91s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [01:03<00:20,  2.31s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [01:04<00:15,  1.91s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [01:07<00:16,  2.32s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [01:08<00:11,  1.92s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [01:11<00:11,  2.32s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [01:12<00:07,  1.92s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [01:16<00:06,  2.31s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:17<00:03,  1.92s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:20<00:02,  2.32s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:21<00:00,  2.09s/it]\u001b[A\n",
      "Epoch:  83%|███████████████████████████████▋      | 5/6 [06:54<01:22, 82.91s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:03<02:03,  3.24s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:04<01:35,  2.57s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:07<01:39,  2.77s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:08<01:18,  2.24s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:11<01:26,  2.54s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:12<01:08,  2.07s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:15<01:17,  2.42s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:16<01:01,  1.99s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:20<01:10,  2.37s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:21<00:56,  1.96s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:24<01:05,  2.34s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:25<00:52,  1.94s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:28<01:00,  2.33s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:29<00:48,  1.93s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:32<00:55,  2.32s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:33<00:44,  1.92s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:37<00:50,  2.31s/it]\u001b[A\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:38<00:40,  1.92s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:41<00:46,  2.32s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:42<00:36,  1.92s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:45<00:41,  2.31s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:46<00:32,  1.92s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:49<00:36,  2.31s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:50<00:28,  1.92s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:54<00:32,  2.31s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:54<00:24,  1.92s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:58<00:27,  2.31s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:59<00:21,  1.91s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [01:02<00:23,  2.31s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [01:03<00:17,  1.91s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [01:06<00:18,  2.31s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [01:07<00:13,  1.92s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [01:10<00:13,  2.31s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [01:11<00:09,  1.92s/it]\u001b[A10/02/2020 11:32:56 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [01:15<00:09,  2.31s/it]\u001b[A10/02/2020 11:32:59 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [01:16<00:05,  1.92s/it]\u001b[A10/02/2020 11:33:00 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:19<00:04,  2.31s/it]\u001b[A10/02/2020 11:33:03 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:20<00:01,  1.92s/it]\u001b[A10/02/2020 11:33:04 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:23<00:00,  2.14s/it]\u001b[A\n",
      "Epoch: 100%|██████████████████████████████████████| 6/6 [08:18<00:00, 83.07s/it]\n",
      "10/02/2020 11:33:06 - INFO - __main__ -   \n",
      "10/02/2020 11:33:06 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:33:06 - INFO - __main__ -   Validation Loss : 0.586049682396844 Validation Accuracy : 0.7472924187725631\n",
      "10/02/2020 11:33:06 - INFO - __main__ -   Loss_ACC : 0.6895306859205776\n",
      "10/02/2020 11:33:06 - INFO - __main__ -   =============================================================================\n"
     ]
    }
   ],
   "source": [
    "!python KDAP/NLI_KD_training_RTE.py"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "run the below cell to save the teacher model's outputs => used for KD or Patient KD"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 44,
   "metadata": {
    "collapsed": true
   },
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "10/02/2020 11:34:50 - INFO - __main__ -   sub_dir = teacher_12layer\n",
      "10/02/2020 11:34:50 - INFO - __main__ -   prediction_mode = teacher\n",
      "10/02/2020 11:34:50 - INFO - __main__ -   interested_set = train,dev,test\n",
      "10/02/2020 11:34:50 - INFO - __main__ -   predicting for task QQP\n",
      "10/02/2020 11:34:50 - INFO - __main__ -   using model from kd_MRPC_nlayer.12_lr.2e-05_T.10.0_alpha.0.0_beta.0.0_bs.32-run-3 epoch 2\n",
      "10/02/2020 11:34:50 - INFO - __main__ -   Skipped because not interested\n",
      "10/02/2020 11:34:50 - INFO - __main__ -   predicting for task SST-2\n",
      "10/02/2020 11:34:50 - INFO - __main__ -   using model from kd_SST-2_nlayer.12_lr.2e-05_T.10.0_alpha.0.0_beta.0.0_bs.32-run-1 epoch 3\n",
      "10/02/2020 11:34:50 - INFO - __main__ -   Skipped because not interested\n",
      "10/02/2020 11:34:50 - INFO - __main__ -   predicting for task RTE\n",
      "10/02/2020 11:34:50 - INFO - __main__ -   using model from kd_RTE_nlayer.12_lr.2e-05_T.10.0_alpha.0.0_beta.0.0_bs.32-run-1 epoch 3\n",
      "10/02/2020 11:34:50 - INFO - src.nli_data_processing -   predicting for RTE\n",
      "10/02/2020 11:34:50 - INFO - src.modeling -   num hidden layer is set as 12\n",
      "10/02/2020 11:34:50 - INFO - src.modeling -   Model config {\n",
      "  \"attention_probs_dropout_prob\": 0.1,\n",
      "  \"hidden_act\": \"gelu\",\n",
      "  \"hidden_dropout_prob\": 0.1,\n",
      "  \"hidden_size\": 768,\n",
      "  \"initializer_range\": 0.02,\n",
      "  \"intermediate_size\": 3072,\n",
      "  \"max_position_embeddings\": 512,\n",
      "  \"num_attention_heads\": 12,\n",
      "  \"num_hidden_layers\": 12,\n",
      "  \"type_vocab_size\": 2,\n",
      "  \"vocab_size\": 30522\n",
      "}\n",
      "\n",
      "10/02/2020 11:34:55 - INFO - src.nli_data_processing -   Writing example 0 of 277\n",
      "10/02/2020 11:34:58 - INFO - __main__ -   for dev, acc = 0.6895306859205776, loss = 0.586049682396844\n",
      "10/02/2020 11:34:58 - INFO - src.nli_data_processing -   Writing example 0 of 2490\n",
      "10/02/2020 11:35:21 - INFO - __main__ -   for training, acc = 0.793574297188755, loss = 0.4677860079998951\n",
      "10/02/2020 11:35:21 - INFO - src.nli_data_processing -   Writing example 0 of 3000\n",
      "10/02/2020 11:35:49 - INFO - __main__ -   for test, acc = 0.742, loss = 0.7630566501617432\n",
      "10/02/2020 11:35:49 - INFO - __main__ -   debug test acc = 0.742\n",
      "10/02/2020 11:35:49 - INFO - __main__ -   saving teacher results\n",
      "10/02/2020 11:35:51 - INFO - __main__ -   predicting for task RTE Done!\n",
      "10/02/2020 11:35:51 - INFO - __main__ -   predicting for task MNLI\n",
      "10/02/2020 11:35:51 - INFO - __main__ -   using model from kd_MNLI_nlayer.12_lr.2e-05_T.10.0_alpha.0.0_beta.0.0_bs.32-run-3 epoch 3\n",
      "10/02/2020 11:35:51 - INFO - __main__ -   Skipped because not interested\n",
      "10/02/2020 11:35:51 - INFO - __main__ -   predicting for task QNLI\n",
      "10/02/2020 11:35:51 - INFO - __main__ -   using model from kd_QNLI_nlayer.12_lr.2e-05_T.10.0_alpha.0.0_beta.0.0_bs.32-run-1 epoch 1\n",
      "10/02/2020 11:35:51 - INFO - __main__ -   Skipped because not interested\n",
      "10/02/2020 11:35:51 - INFO - __main__ -   predicting for task QQP\n",
      "10/02/2020 11:35:51 - INFO - __main__ -   using model from kd_QQP_nlayer.12_lr.2e-05_T.10.0_alpha.0.0_beta.0.0_bs.32-run-1 epoch 3\n",
      "10/02/2020 11:35:51 - INFO - __main__ -   Skipped because not interested\n",
      "10/02/2020 11:35:51 - INFO - __main__ -   predicting for task MNLI-mm\n",
      "10/02/2020 11:35:51 - INFO - __main__ -   using model from kd_MNLI-mm_nlayer.12_lr.2e-05_T.10.0_alpha.0.0_beta.0.0_bs.32-run-3 epoch 3\n",
      "10/02/2020 11:35:51 - INFO - __main__ -   Skipped because not interested\n",
      "10/02/2020 11:35:51 - INFO - __main__ -   predicting for task race-merge\n",
      "10/02/2020 11:35:51 - INFO - __main__ -   using model from kd_race-merge_nlayer.12_lr.2e-05_T.10.0_alpha.0.0_beta.0.0_bs.32-run-1 epoch 3\n",
      "10/02/2020 11:35:51 - INFO - __main__ -   Skipped because not interested\n"
     ]
    }
   ],
   "source": [
    "!python KDAP/run_glue_benchmark.py"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "### [2] BERT6-PKD \n",
    "-> run 5 times to check the result of original BERT6-PKD "
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "the scores on the bottom of each cell is the best values during training. We compare these scores for the dev set."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 30,
   "metadata": {
    "collapsed": true
   },
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "10/02/2020 10:13:15 - INFO - __main__ -   IN DEBUG MODE\n",
      "10/02/2020 10:13:16 - INFO - src.argument_parser -   device: cuda n_gpu: 2, 16-bits training: False\n",
      "10/02/2020 10:13:16 - INFO - src.argument_parser -   random seed = 24203079\n",
      "10/02/2020 10:13:16 - INFO - __main__ -   actual batch size on all GPU = 64\n",
      "10/02/2020 10:13:16 - INFO - __main__ -   Seed changed to : 24203079\n",
      "10/02/2020 10:13:16 - INFO - __main__ -   loading teacher's prediction\n",
      "10/02/2020 10:13:17 - INFO - __main__ -   teacher acc = 79.36, teacher loss = 0.46779\n",
      "10/02/2020 10:13:17 - INFO - __main__ -   teacher acc = 68.95, teacher loss = 0.58605\n",
      "10/02/2020 10:13:17 - INFO - src.nli_data_processing -   Writing example 0 of 2490\n",
      "10/02/2020 10:13:19 - INFO - __main__ -   ***** Running training *****\n",
      "10/02/2020 10:13:19 - INFO - __main__ -     Num examples = 2490\n",
      "10/02/2020 10:13:19 - INFO - __main__ -     Batch size = 64\n",
      "10/02/2020 10:13:19 - INFO - __main__ -     Num steps = 228\n",
      "10/02/2020 10:13:19 - INFO - src.nli_data_processing -   Writing example 0 of 277\n",
      "10/02/2020 10:13:19 - INFO - __main__ -   ***** Running evaluation *****\n",
      "10/02/2020 10:13:19 - INFO - __main__ -     Num examples = 277\n",
      "10/02/2020 10:13:19 - INFO - __main__ -     Batch size = 32\n",
      "10/02/2020 10:13:19 - INFO - src.nli_data_processing -   Writing example 0 of 3000\n",
      "10/02/2020 10:13:21 - INFO - __main__ -   ***** Running evaluation *****\n",
      "10/02/2020 10:13:21 - INFO - __main__ -     Num examples = 3000\n",
      "10/02/2020 10:13:21 - INFO - __main__ -     Batch size = 32\n",
      "10/02/2020 10:13:21 - INFO - __main__ -   using normal Knowledge Distillation\n",
      "10/02/2020 10:13:21 - INFO - src.nli_data_processing -   predicting for RTE\n",
      "10/02/2020 10:13:21 - INFO - src.modeling -   num hidden layer is set as 6\n",
      "10/02/2020 10:13:21 - INFO - src.modeling -   Model config {\n",
      "  \"attention_probs_dropout_prob\": 0.1,\n",
      "  \"hidden_act\": \"gelu\",\n",
      "  \"hidden_dropout_prob\": 0.1,\n",
      "  \"hidden_size\": 768,\n",
      "  \"initializer_range\": 0.02,\n",
      "  \"intermediate_size\": 3072,\n",
      "  \"max_position_embeddings\": 512,\n",
      "  \"num_attention_heads\": 12,\n",
      "  \"num_hidden_layers\": 6,\n",
      "  \"type_vocab_size\": 2,\n",
      "  \"vocab_size\": 30522\n",
      "}\n",
      "\n",
      "10/02/2020 10:13:23 - INFO - src.utils -   delete 104 layers, keep 103 layers\n",
      "10/02/2020 10:13:26 - INFO - src.utils -   data parallel because more than one gpu\n",
      "10/02/2020 10:13:26 - INFO - __main__ -   *****************************************************************************\n",
      "10/02/2020 10:13:26 - INFO - src.utils -   data parallel because more than one gpu\n",
      "10/02/2020 10:13:26 - INFO - __main__ -   number of layers in student model = 6\n",
      "10/02/2020 10:13:26 - INFO - __main__ -   num parameters in student model are 66955008 and 1538\n",
      "10/02/2020 10:13:26 - INFO - __main__ -   FP16 is not activated, use BertAdam\n",
      "Epoch:   0%|                                              | 0/6 [00:00<?, ?it/s]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A/opt/conda/lib/python3.7/site-packages/torch/nn/functional.py:1958: UserWarning: reduction: 'mean' divides the total loss by both the batch size and the support size.'batchmean' divides only by the batch size, and aligns with the KL div math definition.'mean' will be changed to behave the same as 'batchmean' in the next major release.\n",
      "  warnings.warn(\"reduction: 'mean' divides the total loss by both the batch size and the support size.\"\n",
      "10/02/2020 10:13:31 - INFO - __main__ -   \n",
      "10/02/2020 10:13:31 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:13:31 - INFO - __main__ -   Validation Accuracy improved! 0 -> 0.5270758122743683\n",
      "10/02/2020 10:13:31 - INFO - __main__ -   Validation Loss is : 0.6987469497570492\n",
      "10/02/2020 10:13:31 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:13:31 - INFO - __main__ -   \n",
      "10/02/2020 10:13:31 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:13:31 - INFO - __main__ -   Validation Loss improved! 100 -> 0.6987469497570492\n",
      "10/02/2020 10:13:31 - INFO - __main__ -   Validation Accuracy is :0.5270758122743683\n",
      "10/02/2020 10:13:31 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   3%|▊                                | 1/39 [00:05<03:14,  5.13s/it]\u001b[A10/02/2020 10:13:33 - INFO - __main__ -   \n",
      "10/02/2020 10:13:33 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:13:33 - INFO - __main__ -   Validation Loss improved! 0.6987469497570492 -> 0.6983697870578146\n",
      "10/02/2020 10:13:33 - INFO - __main__ -   Validation Accuracy is :0.5270758122743683\n",
      "10/02/2020 10:13:33 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   5%|█▋                               | 2/39 [00:06<02:30,  4.07s/it]\u001b[A10/02/2020 10:13:34 - INFO - __main__ -   \n",
      "10/02/2020 10:13:34 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:13:34 - INFO - __main__ -   Validation Loss improved! 0.6983697870578146 -> 0.6976722517598837\n",
      "10/02/2020 10:13:34 - INFO - __main__ -   Validation Accuracy is :0.5270758122743683\n",
      "10/02/2020 10:13:34 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   8%|██▌                              | 3/39 [00:08<01:59,  3.33s/it]\u001b[A10/02/2020 10:13:36 - INFO - __main__ -   \n",
      "10/02/2020 10:13:36 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:13:36 - INFO - __main__ -   Validation Loss improved! 0.6976722517598837 -> 0.6969382461658024\n",
      "10/02/2020 10:13:36 - INFO - __main__ -   Validation Accuracy is :0.5234657039711191\n",
      "10/02/2020 10:13:36 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  10%|███▍                             | 4/39 [00:09<01:38,  2.81s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:11<01:23,  2.44s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:13<01:12,  2.19s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:14<01:04,  2.01s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:16<00:58,  1.89s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:17<00:54,  1.81s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:19<00:50,  1.76s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:21<00:48,  1.72s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:22<00:45,  1.69s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:24<00:43,  1.68s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:26<00:41,  1.67s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:27<00:40,  1.67s/it]\u001b[A10/02/2020 10:13:55 - INFO - __main__ -   \n",
      "10/02/2020 10:13:55 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:13:55 - INFO - __main__ -   Validation Loss improved! 0.6969382461658024 -> 0.6933763276799061\n",
      "10/02/2020 10:13:55 - INFO - __main__ -   Validation Accuracy is :0.49458483754512633\n",
      "10/02/2020 10:13:55 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:29<00:38,  1.67s/it]\u001b[A10/02/2020 10:13:57 - INFO - __main__ -   \n",
      "10/02/2020 10:13:57 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:13:57 - INFO - __main__ -   Validation Accuracy improved! 0.5270758122743683 -> 0.5342960288808665\n",
      "10/02/2020 10:13:57 - INFO - __main__ -   Validation Loss is : 0.6876900944899136\n",
      "10/02/2020 10:13:57 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:13:57 - INFO - __main__ -   \n",
      "10/02/2020 10:13:57 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:13:57 - INFO - __main__ -   Validation Loss improved! 0.6933763276799061 -> 0.6876900944899136\n",
      "10/02/2020 10:13:57 - INFO - __main__ -   Validation Accuracy is :0.5342960288808665\n",
      "10/02/2020 10:13:57 - INFO - __main__ -   =============================================================================\n",
      "\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  44%|█████████████▉                  | 17/39 [00:31<00:36,  1.68s/it]\u001b[A10/02/2020 10:13:59 - INFO - __main__ -   \n",
      "10/02/2020 10:13:59 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:13:59 - INFO - __main__ -   Validation Accuracy improved! 0.5342960288808665 -> 0.5703971119133574\n",
      "10/02/2020 10:13:59 - INFO - __main__ -   Validation Loss is : 0.6848098396824586\n",
      "10/02/2020 10:13:59 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:13:59 - INFO - __main__ -   \n",
      "10/02/2020 10:13:59 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:13:59 - INFO - __main__ -   Validation Loss improved! 0.6876900944899136 -> 0.6848098396824586\n",
      "10/02/2020 10:13:59 - INFO - __main__ -   Validation Accuracy is :0.5703971119133574\n",
      "10/02/2020 10:13:59 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:32<00:35,  1.68s/it]\u001b[A10/02/2020 10:14:00 - INFO - __main__ -   \n",
      "10/02/2020 10:14:00 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:14:00 - INFO - __main__ -   Validation Accuracy improved! 0.5703971119133574 -> 0.5812274368231047\n",
      "10/02/2020 10:14:00 - INFO - __main__ -   Validation Loss is : 0.683818369565888\n",
      "10/02/2020 10:14:00 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:14:00 - INFO - __main__ -   \n",
      "10/02/2020 10:14:00 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:14:00 - INFO - __main__ -   Validation Loss improved! 0.6848098396824586 -> 0.683818369565888\n",
      "10/02/2020 10:14:00 - INFO - __main__ -   Validation Accuracy is :0.5812274368231047\n",
      "10/02/2020 10:14:00 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:34<00:33,  1.68s/it]\u001b[A10/02/2020 10:14:02 - INFO - __main__ -   \n",
      "10/02/2020 10:14:02 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:14:02 - INFO - __main__ -   Validation Loss improved! 0.683818369565888 -> 0.6838044672666473\n",
      "10/02/2020 10:14:02 - INFO - __main__ -   Validation Accuracy is :0.5631768953068592\n",
      "10/02/2020 10:14:02 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:36<00:32,  1.68s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:37<00:30,  1.69s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:39<00:28,  1.69s/it]\u001b[A10/02/2020 10:14:07 - INFO - __main__ -   \n",
      "10/02/2020 10:14:07 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:14:07 - INFO - __main__ -   Validation Loss improved! 0.6838044672666473 -> 0.6805365076994637\n",
      "10/02/2020 10:14:07 - INFO - __main__ -   Validation Accuracy is :0.5812274368231047\n",
      "10/02/2020 10:14:07 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:41<00:27,  1.71s/it]\u001b[A10/02/2020 10:14:09 - INFO - __main__ -   \n",
      "10/02/2020 10:14:09 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:14:09 - INFO - __main__ -   Validation Accuracy improved! 0.5812274368231047 -> 0.592057761732852\n",
      "10/02/2020 10:14:09 - INFO - __main__ -   Validation Loss is : 0.67646045202813\n",
      "10/02/2020 10:14:09 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:14:09 - INFO - __main__ -   \n",
      "10/02/2020 10:14:09 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:14:09 - INFO - __main__ -   Validation Loss improved! 0.6805365076994637 -> 0.67646045202813\n",
      "10/02/2020 10:14:09 - INFO - __main__ -   Validation Accuracy is :0.592057761732852\n",
      "10/02/2020 10:14:09 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:43<00:25,  1.71s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:44<00:23,  1.71s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:46<00:22,  1.71s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:48<00:20,  1.70s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:49<00:18,  1.70s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:51<00:16,  1.69s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:53<00:15,  1.69s/it]\u001b[A10/02/2020 10:14:21 - INFO - __main__ -   \n",
      "10/02/2020 10:14:21 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:14:21 - INFO - __main__ -   Validation Accuracy improved! 0.592057761732852 -> 0.6064981949458483\n",
      "10/02/2020 10:14:21 - INFO - __main__ -   Validation Loss is : 0.6654999178669513\n",
      "10/02/2020 10:14:21 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:14:21 - INFO - __main__ -   \n",
      "10/02/2020 10:14:21 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:14:21 - INFO - __main__ -   Validation Loss improved! 0.67646045202813 -> 0.6654999178669513\n",
      "10/02/2020 10:14:21 - INFO - __main__ -   Validation Accuracy is :0.6064981949458483\n",
      "10/02/2020 10:14:21 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:54<00:13,  1.68s/it]\u001b[A10/02/2020 10:14:22 - INFO - __main__ -   \n",
      "10/02/2020 10:14:22 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:14:22 - INFO - __main__ -   Validation Loss improved! 0.6654999178669513 -> 0.6628943281483564\n",
      "10/02/2020 10:14:22 - INFO - __main__ -   Validation Accuracy is :0.6064981949458483\n",
      "10/02/2020 10:14:22 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:56<00:11,  1.67s/it]\u001b[A10/02/2020 10:14:24 - INFO - __main__ -   \n",
      "10/02/2020 10:14:24 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:14:24 - INFO - __main__ -   Validation Accuracy improved! 0.6064981949458483 -> 0.6101083032490975\n",
      "10/02/2020 10:14:24 - INFO - __main__ -   Validation Loss is : 0.6681024296619401\n",
      "10/02/2020 10:14:24 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:58<00:10,  1.67s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [00:59<00:08,  1.67s/it]\u001b[A10/02/2020 10:14:27 - INFO - __main__ -   \n",
      "10/02/2020 10:14:27 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:14:27 - INFO - __main__ -   Validation Loss improved! 0.6628943281483564 -> 0.6623001374062218\n",
      "10/02/2020 10:14:27 - INFO - __main__ -   Validation Accuracy is :0.6028880866425993\n",
      "10/02/2020 10:14:27 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [01:01<00:06,  1.66s/it]\u001b[A10/02/2020 10:14:29 - INFO - __main__ -   \n",
      "10/02/2020 10:14:29 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:14:29 - INFO - __main__ -   Validation Accuracy improved! 0.6101083032490975 -> 0.6245487364620939\n",
      "10/02/2020 10:14:29 - INFO - __main__ -   Validation Loss is : 0.6533059385279025\n",
      "10/02/2020 10:14:29 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:14:29 - INFO - __main__ -   \n",
      "10/02/2020 10:14:29 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:14:29 - INFO - __main__ -   Validation Loss improved! 0.6623001374062218 -> 0.6533059385279025\n",
      "10/02/2020 10:14:29 - INFO - __main__ -   Validation Accuracy is :0.6245487364620939\n",
      "10/02/2020 10:14:29 - INFO - __main__ -   =============================================================================\n",
      "\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [01:03<00:04,  1.66s/it]\u001b[A10/02/2020 10:14:31 - INFO - __main__ -   \n",
      "10/02/2020 10:14:31 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:14:31 - INFO - __main__ -   Validation Accuracy improved! 0.6245487364620939 -> 0.6425992779783394\n",
      "10/02/2020 10:14:31 - INFO - __main__ -   Validation Loss is : 0.6534866064464142\n",
      "10/02/2020 10:14:31 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:04<00:03,  1.66s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:06<00:01,  1.65s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:08<00:00,  1.75s/it]\u001b[A\n",
      "Epoch:  17%|██████▎                               | 1/6 [01:08<05:40, 68.07s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:01<01:02,  1.64s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:03<01:00,  1.63s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:04<00:58,  1.63s/it]\u001b[A10/02/2020 10:14:40 - INFO - __main__ -   \n",
      "10/02/2020 10:14:40 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:14:40 - INFO - __main__ -   Validation Loss improved! 0.6533059385279025 -> 0.6513845257810738\n",
      "10/02/2020 10:14:40 - INFO - __main__ -   Validation Accuracy is :0.6101083032490975\n",
      "10/02/2020 10:14:40 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  10%|███▍                             | 4/39 [00:06<00:57,  1.63s/it]\u001b[A10/02/2020 10:14:42 - INFO - __main__ -   \n",
      "10/02/2020 10:14:42 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:14:42 - INFO - __main__ -   Validation Loss improved! 0.6513845257810738 -> 0.6441065585139857\n",
      "10/02/2020 10:14:42 - INFO - __main__ -   Validation Accuracy is :0.6389891696750902\n",
      "10/02/2020 10:14:42 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  13%|████▏                            | 5/39 [00:08<00:55,  1.63s/it]\u001b[A10/02/2020 10:14:44 - INFO - __main__ -   \n",
      "10/02/2020 10:14:44 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:14:44 - INFO - __main__ -   Validation Accuracy improved! 0.6425992779783394 -> 0.6462093862815884\n",
      "10/02/2020 10:14:44 - INFO - __main__ -   Validation Loss is : 0.638830539551883\n",
      "10/02/2020 10:14:44 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:14:44 - INFO - __main__ -   \n",
      "10/02/2020 10:14:44 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:14:44 - INFO - __main__ -   Validation Loss improved! 0.6441065585139857 -> 0.638830539551883\n",
      "10/02/2020 10:14:44 - INFO - __main__ -   Validation Accuracy is :0.6462093862815884\n",
      "10/02/2020 10:14:44 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  15%|█████                            | 6/39 [00:09<00:53,  1.63s/it]\u001b[A10/02/2020 10:14:45 - INFO - __main__ -   \n",
      "10/02/2020 10:14:45 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:14:45 - INFO - __main__ -   Validation Loss improved! 0.638830539551883 -> 0.6379922976993051\n",
      "10/02/2020 10:14:45 - INFO - __main__ -   Validation Accuracy is :0.628158844765343\n",
      "10/02/2020 10:14:45 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:11<00:52,  1.63s/it]\u001b[A10/02/2020 10:14:47 - INFO - __main__ -   \n",
      "10/02/2020 10:14:47 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:14:47 - INFO - __main__ -   Validation Loss improved! 0.6379922976993051 -> 0.6374766353235348\n",
      "10/02/2020 10:14:47 - INFO - __main__ -   Validation Accuracy is :0.6209386281588448\n",
      "10/02/2020 10:14:47 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:13<00:50,  1.63s/it]\u001b[A10/02/2020 10:14:49 - INFO - __main__ -   \n",
      "10/02/2020 10:14:49 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:14:49 - INFO - __main__ -   Validation Loss improved! 0.6374766353235348 -> 0.6338456391427492\n",
      "10/02/2020 10:14:49 - INFO - __main__ -   Validation Accuracy is :0.6389891696750902\n",
      "10/02/2020 10:14:49 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:14<00:48,  1.63s/it]\u001b[A10/02/2020 10:14:50 - INFO - __main__ -   \n",
      "10/02/2020 10:14:50 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:14:50 - INFO - __main__ -   Validation Loss improved! 0.6338456391427492 -> 0.6315051020267638\n",
      "10/02/2020 10:14:50 - INFO - __main__ -   Validation Accuracy is :0.6353790613718412\n",
      "10/02/2020 10:14:50 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:16<00:47,  1.63s/it]\u001b[A10/02/2020 10:14:52 - INFO - __main__ -   \n",
      "10/02/2020 10:14:52 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:14:52 - INFO - __main__ -   Validation Accuracy improved! 0.6462093862815884 -> 0.6498194945848376\n",
      "10/02/2020 10:14:52 - INFO - __main__ -   Validation Loss is : 0.6315597127921314\n",
      "10/02/2020 10:14:52 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  28%|█████████                       | 11/39 [00:17<00:45,  1.62s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:19<00:43,  1.62s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:21<00:42,  1.62s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:22<00:40,  1.62s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:24<00:39,  1.63s/it]\u001b[A10/02/2020 10:15:00 - INFO - __main__ -   \n",
      "10/02/2020 10:15:00 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:15:00 - INFO - __main__ -   Validation Loss improved! 0.6315051020267638 -> 0.6287516308175097\n",
      "10/02/2020 10:15:00 - INFO - __main__ -   Validation Accuracy is :0.6389891696750902\n",
      "10/02/2020 10:15:00 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:26<00:37,  1.62s/it]\u001b[A10/02/2020 10:15:02 - INFO - __main__ -   \n",
      "10/02/2020 10:15:02 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:15:02 - INFO - __main__ -   Validation Loss improved! 0.6287516308175097 -> 0.6261948540752975\n",
      "10/02/2020 10:15:02 - INFO - __main__ -   Validation Accuracy is :0.6462093862815884\n",
      "10/02/2020 10:15:02 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:27<00:35,  1.63s/it]\u001b[A10/02/2020 10:15:03 - INFO - __main__ -   \n",
      "10/02/2020 10:15:03 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:15:03 - INFO - __main__ -   Validation Loss improved! 0.6261948540752975 -> 0.6249229796096306\n",
      "10/02/2020 10:15:03 - INFO - __main__ -   Validation Accuracy is :0.6389891696750902\n",
      "10/02/2020 10:15:03 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:29<00:34,  1.63s/it]\u001b[A10/02/2020 10:15:05 - INFO - __main__ -   \n",
      "10/02/2020 10:15:05 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:15:05 - INFO - __main__ -   Validation Loss improved! 0.6249229796096306 -> 0.6242712585504304\n",
      "10/02/2020 10:15:05 - INFO - __main__ -   Validation Accuracy is :0.6353790613718412\n",
      "10/02/2020 10:15:05 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:30<00:32,  1.63s/it]\u001b[A10/02/2020 10:15:06 - INFO - __main__ -   \n",
      "10/02/2020 10:15:06 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:15:06 - INFO - __main__ -   Validation Loss improved! 0.6242712585504304 -> 0.6242091939767775\n",
      "10/02/2020 10:15:06 - INFO - __main__ -   Validation Accuracy is :0.6353790613718412\n",
      "10/02/2020 10:15:06 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:32<00:30,  1.63s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:34<00:29,  1.63s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:35<00:27,  1.63s/it]\u001b[A10/02/2020 10:15:11 - INFO - __main__ -   \n",
      "10/02/2020 10:15:11 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:15:11 - INFO - __main__ -   Validation Loss improved! 0.6242091939767775 -> 0.6229664771565462\n",
      "10/02/2020 10:15:11 - INFO - __main__ -   Validation Accuracy is :0.6462093862815884\n",
      "10/02/2020 10:15:11 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:37<00:26,  1.63s/it]\u001b[A10/02/2020 10:15:13 - INFO - __main__ -   \n",
      "10/02/2020 10:15:13 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:15:13 - INFO - __main__ -   Validation Loss improved! 0.6229664771565462 -> 0.6224743192376643\n",
      "10/02/2020 10:15:13 - INFO - __main__ -   Validation Accuracy is :0.6353790613718412\n",
      "10/02/2020 10:15:13 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:39<00:24,  1.64s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:40<00:22,  1.64s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:42<00:21,  1.64s/it]\u001b[A10/02/2020 10:15:18 - INFO - __main__ -   \n",
      "10/02/2020 10:15:18 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:15:18 - INFO - __main__ -   Validation Accuracy improved! 0.6498194945848376 -> 0.6534296028880866\n",
      "10/02/2020 10:15:18 - INFO - __main__ -   Validation Loss is : 0.6300271925943423\n",
      "10/02/2020 10:15:18 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:44<00:19,  1.64s/it]\u001b[A10/02/2020 10:15:20 - INFO - __main__ -   \n",
      "10/02/2020 10:15:20 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:15:20 - INFO - __main__ -   Validation Accuracy improved! 0.6534296028880866 -> 0.6570397111913358\n",
      "10/02/2020 10:15:20 - INFO - __main__ -   Validation Loss is : 0.6306729299497088\n",
      "10/02/2020 10:15:20 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:45<00:18,  1.64s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:47<00:16,  1.64s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:48<00:14,  1.64s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:50<00:13,  1.64s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:52<00:11,  1.64s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:53<00:09,  1.65s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [00:55<00:08,  1.65s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [00:57<00:06,  1.65s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [00:58<00:04,  1.65s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:00<00:03,  1.65s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:02<00:01,  1.65s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:03<00:00,  1.63s/it]\u001b[A\n",
      "Epoch:  33%|████████████▋                         | 2/6 [02:11<04:27, 66.77s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:01<01:02,  1.65s/it]\u001b[A10/02/2020 10:15:41 - INFO - __main__ -   \n",
      "10/02/2020 10:15:41 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:15:41 - INFO - __main__ -   Validation Accuracy improved! 0.6570397111913358 -> 0.6606498194945848\n",
      "10/02/2020 10:15:41 - INFO - __main__ -   Validation Loss is : 0.6325235229106586\n",
      "10/02/2020 10:15:41 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   5%|█▋                               | 2/39 [00:03<01:01,  1.65s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:04<00:59,  1.65s/it]\u001b[A10/02/2020 10:15:44 - INFO - __main__ -   \n",
      "10/02/2020 10:15:44 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:15:44 - INFO - __main__ -   Validation Accuracy improved! 0.6606498194945848 -> 0.6678700361010831\n",
      "10/02/2020 10:15:44 - INFO - __main__ -   Validation Loss is : 0.625676003604159\n",
      "10/02/2020 10:15:44 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  10%|███▍                             | 4/39 [00:06<00:57,  1.65s/it]\u001b[A10/02/2020 10:15:46 - INFO - __main__ -   \n",
      "10/02/2020 10:15:46 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:15:46 - INFO - __main__ -   Validation Accuracy improved! 0.6678700361010831 -> 0.6714801444043321\n",
      "10/02/2020 10:15:46 - INFO - __main__ -   Validation Loss is : 0.621304821882007\n",
      "10/02/2020 10:15:46 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:15:46 - INFO - __main__ -   \n",
      "10/02/2020 10:15:46 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:15:46 - INFO - __main__ -   Validation Loss improved! 0.6224743192376643 -> 0.621304821882007\n",
      "10/02/2020 10:15:46 - INFO - __main__ -   Validation Accuracy is :0.6714801444043321\n",
      "10/02/2020 10:15:46 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  13%|████▏                            | 5/39 [00:08<00:56,  1.65s/it]\u001b[A10/02/2020 10:15:48 - INFO - __main__ -   \n",
      "10/02/2020 10:15:48 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:15:48 - INFO - __main__ -   Validation Accuracy improved! 0.6714801444043321 -> 0.6750902527075813\n",
      "10/02/2020 10:15:48 - INFO - __main__ -   Validation Loss is : 0.617303541851388\n",
      "10/02/2020 10:15:48 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:15:48 - INFO - __main__ -   \n",
      "10/02/2020 10:15:48 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:15:48 - INFO - __main__ -   Validation Loss improved! 0.621304821882007 -> 0.617303541851388\n",
      "10/02/2020 10:15:48 - INFO - __main__ -   Validation Accuracy is :0.6750902527075813\n",
      "10/02/2020 10:15:48 - INFO - __main__ -   =============================================================================\n",
      "\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  15%|█████                            | 6/39 [00:09<00:54,  1.65s/it]\u001b[A10/02/2020 10:15:49 - INFO - __main__ -   \n",
      "10/02/2020 10:15:49 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:15:49 - INFO - __main__ -   Validation Loss improved! 0.617303541851388 -> 0.615752719369606\n",
      "10/02/2020 10:15:49 - INFO - __main__ -   Validation Accuracy is :0.6642599277978339\n",
      "10/02/2020 10:15:49 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:11<00:52,  1.65s/it]\u001b[A10/02/2020 10:15:51 - INFO - __main__ -   \n",
      "10/02/2020 10:15:51 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:15:51 - INFO - __main__ -   Validation Loss improved! 0.615752719369606 -> 0.6154854254602095\n",
      "10/02/2020 10:15:51 - INFO - __main__ -   Validation Accuracy is :0.6750902527075813\n",
      "10/02/2020 10:15:51 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:13<00:51,  1.65s/it]\u001b[A10/02/2020 10:15:52 - INFO - __main__ -   \n",
      "10/02/2020 10:15:52 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:15:52 - INFO - __main__ -   Validation Loss improved! 0.6154854254602095 -> 0.6146255451849651\n",
      "10/02/2020 10:15:52 - INFO - __main__ -   Validation Accuracy is :0.6714801444043321\n",
      "10/02/2020 10:15:52 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:14<00:49,  1.65s/it]\u001b[A10/02/2020 10:15:54 - INFO - __main__ -   \n",
      "10/02/2020 10:15:54 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:15:54 - INFO - __main__ -   Validation Loss improved! 0.6146255451849651 -> 0.6126853550384191\n",
      "10/02/2020 10:15:54 - INFO - __main__ -   Validation Accuracy is :0.6642599277978339\n",
      "10/02/2020 10:15:54 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:16<00:47,  1.65s/it]\u001b[A10/02/2020 10:15:56 - INFO - __main__ -   \n",
      "10/02/2020 10:15:56 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:15:56 - INFO - __main__ -   Validation Loss improved! 0.6126853550384191 -> 0.6119192791329394\n",
      "10/02/2020 10:15:56 - INFO - __main__ -   Validation Accuracy is :0.6678700361010831\n",
      "10/02/2020 10:15:56 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  28%|█████████                       | 11/39 [00:18<00:46,  1.65s/it]\u001b[A10/02/2020 10:15:57 - INFO - __main__ -   \n",
      "10/02/2020 10:15:57 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:15:57 - INFO - __main__ -   Validation Loss improved! 0.6119192791329394 -> 0.6113529136464915\n",
      "10/02/2020 10:15:57 - INFO - __main__ -   Validation Accuracy is :0.6642599277978339\n",
      "10/02/2020 10:15:57 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:19<00:44,  1.65s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:21<00:42,  1.65s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:23<00:41,  1.65s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:24<00:39,  1.65s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:26<00:37,  1.65s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:28<00:36,  1.65s/it]\u001b[A10/02/2020 10:16:07 - INFO - __main__ -   \n",
      "10/02/2020 10:16:07 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:16:07 - INFO - __main__ -   Validation Loss improved! 0.6113529136464915 -> 0.610916588710964\n",
      "10/02/2020 10:16:07 - INFO - __main__ -   Validation Accuracy is :0.6714801444043321\n",
      "10/02/2020 10:16:07 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:29<00:34,  1.65s/it]\u001b[A10/02/2020 10:16:09 - INFO - __main__ -   \n",
      "10/02/2020 10:16:09 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:16:09 - INFO - __main__ -   Validation Loss improved! 0.610916588710964 -> 0.6106635895877108\n",
      "10/02/2020 10:16:09 - INFO - __main__ -   Validation Accuracy is :0.6570397111913358\n",
      "10/02/2020 10:16:09 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:31<00:32,  1.65s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:32<00:31,  1.65s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:34<00:29,  1.65s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:36<00:27,  1.64s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:37<00:26,  1.64s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:39<00:24,  1.64s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:41<00:22,  1.64s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:42<00:21,  1.64s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:44<00:19,  1.64s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:46<00:18,  1.64s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:47<00:16,  1.64s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:49<00:14,  1.64s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:51<00:13,  1.64s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:52<00:11,  1.64s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:54<00:09,  1.64s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [00:55<00:08,  1.64s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [00:57<00:06,  1.64s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [00:59<00:04,  1.64s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:00<00:03,  1.64s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:02<00:01,  1.64s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:04<00:00,  1.64s/it]\u001b[A\n",
      "Epoch:  50%|███████████████████                   | 3/6 [03:15<03:17, 65.97s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:01<01:02,  1.64s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:03<01:00,  1.64s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:04<00:59,  1.64s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:06<00:57,  1.64s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:08<00:55,  1.64s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:09<00:54,  1.64s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:11<00:52,  1.64s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:13<00:50,  1.64s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:14<00:49,  1.64s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:16<00:47,  1.64s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:18<00:46,  1.64s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:19<00:44,  1.64s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:21<00:42,  1.64s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:22<00:41,  1.64s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:24<00:39,  1.64s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:26<00:37,  1.64s/it]\u001b[A10/02/2020 10:17:10 - INFO - __main__ -   \n",
      "10/02/2020 10:17:10 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:17:10 - INFO - __main__ -   Validation Accuracy improved! 0.6750902527075813 -> 0.6787003610108303\n",
      "10/02/2020 10:17:10 - INFO - __main__ -   Validation Loss is : 0.6181280397766333\n",
      "10/02/2020 10:17:10 - INFO - __main__ -   =============================================================================\n",
      "\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  44%|█████████████▉                  | 17/39 [00:27<00:36,  1.64s/it]\u001b[A\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:29<00:34,  1.64s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:31<00:32,  1.64s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:32<00:31,  1.64s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:34<00:29,  1.64s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:36<00:27,  1.64s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:37<00:26,  1.64s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:39<00:24,  1.64s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:41<00:23,  1.64s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:42<00:21,  1.64s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:44<00:19,  1.64s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:45<00:18,  1.64s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:47<00:16,  1.64s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:49<00:14,  1.65s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:50<00:13,  1.65s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:52<00:11,  1.64s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:54<00:09,  1.65s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [00:55<00:08,  1.65s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [00:57<00:06,  1.65s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [00:59<00:04,  1.65s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:00<00:03,  1.65s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:02<00:01,  1.64s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:04<00:00,  1.64s/it]\u001b[A\n",
      "Epoch:  67%|█████████████████████████▎            | 4/6 [04:20<02:10, 65.41s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:01<01:02,  1.64s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:03<01:00,  1.64s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:04<00:59,  1.65s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:06<00:57,  1.65s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:08<00:55,  1.64s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:09<00:54,  1.64s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:11<00:52,  1.65s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:13<00:51,  1.65s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:14<00:49,  1.65s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:16<00:47,  1.65s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:18<00:46,  1.65s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:19<00:44,  1.65s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:21<00:42,  1.65s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:23<00:41,  1.65s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:24<00:39,  1.65s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:26<00:37,  1.65s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:28<00:36,  1.65s/it]\u001b[A\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:29<00:34,  1.65s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:31<00:32,  1.65s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:32<00:31,  1.65s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:34<00:29,  1.65s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:36<00:27,  1.65s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:37<00:26,  1.65s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:39<00:24,  1.65s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:41<00:23,  1.64s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:42<00:21,  1.65s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:44<00:19,  1.64s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:46<00:18,  1.64s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:47<00:16,  1.64s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:49<00:14,  1.64s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:51<00:13,  1.64s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:52<00:11,  1.64s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:54<00:09,  1.65s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [00:55<00:08,  1.65s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [00:57<00:06,  1.65s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [00:59<00:04,  1.65s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:00<00:03,  1.65s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:02<00:01,  1.65s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:04<00:00,  1.65s/it]\u001b[A\n",
      "Epoch:  83%|███████████████████████████████▋      | 5/6 [05:24<01:05, 65.05s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:01<01:02,  1.65s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:03<01:01,  1.65s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:04<00:59,  1.65s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:06<00:57,  1.65s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:08<00:55,  1.65s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:09<00:54,  1.65s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:11<00:52,  1.64s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:13<00:50,  1.64s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:14<00:49,  1.64s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:16<00:47,  1.64s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:18<00:46,  1.65s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:19<00:44,  1.65s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:21<00:42,  1.64s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:23<00:41,  1.65s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:24<00:39,  1.65s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:26<00:37,  1.64s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:27<00:36,  1.64s/it]\u001b[A\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:29<00:34,  1.64s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:31<00:32,  1.64s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:32<00:31,  1.64s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:34<00:29,  1.65s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:36<00:27,  1.64s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:37<00:26,  1.64s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:39<00:24,  1.64s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:41<00:22,  1.64s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:42<00:21,  1.64s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:44<00:19,  1.64s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:46<00:18,  1.64s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:47<00:16,  1.64s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:49<00:14,  1.65s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:50<00:13,  1.64s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:52<00:11,  1.64s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:54<00:09,  1.64s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [00:55<00:08,  1.64s/it]\u001b[A10/02/2020 10:19:46 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [00:57<00:06,  1.64s/it]\u001b[A10/02/2020 10:19:48 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [00:59<00:04,  1.64s/it]\u001b[A10/02/2020 10:19:50 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:00<00:03,  1.64s/it]\u001b[A10/02/2020 10:19:51 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:02<00:01,  1.64s/it]\u001b[A10/02/2020 10:19:53 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:04<00:00,  1.64s/it]\u001b[A\n",
      "Epoch: 100%|██████████████████████████████████████| 6/6 [06:28<00:00, 64.72s/it]\n",
      "10/02/2020 10:19:54 - INFO - __main__ -   \n",
      "10/02/2020 10:19:54 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:19:54 - INFO - __main__ -   Validation Loss : 0.6106635895877108 Validation Accuracy : 0.6787003610108303\n",
      "10/02/2020 10:19:54 - INFO - __main__ -   Loss_ACC : 0.6570397111913358\n",
      "10/02/2020 10:19:54 - INFO - __main__ -   =============================================================================\n"
     ]
    }
   ],
   "source": [
    "!python KDAP/NLI_KD_training_RTE.py"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 32,
   "metadata": {
    "collapsed": true
   },
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "10/02/2020 10:25:07 - INFO - __main__ -   IN DEBUG MODE\n",
      "10/02/2020 10:25:09 - INFO - src.argument_parser -   device: cuda n_gpu: 2, 16-bits training: False\n",
      "10/02/2020 10:25:09 - INFO - src.argument_parser -   random seed = 47057918\n",
      "10/02/2020 10:25:09 - INFO - __main__ -   actual batch size on all GPU = 64\n",
      "10/02/2020 10:25:09 - INFO - __main__ -   Seed changed to : 47057918\n",
      "10/02/2020 10:25:09 - INFO - __main__ -   loading teacher's prediction\n",
      "10/02/2020 10:25:09 - INFO - __main__ -   teacher acc = 79.36, teacher loss = 0.46779\n",
      "10/02/2020 10:25:09 - INFO - __main__ -   teacher acc = 68.95, teacher loss = 0.58605\n",
      "10/02/2020 10:25:09 - INFO - src.nli_data_processing -   Writing example 0 of 2490\n",
      "10/02/2020 10:25:11 - INFO - __main__ -   ***** Running training *****\n",
      "10/02/2020 10:25:11 - INFO - __main__ -     Num examples = 2490\n",
      "10/02/2020 10:25:11 - INFO - __main__ -     Batch size = 64\n",
      "10/02/2020 10:25:11 - INFO - __main__ -     Num steps = 228\n",
      "10/02/2020 10:25:11 - INFO - src.nli_data_processing -   Writing example 0 of 277\n",
      "10/02/2020 10:25:11 - INFO - __main__ -   ***** Running evaluation *****\n",
      "10/02/2020 10:25:11 - INFO - __main__ -     Num examples = 277\n",
      "10/02/2020 10:25:11 - INFO - __main__ -     Batch size = 32\n",
      "10/02/2020 10:25:11 - INFO - src.nli_data_processing -   Writing example 0 of 3000\n",
      "10/02/2020 10:25:14 - INFO - __main__ -   ***** Running evaluation *****\n",
      "10/02/2020 10:25:14 - INFO - __main__ -     Num examples = 3000\n",
      "10/02/2020 10:25:14 - INFO - __main__ -     Batch size = 32\n",
      "10/02/2020 10:25:14 - INFO - __main__ -   using normal Knowledge Distillation\n",
      "10/02/2020 10:25:14 - INFO - src.nli_data_processing -   predicting for RTE\n",
      "10/02/2020 10:25:14 - INFO - src.modeling -   num hidden layer is set as 6\n",
      "10/02/2020 10:25:14 - INFO - src.modeling -   Model config {\n",
      "  \"attention_probs_dropout_prob\": 0.1,\n",
      "  \"hidden_act\": \"gelu\",\n",
      "  \"hidden_dropout_prob\": 0.1,\n",
      "  \"hidden_size\": 768,\n",
      "  \"initializer_range\": 0.02,\n",
      "  \"intermediate_size\": 3072,\n",
      "  \"max_position_embeddings\": 512,\n",
      "  \"num_attention_heads\": 12,\n",
      "  \"num_hidden_layers\": 6,\n",
      "  \"type_vocab_size\": 2,\n",
      "  \"vocab_size\": 30522\n",
      "}\n",
      "\n",
      "10/02/2020 10:25:15 - INFO - src.utils -   delete 104 layers, keep 103 layers\n",
      "10/02/2020 10:25:18 - INFO - src.utils -   data parallel because more than one gpu\n",
      "10/02/2020 10:25:18 - INFO - __main__ -   *****************************************************************************\n",
      "10/02/2020 10:25:18 - INFO - src.utils -   data parallel because more than one gpu\n",
      "10/02/2020 10:25:18 - INFO - __main__ -   number of layers in student model = 6\n",
      "10/02/2020 10:25:18 - INFO - __main__ -   num parameters in student model are 66955008 and 1538\n",
      "10/02/2020 10:25:18 - INFO - __main__ -   FP16 is not activated, use BertAdam\n",
      "Epoch:   0%|                                              | 0/6 [00:00<?, ?it/s]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A/opt/conda/lib/python3.7/site-packages/torch/nn/functional.py:1958: UserWarning: reduction: 'mean' divides the total loss by both the batch size and the support size.'batchmean' divides only by the batch size, and aligns with the KL div math definition.'mean' will be changed to behave the same as 'batchmean' in the next major release.\n",
      "  warnings.warn(\"reduction: 'mean' divides the total loss by both the batch size and the support size.\"\n",
      "10/02/2020 10:25:23 - INFO - __main__ -   \n",
      "10/02/2020 10:25:23 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:25:23 - INFO - __main__ -   Validation Accuracy improved! 0 -> 0.5270758122743683\n",
      "10/02/2020 10:25:23 - INFO - __main__ -   Validation Loss is : 0.7129062074186139\n",
      "10/02/2020 10:25:23 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:25:23 - INFO - __main__ -   \n",
      "10/02/2020 10:25:23 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:25:23 - INFO - __main__ -   Validation Loss improved! 100 -> 0.7129062074186139\n",
      "10/02/2020 10:25:23 - INFO - __main__ -   Validation Accuracy is :0.5270758122743683\n",
      "10/02/2020 10:25:23 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   3%|▊                                | 1/39 [00:05<03:15,  5.15s/it]\u001b[A10/02/2020 10:25:25 - INFO - __main__ -   \n",
      "10/02/2020 10:25:25 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:25:25 - INFO - __main__ -   Validation Loss improved! 0.7129062074186139 -> 0.7116972961150352\n",
      "10/02/2020 10:25:25 - INFO - __main__ -   Validation Accuracy is :0.5270758122743683\n",
      "10/02/2020 10:25:25 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   5%|█▋                               | 2/39 [00:06<02:31,  4.08s/it]\u001b[A10/02/2020 10:25:27 - INFO - __main__ -   \n",
      "10/02/2020 10:25:27 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:25:27 - INFO - __main__ -   Validation Loss improved! 0.7116972961150352 -> 0.7090091326606833\n",
      "10/02/2020 10:25:27 - INFO - __main__ -   Validation Accuracy is :0.5270758122743683\n",
      "10/02/2020 10:25:27 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   8%|██▌                              | 3/39 [00:08<02:00,  3.35s/it]\u001b[A10/02/2020 10:25:28 - INFO - __main__ -   \n",
      "10/02/2020 10:25:28 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:25:28 - INFO - __main__ -   Validation Loss improved! 0.7090091326606833 -> 0.7050064376114938\n",
      "10/02/2020 10:25:28 - INFO - __main__ -   Validation Accuracy is :0.5270758122743683\n",
      "10/02/2020 10:25:28 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  10%|███▍                             | 4/39 [00:09<01:38,  2.83s/it]\u001b[A10/02/2020 10:25:30 - INFO - __main__ -   \n",
      "10/02/2020 10:25:30 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:25:30 - INFO - __main__ -   Validation Loss improved! 0.7050064376114938 -> 0.7003122185111477\n",
      "10/02/2020 10:25:30 - INFO - __main__ -   Validation Accuracy is :0.5270758122743683\n",
      "10/02/2020 10:25:30 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  13%|████▏                            | 5/39 [00:11<01:23,  2.47s/it]\u001b[A10/02/2020 10:25:31 - INFO - __main__ -   \n",
      "10/02/2020 10:25:31 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:25:31 - INFO - __main__ -   Validation Loss improved! 0.7003122185111477 -> 0.6963633423677851\n",
      "10/02/2020 10:25:31 - INFO - __main__ -   Validation Accuracy is :0.5270758122743683\n",
      "10/02/2020 10:25:31 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  15%|█████                            | 6/39 [00:13<01:13,  2.22s/it]\u001b[A10/02/2020 10:25:33 - INFO - __main__ -   \n",
      "10/02/2020 10:25:33 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:25:33 - INFO - __main__ -   Validation Loss improved! 0.6963633423677851 -> 0.6943240182924787\n",
      "10/02/2020 10:25:33 - INFO - __main__ -   Validation Accuracy is :0.5270758122743683\n",
      "10/02/2020 10:25:33 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:14<01:05,  2.04s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:16<00:59,  1.93s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:18<00:55,  1.84s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:19<00:51,  1.79s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:21<00:49,  1.75s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:23<00:46,  1.73s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:24<00:44,  1.72s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:26<00:42,  1.71s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:28<00:41,  1.71s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:30<00:39,  1.71s/it]\u001b[A10/02/2020 10:25:50 - INFO - __main__ -   \n",
      "10/02/2020 10:25:50 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:25:50 - INFO - __main__ -   Validation Loss improved! 0.6943240182924787 -> 0.6877892077614685\n",
      "10/02/2020 10:25:50 - INFO - __main__ -   Validation Accuracy is :0.5270758122743683\n",
      "10/02/2020 10:25:50 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:31<00:37,  1.72s/it]\u001b[A10/02/2020 10:25:52 - INFO - __main__ -   \n",
      "10/02/2020 10:25:52 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:25:52 - INFO - __main__ -   Validation Accuracy improved! 0.5270758122743683 -> 0.5884476534296029\n",
      "10/02/2020 10:25:52 - INFO - __main__ -   Validation Loss is : 0.6826485933379576\n",
      "10/02/2020 10:25:52 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:25:52 - INFO - __main__ -   \n",
      "10/02/2020 10:25:52 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:25:52 - INFO - __main__ -   Validation Loss improved! 0.6877892077614685 -> 0.6826485933379576\n",
      "10/02/2020 10:25:52 - INFO - __main__ -   Validation Accuracy is :0.5884476534296029\n",
      "10/02/2020 10:25:52 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:33<00:36,  1.72s/it]\u001b[A10/02/2020 10:25:53 - INFO - __main__ -   \n",
      "10/02/2020 10:25:53 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:25:53 - INFO - __main__ -   Validation Accuracy improved! 0.5884476534296029 -> 0.592057761732852\n",
      "10/02/2020 10:25:53 - INFO - __main__ -   Validation Loss is : 0.6807224879626332\n",
      "10/02/2020 10:25:53 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:25:53 - INFO - __main__ -   \n",
      "10/02/2020 10:25:53 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:25:53 - INFO - __main__ -   Validation Loss improved! 0.6826485933379576 -> 0.6807224879626332\n",
      "10/02/2020 10:25:53 - INFO - __main__ -   Validation Accuracy is :0.592057761732852\n",
      "10/02/2020 10:25:53 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:35<00:34,  1.72s/it]\u001b[A10/02/2020 10:25:55 - INFO - __main__ -   \n",
      "10/02/2020 10:25:55 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:25:55 - INFO - __main__ -   Validation Loss improved! 0.6807224879626332 -> 0.6806402309706926\n",
      "10/02/2020 10:25:55 - INFO - __main__ -   Validation Accuracy is :0.5523465703971119\n",
      "10/02/2020 10:25:55 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:36<00:32,  1.72s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:38<00:30,  1.71s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:40<00:29,  1.71s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:41<00:27,  1.70s/it]\u001b[A10/02/2020 10:26:02 - INFO - __main__ -   \n",
      "10/02/2020 10:26:02 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:26:02 - INFO - __main__ -   Validation Loss improved! 0.6806402309706926 -> 0.6768108051176106\n",
      "10/02/2020 10:26:02 - INFO - __main__ -   Validation Accuracy is :0.5523465703971119\n",
      "10/02/2020 10:26:02 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:43<00:25,  1.69s/it]\u001b[A10/02/2020 10:26:03 - INFO - __main__ -   \n",
      "10/02/2020 10:26:03 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:26:03 - INFO - __main__ -   Validation Accuracy improved! 0.592057761732852 -> 0.6173285198555957\n",
      "10/02/2020 10:26:03 - INFO - __main__ -   Validation Loss is : 0.6724035903434891\n",
      "10/02/2020 10:26:03 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:26:03 - INFO - __main__ -   \n",
      "10/02/2020 10:26:03 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:26:03 - INFO - __main__ -   Validation Loss improved! 0.6768108051176106 -> 0.6724035903434891\n",
      "10/02/2020 10:26:03 - INFO - __main__ -   Validation Accuracy is :0.6173285198555957\n",
      "10/02/2020 10:26:03 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:45<00:23,  1.69s/it]\u001b[A10/02/2020 10:26:05 - INFO - __main__ -   \n",
      "10/02/2020 10:26:05 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:26:05 - INFO - __main__ -   Validation Loss improved! 0.6724035903434891 -> 0.6718189260159159\n",
      "10/02/2020 10:26:05 - INFO - __main__ -   Validation Accuracy is :0.6028880866425993\n",
      "10/02/2020 10:26:05 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:46<00:21,  1.68s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:48<00:20,  1.67s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:50<00:18,  1.67s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:51<00:16,  1.66s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:53<00:14,  1.66s/it]\u001b[A10/02/2020 10:26:13 - INFO - __main__ -   \n",
      "10/02/2020 10:26:13 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:26:13 - INFO - __main__ -   Validation Accuracy improved! 0.6173285198555957 -> 0.6462093862815884\n",
      "10/02/2020 10:26:13 - INFO - __main__ -   Validation Loss is : 0.66458318896242\n",
      "10/02/2020 10:26:13 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:26:13 - INFO - __main__ -   \n",
      "10/02/2020 10:26:13 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:26:13 - INFO - __main__ -   Validation Loss improved! 0.6718189260159159 -> 0.66458318896242\n",
      "10/02/2020 10:26:13 - INFO - __main__ -   Validation Accuracy is :0.6462093862815884\n",
      "10/02/2020 10:26:13 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:55<00:13,  1.65s/it]\u001b[A10/02/2020 10:26:15 - INFO - __main__ -   \n",
      "10/02/2020 10:26:15 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:26:15 - INFO - __main__ -   Validation Loss improved! 0.66458318896242 -> 0.6612188876320739\n",
      "10/02/2020 10:26:15 - INFO - __main__ -   Validation Accuracy is :0.6064981949458483\n",
      "10/02/2020 10:26:15 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:56<00:11,  1.65s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:58<00:09,  1.64s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [01:00<00:08,  1.64s/it]\u001b[A10/02/2020 10:26:20 - INFO - __main__ -   \n",
      "10/02/2020 10:26:20 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:26:20 - INFO - __main__ -   Validation Loss improved! 0.6612188876320739 -> 0.6605860782444262\n",
      "10/02/2020 10:26:20 - INFO - __main__ -   Validation Accuracy is :0.592057761732852\n",
      "10/02/2020 10:26:20 - INFO - __main__ -   =============================================================================\n",
      "\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  90%|████████████████████████████▋   | 35/39 [01:01<00:06,  1.64s/it]\u001b[A10/02/2020 10:26:22 - INFO - __main__ -   \n",
      "10/02/2020 10:26:22 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:26:22 - INFO - __main__ -   Validation Loss improved! 0.6605860782444262 -> 0.6553719930270089\n",
      "10/02/2020 10:26:22 - INFO - __main__ -   Validation Accuracy is :0.6064981949458483\n",
      "10/02/2020 10:26:22 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [01:03<00:04,  1.64s/it]\u001b[A10/02/2020 10:26:23 - INFO - __main__ -   \n",
      "10/02/2020 10:26:23 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:26:23 - INFO - __main__ -   Validation Loss improved! 0.6553719930270089 -> 0.6519996629294936\n",
      "10/02/2020 10:26:23 - INFO - __main__ -   Validation Accuracy is :0.6353790613718412\n",
      "10/02/2020 10:26:23 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:05<00:03,  1.63s/it]\u001b[A10/02/2020 10:26:25 - INFO - __main__ -   \n",
      "10/02/2020 10:26:25 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:26:25 - INFO - __main__ -   Validation Accuracy improved! 0.6462093862815884 -> 0.6498194945848376\n",
      "10/02/2020 10:26:25 - INFO - __main__ -   Validation Loss is : 0.6522417206196148\n",
      "10/02/2020 10:26:25 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:06<00:01,  1.63s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:08<00:00,  1.75s/it]\u001b[A\n",
      "Epoch:  17%|██████▎                               | 1/6 [01:08<05:41, 68.26s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:01<01:01,  1.63s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:03<01:00,  1.63s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:04<00:58,  1.63s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:06<00:56,  1.63s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:08<00:55,  1.63s/it]\u001b[A10/02/2020 10:26:36 - INFO - __main__ -   \n",
      "10/02/2020 10:26:36 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:26:36 - INFO - __main__ -   Validation Accuracy improved! 0.6498194945848376 -> 0.6534296028880866\n",
      "10/02/2020 10:26:36 - INFO - __main__ -   Validation Loss is : 0.6456054281241627\n",
      "10/02/2020 10:26:36 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:26:36 - INFO - __main__ -   \n",
      "10/02/2020 10:26:36 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:26:36 - INFO - __main__ -   Validation Loss improved! 0.6519996629294936 -> 0.6456054281241627\n",
      "10/02/2020 10:26:36 - INFO - __main__ -   Validation Accuracy is :0.6534296028880866\n",
      "10/02/2020 10:26:36 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  15%|█████                            | 6/39 [00:09<00:53,  1.62s/it]\u001b[A10/02/2020 10:26:38 - INFO - __main__ -   \n",
      "10/02/2020 10:26:38 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:26:38 - INFO - __main__ -   Validation Loss improved! 0.6456054281241627 -> 0.6411170615185899\n",
      "10/02/2020 10:26:38 - INFO - __main__ -   Validation Accuracy is :0.6353790613718412\n",
      "10/02/2020 10:26:38 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:11<00:51,  1.62s/it]\u001b[A10/02/2020 10:26:39 - INFO - __main__ -   \n",
      "10/02/2020 10:26:39 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:26:39 - INFO - __main__ -   Validation Loss improved! 0.6411170615185899 -> 0.6388392190210226\n",
      "10/02/2020 10:26:39 - INFO - __main__ -   Validation Accuracy is :0.6389891696750902\n",
      "10/02/2020 10:26:39 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:13<00:50,  1.63s/it]\u001b[A10/02/2020 10:26:41 - INFO - __main__ -   \n",
      "10/02/2020 10:26:41 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:26:41 - INFO - __main__ -   Validation Loss improved! 0.6388392190210226 -> 0.6372027070083343\n",
      "10/02/2020 10:26:41 - INFO - __main__ -   Validation Accuracy is :0.6462093862815884\n",
      "10/02/2020 10:26:41 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:14<00:48,  1.62s/it]\u001b[A10/02/2020 10:26:43 - INFO - __main__ -   \n",
      "10/02/2020 10:26:43 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:26:43 - INFO - __main__ -   Validation Loss improved! 0.6372027070083343 -> 0.6362124418954126\n",
      "10/02/2020 10:26:43 - INFO - __main__ -   Validation Accuracy is :0.6389891696750902\n",
      "10/02/2020 10:26:43 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:16<00:47,  1.62s/it]\u001b[A10/02/2020 10:26:44 - INFO - __main__ -   \n",
      "10/02/2020 10:26:44 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:26:44 - INFO - __main__ -   Validation Accuracy improved! 0.6534296028880866 -> 0.6570397111913358\n",
      "10/02/2020 10:26:44 - INFO - __main__ -   Validation Loss is : 0.6361956751303552\n",
      "10/02/2020 10:26:44 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:26:44 - INFO - __main__ -   \n",
      "10/02/2020 10:26:44 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:26:44 - INFO - __main__ -   Validation Loss improved! 0.6362124418954126 -> 0.6361956751303552\n",
      "10/02/2020 10:26:44 - INFO - __main__ -   Validation Accuracy is :0.6570397111913358\n",
      "10/02/2020 10:26:44 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  28%|█████████                       | 11/39 [00:17<00:45,  1.63s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:19<00:43,  1.63s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:21<00:42,  1.63s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:22<00:40,  1.63s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:24<00:39,  1.63s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:26<00:37,  1.63s/it]\u001b[A10/02/2020 10:26:54 - INFO - __main__ -   \n",
      "10/02/2020 10:26:54 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:26:54 - INFO - __main__ -   Validation Loss improved! 0.6361956751303552 -> 0.6355220092332751\n",
      "10/02/2020 10:26:54 - INFO - __main__ -   Validation Accuracy is :0.6498194945848376\n",
      "10/02/2020 10:26:54 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:27<00:35,  1.63s/it]\u001b[A10/02/2020 10:26:56 - INFO - __main__ -   \n",
      "10/02/2020 10:26:56 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:26:56 - INFO - __main__ -   Validation Loss improved! 0.6355220092332751 -> 0.633129502031347\n",
      "10/02/2020 10:26:56 - INFO - __main__ -   Validation Accuracy is :0.6462093862815884\n",
      "10/02/2020 10:26:56 - INFO - __main__ -   =============================================================================\n",
      "\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  46%|██████████████▊                 | 18/39 [00:29<00:34,  1.64s/it]\u001b[A10/02/2020 10:26:57 - INFO - __main__ -   \n",
      "10/02/2020 10:26:57 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:26:57 - INFO - __main__ -   Validation Loss improved! 0.633129502031347 -> 0.6329808149096768\n",
      "10/02/2020 10:26:57 - INFO - __main__ -   Validation Accuracy is :0.6570397111913358\n",
      "10/02/2020 10:26:57 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:30<00:32,  1.64s/it]\u001b[A10/02/2020 10:26:59 - INFO - __main__ -   \n",
      "10/02/2020 10:26:59 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:26:59 - INFO - __main__ -   Validation Accuracy improved! 0.6570397111913358 -> 0.6606498194945848\n",
      "10/02/2020 10:26:59 - INFO - __main__ -   Validation Loss is : 0.6337798149577116\n",
      "10/02/2020 10:26:59 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:32<00:31,  1.64s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:34<00:29,  1.64s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:35<00:27,  1.64s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:37<00:26,  1.64s/it]\u001b[A10/02/2020 10:27:06 - INFO - __main__ -   \n",
      "10/02/2020 10:27:06 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:27:06 - INFO - __main__ -   Validation Loss improved! 0.6329808149096768 -> 0.6322309893391193\n",
      "10/02/2020 10:27:06 - INFO - __main__ -   Validation Accuracy is :0.6498194945848376\n",
      "10/02/2020 10:27:06 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:39<00:24,  1.65s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:40<00:23,  1.65s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:42<00:21,  1.65s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:44<00:19,  1.65s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:45<00:18,  1.65s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:47<00:16,  1.65s/it]\u001b[A10/02/2020 10:27:15 - INFO - __main__ -   \n",
      "10/02/2020 10:27:15 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:27:15 - INFO - __main__ -   Validation Loss improved! 0.6322309893391193 -> 0.6295276290673211\n",
      "10/02/2020 10:27:15 - INFO - __main__ -   Validation Accuracy is :0.6389891696750902\n",
      "10/02/2020 10:27:15 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:49<00:14,  1.65s/it]\u001b[A10/02/2020 10:27:17 - INFO - __main__ -   \n",
      "10/02/2020 10:27:17 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:27:17 - INFO - __main__ -   Validation Loss improved! 0.6295276290673211 -> 0.6276282403443264\n",
      "10/02/2020 10:27:17 - INFO - __main__ -   Validation Accuracy is :0.628158844765343\n",
      "10/02/2020 10:27:17 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:50<00:13,  1.65s/it]\u001b[A10/02/2020 10:27:19 - INFO - __main__ -   \n",
      "10/02/2020 10:27:19 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:27:19 - INFO - __main__ -   Validation Loss improved! 0.6276282403443264 -> 0.6263855269669626\n",
      "10/02/2020 10:27:19 - INFO - __main__ -   Validation Accuracy is :0.6389891696750902\n",
      "10/02/2020 10:27:19 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:52<00:11,  1.65s/it]\u001b[A10/02/2020 10:27:20 - INFO - __main__ -   \n",
      "10/02/2020 10:27:20 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:27:20 - INFO - __main__ -   Validation Loss improved! 0.6263855269669626 -> 0.6255405026652753\n",
      "10/02/2020 10:27:20 - INFO - __main__ -   Validation Accuracy is :0.6462093862815884\n",
      "10/02/2020 10:27:20 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:54<00:09,  1.65s/it]\u001b[A10/02/2020 10:27:22 - INFO - __main__ -   \n",
      "10/02/2020 10:27:22 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:27:22 - INFO - __main__ -   Validation Loss improved! 0.6255405026652753 -> 0.6253655877784702\n",
      "10/02/2020 10:27:22 - INFO - __main__ -   Validation Accuracy is :0.6425992779783394\n",
      "10/02/2020 10:27:22 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [00:55<00:08,  1.65s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [00:57<00:06,  1.65s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [00:59<00:04,  1.65s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:00<00:03,  1.65s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:02<00:01,  1.65s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:03<00:00,  1.64s/it]\u001b[A\n",
      "Epoch:  33%|████████████▋                         | 2/6 [02:12<04:27, 66.96s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:01<01:02,  1.65s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:03<01:01,  1.65s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:04<00:59,  1.65s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:06<00:57,  1.65s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:08<00:56,  1.65s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:09<00:54,  1.65s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:11<00:52,  1.65s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:13<00:51,  1.65s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:14<00:49,  1.65s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:16<00:47,  1.65s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:18<00:46,  1.64s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:19<00:44,  1.64s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:21<00:42,  1.64s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:23<00:41,  1.64s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:24<00:39,  1.64s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:26<00:37,  1.64s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:27<00:36,  1.64s/it]\u001b[A\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:29<00:34,  1.64s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:31<00:32,  1.64s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:32<00:31,  1.64s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:34<00:29,  1.64s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:36<00:27,  1.64s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:37<00:26,  1.64s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:39<00:24,  1.64s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:41<00:22,  1.64s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:42<00:21,  1.64s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:44<00:19,  1.64s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:45<00:18,  1.64s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:47<00:16,  1.64s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:49<00:14,  1.64s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:50<00:13,  1.64s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:52<00:11,  1.64s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:54<00:09,  1.64s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [00:55<00:08,  1.64s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [00:57<00:06,  1.64s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [00:59<00:04,  1.64s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:00<00:03,  1.64s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:02<00:01,  1.64s/it]\u001b[A10/02/2020 10:28:34 - INFO - __main__ -   \n",
      "10/02/2020 10:28:34 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:28:34 - INFO - __main__ -   Validation Accuracy improved! 0.6606498194945848 -> 0.6642599277978339\n",
      "10/02/2020 10:28:34 - INFO - __main__ -   Validation Loss is : 0.6526457170286764\n",
      "10/02/2020 10:28:34 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:03<00:00,  1.64s/it]\u001b[A\n",
      "Epoch:  50%|███████████████████                   | 3/6 [03:16<03:18, 66.07s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:01<01:02,  1.63s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:03<01:00,  1.64s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:04<00:58,  1.64s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:06<00:57,  1.64s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:08<00:55,  1.64s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:09<00:54,  1.64s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:11<00:52,  1.64s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:13<00:50,  1.64s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:14<00:49,  1.64s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:16<00:47,  1.64s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:18<00:45,  1.64s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:19<00:44,  1.64s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:21<00:42,  1.64s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:22<00:41,  1.64s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:24<00:39,  1.64s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:26<00:37,  1.64s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:27<00:36,  1.64s/it]\u001b[A\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:29<00:34,  1.64s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:31<00:32,  1.64s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:32<00:31,  1.64s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:34<00:29,  1.64s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:36<00:27,  1.64s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:37<00:26,  1.64s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:39<00:24,  1.64s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:41<00:22,  1.64s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:42<00:21,  1.64s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:44<00:19,  1.64s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:45<00:18,  1.64s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:47<00:16,  1.64s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:49<00:14,  1.64s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:50<00:13,  1.64s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:52<00:11,  1.64s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:54<00:09,  1.64s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [00:55<00:08,  1.64s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [00:57<00:06,  1.65s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [00:59<00:04,  1.65s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:00<00:03,  1.64s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:02<00:01,  1.64s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:03<00:00,  1.64s/it]\u001b[A\n",
      "Epoch:  67%|█████████████████████████▎            | 4/6 [04:20<02:10, 65.44s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:01<01:02,  1.64s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:03<01:00,  1.64s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:04<00:59,  1.64s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:06<00:57,  1.64s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:08<00:55,  1.64s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:09<00:54,  1.64s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:11<00:52,  1.65s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:13<00:51,  1.65s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:14<00:49,  1.65s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:16<00:47,  1.65s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:18<00:46,  1.65s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:19<00:44,  1.64s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:21<00:42,  1.65s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:23<00:41,  1.65s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:24<00:39,  1.65s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:26<00:37,  1.65s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:27<00:36,  1.65s/it]\u001b[A\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:29<00:34,  1.65s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:31<00:32,  1.65s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:32<00:31,  1.65s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:34<00:29,  1.65s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:36<00:27,  1.64s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:37<00:26,  1.64s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:39<00:24,  1.64s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:41<00:23,  1.64s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:42<00:21,  1.64s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:44<00:19,  1.65s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:46<00:18,  1.65s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:47<00:16,  1.64s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:49<00:14,  1.65s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:50<00:13,  1.65s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:52<00:11,  1.65s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:54<00:09,  1.65s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [00:55<00:08,  1.65s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [00:57<00:06,  1.65s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [00:59<00:04,  1.65s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:00<00:03,  1.65s/it]\u001b[A\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:02<00:01,  1.64s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:04<00:00,  1.64s/it]\u001b[A\n",
      "Epoch:  83%|███████████████████████████████▋      | 5/6 [05:24<01:05, 65.05s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:01<01:02,  1.65s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:03<01:00,  1.64s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:04<00:59,  1.64s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:06<00:57,  1.64s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:08<00:55,  1.64s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:09<00:54,  1.64s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:11<00:52,  1.64s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:13<00:50,  1.64s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:14<00:49,  1.64s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:16<00:47,  1.64s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:18<00:46,  1.65s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:19<00:44,  1.65s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:21<00:42,  1.65s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:23<00:41,  1.65s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:24<00:39,  1.65s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:26<00:37,  1.65s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:27<00:36,  1.64s/it]\u001b[A\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:29<00:34,  1.65s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:31<00:32,  1.65s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:32<00:31,  1.64s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:34<00:29,  1.65s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:36<00:27,  1.65s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:37<00:26,  1.64s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:39<00:24,  1.65s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:41<00:23,  1.64s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:42<00:21,  1.65s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:44<00:19,  1.64s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:46<00:18,  1.64s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:47<00:16,  1.64s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:49<00:14,  1.64s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:50<00:13,  1.64s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:52<00:11,  1.64s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:54<00:09,  1.64s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [00:55<00:08,  1.64s/it]\u001b[A10/02/2020 10:31:39 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [00:57<00:06,  1.64s/it]\u001b[A10/02/2020 10:31:40 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [00:59<00:04,  1.64s/it]\u001b[A10/02/2020 10:31:42 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:00<00:03,  1.64s/it]\u001b[A10/02/2020 10:31:44 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:02<00:01,  1.64s/it]\u001b[A10/02/2020 10:31:45 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:04<00:00,  1.64s/it]\u001b[A\n",
      "Epoch: 100%|██████████████████████████████████████| 6/6 [06:28<00:00, 64.73s/it]\n",
      "10/02/2020 10:31:47 - INFO - __main__ -   \n",
      "10/02/2020 10:31:47 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:31:47 - INFO - __main__ -   Validation Loss : 0.6253655877784702 Validation Accuracy : 0.6642599277978339\n",
      "10/02/2020 10:31:47 - INFO - __main__ -   Loss_ACC : 0.6425992779783394\n",
      "10/02/2020 10:31:47 - INFO - __main__ -   =============================================================================\n"
     ]
    }
   ],
   "source": [
    "!python KDAP/NLI_KD_training_RTE.py"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 33,
   "metadata": {
    "collapsed": true
   },
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "10/02/2020 10:31:48 - INFO - __main__ -   IN DEBUG MODE\n",
      "10/02/2020 10:31:50 - INFO - src.argument_parser -   device: cuda n_gpu: 2, 16-bits training: False\n",
      "10/02/2020 10:31:50 - INFO - src.argument_parser -   random seed = 22225538\n",
      "10/02/2020 10:31:50 - INFO - __main__ -   actual batch size on all GPU = 64\n",
      "10/02/2020 10:31:50 - INFO - __main__ -   Seed changed to : 22225538\n",
      "10/02/2020 10:31:50 - INFO - __main__ -   loading teacher's prediction\n",
      "10/02/2020 10:31:50 - INFO - __main__ -   teacher acc = 79.36, teacher loss = 0.46779\n",
      "10/02/2020 10:31:50 - INFO - __main__ -   teacher acc = 68.95, teacher loss = 0.58605\n",
      "10/02/2020 10:31:50 - INFO - src.nli_data_processing -   Writing example 0 of 2490\n",
      "10/02/2020 10:31:52 - INFO - __main__ -   ***** Running training *****\n",
      "10/02/2020 10:31:52 - INFO - __main__ -     Num examples = 2490\n",
      "10/02/2020 10:31:52 - INFO - __main__ -     Batch size = 64\n",
      "10/02/2020 10:31:52 - INFO - __main__ -     Num steps = 228\n",
      "10/02/2020 10:31:52 - INFO - src.nli_data_processing -   Writing example 0 of 277\n",
      "10/02/2020 10:31:52 - INFO - __main__ -   ***** Running evaluation *****\n",
      "10/02/2020 10:31:52 - INFO - __main__ -     Num examples = 277\n",
      "10/02/2020 10:31:52 - INFO - __main__ -     Batch size = 32\n",
      "10/02/2020 10:31:52 - INFO - src.nli_data_processing -   Writing example 0 of 3000\n",
      "10/02/2020 10:31:55 - INFO - __main__ -   ***** Running evaluation *****\n",
      "10/02/2020 10:31:55 - INFO - __main__ -     Num examples = 3000\n",
      "10/02/2020 10:31:55 - INFO - __main__ -     Batch size = 32\n",
      "10/02/2020 10:31:55 - INFO - __main__ -   using normal Knowledge Distillation\n",
      "10/02/2020 10:31:55 - INFO - src.nli_data_processing -   predicting for RTE\n",
      "10/02/2020 10:31:55 - INFO - src.modeling -   num hidden layer is set as 6\n",
      "10/02/2020 10:31:55 - INFO - src.modeling -   Model config {\n",
      "  \"attention_probs_dropout_prob\": 0.1,\n",
      "  \"hidden_act\": \"gelu\",\n",
      "  \"hidden_dropout_prob\": 0.1,\n",
      "  \"hidden_size\": 768,\n",
      "  \"initializer_range\": 0.02,\n",
      "  \"intermediate_size\": 3072,\n",
      "  \"max_position_embeddings\": 512,\n",
      "  \"num_attention_heads\": 12,\n",
      "  \"num_hidden_layers\": 6,\n",
      "  \"type_vocab_size\": 2,\n",
      "  \"vocab_size\": 30522\n",
      "}\n",
      "\n",
      "10/02/2020 10:31:56 - INFO - src.utils -   delete 104 layers, keep 103 layers\n",
      "10/02/2020 10:31:59 - INFO - src.utils -   data parallel because more than one gpu\n",
      "10/02/2020 10:31:59 - INFO - __main__ -   *****************************************************************************\n",
      "10/02/2020 10:31:59 - INFO - src.utils -   data parallel because more than one gpu\n",
      "10/02/2020 10:31:59 - INFO - __main__ -   number of layers in student model = 6\n",
      "10/02/2020 10:31:59 - INFO - __main__ -   num parameters in student model are 66955008 and 1538\n",
      "10/02/2020 10:31:59 - INFO - __main__ -   FP16 is not activated, use BertAdam\n",
      "Epoch:   0%|                                              | 0/6 [00:00<?, ?it/s]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A/opt/conda/lib/python3.7/site-packages/torch/nn/functional.py:1958: UserWarning: reduction: 'mean' divides the total loss by both the batch size and the support size.'batchmean' divides only by the batch size, and aligns with the KL div math definition.'mean' will be changed to behave the same as 'batchmean' in the next major release.\n",
      "  warnings.warn(\"reduction: 'mean' divides the total loss by both the batch size and the support size.\"\n",
      "10/02/2020 10:32:04 - INFO - __main__ -   \n",
      "10/02/2020 10:32:04 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:04 - INFO - __main__ -   Validation Accuracy improved! 0 -> 0.4729241877256318\n",
      "10/02/2020 10:32:04 - INFO - __main__ -   Validation Loss is : 0.7074965776519224\n",
      "10/02/2020 10:32:04 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:04 - INFO - __main__ -   \n",
      "10/02/2020 10:32:04 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:04 - INFO - __main__ -   Validation Loss improved! 100 -> 0.7074965776519224\n",
      "10/02/2020 10:32:04 - INFO - __main__ -   Validation Accuracy is :0.4729241877256318\n",
      "10/02/2020 10:32:04 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   3%|▊                                | 1/39 [00:05<03:17,  5.20s/it]\u001b[A10/02/2020 10:32:06 - INFO - __main__ -   \n",
      "10/02/2020 10:32:06 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:06 - INFO - __main__ -   Validation Loss improved! 0.7074965776519224 -> 0.7061322277633723\n",
      "10/02/2020 10:32:06 - INFO - __main__ -   Validation Accuracy is :0.4729241877256318\n",
      "10/02/2020 10:32:06 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   5%|█▋                               | 2/39 [00:06<02:32,  4.12s/it]\u001b[A10/02/2020 10:32:08 - INFO - __main__ -   \n",
      "10/02/2020 10:32:08 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:08 - INFO - __main__ -   Validation Loss improved! 0.7061322277633723 -> 0.7030044548778327\n",
      "10/02/2020 10:32:08 - INFO - __main__ -   Validation Accuracy is :0.4729241877256318\n",
      "10/02/2020 10:32:08 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   8%|██▌                              | 3/39 [00:08<02:01,  3.37s/it]\u001b[A10/02/2020 10:32:09 - INFO - __main__ -   \n",
      "10/02/2020 10:32:09 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:09 - INFO - __main__ -   Validation Accuracy improved! 0.4729241877256318 -> 0.47653429602888087\n",
      "10/02/2020 10:32:09 - INFO - __main__ -   Validation Loss is : 0.6990680109292592\n",
      "10/02/2020 10:32:09 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:09 - INFO - __main__ -   \n",
      "10/02/2020 10:32:09 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:09 - INFO - __main__ -   Validation Loss improved! 0.7030044548778327 -> 0.6990680109292592\n",
      "10/02/2020 10:32:09 - INFO - __main__ -   Validation Accuracy is :0.47653429602888087\n",
      "10/02/2020 10:32:09 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  10%|███▍                             | 4/39 [00:10<01:39,  2.85s/it]\u001b[A10/02/2020 10:32:11 - INFO - __main__ -   \n",
      "10/02/2020 10:32:11 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:11 - INFO - __main__ -   Validation Accuracy improved! 0.47653429602888087 -> 0.48375451263537905\n",
      "10/02/2020 10:32:11 - INFO - __main__ -   Validation Loss is : 0.6963353277543822\n",
      "10/02/2020 10:32:11 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:11 - INFO - __main__ -   \n",
      "10/02/2020 10:32:11 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:11 - INFO - __main__ -   Validation Loss improved! 0.6990680109292592 -> 0.6963353277543822\n",
      "10/02/2020 10:32:11 - INFO - __main__ -   Validation Accuracy is :0.48375451263537905\n",
      "10/02/2020 10:32:11 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  13%|████▏                            | 5/39 [00:11<01:24,  2.48s/it]\u001b[A10/02/2020 10:32:12 - INFO - __main__ -   \n",
      "10/02/2020 10:32:12 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:12 - INFO - __main__ -   Validation Accuracy improved! 0.48375451263537905 -> 0.49097472924187724\n",
      "10/02/2020 10:32:12 - INFO - __main__ -   Validation Loss is : 0.6914980643923102\n",
      "10/02/2020 10:32:12 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:12 - INFO - __main__ -   \n",
      "10/02/2020 10:32:12 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:12 - INFO - __main__ -   Validation Loss improved! 0.6963353277543822 -> 0.6914980643923102\n",
      "10/02/2020 10:32:12 - INFO - __main__ -   Validation Accuracy is :0.49097472924187724\n",
      "10/02/2020 10:32:12 - INFO - __main__ -   =============================================================================\n",
      "\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  15%|█████                            | 6/39 [00:13<01:13,  2.23s/it]\u001b[A10/02/2020 10:32:14 - INFO - __main__ -   \n",
      "10/02/2020 10:32:14 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:14 - INFO - __main__ -   Validation Accuracy improved! 0.49097472924187724 -> 0.5090252707581228\n",
      "10/02/2020 10:32:14 - INFO - __main__ -   Validation Loss is : 0.6873462312057991\n",
      "10/02/2020 10:32:14 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:14 - INFO - __main__ -   \n",
      "10/02/2020 10:32:14 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:14 - INFO - __main__ -   Validation Loss improved! 0.6914980643923102 -> 0.6873462312057991\n",
      "10/02/2020 10:32:14 - INFO - __main__ -   Validation Accuracy is :0.5090252707581228\n",
      "10/02/2020 10:32:14 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:14<01:05,  2.05s/it]\u001b[A10/02/2020 10:32:16 - INFO - __main__ -   \n",
      "10/02/2020 10:32:16 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:16 - INFO - __main__ -   Validation Accuracy improved! 0.5090252707581228 -> 0.555956678700361\n",
      "10/02/2020 10:32:16 - INFO - __main__ -   Validation Loss is : 0.6834895137415036\n",
      "10/02/2020 10:32:16 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:16 - INFO - __main__ -   \n",
      "10/02/2020 10:32:16 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:16 - INFO - __main__ -   Validation Loss improved! 0.6873462312057991 -> 0.6834895137415036\n",
      "10/02/2020 10:32:16 - INFO - __main__ -   Validation Accuracy is :0.555956678700361\n",
      "10/02/2020 10:32:16 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:16<00:59,  1.93s/it]\u001b[A10/02/2020 10:32:17 - INFO - __main__ -   \n",
      "10/02/2020 10:32:17 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:17 - INFO - __main__ -   Validation Accuracy improved! 0.555956678700361 -> 0.5667870036101083\n",
      "10/02/2020 10:32:17 - INFO - __main__ -   Validation Loss is : 0.6813339136998146\n",
      "10/02/2020 10:32:17 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:17 - INFO - __main__ -   \n",
      "10/02/2020 10:32:17 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:17 - INFO - __main__ -   Validation Loss improved! 0.6834895137415036 -> 0.6813339136998146\n",
      "10/02/2020 10:32:17 - INFO - __main__ -   Validation Accuracy is :0.5667870036101083\n",
      "10/02/2020 10:32:17 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:18<00:55,  1.85s/it]\u001b[A10/02/2020 10:32:19 - INFO - __main__ -   \n",
      "10/02/2020 10:32:19 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:19 - INFO - __main__ -   Validation Accuracy improved! 0.5667870036101083 -> 0.5884476534296029\n",
      "10/02/2020 10:32:19 - INFO - __main__ -   Validation Loss is : 0.6797589877021872\n",
      "10/02/2020 10:32:19 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:19 - INFO - __main__ -   \n",
      "10/02/2020 10:32:19 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:19 - INFO - __main__ -   Validation Loss improved! 0.6813339136998146 -> 0.6797589877021872\n",
      "10/02/2020 10:32:19 - INFO - __main__ -   Validation Accuracy is :0.5884476534296029\n",
      "10/02/2020 10:32:19 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:19<00:52,  1.80s/it]\u001b[A10/02/2020 10:32:21 - INFO - __main__ -   \n",
      "10/02/2020 10:32:21 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:21 - INFO - __main__ -   Validation Accuracy improved! 0.5884476534296029 -> 0.6028880866425993\n",
      "10/02/2020 10:32:21 - INFO - __main__ -   Validation Loss is : 0.6782632304442918\n",
      "10/02/2020 10:32:21 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:21 - INFO - __main__ -   \n",
      "10/02/2020 10:32:21 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:21 - INFO - __main__ -   Validation Loss improved! 0.6797589877021872 -> 0.6782632304442918\n",
      "10/02/2020 10:32:21 - INFO - __main__ -   Validation Accuracy is :0.6028880866425993\n",
      "10/02/2020 10:32:21 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  28%|█████████                       | 11/39 [00:21<00:49,  1.76s/it]\u001b[A10/02/2020 10:32:22 - INFO - __main__ -   \n",
      "10/02/2020 10:32:22 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:22 - INFO - __main__ -   Validation Accuracy improved! 0.6028880866425993 -> 0.6064981949458483\n",
      "10/02/2020 10:32:22 - INFO - __main__ -   Validation Loss is : 0.6769001165451987\n",
      "10/02/2020 10:32:22 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:22 - INFO - __main__ -   \n",
      "10/02/2020 10:32:22 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:22 - INFO - __main__ -   Validation Loss improved! 0.6782632304442918 -> 0.6769001165451987\n",
      "10/02/2020 10:32:22 - INFO - __main__ -   Validation Accuracy is :0.6064981949458483\n",
      "10/02/2020 10:32:22 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:23<00:46,  1.74s/it]\u001b[A10/02/2020 10:32:24 - INFO - __main__ -   \n",
      "10/02/2020 10:32:24 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:24 - INFO - __main__ -   Validation Loss improved! 0.6769001165451987 -> 0.6755326267614261\n",
      "10/02/2020 10:32:24 - INFO - __main__ -   Validation Accuracy is :0.5740072202166066\n",
      "10/02/2020 10:32:24 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:24<00:44,  1.72s/it]\u001b[A10/02/2020 10:32:26 - INFO - __main__ -   \n",
      "10/02/2020 10:32:26 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:26 - INFO - __main__ -   Validation Loss improved! 0.6755326267614261 -> 0.6741155218131275\n",
      "10/02/2020 10:32:26 - INFO - __main__ -   Validation Accuracy is :0.6028880866425993\n",
      "10/02/2020 10:32:26 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:26<00:42,  1.72s/it]\u001b[A10/02/2020 10:32:28 - INFO - __main__ -   \n",
      "10/02/2020 10:32:28 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:28 - INFO - __main__ -   Validation Accuracy improved! 0.6064981949458483 -> 0.6173285198555957\n",
      "10/02/2020 10:32:28 - INFO - __main__ -   Validation Loss is : 0.6725904003377425\n",
      "10/02/2020 10:32:28 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:28 - INFO - __main__ -   \n",
      "10/02/2020 10:32:28 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:28 - INFO - __main__ -   Validation Loss improved! 0.6741155218131275 -> 0.6725904003377425\n",
      "10/02/2020 10:32:28 - INFO - __main__ -   Validation Accuracy is :0.6173285198555957\n",
      "10/02/2020 10:32:28 - INFO - __main__ -   =============================================================================\n",
      "\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  38%|████████████▎                   | 15/39 [00:28<00:41,  1.71s/it]\u001b[A10/02/2020 10:32:29 - INFO - __main__ -   \n",
      "10/02/2020 10:32:29 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:29 - INFO - __main__ -   Validation Loss improved! 0.6725904003377425 -> 0.6713474631739867\n",
      "10/02/2020 10:32:29 - INFO - __main__ -   Validation Accuracy is :0.5956678700361011\n",
      "10/02/2020 10:32:29 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:30<00:39,  1.71s/it]\u001b[A10/02/2020 10:32:31 - INFO - __main__ -   \n",
      "10/02/2020 10:32:31 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:31 - INFO - __main__ -   Validation Loss improved! 0.6713474631739867 -> 0.6697205464332112\n",
      "10/02/2020 10:32:31 - INFO - __main__ -   Validation Accuracy is :0.6137184115523465\n",
      "10/02/2020 10:32:31 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:31<00:37,  1.72s/it]\u001b[A10/02/2020 10:32:33 - INFO - __main__ -   \n",
      "10/02/2020 10:32:33 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:33 - INFO - __main__ -   Validation Loss improved! 0.6697205464332112 -> 0.6680249857988598\n",
      "10/02/2020 10:32:33 - INFO - __main__ -   Validation Accuracy is :0.5992779783393501\n",
      "10/02/2020 10:32:33 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:33<00:36,  1.72s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:35<00:34,  1.71s/it]\u001b[A10/02/2020 10:32:36 - INFO - __main__ -   \n",
      "10/02/2020 10:32:36 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:36 - INFO - __main__ -   Validation Loss improved! 0.6680249857988598 -> 0.6642299645213874\n",
      "10/02/2020 10:32:36 - INFO - __main__ -   Validation Accuracy is :0.592057761732852\n",
      "10/02/2020 10:32:36 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:36<00:32,  1.71s/it]\u001b[A10/02/2020 10:32:38 - INFO - __main__ -   \n",
      "10/02/2020 10:32:38 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:38 - INFO - __main__ -   Validation Accuracy improved! 0.6173285198555957 -> 0.6209386281588448\n",
      "10/02/2020 10:32:38 - INFO - __main__ -   Validation Loss is : 0.6575534008039895\n",
      "10/02/2020 10:32:38 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:38 - INFO - __main__ -   \n",
      "10/02/2020 10:32:38 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:38 - INFO - __main__ -   Validation Loss improved! 0.6642299645213874 -> 0.6575534008039895\n",
      "10/02/2020 10:32:38 - INFO - __main__ -   Validation Accuracy is :0.6209386281588448\n",
      "10/02/2020 10:32:38 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:38<00:30,  1.71s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:40<00:28,  1.71s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:42<00:27,  1.70s/it]\u001b[A10/02/2020 10:32:43 - INFO - __main__ -   \n",
      "10/02/2020 10:32:43 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:43 - INFO - __main__ -   Validation Loss improved! 0.6575534008039895 -> 0.6561299403221599\n",
      "10/02/2020 10:32:43 - INFO - __main__ -   Validation Accuracy is :0.6137184115523465\n",
      "10/02/2020 10:32:43 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:43<00:25,  1.69s/it]\u001b[A10/02/2020 10:32:44 - INFO - __main__ -   \n",
      "10/02/2020 10:32:44 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:44 - INFO - __main__ -   Validation Accuracy improved! 0.6209386281588448 -> 0.6389891696750902\n",
      "10/02/2020 10:32:44 - INFO - __main__ -   Validation Loss is : 0.6475769001654339\n",
      "10/02/2020 10:32:44 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:44 - INFO - __main__ -   \n",
      "10/02/2020 10:32:44 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:44 - INFO - __main__ -   Validation Loss improved! 0.6561299403221599 -> 0.6475769001654339\n",
      "10/02/2020 10:32:44 - INFO - __main__ -   Validation Accuracy is :0.6389891696750902\n",
      "10/02/2020 10:32:44 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:45<00:23,  1.68s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:47<00:21,  1.68s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:48<00:20,  1.67s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:50<00:18,  1.67s/it]\u001b[A10/02/2020 10:32:51 - INFO - __main__ -   \n",
      "10/02/2020 10:32:51 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:51 - INFO - __main__ -   Validation Accuracy improved! 0.6389891696750902 -> 0.6425992779783394\n",
      "10/02/2020 10:32:51 - INFO - __main__ -   Validation Loss is : 0.6427141940120326\n",
      "10/02/2020 10:32:51 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:51 - INFO - __main__ -   \n",
      "10/02/2020 10:32:51 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:51 - INFO - __main__ -   Validation Loss improved! 0.6475769001654339 -> 0.6427141940120326\n",
      "10/02/2020 10:32:51 - INFO - __main__ -   Validation Accuracy is :0.6425992779783394\n",
      "10/02/2020 10:32:51 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:52<00:16,  1.66s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:53<00:14,  1.66s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:55<00:13,  1.65s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:56<00:11,  1.65s/it]\u001b[A10/02/2020 10:32:58 - INFO - __main__ -   \n",
      "10/02/2020 10:32:58 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:58 - INFO - __main__ -   Validation Loss improved! 0.6427141940120326 -> 0.6399803505907851\n",
      "10/02/2020 10:32:58 - INFO - __main__ -   Validation Accuracy is :0.6425992779783394\n",
      "10/02/2020 10:32:58 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:58<00:09,  1.65s/it]\u001b[A10/02/2020 10:32:59 - INFO - __main__ -   \n",
      "10/02/2020 10:32:59 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:59 - INFO - __main__ -   Validation Accuracy improved! 0.6425992779783394 -> 0.6462093862815884\n",
      "10/02/2020 10:32:59 - INFO - __main__ -   Validation Loss is : 0.6339461364470664\n",
      "10/02/2020 10:32:59 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:59 - INFO - __main__ -   \n",
      "10/02/2020 10:32:59 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:32:59 - INFO - __main__ -   Validation Loss improved! 0.6399803505907851 -> 0.6339461364470664\n",
      "10/02/2020 10:32:59 - INFO - __main__ -   Validation Accuracy is :0.6462093862815884\n",
      "10/02/2020 10:32:59 - INFO - __main__ -   =============================================================================\n",
      "\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  87%|███████████████████████████▉    | 34/39 [01:00<00:08,  1.64s/it]\u001b[A10/02/2020 10:33:01 - INFO - __main__ -   \n",
      "10/02/2020 10:33:01 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:33:01 - INFO - __main__ -   Validation Loss improved! 0.6339461364470664 -> 0.6332676987355367\n",
      "10/02/2020 10:33:01 - INFO - __main__ -   Validation Accuracy is :0.6389891696750902\n",
      "10/02/2020 10:33:01 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [01:01<00:06,  1.64s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [01:03<00:04,  1.63s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:05<00:03,  1.63s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:06<00:01,  1.63s/it]\u001b[A10/02/2020 10:33:07 - INFO - __main__ -   \n",
      "10/02/2020 10:33:07 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:33:07 - INFO - __main__ -   Validation Loss improved! 0.6332676987355367 -> 0.6294210585446134\n",
      "10/02/2020 10:33:07 - INFO - __main__ -   Validation Accuracy is :0.6389891696750902\n",
      "10/02/2020 10:33:07 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:08<00:00,  1.75s/it]\u001b[A\n",
      "Epoch:  17%|██████▎                               | 1/6 [01:08<05:41, 68.32s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A10/02/2020 10:33:09 - INFO - __main__ -   \n",
      "10/02/2020 10:33:09 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:33:09 - INFO - __main__ -   Validation Loss improved! 0.6294210585446134 -> 0.6269669549990217\n",
      "10/02/2020 10:33:09 - INFO - __main__ -   Validation Accuracy is :0.6425992779783394\n",
      "10/02/2020 10:33:09 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   3%|▊                                | 1/39 [00:01<01:01,  1.62s/it]\u001b[A10/02/2020 10:33:11 - INFO - __main__ -   \n",
      "10/02/2020 10:33:11 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:33:11 - INFO - __main__ -   Validation Loss improved! 0.6269669549990217 -> 0.6260168130647404\n",
      "10/02/2020 10:33:11 - INFO - __main__ -   Validation Accuracy is :0.6389891696750902\n",
      "10/02/2020 10:33:11 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   5%|█▋                               | 2/39 [00:03<00:59,  1.62s/it]\u001b[A10/02/2020 10:33:12 - INFO - __main__ -   \n",
      "10/02/2020 10:33:12 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:33:12 - INFO - __main__ -   Validation Loss improved! 0.6260168130647404 -> 0.6252410833585994\n",
      "10/02/2020 10:33:12 - INFO - __main__ -   Validation Accuracy is :0.6353790613718412\n",
      "10/02/2020 10:33:12 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   8%|██▌                              | 3/39 [00:04<00:58,  1.62s/it]\u001b[A10/02/2020 10:33:14 - INFO - __main__ -   \n",
      "10/02/2020 10:33:14 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:33:14 - INFO - __main__ -   Validation Loss improved! 0.6252410833585994 -> 0.6248808285819925\n",
      "10/02/2020 10:33:14 - INFO - __main__ -   Validation Accuracy is :0.6462093862815884\n",
      "10/02/2020 10:33:14 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  10%|███▍                             | 4/39 [00:06<00:56,  1.62s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:08<00:55,  1.62s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:09<00:53,  1.62s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:11<00:51,  1.62s/it]\u001b[A10/02/2020 10:33:20 - INFO - __main__ -   \n",
      "10/02/2020 10:33:20 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:33:20 - INFO - __main__ -   Validation Accuracy improved! 0.6462093862815884 -> 0.6498194945848376\n",
      "10/02/2020 10:33:20 - INFO - __main__ -   Validation Loss is : 0.6359432303087806\n",
      "10/02/2020 10:33:20 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:12<00:50,  1.63s/it]\u001b[A10/02/2020 10:33:22 - INFO - __main__ -   \n",
      "10/02/2020 10:33:22 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:33:22 - INFO - __main__ -   Validation Accuracy improved! 0.6498194945848376 -> 0.6534296028880866\n",
      "10/02/2020 10:33:22 - INFO - __main__ -   Validation Loss is : 0.6334801925218493\n",
      "10/02/2020 10:33:22 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:14<00:48,  1.63s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:16<00:47,  1.63s/it]\u001b[A10/02/2020 10:33:25 - INFO - __main__ -   \n",
      "10/02/2020 10:33:25 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:33:25 - INFO - __main__ -   Validation Loss improved! 0.6248808285819925 -> 0.624379154577152\n",
      "10/02/2020 10:33:25 - INFO - __main__ -   Validation Accuracy is :0.6389891696750902\n",
      "10/02/2020 10:33:25 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  28%|█████████                       | 11/39 [00:17<00:45,  1.63s/it]\u001b[A10/02/2020 10:33:27 - INFO - __main__ -   \n",
      "10/02/2020 10:33:27 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:33:27 - INFO - __main__ -   Validation Loss improved! 0.624379154577152 -> 0.6232539359412899\n",
      "10/02/2020 10:33:27 - INFO - __main__ -   Validation Accuracy is :0.6534296028880866\n",
      "10/02/2020 10:33:27 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:19<00:44,  1.63s/it]\u001b[A10/02/2020 10:33:29 - INFO - __main__ -   \n",
      "10/02/2020 10:33:29 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:33:29 - INFO - __main__ -   Validation Accuracy improved! 0.6534296028880866 -> 0.6570397111913358\n",
      "10/02/2020 10:33:29 - INFO - __main__ -   Validation Loss is : 0.6241938456731583\n",
      "10/02/2020 10:33:29 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:21<00:42,  1.63s/it]\u001b[A10/02/2020 10:33:30 - INFO - __main__ -   \n",
      "10/02/2020 10:33:30 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:33:30 - INFO - __main__ -   Validation Accuracy improved! 0.6570397111913358 -> 0.6606498194945848\n",
      "10/02/2020 10:33:30 - INFO - __main__ -   Validation Loss is : 0.625271205006954\n",
      "10/02/2020 10:33:30 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:22<00:40,  1.64s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:24<00:39,  1.64s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:26<00:37,  1.64s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:27<00:36,  1.64s/it]\u001b[A\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:29<00:34,  1.64s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:31<00:32,  1.64s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:32<00:31,  1.65s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:34<00:29,  1.65s/it]\u001b[A10/02/2020 10:33:43 - INFO - __main__ -   \n",
      "10/02/2020 10:33:43 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:33:43 - INFO - __main__ -   Validation Accuracy improved! 0.6606498194945848 -> 0.6678700361010831\n",
      "10/02/2020 10:33:43 - INFO - __main__ -   Validation Loss is : 0.6317035144847223\n",
      "10/02/2020 10:33:43 - INFO - __main__ -   =============================================================================\n",
      "\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  56%|██████████████████              | 22/39 [00:35<00:28,  1.65s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:37<00:26,  1.65s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:39<00:24,  1.65s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:40<00:23,  1.65s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:42<00:21,  1.65s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:44<00:19,  1.65s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:45<00:18,  1.65s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:47<00:16,  1.65s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:49<00:14,  1.65s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:50<00:13,  1.65s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:52<00:11,  1.65s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:54<00:09,  1.65s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [00:55<00:08,  1.65s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [00:57<00:06,  1.65s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [00:59<00:04,  1.65s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:00<00:03,  1.65s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:02<00:01,  1.65s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:03<00:00,  1.64s/it]\u001b[A\n",
      "Epoch:  33%|████████████▋                         | 2/6 [02:12<04:28, 67.02s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:01<01:02,  1.64s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:03<01:00,  1.64s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:04<00:59,  1.65s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:06<00:57,  1.65s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:08<00:56,  1.65s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:09<00:54,  1.65s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:11<00:52,  1.64s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:13<00:50,  1.64s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:14<00:49,  1.64s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:16<00:47,  1.64s/it]\u001b[A10/02/2020 10:34:30 - INFO - __main__ -   \n",
      "10/02/2020 10:34:30 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:34:30 - INFO - __main__ -   Validation Loss improved! 0.6232539359412899 -> 0.6221003825053412\n",
      "10/02/2020 10:34:30 - INFO - __main__ -   Validation Accuracy is :0.6534296028880866\n",
      "10/02/2020 10:34:30 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  28%|█████████                       | 11/39 [00:18<00:46,  1.64s/it]\u001b[A10/02/2020 10:34:31 - INFO - __main__ -   \n",
      "10/02/2020 10:34:31 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:34:31 - INFO - __main__ -   Validation Loss improved! 0.6221003825053412 -> 0.6202677750845679\n",
      "10/02/2020 10:34:31 - INFO - __main__ -   Validation Accuracy is :0.6389891696750902\n",
      "10/02/2020 10:34:31 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:19<00:44,  1.64s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:21<00:42,  1.64s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:23<00:41,  1.64s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:24<00:39,  1.64s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:26<00:37,  1.64s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:27<00:36,  1.64s/it]\u001b[A10/02/2020 10:34:41 - INFO - __main__ -   \n",
      "10/02/2020 10:34:41 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:34:41 - INFO - __main__ -   Validation Loss improved! 0.6202677750845679 -> 0.6164718139042493\n",
      "10/02/2020 10:34:41 - INFO - __main__ -   Validation Accuracy is :0.6534296028880866\n",
      "10/02/2020 10:34:41 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:29<00:34,  1.64s/it]\u001b[A10/02/2020 10:34:43 - INFO - __main__ -   \n",
      "10/02/2020 10:34:43 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:34:43 - INFO - __main__ -   Validation Loss improved! 0.6164718139042493 -> 0.6134732518385464\n",
      "10/02/2020 10:34:43 - INFO - __main__ -   Validation Accuracy is :0.631768953068592\n",
      "10/02/2020 10:34:43 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:31<00:32,  1.64s/it]\u001b[A10/02/2020 10:34:44 - INFO - __main__ -   \n",
      "10/02/2020 10:34:44 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:34:44 - INFO - __main__ -   Validation Loss improved! 0.6134732518385464 -> 0.6133914561908598\n",
      "10/02/2020 10:34:44 - INFO - __main__ -   Validation Accuracy is :0.6425992779783394\n",
      "10/02/2020 10:34:44 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:32<00:31,  1.64s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:34<00:29,  1.64s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:36<00:27,  1.64s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:37<00:26,  1.64s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:39<00:24,  1.64s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:41<00:22,  1.64s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:42<00:21,  1.64s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:44<00:19,  1.64s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:45<00:18,  1.64s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:47<00:16,  1.64s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:49<00:14,  1.64s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:50<00:13,  1.64s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:52<00:11,  1.64s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:54<00:09,  1.64s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [00:55<00:08,  1.64s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [00:57<00:06,  1.64s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [00:59<00:04,  1.64s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:00<00:03,  1.64s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:02<00:01,  1.64s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:04<00:00,  1.64s/it]\u001b[A\n",
      "Epoch:  50%|███████████████████                   | 3/6 [03:16<03:18, 66.12s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:01<01:02,  1.64s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:03<01:00,  1.64s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:04<00:59,  1.64s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:06<00:57,  1.64s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:08<00:55,  1.64s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:09<00:54,  1.64s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:11<00:52,  1.64s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:13<00:50,  1.64s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:14<00:49,  1.64s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:16<00:47,  1.64s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:18<00:45,  1.64s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:19<00:44,  1.64s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:21<00:42,  1.64s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:22<00:40,  1.64s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:24<00:39,  1.64s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:26<00:37,  1.64s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:27<00:36,  1.64s/it]\u001b[A\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:29<00:34,  1.64s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:31<00:32,  1.64s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:32<00:31,  1.64s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:34<00:29,  1.64s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:36<00:27,  1.64s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:37<00:26,  1.65s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:39<00:24,  1.65s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:41<00:23,  1.65s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:42<00:21,  1.65s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:44<00:19,  1.65s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:46<00:18,  1.64s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:47<00:16,  1.64s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:49<00:14,  1.65s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:50<00:13,  1.65s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:52<00:11,  1.65s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:54<00:09,  1.65s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [00:55<00:08,  1.65s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [00:57<00:06,  1.66s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [00:59<00:04,  1.65s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:00<00:03,  1.65s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:02<00:01,  1.65s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:04<00:00,  1.64s/it]\u001b[A\n",
      "Epoch:  67%|█████████████████████████▎            | 4/6 [04:20<02:11, 65.52s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:01<01:02,  1.65s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:03<01:01,  1.65s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:04<00:59,  1.65s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:06<00:57,  1.65s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:08<00:56,  1.65s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:09<00:54,  1.65s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:11<00:52,  1.65s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:13<00:51,  1.65s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:14<00:49,  1.65s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:16<00:47,  1.65s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:18<00:46,  1.65s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:19<00:44,  1.65s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:21<00:42,  1.65s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:23<00:41,  1.65s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:24<00:39,  1.65s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:26<00:37,  1.65s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:28<00:36,  1.65s/it]\u001b[A\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:29<00:34,  1.65s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:31<00:32,  1.65s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:32<00:31,  1.65s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:34<00:29,  1.65s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:36<00:28,  1.65s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:37<00:26,  1.65s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:39<00:24,  1.65s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:41<00:23,  1.65s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:42<00:21,  1.65s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:44<00:19,  1.65s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:46<00:18,  1.65s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:47<00:16,  1.65s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:49<00:14,  1.65s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:51<00:13,  1.65s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:52<00:11,  1.65s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:54<00:09,  1.65s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [00:56<00:08,  1.65s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [00:57<00:06,  1.64s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [00:59<00:04,  1.64s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:00<00:03,  1.64s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:02<00:01,  1.64s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:04<00:00,  1.65s/it]\u001b[A\n",
      "Epoch:  83%|███████████████████████████████▋      | 5/6 [05:24<01:05, 65.13s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:01<01:02,  1.64s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:03<01:00,  1.64s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:04<00:59,  1.64s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:06<00:57,  1.64s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:08<00:55,  1.64s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:09<00:54,  1.64s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:11<00:52,  1.64s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:13<00:50,  1.64s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:14<00:49,  1.64s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:16<00:47,  1.64s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:18<00:46,  1.64s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:19<00:44,  1.64s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:21<00:42,  1.64s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:23<00:41,  1.64s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:24<00:39,  1.64s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:26<00:37,  1.64s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:27<00:36,  1.64s/it]\u001b[A\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:29<00:34,  1.64s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:31<00:32,  1.64s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:32<00:31,  1.64s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:34<00:29,  1.64s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:36<00:27,  1.64s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:37<00:26,  1.64s/it]\u001b[A\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  62%|███████████████████▋            | 24/39 [00:39<00:24,  1.64s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:41<00:23,  1.64s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:42<00:21,  1.64s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:44<00:19,  1.64s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:46<00:18,  1.64s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:47<00:16,  1.64s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:49<00:14,  1.64s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:50<00:13,  1.64s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:52<00:11,  1.64s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:54<00:09,  1.64s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [00:55<00:08,  1.64s/it]\u001b[A10/02/2020 10:38:20 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [00:57<00:06,  1.64s/it]\u001b[A10/02/2020 10:38:22 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [00:59<00:04,  1.64s/it]\u001b[A10/02/2020 10:38:23 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:00<00:03,  1.64s/it]\u001b[A10/02/2020 10:38:25 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:02<00:01,  1.64s/it]\u001b[A10/02/2020 10:38:27 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:04<00:00,  1.64s/it]\u001b[A\n",
      "Epoch: 100%|██████████████████████████████████████| 6/6 [06:28<00:00, 64.79s/it]\n",
      "10/02/2020 10:38:28 - INFO - __main__ -   \n",
      "10/02/2020 10:38:28 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:38:28 - INFO - __main__ -   Validation Loss : 0.6133914561908598 Validation Accuracy : 0.6678700361010831\n",
      "10/02/2020 10:38:28 - INFO - __main__ -   Loss_ACC : 0.6425992779783394\n",
      "10/02/2020 10:38:28 - INFO - __main__ -   =============================================================================\n"
     ]
    }
   ],
   "source": [
    "!python KDAP/NLI_KD_training_RTE.py"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 35,
   "metadata": {
    "collapsed": true
   },
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "10/02/2020 10:43:36 - INFO - __main__ -   IN DEBUG MODE\n",
      "10/02/2020 10:43:37 - INFO - src.argument_parser -   device: cuda n_gpu: 2, 16-bits training: False\n",
      "10/02/2020 10:43:37 - INFO - src.argument_parser -   random seed = 8790863\n",
      "10/02/2020 10:43:37 - INFO - __main__ -   actual batch size on all GPU = 64\n",
      "10/02/2020 10:43:37 - INFO - __main__ -   Seed changed to : 8790863\n",
      "10/02/2020 10:43:37 - INFO - __main__ -   loading teacher's prediction\n",
      "10/02/2020 10:43:37 - INFO - __main__ -   teacher acc = 79.36, teacher loss = 0.46779\n",
      "10/02/2020 10:43:38 - INFO - __main__ -   teacher acc = 68.95, teacher loss = 0.58605\n",
      "10/02/2020 10:43:38 - INFO - src.nli_data_processing -   Writing example 0 of 2490\n",
      "10/02/2020 10:43:40 - INFO - __main__ -   ***** Running training *****\n",
      "10/02/2020 10:43:40 - INFO - __main__ -     Num examples = 2490\n",
      "10/02/2020 10:43:40 - INFO - __main__ -     Batch size = 64\n",
      "10/02/2020 10:43:40 - INFO - __main__ -     Num steps = 228\n",
      "10/02/2020 10:43:40 - INFO - src.nli_data_processing -   Writing example 0 of 277\n",
      "10/02/2020 10:43:40 - INFO - __main__ -   ***** Running evaluation *****\n",
      "10/02/2020 10:43:40 - INFO - __main__ -     Num examples = 277\n",
      "10/02/2020 10:43:40 - INFO - __main__ -     Batch size = 32\n",
      "10/02/2020 10:43:40 - INFO - src.nli_data_processing -   Writing example 0 of 3000\n",
      "10/02/2020 10:43:42 - INFO - __main__ -   ***** Running evaluation *****\n",
      "10/02/2020 10:43:42 - INFO - __main__ -     Num examples = 3000\n",
      "10/02/2020 10:43:42 - INFO - __main__ -     Batch size = 32\n",
      "10/02/2020 10:43:42 - INFO - __main__ -   using normal Knowledge Distillation\n",
      "10/02/2020 10:43:42 - INFO - src.nli_data_processing -   predicting for RTE\n",
      "10/02/2020 10:43:42 - INFO - src.modeling -   num hidden layer is set as 6\n",
      "10/02/2020 10:43:42 - INFO - src.modeling -   Model config {\n",
      "  \"attention_probs_dropout_prob\": 0.1,\n",
      "  \"hidden_act\": \"gelu\",\n",
      "  \"hidden_dropout_prob\": 0.1,\n",
      "  \"hidden_size\": 768,\n",
      "  \"initializer_range\": 0.02,\n",
      "  \"intermediate_size\": 3072,\n",
      "  \"max_position_embeddings\": 512,\n",
      "  \"num_attention_heads\": 12,\n",
      "  \"num_hidden_layers\": 6,\n",
      "  \"type_vocab_size\": 2,\n",
      "  \"vocab_size\": 30522\n",
      "}\n",
      "\n",
      "10/02/2020 10:43:44 - INFO - src.utils -   delete 104 layers, keep 103 layers\n",
      "10/02/2020 10:43:47 - INFO - src.utils -   data parallel because more than one gpu\n",
      "10/02/2020 10:43:47 - INFO - __main__ -   *****************************************************************************\n",
      "10/02/2020 10:43:47 - INFO - src.utils -   data parallel because more than one gpu\n",
      "10/02/2020 10:43:47 - INFO - __main__ -   number of layers in student model = 6\n",
      "10/02/2020 10:43:47 - INFO - __main__ -   num parameters in student model are 66955008 and 1538\n",
      "10/02/2020 10:43:47 - INFO - __main__ -   FP16 is not activated, use BertAdam\n",
      "Epoch:   0%|                                              | 0/6 [00:00<?, ?it/s]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A/opt/conda/lib/python3.7/site-packages/torch/nn/functional.py:1958: UserWarning: reduction: 'mean' divides the total loss by both the batch size and the support size.'batchmean' divides only by the batch size, and aligns with the KL div math definition.'mean' will be changed to behave the same as 'batchmean' in the next major release.\n",
      "  warnings.warn(\"reduction: 'mean' divides the total loss by both the batch size and the support size.\"\n",
      "10/02/2020 10:43:52 - INFO - __main__ -   \n",
      "10/02/2020 10:43:52 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:43:52 - INFO - __main__ -   Validation Accuracy improved! 0 -> 0.4729241877256318\n",
      "10/02/2020 10:43:52 - INFO - __main__ -   Validation Loss is : 0.7484906055436668\n",
      "10/02/2020 10:43:52 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:43:52 - INFO - __main__ -   \n",
      "10/02/2020 10:43:52 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:43:52 - INFO - __main__ -   Validation Loss improved! 100 -> 0.7484906055436668\n",
      "10/02/2020 10:43:52 - INFO - __main__ -   Validation Accuracy is :0.4729241877256318\n",
      "10/02/2020 10:43:52 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   3%|▊                                | 1/39 [00:05<03:16,  5.18s/it]\u001b[A10/02/2020 10:43:53 - INFO - __main__ -   \n",
      "10/02/2020 10:43:53 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:43:53 - INFO - __main__ -   Validation Loss improved! 0.7484906055436668 -> 0.7463268596773113\n",
      "10/02/2020 10:43:53 - INFO - __main__ -   Validation Accuracy is :0.4729241877256318\n",
      "10/02/2020 10:43:53 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   5%|█▋                               | 2/39 [00:06<02:31,  4.11s/it]\u001b[A10/02/2020 10:43:55 - INFO - __main__ -   \n",
      "10/02/2020 10:43:55 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:43:55 - INFO - __main__ -   Validation Loss improved! 0.7463268596773113 -> 0.7412176820775662\n",
      "10/02/2020 10:43:55 - INFO - __main__ -   Validation Accuracy is :0.4729241877256318\n",
      "10/02/2020 10:43:55 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   8%|██▌                              | 3/39 [00:08<02:00,  3.36s/it]\u001b[A10/02/2020 10:43:57 - INFO - __main__ -   \n",
      "10/02/2020 10:43:57 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:43:57 - INFO - __main__ -   Validation Loss improved! 0.7412176820775662 -> 0.7334133588880408\n",
      "10/02/2020 10:43:57 - INFO - __main__ -   Validation Accuracy is :0.4729241877256318\n",
      "10/02/2020 10:43:57 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  10%|███▍                             | 4/39 [00:10<01:39,  2.84s/it]\u001b[A10/02/2020 10:43:58 - INFO - __main__ -   \n",
      "10/02/2020 10:43:58 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:43:58 - INFO - __main__ -   Validation Loss improved! 0.7334133588880408 -> 0.7244631701858465\n",
      "10/02/2020 10:43:58 - INFO - __main__ -   Validation Accuracy is :0.4729241877256318\n",
      "10/02/2020 10:43:58 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  13%|████▏                            | 5/39 [00:11<01:24,  2.47s/it]\u001b[A10/02/2020 10:44:00 - INFO - __main__ -   \n",
      "10/02/2020 10:44:00 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:00 - INFO - __main__ -   Validation Loss improved! 0.7244631701858465 -> 0.7136233085329352\n",
      "10/02/2020 10:44:00 - INFO - __main__ -   Validation Accuracy is :0.4729241877256318\n",
      "10/02/2020 10:44:00 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  15%|█████                            | 6/39 [00:13<01:13,  2.22s/it]\u001b[A10/02/2020 10:44:02 - INFO - __main__ -   \n",
      "10/02/2020 10:44:02 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:02 - INFO - __main__ -   Validation Loss improved! 0.7136233085329352 -> 0.7032817847461907\n",
      "10/02/2020 10:44:02 - INFO - __main__ -   Validation Accuracy is :0.4729241877256318\n",
      "10/02/2020 10:44:02 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:14<01:05,  2.05s/it]\u001b[A10/02/2020 10:44:03 - INFO - __main__ -   \n",
      "10/02/2020 10:44:03 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:03 - INFO - __main__ -   Validation Accuracy improved! 0.4729241877256318 -> 0.48375451263537905\n",
      "10/02/2020 10:44:03 - INFO - __main__ -   Validation Loss is : 0.6940397510459707\n",
      "10/02/2020 10:44:03 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:03 - INFO - __main__ -   \n",
      "10/02/2020 10:44:03 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:03 - INFO - __main__ -   Validation Loss improved! 0.7032817847461907 -> 0.6940397510459707\n",
      "10/02/2020 10:44:03 - INFO - __main__ -   Validation Accuracy is :0.48375451263537905\n",
      "10/02/2020 10:44:03 - INFO - __main__ -   =============================================================================\n",
      "\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  21%|██████▊                          | 8/39 [00:16<00:59,  1.93s/it]\u001b[A10/02/2020 10:44:05 - INFO - __main__ -   \n",
      "10/02/2020 10:44:05 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:05 - INFO - __main__ -   Validation Accuracy improved! 0.48375451263537905 -> 0.5306859205776173\n",
      "10/02/2020 10:44:05 - INFO - __main__ -   Validation Loss is : 0.6884483516431458\n",
      "10/02/2020 10:44:05 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:05 - INFO - __main__ -   \n",
      "10/02/2020 10:44:05 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:05 - INFO - __main__ -   Validation Loss improved! 0.6940397510459707 -> 0.6884483516431458\n",
      "10/02/2020 10:44:05 - INFO - __main__ -   Validation Accuracy is :0.5306859205776173\n",
      "10/02/2020 10:44:05 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:18<00:55,  1.85s/it]\u001b[A10/02/2020 10:44:07 - INFO - __main__ -   \n",
      "10/02/2020 10:44:07 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:07 - INFO - __main__ -   Validation Accuracy improved! 0.5306859205776173 -> 0.5415162454873647\n",
      "10/02/2020 10:44:07 - INFO - __main__ -   Validation Loss is : 0.6862235912777457\n",
      "10/02/2020 10:44:07 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:07 - INFO - __main__ -   \n",
      "10/02/2020 10:44:07 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:07 - INFO - __main__ -   Validation Loss improved! 0.6884483516431458 -> 0.6862235912777457\n",
      "10/02/2020 10:44:07 - INFO - __main__ -   Validation Accuracy is :0.5415162454873647\n",
      "10/02/2020 10:44:07 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:19<00:51,  1.79s/it]\u001b[A10/02/2020 10:44:08 - INFO - __main__ -   \n",
      "10/02/2020 10:44:08 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:08 - INFO - __main__ -   Validation Accuracy improved! 0.5415162454873647 -> 0.5487364620938628\n",
      "10/02/2020 10:44:08 - INFO - __main__ -   Validation Loss is : 0.6870644910240862\n",
      "10/02/2020 10:44:08 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  28%|█████████                       | 11/39 [00:21<00:49,  1.75s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:23<00:46,  1.73s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:24<00:44,  1.72s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:26<00:42,  1.71s/it]\u001b[A10/02/2020 10:44:15 - INFO - __main__ -   \n",
      "10/02/2020 10:44:15 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:15 - INFO - __main__ -   Validation Accuracy improved! 0.5487364620938628 -> 0.5523465703971119\n",
      "10/02/2020 10:44:15 - INFO - __main__ -   Validation Loss is : 0.6858014371851291\n",
      "10/02/2020 10:44:15 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:15 - INFO - __main__ -   \n",
      "10/02/2020 10:44:15 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:15 - INFO - __main__ -   Validation Loss improved! 0.6862235912777457 -> 0.6858014371851291\n",
      "10/02/2020 10:44:15 - INFO - __main__ -   Validation Accuracy is :0.5523465703971119\n",
      "10/02/2020 10:44:15 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:28<00:40,  1.71s/it]\u001b[A10/02/2020 10:44:17 - INFO - __main__ -   \n",
      "10/02/2020 10:44:17 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:17 - INFO - __main__ -   Validation Accuracy improved! 0.5523465703971119 -> 0.555956678700361\n",
      "10/02/2020 10:44:17 - INFO - __main__ -   Validation Loss is : 0.6818924673197502\n",
      "10/02/2020 10:44:17 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:17 - INFO - __main__ -   \n",
      "10/02/2020 10:44:17 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:17 - INFO - __main__ -   Validation Loss improved! 0.6858014371851291 -> 0.6818924673197502\n",
      "10/02/2020 10:44:17 - INFO - __main__ -   Validation Accuracy is :0.555956678700361\n",
      "10/02/2020 10:44:17 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:30<00:39,  1.71s/it]\u001b[A10/02/2020 10:44:18 - INFO - __main__ -   \n",
      "10/02/2020 10:44:18 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:18 - INFO - __main__ -   Validation Accuracy improved! 0.555956678700361 -> 0.5776173285198556\n",
      "10/02/2020 10:44:18 - INFO - __main__ -   Validation Loss is : 0.6787310300751284\n",
      "10/02/2020 10:44:18 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:18 - INFO - __main__ -   \n",
      "10/02/2020 10:44:18 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:18 - INFO - __main__ -   Validation Loss improved! 0.6818924673197502 -> 0.6787310300751284\n",
      "10/02/2020 10:44:18 - INFO - __main__ -   Validation Accuracy is :0.5776173285198556\n",
      "10/02/2020 10:44:18 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:31<00:37,  1.71s/it]\u001b[A10/02/2020 10:44:20 - INFO - __main__ -   \n",
      "10/02/2020 10:44:20 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:20 - INFO - __main__ -   Validation Accuracy improved! 0.5776173285198556 -> 0.5884476534296029\n",
      "10/02/2020 10:44:20 - INFO - __main__ -   Validation Loss is : 0.676823464541659\n",
      "10/02/2020 10:44:20 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:20 - INFO - __main__ -   \n",
      "10/02/2020 10:44:20 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:20 - INFO - __main__ -   Validation Loss improved! 0.6787310300751284 -> 0.676823464541659\n",
      "10/02/2020 10:44:20 - INFO - __main__ -   Validation Accuracy is :0.5884476534296029\n",
      "10/02/2020 10:44:20 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:33<00:36,  1.72s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:35<00:34,  1.72s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:36<00:32,  1.72s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:38<00:30,  1.71s/it]\u001b[A10/02/2020 10:44:27 - INFO - __main__ -   \n",
      "10/02/2020 10:44:27 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:27 - INFO - __main__ -   Validation Loss improved! 0.676823464541659 -> 0.6736220728189076\n",
      "10/02/2020 10:44:27 - INFO - __main__ -   Validation Accuracy is :0.5631768953068592\n",
      "10/02/2020 10:44:27 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:40<00:29,  1.71s/it]\u001b[A10/02/2020 10:44:29 - INFO - __main__ -   \n",
      "10/02/2020 10:44:29 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:29 - INFO - __main__ -   Validation Accuracy improved! 0.5884476534296029 -> 0.6101083032490975\n",
      "10/02/2020 10:44:29 - INFO - __main__ -   Validation Loss is : 0.6677703065562335\n",
      "10/02/2020 10:44:29 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:29 - INFO - __main__ -   \n",
      "10/02/2020 10:44:29 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:29 - INFO - __main__ -   Validation Loss improved! 0.6736220728189076 -> 0.6677703065562335\n",
      "10/02/2020 10:44:29 - INFO - __main__ -   Validation Accuracy is :0.6101083032490975\n",
      "10/02/2020 10:44:29 - INFO - __main__ -   =============================================================================\n",
      "\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  59%|██████████████████▊             | 23/39 [00:41<00:27,  1.70s/it]\u001b[A10/02/2020 10:44:30 - INFO - __main__ -   \n",
      "10/02/2020 10:44:30 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:30 - INFO - __main__ -   Validation Loss improved! 0.6677703065562335 -> 0.6640920880038815\n",
      "10/02/2020 10:44:30 - INFO - __main__ -   Validation Accuracy is :0.6064981949458483\n",
      "10/02/2020 10:44:30 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:43<00:25,  1.70s/it]\u001b[A10/02/2020 10:44:32 - INFO - __main__ -   \n",
      "10/02/2020 10:44:32 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:32 - INFO - __main__ -   Validation Loss improved! 0.6640920880038815 -> 0.6622026491681591\n",
      "10/02/2020 10:44:32 - INFO - __main__ -   Validation Accuracy is :0.6064981949458483\n",
      "10/02/2020 10:44:32 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:45<00:23,  1.69s/it]\u001b[A10/02/2020 10:44:34 - INFO - __main__ -   \n",
      "10/02/2020 10:44:34 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:34 - INFO - __main__ -   Validation Accuracy improved! 0.6101083032490975 -> 0.6137184115523465\n",
      "10/02/2020 10:44:34 - INFO - __main__ -   Validation Loss is : 0.6602705118888552\n",
      "10/02/2020 10:44:34 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:34 - INFO - __main__ -   \n",
      "10/02/2020 10:44:34 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:34 - INFO - __main__ -   Validation Loss improved! 0.6622026491681591 -> 0.6602705118888552\n",
      "10/02/2020 10:44:34 - INFO - __main__ -   Validation Accuracy is :0.6137184115523465\n",
      "10/02/2020 10:44:34 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:47<00:21,  1.68s/it]\u001b[A10/02/2020 10:44:35 - INFO - __main__ -   \n",
      "10/02/2020 10:44:35 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:35 - INFO - __main__ -   Validation Accuracy improved! 0.6137184115523465 -> 0.6209386281588448\n",
      "10/02/2020 10:44:35 - INFO - __main__ -   Validation Loss is : 0.6588294204822086\n",
      "10/02/2020 10:44:35 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:35 - INFO - __main__ -   \n",
      "10/02/2020 10:44:35 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:35 - INFO - __main__ -   Validation Loss improved! 0.6602705118888552 -> 0.6588294204822086\n",
      "10/02/2020 10:44:35 - INFO - __main__ -   Validation Accuracy is :0.6209386281588448\n",
      "10/02/2020 10:44:35 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:48<00:20,  1.68s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:50<00:18,  1.67s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:51<00:16,  1.66s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:53<00:14,  1.66s/it]\u001b[A10/02/2020 10:44:42 - INFO - __main__ -   \n",
      "10/02/2020 10:44:42 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:42 - INFO - __main__ -   Validation Accuracy improved! 0.6209386281588448 -> 0.628158844765343\n",
      "10/02/2020 10:44:42 - INFO - __main__ -   Validation Loss is : 0.6552004934648314\n",
      "10/02/2020 10:44:42 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:42 - INFO - __main__ -   \n",
      "10/02/2020 10:44:42 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:42 - INFO - __main__ -   Validation Loss improved! 0.6588294204822086 -> 0.6552004934648314\n",
      "10/02/2020 10:44:42 - INFO - __main__ -   Validation Accuracy is :0.628158844765343\n",
      "10/02/2020 10:44:42 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:55<00:13,  1.65s/it]\u001b[A10/02/2020 10:44:44 - INFO - __main__ -   \n",
      "10/02/2020 10:44:44 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:44 - INFO - __main__ -   Validation Loss improved! 0.6552004934648314 -> 0.6542110408686559\n",
      "10/02/2020 10:44:44 - INFO - __main__ -   Validation Accuracy is :0.628158844765343\n",
      "10/02/2020 10:44:44 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:56<00:11,  1.65s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:58<00:09,  1.64s/it]\u001b[A10/02/2020 10:44:47 - INFO - __main__ -   \n",
      "10/02/2020 10:44:47 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:47 - INFO - __main__ -   Validation Accuracy improved! 0.628158844765343 -> 0.631768953068592\n",
      "10/02/2020 10:44:47 - INFO - __main__ -   Validation Loss is : 0.6526752292894714\n",
      "10/02/2020 10:44:47 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:47 - INFO - __main__ -   \n",
      "10/02/2020 10:44:47 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:47 - INFO - __main__ -   Validation Loss improved! 0.6542110408686559 -> 0.6526752292894714\n",
      "10/02/2020 10:44:47 - INFO - __main__ -   Validation Accuracy is :0.631768953068592\n",
      "10/02/2020 10:44:47 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [01:00<00:08,  1.64s/it]\u001b[A10/02/2020 10:44:48 - INFO - __main__ -   \n",
      "10/02/2020 10:44:48 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:48 - INFO - __main__ -   Validation Accuracy improved! 0.631768953068592 -> 0.6462093862815884\n",
      "10/02/2020 10:44:48 - INFO - __main__ -   Validation Loss is : 0.6488409834217939\n",
      "10/02/2020 10:44:48 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:48 - INFO - __main__ -   \n",
      "10/02/2020 10:44:48 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:48 - INFO - __main__ -   Validation Loss improved! 0.6526752292894714 -> 0.6488409834217939\n",
      "10/02/2020 10:44:48 - INFO - __main__ -   Validation Accuracy is :0.6462093862815884\n",
      "10/02/2020 10:44:48 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [01:01<00:06,  1.64s/it]\u001b[A10/02/2020 10:44:50 - INFO - __main__ -   \n",
      "10/02/2020 10:44:50 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:50 - INFO - __main__ -   Validation Loss improved! 0.6488409834217939 -> 0.6483727878611871\n",
      "10/02/2020 10:44:50 - INFO - __main__ -   Validation Accuracy is :0.6137184115523465\n",
      "10/02/2020 10:44:50 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [01:03<00:04,  1.63s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:05<00:03,  1.63s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:06<00:01,  1.63s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:08<00:00,  1.75s/it]\u001b[A\n",
      "Epoch:  17%|██████▎                               | 1/6 [01:08<05:41, 68.26s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A10/02/2020 10:44:57 - INFO - __main__ -   \n",
      "10/02/2020 10:44:57 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:57 - INFO - __main__ -   Validation Loss improved! 0.6483727878611871 -> 0.645527788017631\n",
      "10/02/2020 10:44:57 - INFO - __main__ -   Validation Accuracy is :0.6209386281588448\n",
      "10/02/2020 10:44:57 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   3%|▊                                | 1/39 [00:01<01:01,  1.62s/it]\u001b[A10/02/2020 10:44:58 - INFO - __main__ -   \n",
      "10/02/2020 10:44:58 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:44:58 - INFO - __main__ -   Validation Loss improved! 0.645527788017631 -> 0.640081560568689\n",
      "10/02/2020 10:44:58 - INFO - __main__ -   Validation Accuracy is :0.631768953068592\n",
      "10/02/2020 10:44:58 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   5%|█▋                               | 2/39 [00:03<00:59,  1.62s/it]\u001b[A10/02/2020 10:45:00 - INFO - __main__ -   \n",
      "10/02/2020 10:45:00 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:45:00 - INFO - __main__ -   Validation Loss improved! 0.640081560568689 -> 0.637602850848587\n",
      "10/02/2020 10:45:00 - INFO - __main__ -   Validation Accuracy is :0.6389891696750902\n",
      "10/02/2020 10:45:00 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   8%|██▌                              | 3/39 [00:04<00:58,  1.62s/it]\u001b[A10/02/2020 10:45:01 - INFO - __main__ -   \n",
      "10/02/2020 10:45:01 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:45:01 - INFO - __main__ -   Validation Accuracy improved! 0.6462093862815884 -> 0.6534296028880866\n",
      "10/02/2020 10:45:01 - INFO - __main__ -   Validation Loss is : 0.6364247738669495\n",
      "10/02/2020 10:45:01 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:45:01 - INFO - __main__ -   \n",
      "10/02/2020 10:45:01 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:45:01 - INFO - __main__ -   Validation Loss improved! 0.637602850848587 -> 0.6364247738669495\n",
      "10/02/2020 10:45:01 - INFO - __main__ -   Validation Accuracy is :0.6534296028880866\n",
      "10/02/2020 10:45:01 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  10%|███▍                             | 4/39 [00:06<00:56,  1.62s/it]\u001b[A10/02/2020 10:45:03 - INFO - __main__ -   \n",
      "10/02/2020 10:45:03 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:45:03 - INFO - __main__ -   Validation Loss improved! 0.6364247738669495 -> 0.6332998998759025\n",
      "10/02/2020 10:45:03 - INFO - __main__ -   Validation Accuracy is :0.6462093862815884\n",
      "10/02/2020 10:45:03 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  13%|████▏                            | 5/39 [00:08<00:55,  1.62s/it]\u001b[A10/02/2020 10:45:05 - INFO - __main__ -   \n",
      "10/02/2020 10:45:05 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:45:05 - INFO - __main__ -   Validation Loss improved! 0.6332998998759025 -> 0.6320843231807116\n",
      "10/02/2020 10:45:05 - INFO - __main__ -   Validation Accuracy is :0.631768953068592\n",
      "10/02/2020 10:45:05 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  15%|█████                            | 6/39 [00:09<00:53,  1.62s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:11<00:51,  1.62s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:12<00:50,  1.62s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:14<00:48,  1.62s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:16<00:47,  1.62s/it]\u001b[A10/02/2020 10:45:13 - INFO - __main__ -   \n",
      "10/02/2020 10:45:13 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:45:13 - INFO - __main__ -   Validation Loss improved! 0.6320843231807116 -> 0.6265436131170941\n",
      "10/02/2020 10:45:13 - INFO - __main__ -   Validation Accuracy is :0.6389891696750902\n",
      "10/02/2020 10:45:13 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  28%|█████████                       | 11/39 [00:17<00:45,  1.62s/it]\u001b[A10/02/2020 10:45:14 - INFO - __main__ -   \n",
      "10/02/2020 10:45:14 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:45:14 - INFO - __main__ -   Validation Loss improved! 0.6265436131170941 -> 0.6230352553219571\n",
      "10/02/2020 10:45:14 - INFO - __main__ -   Validation Accuracy is :0.6389891696750902\n",
      "10/02/2020 10:45:14 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:19<00:43,  1.63s/it]\u001b[A10/02/2020 10:45:16 - INFO - __main__ -   \n",
      "10/02/2020 10:45:16 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:45:16 - INFO - __main__ -   Validation Accuracy improved! 0.6534296028880866 -> 0.6570397111913358\n",
      "10/02/2020 10:45:16 - INFO - __main__ -   Validation Loss is : 0.6221524696487812\n",
      "10/02/2020 10:45:16 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:45:16 - INFO - __main__ -   \n",
      "10/02/2020 10:45:16 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:45:16 - INFO - __main__ -   Validation Loss improved! 0.6230352553219571 -> 0.6221524696487812\n",
      "10/02/2020 10:45:16 - INFO - __main__ -   Validation Accuracy is :0.6570397111913358\n",
      "10/02/2020 10:45:16 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:21<00:42,  1.63s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:22<00:40,  1.63s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:24<00:39,  1.63s/it]\u001b[A10/02/2020 10:45:21 - INFO - __main__ -   \n",
      "10/02/2020 10:45:21 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:45:21 - INFO - __main__ -   Validation Loss improved! 0.6221524696487812 -> 0.6214246095733091\n",
      "10/02/2020 10:45:21 - INFO - __main__ -   Validation Accuracy is :0.6534296028880866\n",
      "10/02/2020 10:45:21 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:26<00:37,  1.64s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:27<00:36,  1.64s/it]\u001b[A\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:29<00:34,  1.64s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:30<00:32,  1.65s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:32<00:31,  1.65s/it]\u001b[A10/02/2020 10:45:29 - INFO - __main__ -   \n",
      "10/02/2020 10:45:29 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:45:29 - INFO - __main__ -   Validation Accuracy improved! 0.6570397111913358 -> 0.6642599277978339\n",
      "10/02/2020 10:45:29 - INFO - __main__ -   Validation Loss is : 0.626817410603327\n",
      "10/02/2020 10:45:29 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:34<00:29,  1.65s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:35<00:28,  1.65s/it]\u001b[A\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  59%|██████████████████▊             | 23/39 [00:37<00:26,  1.65s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:39<00:24,  1.65s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:40<00:23,  1.65s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:42<00:21,  1.65s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:44<00:19,  1.65s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:45<00:18,  1.65s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:47<00:16,  1.65s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:49<00:14,  1.65s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:50<00:13,  1.65s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:52<00:11,  1.65s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:54<00:09,  1.65s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [00:55<00:08,  1.65s/it]\u001b[A10/02/2020 10:45:52 - INFO - __main__ -   \n",
      "10/02/2020 10:45:52 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:45:52 - INFO - __main__ -   Validation Accuracy improved! 0.6642599277978339 -> 0.6678700361010831\n",
      "10/02/2020 10:45:52 - INFO - __main__ -   Validation Loss is : 0.6467523437114399\n",
      "10/02/2020 10:45:52 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [00:57<00:06,  1.65s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [00:59<00:04,  1.65s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:00<00:03,  1.65s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:02<00:01,  1.65s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:03<00:00,  1.64s/it]\u001b[A\n",
      "Epoch:  33%|████████████▋                         | 2/6 [02:12<04:27, 66.96s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:01<01:02,  1.64s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:03<01:00,  1.64s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:04<00:59,  1.64s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:06<00:57,  1.64s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:08<00:55,  1.64s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:09<00:54,  1.65s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:11<00:52,  1.64s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:13<00:50,  1.64s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:14<00:49,  1.64s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:16<00:47,  1.64s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:18<00:45,  1.64s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:19<00:44,  1.64s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:21<00:42,  1.64s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:23<00:41,  1.65s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:24<00:39,  1.65s/it]\u001b[A10/02/2020 10:46:25 - INFO - __main__ -   \n",
      "10/02/2020 10:46:25 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:46:25 - INFO - __main__ -   Validation Loss improved! 0.6214246095733091 -> 0.6212687785014349\n",
      "10/02/2020 10:46:25 - INFO - __main__ -   Validation Accuracy is :0.6606498194945848\n",
      "10/02/2020 10:46:25 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:26<00:37,  1.65s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:27<00:36,  1.64s/it]\u001b[A\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:29<00:34,  1.64s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:31<00:32,  1.64s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:32<00:31,  1.64s/it]\u001b[A10/02/2020 10:46:33 - INFO - __main__ -   \n",
      "10/02/2020 10:46:33 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:46:33 - INFO - __main__ -   Validation Loss improved! 0.6212687785014349 -> 0.6205860681912528\n",
      "10/02/2020 10:46:33 - INFO - __main__ -   Validation Accuracy is :0.6606498194945848\n",
      "10/02/2020 10:46:33 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:34<00:29,  1.65s/it]\u001b[A10/02/2020 10:46:35 - INFO - __main__ -   \n",
      "10/02/2020 10:46:35 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:46:35 - INFO - __main__ -   Validation Loss improved! 0.6205860681912528 -> 0.6202241573953456\n",
      "10/02/2020 10:46:35 - INFO - __main__ -   Validation Accuracy is :0.6606498194945848\n",
      "10/02/2020 10:46:35 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:36<00:27,  1.64s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:37<00:26,  1.64s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:39<00:24,  1.64s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:41<00:23,  1.64s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:42<00:21,  1.64s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:44<00:19,  1.64s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:46<00:18,  1.64s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:47<00:16,  1.64s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:49<00:14,  1.64s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:50<00:13,  1.64s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:52<00:11,  1.64s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:54<00:09,  1.64s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [00:55<00:08,  1.64s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [00:57<00:06,  1.64s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [00:59<00:04,  1.64s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:00<00:03,  1.64s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:02<00:01,  1.64s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:04<00:00,  1.64s/it]\u001b[A\n",
      "Epoch:  50%|███████████████████                   | 3/6 [03:16<03:18, 66.09s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:01<01:02,  1.64s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:03<01:00,  1.64s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:04<00:59,  1.64s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:06<00:57,  1.64s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:08<00:55,  1.64s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:09<00:54,  1.64s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:11<00:52,  1.64s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:13<00:50,  1.64s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:14<00:49,  1.64s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:16<00:47,  1.64s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:18<00:45,  1.64s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:19<00:44,  1.64s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:21<00:42,  1.64s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:22<00:41,  1.64s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:24<00:39,  1.64s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:26<00:37,  1.64s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:27<00:36,  1.64s/it]\u001b[A\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:29<00:34,  1.64s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:31<00:32,  1.64s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:32<00:31,  1.64s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:34<00:29,  1.64s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:36<00:27,  1.64s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:37<00:26,  1.64s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:39<00:24,  1.64s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:41<00:22,  1.64s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:42<00:21,  1.64s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:44<00:19,  1.64s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:45<00:18,  1.64s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:47<00:16,  1.64s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:49<00:14,  1.64s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:50<00:13,  1.64s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:52<00:11,  1.64s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:54<00:09,  1.64s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [00:55<00:08,  1.64s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [00:57<00:06,  1.65s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [00:59<00:04,  1.65s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:00<00:03,  1.64s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:02<00:01,  1.64s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:03<00:00,  1.64s/it]\u001b[A\n",
      "Epoch:  67%|█████████████████████████▎            | 4/6 [04:20<02:10, 65.45s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:01<01:02,  1.64s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:03<01:00,  1.64s/it]\u001b[A10/02/2020 10:48:12 - INFO - __main__ -   \n",
      "10/02/2020 10:48:12 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:48:12 - INFO - __main__ -   Validation Accuracy improved! 0.6678700361010831 -> 0.6714801444043321\n",
      "10/02/2020 10:48:12 - INFO - __main__ -   Validation Loss is : 0.6469117106083068\n",
      "10/02/2020 10:48:12 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   8%|██▌                              | 3/39 [00:04<00:59,  1.64s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:06<00:57,  1.64s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:08<00:55,  1.64s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:09<00:54,  1.64s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:11<00:52,  1.64s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:13<00:50,  1.64s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:14<00:49,  1.64s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:16<00:47,  1.64s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:18<00:46,  1.64s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:19<00:44,  1.64s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:21<00:42,  1.64s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:22<00:41,  1.65s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:24<00:39,  1.65s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:26<00:37,  1.65s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:27<00:36,  1.65s/it]\u001b[A\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:29<00:34,  1.65s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:31<00:32,  1.65s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:32<00:31,  1.65s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:34<00:29,  1.65s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:36<00:28,  1.65s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:37<00:26,  1.65s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:39<00:24,  1.65s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:41<00:23,  1.65s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:42<00:21,  1.65s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:44<00:19,  1.65s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:46<00:18,  1.65s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:47<00:16,  1.65s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:49<00:14,  1.65s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:50<00:13,  1.64s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:52<00:11,  1.65s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:54<00:09,  1.65s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [00:55<00:08,  1.65s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [00:57<00:06,  1.65s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [00:59<00:04,  1.65s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:00<00:03,  1.65s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:02<00:01,  1.65s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:04<00:00,  1.64s/it]\u001b[A\n",
      "Epoch:  83%|███████████████████████████████▋      | 5/6 [05:24<01:05, 65.06s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:01<01:02,  1.65s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:03<01:00,  1.65s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:04<00:59,  1.65s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:06<00:57,  1.65s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:08<00:56,  1.65s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:09<00:54,  1.65s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:11<00:52,  1.65s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:13<00:51,  1.65s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:14<00:49,  1.65s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:16<00:47,  1.65s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:18<00:46,  1.65s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:19<00:44,  1.65s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:21<00:42,  1.65s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:23<00:41,  1.65s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:24<00:39,  1.65s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:26<00:37,  1.64s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:27<00:36,  1.64s/it]\u001b[A\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:29<00:34,  1.64s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:31<00:32,  1.64s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:32<00:31,  1.64s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:34<00:29,  1.64s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:36<00:27,  1.64s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:37<00:26,  1.64s/it]\u001b[A\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  62%|███████████████████▋            | 24/39 [00:39<00:24,  1.64s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:41<00:22,  1.64s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:42<00:21,  1.64s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:44<00:19,  1.64s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:46<00:18,  1.64s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:47<00:16,  1.64s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:49<00:14,  1.64s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:50<00:13,  1.64s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:52<00:11,  1.64s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:54<00:09,  1.64s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [00:55<00:08,  1.64s/it]\u001b[A10/02/2020 10:50:07 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [00:57<00:06,  1.64s/it]\u001b[A10/02/2020 10:50:09 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [00:59<00:04,  1.64s/it]\u001b[A10/02/2020 10:50:11 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:00<00:03,  1.64s/it]\u001b[A10/02/2020 10:50:12 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:02<00:01,  1.64s/it]\u001b[A10/02/2020 10:50:14 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:04<00:00,  1.64s/it]\u001b[A\n",
      "Epoch: 100%|██████████████████████████████████████| 6/6 [06:28<00:00, 64.74s/it]\n",
      "10/02/2020 10:50:15 - INFO - __main__ -   \n",
      "10/02/2020 10:50:15 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:50:15 - INFO - __main__ -   Validation Loss : 0.6202241573953456 Validation Accuracy : 0.6714801444043321\n",
      "10/02/2020 10:50:15 - INFO - __main__ -   Loss_ACC : 0.6606498194945848\n",
      "10/02/2020 10:50:15 - INFO - __main__ -   =============================================================================\n"
     ]
    }
   ],
   "source": [
    "!python KDAP/NLI_KD_training_RTE.py"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 36,
   "metadata": {
    "collapsed": true
   },
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "10/02/2020 10:50:17 - INFO - __main__ -   IN DEBUG MODE\n",
      "10/02/2020 10:50:18 - INFO - src.argument_parser -   device: cuda n_gpu: 2, 16-bits training: False\n",
      "10/02/2020 10:50:18 - INFO - src.argument_parser -   random seed = 91584870\n",
      "10/02/2020 10:50:18 - INFO - __main__ -   actual batch size on all GPU = 64\n",
      "10/02/2020 10:50:18 - INFO - __main__ -   Seed changed to : 91584870\n",
      "10/02/2020 10:50:18 - INFO - __main__ -   loading teacher's prediction\n",
      "10/02/2020 10:50:19 - INFO - __main__ -   teacher acc = 79.36, teacher loss = 0.46779\n",
      "10/02/2020 10:50:19 - INFO - __main__ -   teacher acc = 68.95, teacher loss = 0.58605\n",
      "10/02/2020 10:50:19 - INFO - src.nli_data_processing -   Writing example 0 of 2490\n",
      "10/02/2020 10:50:21 - INFO - __main__ -   ***** Running training *****\n",
      "10/02/2020 10:50:21 - INFO - __main__ -     Num examples = 2490\n",
      "10/02/2020 10:50:21 - INFO - __main__ -     Batch size = 64\n",
      "10/02/2020 10:50:21 - INFO - __main__ -     Num steps = 228\n",
      "10/02/2020 10:50:21 - INFO - src.nli_data_processing -   Writing example 0 of 277\n",
      "10/02/2020 10:50:21 - INFO - __main__ -   ***** Running evaluation *****\n",
      "10/02/2020 10:50:21 - INFO - __main__ -     Num examples = 277\n",
      "10/02/2020 10:50:21 - INFO - __main__ -     Batch size = 32\n",
      "10/02/2020 10:50:21 - INFO - src.nli_data_processing -   Writing example 0 of 3000\n",
      "10/02/2020 10:50:23 - INFO - __main__ -   ***** Running evaluation *****\n",
      "10/02/2020 10:50:23 - INFO - __main__ -     Num examples = 3000\n",
      "10/02/2020 10:50:23 - INFO - __main__ -     Batch size = 32\n",
      "10/02/2020 10:50:23 - INFO - __main__ -   using normal Knowledge Distillation\n",
      "10/02/2020 10:50:23 - INFO - src.nli_data_processing -   predicting for RTE\n",
      "10/02/2020 10:50:23 - INFO - src.modeling -   num hidden layer is set as 6\n",
      "10/02/2020 10:50:23 - INFO - src.modeling -   Model config {\n",
      "  \"attention_probs_dropout_prob\": 0.1,\n",
      "  \"hidden_act\": \"gelu\",\n",
      "  \"hidden_dropout_prob\": 0.1,\n",
      "  \"hidden_size\": 768,\n",
      "  \"initializer_range\": 0.02,\n",
      "  \"intermediate_size\": 3072,\n",
      "  \"max_position_embeddings\": 512,\n",
      "  \"num_attention_heads\": 12,\n",
      "  \"num_hidden_layers\": 6,\n",
      "  \"type_vocab_size\": 2,\n",
      "  \"vocab_size\": 30522\n",
      "}\n",
      "\n",
      "10/02/2020 10:50:25 - INFO - src.utils -   delete 104 layers, keep 103 layers\n",
      "10/02/2020 10:50:28 - INFO - src.utils -   data parallel because more than one gpu\n",
      "10/02/2020 10:50:28 - INFO - __main__ -   *****************************************************************************\n",
      "10/02/2020 10:50:28 - INFO - src.utils -   data parallel because more than one gpu\n",
      "10/02/2020 10:50:28 - INFO - __main__ -   number of layers in student model = 6\n",
      "10/02/2020 10:50:28 - INFO - __main__ -   num parameters in student model are 66955008 and 1538\n",
      "10/02/2020 10:50:28 - INFO - __main__ -   FP16 is not activated, use BertAdam\n",
      "Epoch:   0%|                                              | 0/6 [00:00<?, ?it/s]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A/opt/conda/lib/python3.7/site-packages/torch/nn/functional.py:1958: UserWarning: reduction: 'mean' divides the total loss by both the batch size and the support size.'batchmean' divides only by the batch size, and aligns with the KL div math definition.'mean' will be changed to behave the same as 'batchmean' in the next major release.\n",
      "  warnings.warn(\"reduction: 'mean' divides the total loss by both the batch size and the support size.\"\n",
      "10/02/2020 10:50:33 - INFO - __main__ -   \n",
      "10/02/2020 10:50:33 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:50:33 - INFO - __main__ -   Validation Accuracy improved! 0 -> 0.5234657039711191\n",
      "10/02/2020 10:50:33 - INFO - __main__ -   Validation Loss is : 0.7028819955212975\n",
      "10/02/2020 10:50:33 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:50:33 - INFO - __main__ -   \n",
      "10/02/2020 10:50:33 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:50:33 - INFO - __main__ -   Validation Loss improved! 100 -> 0.7028819955212975\n",
      "10/02/2020 10:50:33 - INFO - __main__ -   Validation Accuracy is :0.5234657039711191\n",
      "10/02/2020 10:50:33 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   3%|▊                                | 1/39 [00:05<03:20,  5.27s/it]\u001b[A10/02/2020 10:50:35 - INFO - __main__ -   \n",
      "10/02/2020 10:50:35 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:50:35 - INFO - __main__ -   Validation Loss improved! 0.7028819955212975 -> 0.7024285354338828\n",
      "10/02/2020 10:50:35 - INFO - __main__ -   Validation Accuracy is :0.51985559566787\n",
      "10/02/2020 10:50:35 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   5%|█▋                               | 2/39 [00:06<02:34,  4.17s/it]\u001b[A10/02/2020 10:50:36 - INFO - __main__ -   \n",
      "10/02/2020 10:50:36 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:50:36 - INFO - __main__ -   Validation Loss improved! 0.7024285354338828 -> 0.7016014725722991\n",
      "10/02/2020 10:50:36 - INFO - __main__ -   Validation Accuracy is :0.51985559566787\n",
      "10/02/2020 10:50:36 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   8%|██▌                              | 3/39 [00:08<02:02,  3.40s/it]\u001b[A10/02/2020 10:50:38 - INFO - __main__ -   \n",
      "10/02/2020 10:50:38 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:50:38 - INFO - __main__ -   Validation Loss improved! 0.7016014725722991 -> 0.700728712529482\n",
      "10/02/2020 10:50:38 - INFO - __main__ -   Validation Accuracy is :0.516245487364621\n",
      "10/02/2020 10:50:38 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  10%|███▍                             | 4/39 [00:10<01:40,  2.87s/it]\u001b[A10/02/2020 10:50:40 - INFO - __main__ -   \n",
      "10/02/2020 10:50:40 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:50:40 - INFO - __main__ -   Validation Loss improved! 0.700728712529482 -> 0.700697891979011\n",
      "10/02/2020 10:50:40 - INFO - __main__ -   Validation Accuracy is :0.4657039711191336\n",
      "10/02/2020 10:50:40 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  13%|████▏                            | 5/39 [00:11<01:24,  2.49s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:13<01:13,  2.23s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:14<01:05,  2.05s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:16<00:59,  1.93s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:18<00:55,  1.85s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:19<00:51,  1.79s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:21<00:49,  1.76s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:23<00:46,  1.73s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:24<00:44,  1.72s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:26<00:42,  1.71s/it]\u001b[A10/02/2020 10:50:56 - INFO - __main__ -   \n",
      "10/02/2020 10:50:56 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:50:56 - INFO - __main__ -   Validation Loss improved! 0.700697891979011 -> 0.6980008872407438\n",
      "10/02/2020 10:50:56 - INFO - __main__ -   Validation Accuracy is :0.4729241877256318\n",
      "10/02/2020 10:50:56 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:28<00:41,  1.71s/it]\u001b[A10/02/2020 10:50:58 - INFO - __main__ -   \n",
      "10/02/2020 10:50:58 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:50:58 - INFO - __main__ -   Validation Accuracy improved! 0.5234657039711191 -> 0.5342960288808665\n",
      "10/02/2020 10:50:58 - INFO - __main__ -   Validation Loss is : 0.690489087294155\n",
      "10/02/2020 10:50:58 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:50:58 - INFO - __main__ -   \n",
      "10/02/2020 10:50:58 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:50:58 - INFO - __main__ -   Validation Loss improved! 0.6980008872407438 -> 0.690489087294155\n",
      "10/02/2020 10:50:58 - INFO - __main__ -   Validation Accuracy is :0.5342960288808665\n",
      "10/02/2020 10:50:58 - INFO - __main__ -   =============================================================================\n",
      "\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  41%|█████████████▏                  | 16/39 [00:30<00:39,  1.71s/it]\u001b[A10/02/2020 10:51:00 - INFO - __main__ -   \n",
      "10/02/2020 10:51:00 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:51:00 - INFO - __main__ -   Validation Accuracy improved! 0.5342960288808665 -> 0.5667870036101083\n",
      "10/02/2020 10:51:00 - INFO - __main__ -   Validation Loss is : 0.6863699475780721\n",
      "10/02/2020 10:51:00 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:51:00 - INFO - __main__ -   \n",
      "10/02/2020 10:51:00 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:51:00 - INFO - __main__ -   Validation Loss improved! 0.690489087294155 -> 0.6863699475780721\n",
      "10/02/2020 10:51:00 - INFO - __main__ -   Validation Accuracy is :0.5667870036101083\n",
      "10/02/2020 10:51:00 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:31<00:37,  1.71s/it]\u001b[A10/02/2020 10:51:01 - INFO - __main__ -   \n",
      "10/02/2020 10:51:01 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:51:01 - INFO - __main__ -   Validation Loss improved! 0.6863699475780721 -> 0.6860064358487457\n",
      "10/02/2020 10:51:01 - INFO - __main__ -   Validation Accuracy is :0.5379061371841155\n",
      "10/02/2020 10:51:01 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:33<00:35,  1.71s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:35<00:34,  1.71s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:36<00:32,  1.71s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:38<00:30,  1.71s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:40<00:28,  1.71s/it]\u001b[A10/02/2020 10:51:10 - INFO - __main__ -   \n",
      "10/02/2020 10:51:10 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:51:10 - INFO - __main__ -   Validation Loss improved! 0.6860064358487457 -> 0.681839343873172\n",
      "10/02/2020 10:51:10 - INFO - __main__ -   Validation Accuracy is :0.5667870036101083\n",
      "10/02/2020 10:51:10 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:42<00:27,  1.70s/it]\u001b[A10/02/2020 10:51:12 - INFO - __main__ -   \n",
      "10/02/2020 10:51:12 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:51:12 - INFO - __main__ -   Validation Loss improved! 0.681839343873172 -> 0.6801259285276117\n",
      "10/02/2020 10:51:12 - INFO - __main__ -   Validation Accuracy is :0.5595667870036101\n",
      "10/02/2020 10:51:12 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:43<00:25,  1.69s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:45<00:23,  1.69s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:47<00:21,  1.68s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:48<00:20,  1.67s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:50<00:18,  1.67s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:52<00:16,  1.66s/it]\u001b[A10/02/2020 10:51:21 - INFO - __main__ -   \n",
      "10/02/2020 10:51:21 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:51:21 - INFO - __main__ -   Validation Loss improved! 0.6801259285276117 -> 0.6791235996067309\n",
      "10/02/2020 10:51:21 - INFO - __main__ -   Validation Accuracy is :0.5342960288808665\n",
      "10/02/2020 10:51:21 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:53<00:14,  1.66s/it]\u001b[A10/02/2020 10:51:23 - INFO - __main__ -   \n",
      "10/02/2020 10:51:23 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:51:23 - INFO - __main__ -   Validation Accuracy improved! 0.5667870036101083 -> 0.5703971119133574\n",
      "10/02/2020 10:51:23 - INFO - __main__ -   Validation Loss is : 0.6679131408030375\n",
      "10/02/2020 10:51:23 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:51:23 - INFO - __main__ -   \n",
      "10/02/2020 10:51:23 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:51:23 - INFO - __main__ -   Validation Loss improved! 0.6791235996067309 -> 0.6679131408030375\n",
      "10/02/2020 10:51:23 - INFO - __main__ -   Validation Accuracy is :0.5703971119133574\n",
      "10/02/2020 10:51:23 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:55<00:13,  1.65s/it]\u001b[A10/02/2020 10:51:25 - INFO - __main__ -   \n",
      "10/02/2020 10:51:25 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:51:25 - INFO - __main__ -   Validation Accuracy improved! 0.5703971119133574 -> 0.592057761732852\n",
      "10/02/2020 10:51:25 - INFO - __main__ -   Validation Loss is : 0.6665739789336167\n",
      "10/02/2020 10:51:25 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:51:25 - INFO - __main__ -   \n",
      "10/02/2020 10:51:25 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:51:25 - INFO - __main__ -   Validation Loss improved! 0.6679131408030375 -> 0.6665739789336167\n",
      "10/02/2020 10:51:25 - INFO - __main__ -   Validation Accuracy is :0.592057761732852\n",
      "10/02/2020 10:51:25 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:56<00:11,  1.65s/it]\u001b[A10/02/2020 10:51:26 - INFO - __main__ -   \n",
      "10/02/2020 10:51:26 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:51:26 - INFO - __main__ -   Validation Accuracy improved! 0.592057761732852 -> 0.5956678700361011\n",
      "10/02/2020 10:51:26 - INFO - __main__ -   Validation Loss is : 0.6701920264894782\n",
      "10/02/2020 10:51:26 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:58<00:09,  1.64s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [01:00<00:08,  1.64s/it]\u001b[A10/02/2020 10:51:30 - INFO - __main__ -   \n",
      "10/02/2020 10:51:30 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:51:30 - INFO - __main__ -   Validation Accuracy improved! 0.5956678700361011 -> 0.6028880866425993\n",
      "10/02/2020 10:51:30 - INFO - __main__ -   Validation Loss is : 0.6618242676938053\n",
      "10/02/2020 10:51:30 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:51:30 - INFO - __main__ -   \n",
      "10/02/2020 10:51:30 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:51:30 - INFO - __main__ -   Validation Loss improved! 0.6665739789336167 -> 0.6618242676938053\n",
      "10/02/2020 10:51:30 - INFO - __main__ -   Validation Accuracy is :0.6028880866425993\n",
      "10/02/2020 10:51:30 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [01:01<00:06,  1.64s/it]\u001b[A10/02/2020 10:51:31 - INFO - __main__ -   \n",
      "10/02/2020 10:51:31 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:51:31 - INFO - __main__ -   Validation Accuracy improved! 0.6028880866425993 -> 0.6209386281588448\n",
      "10/02/2020 10:51:31 - INFO - __main__ -   Validation Loss is : 0.6572123950999567\n",
      "10/02/2020 10:51:31 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:51:31 - INFO - __main__ -   \n",
      "10/02/2020 10:51:31 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:51:31 - INFO - __main__ -   Validation Loss improved! 0.6618242676938053 -> 0.6572123950999567\n",
      "10/02/2020 10:51:31 - INFO - __main__ -   Validation Accuracy is :0.6209386281588448\n",
      "10/02/2020 10:51:31 - INFO - __main__ -   =============================================================================\n",
      "\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [01:03<00:04,  1.63s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:05<00:03,  1.63s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:06<00:01,  1.63s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:08<00:00,  1.75s/it]\u001b[A\n",
      "Epoch:  17%|██████▎                               | 1/6 [01:08<05:41, 68.32s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:01<01:01,  1.63s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:03<01:00,  1.62s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:04<00:58,  1.62s/it]\u001b[A10/02/2020 10:51:43 - INFO - __main__ -   \n",
      "10/02/2020 10:51:43 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:51:43 - INFO - __main__ -   Validation Accuracy improved! 0.6209386281588448 -> 0.628158844765343\n",
      "10/02/2020 10:51:43 - INFO - __main__ -   Validation Loss is : 0.652075447330406\n",
      "10/02/2020 10:51:43 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:51:43 - INFO - __main__ -   \n",
      "10/02/2020 10:51:43 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:51:43 - INFO - __main__ -   Validation Loss improved! 0.6572123950999567 -> 0.652075447330406\n",
      "10/02/2020 10:51:43 - INFO - __main__ -   Validation Accuracy is :0.628158844765343\n",
      "10/02/2020 10:51:43 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  10%|███▍                             | 4/39 [00:06<00:56,  1.62s/it]\u001b[A10/02/2020 10:51:44 - INFO - __main__ -   \n",
      "10/02/2020 10:51:44 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:51:44 - INFO - __main__ -   Validation Accuracy improved! 0.628158844765343 -> 0.631768953068592\n",
      "10/02/2020 10:51:44 - INFO - __main__ -   Validation Loss is : 0.6483157911885946\n",
      "10/02/2020 10:51:44 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:51:44 - INFO - __main__ -   \n",
      "10/02/2020 10:51:44 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:51:44 - INFO - __main__ -   Validation Loss improved! 0.652075447330406 -> 0.6483157911885946\n",
      "10/02/2020 10:51:44 - INFO - __main__ -   Validation Accuracy is :0.631768953068592\n",
      "10/02/2020 10:51:44 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  13%|████▏                            | 5/39 [00:08<00:55,  1.62s/it]\u001b[A10/02/2020 10:51:46 - INFO - __main__ -   \n",
      "10/02/2020 10:51:46 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:51:46 - INFO - __main__ -   Validation Loss improved! 0.6483157911885946 -> 0.6444727832229559\n",
      "10/02/2020 10:51:46 - INFO - __main__ -   Validation Accuracy is :0.631768953068592\n",
      "10/02/2020 10:51:46 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  15%|█████                            | 6/39 [00:09<00:53,  1.62s/it]\u001b[A10/02/2020 10:51:47 - INFO - __main__ -   \n",
      "10/02/2020 10:51:47 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:51:47 - INFO - __main__ -   Validation Loss improved! 0.6444727832229559 -> 0.6419506417284804\n",
      "10/02/2020 10:51:47 - INFO - __main__ -   Validation Accuracy is :0.6245487364620939\n",
      "10/02/2020 10:51:47 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:11<00:51,  1.62s/it]\u001b[A10/02/2020 10:51:49 - INFO - __main__ -   \n",
      "10/02/2020 10:51:49 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:51:49 - INFO - __main__ -   Validation Loss improved! 0.6419506417284804 -> 0.640077398141799\n",
      "10/02/2020 10:51:49 - INFO - __main__ -   Validation Accuracy is :0.628158844765343\n",
      "10/02/2020 10:51:49 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:12<00:50,  1.62s/it]\u001b[A10/02/2020 10:51:51 - INFO - __main__ -   \n",
      "10/02/2020 10:51:51 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:51:51 - INFO - __main__ -   Validation Loss improved! 0.640077398141799 -> 0.6382523099437948\n",
      "10/02/2020 10:51:51 - INFO - __main__ -   Validation Accuracy is :0.6209386281588448\n",
      "10/02/2020 10:51:51 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:14<00:48,  1.62s/it]\u001b[A10/02/2020 10:51:52 - INFO - __main__ -   \n",
      "10/02/2020 10:51:52 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:51:52 - INFO - __main__ -   Validation Loss improved! 0.6382523099437948 -> 0.6372655668843954\n",
      "10/02/2020 10:51:52 - INFO - __main__ -   Validation Accuracy is :0.631768953068592\n",
      "10/02/2020 10:51:52 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:16<00:47,  1.63s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:17<00:45,  1.63s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:19<00:43,  1.63s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:21<00:42,  1.63s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:22<00:40,  1.63s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:24<00:39,  1.63s/it]\u001b[A10/02/2020 10:52:02 - INFO - __main__ -   \n",
      "10/02/2020 10:52:02 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:52:02 - INFO - __main__ -   Validation Loss improved! 0.6372655668843954 -> 0.635735704580369\n",
      "10/02/2020 10:52:02 - INFO - __main__ -   Validation Accuracy is :0.6101083032490975\n",
      "10/02/2020 10:52:02 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:26<00:37,  1.63s/it]\u001b[A10/02/2020 10:52:04 - INFO - __main__ -   \n",
      "10/02/2020 10:52:04 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:52:04 - INFO - __main__ -   Validation Accuracy improved! 0.631768953068592 -> 0.6353790613718412\n",
      "10/02/2020 10:52:04 - INFO - __main__ -   Validation Loss is : 0.6345505180772031\n",
      "10/02/2020 10:52:04 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:52:04 - INFO - __main__ -   \n",
      "10/02/2020 10:52:04 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:52:04 - INFO - __main__ -   Validation Loss improved! 0.635735704580369 -> 0.6345505180772031\n",
      "10/02/2020 10:52:04 - INFO - __main__ -   Validation Accuracy is :0.6353790613718412\n",
      "10/02/2020 10:52:04 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:27<00:36,  1.64s/it]\u001b[A10/02/2020 10:52:05 - INFO - __main__ -   \n",
      "10/02/2020 10:52:05 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:52:05 - INFO - __main__ -   Validation Accuracy improved! 0.6353790613718412 -> 0.6425992779783394\n",
      "10/02/2020 10:52:05 - INFO - __main__ -   Validation Loss is : 0.6339983303194011\n",
      "10/02/2020 10:52:05 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:52:05 - INFO - __main__ -   \n",
      "10/02/2020 10:52:05 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:52:05 - INFO - __main__ -   Validation Loss improved! 0.6345505180772031 -> 0.6339983303194011\n",
      "10/02/2020 10:52:05 - INFO - __main__ -   Validation Accuracy is :0.6425992779783394\n",
      "10/02/2020 10:52:05 - INFO - __main__ -   =============================================================================\n",
      "\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  46%|██████████████▊                 | 18/39 [00:29<00:34,  1.64s/it]\u001b[A10/02/2020 10:52:07 - INFO - __main__ -   \n",
      "10/02/2020 10:52:07 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:52:07 - INFO - __main__ -   Validation Loss improved! 0.6339983303194011 -> 0.6335884190638573\n",
      "10/02/2020 10:52:07 - INFO - __main__ -   Validation Accuracy is :0.6389891696750902\n",
      "10/02/2020 10:52:07 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:30<00:32,  1.64s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:32<00:31,  1.64s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:34<00:29,  1.64s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:35<00:27,  1.64s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:37<00:26,  1.65s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:39<00:24,  1.65s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:40<00:23,  1.65s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:42<00:21,  1.65s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:44<00:19,  1.65s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:45<00:18,  1.65s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:47<00:16,  1.65s/it]\u001b[A10/02/2020 10:52:25 - INFO - __main__ -   \n",
      "10/02/2020 10:52:25 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:52:25 - INFO - __main__ -   Validation Loss improved! 0.6335884190638573 -> 0.6312144861324599\n",
      "10/02/2020 10:52:25 - INFO - __main__ -   Validation Accuracy is :0.6353790613718412\n",
      "10/02/2020 10:52:25 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:49<00:14,  1.65s/it]\u001b[A10/02/2020 10:52:27 - INFO - __main__ -   \n",
      "10/02/2020 10:52:27 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:52:27 - INFO - __main__ -   Validation Loss improved! 0.6312144861324599 -> 0.6304204885710017\n",
      "10/02/2020 10:52:27 - INFO - __main__ -   Validation Accuracy is :0.6353790613718412\n",
      "10/02/2020 10:52:27 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:50<00:13,  1.65s/it]\u001b[A10/02/2020 10:52:29 - INFO - __main__ -   \n",
      "10/02/2020 10:52:29 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:52:29 - INFO - __main__ -   Validation Loss improved! 0.6304204885710017 -> 0.6299252217427057\n",
      "10/02/2020 10:52:29 - INFO - __main__ -   Validation Accuracy is :0.6245487364620939\n",
      "10/02/2020 10:52:29 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:52<00:11,  1.65s/it]\u001b[A10/02/2020 10:52:30 - INFO - __main__ -   \n",
      "10/02/2020 10:52:30 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:52:30 - INFO - __main__ -   Validation Loss improved! 0.6299252217427057 -> 0.6296630129487075\n",
      "10/02/2020 10:52:30 - INFO - __main__ -   Validation Accuracy is :0.6209386281588448\n",
      "10/02/2020 10:52:30 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:54<00:09,  1.65s/it]\u001b[A10/02/2020 10:52:32 - INFO - __main__ -   \n",
      "10/02/2020 10:52:32 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:52:32 - INFO - __main__ -   Validation Loss improved! 0.6296630129487075 -> 0.6291487139484943\n",
      "10/02/2020 10:52:32 - INFO - __main__ -   Validation Accuracy is :0.631768953068592\n",
      "10/02/2020 10:52:32 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [00:55<00:08,  1.65s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [00:57<00:06,  1.65s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [00:59<00:04,  1.65s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:00<00:03,  1.65s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:02<00:01,  1.65s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:03<00:00,  1.64s/it]\u001b[A\n",
      "Epoch:  33%|████████████▋                         | 2/6 [02:12<04:28, 67.01s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:01<01:02,  1.65s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:03<01:00,  1.65s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:04<00:59,  1.65s/it]\u001b[A10/02/2020 10:52:47 - INFO - __main__ -   \n",
      "10/02/2020 10:52:47 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:52:47 - INFO - __main__ -   Validation Accuracy improved! 0.6425992779783394 -> 0.6498194945848376\n",
      "10/02/2020 10:52:47 - INFO - __main__ -   Validation Loss is : 0.6375267634753286\n",
      "10/02/2020 10:52:47 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  10%|███▍                             | 4/39 [00:06<00:57,  1.65s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:08<00:55,  1.65s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:09<00:54,  1.64s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:11<00:52,  1.64s/it]\u001b[A10/02/2020 10:52:53 - INFO - __main__ -   \n",
      "10/02/2020 10:52:53 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:52:53 - INFO - __main__ -   Validation Loss improved! 0.6291487139484943 -> 0.6290759868139825\n",
      "10/02/2020 10:52:53 - INFO - __main__ -   Validation Accuracy is :0.6425992779783394\n",
      "10/02/2020 10:52:53 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:13<00:50,  1.64s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:14<00:49,  1.64s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:16<00:47,  1.64s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:18<00:45,  1.64s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:19<00:44,  1.64s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:21<00:42,  1.64s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:22<00:40,  1.64s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:24<00:39,  1.64s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:26<00:37,  1.64s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:27<00:36,  1.64s/it]\u001b[A\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:29<00:34,  1.64s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:31<00:32,  1.64s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:32<00:31,  1.64s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:34<00:29,  1.64s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:36<00:27,  1.64s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:37<00:26,  1.64s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:39<00:24,  1.64s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:41<00:22,  1.64s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:42<00:21,  1.64s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:44<00:19,  1.64s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:45<00:18,  1.64s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:47<00:16,  1.64s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:49<00:14,  1.64s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:50<00:13,  1.64s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:52<00:11,  1.64s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:54<00:09,  1.64s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [00:55<00:08,  1.64s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [00:57<00:06,  1.65s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [00:59<00:04,  1.64s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:00<00:03,  1.64s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:02<00:01,  1.64s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:03<00:00,  1.64s/it]\u001b[A\n",
      "Epoch:  50%|███████████████████                   | 3/6 [03:16<03:18, 66.10s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:01<01:02,  1.65s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:03<01:00,  1.65s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:04<00:59,  1.64s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:06<00:57,  1.64s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:08<00:55,  1.65s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:09<00:54,  1.65s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:11<00:52,  1.65s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:13<00:51,  1.65s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:14<00:49,  1.64s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:16<00:47,  1.65s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:18<00:46,  1.65s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:19<00:44,  1.65s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:21<00:42,  1.65s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:23<00:41,  1.65s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:24<00:39,  1.65s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:26<00:37,  1.65s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:27<00:36,  1.65s/it]\u001b[A10/02/2020 10:54:14 - INFO - __main__ -   \n",
      "10/02/2020 10:54:14 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:54:14 - INFO - __main__ -   Validation Accuracy improved! 0.6498194945848376 -> 0.6642599277978339\n",
      "10/02/2020 10:54:14 - INFO - __main__ -   Validation Loss is : 0.6361604167235887\n",
      "10/02/2020 10:54:14 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:29<00:34,  1.65s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:31<00:32,  1.65s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:32<00:31,  1.65s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:34<00:29,  1.65s/it]\u001b[A10/02/2020 10:54:20 - INFO - __main__ -   \n",
      "10/02/2020 10:54:20 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:54:20 - INFO - __main__ -   Validation Loss improved! 0.6290759868139825 -> 0.6279731468173141\n",
      "10/02/2020 10:54:20 - INFO - __main__ -   Validation Accuracy is :0.6425992779783394\n",
      "10/02/2020 10:54:20 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:36<00:28,  1.65s/it]\u001b[A10/02/2020 10:54:22 - INFO - __main__ -   \n",
      "10/02/2020 10:54:22 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:54:22 - INFO - __main__ -   Validation Loss improved! 0.6279731468173141 -> 0.626231899330332\n",
      "10/02/2020 10:54:22 - INFO - __main__ -   Validation Accuracy is :0.6498194945848376\n",
      "10/02/2020 10:54:22 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:37<00:26,  1.65s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:39<00:24,  1.65s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:41<00:23,  1.65s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:42<00:21,  1.65s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:44<00:19,  1.65s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:46<00:18,  1.65s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:47<00:16,  1.65s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:49<00:14,  1.65s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:51<00:13,  1.64s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:52<00:11,  1.65s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:54<00:09,  1.65s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [00:55<00:08,  1.65s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [00:57<00:06,  1.65s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [00:59<00:04,  1.65s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:00<00:03,  1.65s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:02<00:01,  1.65s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:04<00:00,  1.65s/it]\u001b[A\n",
      "Epoch:  67%|█████████████████████████▎            | 4/6 [04:20<02:11, 65.53s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:01<01:02,  1.65s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:03<01:00,  1.65s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:04<00:59,  1.65s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:06<00:57,  1.64s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:08<00:55,  1.64s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:09<00:54,  1.65s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:11<00:52,  1.64s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:13<00:50,  1.64s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:14<00:49,  1.64s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:16<00:47,  1.64s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:18<00:45,  1.64s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:19<00:44,  1.64s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:21<00:42,  1.64s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:23<00:41,  1.64s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:24<00:39,  1.64s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:26<00:37,  1.64s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:27<00:36,  1.64s/it]\u001b[A\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:29<00:34,  1.64s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:31<00:32,  1.64s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:32<00:31,  1.64s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:34<00:29,  1.64s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:36<00:27,  1.64s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:37<00:26,  1.64s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:39<00:24,  1.64s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:41<00:22,  1.64s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:42<00:21,  1.64s/it]\u001b[A\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:44<00:19,  1.64s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:46<00:18,  1.64s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:47<00:16,  1.64s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:49<00:14,  1.64s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:50<00:13,  1.64s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:52<00:11,  1.64s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:54<00:09,  1.64s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [00:55<00:08,  1.64s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [00:57<00:06,  1.64s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [00:59<00:04,  1.64s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:00<00:03,  1.64s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:02<00:01,  1.64s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:04<00:00,  1.64s/it]\u001b[A\n",
      "Epoch:  83%|███████████████████████████████▋      | 5/6 [05:24<01:05, 65.08s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:01<01:02,  1.64s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:03<01:00,  1.64s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:04<00:59,  1.64s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:06<00:57,  1.64s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:08<00:55,  1.64s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:09<00:54,  1.64s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:11<00:52,  1.64s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:13<00:50,  1.64s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:14<00:49,  1.64s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:16<00:47,  1.64s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:18<00:45,  1.64s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:19<00:44,  1.64s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:21<00:42,  1.64s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:22<00:41,  1.64s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:24<00:39,  1.64s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:26<00:37,  1.64s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:27<00:36,  1.64s/it]\u001b[A\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:29<00:34,  1.64s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:31<00:32,  1.64s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:32<00:31,  1.64s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:34<00:29,  1.64s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:36<00:27,  1.64s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:37<00:26,  1.64s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:39<00:24,  1.64s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:41<00:22,  1.64s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:42<00:21,  1.64s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:44<00:19,  1.64s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:45<00:18,  1.64s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:47<00:16,  1.64s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:49<00:14,  1.64s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:50<00:13,  1.64s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:52<00:11,  1.64s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:54<00:09,  1.64s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [00:55<00:08,  1.64s/it]\u001b[A10/02/2020 10:56:48 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [00:57<00:06,  1.64s/it]\u001b[A10/02/2020 10:56:50 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [00:59<00:04,  1.64s/it]\u001b[A10/02/2020 10:56:52 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:00<00:03,  1.64s/it]\u001b[A10/02/2020 10:56:53 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:02<00:01,  1.64s/it]\u001b[A10/02/2020 10:56:55 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:03<00:00,  1.64s/it]\u001b[A\n",
      "Epoch: 100%|██████████████████████████████████████| 6/6 [06:28<00:00, 64.73s/it]\n",
      "10/02/2020 10:56:56 - INFO - __main__ -   \n",
      "10/02/2020 10:56:56 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 10:56:56 - INFO - __main__ -   Validation Loss : 0.626231899330332 Validation Accuracy : 0.6642599277978339\n",
      "10/02/2020 10:56:56 - INFO - __main__ -   Loss_ACC : 0.6498194945848376\n",
      "10/02/2020 10:56:56 - INFO - __main__ -   =============================================================================\n"
     ]
    }
   ],
   "source": [
    "!python KDAP/NLI_KD_training_RTE.py"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "### [3] PTP pretraining with SPS-student"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "Settings:[seed = 12585932 / train_batch_size = 64/ learning_rate = 2e-5 / epoch = 6]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 21,
   "metadata": {
    "collapsed": true
   },
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "10/02/2020 09:49:40 - INFO - __main__ -   IN DEBUG MODE\n",
      "10/02/2020 09:49:41 - INFO - src.argument_parser -   device: cuda n_gpu: 2, 16-bits training: False\n",
      "10/02/2020 09:49:41 - INFO - src.argument_parser -   random seed = 6237840\n",
      "10/02/2020 09:49:41 - INFO - __main__ -   actual batch size on all GPU = 64\n",
      "10/02/2020 09:49:41 - INFO - __main__ -   Seed changed to : 12585932\n",
      "10/02/2020 09:49:43 - INFO - __main__ -   \n",
      "10/02/2020 09:49:43 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:49:43 - INFO - __main__ -   PTP_label.eq(0).sum() = tensor(545)\n",
      "10/02/2020 09:49:43 - INFO - __main__ -   PTP_label.eq(1).sum() = tensor(1431)\n",
      "10/02/2020 09:49:43 - INFO - __main__ -   PTP_label.eq(2).sum() = tensor(279)\n",
      "10/02/2020 09:49:43 - INFO - __main__ -   PTP_label.eq(3).sum() = tensor(235)\n",
      "10/02/2020 09:49:43 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:49:43 - INFO - src.nli_data_processing -   Initializing model for pretraining\n",
      "10/02/2020 09:49:43 - INFO - src.modeling -   num hidden layer is set as 6\n",
      "10/02/2020 09:49:43 - INFO - src.modeling -   Model config {\n",
      "  \"attention_probs_dropout_prob\": 0.1,\n",
      "  \"hidden_act\": \"gelu\",\n",
      "  \"hidden_dropout_prob\": 0.1,\n",
      "  \"hidden_size\": 768,\n",
      "  \"initializer_range\": 0.02,\n",
      "  \"intermediate_size\": 3072,\n",
      "  \"max_position_embeddings\": 512,\n",
      "  \"num_attention_heads\": 12,\n",
      "  \"num_hidden_layers\": 6,\n",
      "  \"type_vocab_size\": 2,\n",
      "  \"vocab_size\": 30522\n",
      "}\n",
      "\n",
      "10/02/2020 09:49:44 - INFO - src.utils -   delete 104 layers, keep 103 layers\n",
      "10/02/2020 09:49:47 - INFO - src.utils -   data parallel because more than one gpu\n",
      "10/02/2020 09:49:47 - INFO - __main__ -   *****************************************************************************\n",
      "10/02/2020 09:49:47 - INFO - src.utils -   data parallel because more than one gpu\n",
      "10/02/2020 09:49:47 - INFO - __main__ -   number of layers in student model = 6\n",
      "10/02/2020 09:49:47 - INFO - __main__ -   num parameters in student model are 66955008 and 3076\n",
      "10/02/2020 09:49:47 - INFO - __main__ -   FP16 is not activated, use BertAdam\n",
      "Epoch:   0%|                                              | 0/6 [00:00<?, ?it/s]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A10/02/2020 09:49:52 - INFO - __main__ -   \n",
      "10/02/2020 09:49:52 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:49:52 - INFO - __main__ -   Validation Accuracy improved! 0 -> 0.11913357400722022\n",
      "10/02/2020 09:49:52 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:49:52 - INFO - __main__ -   \n",
      "10/02/2020 09:49:52 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:49:52 - INFO - __main__ -   Validation improved! 100 -> 1.4560875410637701\n",
      "10/02/2020 09:49:52 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   3%|▊                                | 1/39 [00:05<03:12,  5.06s/it]\u001b[A10/02/2020 09:49:54 - INFO - __main__ -   \n",
      "10/02/2020 09:49:54 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:49:54 - INFO - __main__ -   Validation improved! 1.4560875410637701 -> 1.4457609429686509\n",
      "10/02/2020 09:49:54 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   5%|█▋                               | 2/39 [00:06<02:28,  4.01s/it]\u001b[A10/02/2020 09:49:55 - INFO - __main__ -   \n",
      "10/02/2020 09:49:55 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:49:55 - INFO - __main__ -   Validation improved! 1.4457609429686509 -> 1.4229970021368363\n",
      "10/02/2020 09:49:55 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   8%|██▌                              | 3/39 [00:08<01:58,  3.28s/it]\u001b[A10/02/2020 09:49:57 - INFO - __main__ -   \n",
      "10/02/2020 09:49:57 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:49:57 - INFO - __main__ -   Validation Accuracy improved! 0.11913357400722022 -> 0.1407942238267148\n",
      "10/02/2020 09:49:57 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:49:57 - INFO - __main__ -   \n",
      "10/02/2020 09:49:57 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:49:57 - INFO - __main__ -   Validation improved! 1.4229970021368363 -> 1.3888697482188255\n",
      "10/02/2020 09:49:57 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  10%|███▍                             | 4/39 [00:09<01:36,  2.76s/it]\u001b[A10/02/2020 09:49:58 - INFO - __main__ -   \n",
      "10/02/2020 09:49:58 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:49:58 - INFO - __main__ -   Validation Accuracy improved! 0.1407942238267148 -> 0.51985559566787\n",
      "10/02/2020 09:49:58 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:49:58 - INFO - __main__ -   \n",
      "10/02/2020 09:49:58 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:49:58 - INFO - __main__ -   Validation improved! 1.3888697482188255 -> 1.3426405023581716\n",
      "10/02/2020 09:49:58 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  13%|████▏                            | 5/39 [00:11<01:21,  2.41s/it]\u001b[A10/02/2020 09:50:00 - INFO - __main__ -   \n",
      "10/02/2020 09:50:00 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:50:00 - INFO - __main__ -   Validation Accuracy improved! 0.51985559566787 -> 0.5703971119133574\n",
      "10/02/2020 09:50:00 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:50:00 - INFO - __main__ -   \n",
      "10/02/2020 09:50:00 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:50:00 - INFO - __main__ -   Validation improved! 1.3426405023581716 -> 1.2935941963849946\n",
      "10/02/2020 09:50:00 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  15%|█████                            | 6/39 [00:12<01:11,  2.16s/it]\u001b[A10/02/2020 09:50:02 - INFO - __main__ -   \n",
      "10/02/2020 09:50:02 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:50:02 - INFO - __main__ -   Validation improved! 1.2935941963849946 -> 1.2436697659509708\n",
      "10/02/2020 09:50:02 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:14<01:03,  1.99s/it]\u001b[A10/02/2020 09:50:03 - INFO - __main__ -   \n",
      "10/02/2020 09:50:03 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:50:03 - INFO - __main__ -   Validation improved! 1.2436697659509708 -> 1.2077787210794992\n",
      "10/02/2020 09:50:03 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:16<00:58,  1.87s/it]\u001b[A10/02/2020 09:50:05 - INFO - __main__ -   \n",
      "10/02/2020 09:50:05 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:50:05 - INFO - __main__ -   Validation improved! 1.2077787210794992 -> 1.2007266773643908\n",
      "10/02/2020 09:50:05 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:17<00:54,  1.80s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:19<00:50,  1.75s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:21<00:48,  1.71s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:22<00:45,  1.69s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:24<00:43,  1.68s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:25<00:41,  1.68s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:27<00:40,  1.67s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:29<00:38,  1.67s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:30<00:36,  1.67s/it]\u001b[A\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:32<00:35,  1.68s/it]\u001b[A10/02/2020 09:50:21 - INFO - __main__ -   \n",
      "10/02/2020 09:50:21 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:50:21 - INFO - __main__ -   Validation improved! 1.2007266773643908 -> 1.1791466728444564\n",
      "10/02/2020 09:50:21 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:34<00:33,  1.68s/it]\u001b[A10/02/2020 09:50:23 - INFO - __main__ -   \n",
      "10/02/2020 09:50:23 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:50:23 - INFO - __main__ -   Validation improved! 1.1791466728444564 -> 1.156036034387802\n",
      "10/02/2020 09:50:23 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:36<00:31,  1.68s/it]\u001b[A10/02/2020 09:50:25 - INFO - __main__ -   \n",
      "10/02/2020 09:50:25 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:50:25 - INFO - __main__ -   Validation improved! 1.156036034387802 -> 1.1434401389924198\n",
      "10/02/2020 09:50:25 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:37<00:30,  1.68s/it]\u001b[A10/02/2020 09:50:26 - INFO - __main__ -   \n",
      "10/02/2020 09:50:26 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:50:26 - INFO - __main__ -   Validation improved! 1.1434401389924198 -> 1.1415556247483953\n",
      "10/02/2020 09:50:26 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:39<00:28,  1.67s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:41<00:26,  1.67s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:42<00:24,  1.66s/it]\u001b[A10/02/2020 09:50:31 - INFO - __main__ -   \n",
      "10/02/2020 09:50:31 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:50:31 - INFO - __main__ -   Validation Accuracy improved! 0.5703971119133574 -> 0.5740072202166066\n",
      "10/02/2020 09:50:31 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:44<00:23,  1.65s/it]\u001b[A10/02/2020 09:50:33 - INFO - __main__ -   \n",
      "10/02/2020 09:50:33 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:50:33 - INFO - __main__ -   Validation improved! 1.1415556247483953 -> 1.1394896107029828\n",
      "10/02/2020 09:50:33 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:45<00:21,  1.65s/it]\u001b[A10/02/2020 09:50:35 - INFO - __main__ -   \n",
      "10/02/2020 09:50:35 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:50:35 - INFO - __main__ -   Validation improved! 1.1394896107029828 -> 1.118181765725036\n",
      "10/02/2020 09:50:35 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:47<00:19,  1.64s/it]\u001b[A10/02/2020 09:50:36 - INFO - __main__ -   \n",
      "10/02/2020 09:50:36 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:50:36 - INFO - __main__ -   Validation improved! 1.118181765725036 -> 1.1066982974214243\n",
      "10/02/2020 09:50:36 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:49<00:18,  1.64s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:50<00:16,  1.63s/it]\u001b[A10/02/2020 09:50:39 - INFO - __main__ -   \n",
      "10/02/2020 09:50:39 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:50:39 - INFO - __main__ -   Validation improved! 1.1066982974214243 -> 1.1063266971051047\n",
      "10/02/2020 09:50:39 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:52<00:14,  1.62s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:53<00:12,  1.61s/it]\u001b[A10/02/2020 09:50:43 - INFO - __main__ -   \n",
      "10/02/2020 09:50:43 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:50:43 - INFO - __main__ -   Validation Accuracy improved! 0.5740072202166066 -> 0.5776173285198556\n",
      "10/02/2020 09:50:43 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:55<00:11,  1.61s/it]\u001b[A10/02/2020 09:50:44 - INFO - __main__ -   \n",
      "10/02/2020 09:50:44 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:50:44 - INFO - __main__ -   Validation Accuracy improved! 0.5776173285198556 -> 0.592057761732852\n",
      "10/02/2020 09:50:44 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:57<00:09,  1.60s/it]\u001b[A10/02/2020 09:50:46 - INFO - __main__ -   \n",
      "10/02/2020 09:50:46 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:50:46 - INFO - __main__ -   Validation Accuracy improved! 0.592057761732852 -> 0.6028880866425993\n",
      "10/02/2020 09:50:46 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [00:58<00:07,  1.59s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [01:00<00:06,  1.59s/it]\u001b[A10/02/2020 09:50:49 - INFO - __main__ -   \n",
      "10/02/2020 09:50:49 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:50:49 - INFO - __main__ -   Validation improved! 1.1063266971051047 -> 1.0674247053125705\n",
      "10/02/2020 09:50:49 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [01:01<00:04,  1.58s/it]\u001b[A10/02/2020 09:50:50 - INFO - __main__ -   \n",
      "10/02/2020 09:50:50 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:50:50 - INFO - __main__ -   Validation improved! 1.0674247053125705 -> 1.055322073426918\n",
      "10/02/2020 09:50:50 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:03<00:03,  1.58s/it]\u001b[A10/02/2020 09:50:52 - INFO - __main__ -   \n",
      "10/02/2020 09:50:52 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:50:52 - INFO - __main__ -   Validation improved! 1.055322073426918 -> 1.0473105606189272\n",
      "10/02/2020 09:50:52 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:05<00:01,  1.58s/it]\u001b[A10/02/2020 09:50:54 - INFO - __main__ -   \n",
      "10/02/2020 09:50:54 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:50:54 - INFO - __main__ -   Validation Accuracy improved! 0.6028880866425993 -> 0.6209386281588448\n",
      "10/02/2020 09:50:54 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:06<00:00,  1.71s/it]\u001b[A\n",
      "Epoch:  17%|██████▎                               | 1/6 [01:06<05:32, 66.57s/it]\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:01<00:59,  1.57s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:03<00:57,  1.57s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:04<00:56,  1.56s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:06<00:54,  1.56s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:07<00:53,  1.56s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:09<00:51,  1.56s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:10<00:49,  1.56s/it]\u001b[A10/02/2020 09:51:06 - INFO - __main__ -   \n",
      "10/02/2020 09:51:06 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:51:06 - INFO - __main__ -   Validation Accuracy improved! 0.6209386281588448 -> 0.6245487364620939\n",
      "10/02/2020 09:51:06 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:51:06 - INFO - __main__ -   \n",
      "10/02/2020 09:51:06 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:51:06 - INFO - __main__ -   Validation improved! 1.0473105606189272 -> 1.026071576435213\n",
      "10/02/2020 09:51:06 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:12<00:48,  1.56s/it]\u001b[A10/02/2020 09:51:08 - INFO - __main__ -   \n",
      "10/02/2020 09:51:08 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:51:08 - INFO - __main__ -   Validation Accuracy improved! 0.6245487364620939 -> 0.631768953068592\n",
      "10/02/2020 09:51:08 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:51:08 - INFO - __main__ -   \n",
      "10/02/2020 09:51:08 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:51:08 - INFO - __main__ -   Validation improved! 1.026071576435213 -> 1.0071974142794144\n",
      "10/02/2020 09:51:08 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:14<00:46,  1.56s/it]\u001b[A10/02/2020 09:51:09 - INFO - __main__ -   \n",
      "10/02/2020 09:51:09 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:51:09 - INFO - __main__ -   Validation improved! 1.0071974142794144 -> 1.0029516779558754\n",
      "10/02/2020 09:51:09 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:15<00:45,  1.56s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:17<00:43,  1.57s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:18<00:42,  1.57s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:20<00:40,  1.57s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:21<00:39,  1.57s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:23<00:37,  1.58s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:25<00:36,  1.58s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:26<00:34,  1.58s/it]\u001b[A\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:28<00:33,  1.58s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:29<00:31,  1.59s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:31<00:30,  1.59s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:33<00:28,  1.59s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:34<00:27,  1.59s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:36<00:25,  1.59s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:37<00:23,  1.60s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:39<00:22,  1.60s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:41<00:20,  1.60s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:42<00:19,  1.60s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:44<00:17,  1.61s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:45<00:16,  1.61s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:47<00:14,  1.61s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:49<00:12,  1.60s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:50<00:11,  1.61s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:52<00:09,  1.61s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [00:53<00:08,  1.61s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [00:55<00:06,  1.60s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [00:57<00:04,  1.60s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [00:58<00:03,  1.60s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:00<00:01,  1.61s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:01<00:00,  1.59s/it]\u001b[A\n",
      "Epoch:  33%|████████████▋                         | 2/6 [02:08<04:20, 65.16s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:01<01:00,  1.60s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:03<00:59,  1.61s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:04<00:58,  1.61s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:06<00:56,  1.61s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:08<00:54,  1.61s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:09<00:52,  1.61s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:11<00:51,  1.60s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:12<00:49,  1.60s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:14<00:47,  1.60s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:16<00:46,  1.60s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:17<00:44,  1.60s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:19<00:43,  1.60s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:20<00:41,  1.60s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:22<00:39,  1.60s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:24<00:38,  1.60s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:25<00:36,  1.59s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:27<00:35,  1.59s/it]\u001b[A\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:28<00:33,  1.60s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:30<00:31,  1.59s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:31<00:30,  1.60s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:33<00:28,  1.60s/it]\u001b[A10/02/2020 09:52:31 - INFO - __main__ -   \n",
      "10/02/2020 09:52:31 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:52:31 - INFO - __main__ -   Validation Accuracy improved! 0.631768953068592 -> 0.6389891696750902\n",
      "10/02/2020 09:52:31 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:35<00:27,  1.60s/it]\u001b[A10/02/2020 09:52:32 - INFO - __main__ -   \n",
      "10/02/2020 09:52:32 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:52:32 - INFO - __main__ -   Validation Accuracy improved! 0.6389891696750902 -> 0.6425992779783394\n",
      "10/02/2020 09:52:32 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:36<00:25,  1.60s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:38<00:23,  1.60s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:39<00:22,  1.60s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:41<00:20,  1.60s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:43<00:19,  1.60s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:44<00:17,  1.60s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:46<00:15,  1.60s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:47<00:14,  1.60s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:49<00:12,  1.60s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:51<00:11,  1.60s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:52<00:09,  1.60s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [00:54<00:08,  1.60s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [00:55<00:06,  1.60s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [00:57<00:04,  1.60s/it]\u001b[A10/02/2020 09:52:55 - INFO - __main__ -   \n",
      "10/02/2020 09:52:55 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:52:55 - INFO - __main__ -   Validation improved! 1.0029516779558754 -> 0.9904140165136179\n",
      "10/02/2020 09:52:55 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:52:55 - INFO - __main__ -   Saving the model...\n",
      "\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [00:59<00:03,  1.68s/it]\u001b[A10/02/2020 09:52:57 - INFO - __main__ -   \n",
      "10/02/2020 09:52:57 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:52:57 - INFO - __main__ -   Validation improved! 0.9904140165136179 -> 0.9812304801458916\n",
      "10/02/2020 09:52:57 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:52:59 - INFO - __main__ -   Saving the model...\n",
      "\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:03<00:02,  2.32s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:04<00:00,  1.66s/it]\u001b[A\n",
      "Epoch:  50%|███████████████████                   | 3/6 [03:13<03:15, 65.08s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:01<00:59,  1.58s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:03<00:58,  1.58s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:04<00:57,  1.58s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:06<00:55,  1.59s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:07<00:54,  1.59s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:09<00:52,  1.59s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:11<00:51,  1.60s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:12<00:49,  1.60s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:14<00:47,  1.60s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:15<00:46,  1.60s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:17<00:44,  1.60s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:19<00:43,  1.60s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:20<00:41,  1.60s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:22<00:40,  1.60s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:23<00:38,  1.60s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:25<00:36,  1.61s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:27<00:35,  1.61s/it]\u001b[A10/02/2020 09:53:29 - INFO - __main__ -   \n",
      "10/02/2020 09:53:29 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:53:29 - INFO - __main__ -   Validation improved! 0.9812304801458916 -> 0.9796221591935691\n",
      "10/02/2020 09:53:29 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:53:31 - INFO - __main__ -   Saving the model...\n",
      "\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:30<00:47,  2.25s/it]\u001b[A10/02/2020 09:53:33 - INFO - __main__ -   \n",
      "10/02/2020 09:53:33 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:53:33 - INFO - __main__ -   Validation improved! 0.9796221591935691 -> 0.9794008929806927\n",
      "10/02/2020 09:53:33 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:53:35 - INFO - __main__ -   Saving the model...\n",
      "\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:34<00:53,  2.69s/it]\u001b[A10/02/2020 09:53:37 - INFO - __main__ -   \n",
      "10/02/2020 09:53:37 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:53:37 - INFO - __main__ -   Validation Accuracy improved! 0.6425992779783394 -> 0.6462093862815884\n",
      "10/02/2020 09:53:37 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:36<00:45,  2.38s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:37<00:38,  2.14s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:39<00:33,  1.97s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:41<00:29,  1.86s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:42<00:26,  1.78s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:44<00:24,  1.72s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:45<00:21,  1.69s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:47<00:19,  1.66s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:49<00:18,  1.65s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:50<00:16,  1.64s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:52<00:14,  1.63s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:53<00:13,  1.63s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:55<00:11,  1.62s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:57<00:09,  1.62s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [00:58<00:08,  1.62s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [01:00<00:06,  1.62s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [01:02<00:04,  1.62s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:03<00:03,  1.61s/it]\u001b[A10/02/2020 09:54:06 - INFO - __main__ -   \n",
      "10/02/2020 09:54:06 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 09:54:06 - INFO - __main__ -   Validation Accuracy improved! 0.6462093862815884 -> 0.6498194945848376\n",
      "10/02/2020 09:54:06 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:05<00:01,  1.61s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:06<00:00,  1.71s/it]\u001b[A\n",
      "Epoch:  67%|█████████████████████████▎            | 4/6 [04:20<02:11, 65.60s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:01<01:01,  1.62s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:03<00:59,  1.61s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:04<00:58,  1.62s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:06<00:56,  1.62s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:08<00:54,  1.61s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:09<00:53,  1.61s/it]\u001b[A\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  18%|█████▉                           | 7/39 [00:11<00:51,  1.61s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:12<00:49,  1.61s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:14<00:48,  1.61s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:16<00:46,  1.60s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:17<00:44,  1.60s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:19<00:43,  1.60s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:20<00:41,  1.60s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:22<00:39,  1.60s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:24<00:38,  1.60s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:25<00:36,  1.60s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:27<00:35,  1.59s/it]\u001b[A\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:28<00:33,  1.59s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:30<00:31,  1.59s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:32<00:30,  1.59s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:33<00:28,  1.59s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:35<00:27,  1.59s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:36<00:25,  1.59s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:38<00:23,  1.59s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:39<00:22,  1.59s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:41<00:20,  1.59s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:43<00:19,  1.59s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:44<00:17,  1.59s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:46<00:15,  1.59s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:47<00:14,  1.59s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:49<00:12,  1.59s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:51<00:11,  1.59s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:52<00:09,  1.59s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [00:54<00:07,  1.59s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [00:55<00:06,  1.58s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [00:57<00:04,  1.59s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [00:59<00:03,  1.59s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:00<00:01,  1.59s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:02<00:00,  1.59s/it]\u001b[A\n",
      "Epoch:  83%|███████████████████████████████▋      | 5/6 [05:22<01:04, 64.57s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:01<01:00,  1.58s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:03<00:58,  1.58s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:04<00:57,  1.58s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:06<00:55,  1.59s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:07<00:53,  1.59s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:09<00:52,  1.59s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:11<00:51,  1.59s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:12<00:49,  1.59s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:14<00:47,  1.59s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:15<00:46,  1.60s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:17<00:44,  1.60s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:19<00:42,  1.59s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:20<00:41,  1.59s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:22<00:39,  1.59s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:23<00:38,  1.59s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:25<00:36,  1.59s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:27<00:35,  1.60s/it]\u001b[A\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:28<00:33,  1.60s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:30<00:31,  1.59s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:31<00:30,  1.59s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:33<00:28,  1.60s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:35<00:27,  1.60s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:36<00:25,  1.60s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:38<00:24,  1.60s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:39<00:22,  1.60s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [00:41<00:20,  1.60s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [00:43<00:19,  1.60s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [00:44<00:17,  1.60s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [00:46<00:16,  1.60s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [00:47<00:14,  1.60s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [00:49<00:12,  1.60s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [00:51<00:11,  1.60s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [00:52<00:09,  1.60s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [00:54<00:08,  1.60s/it]\u001b[A10/02/2020 09:56:04 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [00:55<00:06,  1.60s/it]\u001b[A10/02/2020 09:56:06 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [00:57<00:04,  1.60s/it]\u001b[A10/02/2020 09:56:07 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [00:59<00:03,  1.60s/it]\u001b[A10/02/2020 09:56:09 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:00<00:01,  1.60s/it]\u001b[A10/02/2020 09:56:11 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:02<00:00,  1.60s/it]\u001b[A\n",
      "Epoch: 100%|██████████████████████████████████████| 6/6 [06:24<00:00, 64.09s/it]\n"
     ]
    }
   ],
   "source": [
    "!python KDAP/PTP_RTE.py"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "### [4] PeaBERT6 - RTE 73.64%"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "Settings: [seed = 65051690 / train_batch_size = 64 / learning_rate = 1e-5 / epoch = 6 / alpha = 0.6 / beta = 0 ] \n",
    "\n",
    "The result is shown in the following cell."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 39,
   "metadata": {
    "collapsed": true
   },
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "10/02/2020 11:05:29 - INFO - __main__ -   IN DEBUG MODE\n",
      "10/02/2020 11:05:30 - INFO - src.argument_parser -   device: cuda n_gpu: 2, 16-bits training: False\n",
      "10/02/2020 11:05:30 - INFO - src.argument_parser -   random seed = 16954747\n",
      "10/02/2020 11:05:30 - INFO - __main__ -   actual batch size on all GPU = 64\n",
      "10/02/2020 11:05:30 - INFO - __main__ -   Seed changed to : 65051690\n",
      "10/02/2020 11:05:30 - INFO - __main__ -   loading teacher's prediction\n",
      "10/02/2020 11:05:30 - INFO - __main__ -   teacher acc = 79.36, teacher loss = 0.46779\n",
      "10/02/2020 11:05:30 - INFO - __main__ -   teacher acc = 68.95, teacher loss = 0.58605\n",
      "10/02/2020 11:05:30 - INFO - src.nli_data_processing -   Writing example 0 of 2490\n",
      "10/02/2020 11:05:32 - INFO - __main__ -   ***** Running training *****\n",
      "10/02/2020 11:05:32 - INFO - __main__ -     Num examples = 2490\n",
      "10/02/2020 11:05:32 - INFO - __main__ -     Batch size = 64\n",
      "10/02/2020 11:05:32 - INFO - __main__ -     Num steps = 228\n",
      "10/02/2020 11:05:32 - INFO - src.nli_data_processing -   Writing example 0 of 277\n",
      "10/02/2020 11:05:33 - INFO - __main__ -   ***** Running evaluation *****\n",
      "10/02/2020 11:05:33 - INFO - __main__ -     Num examples = 277\n",
      "10/02/2020 11:05:33 - INFO - __main__ -     Batch size = 32\n",
      "10/02/2020 11:05:33 - INFO - src.nli_data_processing -   Writing example 0 of 3000\n",
      "10/02/2020 11:05:35 - INFO - __main__ -   ***** Running evaluation *****\n",
      "10/02/2020 11:05:35 - INFO - __main__ -     Num examples = 3000\n",
      "10/02/2020 11:05:35 - INFO - __main__ -     Batch size = 32\n",
      "10/02/2020 11:05:35 - INFO - __main__ -   using normal Knowledge Distillation\n",
      "10/02/2020 11:05:35 - INFO - src.nli_data_processing -   predicting for RTE\n",
      "10/02/2020 11:05:35 - INFO - src.modeling -   num hidden layer is set as 6\n",
      "10/02/2020 11:05:35 - INFO - src.modeling -   Model config {\n",
      "  \"attention_probs_dropout_prob\": 0.1,\n",
      "  \"hidden_act\": \"gelu\",\n",
      "  \"hidden_dropout_prob\": 0.1,\n",
      "  \"hidden_size\": 768,\n",
      "  \"initializer_range\": 0.02,\n",
      "  \"intermediate_size\": 3072,\n",
      "  \"max_position_embeddings\": 512,\n",
      "  \"num_attention_heads\": 12,\n",
      "  \"num_hidden_layers\": 6,\n",
      "  \"type_vocab_size\": 2,\n",
      "  \"vocab_size\": 30522\n",
      "}\n",
      "\n",
      "10/02/2020 11:05:40 - INFO - src.utils -   delete 0 layers, keep 103 layers\n",
      "10/02/2020 11:05:40 - INFO - src.utils -   data parallel because more than one gpu\n",
      "10/02/2020 11:05:40 - INFO - __main__ -   *****************************************************************************\n",
      "10/02/2020 11:05:40 - INFO - src.utils -   data parallel because more than one gpu\n",
      "10/02/2020 11:05:40 - INFO - __main__ -   number of layers in student model = 6\n",
      "10/02/2020 11:05:40 - INFO - __main__ -   num parameters in student model are 66955008 and 1538\n",
      "10/02/2020 11:05:40 - INFO - __main__ -   FP16 is not activated, use BertAdam\n",
      "Epoch:   0%|                                              | 0/6 [00:00<?, ?it/s]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A/opt/conda/lib/python3.7/site-packages/torch/nn/functional.py:1958: UserWarning: reduction: 'mean' divides the total loss by both the batch size and the support size.'batchmean' divides only by the batch size, and aligns with the KL div math definition.'mean' will be changed to behave the same as 'batchmean' in the next major release.\n",
      "  warnings.warn(\"reduction: 'mean' divides the total loss by both the batch size and the support size.\"\n",
      "10/02/2020 11:05:45 - INFO - __main__ -   \n",
      "10/02/2020 11:05:45 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:05:45 - INFO - __main__ -   Validation Accuracy improved! 0 -> 0.5270758122743683\n",
      "10/02/2020 11:05:45 - INFO - __main__ -   Validation Loss is : 0.6868718212692316\n",
      "10/02/2020 11:05:45 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:05:45 - INFO - __main__ -   \n",
      "10/02/2020 11:05:45 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:05:45 - INFO - __main__ -   Validation Loss improved! 100 -> 0.6868718212692316\n",
      "10/02/2020 11:05:45 - INFO - __main__ -   Validation Accuracy is :0.5270758122743683\n",
      "10/02/2020 11:05:45 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   3%|▊                                | 1/39 [00:05<03:42,  5.86s/it]\u001b[A10/02/2020 11:05:48 - INFO - __main__ -   \n",
      "10/02/2020 11:05:48 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:05:48 - INFO - __main__ -   Validation Loss improved! 0.6868718212692316 -> 0.6837985593059003\n",
      "10/02/2020 11:05:48 - INFO - __main__ -   Validation Accuracy is :0.5234657039711191\n",
      "10/02/2020 11:05:48 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   5%|█▋                               | 2/39 [00:08<02:55,  4.75s/it]\u001b[A10/02/2020 11:05:50 - INFO - __main__ -   \n",
      "10/02/2020 11:05:50 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:05:50 - INFO - __main__ -   Validation Accuracy improved! 0.5270758122743683 -> 0.5523465703971119\n",
      "10/02/2020 11:05:50 - INFO - __main__ -   Validation Loss is : 0.6759570738038432\n",
      "10/02/2020 11:05:50 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:05:50 - INFO - __main__ -   \n",
      "10/02/2020 11:05:50 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:05:50 - INFO - __main__ -   Validation Loss improved! 0.6837985593059003 -> 0.6759570738038432\n",
      "10/02/2020 11:05:50 - INFO - __main__ -   Validation Accuracy is :0.5523465703971119\n",
      "10/02/2020 11:05:50 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:   8%|██▌                              | 3/39 [00:10<02:23,  3.97s/it]\u001b[A10/02/2020 11:05:52 - INFO - __main__ -   \n",
      "10/02/2020 11:05:52 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:05:52 - INFO - __main__ -   Validation Accuracy improved! 0.5523465703971119 -> 0.5776173285198556\n",
      "10/02/2020 11:05:52 - INFO - __main__ -   Validation Loss is : 0.6650755431247533\n",
      "10/02/2020 11:05:52 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:05:52 - INFO - __main__ -   \n",
      "10/02/2020 11:05:52 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:05:52 - INFO - __main__ -   Validation Loss improved! 0.6759570738038432 -> 0.6650755431247533\n",
      "10/02/2020 11:05:52 - INFO - __main__ -   Validation Accuracy is :0.5776173285198556\n",
      "10/02/2020 11:05:52 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  10%|███▍                             | 4/39 [00:12<02:00,  3.43s/it]\u001b[A10/02/2020 11:05:54 - INFO - __main__ -   \n",
      "10/02/2020 11:05:54 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:05:54 - INFO - __main__ -   Validation Accuracy improved! 0.5776173285198556 -> 0.631768953068592\n",
      "10/02/2020 11:05:54 - INFO - __main__ -   Validation Loss is : 0.6543995103250773\n",
      "10/02/2020 11:05:54 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:05:54 - INFO - __main__ -   \n",
      "10/02/2020 11:05:54 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:05:54 - INFO - __main__ -   Validation Loss improved! 0.6650755431247533 -> 0.6543995103250773\n",
      "10/02/2020 11:05:54 - INFO - __main__ -   Validation Accuracy is :0.631768953068592\n",
      "10/02/2020 11:05:54 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  13%|████▏                            | 5/39 [00:14<01:43,  3.05s/it]\u001b[A10/02/2020 11:05:56 - INFO - __main__ -   \n",
      "10/02/2020 11:05:56 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:05:56 - INFO - __main__ -   Validation Loss improved! 0.6543995103250773 -> 0.6499820977772186\n",
      "10/02/2020 11:05:56 - INFO - __main__ -   Validation Accuracy is :0.5992779783393501\n",
      "10/02/2020 11:05:56 - INFO - __main__ -   =============================================================================\n",
      "\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  15%|█████                            | 6/39 [00:16<01:32,  2.79s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:18<01:23,  2.60s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:21<01:16,  2.48s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:23<01:11,  2.39s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:25<01:07,  2.33s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:27<01:04,  2.30s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:29<01:01,  2.27s/it]\u001b[A10/02/2020 11:06:12 - INFO - __main__ -   \n",
      "10/02/2020 11:06:12 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:06:12 - INFO - __main__ -   Validation Loss improved! 0.6499820977772186 -> 0.6411533803285675\n",
      "10/02/2020 11:06:12 - INFO - __main__ -   Validation Accuracy is :0.6245487364620939\n",
      "10/02/2020 11:06:12 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:32<00:58,  2.26s/it]\u001b[A10/02/2020 11:06:14 - INFO - __main__ -   \n",
      "10/02/2020 11:06:14 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:06:14 - INFO - __main__ -   Validation Loss improved! 0.6411533803285675 -> 0.6303044811482894\n",
      "10/02/2020 11:06:14 - INFO - __main__ -   Validation Accuracy is :0.6173285198555957\n",
      "10/02/2020 11:06:14 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:34<00:56,  2.25s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:36<00:53,  2.24s/it]\u001b[A10/02/2020 11:06:18 - INFO - __main__ -   \n",
      "10/02/2020 11:06:18 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:06:18 - INFO - __main__ -   Validation Accuracy improved! 0.631768953068592 -> 0.6606498194945848\n",
      "10/02/2020 11:06:18 - INFO - __main__ -   Validation Loss is : 0.6380689221599042\n",
      "10/02/2020 11:06:18 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:38<00:51,  2.24s/it]\u001b[A10/02/2020 11:06:21 - INFO - __main__ -   \n",
      "10/02/2020 11:06:21 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:06:21 - INFO - __main__ -   Validation Accuracy improved! 0.6606498194945848 -> 0.6714801444043321\n",
      "10/02/2020 11:06:21 - INFO - __main__ -   Validation Loss is : 0.6435677738396269\n",
      "10/02/2020 11:06:21 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:41<00:49,  2.24s/it]\u001b[A\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:43<00:47,  2.24s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:45<00:44,  2.24s/it]\u001b[A10/02/2020 11:06:27 - INFO - __main__ -   \n",
      "10/02/2020 11:06:27 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:06:27 - INFO - __main__ -   Validation Loss improved! 0.6303044811482894 -> 0.6270958835037176\n",
      "10/02/2020 11:06:27 - INFO - __main__ -   Validation Accuracy is :0.628158844765343\n",
      "10/02/2020 11:06:27 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:47<00:42,  2.24s/it]\u001b[A10/02/2020 11:06:30 - INFO - __main__ -   \n",
      "10/02/2020 11:06:30 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:06:30 - INFO - __main__ -   Validation Loss improved! 0.6270958835037176 -> 0.6257914188536496\n",
      "10/02/2020 11:06:30 - INFO - __main__ -   Validation Accuracy is :0.628158844765343\n",
      "10/02/2020 11:06:30 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:49<00:40,  2.24s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:52<00:38,  2.25s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:54<00:36,  2.25s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:56<00:33,  2.26s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [00:59<00:31,  2.26s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [01:01<00:29,  2.26s/it]\u001b[A10/02/2020 11:06:43 - INFO - __main__ -   \n",
      "10/02/2020 11:06:43 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:06:43 - INFO - __main__ -   Validation Loss improved! 0.6257914188536496 -> 0.6253571940673387\n",
      "10/02/2020 11:06:43 - INFO - __main__ -   Validation Accuracy is :0.6353790613718412\n",
      "10/02/2020 11:06:43 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [01:03<00:27,  2.27s/it]\u001b[A10/02/2020 11:06:45 - INFO - __main__ -   \n",
      "10/02/2020 11:06:45 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:06:45 - INFO - __main__ -   Validation Loss improved! 0.6253571940673387 -> 0.6168973385642151\n",
      "10/02/2020 11:06:45 - INFO - __main__ -   Validation Accuracy is :0.6534296028880866\n",
      "10/02/2020 11:06:45 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [01:05<00:24,  2.27s/it]\u001b[A10/02/2020 11:06:48 - INFO - __main__ -   \n",
      "10/02/2020 11:06:48 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:06:48 - INFO - __main__ -   Validation Loss improved! 0.6168973385642151 -> 0.616108109374339\n",
      "10/02/2020 11:06:48 - INFO - __main__ -   Validation Accuracy is :0.6353790613718412\n",
      "10/02/2020 11:06:48 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [01:08<00:22,  2.27s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [01:10<00:20,  2.28s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [01:12<00:18,  2.28s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [01:15<00:16,  2.29s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [01:17<00:13,  2.29s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [01:19<00:11,  2.30s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [01:21<00:09,  2.31s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [01:24<00:06,  2.32s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:26<00:04,  2.33s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:29<00:02,  2.33s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:31<00:00,  2.34s/it]\u001b[A\n",
      "Epoch:  17%|██████▎                               | 1/6 [01:31<07:36, 91.32s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:02<01:30,  2.37s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:04<01:27,  2.37s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:07<01:25,  2.37s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:09<01:23,  2.39s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:11<01:21,  2.40s/it]\u001b[A10/02/2020 11:07:25 - INFO - __main__ -   \n",
      "10/02/2020 11:07:25 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:07:25 - INFO - __main__ -   Validation Loss improved! 0.616108109374339 -> 0.6154278744859386\n",
      "10/02/2020 11:07:25 - INFO - __main__ -   Validation Accuracy is :0.6425992779783394\n",
      "10/02/2020 11:07:25 - INFO - __main__ -   =============================================================================\n",
      "\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  15%|█████                            | 6/39 [00:14<01:19,  2.41s/it]\u001b[A10/02/2020 11:07:28 - INFO - __main__ -   \n",
      "10/02/2020 11:07:28 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:07:28 - INFO - __main__ -   Validation Loss improved! 0.6154278744859386 -> 0.6138588736633962\n",
      "10/02/2020 11:07:28 - INFO - __main__ -   Validation Accuracy is :0.6425992779783394\n",
      "10/02/2020 11:07:28 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:16<01:17,  2.41s/it]\u001b[A10/02/2020 11:07:30 - INFO - __main__ -   \n",
      "10/02/2020 11:07:30 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:07:30 - INFO - __main__ -   Validation Loss improved! 0.6138588736633962 -> 0.613319710273605\n",
      "10/02/2020 11:07:30 - INFO - __main__ -   Validation Accuracy is :0.6425992779783394\n",
      "10/02/2020 11:07:30 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:19<01:15,  2.42s/it]\u001b[A10/02/2020 11:07:33 - INFO - __main__ -   \n",
      "10/02/2020 11:07:33 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:07:33 - INFO - __main__ -   Validation Loss improved! 0.613319710273605 -> 0.6123668278167395\n",
      "10/02/2020 11:07:33 - INFO - __main__ -   Validation Accuracy is :0.6462093862815884\n",
      "10/02/2020 11:07:33 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:21<01:12,  2.43s/it]\u001b[A10/02/2020 11:07:35 - INFO - __main__ -   \n",
      "10/02/2020 11:07:35 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:07:35 - INFO - __main__ -   Validation Loss improved! 0.6123668278167395 -> 0.611444469823734\n",
      "10/02/2020 11:07:35 - INFO - __main__ -   Validation Accuracy is :0.6425992779783394\n",
      "10/02/2020 11:07:35 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:24<01:10,  2.43s/it]\u001b[A10/02/2020 11:07:37 - INFO - __main__ -   \n",
      "10/02/2020 11:07:37 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:07:37 - INFO - __main__ -   Validation Loss improved! 0.611444469823734 -> 0.6113163159642409\n",
      "10/02/2020 11:07:37 - INFO - __main__ -   Validation Accuracy is :0.6425992779783394\n",
      "10/02/2020 11:07:37 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  28%|█████████                       | 11/39 [00:26<01:08,  2.43s/it]\u001b[A10/02/2020 11:07:40 - INFO - __main__ -   \n",
      "10/02/2020 11:07:40 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:07:40 - INFO - __main__ -   Validation Loss improved! 0.6113163159642409 -> 0.6106509267207948\n",
      "10/02/2020 11:07:40 - INFO - __main__ -   Validation Accuracy is :0.6389891696750902\n",
      "10/02/2020 11:07:40 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:28<01:05,  2.43s/it]\u001b[A10/02/2020 11:07:42 - INFO - __main__ -   \n",
      "10/02/2020 11:07:42 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:07:42 - INFO - __main__ -   Validation Loss improved! 0.6106509267207948 -> 0.6090335777089915\n",
      "10/02/2020 11:07:42 - INFO - __main__ -   Validation Accuracy is :0.6425992779783394\n",
      "10/02/2020 11:07:42 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:31<01:03,  2.44s/it]\u001b[A10/02/2020 11:07:45 - INFO - __main__ -   \n",
      "10/02/2020 11:07:45 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:07:45 - INFO - __main__ -   Validation Loss improved! 0.6090335777089915 -> 0.6074813918516524\n",
      "10/02/2020 11:07:45 - INFO - __main__ -   Validation Accuracy is :0.6462093862815884\n",
      "10/02/2020 11:07:45 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:33<01:01,  2.45s/it]\u001b[A10/02/2020 11:07:47 - INFO - __main__ -   \n",
      "10/02/2020 11:07:47 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:07:47 - INFO - __main__ -   Validation Loss improved! 0.6074813918516524 -> 0.606472273595927\n",
      "10/02/2020 11:07:47 - INFO - __main__ -   Validation Accuracy is :0.6389891696750902\n",
      "10/02/2020 11:07:47 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:36<00:58,  2.45s/it]\u001b[A10/02/2020 11:07:50 - INFO - __main__ -   \n",
      "10/02/2020 11:07:50 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:07:50 - INFO - __main__ -   Validation Loss improved! 0.606472273595927 -> 0.605153311030529\n",
      "10/02/2020 11:07:50 - INFO - __main__ -   Validation Accuracy is :0.6534296028880866\n",
      "10/02/2020 11:07:50 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:38<00:56,  2.45s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:41<00:53,  2.44s/it]\u001b[A\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:43<00:51,  2.44s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:46<00:48,  2.43s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:48<00:46,  2.44s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:50<00:43,  2.44s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:53<00:41,  2.44s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:55<00:38,  2.43s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:58<00:36,  2.43s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [01:00<00:33,  2.42s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [01:03<00:31,  2.42s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [01:05<00:28,  2.41s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [01:07<00:26,  2.42s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [01:10<00:24,  2.41s/it]\u001b[A10/02/2020 11:08:24 - INFO - __main__ -   \n",
      "10/02/2020 11:08:24 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:08:24 - INFO - __main__ -   Validation Loss improved! 0.605153311030529 -> 0.6034840366900612\n",
      "10/02/2020 11:08:24 - INFO - __main__ -   Validation Accuracy is :0.6498194945848376\n",
      "10/02/2020 11:08:24 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [01:12<00:21,  2.41s/it]\u001b[A10/02/2020 11:08:26 - INFO - __main__ -   \n",
      "10/02/2020 11:08:26 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:08:26 - INFO - __main__ -   Validation Loss improved! 0.6034840366900612 -> 0.601005034326216\n",
      "10/02/2020 11:08:26 - INFO - __main__ -   Validation Accuracy is :0.6498194945848376\n",
      "10/02/2020 11:08:26 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [01:15<00:19,  2.41s/it]\u001b[A10/02/2020 11:08:28 - INFO - __main__ -   \n",
      "10/02/2020 11:08:28 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:08:28 - INFO - __main__ -   Validation Loss improved! 0.601005034326216 -> 0.5982348238948451\n",
      "10/02/2020 11:08:28 - INFO - __main__ -   Validation Accuracy is :0.6462093862815884\n",
      "10/02/2020 11:08:28 - INFO - __main__ -   =============================================================================\n",
      "\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  82%|██████████████████████████▎     | 32/39 [01:17<00:16,  2.40s/it]\u001b[A10/02/2020 11:08:31 - INFO - __main__ -   \n",
      "10/02/2020 11:08:31 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:08:31 - INFO - __main__ -   Validation Loss improved! 0.5982348238948451 -> 0.5967195627921755\n",
      "10/02/2020 11:08:31 - INFO - __main__ -   Validation Accuracy is :0.6534296028880866\n",
      "10/02/2020 11:08:31 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [01:19<00:14,  2.41s/it]\u001b[A10/02/2020 11:08:33 - INFO - __main__ -   \n",
      "10/02/2020 11:08:33 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:08:33 - INFO - __main__ -   Validation Loss improved! 0.5967195627921755 -> 0.5959274932365555\n",
      "10/02/2020 11:08:33 - INFO - __main__ -   Validation Accuracy is :0.6498194945848376\n",
      "10/02/2020 11:08:33 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [01:22<00:12,  2.41s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [01:24<00:09,  2.41s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [01:27<00:07,  2.42s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:29<00:04,  2.42s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:32<00:02,  2.42s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:34<00:00,  2.42s/it]\u001b[A\n",
      "Epoch:  33%|████████████▋                         | 2/6 [03:05<06:08, 92.24s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:02<01:32,  2.43s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:04<01:29,  2.43s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:07<01:27,  2.43s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:09<01:24,  2.42s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:12<01:22,  2.43s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:14<01:19,  2.42s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:16<01:17,  2.42s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:19<01:14,  2.42s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:21<01:12,  2.42s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:24<01:10,  2.42s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:26<01:07,  2.42s/it]\u001b[A10/02/2020 11:09:14 - INFO - __main__ -   \n",
      "10/02/2020 11:09:14 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:09:14 - INFO - __main__ -   Validation Loss improved! 0.5959274932365555 -> 0.5944141876826647\n",
      "10/02/2020 11:09:14 - INFO - __main__ -   Validation Accuracy is :0.6353790613718412\n",
      "10/02/2020 11:09:14 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:29<01:05,  2.42s/it]\u001b[A10/02/2020 11:09:17 - INFO - __main__ -   \n",
      "10/02/2020 11:09:17 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:09:17 - INFO - __main__ -   Validation Loss improved! 0.5944141876826647 -> 0.5930025672224024\n",
      "10/02/2020 11:09:17 - INFO - __main__ -   Validation Accuracy is :0.6389891696750902\n",
      "10/02/2020 11:09:17 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:31<01:02,  2.42s/it]\u001b[A10/02/2020 11:09:19 - INFO - __main__ -   \n",
      "10/02/2020 11:09:19 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:09:19 - INFO - __main__ -   Validation Loss improved! 0.5930025672224024 -> 0.5921412719285876\n",
      "10/02/2020 11:09:19 - INFO - __main__ -   Validation Accuracy is :0.6425992779783394\n",
      "10/02/2020 11:09:19 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:33<01:00,  2.42s/it]\u001b[A10/02/2020 11:09:22 - INFO - __main__ -   \n",
      "10/02/2020 11:09:22 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:09:22 - INFO - __main__ -   Validation Loss improved! 0.5921412719285876 -> 0.5920253264774915\n",
      "10/02/2020 11:09:22 - INFO - __main__ -   Validation Accuracy is :0.6570397111913358\n",
      "10/02/2020 11:09:22 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:36<00:57,  2.42s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:38<00:55,  2.42s/it]\u001b[A10/02/2020 11:09:26 - INFO - __main__ -   \n",
      "10/02/2020 11:09:26 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:09:26 - INFO - __main__ -   Validation Accuracy improved! 0.6714801444043321 -> 0.6823104693140795\n",
      "10/02/2020 11:09:26 - INFO - __main__ -   Validation Loss is : 0.5931646987419266\n",
      "10/02/2020 11:09:26 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:41<00:53,  2.41s/it]\u001b[A10/02/2020 11:09:29 - INFO - __main__ -   \n",
      "10/02/2020 11:09:29 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:09:29 - INFO - __main__ -   Validation Accuracy improved! 0.6823104693140795 -> 0.6859205776173285\n",
      "10/02/2020 11:09:29 - INFO - __main__ -   Validation Loss is : 0.5954735304904759\n",
      "10/02/2020 11:09:29 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:43<00:50,  2.41s/it]\u001b[A10/02/2020 11:09:31 - INFO - __main__ -   \n",
      "10/02/2020 11:09:31 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:09:31 - INFO - __main__ -   Validation Accuracy improved! 0.6859205776173285 -> 0.6895306859205776\n",
      "10/02/2020 11:09:31 - INFO - __main__ -   Validation Loss is : 0.5979925596326697\n",
      "10/02/2020 11:09:31 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:45<00:48,  2.41s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:48<00:45,  2.41s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:50<00:43,  2.41s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:53<00:40,  2.41s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:55<00:38,  2.40s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:57<00:36,  2.41s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [01:00<00:33,  2.41s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [01:02<00:31,  2.41s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [01:05<00:28,  2.41s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [01:07<00:26,  2.41s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [01:10<00:24,  2.41s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [01:12<00:21,  2.41s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [01:14<00:19,  2.41s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [01:17<00:16,  2.42s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [01:19<00:14,  2.41s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [01:22<00:12,  2.41s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [01:24<00:09,  2.41s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [01:26<00:07,  2.42s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:29<00:04,  2.42s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:31<00:02,  2.41s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:34<00:00,  2.41s/it]\u001b[A\n",
      "Epoch:  50%|███████████████████                   | 3/6 [04:39<04:38, 92.81s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:02<01:32,  2.44s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:04<01:30,  2.44s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:07<01:27,  2.43s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:09<01:24,  2.42s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:12<01:22,  2.43s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:14<01:19,  2.42s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:16<01:17,  2.41s/it]\u001b[A10/02/2020 11:10:39 - INFO - __main__ -   \n",
      "10/02/2020 11:10:39 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:10:39 - INFO - __main__ -   Validation Loss improved! 0.5920253264774915 -> 0.5909542125054645\n",
      "10/02/2020 11:10:39 - INFO - __main__ -   Validation Accuracy is :0.6570397111913358\n",
      "10/02/2020 11:10:39 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:19<01:15,  2.42s/it]\u001b[A10/02/2020 11:10:41 - INFO - __main__ -   \n",
      "10/02/2020 11:10:41 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:10:41 - INFO - __main__ -   Validation Loss improved! 0.5909542125054645 -> 0.5881076082856216\n",
      "10/02/2020 11:10:41 - INFO - __main__ -   Validation Accuracy is :0.6678700361010831\n",
      "10/02/2020 11:10:41 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:21<01:12,  2.42s/it]\u001b[A10/02/2020 11:10:44 - INFO - __main__ -   \n",
      "10/02/2020 11:10:44 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:10:44 - INFO - __main__ -   Validation Loss improved! 0.5881076082856216 -> 0.5858131367376995\n",
      "10/02/2020 11:10:44 - INFO - __main__ -   Validation Accuracy is :0.6859205776173285\n",
      "10/02/2020 11:10:44 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:24<01:10,  2.41s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:26<01:07,  2.41s/it]\u001b[A10/02/2020 11:10:48 - INFO - __main__ -   \n",
      "10/02/2020 11:10:48 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:10:48 - INFO - __main__ -   Validation Accuracy improved! 0.6895306859205776 -> 0.7111913357400722\n",
      "10/02/2020 11:10:48 - INFO - __main__ -   Validation Loss is : 0.5855349358238469\n",
      "10/02/2020 11:10:48 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:10:48 - INFO - __main__ -   \n",
      "10/02/2020 11:10:48 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:10:48 - INFO - __main__ -   Validation Loss improved! 0.5858131367376995 -> 0.5855349358238469\n",
      "10/02/2020 11:10:48 - INFO - __main__ -   Validation Accuracy is :0.7111913357400722\n",
      "10/02/2020 11:10:48 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:29<01:05,  2.41s/it]\u001b[A10/02/2020 11:10:51 - INFO - __main__ -   \n",
      "10/02/2020 11:10:51 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:10:51 - INFO - __main__ -   Validation Accuracy improved! 0.7111913357400722 -> 0.7148014440433214\n",
      "10/02/2020 11:10:51 - INFO - __main__ -   Validation Loss is : 0.5855387608497151\n",
      "10/02/2020 11:10:51 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:31<01:02,  2.41s/it]\u001b[A10/02/2020 11:10:53 - INFO - __main__ -   \n",
      "10/02/2020 11:10:53 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:10:53 - INFO - __main__ -   Validation Accuracy improved! 0.7148014440433214 -> 0.7292418772563177\n",
      "10/02/2020 11:10:53 - INFO - __main__ -   Validation Loss is : 0.5856501995871644\n",
      "10/02/2020 11:10:53 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:33<01:00,  2.41s/it]\u001b[A10/02/2020 11:10:56 - INFO - __main__ -   \n",
      "10/02/2020 11:10:56 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:10:56 - INFO - __main__ -   Validation Accuracy improved! 0.7292418772563177 -> 0.7364620938628159\n",
      "10/02/2020 11:10:56 - INFO - __main__ -   Validation Loss is : 0.5858313357356654\n",
      "10/02/2020 11:10:56 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:10:56 - INFO - __main__ -   Saving the model...\n",
      "\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:36<00:59,  2.49s/it]\u001b[A10/02/2020 11:10:58 - INFO - __main__ -   \n",
      "10/02/2020 11:10:58 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:10:58 - INFO - __main__ -   Validation Loss improved! 0.5855349358238469 -> 0.585379459367332\n",
      "10/02/2020 11:10:58 - INFO - __main__ -   Validation Accuracy is :0.7328519855595668\n",
      "10/02/2020 11:10:58 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:38<00:57,  2.49s/it]\u001b[A10/02/2020 11:11:01 - INFO - __main__ -   \n",
      "10/02/2020 11:11:01 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:11:01 - INFO - __main__ -   Validation Loss improved! 0.585379459367332 -> 0.5846636458854813\n",
      "10/02/2020 11:11:01 - INFO - __main__ -   Validation Accuracy is :0.7364620938628159\n",
      "10/02/2020 11:11:01 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:41<00:54,  2.48s/it]\u001b[A10/02/2020 11:11:03 - INFO - __main__ -   \n",
      "10/02/2020 11:11:03 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:11:03 - INFO - __main__ -   Validation Loss improved! 0.5846636458854813 -> 0.5844593908812595\n",
      "10/02/2020 11:11:03 - INFO - __main__ -   Validation Accuracy is :0.7364620938628159\n",
      "10/02/2020 11:11:03 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:43<00:51,  2.47s/it]\u001b[A10/02/2020 11:11:06 - INFO - __main__ -   \n",
      "10/02/2020 11:11:06 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:11:06 - INFO - __main__ -   Validation Loss improved! 0.5844593908812595 -> 0.5840816394516707\n",
      "10/02/2020 11:11:06 - INFO - __main__ -   Validation Accuracy is :0.7328519855595668\n",
      "10/02/2020 11:11:06 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:46<00:49,  2.45s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:48<00:46,  2.44s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:51<00:43,  2.43s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:53<00:41,  2.43s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:55<00:38,  2.42s/it]\u001b[A\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Iteration:  62%|███████████████████▋            | 24/39 [00:58<00:36,  2.42s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [01:00<00:33,  2.42s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [01:03<00:31,  2.42s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [01:05<00:29,  2.42s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [01:08<00:26,  2.42s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [01:10<00:24,  2.41s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [01:12<00:21,  2.41s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [01:15<00:19,  2.41s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [01:17<00:16,  2.41s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [01:20<00:14,  2.41s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [01:22<00:12,  2.40s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [01:24<00:09,  2.42s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [01:27<00:07,  2.41s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:29<00:04,  2.41s/it]\u001b[A10/02/2020 11:11:52 - INFO - __main__ -   \n",
      "10/02/2020 11:11:52 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:11:52 - INFO - __main__ -   Validation Loss improved! 0.5840816394516707 -> 0.5835643244994676\n",
      "10/02/2020 11:11:52 - INFO - __main__ -   Validation Accuracy is :0.6823104693140795\n",
      "10/02/2020 11:11:52 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:32<00:02,  2.42s/it]\u001b[A10/02/2020 11:11:54 - INFO - __main__ -   \n",
      "10/02/2020 11:11:54 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:11:54 - INFO - __main__ -   Validation Loss improved! 0.5835643244994676 -> 0.5835256610966761\n",
      "10/02/2020 11:11:54 - INFO - __main__ -   Validation Accuracy is :0.6750902527075813\n",
      "10/02/2020 11:11:54 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:34<00:00,  2.42s/it]\u001b[A\n",
      "Epoch:  67%|█████████████████████████▎            | 4/6 [06:14<03:06, 93.33s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:02<01:32,  2.45s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:04<01:29,  2.43s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:07<01:27,  2.43s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:09<01:24,  2.42s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:12<01:22,  2.43s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:14<01:19,  2.42s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:16<01:17,  2.41s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:19<01:15,  2.42s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:21<01:12,  2.41s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:24<01:10,  2.42s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:26<01:07,  2.41s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:28<01:04,  2.41s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:31<01:02,  2.42s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:33<01:00,  2.41s/it]\u001b[A10/02/2020 11:12:30 - INFO - __main__ -   \n",
      "10/02/2020 11:12:30 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:12:30 - INFO - __main__ -   Validation Loss improved! 0.5835256610966761 -> 0.5832562635951954\n",
      "10/02/2020 11:12:30 - INFO - __main__ -   Validation Accuracy is :0.6967509025270758\n",
      "10/02/2020 11:12:30 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:36<00:58,  2.42s/it]\u001b[A10/02/2020 11:12:33 - INFO - __main__ -   \n",
      "10/02/2020 11:12:33 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:12:33 - INFO - __main__ -   Validation Loss improved! 0.5832562635951954 -> 0.5800931875456111\n",
      "10/02/2020 11:12:33 - INFO - __main__ -   Validation Accuracy is :0.6823104693140795\n",
      "10/02/2020 11:12:33 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:38<00:55,  2.42s/it]\u001b[A10/02/2020 11:12:35 - INFO - __main__ -   \n",
      "10/02/2020 11:12:35 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:12:35 - INFO - __main__ -   Validation Loss improved! 0.5800931875456111 -> 0.5783453366386331\n",
      "10/02/2020 11:12:35 - INFO - __main__ -   Validation Accuracy is :0.6859205776173285\n",
      "10/02/2020 11:12:35 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:41<00:53,  2.42s/it]\u001b[A10/02/2020 11:12:37 - INFO - __main__ -   \n",
      "10/02/2020 11:12:37 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:12:37 - INFO - __main__ -   Validation Loss improved! 0.5783453366386331 -> 0.5781198425843828\n",
      "10/02/2020 11:12:37 - INFO - __main__ -   Validation Accuracy is :0.6931407942238267\n",
      "10/02/2020 11:12:37 - INFO - __main__ -   =============================================================================\n",
      "\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:43<00:50,  2.42s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:45<00:48,  2.41s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:48<00:45,  2.41s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:50<00:43,  2.42s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:53<00:41,  2.41s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:55<00:38,  2.41s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:57<00:36,  2.41s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [01:00<00:33,  2.42s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [01:02<00:31,  2.41s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [01:05<00:28,  2.41s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [01:07<00:26,  2.42s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [01:10<00:24,  2.41s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [01:12<00:21,  2.42s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [01:14<00:19,  2.41s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [01:17<00:16,  2.41s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [01:19<00:14,  2.42s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [01:22<00:12,  2.41s/it]\u001b[A\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [01:24<00:09,  2.41s/it]\u001b[A\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [01:26<00:07,  2.42s/it]\u001b[A\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:29<00:04,  2.41s/it]\u001b[A\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:31<00:02,  2.42s/it]\u001b[A\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:34<00:00,  2.41s/it]\u001b[A\n",
      "Epoch:  83%|███████████████████████████████▋      | 5/6 [07:48<01:33, 93.57s/it]\n",
      "Iteration:   0%|                                         | 0/39 [00:00<?, ?it/s]\u001b[A\n",
      "Iteration:   3%|▊                                | 1/39 [00:02<01:32,  2.44s/it]\u001b[A\n",
      "Iteration:   5%|█▋                               | 2/39 [00:04<01:30,  2.44s/it]\u001b[A\n",
      "Iteration:   8%|██▌                              | 3/39 [00:07<01:27,  2.43s/it]\u001b[A\n",
      "Iteration:  10%|███▍                             | 4/39 [00:09<01:24,  2.42s/it]\u001b[A\n",
      "Iteration:  13%|████▏                            | 5/39 [00:12<01:22,  2.42s/it]\u001b[A\n",
      "Iteration:  15%|█████                            | 6/39 [00:14<01:19,  2.42s/it]\u001b[A\n",
      "Iteration:  18%|█████▉                           | 7/39 [00:16<01:17,  2.41s/it]\u001b[A\n",
      "Iteration:  21%|██████▊                          | 8/39 [00:19<01:15,  2.42s/it]\u001b[A\n",
      "Iteration:  23%|███████▌                         | 9/39 [00:21<01:12,  2.42s/it]\u001b[A\n",
      "Iteration:  26%|████████▏                       | 10/39 [00:24<01:09,  2.41s/it]\u001b[A\n",
      "Iteration:  28%|█████████                       | 11/39 [00:26<01:07,  2.42s/it]\u001b[A\n",
      "Iteration:  31%|█████████▊                      | 12/39 [00:29<01:05,  2.42s/it]\u001b[A\n",
      "Iteration:  33%|██████████▋                     | 13/39 [00:31<01:02,  2.42s/it]\u001b[A\n",
      "Iteration:  36%|███████████▍                    | 14/39 [00:33<01:00,  2.41s/it]\u001b[A\n",
      "Iteration:  38%|████████████▎                   | 15/39 [00:36<00:57,  2.41s/it]\u001b[A\n",
      "Iteration:  41%|█████████████▏                  | 16/39 [00:38<00:55,  2.41s/it]\u001b[A\n",
      "Iteration:  44%|█████████████▉                  | 17/39 [00:41<00:53,  2.42s/it]\u001b[A\n",
      "Iteration:  46%|██████████████▊                 | 18/39 [00:43<00:50,  2.42s/it]\u001b[A\n",
      "Iteration:  49%|███████████████▌                | 19/39 [00:45<00:48,  2.42s/it]\u001b[A\n",
      "Iteration:  51%|████████████████▍               | 20/39 [00:48<00:45,  2.41s/it]\u001b[A\n",
      "Iteration:  54%|█████████████████▏              | 21/39 [00:50<00:43,  2.41s/it]\u001b[A\n",
      "Iteration:  56%|██████████████████              | 22/39 [00:53<00:40,  2.41s/it]\u001b[A\n",
      "Iteration:  59%|██████████████████▊             | 23/39 [00:55<00:38,  2.41s/it]\u001b[A\n",
      "Iteration:  62%|███████████████████▋            | 24/39 [00:57<00:36,  2.41s/it]\u001b[A\n",
      "Iteration:  64%|████████████████████▌           | 25/39 [01:00<00:33,  2.42s/it]\u001b[A\n",
      "Iteration:  67%|█████████████████████▎          | 26/39 [01:02<00:31,  2.41s/it]\u001b[A\n",
      "Iteration:  69%|██████████████████████▏         | 27/39 [01:05<00:28,  2.41s/it]\u001b[A\n",
      "Iteration:  72%|██████████████████████▉         | 28/39 [01:07<00:26,  2.41s/it]\u001b[A\n",
      "Iteration:  74%|███████████████████████▊        | 29/39 [01:10<00:24,  2.41s/it]\u001b[A\n",
      "Iteration:  77%|████████████████████████▌       | 30/39 [01:12<00:21,  2.41s/it]\u001b[A\n",
      "Iteration:  79%|█████████████████████████▍      | 31/39 [01:14<00:19,  2.41s/it]\u001b[A\n",
      "Iteration:  82%|██████████████████████████▎     | 32/39 [01:17<00:16,  2.41s/it]\u001b[A\n",
      "Iteration:  85%|███████████████████████████     | 33/39 [01:19<00:14,  2.42s/it]\u001b[A\n",
      "Iteration:  87%|███████████████████████████▉    | 34/39 [01:22<00:12,  2.42s/it]\u001b[A10/02/2020 11:14:51 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  90%|████████████████████████████▋   | 35/39 [01:24<00:09,  2.42s/it]\u001b[A10/02/2020 11:14:53 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  92%|█████████████████████████████▌  | 36/39 [01:26<00:07,  2.41s/it]\u001b[A10/02/2020 11:14:56 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  95%|██████████████████████████████▎ | 37/39 [01:29<00:04,  2.41s/it]\u001b[A10/02/2020 11:14:58 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration:  97%|███████████████████████████████▏| 38/39 [01:31<00:02,  2.41s/it]\u001b[A10/02/2020 11:15:00 - WARNING - BERT.pytorch_pretrained_bert.optimization -   Training beyond specified 't_total' steps with schedule 'warmup_linear'. Learning rate set to 0.0. Please set 't_total' of BertAdam correctly.\n",
      "\n",
      "Iteration: 100%|████████████████████████████████| 39/39 [01:34<00:00,  2.41s/it]\u001b[A\n",
      "Epoch: 100%|██████████████████████████████████████| 6/6 [09:22<00:00, 93.78s/it]\n",
      "10/02/2020 11:15:02 - INFO - __main__ -   \n",
      "10/02/2020 11:15:02 - INFO - __main__ -   =============================================================================\n",
      "10/02/2020 11:15:02 - INFO - __main__ -   Validation Loss : 0.5781198425843828 Validation Accuracy : 0.7364620938628159\n",
      "10/02/2020 11:15:02 - INFO - __main__ -   Loss_ACC : 0.6931407942238267\n",
      "10/02/2020 11:15:02 - INFO - __main__ -   =============================================================================\n"
     ]
    }
   ],
   "source": [
    "!python KDAP/NLI_KD_training_RTE.py"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": []
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "Python 3",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.7.8"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 4
}
