[02/28 09:04:02][DEBUG] cmd.py: 1253: Popen(['git', 'rev-parse', '--show-toplevel'], cwd=/home/anonymous/research/CorrelationSideTuning, stdin=None, shell=False, universal_newlines=False)
[02/28 09:04:02][DEBUG] cmd.py: 1253: Popen(['git', 'rev-parse', '--show-toplevel'], cwd=/home/anonymous/research/CorrelationSideTuning, stdin=None, shell=False, universal_newlines=False)
[02/28 09:04:03][DEBUG] cmd.py: 1253: Popen(['git', 'cat-file', '--batch-check'], cwd=/home/anonymous/research/CorrelationSideTuning, stdin=<valid stream>, shell=False, universal_newlines=False)
[02/28 09:04:06][INFO] train_vision.py:  207: ------------------------------------
[02/28 09:04:06][INFO] train_vision.py:  208: Environment Versions:
[02/28 09:04:06][INFO] train_vision.py:  209: - Python: 3.8.19 (default, Mar 20 2024, 19:58:24) 
[GCC 11.2.0]
[02/28 09:04:06][INFO] train_vision.py:  210: - PyTorch: 1.12.1
[02/28 09:04:06][INFO] train_vision.py:  211: - TorchVison: 0.13.1
[02/28 09:04:06][INFO] train_vision.py:  212: ------------------------------------
[02/28 09:04:06][INFO] train_vision.py:  214: {   'data': {   'batch_size': 10,
                'dataset': 'finegym',
                'image_tmpl': 'img_{:05d}.jpg',
                'input_size': 224,
                'label_list': 'lists/finegym288_labels.csv',
                'modality': 'RGB',
                'num_classes': 288,
                'num_sample': 1,
                'num_segments': 32,
                'rand_aug': False,
                'rand_erase': False,
                'random_shift': True,
                'seg_length': 1,
                'test_batch_size': 3,
                'train_list': 'lists/finegym/train_gym288_rgb_320px_60fps.txt',
                'train_root': '/home/anonymous/datasets/finegym',
                'val_list': 'lists/finegym/val_gym288_rgb_320px_60fps.txt',
                'val_root': '/home/anonymous/datasets/finegym',
                'workers': 4},
    'logging': {   'acc_per_class': True,
                   'correct_per_sample': True,
                   'eval_freq': 2,
                   'print_freq': 10,
                   'skip_epoch': []},
    'network': {   'arch': 'ViT-L/14',
                   'corr_dim': 256,
                   'corr_ext_chnls': [96],
                   'corr_func': 'cosine',
                   'corr_int_chnls': [96, 96, 192],
                   'corr_layer_index': [7],
                   'corr_num_encoders': 2,
                   'corr_window': [5, 9, 9],
                   'drop_fc': 0,
                   'dropout': 0.0,
                   'emb_dropout': 0.0,
                   'fix_clip': False,
                   'init': True,
                   'joint_st': False,
                   'my_fix_clip': True,
                   'n_emb': 448,
                   'num_checkpoints': 24,
                   'side_dim': 448,
                   'sim_header': 'None',
                   'sync_bn': False,
                   'tm': False,
                   'type': 'clip_k400'},
    'pretrain': 'exp/s4v_selfy_vitl14_16x224_k400_run3/model_best.pt',
    'resume': None,
    'seed': 2048,
    'solver': {   'betas': [0.9, 0.999],
                  'clip_ratio': 1,
                  'epoch_offset': 0,
                  'epochs': 30,
                  'evaluate': False,
                  'final_factor': 0.01,
                  'grad_accumulation_steps': 1,
                  'layer_decay': 1.0,
                  'loss_type': 'CE',
                  'lr': 0.0003,
                  'lr_warmup_step': 4,
                  'optim': 'adamw',
                  'smoothing': 0.1,
                  'start_epoch': 0,
                  'type': 'cosine',
                  'warmup_lr': 3e-07,
                  'weight_decay': 0.15},
    'wandb': {   'entity': 'anonymous',
                 'exp_name': 's4v_selfy_vitl14_32x224_finegym288_run2/train',
                 'group_name': 's4v_selfy_vitl14_32x224_finegym288_run2',
                 'key': '1234',
                 'project_name': 'corr_adapter_finegym288',
                 'use_wandb': True}}
[02/28 09:04:06][INFO] train_vision.py:  215: ------------------------------------
[02/28 09:04:06][INFO] train_vision.py:  216: storing name: ./exp/s4v_selfy_vitl14_32x224_finegym288_run2
[02/28 09:04:08][INFO] model.py:  404: dropout used:[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0]
[02/28 09:04:09][INFO] model.py:  444: dropout used:[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0]
[02/28 09:04:10][INFO] model.py:  404: dropout used:[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0]
[02/28 09:04:11][INFO] model.py:  921: loading clip pretrained model!
[02/28 09:04:12][INFO] train_vision.py:  284: visual.class_embedding False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.positional_embedding False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.conv1.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.ln_pre.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.ln_pre.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.0.attn.in_proj_weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.0.attn.in_proj_bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.0.attn.out_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.0.attn.out_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.0.ln_1.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.0.ln_1.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.0.mlp.c_fc.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.0.mlp.c_fc.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.0.mlp.c_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.0.mlp.c_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.0.ln_2.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.0.ln_2.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.1.attn.in_proj_weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.1.attn.in_proj_bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.1.attn.out_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.1.attn.out_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.1.ln_1.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.1.ln_1.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.1.mlp.c_fc.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.1.mlp.c_fc.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.1.mlp.c_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.1.mlp.c_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.1.ln_2.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.1.ln_2.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.2.attn.in_proj_weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.2.attn.in_proj_bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.2.attn.out_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.2.attn.out_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.2.ln_1.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.2.ln_1.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.2.mlp.c_fc.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.2.mlp.c_fc.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.2.mlp.c_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.2.mlp.c_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.2.ln_2.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.2.ln_2.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.3.attn.in_proj_weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.3.attn.in_proj_bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.3.attn.out_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.3.attn.out_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.3.ln_1.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.3.ln_1.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.3.mlp.c_fc.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.3.mlp.c_fc.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.3.mlp.c_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.3.mlp.c_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.3.ln_2.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.3.ln_2.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.4.attn.in_proj_weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.4.attn.in_proj_bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.4.attn.out_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.4.attn.out_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.4.ln_1.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.4.ln_1.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.4.mlp.c_fc.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.4.mlp.c_fc.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.4.mlp.c_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.4.mlp.c_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.4.ln_2.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.4.ln_2.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.5.attn.in_proj_weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.5.attn.in_proj_bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.5.attn.out_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.5.attn.out_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.5.ln_1.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.5.ln_1.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.5.mlp.c_fc.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.5.mlp.c_fc.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.5.mlp.c_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.5.mlp.c_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.5.ln_2.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.5.ln_2.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.6.attn.in_proj_weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.6.attn.in_proj_bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.6.attn.out_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.6.attn.out_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.6.ln_1.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.6.ln_1.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.6.mlp.c_fc.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.6.mlp.c_fc.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.6.mlp.c_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.6.mlp.c_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.6.ln_2.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.6.ln_2.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.7.attn.in_proj_weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.7.attn.in_proj_bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.7.attn.out_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.7.attn.out_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.7.ln_1.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.7.ln_1.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.7.mlp.c_fc.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.7.mlp.c_fc.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.7.mlp.c_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.7.mlp.c_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.7.ln_2.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.7.ln_2.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.8.attn.in_proj_weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.8.attn.in_proj_bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.8.attn.out_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.8.attn.out_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.8.ln_1.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.8.ln_1.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.8.mlp.c_fc.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.8.mlp.c_fc.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.8.mlp.c_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.8.mlp.c_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.8.ln_2.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.8.ln_2.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.9.attn.in_proj_weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.9.attn.in_proj_bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.9.attn.out_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.9.attn.out_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.9.ln_1.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.9.ln_1.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.9.mlp.c_fc.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.9.mlp.c_fc.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.9.mlp.c_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.9.mlp.c_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.9.ln_2.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.9.ln_2.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.10.attn.in_proj_weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.10.attn.in_proj_bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.10.attn.out_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.10.attn.out_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.10.ln_1.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.10.ln_1.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.10.mlp.c_fc.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.10.mlp.c_fc.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.10.mlp.c_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.10.mlp.c_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.10.ln_2.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.10.ln_2.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.11.attn.in_proj_weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.11.attn.in_proj_bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.11.attn.out_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.11.attn.out_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.11.ln_1.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.11.ln_1.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.11.mlp.c_fc.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.11.mlp.c_fc.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.11.mlp.c_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.11.mlp.c_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.11.ln_2.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.11.ln_2.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.12.attn.in_proj_weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.12.attn.in_proj_bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.12.attn.out_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.12.attn.out_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.12.ln_1.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.12.ln_1.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.12.mlp.c_fc.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.12.mlp.c_fc.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.12.mlp.c_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.12.mlp.c_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.12.ln_2.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.12.ln_2.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.13.attn.in_proj_weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.13.attn.in_proj_bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.13.attn.out_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.13.attn.out_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.13.ln_1.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.13.ln_1.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.13.mlp.c_fc.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.13.mlp.c_fc.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.13.mlp.c_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.13.mlp.c_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.13.ln_2.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.13.ln_2.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.14.attn.in_proj_weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.14.attn.in_proj_bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.14.attn.out_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.14.attn.out_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.14.ln_1.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.14.ln_1.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.14.mlp.c_fc.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.14.mlp.c_fc.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.14.mlp.c_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.14.mlp.c_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.14.ln_2.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.14.ln_2.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.15.attn.in_proj_weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.15.attn.in_proj_bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.15.attn.out_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.15.attn.out_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.15.ln_1.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.15.ln_1.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.15.mlp.c_fc.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.15.mlp.c_fc.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.15.mlp.c_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.15.mlp.c_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.15.ln_2.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.15.ln_2.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.16.attn.in_proj_weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.16.attn.in_proj_bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.16.attn.out_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.16.attn.out_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.16.ln_1.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.16.ln_1.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.16.mlp.c_fc.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.16.mlp.c_fc.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.16.mlp.c_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.16.mlp.c_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.16.ln_2.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.16.ln_2.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.17.attn.in_proj_weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.17.attn.in_proj_bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.17.attn.out_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.17.attn.out_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.17.ln_1.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.17.ln_1.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.17.mlp.c_fc.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.17.mlp.c_fc.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.17.mlp.c_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.17.mlp.c_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.17.ln_2.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.17.ln_2.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.18.attn.in_proj_weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.18.attn.in_proj_bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.18.attn.out_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.18.attn.out_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.18.ln_1.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.18.ln_1.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.18.mlp.c_fc.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.18.mlp.c_fc.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.18.mlp.c_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.18.mlp.c_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.18.ln_2.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.18.ln_2.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.19.attn.in_proj_weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.19.attn.in_proj_bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.19.attn.out_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.19.attn.out_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.19.ln_1.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.19.ln_1.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.19.mlp.c_fc.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.19.mlp.c_fc.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.19.mlp.c_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.19.mlp.c_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.19.ln_2.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.19.ln_2.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.20.attn.in_proj_weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.20.attn.in_proj_bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.20.attn.out_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.20.attn.out_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.20.ln_1.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.20.ln_1.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.20.mlp.c_fc.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.20.mlp.c_fc.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.20.mlp.c_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.20.mlp.c_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.20.ln_2.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.20.ln_2.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.21.attn.in_proj_weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.21.attn.in_proj_bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.21.attn.out_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.21.attn.out_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.21.ln_1.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.21.ln_1.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.21.mlp.c_fc.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.21.mlp.c_fc.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.21.mlp.c_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.21.mlp.c_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.21.ln_2.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.21.ln_2.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.22.attn.in_proj_weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.22.attn.in_proj_bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.22.attn.out_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.22.attn.out_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.22.ln_1.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.22.ln_1.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.22.mlp.c_fc.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.22.mlp.c_fc.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.22.mlp.c_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.22.mlp.c_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.22.ln_2.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.22.ln_2.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.23.attn.in_proj_weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.23.attn.in_proj_bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.23.attn.out_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.23.attn.out_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.23.ln_1.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.23.ln_1.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.23.mlp.c_fc.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.23.mlp.c_fc.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.23.mlp.c_proj.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.23.mlp.c_proj.bias False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.23.ln_2.weight False
[02/28 09:04:12][INFO] train_vision.py:  284: visual.transformer.resblocks.23.ln_2.bias False
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.side_spatial_position_embeddings True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.0.bn_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.0.bn_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.0.conv.0.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.0.conv.0.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.0.conv.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.0.conv.1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.0.conv.2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.0.conv.2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.0.bn_2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.0.bn_2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.0.mlp.fc1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.0.mlp.fc1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.0.mlp.fc2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.0.mlp.fc2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.0.attn.in_proj_weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.0.attn.in_proj_bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.0.attn.out_proj.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.0.attn.out_proj.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.0.ln_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.0.ln_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.1.bn_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.1.bn_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.1.conv.0.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.1.conv.0.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.1.conv.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.1.conv.1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.1.conv.2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.1.conv.2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.1.bn_2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.1.bn_2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.1.mlp.fc1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.1.mlp.fc1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.1.mlp.fc2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.1.mlp.fc2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.1.attn.in_proj_weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.1.attn.in_proj_bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.1.attn.out_proj.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.1.attn.out_proj.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.1.ln_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.1.ln_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.2.bn_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.2.bn_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.2.conv.0.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.2.conv.0.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.2.conv.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.2.conv.1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.2.conv.2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.2.conv.2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.2.bn_2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.2.bn_2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.2.mlp.fc1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.2.mlp.fc1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.2.mlp.fc2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.2.mlp.fc2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.2.attn.in_proj_weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.2.attn.in_proj_bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.2.attn.out_proj.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.2.attn.out_proj.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.2.ln_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.2.ln_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.3.bn_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.3.bn_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.3.conv.0.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.3.conv.0.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.3.conv.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.3.conv.1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.3.conv.2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.3.conv.2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.3.bn_2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.3.bn_2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.3.mlp.fc1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.3.mlp.fc1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.3.mlp.fc2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.3.mlp.fc2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.3.attn.in_proj_weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.3.attn.in_proj_bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.3.attn.out_proj.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.3.attn.out_proj.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.3.ln_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.3.ln_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.4.bn_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.4.bn_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.4.conv.0.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.4.conv.0.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.4.conv.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.4.conv.1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.4.conv.2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.4.conv.2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.4.bn_2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.4.bn_2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.4.mlp.fc1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.4.mlp.fc1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.4.mlp.fc2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.4.mlp.fc2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.4.attn.in_proj_weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.4.attn.in_proj_bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.4.attn.out_proj.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.4.attn.out_proj.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.4.ln_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.4.ln_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.5.bn_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.5.bn_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.5.conv.0.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.5.conv.0.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.5.conv.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.5.conv.1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.5.conv.2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.5.conv.2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.5.bn_2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.5.bn_2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.5.mlp.fc1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.5.mlp.fc1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.5.mlp.fc2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.5.mlp.fc2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.5.attn.in_proj_weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.5.attn.in_proj_bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.5.attn.out_proj.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.5.attn.out_proj.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.5.ln_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.5.ln_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.6.bn_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.6.bn_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.6.conv.0.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.6.conv.0.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.6.conv.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.6.conv.1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.6.conv.2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.6.conv.2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.6.bn_2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.6.bn_2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.6.mlp.fc1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.6.mlp.fc1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.6.mlp.fc2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.6.mlp.fc2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.6.attn.in_proj_weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.6.attn.in_proj_bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.6.attn.out_proj.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.6.attn.out_proj.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.6.ln_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.6.ln_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.7.bn_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.7.bn_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.7.conv.0.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.7.conv.0.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.7.conv.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.7.conv.1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.7.conv.2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.7.conv.2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.7.bn_2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.7.bn_2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.7.mlp.fc1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.7.mlp.fc1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.7.mlp.fc2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.7.mlp.fc2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.7.attn.in_proj_weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.7.attn.in_proj_bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.7.attn.out_proj.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.7.attn.out_proj.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.7.ln_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.7.ln_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.8.bn_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.8.bn_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.8.conv.0.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.8.conv.0.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.8.conv.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.8.conv.1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.8.conv.2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.8.conv.2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.8.bn_2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.8.bn_2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.8.mlp.fc1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.8.mlp.fc1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.8.mlp.fc2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.8.mlp.fc2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.8.attn.in_proj_weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.8.attn.in_proj_bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.8.attn.out_proj.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.8.attn.out_proj.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.8.ln_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.8.ln_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.9.bn_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.9.bn_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.9.conv.0.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.9.conv.0.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.9.conv.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.9.conv.1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.9.conv.2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.9.conv.2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.9.bn_2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.9.bn_2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.9.mlp.fc1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.9.mlp.fc1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.9.mlp.fc2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.9.mlp.fc2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.9.attn.in_proj_weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.9.attn.in_proj_bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.9.attn.out_proj.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.9.attn.out_proj.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.9.ln_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.9.ln_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.10.bn_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.10.bn_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.10.conv.0.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.10.conv.0.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.10.conv.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.10.conv.1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.10.conv.2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.10.conv.2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.10.bn_2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.10.bn_2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.10.mlp.fc1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.10.mlp.fc1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.10.mlp.fc2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.10.mlp.fc2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.10.attn.in_proj_weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.10.attn.in_proj_bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.10.attn.out_proj.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.10.attn.out_proj.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.10.ln_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.10.ln_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.11.bn_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.11.bn_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.11.conv.0.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.11.conv.0.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.11.conv.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.11.conv.1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.11.conv.2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.11.conv.2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.11.bn_2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.11.bn_2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.11.mlp.fc1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.11.mlp.fc1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.11.mlp.fc2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.11.mlp.fc2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.11.attn.in_proj_weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.11.attn.in_proj_bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.11.attn.out_proj.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.11.attn.out_proj.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.11.ln_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.11.ln_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.12.bn_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.12.bn_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.12.conv.0.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.12.conv.0.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.12.conv.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.12.conv.1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.12.conv.2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.12.conv.2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.12.bn_2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.12.bn_2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.12.mlp.fc1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.12.mlp.fc1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.12.mlp.fc2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.12.mlp.fc2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.12.attn.in_proj_weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.12.attn.in_proj_bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.12.attn.out_proj.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.12.attn.out_proj.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.12.ln_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.12.ln_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.13.bn_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.13.bn_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.13.conv.0.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.13.conv.0.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.13.conv.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.13.conv.1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.13.conv.2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.13.conv.2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.13.bn_2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.13.bn_2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.13.mlp.fc1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.13.mlp.fc1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.13.mlp.fc2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.13.mlp.fc2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.13.attn.in_proj_weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.13.attn.in_proj_bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.13.attn.out_proj.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.13.attn.out_proj.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.13.ln_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.13.ln_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.14.bn_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.14.bn_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.14.conv.0.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.14.conv.0.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.14.conv.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.14.conv.1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.14.conv.2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.14.conv.2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.14.bn_2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.14.bn_2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.14.mlp.fc1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.14.mlp.fc1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.14.mlp.fc2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.14.mlp.fc2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.14.attn.in_proj_weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.14.attn.in_proj_bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.14.attn.out_proj.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.14.attn.out_proj.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.14.ln_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.14.ln_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.15.bn_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.15.bn_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.15.conv.0.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.15.conv.0.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.15.conv.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.15.conv.1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.15.conv.2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.15.conv.2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.15.bn_2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.15.bn_2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.15.mlp.fc1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.15.mlp.fc1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.15.mlp.fc2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.15.mlp.fc2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.15.attn.in_proj_weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.15.attn.in_proj_bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.15.attn.out_proj.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.15.attn.out_proj.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.15.ln_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.15.ln_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.16.bn_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.16.bn_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.16.conv.0.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.16.conv.0.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.16.conv.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.16.conv.1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.16.conv.2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.16.conv.2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.16.bn_2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.16.bn_2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.16.mlp.fc1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.16.mlp.fc1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.16.mlp.fc2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.16.mlp.fc2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.16.attn.in_proj_weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.16.attn.in_proj_bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.16.attn.out_proj.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.16.attn.out_proj.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.16.ln_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.16.ln_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.17.bn_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.17.bn_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.17.conv.0.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.17.conv.0.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.17.conv.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.17.conv.1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.17.conv.2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.17.conv.2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.17.bn_2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.17.bn_2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.17.mlp.fc1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.17.mlp.fc1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.17.mlp.fc2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.17.mlp.fc2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.17.attn.in_proj_weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.17.attn.in_proj_bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.17.attn.out_proj.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.17.attn.out_proj.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.17.ln_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.17.ln_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.18.bn_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.18.bn_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.18.conv.0.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.18.conv.0.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.18.conv.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.18.conv.1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.18.conv.2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.18.conv.2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.18.bn_2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.18.bn_2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.18.mlp.fc1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.18.mlp.fc1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.18.mlp.fc2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.18.mlp.fc2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.18.attn.in_proj_weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.18.attn.in_proj_bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.18.attn.out_proj.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.18.attn.out_proj.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.18.ln_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.18.ln_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.19.bn_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.19.bn_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.19.conv.0.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.19.conv.0.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.19.conv.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.19.conv.1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.19.conv.2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.19.conv.2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.19.bn_2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.19.bn_2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.19.mlp.fc1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.19.mlp.fc1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.19.mlp.fc2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.19.mlp.fc2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.19.attn.in_proj_weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.19.attn.in_proj_bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.19.attn.out_proj.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.19.attn.out_proj.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.19.ln_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.19.ln_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.20.bn_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.20.bn_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.20.conv.0.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.20.conv.0.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.20.conv.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.20.conv.1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.20.conv.2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.20.conv.2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.20.bn_2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.20.bn_2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.20.mlp.fc1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.20.mlp.fc1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.20.mlp.fc2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.20.mlp.fc2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.20.attn.in_proj_weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.20.attn.in_proj_bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.20.attn.out_proj.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.20.attn.out_proj.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.20.ln_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.20.ln_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.21.bn_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.21.bn_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.21.conv.0.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.21.conv.0.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.21.conv.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.21.conv.1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.21.conv.2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.21.conv.2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.21.bn_2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.21.bn_2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.21.mlp.fc1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.21.mlp.fc1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.21.mlp.fc2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.21.mlp.fc2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.21.attn.in_proj_weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.21.attn.in_proj_bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.21.attn.out_proj.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.21.attn.out_proj.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.21.ln_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.21.ln_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.22.bn_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.22.bn_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.22.conv.0.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.22.conv.0.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.22.conv.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.22.conv.1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.22.conv.2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.22.conv.2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.22.bn_2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.22.bn_2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.22.mlp.fc1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.22.mlp.fc1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.22.mlp.fc2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.22.mlp.fc2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.22.attn.in_proj_weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.22.attn.in_proj_bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.22.attn.out_proj.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.22.attn.out_proj.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.22.ln_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.22.ln_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.23.bn_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.23.bn_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.23.conv.0.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.23.conv.0.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.23.conv.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.23.conv.1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.23.conv.2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.23.conv.2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.23.bn_2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.23.bn_2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.23.mlp.fc1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.23.mlp.fc1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.23.mlp.fc2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.23.mlp.fc2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.23.attn.in_proj_weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.23.attn.in_proj_bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.23.attn.out_proj.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.23.attn.out_proj.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.23.ln_1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.resblocks.23.ln_1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.0.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.0.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.3.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.3.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.4.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.4.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.5.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.5.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.6.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.6.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.7.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.7.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.8.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.8.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.9.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.9.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.10.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.10.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.11.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.11.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.12.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.12.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.13.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.13.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.14.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.14.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.15.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.15.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.16.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.16.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.17.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.17.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.18.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.18.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.19.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.19.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.20.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.20.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.21.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.21.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.22.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.22.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.23.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.adaptation.23.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.0.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.0.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.3.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.3.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.4.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.4.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.5.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.5.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.6.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.6.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.7.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.7.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.8.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.8.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.9.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.9.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.10.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.10.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.11.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.11.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.12.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.12.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.13.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.13.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.14.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.14.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.15.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.15.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.16.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.16.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.17.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.17.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.18.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.18.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.19.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.19.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.20.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.20.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.21.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.21.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.22.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.22.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.23.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.lns_pre.23.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.ln_pre.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.ln_pre.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.in_proj.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.in_proj.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.stss_extraction.conv0.0.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.stss_extraction.conv0.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.stss_extraction.conv0.1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.stss_integration.conv0.0.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.stss_integration.conv0.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.stss_integration.conv0.1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.stss_integration.conv1.0.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.stss_integration.conv1.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.stss_integration.conv1.1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.stss_integration.conv2_fuse.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.stss_integration.conv2_fuse.2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.stss_integration.conv2_fuse.2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.out_proj.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.out_proj.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.ln_pre.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.ln_pre.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.in_proj.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.in_proj.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.stss_extraction.conv0.0.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.stss_extraction.conv0.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.stss_extraction.conv0.1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.stss_integration.conv0.0.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.stss_integration.conv0.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.stss_integration.conv0.1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.stss_integration.conv1.0.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.stss_integration.conv1.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.stss_integration.conv1.1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.stss_integration.conv2_fuse.1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.stss_integration.conv2_fuse.2.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.stss_integration.conv2_fuse.2.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.out_proj.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.out_proj.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_post_bn.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_post_bn.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_conv1.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_conv1.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_pre_bn3d.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: visual.side_pre_bn3d.bias True
[02/28 09:04:12][INFO] train_vision.py:  287: fc.weight True
[02/28 09:04:12][INFO] train_vision.py:  287: fc.bias True
[02/28 09:04:12][INFO] utils.py:  500: Model:
VideoCLIP(
  (visual): VisualTransformer(
    (conv1): Conv2d(3, 1024, kernel_size=(14, 14), stride=(14, 14), bias=False)
    (dropout): Dropout(p=0.0, inplace=False)
    (ln_pre): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
    (transformer): Transformer(
      (resblocks): ModuleList(
        (0): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (1): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (2): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (3): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (4): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (5): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (6): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (7): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (8): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (9): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (10): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (11): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (12): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (13): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (14): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (15): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (16): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (17): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (18): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (19): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (20): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (21): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (22): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (23): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
      )
    )
    (side_network): SideNetwork(
      (resblocks): ModuleList(
        (0): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (1): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (2): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (3): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (4): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (5): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (6): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (7): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (8): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (9): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (10): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (11): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (12): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (13): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (14): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (15): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (16): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (17): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (18): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (19): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (20): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (21): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (22): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (23): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
      )
      (adaptation): ModuleList(
        (0): Linear(in_features=1024, out_features=448, bias=True)
        (1): Linear(in_features=1024, out_features=448, bias=True)
        (2): Linear(in_features=1024, out_features=448, bias=True)
        (3): Linear(in_features=1024, out_features=448, bias=True)
        (4): Linear(in_features=1024, out_features=448, bias=True)
        (5): Linear(in_features=1024, out_features=448, bias=True)
        (6): Linear(in_features=1024, out_features=448, bias=True)
        (7): Linear(in_features=1024, out_features=448, bias=True)
        (8): Linear(in_features=1024, out_features=448, bias=True)
        (9): Linear(in_features=1024, out_features=448, bias=True)
        (10): Linear(in_features=1024, out_features=448, bias=True)
        (11): Linear(in_features=1024, out_features=448, bias=True)
        (12): Linear(in_features=1024, out_features=448, bias=True)
        (13): Linear(in_features=1024, out_features=448, bias=True)
        (14): Linear(in_features=1024, out_features=448, bias=True)
        (15): Linear(in_features=1024, out_features=448, bias=True)
        (16): Linear(in_features=1024, out_features=448, bias=True)
        (17): Linear(in_features=1024, out_features=448, bias=True)
        (18): Linear(in_features=1024, out_features=448, bias=True)
        (19): Linear(in_features=1024, out_features=448, bias=True)
        (20): Linear(in_features=1024, out_features=448, bias=True)
        (21): Linear(in_features=1024, out_features=448, bias=True)
        (22): Linear(in_features=1024, out_features=448, bias=True)
        (23): Linear(in_features=1024, out_features=448, bias=True)
      )
      (lns_pre): ModuleList(
        (0): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (3): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (4): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (5): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (6): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (7): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (8): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (9): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (10): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (11): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (12): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (13): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (14): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (15): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (16): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (17): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (18): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (19): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (20): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (21): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (22): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (23): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
      )
      (moss_layers): ModuleList(
        (0): MOSSBlock(
          (stss_encoders): ModuleList(
            (0): STSSEncoder(
              (ln_pre): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
              (in_proj): Linear(in_features=1024, out_features=256, bias=True)
              (stss_transformation): STSSTransformation()
              (stss_extraction): STSSExtraction(
                (conv0): Sequential(
                  (0): Conv3d(81, 96, kernel_size=(1, 1, 1), stride=(1, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
              )
              (stss_integration): STSSIntegration(
                (conv0): Sequential(
                  (0): Conv3d(96, 96, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
                (conv1): Sequential(
                  (0): Conv3d(96, 96, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
                (conv2_fuse): Sequential(
                  (0): Rearrange('(b l) c t h w -> b (l c) t h w', l=5)
                  (1): Conv3d(480, 192, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (2): BatchNorm3d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (3): GELU(approximate=none)
                )
              )
              (out_proj): Linear(in_features=192, out_features=448, bias=True)
            )
            (1): STSSEncoder(
              (ln_pre): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
              (in_proj): Linear(in_features=448, out_features=256, bias=True)
              (stss_transformation): STSSTransformation()
              (stss_extraction): STSSExtraction(
                (conv0): Sequential(
                  (0): Conv3d(81, 96, kernel_size=(1, 1, 1), stride=(1, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
              )
              (stss_integration): STSSIntegration(
                (conv0): Sequential(
                  (0): Conv3d(96, 96, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
                (conv1): Sequential(
                  (0): Conv3d(96, 96, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
                (conv2_fuse): Sequential(
                  (0): Rearrange('(b l) c t h w -> b (l c) t h w', l=5)
                  (1): Conv3d(480, 192, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (2): BatchNorm3d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (3): GELU(approximate=none)
                )
              )
              (out_proj): Linear(in_features=192, out_features=448, bias=True)
            )
          )
        )
      )
    )
    (side_post_bn): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
    (side_conv1): Conv3d(3, 448, kernel_size=(3, 14, 14), stride=(1, 14, 14), padding=(1, 0, 0))
    (side_pre_bn3d): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
  )
  (fusion_model): video_header()
  (drop_out): Dropout(p=0, inplace=False)
  (fc): Linear(in_features=448, out_features=288, bias=True)
)
[02/28 09:05:14][WARNING] jit_analysis.py:  501: Unsupported operator aten::div encountered 205 time(s)
[02/28 09:05:14][WARNING] jit_analysis.py:  501: Unsupported operator aten::add_ encountered 58 time(s)
[02/28 09:05:14][WARNING] jit_analysis.py:  501: Unsupported operator aten::mul encountered 463 time(s)
[02/28 09:05:14][WARNING] jit_analysis.py:  501: Unsupported operator aten::mul_ encountered 90 time(s)
[02/28 09:05:14][WARNING] jit_analysis.py:  501: Unsupported operator aten::add encountered 171 time(s)
[02/28 09:05:14][WARNING] jit_analysis.py:  501: Unsupported operator aten::softmax encountered 24 time(s)
[02/28 09:05:14][WARNING] jit_analysis.py:  501: Unsupported operator aten::sigmoid encountered 24 time(s)
[02/28 09:05:14][WARNING] jit_analysis.py:  501: Unsupported operator prim::PythonOp.CheckpointFunction encountered 72 time(s)
[02/28 09:05:14][WARNING] jit_analysis.py:  501: Unsupported operator aten::pad encountered 38 time(s)
[02/28 09:05:14][WARNING] jit_analysis.py:  501: Unsupported operator aten::unfold encountered 2 time(s)
[02/28 09:05:14][WARNING] jit_analysis.py:  501: Unsupported operator aten::norm encountered 4 time(s)
[02/28 09:05:14][WARNING] jit_analysis.py:  501: Unsupported operator aten::clamp_min encountered 4 time(s)
[02/28 09:05:14][WARNING] jit_analysis.py:  501: Unsupported operator aten::expand_as encountered 4 time(s)
[02/28 09:05:14][WARNING] jit_analysis.py:  501: Unsupported operator aten::diagonal encountered 36 time(s)
[02/28 09:05:14][WARNING] jit_analysis.py:  501: Unsupported operator aten::gelu encountered 8 time(s)
[02/28 09:05:14][WARNING] jit_analysis.py:  501: Unsupported operator aten::sum encountered 1 time(s)
[02/28 09:05:14][WARNING] jit_analysis.py:  501: Unsupported operator aten::mean encountered 1 time(s)
[02/28 09:05:14][WARNING] jit_analysis.py:  513: The following submodules of the model were never called during the trace of the graph. They may be unused, or they were accessed by direct calls to .forward() or via other python methods. In the latter case they will have zeros for statistics, though their statistics will still contribute to their parent calling module.
fusion_model, visual.dropout, visual.side_network.resblocks.0.attn, visual.side_network.resblocks.0.attn.out_proj, visual.side_network.resblocks.0.conv, visual.side_network.resblocks.0.conv.0, visual.side_network.resblocks.0.conv.1, visual.side_network.resblocks.0.conv.2, visual.side_network.resblocks.0.mlp, visual.side_network.resblocks.0.mlp.act, visual.side_network.resblocks.0.mlp.drop, visual.side_network.resblocks.0.mlp.fc1, visual.side_network.resblocks.0.mlp.fc2, visual.side_network.resblocks.1.attn, visual.side_network.resblocks.1.attn.out_proj, visual.side_network.resblocks.1.conv, visual.side_network.resblocks.1.conv.0, visual.side_network.resblocks.1.conv.1, visual.side_network.resblocks.1.conv.2, visual.side_network.resblocks.1.mlp, visual.side_network.resblocks.1.mlp.act, visual.side_network.resblocks.1.mlp.drop, visual.side_network.resblocks.1.mlp.fc1, visual.side_network.resblocks.1.mlp.fc2, visual.side_network.resblocks.10.attn, visual.side_network.resblocks.10.attn.out_proj, visual.side_network.resblocks.10.conv, visual.side_network.resblocks.10.conv.0, visual.side_network.resblocks.10.conv.1, visual.side_network.resblocks.10.conv.2, visual.side_network.resblocks.10.mlp, visual.side_network.resblocks.10.mlp.act, visual.side_network.resblocks.10.mlp.drop, visual.side_network.resblocks.10.mlp.fc1, visual.side_network.resblocks.10.mlp.fc2, visual.side_network.resblocks.11.attn, visual.side_network.resblocks.11.attn.out_proj, visual.side_network.resblocks.11.conv, visual.side_network.resblocks.11.conv.0, visual.side_network.resblocks.11.conv.1, visual.side_network.resblocks.11.conv.2, visual.side_network.resblocks.11.mlp, visual.side_network.resblocks.11.mlp.act, visual.side_network.resblocks.11.mlp.drop, visual.side_network.resblocks.11.mlp.fc1, visual.side_network.resblocks.11.mlp.fc2, visual.side_network.resblocks.12.attn, visual.side_network.resblocks.12.attn.out_proj, visual.side_network.resblocks.12.conv, visual.side_network.resblocks.12.conv.0, visual.side_network.resblocks.12.conv.1, visual.side_network.resblocks.12.conv.2, visual.side_network.resblocks.12.mlp, visual.side_network.resblocks.12.mlp.act, visual.side_network.resblocks.12.mlp.drop, visual.side_network.resblocks.12.mlp.fc1, visual.side_network.resblocks.12.mlp.fc2, visual.side_network.resblocks.13.attn, visual.side_network.resblocks.13.attn.out_proj, visual.side_network.resblocks.13.conv, visual.side_network.resblocks.13.conv.0, visual.side_network.resblocks.13.conv.1, visual.side_network.resblocks.13.conv.2, visual.side_network.resblocks.13.mlp, visual.side_network.resblocks.13.mlp.act, visual.side_network.resblocks.13.mlp.drop, visual.side_network.resblocks.13.mlp.fc1, visual.side_network.resblocks.13.mlp.fc2, visual.side_network.resblocks.14.attn, visual.side_network.resblocks.14.attn.out_proj, visual.side_network.resblocks.14.conv, visual.side_network.resblocks.14.conv.0, visual.side_network.resblocks.14.conv.1, visual.side_network.resblocks.14.conv.2, visual.side_network.resblocks.14.mlp, visual.side_network.resblocks.14.mlp.act, visual.side_network.resblocks.14.mlp.drop, visual.side_network.resblocks.14.mlp.fc1, visual.side_network.resblocks.14.mlp.fc2, visual.side_network.resblocks.15.attn, visual.side_network.resblocks.15.attn.out_proj, visual.side_network.resblocks.15.conv, visual.side_network.resblocks.15.conv.0, visual.side_network.resblocks.15.conv.1, visual.side_network.resblocks.15.conv.2, visual.side_network.resblocks.15.mlp, visual.side_network.resblocks.15.mlp.act, visual.side_network.resblocks.15.mlp.drop, visual.side_network.resblocks.15.mlp.fc1, visual.side_network.resblocks.15.mlp.fc2, visual.side_network.resblocks.16.attn, visual.side_network.resblocks.16.attn.out_proj, visual.side_network.resblocks.16.conv, visual.side_network.resblocks.16.conv.0, visual.side_network.resblocks.16.conv.1, visual.side_network.resblocks.16.conv.2, visual.side_network.resblocks.16.mlp, visual.side_network.resblocks.16.mlp.act, visual.side_network.resblocks.16.mlp.drop, visual.side_network.resblocks.16.mlp.fc1, visual.side_network.resblocks.16.mlp.fc2, visual.side_network.resblocks.17.attn, visual.side_network.resblocks.17.attn.out_proj, visual.side_network.resblocks.17.conv, visual.side_network.resblocks.17.conv.0, visual.side_network.resblocks.17.conv.1, visual.side_network.resblocks.17.conv.2, visual.side_network.resblocks.17.mlp, visual.side_network.resblocks.17.mlp.act, visual.side_network.resblocks.17.mlp.drop, visual.side_network.resblocks.17.mlp.fc1, visual.side_network.resblocks.17.mlp.fc2, visual.side_network.resblocks.18.attn, visual.side_network.resblocks.18.attn.out_proj, visual.side_network.resblocks.18.conv, visual.side_network.resblocks.18.conv.0, visual.side_network.resblocks.18.conv.1, visual.side_network.resblocks.18.conv.2, visual.side_network.resblocks.18.mlp, visual.side_network.resblocks.18.mlp.act, visual.side_network.resblocks.18.mlp.drop, visual.side_network.resblocks.18.mlp.fc1, visual.side_network.resblocks.18.mlp.fc2, visual.side_network.resblocks.19.attn, visual.side_network.resblocks.19.attn.out_proj, visual.side_network.resblocks.19.conv, visual.side_network.resblocks.19.conv.0, visual.side_network.resblocks.19.conv.1, visual.side_network.resblocks.19.conv.2, visual.side_network.resblocks.19.mlp, visual.side_network.resblocks.19.mlp.act, visual.side_network.resblocks.19.mlp.drop, visual.side_network.resblocks.19.mlp.fc1, visual.side_network.resblocks.19.mlp.fc2, visual.side_network.resblocks.2.attn, visual.side_network.resblocks.2.attn.out_proj, visual.side_network.resblocks.2.conv, visual.side_network.resblocks.2.conv.0, visual.side_network.resblocks.2.conv.1, visual.side_network.resblocks.2.conv.2, visual.side_network.resblocks.2.mlp, visual.side_network.resblocks.2.mlp.act, visual.side_network.resblocks.2.mlp.drop, visual.side_network.resblocks.2.mlp.fc1, visual.side_network.resblocks.2.mlp.fc2, visual.side_network.resblocks.20.attn, visual.side_network.resblocks.20.attn.out_proj, visual.side_network.resblocks.20.conv, visual.side_network.resblocks.20.conv.0, visual.side_network.resblocks.20.conv.1, visual.side_network.resblocks.20.conv.2, visual.side_network.resblocks.20.mlp, visual.side_network.resblocks.20.mlp.act, visual.side_network.resblocks.20.mlp.drop, visual.side_network.resblocks.20.mlp.fc1, visual.side_network.resblocks.20.mlp.fc2, visual.side_network.resblocks.21.attn, visual.side_network.resblocks.21.attn.out_proj, visual.side_network.resblocks.21.conv, visual.side_network.resblocks.21.conv.0, visual.side_network.resblocks.21.conv.1, visual.side_network.resblocks.21.conv.2, visual.side_network.resblocks.21.mlp, visual.side_network.resblocks.21.mlp.act, visual.side_network.resblocks.21.mlp.drop, visual.side_network.resblocks.21.mlp.fc1, visual.side_network.resblocks.21.mlp.fc2, visual.side_network.resblocks.22.attn, visual.side_network.resblocks.22.attn.out_proj, visual.side_network.resblocks.22.conv, visual.side_network.resblocks.22.conv.0, visual.side_network.resblocks.22.conv.1, visual.side_network.resblocks.22.conv.2, visual.side_network.resblocks.22.mlp, visual.side_network.resblocks.22.mlp.act, visual.side_network.resblocks.22.mlp.drop, visual.side_network.resblocks.22.mlp.fc1, visual.side_network.resblocks.22.mlp.fc2, visual.side_network.resblocks.23.attn, visual.side_network.resblocks.23.attn.out_proj, visual.side_network.resblocks.23.conv, visual.side_network.resblocks.23.conv.0, visual.side_network.resblocks.23.conv.1, visual.side_network.resblocks.23.conv.2, visual.side_network.resblocks.23.mlp, visual.side_network.resblocks.23.mlp.act, visual.side_network.resblocks.23.mlp.drop, visual.side_network.resblocks.23.mlp.fc1, visual.side_network.resblocks.23.mlp.fc2, visual.side_network.resblocks.3.attn, visual.side_network.resblocks.3.attn.out_proj, visual.side_network.resblocks.3.conv, visual.side_network.resblocks.3.conv.0, visual.side_network.resblocks.3.conv.1, visual.side_network.resblocks.3.conv.2, visual.side_network.resblocks.3.mlp, visual.side_network.resblocks.3.mlp.act, visual.side_network.resblocks.3.mlp.drop, visual.side_network.resblocks.3.mlp.fc1, visual.side_network.resblocks.3.mlp.fc2, visual.side_network.resblocks.4.attn, visual.side_network.resblocks.4.attn.out_proj, visual.side_network.resblocks.4.conv, visual.side_network.resblocks.4.conv.0, visual.side_network.resblocks.4.conv.1, visual.side_network.resblocks.4.conv.2, visual.side_network.resblocks.4.mlp, visual.side_network.resblocks.4.mlp.act, visual.side_network.resblocks.4.mlp.drop, visual.side_network.resblocks.4.mlp.fc1, visual.side_network.resblocks.4.mlp.fc2, visual.side_network.resblocks.5.attn, visual.side_network.resblocks.5.attn.out_proj, visual.side_network.resblocks.5.conv, visual.side_network.resblocks.5.conv.0, visual.side_network.resblocks.5.conv.1, visual.side_network.resblocks.5.conv.2, visual.side_network.resblocks.5.mlp, visual.side_network.resblocks.5.mlp.act, visual.side_network.resblocks.5.mlp.drop, visual.side_network.resblocks.5.mlp.fc1, visual.side_network.resblocks.5.mlp.fc2, visual.side_network.resblocks.6.attn, visual.side_network.resblocks.6.attn.out_proj, visual.side_network.resblocks.6.conv, visual.side_network.resblocks.6.conv.0, visual.side_network.resblocks.6.conv.1, visual.side_network.resblocks.6.conv.2, visual.side_network.resblocks.6.mlp, visual.side_network.resblocks.6.mlp.act, visual.side_network.resblocks.6.mlp.drop, visual.side_network.resblocks.6.mlp.fc1, visual.side_network.resblocks.6.mlp.fc2, visual.side_network.resblocks.7.attn, visual.side_network.resblocks.7.attn.out_proj, visual.side_network.resblocks.7.conv, visual.side_network.resblocks.7.conv.0, visual.side_network.resblocks.7.conv.1, visual.side_network.resblocks.7.conv.2, visual.side_network.resblocks.7.mlp, visual.side_network.resblocks.7.mlp.act, visual.side_network.resblocks.7.mlp.drop, visual.side_network.resblocks.7.mlp.fc1, visual.side_network.resblocks.7.mlp.fc2, visual.side_network.resblocks.8.attn, visual.side_network.resblocks.8.attn.out_proj, visual.side_network.resblocks.8.conv, visual.side_network.resblocks.8.conv.0, visual.side_network.resblocks.8.conv.1, visual.side_network.resblocks.8.conv.2, visual.side_network.resblocks.8.mlp, visual.side_network.resblocks.8.mlp.act, visual.side_network.resblocks.8.mlp.drop, visual.side_network.resblocks.8.mlp.fc1, visual.side_network.resblocks.8.mlp.fc2, visual.side_network.resblocks.9.attn, visual.side_network.resblocks.9.attn.out_proj, visual.side_network.resblocks.9.conv, visual.side_network.resblocks.9.conv.0, visual.side_network.resblocks.9.conv.1, visual.side_network.resblocks.9.conv.2, visual.side_network.resblocks.9.mlp, visual.side_network.resblocks.9.mlp.act, visual.side_network.resblocks.9.mlp.drop, visual.side_network.resblocks.9.mlp.fc1, visual.side_network.resblocks.9.mlp.fc2, visual.transformer.resblocks.0.attn.out_proj, visual.transformer.resblocks.1.attn.out_proj, visual.transformer.resblocks.10.attn.out_proj, visual.transformer.resblocks.11.attn.out_proj, visual.transformer.resblocks.12.attn.out_proj, visual.transformer.resblocks.13.attn.out_proj, visual.transformer.resblocks.14.attn.out_proj, visual.transformer.resblocks.15.attn.out_proj, visual.transformer.resblocks.16.attn.out_proj, visual.transformer.resblocks.17.attn.out_proj, visual.transformer.resblocks.18.attn.out_proj, visual.transformer.resblocks.19.attn.out_proj, visual.transformer.resblocks.2.attn.out_proj, visual.transformer.resblocks.20.attn.out_proj, visual.transformer.resblocks.21.attn.out_proj, visual.transformer.resblocks.22.attn.out_proj, visual.transformer.resblocks.23.attn.out_proj, visual.transformer.resblocks.3.attn.out_proj, visual.transformer.resblocks.4.attn.out_proj, visual.transformer.resblocks.5.attn.out_proj, visual.transformer.resblocks.6.attn.out_proj, visual.transformer.resblocks.7.attn.out_proj, visual.transformer.resblocks.8.attn.out_proj, visual.transformer.resblocks.9.attn.out_proj
[02/28 09:05:14][INFO] utils.py:  502: Flops: 2.732T
[02/28 09:05:14][INFO] utils.py:  504: Params: 385.508M, tunable Params: 82.330M
[02/28 09:05:14][INFO] train_vision.py:  297: train transforms: [Compose(
    <datasets.transforms.GroupScale object at 0x148566d8f580>
    Compose(
    <datasets.transforms.GroupRandomSizedCrop object at 0x148566d8f400>
    <datasets.transforms.GroupRandomHorizontalFlip object at 0x148566d8f760>
)
    <datasets.transforms.GroupRandomGrayscale object at 0x148566d8fa90>
), Compose(
    <datasets.transforms.Stack object at 0x148566d8f310>
    <datasets.transforms.ToTorchFormatTensor object at 0x148566d8f490>
    <datasets.transforms.GroupNormalize object at 0x148566d8f190>
)]
[02/28 09:05:14][INFO] train_vision.py:  298: val transforms: [Compose(
    <datasets.transforms.GroupScale object at 0x148566d8fd30>
    <datasets.transforms.GroupCenterCrop object at 0x14856692a160>
), Compose(
    <datasets.transforms.Stack object at 0x148566d8fd60>
    <datasets.transforms.ToTorchFormatTensor object at 0x14856692aee0>
    <datasets.transforms.GroupNormalize object at 0x148566d8fca0>
)]
[02/28 09:05:14][INFO] train_vision.py:  361: => Using label smoothing: 0.1
[02/28 09:05:14][INFO] train_vision.py:  372: => loading checkpoint 'exp/s4v_selfy_vitl14_16x224_k400_run3/model_best.pt'
[02/28 09:05:15][INFO] train_vision.py:  384: => pop last fc layer
[02/28 09:05:26][INFO] train_vision.py:  668: Epoch: [0][0/367], lr: 3.00e-07, eta: 1 day, 7:58:00	Time 10.451 (10.451)	Data 2.558 (2.558)	Mem 40.67GB	Prec@1 0.000 (0.000)	Loss 5.6440 (5.6440)
[02/28 09:05:27][INFO] distributed.py:  995: Reducer buckets have been rebuilt in this iteration.
[02/28 09:05:56][INFO] train_vision.py:  668: Epoch: [0][10/367], lr: 2.14e-06, eta: 11:04:18	Time 2.944 (3.623)	Data 0.043 (0.290)	Mem 41.61GB	Prec@1 0.000 (0.000)	Loss 5.7279 (5.6958)
[02/28 09:06:25][INFO] train_vision.py:  668: Epoch: [0][20/367], lr: 4.18e-06, eta: 10:05:39	Time 2.952 (3.306)	Data 0.046 (0.180)	Mem 41.61GB	Prec@1 10.000 (0.476)	Loss 5.4956 (5.6698)
[02/28 09:06:55][INFO] train_vision.py:  668: Epoch: [0][30/367], lr: 6.22e-06, eta: 9:45:53	Time 2.988 (3.201)	Data 0.070 (0.142)	Mem 41.61GB	Prec@1 0.000 (0.645)	Loss 5.5824 (5.6791)
[02/28 09:07:25][INFO] train_vision.py:  668: Epoch: [0][40/367], lr: 8.26e-06, eta: 9:36:43	Time 3.026 (3.154)	Data 0.090 (0.122)	Mem 41.61GB	Prec@1 0.000 (0.976)	Loss 5.5915 (5.6685)
[02/28 09:07:55][INFO] train_vision.py:  668: Epoch: [0][50/367], lr: 1.03e-05, eta: 9:30:49	Time 3.016 (3.125)	Data 0.063 (0.110)	Mem 41.61GB	Prec@1 0.000 (1.176)	Loss 5.5943 (5.6490)
[02/28 09:08:25][INFO] train_vision.py:  668: Epoch: [0][60/367], lr: 1.23e-05, eta: 9:26:47	Time 2.984 (3.105)	Data 0.068 (0.102)	Mem 41.61GB	Prec@1 20.000 (1.803)	Loss 5.5046 (5.6246)
[02/28 09:08:56][INFO] train_vision.py:  668: Epoch: [0][70/367], lr: 1.44e-05, eta: 9:23:49	Time 2.996 (3.092)	Data 0.050 (0.096)	Mem 41.61GB	Prec@1 0.000 (2.535)	Loss 5.4635 (5.5995)
[02/28 09:09:26][INFO] train_vision.py:  668: Epoch: [0][80/367], lr: 1.64e-05, eta: 9:21:24	Time 3.020 (3.082)	Data 0.084 (0.092)	Mem 41.61GB	Prec@1 10.000 (3.210)	Loss 5.5378 (5.5742)
[02/28 09:09:56][INFO] train_vision.py:  668: Epoch: [0][90/367], lr: 1.85e-05, eta: 9:19:35	Time 3.003 (3.074)	Data 0.061 (0.089)	Mem 41.61GB	Prec@1 40.000 (4.286)	Loss 4.6698 (5.5225)
[02/28 09:10:26][INFO] train_vision.py:  668: Epoch: [0][100/367], lr: 2.05e-05, eta: 9:18:04	Time 3.020 (3.069)	Data 0.071 (0.086)	Mem 41.61GB	Prec@1 40.000 (5.446)	Loss 4.4555 (5.4578)
[02/28 09:10:56][INFO] train_vision.py:  668: Epoch: [0][110/367], lr: 2.26e-05, eta: 9:16:47	Time 3.024 (3.065)	Data 0.058 (0.085)	Mem 41.61GB	Prec@1 20.000 (6.126)	Loss 4.3811 (5.4055)
[02/28 09:11:26][INFO] train_vision.py:  668: Epoch: [0][120/367], lr: 2.46e-05, eta: 9:15:38	Time 3.010 (3.061)	Data 0.075 (0.083)	Mem 41.61GB	Prec@1 20.000 (6.942)	Loss 4.5429 (5.3445)
[02/28 09:11:57][INFO] train_vision.py:  668: Epoch: [0][130/367], lr: 2.66e-05, eta: 9:14:40	Time 3.037 (3.059)	Data 0.072 (0.082)	Mem 41.61GB	Prec@1 10.000 (7.786)	Loss 4.2953 (5.2908)
[02/28 09:12:27][INFO] train_vision.py:  668: Epoch: [0][140/367], lr: 2.87e-05, eta: 9:13:37	Time 3.035 (3.056)	Data 0.023 (0.080)	Mem 41.61GB	Prec@1 10.000 (8.298)	Loss 4.7820 (5.2450)
[02/28 09:12:57][INFO] train_vision.py:  668: Epoch: [0][150/367], lr: 3.07e-05, eta: 9:12:46	Time 3.045 (3.054)	Data 0.084 (0.080)	Mem 41.61GB	Prec@1 20.000 (8.742)	Loss 4.2375 (5.1870)
[02/28 09:13:27][INFO] train_vision.py:  668: Epoch: [0][160/367], lr: 3.28e-05, eta: 9:11:52	Time 3.015 (3.052)	Data 0.068 (0.079)	Mem 41.61GB	Prec@1 30.000 (9.006)	Loss 4.0920 (5.1486)
[02/28 09:13:58][INFO] train_vision.py:  668: Epoch: [0][170/367], lr: 3.48e-05, eta: 9:11:07	Time 3.035 (3.050)	Data 0.090 (0.078)	Mem 41.61GB	Prec@1 30.000 (9.825)	Loss 3.7226 (5.0905)
[02/28 09:14:28][INFO] train_vision.py:  668: Epoch: [0][180/367], lr: 3.68e-05, eta: 9:10:20	Time 2.993 (3.049)	Data 0.057 (0.078)	Mem 41.61GB	Prec@1 20.000 (10.110)	Loss 4.3856 (5.0517)
[02/28 09:14:58][INFO] train_vision.py:  668: Epoch: [0][190/367], lr: 3.89e-05, eta: 9:09:36	Time 3.034 (3.047)	Data 0.075 (0.077)	Mem 41.61GB	Prec@1 20.000 (10.890)	Loss 3.8784 (4.9929)
[02/28 09:15:28][INFO] train_vision.py:  668: Epoch: [0][200/367], lr: 4.09e-05, eta: 9:08:53	Time 2.996 (3.046)	Data 0.052 (0.076)	Mem 41.61GB	Prec@1 30.000 (11.542)	Loss 3.7491 (4.9352)
[02/28 09:15:59][INFO] train_vision.py:  668: Epoch: [0][210/367], lr: 4.30e-05, eta: 9:08:09	Time 3.026 (3.045)	Data 0.048 (0.075)	Mem 41.61GB	Prec@1 10.000 (12.322)	Loss 4.2173 (4.8876)
[02/28 09:16:29][INFO] train_vision.py:  668: Epoch: [0][220/367], lr: 4.50e-05, eta: 9:07:19	Time 3.028 (3.043)	Data 0.019 (0.074)	Mem 41.61GB	Prec@1 30.000 (12.896)	Loss 3.8310 (4.8492)
[02/28 09:16:59][INFO] train_vision.py:  668: Epoch: [0][230/367], lr: 4.71e-05, eta: 9:06:29	Time 2.998 (3.041)	Data 0.050 (0.073)	Mem 41.61GB	Prec@1 10.000 (13.290)	Loss 4.5888 (4.8110)
[02/28 09:17:29][INFO] train_vision.py:  668: Epoch: [0][240/367], lr: 4.91e-05, eta: 9:05:41	Time 3.010 (3.040)	Data 0.048 (0.072)	Mem 41.61GB	Prec@1 20.000 (13.610)	Loss 4.1251 (4.7774)
[02/28 09:17:59][INFO] train_vision.py:  668: Epoch: [0][250/367], lr: 5.11e-05, eta: 9:04:54	Time 3.032 (3.038)	Data 0.026 (0.071)	Mem 41.61GB	Prec@1 50.000 (14.263)	Loss 2.9538 (4.7316)
[02/28 09:18:29][INFO] train_vision.py:  668: Epoch: [0][260/367], lr: 5.32e-05, eta: 9:04:08	Time 3.010 (3.037)	Data 0.057 (0.070)	Mem 41.61GB	Prec@1 10.000 (14.866)	Loss 3.5957 (4.6925)
[02/28 09:18:59][INFO] train_vision.py:  668: Epoch: [0][270/367], lr: 5.52e-05, eta: 9:03:25	Time 2.976 (3.036)	Data 0.050 (0.069)	Mem 41.61GB	Prec@1 10.000 (15.461)	Loss 3.5857 (4.6487)
[02/28 09:19:29][INFO] train_vision.py:  668: Epoch: [0][280/367], lr: 5.73e-05, eta: 9:02:43	Time 3.004 (3.035)	Data 0.057 (0.068)	Mem 41.61GB	Prec@1 10.000 (15.836)	Loss 4.8066 (4.6185)
[02/28 09:19:59][INFO] train_vision.py:  668: Epoch: [0][290/367], lr: 5.93e-05, eta: 9:02:00	Time 2.991 (3.033)	Data 0.050 (0.068)	Mem 41.61GB	Prec@1 40.000 (16.254)	Loss 3.0432 (4.5843)
[02/28 09:20:29][INFO] train_vision.py:  668: Epoch: [0][300/367], lr: 6.13e-05, eta: 9:01:20	Time 3.017 (3.032)	Data 0.070 (0.067)	Mem 41.61GB	Prec@1 30.000 (16.645)	Loss 3.7915 (4.5471)
[02/28 09:20:59][INFO] train_vision.py:  668: Epoch: [0][310/367], lr: 6.34e-05, eta: 9:00:41	Time 3.019 (3.032)	Data 0.080 (0.067)	Mem 41.61GB	Prec@1 40.000 (17.138)	Loss 3.2193 (4.5126)
[02/28 09:21:29][INFO] train_vision.py:  668: Epoch: [0][320/367], lr: 6.54e-05, eta: 9:00:02	Time 3.010 (3.031)	Data 0.066 (0.066)	Mem 41.61GB	Prec@1 40.000 (17.913)	Loss 3.5687 (4.4748)
[02/28 09:21:59][INFO] train_vision.py:  668: Epoch: [0][330/367], lr: 6.75e-05, eta: 8:59:22	Time 2.999 (3.030)	Data 0.045 (0.066)	Mem 41.61GB	Prec@1 20.000 (18.127)	Loss 4.5045 (4.4488)
[02/28 09:22:29][INFO] train_vision.py:  668: Epoch: [0][340/367], lr: 6.95e-05, eta: 8:58:41	Time 3.006 (3.029)	Data 0.064 (0.065)	Mem 41.61GB	Prec@1 40.000 (18.504)	Loss 2.9662 (4.4177)
[02/28 09:22:59][INFO] train_vision.py:  668: Epoch: [0][350/367], lr: 7.16e-05, eta: 8:58:03	Time 2.996 (3.028)	Data 0.050 (0.065)	Mem 41.61GB	Prec@1 30.000 (18.946)	Loss 3.2236 (4.3867)
[02/28 09:23:29][INFO] train_vision.py:  668: Epoch: [0][360/367], lr: 7.36e-05, eta: 8:57:28	Time 3.009 (3.028)	Data 0.058 (0.065)	Mem 41.61GB	Prec@1 60.000 (19.474)	Loss 2.7127 (4.3558)
[02/28 09:23:54][INFO] train_vision.py:  668: Epoch: [1][0/367], lr: 7.52e-05, eta: 16:32:47	Time 5.596 (5.596)	Data 2.627 (2.627)	Mem 41.61GB	Prec@1 40.000 (40.000)	Loss 3.1839 (3.1839)
[02/28 09:24:24][INFO] train_vision.py:  668: Epoch: [1][10/367], lr: 7.71e-05, eta: 9:33:40	Time 3.018 (3.237)	Data 0.061 (0.289)	Mem 41.61GB	Prec@1 10.000 (28.182)	Loss 3.0159 (3.2760)
[02/28 09:24:54][INFO] train_vision.py:  668: Epoch: [1][20/367], lr: 7.91e-05, eta: 9:13:27	Time 2.986 (3.126)	Data 0.047 (0.182)	Mem 41.61GB	Prec@1 70.000 (35.714)	Loss 2.2496 (3.2752)
[02/28 09:25:24][INFO] train_vision.py:  668: Epoch: [1][30/367], lr: 8.11e-05, eta: 9:05:53	Time 2.967 (3.086)	Data 0.047 (0.142)	Mem 41.61GB	Prec@1 30.000 (36.452)	Loss 3.3796 (3.2510)
[02/28 09:25:54][INFO] train_vision.py:  668: Epoch: [1][40/367], lr: 8.32e-05, eta: 9:02:13	Time 3.012 (3.068)	Data 0.050 (0.121)	Mem 41.61GB	Prec@1 10.000 (35.610)	Loss 4.0506 (3.2694)
[02/28 09:26:24][INFO] train_vision.py:  668: Epoch: [1][50/367], lr: 8.52e-05, eta: 8:59:23	Time 2.998 (3.055)	Data 0.061 (0.109)	Mem 41.61GB	Prec@1 40.000 (36.471)	Loss 2.7193 (3.2566)
[02/28 09:26:54][INFO] train_vision.py:  668: Epoch: [1][60/367], lr: 8.73e-05, eta: 8:57:20	Time 3.007 (3.046)	Data 0.049 (0.100)	Mem 41.61GB	Prec@1 50.000 (36.721)	Loss 2.6746 (3.2411)
[02/28 09:27:24][INFO] train_vision.py:  668: Epoch: [1][70/367], lr: 8.93e-05, eta: 8:55:42	Time 2.997 (3.040)	Data 0.055 (0.093)	Mem 41.61GB	Prec@1 10.000 (35.775)	Loss 2.8267 (3.2359)
[02/28 09:27:54][INFO] train_vision.py:  668: Epoch: [1][80/367], lr: 9.14e-05, eta: 8:54:19	Time 3.003 (3.035)	Data 0.044 (0.088)	Mem 41.61GB	Prec@1 40.000 (36.667)	Loss 2.6803 (3.2026)
[02/28 09:28:24][INFO] train_vision.py:  668: Epoch: [1][90/367], lr: 9.34e-05, eta: 8:53:06	Time 3.005 (3.031)	Data 0.051 (0.084)	Mem 41.61GB	Prec@1 50.000 (37.253)	Loss 2.6499 (3.1619)
[02/28 09:28:54][INFO] train_vision.py:  668: Epoch: [1][100/367], lr: 9.54e-05, eta: 8:52:04	Time 3.013 (3.028)	Data 0.034 (0.080)	Mem 41.61GB	Prec@1 30.000 (37.723)	Loss 3.1920 (3.1485)
[02/28 09:29:24][INFO] train_vision.py:  668: Epoch: [1][110/367], lr: 9.75e-05, eta: 8:51:06	Time 3.005 (3.025)	Data 0.057 (0.077)	Mem 41.61GB	Prec@1 20.000 (38.198)	Loss 3.9224 (3.1311)
[02/28 09:29:54][INFO] train_vision.py:  668: Epoch: [1][120/367], lr: 9.95e-05, eta: 8:50:15	Time 2.993 (3.023)	Data 0.050 (0.075)	Mem 41.61GB	Prec@1 70.000 (38.926)	Loss 1.9316 (3.1062)
[02/28 09:30:24][INFO] train_vision.py:  668: Epoch: [1][130/367], lr: 1.02e-04, eta: 8:49:26	Time 2.998 (3.021)	Data 0.055 (0.073)	Mem 41.61GB	Prec@1 70.000 (39.160)	Loss 2.4330 (3.1073)
[02/28 09:30:54][INFO] train_vision.py:  668: Epoch: [1][140/367], lr: 1.04e-04, eta: 8:48:38	Time 2.995 (3.020)	Data 0.050 (0.071)	Mem 41.61GB	Prec@1 30.000 (39.149)	Loss 3.1862 (3.0999)
[02/28 09:31:24][INFO] train_vision.py:  668: Epoch: [1][150/367], lr: 1.06e-04, eta: 8:47:52	Time 2.997 (3.018)	Data 0.054 (0.070)	Mem 41.61GB	Prec@1 60.000 (39.536)	Loss 2.1829 (3.0857)
[02/28 09:31:54][INFO] train_vision.py:  668: Epoch: [1][160/367], lr: 1.08e-04, eta: 8:47:07	Time 2.995 (3.017)	Data 0.048 (0.069)	Mem 41.61GB	Prec@1 50.000 (39.876)	Loss 2.9680 (3.0620)
[02/28 09:32:24][INFO] train_vision.py:  668: Epoch: [1][170/367], lr: 1.10e-04, eta: 8:46:24	Time 2.993 (3.016)	Data 0.055 (0.068)	Mem 41.61GB	Prec@1 20.000 (40.000)	Loss 3.8435 (3.0489)
[02/28 09:32:54][INFO] train_vision.py:  668: Epoch: [1][180/367], lr: 1.12e-04, eta: 8:45:43	Time 3.033 (3.015)	Data 0.025 (0.067)	Mem 41.61GB	Prec@1 60.000 (40.221)	Loss 2.1187 (3.0320)
[02/28 09:33:24][INFO] train_vision.py:  668: Epoch: [1][190/367], lr: 1.14e-04, eta: 8:45:05	Time 3.000 (3.014)	Data 0.061 (0.066)	Mem 41.61GB	Prec@1 30.000 (40.105)	Loss 3.1043 (3.0341)
[02/28 09:33:54][INFO] train_vision.py:  668: Epoch: [1][200/367], lr: 1.16e-04, eta: 8:44:27	Time 3.000 (3.013)	Data 0.051 (0.065)	Mem 41.61GB	Prec@1 30.000 (40.597)	Loss 3.2690 (3.0245)
[02/28 09:34:24][INFO] train_vision.py:  668: Epoch: [1][210/367], lr: 1.18e-04, eta: 8:43:49	Time 3.000 (3.012)	Data 0.054 (0.064)	Mem 41.61GB	Prec@1 50.000 (40.995)	Loss 2.8317 (3.0132)
[02/28 09:34:54][INFO] train_vision.py:  668: Epoch: [1][220/367], lr: 1.20e-04, eta: 8:43:12	Time 2.987 (3.012)	Data 0.039 (0.064)	Mem 41.61GB	Prec@1 50.000 (41.357)	Loss 2.4501 (2.9991)
[02/28 09:35:24][INFO] train_vision.py:  668: Epoch: [1][230/367], lr: 1.22e-04, eta: 8:42:38	Time 3.002 (3.011)	Data 0.050 (0.063)	Mem 41.61GB	Prec@1 30.000 (41.732)	Loss 2.6290 (2.9791)
[02/28 09:35:54][INFO] train_vision.py:  668: Epoch: [1][240/367], lr: 1.24e-04, eta: 8:42:01	Time 2.998 (3.010)	Data 0.024 (0.062)	Mem 41.61GB	Prec@1 20.000 (41.494)	Loss 3.1726 (2.9775)
[02/28 09:36:24][INFO] train_vision.py:  668: Epoch: [1][250/367], lr: 1.26e-04, eta: 8:41:29	Time 3.037 (3.010)	Data 0.030 (0.062)	Mem 41.61GB	Prec@1 20.000 (41.753)	Loss 3.3869 (2.9659)
[02/28 09:36:54][INFO] train_vision.py:  668: Epoch: [1][260/367], lr: 1.28e-04, eta: 8:40:53	Time 2.963 (3.010)	Data 0.046 (0.061)	Mem 41.61GB	Prec@1 40.000 (41.648)	Loss 2.3497 (2.9574)
[02/28 09:37:24][INFO] train_vision.py:  668: Epoch: [1][270/367], lr: 1.30e-04, eta: 8:40:20	Time 2.997 (3.009)	Data 0.055 (0.061)	Mem 41.61GB	Prec@1 30.000 (41.734)	Loss 3.1605 (2.9474)
[02/28 09:37:54][INFO] train_vision.py:  668: Epoch: [1][280/367], lr: 1.32e-04, eta: 8:39:47	Time 3.000 (3.009)	Data 0.050 (0.061)	Mem 41.61GB	Prec@1 40.000 (41.815)	Loss 2.6671 (2.9441)
[02/28 09:38:24][INFO] train_vision.py:  668: Epoch: [1][290/367], lr: 1.34e-04, eta: 8:39:14	Time 2.997 (3.009)	Data 0.053 (0.060)	Mem 41.61GB	Prec@1 60.000 (41.787)	Loss 1.9507 (2.9387)
[02/28 09:38:54][INFO] train_vision.py:  668: Epoch: [1][300/367], lr: 1.36e-04, eta: 8:38:41	Time 2.986 (3.009)	Data 0.046 (0.060)	Mem 41.61GB	Prec@1 30.000 (41.827)	Loss 3.2311 (2.9298)
[02/28 09:39:24][INFO] train_vision.py:  668: Epoch: [1][310/367], lr: 1.38e-04, eta: 8:38:06	Time 2.998 (3.008)	Data 0.055 (0.060)	Mem 41.61GB	Prec@1 60.000 (41.801)	Loss 2.3051 (2.9201)
[02/28 09:39:54][INFO] train_vision.py:  668: Epoch: [1][320/367], lr: 1.40e-04, eta: 8:37:36	Time 2.999 (3.008)	Data 0.042 (0.059)	Mem 41.61GB	Prec@1 70.000 (42.150)	Loss 2.2370 (2.9064)
[02/28 09:40:24][INFO] train_vision.py:  668: Epoch: [1][330/367], lr: 1.42e-04, eta: 8:37:03	Time 3.020 (3.008)	Data 0.030 (0.059)	Mem 41.61GB	Prec@1 50.000 (42.296)	Loss 2.8663 (2.8997)
[02/28 09:40:54][INFO] train_vision.py:  668: Epoch: [1][340/367], lr: 1.44e-04, eta: 8:36:30	Time 3.004 (3.008)	Data 0.044 (0.058)	Mem 41.61GB	Prec@1 20.000 (42.405)	Loss 2.8633 (2.8931)
[02/28 09:41:24][INFO] train_vision.py:  668: Epoch: [1][350/367], lr: 1.46e-04, eta: 8:35:57	Time 2.997 (3.007)	Data 0.042 (0.058)	Mem 41.61GB	Prec@1 60.000 (42.764)	Loss 2.1345 (2.8771)
[02/28 09:41:54][INFO] train_vision.py:  668: Epoch: [1][360/367], lr: 1.49e-04, eta: 8:35:25	Time 3.000 (3.007)	Data 0.049 (0.058)	Mem 41.61GB	Prec@1 70.000 (43.019)	Loss 2.2225 (2.8703)
[02/28 09:42:19][INFO] train_vision.py:  840: Test: [0/121]	Prec@1 72.500 (72.500)	Prec@5 96.250 (96.250)	mPrec@1 (6.615)	mPrec@5 (10.590)
[02/28 09:43:01][INFO] train_vision.py:  840: Test: [10/121]	Prec@1 71.250 (73.068)	Prec@5 97.500 (95.682)	mPrec@1 (12.955)	mPrec@5 (28.096)
[02/28 09:43:43][INFO] train_vision.py:  840: Test: [20/121]	Prec@1 63.750 (67.679)	Prec@5 96.250 (94.524)	mPrec@1 (15.683)	mPrec@5 (35.129)
[02/28 09:44:25][INFO] train_vision.py:  840: Test: [30/121]	Prec@1 50.000 (65.363)	Prec@5 95.000 (94.476)	mPrec@1 (15.768)	mPrec@5 (39.805)
[02/28 09:45:08][INFO] train_vision.py:  840: Test: [40/121]	Prec@1 53.750 (64.634)	Prec@5 91.250 (93.841)	mPrec@1 (15.818)	mPrec@5 (40.639)
[02/28 09:45:50][INFO] train_vision.py:  840: Test: [50/121]	Prec@1 65.000 (63.627)	Prec@5 88.750 (93.211)	mPrec@1 (15.701)	mPrec@5 (40.420)
[02/28 09:46:32][INFO] train_vision.py:  840: Test: [60/121]	Prec@1 57.500 (63.053)	Prec@5 91.250 (93.176)	mPrec@1 (15.757)	mPrec@5 (40.728)
[02/28 09:47:14][INFO] train_vision.py:  840: Test: [70/121]	Prec@1 57.500 (63.239)	Prec@5 88.750 (93.310)	mPrec@1 (15.774)	mPrec@5 (40.837)
[02/28 09:47:56][INFO] train_vision.py:  840: Test: [80/121]	Prec@1 38.750 (62.593)	Prec@5 82.500 (92.994)	mPrec@1 (15.727)	mPrec@5 (41.046)
[02/28 09:48:39][INFO] train_vision.py:  840: Test: [90/121]	Prec@1 35.000 (60.783)	Prec@5 88.750 (92.418)	mPrec@1 (15.885)	mPrec@5 (41.638)
[02/28 09:49:21][INFO] train_vision.py:  840: Test: [100/121]	Prec@1 68.750 (59.319)	Prec@5 96.250 (91.324)	mPrec@1 (16.188)	mPrec@5 (42.220)
[02/28 09:50:03][INFO] train_vision.py:  840: Test: [110/121]	Prec@1 43.750 (59.561)	Prec@5 76.250 (91.306)	mPrec@1 (16.250)	mPrec@5 (42.116)
[02/28 09:50:44][INFO] train_vision.py:  840: Test: [120/121]	Prec@1 27.083 (57.400)	Prec@5 33.333 (89.583)	mPrec@1 (16.322)	mPrec@5 (42.306)
[02/28 09:50:45][INFO] train_vision.py:  847: Overall Prec@1 57.400% Prec@5 89.583% mPrec@1 (16.322) mPrec@5 (42.306)
[02/28 09:50:45][INFO] train_vision.py:  464: Testing: 16.32164192199707/16.32164192199707
[02/28 09:50:45][INFO] train_vision.py:  465: Saving:
[02/28 09:50:55][INFO] train_vision.py:  668: Epoch: [2][0/367], lr: 1.50e-04, eta: 15:12:23	Time 5.327 (5.327)	Data 2.409 (2.409)	Mem 41.61GB	Prec@1 70.000 (70.000)	Loss 2.1848 (2.1848)
[02/28 09:51:25][INFO] train_vision.py:  668: Epoch: [2][10/367], lr: 1.52e-04, eta: 9:11:15	Time 3.047 (3.222)	Data 0.093 (0.279)	Mem 41.61GB	Prec@1 50.000 (51.818)	Loss 2.1634 (2.4351)
[02/28 09:51:55][INFO] train_vision.py:  668: Epoch: [2][20/367], lr: 1.54e-04, eta: 8:54:52	Time 3.045 (3.129)	Data 0.093 (0.181)	Mem 41.61GB	Prec@1 30.000 (45.238)	Loss 2.6907 (2.6503)
[02/28 09:52:25][INFO] train_vision.py:  668: Epoch: [2][30/367], lr: 1.56e-04, eta: 8:48:23	Time 3.034 (3.094)	Data 0.055 (0.143)	Mem 41.61GB	Prec@1 40.000 (49.032)	Loss 2.3171 (2.5778)
[02/28 09:52:56][INFO] train_vision.py:  668: Epoch: [2][40/367], lr: 1.58e-04, eta: 8:44:53	Time 3.029 (3.076)	Data 0.054 (0.123)	Mem 41.61GB	Prec@1 30.000 (49.268)	Loss 2.9912 (2.5868)
[02/28 09:53:26][INFO] train_vision.py:  668: Epoch: [2][50/367], lr: 1.60e-04, eta: 8:42:31	Time 3.048 (3.066)	Data 0.023 (0.109)	Mem 41.61GB	Prec@1 60.000 (49.804)	Loss 2.3806 (2.5950)
[02/28 09:53:56][INFO] train_vision.py:  668: Epoch: [2][60/367], lr: 1.62e-04, eta: 8:40:51	Time 3.026 (3.059)	Data 0.083 (0.102)	Mem 41.61GB	Prec@1 40.000 (49.344)	Loss 2.9475 (2.5890)
[02/28 09:54:26][INFO] train_vision.py:  668: Epoch: [2][70/367], lr: 1.64e-04, eta: 8:39:10	Time 3.013 (3.052)	Data 0.058 (0.096)	Mem 41.61GB	Prec@1 50.000 (49.296)	Loss 2.3147 (2.5949)
[02/28 09:54:56][INFO] train_vision.py:  668: Epoch: [2][80/367], lr: 1.66e-04, eta: 8:37:41	Time 3.024 (3.046)	Data 0.025 (0.091)	Mem 41.61GB	Prec@1 30.000 (49.259)	Loss 2.8960 (2.6075)
[02/28 09:55:26][INFO] train_vision.py:  668: Epoch: [2][90/367], lr: 1.68e-04, eta: 8:36:28	Time 3.029 (3.042)	Data 0.025 (0.087)	Mem 41.61GB	Prec@1 50.000 (49.780)	Loss 2.0584 (2.5742)
[02/28 09:55:56][INFO] train_vision.py:  668: Epoch: [2][100/367], lr: 1.70e-04, eta: 8:35:25	Time 3.032 (3.039)	Data 0.026 (0.084)	Mem 41.61GB	Prec@1 50.000 (49.406)	Loss 2.5350 (2.5906)
[02/28 09:56:27][INFO] train_vision.py:  668: Epoch: [2][110/367], lr: 1.72e-04, eta: 8:34:33	Time 3.042 (3.037)	Data 0.021 (0.082)	Mem 41.61GB	Prec@1 50.000 (49.459)	Loss 2.3235 (2.5845)
[02/28 09:56:57][INFO] train_vision.py:  668: Epoch: [2][120/367], lr: 1.74e-04, eta: 8:33:38	Time 2.996 (3.034)	Data 0.086 (0.080)	Mem 41.61GB	Prec@1 40.000 (50.165)	Loss 2.5156 (2.5529)
[02/28 09:57:27][INFO] train_vision.py:  668: Epoch: [2][130/367], lr: 1.76e-04, eta: 8:32:56	Time 2.995 (3.033)	Data 0.055 (0.078)	Mem 41.61GB	Prec@1 70.000 (50.916)	Loss 2.1850 (2.5426)
[02/28 09:57:57][INFO] train_vision.py:  668: Epoch: [2][140/367], lr: 1.79e-04, eta: 8:32:07	Time 3.016 (3.031)	Data 0.082 (0.077)	Mem 41.61GB	Prec@1 50.000 (51.277)	Loss 2.2630 (2.5209)
[02/28 09:58:27][INFO] train_vision.py:  668: Epoch: [2][150/367], lr: 1.81e-04, eta: 8:31:22	Time 2.986 (3.030)	Data 0.044 (0.075)	Mem 41.61GB	Prec@1 60.000 (51.589)	Loss 2.1435 (2.5092)
[02/28 09:58:57][INFO] train_vision.py:  668: Epoch: [2][160/367], lr: 1.83e-04, eta: 8:30:39	Time 3.015 (3.028)	Data 0.080 (0.074)	Mem 41.61GB	Prec@1 70.000 (51.304)	Loss 2.1495 (2.5155)
[02/28 09:59:27][INFO] train_vision.py:  668: Epoch: [2][170/367], lr: 1.85e-04, eta: 8:29:53	Time 2.989 (3.027)	Data 0.083 (0.073)	Mem 41.61GB	Prec@1 40.000 (51.404)	Loss 2.8624 (2.5140)
[02/28 09:59:57][INFO] train_vision.py:  668: Epoch: [2][180/367], lr: 1.87e-04, eta: 8:29:15	Time 3.040 (3.026)	Data 0.022 (0.072)	Mem 41.61GB	Prec@1 70.000 (51.215)	Loss 2.3745 (2.5114)
[02/28 10:00:27][INFO] train_vision.py:  668: Epoch: [2][190/367], lr: 1.89e-04, eta: 8:28:39	Time 3.015 (3.026)	Data 0.058 (0.071)	Mem 41.61GB	Prec@1 90.000 (51.675)	Loss 1.8690 (2.5012)
[02/28 10:00:57][INFO] train_vision.py:  668: Epoch: [2][200/367], lr: 1.91e-04, eta: 8:27:58	Time 3.018 (3.025)	Data 0.069 (0.070)	Mem 41.61GB	Prec@1 40.000 (51.741)	Loss 2.4481 (2.4905)
[02/28 10:01:27][INFO] train_vision.py:  668: Epoch: [2][210/367], lr: 1.93e-04, eta: 8:27:21	Time 3.006 (3.024)	Data 0.021 (0.070)	Mem 41.61GB	Prec@1 60.000 (52.133)	Loss 2.0699 (2.4766)
[02/28 10:01:58][INFO] train_vision.py:  668: Epoch: [2][220/367], lr: 1.95e-04, eta: 8:26:44	Time 3.016 (3.023)	Data 0.041 (0.069)	Mem 41.61GB	Prec@1 60.000 (52.127)	Loss 2.1639 (2.4780)
[02/28 10:02:28][INFO] train_vision.py:  668: Epoch: [2][230/367], lr: 1.97e-04, eta: 8:26:08	Time 3.036 (3.023)	Data 0.020 (0.069)	Mem 41.61GB	Prec@1 40.000 (52.381)	Loss 2.5273 (2.4632)
[02/28 10:02:58][INFO] train_vision.py:  668: Epoch: [2][240/367], lr: 1.99e-04, eta: 8:25:31	Time 3.011 (3.022)	Data 0.079 (0.068)	Mem 41.61GB	Prec@1 50.000 (52.199)	Loss 2.0372 (2.4612)
[02/28 10:03:28][INFO] train_vision.py:  668: Epoch: [2][250/367], lr: 2.01e-04, eta: 8:24:55	Time 3.014 (3.021)	Data 0.067 (0.068)	Mem 41.61GB	Prec@1 80.000 (52.311)	Loss 2.1104 (2.4642)
[02/28 10:03:58][INFO] train_vision.py:  668: Epoch: [2][260/367], lr: 2.03e-04, eta: 8:24:22	Time 3.034 (3.021)	Data 0.081 (0.068)	Mem 41.61GB	Prec@1 80.000 (52.644)	Loss 1.9937 (2.4565)
[02/28 10:04:28][INFO] train_vision.py:  668: Epoch: [2][270/367], lr: 2.05e-04, eta: 8:23:51	Time 2.990 (3.021)	Data 0.053 (0.068)	Mem 41.61GB	Prec@1 60.000 (52.915)	Loss 2.3111 (2.4473)
[02/28 10:04:58][INFO] train_vision.py:  668: Epoch: [2][280/367], lr: 2.07e-04, eta: 8:23:17	Time 3.018 (3.021)	Data 0.083 (0.068)	Mem 41.61GB	Prec@1 70.000 (53.488)	Loss 1.8430 (2.4321)
[02/28 10:05:28][INFO] train_vision.py:  668: Epoch: [2][290/367], lr: 2.09e-04, eta: 8:22:44	Time 2.999 (3.020)	Data 0.055 (0.067)	Mem 41.61GB	Prec@1 50.000 (53.402)	Loss 2.6409 (2.4359)
[02/28 10:05:58][INFO] train_vision.py:  668: Epoch: [2][300/367], lr: 2.11e-04, eta: 8:22:08	Time 3.001 (3.020)	Data 0.047 (0.067)	Mem 41.61GB	Prec@1 50.000 (53.322)	Loss 2.6356 (2.4387)
[02/28 10:06:29][INFO] train_vision.py:  668: Epoch: [2][310/367], lr: 2.13e-04, eta: 8:21:36	Time 3.015 (3.020)	Data 0.058 (0.067)	Mem 41.61GB	Prec@1 70.000 (53.376)	Loss 2.0283 (2.4376)
[02/28 10:06:59][INFO] train_vision.py:  668: Epoch: [2][320/367], lr: 2.15e-04, eta: 8:21:04	Time 3.007 (3.019)	Data 0.054 (0.067)	Mem 41.61GB	Prec@1 20.000 (53.458)	Loss 2.6599 (2.4315)
[02/28 10:07:29][INFO] train_vision.py:  668: Epoch: [2][330/367], lr: 2.17e-04, eta: 8:20:30	Time 3.041 (3.019)	Data 0.053 (0.066)	Mem 41.61GB	Prec@1 60.000 (53.716)	Loss 2.2357 (2.4249)
[02/28 10:07:59][INFO] train_vision.py:  668: Epoch: [2][340/367], lr: 2.19e-04, eta: 8:19:56	Time 3.003 (3.019)	Data 0.055 (0.066)	Mem 41.61GB	Prec@1 50.000 (53.959)	Loss 2.2674 (2.4208)
[02/28 10:08:29][INFO] train_vision.py:  668: Epoch: [2][350/367], lr: 2.21e-04, eta: 8:19:21	Time 2.965 (3.018)	Data 0.045 (0.066)	Mem 41.61GB	Prec@1 60.000 (54.359)	Loss 2.3233 (2.4112)
[02/28 10:08:59][INFO] train_vision.py:  668: Epoch: [2][360/367], lr: 2.23e-04, eta: 8:18:48	Time 2.998 (3.018)	Data 0.053 (0.065)	Mem 41.61GB	Prec@1 40.000 (54.543)	Loss 2.7878 (2.4047)
[02/28 10:09:22][INFO] train_vision.py:  668: Epoch: [3][0/367], lr: 2.25e-04, eta: 15:16:09	Time 5.547 (5.547)	Data 2.400 (2.400)	Mem 41.61GB	Prec@1 50.000 (50.000)	Loss 3.1856 (3.1856)
[02/28 10:09:52][INFO] train_vision.py:  668: Epoch: [3][10/367], lr: 2.27e-04, eta: 8:54:16	Time 3.017 (3.238)	Data 0.053 (0.270)	Mem 41.61GB	Prec@1 50.000 (49.091)	Loss 2.2851 (2.5415)
[02/28 10:10:22][INFO] train_vision.py:  668: Epoch: [3][20/367], lr: 2.29e-04, eta: 8:35:46	Time 2.990 (3.129)	Data 0.060 (0.174)	Mem 41.61GB	Prec@1 40.000 (52.381)	Loss 2.8668 (2.4546)
[02/28 10:10:52][INFO] train_vision.py:  668: Epoch: [3][30/367], lr: 2.31e-04, eta: 8:29:16	Time 3.038 (3.093)	Data 0.074 (0.139)	Mem 41.61GB	Prec@1 50.000 (56.452)	Loss 2.8588 (2.3604)
[02/28 10:11:22][INFO] train_vision.py:  668: Epoch: [3][40/367], lr: 2.33e-04, eta: 8:25:42	Time 2.967 (3.074)	Data 0.063 (0.120)	Mem 41.61GB	Prec@1 50.000 (56.098)	Loss 2.1121 (2.3090)
[02/28 10:11:53][INFO] train_vision.py:  668: Epoch: [3][50/367], lr: 2.35e-04, eta: 8:23:12	Time 3.016 (3.062)	Data 0.079 (0.105)	Mem 41.61GB	Prec@1 60.000 (57.451)	Loss 2.1712 (2.2979)
[02/28 10:12:23][INFO] train_vision.py:  668: Epoch: [3][60/367], lr: 2.37e-04, eta: 8:21:05	Time 3.032 (3.052)	Data 0.048 (0.095)	Mem 41.61GB	Prec@1 60.000 (57.705)	Loss 1.9740 (2.2942)
[02/28 10:12:53][INFO] train_vision.py:  668: Epoch: [3][70/367], lr: 2.39e-04, eta: 8:19:30	Time 3.003 (3.046)	Data 0.054 (0.090)	Mem 41.61GB	Prec@1 60.000 (57.183)	Loss 1.8956 (2.3149)
[02/28 10:13:23][INFO] train_vision.py:  668: Epoch: [3][80/367], lr: 2.41e-04, eta: 8:18:06	Time 2.975 (3.040)	Data 0.039 (0.085)	Mem 41.61GB	Prec@1 50.000 (57.778)	Loss 2.7630 (2.3128)
[02/28 10:13:53][INFO] train_vision.py:  668: Epoch: [3][90/367], lr: 2.43e-04, eta: 8:16:53	Time 2.978 (3.036)	Data 0.052 (0.081)	Mem 41.61GB	Prec@1 50.000 (58.242)	Loss 2.4267 (2.2820)
[02/28 10:14:23][INFO] train_vision.py:  668: Epoch: [3][100/367], lr: 2.45e-04, eta: 8:15:44	Time 2.993 (3.032)	Data 0.046 (0.077)	Mem 41.61GB	Prec@1 60.000 (58.614)	Loss 1.8009 (2.2599)
[02/28 10:14:53][INFO] train_vision.py:  668: Epoch: [3][110/367], lr: 2.47e-04, eta: 8:14:40	Time 2.989 (3.029)	Data 0.051 (0.075)	Mem 41.61GB	Prec@1 40.000 (58.559)	Loss 2.9964 (2.2569)
[02/28 10:15:23][INFO] train_vision.py:  668: Epoch: [3][120/367], lr: 2.49e-04, eta: 8:13:44	Time 2.981 (3.026)	Data 0.042 (0.072)	Mem 41.61GB	Prec@1 60.000 (58.760)	Loss 2.1554 (2.2449)
[02/28 10:15:52][INFO] train_vision.py:  668: Epoch: [3][130/367], lr: 2.51e-04, eta: 8:12:51	Time 3.010 (3.024)	Data 0.051 (0.070)	Mem 41.61GB	Prec@1 70.000 (58.855)	Loss 2.1228 (2.2456)
[02/28 10:16:22][INFO] train_vision.py:  668: Epoch: [3][140/367], lr: 2.53e-04, eta: 8:12:00	Time 2.963 (3.022)	Data 0.046 (0.069)	Mem 41.61GB	Prec@1 70.000 (59.007)	Loss 1.8038 (2.2434)
[02/28 10:16:52][INFO] train_vision.py:  668: Epoch: [3][150/367], lr: 2.55e-04, eta: 8:11:19	Time 3.021 (3.020)	Data 0.061 (0.068)	Mem 41.61GB	Prec@1 70.000 (59.073)	Loss 1.6878 (2.2395)
[02/28 10:17:22][INFO] train_vision.py:  668: Epoch: [3][160/367], lr: 2.58e-04, eta: 8:10:35	Time 2.992 (3.019)	Data 0.055 (0.066)	Mem 41.61GB	Prec@1 50.000 (59.193)	Loss 2.8293 (2.2371)
[02/28 10:17:52][INFO] train_vision.py:  668: Epoch: [3][170/367], lr: 2.60e-04, eta: 8:09:50	Time 3.005 (3.017)	Data 0.042 (0.065)	Mem 41.61GB	Prec@1 60.000 (59.240)	Loss 3.1886 (2.2363)
[02/28 10:18:22][INFO] train_vision.py:  668: Epoch: [3][180/367], lr: 2.62e-04, eta: 8:09:09	Time 2.989 (3.016)	Data 0.054 (0.065)	Mem 41.61GB	Prec@1 60.000 (59.337)	Loss 2.2605 (2.2360)
[02/28 10:18:52][INFO] train_vision.py:  668: Epoch: [3][190/367], lr: 2.64e-04, eta: 8:08:28	Time 3.001 (3.015)	Data 0.054 (0.064)	Mem 41.61GB	Prec@1 60.000 (59.319)	Loss 2.1312 (2.2393)
[02/28 10:19:22][INFO] train_vision.py:  668: Epoch: [3][200/367], lr: 2.66e-04, eta: 8:07:49	Time 2.993 (3.014)	Data 0.044 (0.063)	Mem 41.61GB	Prec@1 60.000 (59.552)	Loss 1.8668 (2.2290)
[02/28 10:19:52][INFO] train_vision.py:  668: Epoch: [3][210/367], lr: 2.68e-04, eta: 8:07:09	Time 3.004 (3.013)	Data 0.053 (0.062)	Mem 41.61GB	Prec@1 60.000 (59.858)	Loss 1.9257 (2.2207)
[02/28 10:20:22][INFO] train_vision.py:  668: Epoch: [3][220/367], lr: 2.70e-04, eta: 8:06:31	Time 2.993 (3.013)	Data 0.042 (0.062)	Mem 41.61GB	Prec@1 50.000 (59.502)	Loss 2.2550 (2.2304)
[02/28 10:20:52][INFO] train_vision.py:  668: Epoch: [3][230/367], lr: 2.72e-04, eta: 8:05:52	Time 2.998 (3.012)	Data 0.049 (0.061)	Mem 41.61GB	Prec@1 50.000 (59.957)	Loss 2.3755 (2.2162)
[02/28 10:21:22][INFO] train_vision.py:  668: Epoch: [3][240/367], lr: 2.74e-04, eta: 8:05:15	Time 2.971 (3.011)	Data 0.039 (0.060)	Mem 41.61GB	Prec@1 70.000 (60.207)	Loss 2.0272 (2.2076)
[02/28 10:21:52][INFO] train_vision.py:  668: Epoch: [3][250/367], lr: 2.76e-04, eta: 8:04:37	Time 2.998 (3.010)	Data 0.047 (0.060)	Mem 41.61GB	Prec@1 80.000 (60.279)	Loss 1.6382 (2.2055)
[02/28 10:22:22][INFO] train_vision.py:  668: Epoch: [3][260/367], lr: 2.78e-04, eta: 8:04:00	Time 2.993 (3.009)	Data 0.045 (0.059)	Mem 41.61GB	Prec@1 70.000 (60.077)	Loss 1.7174 (2.2005)
[02/28 10:22:52][INFO] train_vision.py:  668: Epoch: [3][270/367], lr: 2.80e-04, eta: 8:03:24	Time 2.977 (3.009)	Data 0.041 (0.059)	Mem 41.61GB	Prec@1 80.000 (60.406)	Loss 1.9641 (2.1892)
[02/28 10:23:22][INFO] train_vision.py:  668: Epoch: [3][280/367], lr: 2.82e-04, eta: 8:02:49	Time 2.974 (3.008)	Data 0.044 (0.058)	Mem 41.61GB	Prec@1 70.000 (60.534)	Loss 1.7073 (2.1839)
[02/28 10:23:52][INFO] train_vision.py:  668: Epoch: [3][290/367], lr: 2.84e-04, eta: 8:02:17	Time 2.993 (3.008)	Data 0.049 (0.058)	Mem 41.61GB	Prec@1 50.000 (60.825)	Loss 2.7365 (2.1740)
[02/28 10:24:22][INFO] train_vision.py:  668: Epoch: [3][300/367], lr: 2.86e-04, eta: 8:01:41	Time 2.988 (3.007)	Data 0.045 (0.057)	Mem 41.61GB	Prec@1 20.000 (60.698)	Loss 3.1005 (2.1762)
[02/28 10:24:52][INFO] train_vision.py:  668: Epoch: [3][310/367], lr: 2.88e-04, eta: 8:01:07	Time 3.020 (3.007)	Data 0.025 (0.057)	Mem 41.61GB	Prec@1 50.000 (60.450)	Loss 2.2252 (2.1821)
[02/28 10:25:21][INFO] train_vision.py:  668: Epoch: [3][320/367], lr: 2.90e-04, eta: 8:00:32	Time 2.976 (3.006)	Data 0.042 (0.057)	Mem 41.61GB	Prec@1 40.000 (60.125)	Loss 2.5414 (2.1876)
[02/28 10:25:51][INFO] train_vision.py:  668: Epoch: [3][330/367], lr: 2.92e-04, eta: 7:59:57	Time 3.014 (3.006)	Data 0.032 (0.056)	Mem 41.61GB	Prec@1 80.000 (60.302)	Loss 1.6819 (2.1839)
[02/28 10:26:21][INFO] train_vision.py:  668: Epoch: [3][340/367], lr: 2.94e-04, eta: 7:59:24	Time 2.993 (3.006)	Data 0.041 (0.056)	Mem 41.61GB	Prec@1 50.000 (60.352)	Loss 2.4390 (2.1845)
[02/28 10:26:51][INFO] train_vision.py:  668: Epoch: [3][350/367], lr: 2.96e-04, eta: 7:58:53	Time 3.008 (3.006)	Data 0.046 (0.056)	Mem 41.61GB	Prec@1 40.000 (60.285)	Loss 3.0616 (2.1832)
[02/28 10:27:21][INFO] train_vision.py:  668: Epoch: [3][360/367], lr: 2.98e-04, eta: 7:58:21	Time 2.991 (3.005)	Data 0.039 (0.056)	Mem 41.61GB	Prec@1 50.000 (60.194)	Loss 2.7306 (2.1869)
[02/28 10:27:46][INFO] train_vision.py:  840: Test: [0/121]	Prec@1 82.500 (82.500)	Prec@5 98.750 (98.750)	mPrec@1 (8.380)	mPrec@5 (11.389)
[02/28 10:28:29][INFO] train_vision.py:  840: Test: [10/121]	Prec@1 88.750 (81.818)	Prec@5 100.000 (98.182)	mPrec@1 (20.242)	mPrec@5 (31.603)
[02/28 10:29:11][INFO] train_vision.py:  840: Test: [20/121]	Prec@1 77.500 (78.274)	Prec@5 98.750 (97.202)	mPrec@1 (24.917)	mPrec@5 (41.499)
[02/28 10:29:53][INFO] train_vision.py:  840: Test: [30/121]	Prec@1 65.000 (77.298)	Prec@5 98.750 (97.500)	mPrec@1 (27.143)	mPrec@5 (48.938)
[02/28 10:30:35][INFO] train_vision.py:  840: Test: [40/121]	Prec@1 71.250 (76.951)	Prec@5 96.250 (97.012)	mPrec@1 (28.210)	mPrec@5 (51.574)
[02/28 10:31:17][INFO] train_vision.py:  840: Test: [50/121]	Prec@1 70.000 (75.319)	Prec@5 96.250 (96.520)	mPrec@1 (27.810)	mPrec@5 (52.733)
[02/28 10:32:00][INFO] train_vision.py:  840: Test: [60/121]	Prec@1 71.250 (75.574)	Prec@5 96.250 (96.557)	mPrec@1 (27.840)	mPrec@5 (54.305)
[02/28 10:32:42][INFO] train_vision.py:  840: Test: [70/121]	Prec@1 70.000 (75.581)	Prec@5 97.500 (96.725)	mPrec@1 (27.894)	mPrec@5 (55.105)
[02/28 10:33:24][INFO] train_vision.py:  840: Test: [80/121]	Prec@1 65.000 (75.494)	Prec@5 95.000 (96.667)	mPrec@1 (28.188)	mPrec@5 (56.317)
[02/28 10:34:06][INFO] train_vision.py:  840: Test: [90/121]	Prec@1 61.250 (74.560)	Prec@5 96.250 (96.593)	mPrec@1 (29.029)	mPrec@5 (57.982)
[02/28 10:34:48][INFO] train_vision.py:  840: Test: [100/121]	Prec@1 80.000 (73.317)	Prec@5 98.750 (95.866)	mPrec@1 (29.219)	mPrec@5 (59.615)
[02/28 10:35:31][INFO] train_vision.py:  840: Test: [110/121]	Prec@1 55.000 (73.367)	Prec@5 83.750 (95.845)	mPrec@1 (29.252)	mPrec@5 (59.498)
[02/28 10:36:11][INFO] train_vision.py:  840: Test: [120/121]	Prec@1 29.167 (71.766)	Prec@5 43.750 (94.652)	mPrec@1 (28.959)	mPrec@5 (59.870)
[02/28 10:36:11][INFO] train_vision.py:  847: Overall Prec@1 71.766% Prec@5 94.652% mPrec@1 (28.959) mPrec@5 (59.870)
[02/28 10:36:11][INFO] train_vision.py:  464: Testing: 28.95897102355957/28.95897102355957
[02/28 10:36:11][INFO] train_vision.py:  465: Saving:
[02/28 10:36:31][INFO] train_vision.py:  668: Epoch: [4][0/367], lr: 3.00e-04, eta: 13:34:32	Time 5.121 (5.121)	Data 2.231 (2.231)	Mem 41.61GB	Prec@1 70.000 (70.000)	Loss 1.7935 (1.7935)
[02/28 10:37:01][INFO] train_vision.py:  668: Epoch: [4][10/367], lr: 3.00e-04, eta: 8:27:34	Time 3.024 (3.195)	Data 0.069 (0.259)	Mem 41.61GB	Prec@1 40.000 (62.727)	Loss 2.5539 (2.0603)
[02/28 10:37:31][INFO] train_vision.py:  668: Epoch: [4][20/367], lr: 3.00e-04, eta: 8:13:51	Time 3.025 (3.112)	Data 0.049 (0.163)	Mem 41.61GB	Prec@1 40.000 (59.048)	Loss 3.1592 (2.2187)
[02/28 10:38:01][INFO] train_vision.py:  668: Epoch: [4][30/367], lr: 3.00e-04, eta: 8:07:55	Time 2.993 (3.077)	Data 0.053 (0.128)	Mem 41.61GB	Prec@1 50.000 (59.355)	Loss 1.7399 (2.1701)
[02/28 10:38:31][INFO] train_vision.py:  668: Epoch: [4][40/367], lr: 3.00e-04, eta: 8:04:52	Time 2.985 (3.061)	Data 0.042 (0.109)	Mem 41.61GB	Prec@1 70.000 (60.732)	Loss 2.1071 (2.1774)
[02/28 10:39:01][INFO] train_vision.py:  668: Epoch: [4][50/367], lr: 3.00e-04, eta: 8:02:39	Time 2.997 (3.051)	Data 0.057 (0.097)	Mem 41.61GB	Prec@1 60.000 (61.569)	Loss 2.2707 (2.1401)
[02/28 10:39:31][INFO] train_vision.py:  668: Epoch: [4][60/367], lr: 3.00e-04, eta: 8:01:16	Time 3.008 (3.045)	Data 0.054 (0.090)	Mem 41.61GB	Prec@1 50.000 (61.475)	Loss 2.5804 (2.1423)
[02/28 10:40:01][INFO] train_vision.py:  668: Epoch: [4][70/367], lr: 3.00e-04, eta: 8:00:04	Time 3.002 (3.041)	Data 0.054 (0.085)	Mem 41.61GB	Prec@1 80.000 (61.972)	Loss 1.7055 (2.1276)
[02/28 10:40:32][INFO] train_vision.py:  668: Epoch: [4][80/367], lr: 3.00e-04, eta: 7:59:05	Time 3.003 (3.038)	Data 0.055 (0.081)	Mem 41.61GB	Prec@1 50.000 (61.235)	Loss 2.1184 (2.1393)
[02/28 10:41:02][INFO] train_vision.py:  668: Epoch: [4][90/367], lr: 3.00e-04, eta: 7:58:01	Time 2.999 (3.034)	Data 0.048 (0.078)	Mem 41.61GB	Prec@1 100.000 (62.418)	Loss 1.2598 (2.1083)
[02/28 10:41:32][INFO] train_vision.py:  668: Epoch: [4][100/367], lr: 3.00e-04, eta: 7:57:07	Time 3.024 (3.032)	Data 0.024 (0.075)	Mem 41.61GB	Prec@1 60.000 (62.178)	Loss 2.3392 (2.1193)
[02/28 10:42:02][INFO] train_vision.py:  668: Epoch: [4][110/367], lr: 3.00e-04, eta: 7:56:22	Time 3.021 (3.030)	Data 0.024 (0.073)	Mem 41.61GB	Prec@1 50.000 (62.162)	Loss 2.2227 (2.1246)
[02/28 10:42:32][INFO] train_vision.py:  668: Epoch: [4][120/367], lr: 3.00e-04, eta: 7:55:31	Time 3.005 (3.028)	Data 0.060 (0.070)	Mem 41.61GB	Prec@1 80.000 (62.645)	Loss 1.8356 (2.1100)
[02/28 10:43:02][INFO] train_vision.py:  668: Epoch: [4][130/367], lr: 3.00e-04, eta: 7:54:40	Time 2.993 (3.026)	Data 0.044 (0.068)	Mem 41.61GB	Prec@1 80.000 (62.977)	Loss 1.5619 (2.0986)
[02/28 10:43:32][INFO] train_vision.py:  668: Epoch: [4][140/367], lr: 3.00e-04, eta: 7:53:53	Time 2.972 (3.024)	Data 0.055 (0.067)	Mem 41.61GB	Prec@1 70.000 (62.766)	Loss 1.9460 (2.1053)
[02/28 10:44:02][INFO] train_vision.py:  668: Epoch: [4][150/367], lr: 3.00e-04, eta: 7:53:10	Time 3.026 (3.023)	Data 0.049 (0.065)	Mem 41.61GB	Prec@1 50.000 (62.715)	Loss 2.4476 (2.1160)
[02/28 10:44:32][INFO] train_vision.py:  668: Epoch: [4][160/367], lr: 3.00e-04, eta: 7:52:28	Time 3.006 (3.021)	Data 0.045 (0.064)	Mem 41.61GB	Prec@1 50.000 (62.857)	Loss 2.1377 (2.1048)
[02/28 10:45:02][INFO] train_vision.py:  668: Epoch: [4][170/367], lr: 3.00e-04, eta: 7:51:45	Time 2.969 (3.020)	Data 0.042 (0.064)	Mem 41.61GB	Prec@1 50.000 (63.275)	Loss 2.2537 (2.0977)
[02/28 10:45:32][INFO] train_vision.py:  668: Epoch: [4][180/367], lr: 3.00e-04, eta: 7:51:06	Time 3.002 (3.019)	Data 0.052 (0.062)	Mem 41.61GB	Prec@1 80.000 (62.762)	Loss 1.8295 (2.0999)
[02/28 10:46:02][INFO] train_vision.py:  668: Epoch: [4][190/367], lr: 3.00e-04, eta: 7:50:28	Time 2.986 (3.018)	Data 0.048 (0.062)	Mem 41.61GB	Prec@1 60.000 (62.723)	Loss 2.2796 (2.1032)
[02/28 10:46:32][INFO] train_vision.py:  668: Epoch: [4][200/367], lr: 3.00e-04, eta: 7:49:50	Time 3.023 (3.017)	Data 0.032 (0.061)	Mem 41.61GB	Prec@1 40.000 (62.935)	Loss 2.3170 (2.0963)
[02/28 10:47:02][INFO] train_vision.py:  668: Epoch: [4][210/367], lr: 3.00e-04, eta: 7:49:14	Time 2.999 (3.017)	Data 0.053 (0.061)	Mem 41.61GB	Prec@1 70.000 (63.081)	Loss 1.8650 (2.0939)
[02/28 10:47:32][INFO] train_vision.py:  668: Epoch: [4][220/367], lr: 3.00e-04, eta: 7:48:37	Time 2.998 (3.016)	Data 0.050 (0.060)	Mem 41.61GB	Prec@1 60.000 (62.805)	Loss 2.0723 (2.0981)
[02/28 10:48:02][INFO] train_vision.py:  668: Epoch: [4][230/367], lr: 3.00e-04, eta: 7:47:59	Time 3.001 (3.015)	Data 0.029 (0.059)	Mem 41.61GB	Prec@1 70.000 (62.857)	Loss 2.0095 (2.0998)
[02/28 10:48:32][INFO] train_vision.py:  668: Epoch: [4][240/367], lr: 3.00e-04, eta: 7:47:22	Time 3.009 (3.014)	Data 0.034 (0.059)	Mem 41.61GB	Prec@1 70.000 (63.071)	Loss 1.6534 (2.0942)
[02/28 10:49:02][INFO] train_vision.py:  668: Epoch: [4][250/367], lr: 3.00e-04, eta: 7:46:47	Time 2.994 (3.014)	Data 0.040 (0.058)	Mem 41.61GB	Prec@1 90.000 (63.068)	Loss 1.1813 (2.0885)
[02/28 10:49:32][INFO] train_vision.py:  668: Epoch: [4][260/367], lr: 2.99e-04, eta: 7:46:12	Time 3.001 (3.013)	Data 0.054 (0.058)	Mem 41.61GB	Prec@1 60.000 (63.180)	Loss 2.1188 (2.0880)
[02/28 10:50:02][INFO] train_vision.py:  668: Epoch: [4][270/367], lr: 2.99e-04, eta: 7:45:38	Time 2.999 (3.013)	Data 0.044 (0.057)	Mem 41.61GB	Prec@1 80.000 (63.321)	Loss 1.6305 (2.0842)
[02/28 10:50:32][INFO] train_vision.py:  668: Epoch: [4][280/367], lr: 2.99e-04, eta: 7:45:05	Time 3.009 (3.013)	Data 0.055 (0.057)	Mem 41.61GB	Prec@1 50.000 (63.559)	Loss 2.4461 (2.0740)
[02/28 10:51:02][INFO] train_vision.py:  668: Epoch: [4][290/367], lr: 2.99e-04, eta: 7:44:30	Time 2.969 (3.012)	Data 0.049 (0.057)	Mem 41.61GB	Prec@1 80.000 (63.780)	Loss 2.1113 (2.0731)
[02/28 10:51:32][INFO] train_vision.py:  668: Epoch: [4][300/367], lr: 2.99e-04, eta: 7:43:57	Time 2.999 (3.012)	Data 0.051 (0.056)	Mem 41.61GB	Prec@1 60.000 (63.821)	Loss 2.2448 (2.0695)
[02/28 10:52:02][INFO] train_vision.py:  668: Epoch: [4][310/367], lr: 2.99e-04, eta: 7:43:22	Time 2.985 (3.011)	Data 0.048 (0.056)	Mem 41.61GB	Prec@1 70.000 (63.826)	Loss 2.4520 (2.0699)
[02/28 10:52:32][INFO] train_vision.py:  668: Epoch: [4][320/367], lr: 2.99e-04, eta: 7:42:49	Time 2.999 (3.011)	Data 0.049 (0.056)	Mem 41.61GB	Prec@1 40.000 (63.551)	Loss 2.4854 (2.0731)
[02/28 10:53:02][INFO] train_vision.py:  668: Epoch: [4][330/367], lr: 2.99e-04, eta: 7:42:16	Time 2.994 (3.011)	Data 0.048 (0.055)	Mem 41.61GB	Prec@1 80.000 (63.656)	Loss 1.6190 (2.0706)
[02/28 10:53:32][INFO] train_vision.py:  668: Epoch: [4][340/367], lr: 2.99e-04, eta: 7:41:41	Time 2.998 (3.010)	Data 0.053 (0.055)	Mem 41.61GB	Prec@1 70.000 (63.783)	Loss 2.2429 (2.0687)
[02/28 10:54:02][INFO] train_vision.py:  668: Epoch: [4][350/367], lr: 2.99e-04, eta: 7:41:08	Time 2.999 (3.010)	Data 0.048 (0.055)	Mem 41.61GB	Prec@1 80.000 (63.789)	Loss 1.8324 (2.0689)
[02/28 10:54:32][INFO] train_vision.py:  668: Epoch: [4][360/367], lr: 2.99e-04, eta: 7:40:34	Time 2.990 (3.009)	Data 0.040 (0.055)	Mem 41.61GB	Prec@1 70.000 (63.823)	Loss 1.9429 (2.0710)
[02/28 10:54:55][INFO] train_vision.py:  668: Epoch: [5][0/367], lr: 2.99e-04, eta: 14:18:24	Time 5.613 (5.613)	Data 2.333 (2.333)	Mem 41.61GB	Prec@1 60.000 (60.000)	Loss 2.4276 (2.4276)
[02/28 10:55:25][INFO] train_vision.py:  668: Epoch: [5][10/367], lr: 2.99e-04, eta: 8:15:42	Time 3.061 (3.245)	Data 0.074 (0.266)	Mem 41.61GB	Prec@1 70.000 (74.545)	Loss 1.9083 (1.9164)
[02/28 10:55:55][INFO] train_vision.py:  668: Epoch: [5][20/367], lr: 2.99e-04, eta: 7:57:42	Time 2.983 (3.130)	Data 0.062 (0.168)	Mem 41.61GB	Prec@1 60.000 (70.000)	Loss 2.1460 (1.9846)
[02/28 10:56:25][INFO] train_vision.py:  668: Epoch: [5][30/367], lr: 2.99e-04, eta: 7:50:48	Time 3.021 (3.089)	Data 0.053 (0.129)	Mem 41.61GB	Prec@1 60.000 (70.000)	Loss 2.0451 (1.9601)
[02/28 10:56:55][INFO] train_vision.py:  668: Epoch: [5][40/367], lr: 2.99e-04, eta: 7:47:00	Time 2.998 (3.067)	Data 0.048 (0.110)	Mem 41.61GB	Prec@1 80.000 (69.024)	Loss 1.3339 (1.9584)
[02/28 10:57:25][INFO] train_vision.py:  668: Epoch: [5][50/367], lr: 2.99e-04, eta: 7:44:30	Time 3.006 (3.054)	Data 0.049 (0.098)	Mem 41.61GB	Prec@1 80.000 (69.608)	Loss 1.8921 (1.9445)
[02/28 10:57:55][INFO] train_vision.py:  668: Epoch: [5][60/367], lr: 2.99e-04, eta: 7:42:39	Time 2.982 (3.045)	Data 0.042 (0.090)	Mem 41.61GB	Prec@1 90.000 (70.164)	Loss 1.7008 (1.9357)
[02/28 10:58:25][INFO] train_vision.py:  668: Epoch: [5][70/367], lr: 2.98e-04, eta: 7:41:03	Time 3.002 (3.038)	Data 0.045 (0.084)	Mem 41.61GB	Prec@1 80.000 (70.282)	Loss 1.5939 (1.9284)
[02/28 10:58:55][INFO] train_vision.py:  668: Epoch: [5][80/367], lr: 2.98e-04, eta: 7:39:47	Time 2.991 (3.033)	Data 0.048 (0.080)	Mem 41.61GB	Prec@1 60.000 (69.753)	Loss 2.5684 (1.9175)
[02/28 10:59:25][INFO] train_vision.py:  668: Epoch: [5][90/367], lr: 2.98e-04, eta: 7:38:42	Time 2.999 (3.029)	Data 0.044 (0.076)	Mem 41.61GB	Prec@1 60.000 (69.890)	Loss 1.7012 (1.9086)
[02/28 10:59:55][INFO] train_vision.py:  668: Epoch: [5][100/367], lr: 2.98e-04, eta: 7:37:45	Time 3.001 (3.026)	Data 0.029 (0.073)	Mem 41.61GB	Prec@1 60.000 (69.307)	Loss 2.1937 (1.9374)
[02/28 11:00:25][INFO] train_vision.py:  668: Epoch: [5][110/367], lr: 2.98e-04, eta: 7:36:55	Time 3.003 (3.024)	Data 0.048 (0.071)	Mem 41.61GB	Prec@1 60.000 (69.189)	Loss 1.8767 (1.9278)
[02/28 11:00:55][INFO] train_vision.py:  668: Epoch: [5][120/367], lr: 2.98e-04, eta: 7:36:09	Time 2.992 (3.022)	Data 0.050 (0.069)	Mem 41.61GB	Prec@1 60.000 (68.843)	Loss 2.0336 (1.9309)
[02/28 11:01:25][INFO] train_vision.py:  668: Epoch: [5][130/367], lr: 2.98e-04, eta: 7:35:22	Time 3.002 (3.020)	Data 0.048 (0.068)	Mem 41.61GB	Prec@1 80.000 (68.321)	Loss 1.8937 (1.9446)
[02/28 11:01:55][INFO] train_vision.py:  668: Epoch: [5][140/367], lr: 2.98e-04, eta: 7:34:38	Time 2.989 (3.019)	Data 0.046 (0.066)	Mem 41.61GB	Prec@1 80.000 (68.369)	Loss 1.6029 (1.9408)
[02/28 11:02:25][INFO] train_vision.py:  668: Epoch: [5][150/367], lr: 2.98e-04, eta: 7:33:53	Time 2.986 (3.017)	Data 0.047 (0.065)	Mem 41.61GB	Prec@1 70.000 (68.278)	Loss 1.9548 (1.9343)
[02/28 11:02:55][INFO] train_vision.py:  668: Epoch: [5][160/367], lr: 2.98e-04, eta: 7:33:11	Time 2.997 (3.016)	Data 0.046 (0.064)	Mem 41.61GB	Prec@1 50.000 (68.323)	Loss 2.0714 (1.9334)
[02/28 11:03:25][INFO] train_vision.py:  668: Epoch: [5][170/367], lr: 2.98e-04, eta: 7:32:31	Time 3.006 (3.015)	Data 0.048 (0.063)	Mem 41.61GB	Prec@1 70.000 (68.129)	Loss 1.8494 (1.9321)
[02/28 11:03:55][INFO] train_vision.py:  668: Epoch: [5][180/367], lr: 2.98e-04, eta: 7:31:53	Time 3.002 (3.014)	Data 0.041 (0.063)	Mem 41.61GB	Prec@1 60.000 (68.011)	Loss 2.0120 (1.9312)
[02/28 11:04:25][INFO] train_vision.py:  668: Epoch: [5][190/367], lr: 2.98e-04, eta: 7:31:18	Time 3.001 (3.013)	Data 0.051 (0.062)	Mem 41.61GB	Prec@1 60.000 (67.853)	Loss 1.9993 (1.9335)
[02/28 11:04:55][INFO] train_vision.py:  668: Epoch: [5][200/367], lr: 2.97e-04, eta: 7:30:41	Time 3.013 (3.013)	Data 0.021 (0.061)	Mem 41.61GB	Prec@1 80.000 (67.861)	Loss 1.4786 (1.9257)
[02/28 11:05:25][INFO] train_vision.py:  668: Epoch: [5][210/367], lr: 2.97e-04, eta: 7:30:05	Time 3.008 (3.012)	Data 0.041 (0.061)	Mem 41.61GB	Prec@1 80.000 (67.867)	Loss 2.1405 (1.9282)
[02/28 11:05:55][INFO] train_vision.py:  668: Epoch: [5][220/367], lr: 2.97e-04, eta: 7:29:30	Time 3.002 (3.011)	Data 0.051 (0.060)	Mem 41.61GB	Prec@1 60.000 (67.873)	Loss 2.3200 (1.9270)
[02/28 11:06:25][INFO] train_vision.py:  668: Epoch: [5][230/367], lr: 2.97e-04, eta: 7:28:53	Time 2.987 (3.011)	Data 0.050 (0.060)	Mem 41.61GB	Prec@1 60.000 (67.792)	Loss 1.9351 (1.9284)
[02/28 11:06:55][INFO] train_vision.py:  668: Epoch: [5][240/367], lr: 2.97e-04, eta: 7:28:18	Time 2.997 (3.010)	Data 0.049 (0.059)	Mem 41.61GB	Prec@1 60.000 (67.676)	Loss 2.1138 (1.9291)
[02/28 11:07:25][INFO] train_vision.py:  668: Epoch: [5][250/367], lr: 2.97e-04, eta: 7:27:46	Time 3.010 (3.010)	Data 0.048 (0.059)	Mem 41.61GB	Prec@1 100.000 (67.570)	Loss 1.1561 (1.9297)
[02/28 11:07:55][INFO] train_vision.py:  668: Epoch: [5][260/367], lr: 2.97e-04, eta: 7:27:13	Time 3.001 (3.010)	Data 0.049 (0.059)	Mem 41.61GB	Prec@1 70.000 (67.893)	Loss 1.9996 (1.9286)
[02/28 11:08:25][INFO] train_vision.py:  668: Epoch: [5][270/367], lr: 2.97e-04, eta: 7:26:40	Time 3.001 (3.009)	Data 0.046 (0.059)	Mem 41.61GB	Prec@1 80.000 (68.081)	Loss 1.5112 (1.9252)
[02/28 11:08:55][INFO] train_vision.py:  668: Epoch: [5][280/367], lr: 2.97e-04, eta: 7:26:05	Time 2.989 (3.009)	Data 0.050 (0.059)	Mem 41.61GB	Prec@1 80.000 (68.185)	Loss 1.4116 (1.9192)
[02/28 11:09:25][INFO] train_vision.py:  668: Epoch: [5][290/367], lr: 2.97e-04, eta: 7:25:30	Time 2.993 (3.008)	Data 0.046 (0.058)	Mem 41.61GB	Prec@1 80.000 (68.213)	Loss 1.2220 (1.9202)
[02/28 11:09:55][INFO] train_vision.py:  668: Epoch: [5][300/367], lr: 2.96e-04, eta: 7:24:56	Time 2.998 (3.008)	Data 0.049 (0.058)	Mem 41.61GB	Prec@1 80.000 (68.239)	Loss 1.7652 (1.9254)
[02/28 11:10:25][INFO] train_vision.py:  668: Epoch: [5][310/367], lr: 2.96e-04, eta: 7:24:22	Time 2.972 (3.007)	Data 0.048 (0.058)	Mem 41.61GB	Prec@1 80.000 (68.392)	Loss 1.5026 (1.9217)
[02/28 11:10:55][INFO] train_vision.py:  668: Epoch: [5][320/367], lr: 2.96e-04, eta: 7:23:52	Time 2.999 (3.007)	Data 0.053 (0.057)	Mem 41.61GB	Prec@1 90.000 (68.318)	Loss 1.4769 (1.9265)
[02/28 11:11:25][INFO] train_vision.py:  668: Epoch: [5][330/367], lr: 2.96e-04, eta: 7:23:20	Time 2.980 (3.007)	Data 0.047 (0.057)	Mem 41.61GB	Prec@1 70.000 (68.248)	Loss 2.0420 (1.9252)
[02/28 11:11:55][INFO] train_vision.py:  668: Epoch: [5][340/367], lr: 2.96e-04, eta: 7:22:48	Time 2.987 (3.007)	Data 0.041 (0.057)	Mem 41.61GB	Prec@1 90.000 (68.299)	Loss 1.4709 (1.9253)
[02/28 11:12:25][INFO] train_vision.py:  668: Epoch: [5][350/367], lr: 2.96e-04, eta: 7:22:15	Time 2.978 (3.006)	Data 0.045 (0.056)	Mem 41.61GB	Prec@1 80.000 (68.262)	Loss 1.6354 (1.9246)
[02/28 11:12:55][INFO] train_vision.py:  668: Epoch: [5][360/367], lr: 2.96e-04, eta: 7:21:43	Time 3.001 (3.006)	Data 0.048 (0.056)	Mem 41.61GB	Prec@1 60.000 (68.366)	Loss 2.0108 (1.9211)
[02/28 11:13:19][INFO] train_vision.py:  840: Test: [0/121]	Prec@1 90.000 (90.000)	Prec@5 100.000 (100.000)	mPrec@1 (10.058)	mPrec@5 (11.458)
[02/28 11:14:01][INFO] train_vision.py:  840: Test: [10/121]	Prec@1 92.500 (90.909)	Prec@5 100.000 (99.886)	mPrec@1 (25.788)	mPrec@5 (32.523)
[02/28 11:14:43][INFO] train_vision.py:  840: Test: [20/121]	Prec@1 87.500 (87.917)	Prec@5 100.000 (98.810)	mPrec@1 (32.999)	mPrec@5 (43.562)
[02/28 11:15:25][INFO] train_vision.py:  840: Test: [30/121]	Prec@1 83.750 (87.298)	Prec@5 97.500 (98.790)	mPrec@1 (36.811)	mPrec@5 (51.328)
[02/28 11:16:08][INFO] train_vision.py:  840: Test: [40/121]	Prec@1 87.500 (87.043)	Prec@5 100.000 (98.445)	mPrec@1 (39.398)	mPrec@5 (55.828)
[02/28 11:16:50][INFO] train_vision.py:  840: Test: [50/121]	Prec@1 81.250 (85.662)	Prec@5 97.500 (97.990)	mPrec@1 (39.802)	mPrec@5 (56.904)
[02/28 11:17:32][INFO] train_vision.py:  840: Test: [60/121]	Prec@1 83.750 (85.553)	Prec@5 98.750 (98.053)	mPrec@1 (39.559)	mPrec@5 (59.365)
[02/28 11:18:14][INFO] train_vision.py:  840: Test: [70/121]	Prec@1 82.500 (85.915)	Prec@5 97.500 (98.169)	mPrec@1 (39.604)	mPrec@5 (59.897)
[02/28 11:18:56][INFO] train_vision.py:  840: Test: [80/121]	Prec@1 77.500 (85.988)	Prec@5 98.750 (98.272)	mPrec@1 (41.056)	mPrec@5 (63.749)
[02/28 11:19:39][INFO] train_vision.py:  840: Test: [90/121]	Prec@1 76.250 (85.549)	Prec@5 97.500 (98.242)	mPrec@1 (41.768)	mPrec@5 (65.657)
[02/28 11:20:21][INFO] train_vision.py:  840: Test: [100/121]	Prec@1 83.750 (84.257)	Prec@5 100.000 (97.710)	mPrec@1 (41.610)	mPrec@5 (69.185)
[02/28 11:21:03][INFO] train_vision.py:  840: Test: [110/121]	Prec@1 70.000 (84.482)	Prec@5 83.750 (97.624)	mPrec@1 (41.733)	mPrec@5 (68.998)
[02/28 11:21:44][INFO] train_vision.py:  840: Test: [120/121]	Prec@1 35.417 (82.991)	Prec@5 62.500 (96.984)	mPrec@1 (42.174)	mPrec@5 (71.575)
[02/28 11:21:44][INFO] train_vision.py:  847: Overall Prec@1 82.991% Prec@5 96.984% mPrec@1 (42.174) mPrec@5 (71.575)
[02/28 11:21:44][INFO] train_vision.py:  464: Testing: 42.1744384765625/42.1744384765625
[02/28 11:21:44][INFO] train_vision.py:  465: Saving:
[02/28 11:22:04][INFO] train_vision.py:  668: Epoch: [6][0/367], lr: 2.96e-04, eta: 13:06:50	Time 5.359 (5.359)	Data 2.474 (2.474)	Mem 41.61GB	Prec@1 80.000 (80.000)	Loss 1.4248 (1.4248)
[02/28 11:22:34][INFO] train_vision.py:  668: Epoch: [6][10/367], lr: 2.96e-04, eta: 7:50:37	Time 3.006 (3.209)	Data 0.060 (0.290)	Mem 41.61GB	Prec@1 90.000 (74.545)	Loss 1.2991 (1.6876)
[02/28 11:23:04][INFO] train_vision.py:  668: Epoch: [6][20/367], lr: 2.95e-04, eta: 7:35:59	Time 2.988 (3.113)	Data 0.044 (0.178)	Mem 41.61GB	Prec@1 70.000 (70.952)	Loss 1.9272 (1.8588)
[02/28 11:23:34][INFO] train_vision.py:  668: Epoch: [6][30/367], lr: 2.95e-04, eta: 7:30:56	Time 3.007 (3.082)	Data 0.053 (0.142)	Mem 41.61GB	Prec@1 70.000 (70.323)	Loss 2.0140 (1.8846)
[02/28 11:24:04][INFO] train_vision.py:  668: Epoch: [6][40/367], lr: 2.95e-04, eta: 7:28:06	Time 3.031 (3.066)	Data 0.065 (0.122)	Mem 41.61GB	Prec@1 70.000 (70.000)	Loss 1.9534 (1.8746)
[02/28 11:24:34][INFO] train_vision.py:  668: Epoch: [6][50/367], lr: 2.95e-04, eta: 7:25:54	Time 3.001 (3.055)	Data 0.054 (0.110)	Mem 41.61GB	Prec@1 40.000 (70.392)	Loss 2.6752 (1.8520)
[02/28 11:25:04][INFO] train_vision.py:  668: Epoch: [6][60/367], lr: 2.95e-04, eta: 7:24:20	Time 3.002 (3.047)	Data 0.058 (0.101)	Mem 41.61GB	Prec@1 50.000 (68.033)	Loss 2.5118 (1.8998)
[02/28 11:25:34][INFO] train_vision.py:  668: Epoch: [6][70/367], lr: 2.95e-04, eta: 7:22:56	Time 3.001 (3.041)	Data 0.057 (0.095)	Mem 41.61GB	Prec@1 80.000 (68.732)	Loss 1.4578 (1.8702)
[02/28 11:26:04][INFO] train_vision.py:  668: Epoch: [6][80/367], lr: 2.95e-04, eta: 7:21:49	Time 3.038 (3.037)	Data 0.069 (0.091)	Mem 41.61GB	Prec@1 60.000 (69.383)	Loss 1.5154 (1.8492)
[02/28 11:26:34][INFO] train_vision.py:  668: Epoch: [6][90/367], lr: 2.95e-04, eta: 7:20:46	Time 3.000 (3.033)	Data 0.057 (0.087)	Mem 41.61GB	Prec@1 70.000 (68.791)	Loss 1.8045 (1.8528)
[02/28 11:27:04][INFO] train_vision.py:  668: Epoch: [6][100/367], lr: 2.94e-04, eta: 7:19:52	Time 3.007 (3.030)	Data 0.056 (0.083)	Mem 41.61GB	Prec@1 80.000 (68.911)	Loss 1.6789 (1.8618)
[02/28 11:27:35][INFO] train_vision.py:  668: Epoch: [6][110/367], lr: 2.94e-04, eta: 7:19:04	Time 3.002 (3.028)	Data 0.062 (0.081)	Mem 41.61GB	Prec@1 80.000 (68.829)	Loss 1.5123 (1.8648)
[02/28 11:28:05][INFO] train_vision.py:  668: Epoch: [6][120/367], lr: 2.94e-04, eta: 7:18:20	Time 3.003 (3.027)	Data 0.062 (0.078)	Mem 41.61GB	Prec@1 70.000 (68.512)	Loss 2.2241 (1.8839)
[02/28 11:28:35][INFO] train_vision.py:  668: Epoch: [6][130/367], lr: 2.94e-04, eta: 7:17:32	Time 2.957 (3.025)	Data 0.057 (0.076)	Mem 41.61GB	Prec@1 70.000 (69.313)	Loss 1.5041 (1.8648)
[02/28 11:29:05][INFO] train_vision.py:  668: Epoch: [6][140/367], lr: 2.94e-04, eta: 7:16:51	Time 3.002 (3.024)	Data 0.063 (0.075)	Mem 41.61GB	Prec@1 60.000 (69.433)	Loss 2.4110 (1.8635)
[02/28 11:29:35][INFO] train_vision.py:  668: Epoch: [6][150/367], lr: 2.94e-04, eta: 7:16:09	Time 3.009 (3.022)	Data 0.064 (0.073)	Mem 41.61GB	Prec@1 60.000 (69.801)	Loss 1.7655 (1.8522)
[02/28 11:30:05][INFO] train_vision.py:  668: Epoch: [6][160/367], lr: 2.94e-04, eta: 7:15:26	Time 3.002 (3.021)	Data 0.055 (0.072)	Mem 41.61GB	Prec@1 70.000 (69.938)	Loss 1.4807 (1.8462)
[02/28 11:30:35][INFO] train_vision.py:  668: Epoch: [6][170/367], lr: 2.93e-04, eta: 7:14:46	Time 3.033 (3.020)	Data 0.026 (0.071)	Mem 41.61GB	Prec@1 70.000 (70.117)	Loss 2.5450 (1.8480)
[02/28 11:31:05][INFO] train_vision.py:  668: Epoch: [6][180/367], lr: 2.93e-04, eta: 7:14:11	Time 3.018 (3.019)	Data 0.041 (0.070)	Mem 41.61GB	Prec@1 90.000 (70.055)	Loss 1.4624 (1.8455)
[02/28 11:31:35][INFO] train_vision.py:  668: Epoch: [6][190/367], lr: 2.93e-04, eta: 7:13:35	Time 3.001 (3.018)	Data 0.059 (0.069)	Mem 41.61GB	Prec@1 60.000 (70.052)	Loss 1.9270 (1.8432)
[02/28 11:32:05][INFO] train_vision.py:  668: Epoch: [6][200/367], lr: 2.93e-04, eta: 7:13:00	Time 3.003 (3.018)	Data 0.055 (0.068)	Mem 41.61GB	Prec@1 70.000 (69.851)	Loss 2.2227 (1.8557)
[02/28 11:32:35][INFO] train_vision.py:  668: Epoch: [6][210/367], lr: 2.93e-04, eta: 7:12:25	Time 3.030 (3.017)	Data 0.055 (0.068)	Mem 41.61GB	Prec@1 70.000 (69.905)	Loss 2.3050 (1.8552)
[02/28 11:33:05][INFO] train_vision.py:  668: Epoch: [6][220/367], lr: 2.93e-04, eta: 7:11:49	Time 3.001 (3.017)	Data 0.055 (0.067)	Mem 41.61GB	Prec@1 60.000 (70.181)	Loss 1.8807 (1.8478)
[02/28 11:33:35][INFO] train_vision.py:  668: Epoch: [6][230/367], lr: 2.93e-04, eta: 7:11:13	Time 3.003 (3.016)	Data 0.052 (0.066)	Mem 41.61GB	Prec@1 70.000 (70.087)	Loss 2.1326 (1.8527)
[02/28 11:34:05][INFO] train_vision.py:  668: Epoch: [6][240/367], lr: 2.92e-04, eta: 7:10:40	Time 3.005 (3.016)	Data 0.063 (0.066)	Mem 41.61GB	Prec@1 80.000 (70.083)	Loss 1.8707 (1.8541)
[02/28 11:34:35][INFO] train_vision.py:  668: Epoch: [6][250/367], lr: 2.92e-04, eta: 7:10:07	Time 3.027 (3.015)	Data 0.057 (0.065)	Mem 41.61GB	Prec@1 60.000 (70.080)	Loss 2.3630 (1.8577)
[02/28 11:35:05][INFO] train_vision.py:  668: Epoch: [6][260/367], lr: 2.92e-04, eta: 7:09:34	Time 3.002 (3.015)	Data 0.059 (0.065)	Mem 41.61GB	Prec@1 70.000 (70.077)	Loss 2.2758 (1.8642)
[02/28 11:35:35][INFO] train_vision.py:  668: Epoch: [6][270/367], lr: 2.92e-04, eta: 7:08:59	Time 2.997 (3.014)	Data 0.056 (0.064)	Mem 41.61GB	Prec@1 80.000 (70.185)	Loss 1.5122 (1.8594)
[02/28 11:36:05][INFO] train_vision.py:  668: Epoch: [6][280/367], lr: 2.92e-04, eta: 7:08:24	Time 3.002 (3.014)	Data 0.055 (0.064)	Mem 41.61GB	Prec@1 80.000 (70.463)	Loss 1.7410 (1.8526)
[02/28 11:36:35][INFO] train_vision.py:  668: Epoch: [6][290/367], lr: 2.92e-04, eta: 7:07:51	Time 2.980 (3.013)	Data 0.056 (0.063)	Mem 41.61GB	Prec@1 60.000 (70.309)	Loss 1.7225 (1.8571)
[02/28 11:37:05][INFO] train_vision.py:  668: Epoch: [6][300/367], lr: 2.91e-04, eta: 7:07:19	Time 2.994 (3.013)	Data 0.052 (0.063)	Mem 41.61GB	Prec@1 80.000 (70.432)	Loss 1.7157 (1.8557)
[02/28 11:37:35][INFO] train_vision.py:  668: Epoch: [6][310/367], lr: 2.91e-04, eta: 7:06:45	Time 2.991 (3.013)	Data 0.052 (0.063)	Mem 41.61GB	Prec@1 70.000 (70.450)	Loss 1.7384 (1.8506)
[02/28 11:38:05][INFO] train_vision.py:  668: Epoch: [6][320/367], lr: 2.91e-04, eta: 7:06:10	Time 3.006 (3.012)	Data 0.070 (0.062)	Mem 41.61GB	Prec@1 90.000 (70.685)	Loss 1.8063 (1.8466)
[02/28 11:38:35][INFO] train_vision.py:  668: Epoch: [6][330/367], lr: 2.91e-04, eta: 7:05:36	Time 2.999 (3.012)	Data 0.057 (0.062)	Mem 41.61GB	Prec@1 80.000 (70.846)	Loss 1.1932 (1.8406)
[02/28 11:39:05][INFO] train_vision.py:  668: Epoch: [6][340/367], lr: 2.91e-04, eta: 7:05:01	Time 2.993 (3.011)	Data 0.066 (0.062)	Mem 41.61GB	Prec@1 70.000 (71.144)	Loss 2.0046 (1.8338)
[02/28 11:39:35][INFO] train_vision.py:  668: Epoch: [6][350/367], lr: 2.91e-04, eta: 7:04:28	Time 3.018 (3.011)	Data 0.028 (0.062)	Mem 41.61GB	Prec@1 100.000 (71.197)	Loss 1.1338 (1.8311)
[02/28 11:40:05][INFO] train_vision.py:  668: Epoch: [6][360/367], lr: 2.90e-04, eta: 7:03:56	Time 3.007 (3.011)	Data 0.064 (0.062)	Mem 41.61GB	Prec@1 70.000 (71.108)	Loss 2.2243 (1.8345)
[02/28 11:40:28][INFO] train_vision.py:  668: Epoch: [7][0/367], lr: 2.90e-04, eta: 13:08:11	Time 5.602 (5.602)	Data 2.439 (2.439)	Mem 41.61GB	Prec@1 60.000 (60.000)	Loss 1.9712 (1.9712)
[02/28 11:40:58][INFO] train_vision.py:  668: Epoch: [7][10/367], lr: 2.90e-04, eta: 7:34:46	Time 3.001 (3.236)	Data 0.082 (0.268)	Mem 41.61GB	Prec@1 70.000 (70.909)	Loss 1.6204 (1.8377)
[02/28 11:41:28][INFO] train_vision.py:  668: Epoch: [7][20/367], lr: 2.90e-04, eta: 7:18:39	Time 3.007 (3.125)	Data 0.073 (0.169)	Mem 41.61GB	Prec@1 60.000 (69.524)	Loss 1.8820 (1.9353)
[02/28 11:41:58][INFO] train_vision.py:  668: Epoch: [7][30/367], lr: 2.90e-04, eta: 7:12:47	Time 3.012 (3.087)	Data 0.061 (0.135)	Mem 41.61GB	Prec@1 80.000 (72.258)	Loss 1.3835 (1.8664)
[02/28 11:42:28][INFO] train_vision.py:  668: Epoch: [7][40/367], lr: 2.90e-04, eta: 7:09:28	Time 3.008 (3.067)	Data 0.070 (0.117)	Mem 41.61GB	Prec@1 70.000 (72.683)	Loss 2.2004 (1.8552)
[02/28 11:42:58][INFO] train_vision.py:  668: Epoch: [7][50/367], lr: 2.89e-04, eta: 7:07:11	Time 3.013 (3.054)	Data 0.032 (0.105)	Mem 41.61GB	Prec@1 70.000 (71.961)	Loss 1.9928 (1.8610)
[02/28 11:43:28][INFO] train_vision.py:  668: Epoch: [7][60/367], lr: 2.89e-04, eta: 7:05:28	Time 2.998 (3.046)	Data 0.051 (0.098)	Mem 41.61GB	Prec@1 90.000 (72.459)	Loss 1.1936 (1.8426)
[02/28 11:43:58][INFO] train_vision.py:  668: Epoch: [7][70/367], lr: 2.89e-04, eta: 7:04:06	Time 3.002 (3.039)	Data 0.066 (0.091)	Mem 41.61GB	Prec@1 70.000 (70.845)	Loss 1.8336 (1.8735)
[02/28 11:44:28][INFO] train_vision.py:  668: Epoch: [7][80/367], lr: 2.89e-04, eta: 7:03:01	Time 3.038 (3.035)	Data 0.027 (0.087)	Mem 41.61GB	Prec@1 50.000 (69.753)	Loss 2.2174 (1.8915)
[02/28 11:44:58][INFO] train_vision.py:  668: Epoch: [7][90/367], lr: 2.89e-04, eta: 7:01:56	Time 2.976 (3.031)	Data 0.055 (0.083)	Mem 41.61GB	Prec@1 70.000 (69.890)	Loss 1.5792 (1.8839)
[02/28 11:45:28][INFO] train_vision.py:  668: Epoch: [7][100/367], lr: 2.89e-04, eta: 7:01:03	Time 2.994 (3.028)	Data 0.050 (0.080)	Mem 41.61GB	Prec@1 60.000 (70.297)	Loss 2.0117 (1.8728)
[02/28 11:45:58][INFO] train_vision.py:  668: Epoch: [7][110/367], lr: 2.88e-04, eta: 7:00:11	Time 2.996 (3.026)	Data 0.059 (0.078)	Mem 41.61GB	Prec@1 50.000 (70.180)	Loss 1.8634 (1.8653)
[02/28 11:46:28][INFO] train_vision.py:  668: Epoch: [7][120/367], lr: 2.88e-04, eta: 6:59:22	Time 2.993 (3.024)	Data 0.047 (0.076)	Mem 41.61GB	Prec@1 80.000 (70.579)	Loss 1.4109 (1.8562)
[02/28 11:46:58][INFO] train_vision.py:  668: Epoch: [7][130/367], lr: 2.88e-04, eta: 6:58:35	Time 2.997 (3.022)	Data 0.056 (0.074)	Mem 41.61GB	Prec@1 80.000 (70.534)	Loss 1.4272 (1.8629)
[02/28 11:47:28][INFO] train_vision.py:  668: Epoch: [7][140/367], lr: 2.88e-04, eta: 6:57:52	Time 2.986 (3.020)	Data 0.051 (0.072)	Mem 41.61GB	Prec@1 60.000 (70.709)	Loss 2.4368 (1.8674)
[02/28 11:47:58][INFO] train_vision.py:  668: Epoch: [7][150/367], lr: 2.88e-04, eta: 6:57:12	Time 3.005 (3.019)	Data 0.044 (0.071)	Mem 41.61GB	Prec@1 90.000 (70.464)	Loss 1.3698 (1.8750)
[02/28 11:48:29][INFO] train_vision.py:  668: Epoch: [7][160/367], lr: 2.87e-04, eta: 6:56:34	Time 3.008 (3.018)	Data 0.031 (0.069)	Mem 41.61GB	Prec@1 80.000 (70.932)	Loss 2.0625 (1.8641)
[02/28 11:48:59][INFO] train_vision.py:  668: Epoch: [7][170/367], lr: 2.87e-04, eta: 6:55:56	Time 2.981 (3.017)	Data 0.044 (0.068)	Mem 41.61GB	Prec@1 90.000 (71.170)	Loss 1.6506 (1.8578)
[02/28 11:49:29][INFO] train_vision.py:  668: Epoch: [7][180/367], lr: 2.87e-04, eta: 6:55:19	Time 2.995 (3.016)	Data 0.049 (0.067)	Mem 41.61GB	Prec@1 70.000 (71.271)	Loss 1.8979 (1.8552)
[02/28 11:49:59][INFO] train_vision.py:  668: Epoch: [7][190/367], lr: 2.87e-04, eta: 6:54:42	Time 2.991 (3.015)	Data 0.042 (0.066)	Mem 41.61GB	Prec@1 90.000 (71.623)	Loss 1.4082 (1.8483)
[02/28 11:50:29][INFO] train_vision.py:  668: Epoch: [7][200/367], lr: 2.87e-04, eta: 6:54:07	Time 3.006 (3.015)	Data 0.044 (0.065)	Mem 41.61GB	Prec@1 70.000 (71.692)	Loss 1.8799 (1.8445)
[02/28 11:50:59][INFO] train_vision.py:  668: Epoch: [7][210/367], lr: 2.86e-04, eta: 6:53:33	Time 3.013 (3.014)	Data 0.042 (0.064)	Mem 41.61GB	Prec@1 90.000 (71.943)	Loss 1.3891 (1.8433)
[02/28 11:51:29][INFO] train_vision.py:  668: Epoch: [7][220/367], lr: 2.86e-04, eta: 6:52:59	Time 2.984 (3.014)	Data 0.050 (0.064)	Mem 41.61GB	Prec@1 80.000 (72.036)	Loss 1.8156 (1.8444)
[02/28 11:51:59][INFO] train_vision.py:  668: Epoch: [7][230/367], lr: 2.86e-04, eta: 6:52:26	Time 3.018 (3.013)	Data 0.023 (0.063)	Mem 41.61GB	Prec@1 70.000 (71.948)	Loss 1.8961 (1.8466)
[02/28 11:52:29][INFO] train_vision.py:  668: Epoch: [7][240/367], lr: 2.86e-04, eta: 6:51:52	Time 2.983 (3.013)	Data 0.036 (0.063)	Mem 41.61GB	Prec@1 60.000 (71.867)	Loss 2.4506 (1.8482)
[02/28 11:52:59][INFO] train_vision.py:  668: Epoch: [7][250/367], lr: 2.86e-04, eta: 6:51:20	Time 3.001 (3.013)	Data 0.038 (0.062)	Mem 41.61GB	Prec@1 90.000 (71.992)	Loss 1.4864 (1.8459)
[02/28 11:53:29][INFO] train_vision.py:  668: Epoch: [7][260/367], lr: 2.85e-04, eta: 6:50:49	Time 3.004 (3.013)	Data 0.050 (0.061)	Mem 41.61GB	Prec@1 70.000 (71.877)	Loss 1.8563 (1.8465)
[02/28 11:53:59][INFO] train_vision.py:  668: Epoch: [7][270/367], lr: 2.85e-04, eta: 6:50:18	Time 3.014 (3.012)	Data 0.043 (0.061)	Mem 41.61GB	Prec@1 80.000 (71.993)	Loss 1.6268 (1.8435)
[02/28 11:54:29][INFO] train_vision.py:  668: Epoch: [7][280/367], lr: 2.85e-04, eta: 6:49:45	Time 2.999 (3.012)	Data 0.051 (0.060)	Mem 41.61GB	Prec@1 70.000 (71.779)	Loss 1.8264 (1.8448)
[02/28 11:54:59][INFO] train_vision.py:  668: Epoch: [7][290/367], lr: 2.85e-04, eta: 6:49:13	Time 3.009 (3.012)	Data 0.043 (0.060)	Mem 41.61GB	Prec@1 70.000 (71.856)	Loss 2.0295 (1.8437)
[02/28 11:55:29][INFO] train_vision.py:  668: Epoch: [7][300/367], lr: 2.85e-04, eta: 6:48:42	Time 2.995 (3.012)	Data 0.049 (0.060)	Mem 41.61GB	Prec@1 90.000 (72.060)	Loss 1.5460 (1.8393)
[02/28 11:55:59][INFO] train_vision.py:  668: Epoch: [7][310/367], lr: 2.84e-04, eta: 6:48:08	Time 3.001 (3.011)	Data 0.046 (0.059)	Mem 41.61GB	Prec@1 70.000 (72.058)	Loss 1.6409 (1.8357)
[02/28 11:56:29][INFO] train_vision.py:  668: Epoch: [7][320/367], lr: 2.84e-04, eta: 6:47:35	Time 2.987 (3.011)	Data 0.049 (0.059)	Mem 41.61GB	Prec@1 70.000 (72.118)	Loss 1.7000 (1.8332)
[02/28 11:56:59][INFO] train_vision.py:  668: Epoch: [7][330/367], lr: 2.84e-04, eta: 6:47:05	Time 3.017 (3.011)	Data 0.044 (0.059)	Mem 41.61GB	Prec@1 100.000 (71.934)	Loss 1.1731 (1.8336)
[02/28 11:57:29][INFO] train_vision.py:  668: Epoch: [7][340/367], lr: 2.84e-04, eta: 6:46:33	Time 3.001 (3.011)	Data 0.043 (0.059)	Mem 41.61GB	Prec@1 60.000 (71.965)	Loss 1.8822 (1.8320)
[02/28 11:57:59][INFO] train_vision.py:  668: Epoch: [7][350/367], lr: 2.83e-04, eta: 6:46:02	Time 3.014 (3.011)	Data 0.041 (0.058)	Mem 41.61GB	Prec@1 60.000 (71.738)	Loss 1.5511 (1.8352)
[02/28 11:58:29][INFO] train_vision.py:  668: Epoch: [7][360/367], lr: 2.83e-04, eta: 6:45:30	Time 3.036 (3.010)	Data 0.021 (0.058)	Mem 41.61GB	Prec@1 40.000 (71.717)	Loss 2.7282 (1.8348)
[02/28 11:58:53][INFO] train_vision.py:  840: Test: [0/121]	Prec@1 93.750 (93.750)	Prec@5 100.000 (100.000)	mPrec@1 (10.260)	mPrec@5 (11.458)
[02/28 11:59:36][INFO] train_vision.py:  840: Test: [10/121]	Prec@1 93.750 (91.250)	Prec@5 100.000 (99.659)	mPrec@1 (26.946)	mPrec@5 (32.397)
[02/28 12:00:18][INFO] train_vision.py:  840: Test: [20/121]	Prec@1 87.500 (89.940)	Prec@5 100.000 (99.107)	mPrec@1 (34.616)	mPrec@5 (43.928)
[02/28 12:01:00][INFO] train_vision.py:  840: Test: [30/121]	Prec@1 88.750 (89.758)	Prec@5 98.750 (99.113)	mPrec@1 (40.039)	mPrec@5 (51.875)
[02/28 12:01:42][INFO] train_vision.py:  840: Test: [40/121]	Prec@1 88.750 (89.451)	Prec@5 98.750 (98.750)	mPrec@1 (42.560)	mPrec@5 (56.719)
[02/28 12:02:24][INFO] train_vision.py:  840: Test: [50/121]	Prec@1 77.500 (88.186)	Prec@5 98.750 (98.554)	mPrec@1 (42.648)	mPrec@5 (58.385)
[02/28 12:03:06][INFO] train_vision.py:  840: Test: [60/121]	Prec@1 85.000 (88.258)	Prec@5 100.000 (98.607)	mPrec@1 (43.912)	mPrec@5 (61.025)
[02/28 12:03:49][INFO] train_vision.py:  840: Test: [70/121]	Prec@1 92.500 (88.539)	Prec@5 98.750 (98.697)	mPrec@1 (44.061)	mPrec@5 (62.255)
[02/28 12:04:31][INFO] train_vision.py:  840: Test: [80/121]	Prec@1 86.250 (88.765)	Prec@5 98.750 (98.704)	mPrec@1 (45.424)	mPrec@5 (65.716)
[02/28 12:05:13][INFO] train_vision.py:  840: Test: [90/121]	Prec@1 85.000 (88.214)	Prec@5 97.500 (98.668)	mPrec@1 (45.774)	mPrec@5 (68.104)
[02/28 12:05:55][INFO] train_vision.py:  840: Test: [100/121]	Prec@1 91.250 (86.955)	Prec@5 100.000 (98.267)	mPrec@1 (45.886)	mPrec@5 (72.962)
[02/28 12:06:37][INFO] train_vision.py:  840: Test: [110/121]	Prec@1 65.000 (87.038)	Prec@5 95.000 (98.266)	mPrec@1 (46.154)	mPrec@5 (72.909)
[02/28 12:07:18][INFO] train_vision.py:  840: Test: [120/121]	Prec@1 33.333 (85.551)	Prec@5 70.833 (97.751)	mPrec@1 (46.601)	mPrec@5 (77.435)
[02/28 12:07:18][INFO] train_vision.py:  847: Overall Prec@1 85.551% Prec@5 97.751% mPrec@1 (46.601) mPrec@5 (77.435)
[02/28 12:07:18][INFO] train_vision.py:  464: Testing: 46.60130310058594/46.60130310058594
[02/28 12:07:18][INFO] train_vision.py:  465: Saving:
[02/28 12:07:37][INFO] train_vision.py:  668: Epoch: [8][0/367], lr: 2.83e-04, eta: 11:45:38	Time 5.243 (5.243)	Data 2.348 (2.348)	Mem 41.61GB	Prec@1 70.000 (70.000)	Loss 1.7902 (1.7902)
[02/28 12:08:07][INFO] train_vision.py:  668: Epoch: [8][10/367], lr: 2.83e-04, eta: 7:09:43	Time 2.996 (3.197)	Data 0.071 (0.272)	Mem 41.61GB	Prec@1 60.000 (75.455)	Loss 2.1026 (1.8341)
[02/28 12:08:38][INFO] train_vision.py:  668: Epoch: [8][20/367], lr: 2.83e-04, eta: 6:58:08	Time 3.021 (3.115)	Data 0.080 (0.175)	Mem 41.61GB	Prec@1 50.000 (76.667)	Loss 2.3802 (1.7606)
[02/28 12:09:08][INFO] train_vision.py:  668: Epoch: [8][30/367], lr: 2.82e-04, eta: 6:53:20	Time 3.030 (3.083)	Data 0.061 (0.137)	Mem 41.61GB	Prec@1 90.000 (76.452)	Loss 1.4307 (1.7350)
[02/28 12:09:38][INFO] train_vision.py:  668: Epoch: [8][40/367], lr: 2.82e-04, eta: 6:50:50	Time 3.051 (3.068)	Data 0.070 (0.120)	Mem 41.61GB	Prec@1 50.000 (76.341)	Loss 2.2793 (1.7159)
[02/28 12:10:08][INFO] train_vision.py:  668: Epoch: [8][50/367], lr: 2.82e-04, eta: 6:49:14	Time 3.040 (3.060)	Data 0.069 (0.109)	Mem 41.61GB	Prec@1 80.000 (76.863)	Loss 1.6055 (1.6883)
[02/28 12:10:39][INFO] train_vision.py:  668: Epoch: [8][60/367], lr: 2.82e-04, eta: 6:47:53	Time 3.012 (3.054)	Data 0.065 (0.101)	Mem 41.61GB	Prec@1 80.000 (76.885)	Loss 1.7568 (1.6788)
[02/28 12:11:09][INFO] train_vision.py:  668: Epoch: [8][70/367], lr: 2.81e-04, eta: 6:46:50	Time 3.057 (3.049)	Data 0.067 (0.094)	Mem 41.61GB	Prec@1 60.000 (75.915)	Loss 1.8273 (1.7081)
[02/28 12:11:39][INFO] train_vision.py:  668: Epoch: [8][80/367], lr: 2.81e-04, eta: 6:45:59	Time 3.037 (3.047)	Data 0.048 (0.090)	Mem 41.61GB	Prec@1 70.000 (75.432)	Loss 2.0420 (1.7111)
[02/28 12:12:09][INFO] train_vision.py:  668: Epoch: [8][90/367], lr: 2.81e-04, eta: 6:45:07	Time 3.042 (3.044)	Data 0.073 (0.087)	Mem 41.61GB	Prec@1 80.000 (75.055)	Loss 1.5113 (1.7252)
[02/28 12:12:40][INFO] train_vision.py:  668: Epoch: [8][100/367], lr: 2.81e-04, eta: 6:44:23	Time 3.045 (3.042)	Data 0.063 (0.084)	Mem 41.61GB	Prec@1 80.000 (75.149)	Loss 1.9950 (1.7304)
[02/28 12:13:10][INFO] train_vision.py:  668: Epoch: [8][110/367], lr: 2.80e-04, eta: 6:43:25	Time 3.002 (3.039)	Data 0.067 (0.081)	Mem 41.61GB	Prec@1 80.000 (74.775)	Loss 1.7869 (1.7450)
[02/28 12:13:40][INFO] train_vision.py:  668: Epoch: [8][120/367], lr: 2.80e-04, eta: 6:42:38	Time 3.005 (3.037)	Data 0.043 (0.079)	Mem 41.61GB	Prec@1 90.000 (74.876)	Loss 1.4476 (1.7365)
[02/28 12:14:10][INFO] train_vision.py:  668: Epoch: [8][130/367], lr: 2.80e-04, eta: 6:41:51	Time 3.010 (3.035)	Data 0.066 (0.078)	Mem 41.61GB	Prec@1 70.000 (74.885)	Loss 1.5932 (1.7339)
[02/28 12:14:40][INFO] train_vision.py:  668: Epoch: [8][140/367], lr: 2.80e-04, eta: 6:41:06	Time 2.974 (3.033)	Data 0.056 (0.076)	Mem 41.61GB	Prec@1 80.000 (74.681)	Loss 1.4757 (1.7366)
[02/28 12:15:10][INFO] train_vision.py:  668: Epoch: [8][150/367], lr: 2.79e-04, eta: 6:40:23	Time 3.008 (3.031)	Data 0.071 (0.075)	Mem 41.61GB	Prec@1 50.000 (74.967)	Loss 1.9789 (1.7239)
[02/28 12:15:40][INFO] train_vision.py:  668: Epoch: [8][160/367], lr: 2.79e-04, eta: 6:39:45	Time 3.005 (3.030)	Data 0.047 (0.074)	Mem 41.61GB	Prec@1 80.000 (74.348)	Loss 1.7893 (1.7455)
[02/28 12:16:10][INFO] train_vision.py:  668: Epoch: [8][170/367], lr: 2.79e-04, eta: 6:39:07	Time 3.035 (3.029)	Data 0.031 (0.073)	Mem 41.61GB	Prec@1 80.000 (73.977)	Loss 1.5514 (1.7531)
[02/28 12:16:40][INFO] train_vision.py:  668: Epoch: [8][180/367], lr: 2.79e-04, eta: 6:38:29	Time 3.015 (3.028)	Data 0.054 (0.072)	Mem 41.61GB	Prec@1 70.000 (73.481)	Loss 1.9128 (1.7583)
[02/28 12:17:11][INFO] train_vision.py:  668: Epoch: [8][190/367], lr: 2.78e-04, eta: 6:37:54	Time 3.015 (3.028)	Data 0.053 (0.072)	Mem 41.61GB	Prec@1 80.000 (73.403)	Loss 1.5217 (1.7652)
[02/28 12:17:41][INFO] train_vision.py:  668: Epoch: [8][200/367], lr: 2.78e-04, eta: 6:37:18	Time 3.010 (3.027)	Data 0.059 (0.071)	Mem 41.61GB	Prec@1 100.000 (73.781)	Loss 1.1242 (1.7596)
[02/28 12:18:11][INFO] train_vision.py:  668: Epoch: [8][210/367], lr: 2.78e-04, eta: 6:36:40	Time 3.008 (3.026)	Data 0.059 (0.070)	Mem 41.61GB	Prec@1 80.000 (73.934)	Loss 1.7112 (1.7601)
[02/28 12:18:41][INFO] train_vision.py:  668: Epoch: [8][220/367], lr: 2.78e-04, eta: 6:36:04	Time 3.007 (3.025)	Data 0.035 (0.069)	Mem 41.61GB	Prec@1 70.000 (73.575)	Loss 1.5768 (1.7727)
[02/28 12:19:11][INFO] train_vision.py:  668: Epoch: [8][230/367], lr: 2.77e-04, eta: 6:35:29	Time 3.009 (3.025)	Data 0.059 (0.068)	Mem 41.61GB	Prec@1 80.000 (73.636)	Loss 1.6584 (1.7748)
[02/28 12:19:41][INFO] train_vision.py:  668: Epoch: [8][240/367], lr: 2.77e-04, eta: 6:34:52	Time 2.972 (3.024)	Data 0.048 (0.068)	Mem 41.61GB	Prec@1 90.000 (73.402)	Loss 1.4039 (1.7835)
[02/28 12:20:11][INFO] train_vision.py:  668: Epoch: [8][250/367], lr: 2.77e-04, eta: 6:34:17	Time 2.989 (3.023)	Data 0.046 (0.067)	Mem 41.61GB	Prec@1 50.000 (73.386)	Loss 1.7642 (1.7827)
[02/28 12:20:41][INFO] train_vision.py:  668: Epoch: [8][260/367], lr: 2.77e-04, eta: 6:33:44	Time 3.008 (3.023)	Data 0.054 (0.066)	Mem 41.61GB	Prec@1 60.000 (73.602)	Loss 1.8887 (1.7765)
[02/28 12:21:11][INFO] train_vision.py:  668: Epoch: [8][270/367], lr: 2.76e-04, eta: 6:33:10	Time 3.008 (3.022)	Data 0.058 (0.066)	Mem 41.61GB	Prec@1 70.000 (73.801)	Loss 1.8279 (1.7679)
[02/28 12:21:41][INFO] train_vision.py:  668: Epoch: [8][280/367], lr: 2.76e-04, eta: 6:32:36	Time 3.012 (3.022)	Data 0.052 (0.066)	Mem 41.61GB	Prec@1 80.000 (73.737)	Loss 1.3624 (1.7694)
[02/28 12:22:12][INFO] train_vision.py:  668: Epoch: [8][290/367], lr: 2.76e-04, eta: 6:32:02	Time 3.046 (3.022)	Data 0.029 (0.065)	Mem 41.61GB	Prec@1 80.000 (74.021)	Loss 1.4070 (1.7618)
[02/28 12:22:42][INFO] train_vision.py:  668: Epoch: [8][300/367], lr: 2.76e-04, eta: 6:31:29	Time 3.017 (3.021)	Data 0.059 (0.065)	Mem 41.61GB	Prec@1 60.000 (74.053)	Loss 2.1526 (1.7616)
[02/28 12:23:12][INFO] train_vision.py:  668: Epoch: [8][310/367], lr: 2.75e-04, eta: 6:30:56	Time 3.027 (3.021)	Data 0.059 (0.065)	Mem 41.61GB	Prec@1 80.000 (74.212)	Loss 1.5329 (1.7578)
[02/28 12:23:42][INFO] train_vision.py:  668: Epoch: [8][320/367], lr: 2.75e-04, eta: 6:30:23	Time 3.014 (3.020)	Data 0.058 (0.064)	Mem 41.61GB	Prec@1 70.000 (74.361)	Loss 1.9070 (1.7550)
[02/28 12:24:12][INFO] train_vision.py:  668: Epoch: [8][330/367], lr: 2.75e-04, eta: 6:29:50	Time 3.015 (3.020)	Data 0.059 (0.064)	Mem 41.61GB	Prec@1 80.000 (74.411)	Loss 1.3933 (1.7535)
[02/28 12:24:42][INFO] train_vision.py:  668: Epoch: [8][340/367], lr: 2.74e-04, eta: 6:29:17	Time 2.982 (3.020)	Data 0.051 (0.064)	Mem 41.61GB	Prec@1 70.000 (74.516)	Loss 1.7608 (1.7498)
[02/28 12:25:12][INFO] train_vision.py:  668: Epoch: [8][350/367], lr: 2.74e-04, eta: 6:28:45	Time 3.007 (3.019)	Data 0.060 (0.064)	Mem 41.61GB	Prec@1 90.000 (74.558)	Loss 1.3920 (1.7485)
[02/28 12:25:42][INFO] train_vision.py:  668: Epoch: [8][360/367], lr: 2.74e-04, eta: 6:28:13	Time 3.014 (3.019)	Data 0.053 (0.063)	Mem 41.61GB	Prec@1 100.000 (74.681)	Loss 1.2059 (1.7460)
[02/28 12:26:06][INFO] train_vision.py:  668: Epoch: [9][0/367], lr: 2.74e-04, eta: 12:31:34	Time 5.850 (5.850)	Data 2.455 (2.455)	Mem 41.61GB	Prec@1 80.000 (80.000)	Loss 1.5187 (1.5187)
[02/28 12:26:36][INFO] train_vision.py:  668: Epoch: [9][10/367], lr: 2.73e-04, eta: 6:58:49	Time 2.971 (3.264)	Data 0.045 (0.275)	Mem 41.61GB	Prec@1 100.000 (75.455)	Loss 1.1549 (1.7153)
[02/28 12:27:06][INFO] train_vision.py:  668: Epoch: [9][20/367], lr: 2.73e-04, eta: 6:43:09	Time 3.037 (3.146)	Data 0.066 (0.170)	Mem 41.61GB	Prec@1 60.000 (76.190)	Loss 2.1084 (1.7073)
[02/28 12:27:36][INFO] train_vision.py:  668: Epoch: [9][30/367], lr: 2.73e-04, eta: 6:36:47	Time 3.003 (3.101)	Data 0.022 (0.132)	Mem 41.61GB	Prec@1 80.000 (71.935)	Loss 1.5612 (1.8309)
[02/28 12:28:06][INFO] train_vision.py:  668: Epoch: [9][40/367], lr: 2.73e-04, eta: 6:33:18	Time 3.030 (3.078)	Data 0.026 (0.111)	Mem 41.61GB	Prec@1 40.000 (72.195)	Loss 2.1275 (1.7884)
[02/28 12:28:36][INFO] train_vision.py:  668: Epoch: [9][50/367], lr: 2.72e-04, eta: 6:30:58	Time 3.007 (3.063)	Data 0.053 (0.099)	Mem 41.61GB	Prec@1 80.000 (70.784)	Loss 1.7154 (1.8321)
[02/28 12:29:06][INFO] train_vision.py:  668: Epoch: [9][60/367], lr: 2.72e-04, eta: 6:29:11	Time 2.993 (3.053)	Data 0.061 (0.092)	Mem 41.61GB	Prec@1 70.000 (70.328)	Loss 1.5793 (1.8196)
[02/28 12:29:36][INFO] train_vision.py:  668: Epoch: [9][70/367], lr: 2.72e-04, eta: 6:27:47	Time 3.003 (3.046)	Data 0.056 (0.086)	Mem 41.61GB	Prec@1 70.000 (71.408)	Loss 2.1568 (1.7944)
[02/28 12:30:06][INFO] train_vision.py:  668: Epoch: [9][80/367], lr: 2.71e-04, eta: 6:26:32	Time 3.030 (3.040)	Data 0.019 (0.082)	Mem 41.61GB	Prec@1 40.000 (72.346)	Loss 2.2779 (1.7660)
[02/28 12:30:36][INFO] train_vision.py:  668: Epoch: [9][90/367], lr: 2.71e-04, eta: 6:25:28	Time 2.999 (3.036)	Data 0.049 (0.079)	Mem 41.61GB	Prec@1 70.000 (72.308)	Loss 2.0903 (1.7802)
[02/28 12:31:06][INFO] train_vision.py:  668: Epoch: [9][100/367], lr: 2.71e-04, eta: 6:24:29	Time 2.977 (3.032)	Data 0.066 (0.077)	Mem 41.61GB	Prec@1 60.000 (72.376)	Loss 1.9339 (1.7785)
[02/28 12:31:36][INFO] train_vision.py:  668: Epoch: [9][110/367], lr: 2.71e-04, eta: 6:23:41	Time 3.016 (3.030)	Data 0.067 (0.074)	Mem 41.61GB	Prec@1 40.000 (72.523)	Loss 1.9370 (1.7664)
[02/28 12:32:06][INFO] train_vision.py:  668: Epoch: [9][120/367], lr: 2.70e-04, eta: 6:22:49	Time 3.004 (3.027)	Data 0.063 (0.072)	Mem 41.61GB	Prec@1 70.000 (72.149)	Loss 1.6243 (1.7728)
[02/28 12:32:36][INFO] train_vision.py:  668: Epoch: [9][130/367], lr: 2.70e-04, eta: 6:22:04	Time 3.008 (3.025)	Data 0.057 (0.071)	Mem 41.61GB	Prec@1 80.000 (72.519)	Loss 1.7955 (1.7614)
[02/28 12:33:06][INFO] train_vision.py:  668: Epoch: [9][140/367], lr: 2.70e-04, eta: 6:21:19	Time 2.999 (3.023)	Data 0.049 (0.070)	Mem 41.61GB	Prec@1 60.000 (73.050)	Loss 2.0989 (1.7576)
[02/28 12:33:36][INFO] train_vision.py:  668: Epoch: [9][150/367], lr: 2.69e-04, eta: 6:20:37	Time 3.000 (3.022)	Data 0.054 (0.068)	Mem 41.61GB	Prec@1 70.000 (73.311)	Loss 2.0404 (1.7534)
[02/28 12:34:06][INFO] train_vision.py:  668: Epoch: [9][160/367], lr: 2.69e-04, eta: 6:19:56	Time 2.983 (3.020)	Data 0.052 (0.067)	Mem 41.61GB	Prec@1 70.000 (73.106)	Loss 1.7404 (1.7577)
[02/28 12:34:36][INFO] train_vision.py:  668: Epoch: [9][170/367], lr: 2.69e-04, eta: 6:19:16	Time 2.989 (3.019)	Data 0.052 (0.066)	Mem 41.61GB	Prec@1 80.000 (73.099)	Loss 1.9729 (1.7518)
[02/28 12:35:06][INFO] train_vision.py:  668: Epoch: [9][180/367], lr: 2.69e-04, eta: 6:18:37	Time 3.001 (3.018)	Data 0.043 (0.066)	Mem 41.61GB	Prec@1 70.000 (73.039)	Loss 1.6438 (1.7540)
[02/28 12:35:36][INFO] train_vision.py:  668: Epoch: [9][190/367], lr: 2.68e-04, eta: 6:17:59	Time 2.996 (3.017)	Data 0.054 (0.065)	Mem 41.61GB	Prec@1 90.000 (72.670)	Loss 1.5524 (1.7709)
[02/28 12:36:06][INFO] train_vision.py:  668: Epoch: [9][200/367], lr: 2.68e-04, eta: 6:17:22	Time 2.998 (3.016)	Data 0.042 (0.065)	Mem 41.61GB	Prec@1 80.000 (73.035)	Loss 1.5703 (1.7557)
[02/28 12:36:36][INFO] train_vision.py:  668: Epoch: [9][210/367], lr: 2.68e-04, eta: 6:16:49	Time 3.015 (3.015)	Data 0.067 (0.064)	Mem 41.61GB	Prec@1 60.000 (73.128)	Loss 2.0806 (1.7535)
[02/28 12:37:06][INFO] train_vision.py:  668: Epoch: [9][220/367], lr: 2.67e-04, eta: 6:16:12	Time 2.991 (3.015)	Data 0.047 (0.063)	Mem 41.61GB	Prec@1 70.000 (73.394)	Loss 1.7897 (1.7464)
[02/28 12:37:36][INFO] train_vision.py:  668: Epoch: [9][230/367], lr: 2.67e-04, eta: 6:15:36	Time 3.006 (3.014)	Data 0.054 (0.063)	Mem 41.61GB	Prec@1 70.000 (73.420)	Loss 1.7580 (1.7451)
[02/28 12:38:06][INFO] train_vision.py:  668: Epoch: [9][240/367], lr: 2.67e-04, eta: 6:15:02	Time 3.004 (3.013)	Data 0.024 (0.062)	Mem 41.61GB	Prec@1 90.000 (73.568)	Loss 1.3328 (1.7415)
[02/28 12:38:36][INFO] train_vision.py:  668: Epoch: [9][250/367], lr: 2.66e-04, eta: 6:14:30	Time 3.010 (3.013)	Data 0.055 (0.062)	Mem 41.61GB	Prec@1 60.000 (73.426)	Loss 1.8609 (1.7449)
[02/28 12:39:06][INFO] train_vision.py:  668: Epoch: [9][260/367], lr: 2.66e-04, eta: 6:13:55	Time 2.972 (3.012)	Data 0.042 (0.061)	Mem 41.61GB	Prec@1 80.000 (73.295)	Loss 1.3671 (1.7480)
[02/28 12:39:36][INFO] train_vision.py:  668: Epoch: [9][270/367], lr: 2.66e-04, eta: 6:13:21	Time 3.004 (3.012)	Data 0.055 (0.061)	Mem 41.61GB	Prec@1 80.000 (73.432)	Loss 1.5164 (1.7399)
[02/28 12:40:06][INFO] train_vision.py:  668: Epoch: [9][280/367], lr: 2.65e-04, eta: 6:12:48	Time 2.992 (3.011)	Data 0.049 (0.061)	Mem 41.61GB	Prec@1 60.000 (73.523)	Loss 2.2875 (1.7367)
[02/28 12:40:36][INFO] train_vision.py:  668: Epoch: [9][290/367], lr: 2.65e-04, eta: 6:12:17	Time 3.019 (3.011)	Data 0.031 (0.060)	Mem 41.61GB	Prec@1 80.000 (73.540)	Loss 1.7100 (1.7403)
[02/28 12:41:06][INFO] train_vision.py:  668: Epoch: [9][300/367], lr: 2.65e-04, eta: 6:11:46	Time 3.004 (3.011)	Data 0.049 (0.060)	Mem 41.61GB	Prec@1 90.000 (73.588)	Loss 1.2324 (1.7389)
[02/28 12:41:36][INFO] train_vision.py:  668: Epoch: [9][310/367], lr: 2.65e-04, eta: 6:11:14	Time 3.005 (3.011)	Data 0.054 (0.060)	Mem 41.61GB	Prec@1 70.000 (73.473)	Loss 2.0872 (1.7434)
[02/28 12:42:06][INFO] train_vision.py:  668: Epoch: [9][320/367], lr: 2.64e-04, eta: 6:10:43	Time 3.002 (3.011)	Data 0.042 (0.059)	Mem 41.61GB	Prec@1 50.000 (73.551)	Loss 2.0946 (1.7434)
[02/28 12:42:36][INFO] train_vision.py:  668: Epoch: [9][330/367], lr: 2.64e-04, eta: 6:10:10	Time 3.002 (3.010)	Data 0.051 (0.059)	Mem 41.61GB	Prec@1 50.000 (73.535)	Loss 2.2860 (1.7442)
[02/28 12:43:06][INFO] train_vision.py:  668: Epoch: [9][340/367], lr: 2.64e-04, eta: 6:09:39	Time 3.006 (3.010)	Data 0.050 (0.059)	Mem 41.61GB	Prec@1 60.000 (73.460)	Loss 1.9007 (1.7454)
[02/28 12:43:36][INFO] train_vision.py:  668: Epoch: [9][350/367], lr: 2.63e-04, eta: 6:09:07	Time 3.001 (3.010)	Data 0.053 (0.059)	Mem 41.61GB	Prec@1 80.000 (73.618)	Loss 1.5094 (1.7418)
[02/28 12:44:06][INFO] train_vision.py:  668: Epoch: [9][360/367], lr: 2.63e-04, eta: 6:08:35	Time 3.023 (3.010)	Data 0.040 (0.058)	Mem 41.61GB	Prec@1 90.000 (73.684)	Loss 1.1613 (1.7414)
[02/28 12:44:31][INFO] train_vision.py:  840: Test: [0/121]	Prec@1 91.250 (91.250)	Prec@5 100.000 (100.000)	mPrec@1 (9.566)	mPrec@5 (11.458)
[02/28 12:45:13][INFO] train_vision.py:  840: Test: [10/121]	Prec@1 93.750 (93.977)	Prec@5 100.000 (99.886)	mPrec@1 (27.485)	mPrec@5 (32.465)
[02/28 12:45:55][INFO] train_vision.py:  840: Test: [20/121]	Prec@1 92.500 (91.131)	Prec@5 100.000 (98.988)	mPrec@1 (35.797)	mPrec@5 (44.191)
[02/28 12:46:37][INFO] train_vision.py:  840: Test: [30/121]	Prec@1 87.500 (90.444)	Prec@5 98.750 (99.153)	mPrec@1 (40.397)	mPrec@5 (52.910)
[02/28 12:47:20][INFO] train_vision.py:  840: Test: [40/121]	Prec@1 88.750 (90.244)	Prec@5 98.750 (99.085)	mPrec@1 (43.370)	mPrec@5 (59.917)
[02/28 12:48:02][INFO] train_vision.py:  840: Test: [50/121]	Prec@1 82.500 (88.971)	Prec@5 97.500 (98.824)	mPrec@1 (43.659)	mPrec@5 (61.813)
[02/28 12:48:44][INFO] train_vision.py:  840: Test: [60/121]	Prec@1 87.500 (88.975)	Prec@5 100.000 (98.914)	mPrec@1 (45.006)	mPrec@5 (65.013)
[02/28 12:49:26][INFO] train_vision.py:  840: Test: [70/121]	Prec@1 93.750 (88.996)	Prec@5 100.000 (98.944)	mPrec@1 (45.456)	mPrec@5 (65.606)
[02/28 12:50:08][INFO] train_vision.py:  840: Test: [80/121]	Prec@1 82.500 (89.059)	Prec@5 100.000 (98.935)	mPrec@1 (47.174)	mPrec@5 (69.757)
[02/28 12:50:50][INFO] train_vision.py:  840: Test: [90/121]	Prec@1 83.750 (88.503)	Prec@5 98.750 (98.915)	mPrec@1 (48.115)	mPrec@5 (72.494)
[02/28 12:51:33][INFO] train_vision.py:  840: Test: [100/121]	Prec@1 93.750 (87.512)	Prec@5 100.000 (98.725)	mPrec@1 (48.460)	mPrec@5 (78.734)
[02/28 12:52:15][INFO] train_vision.py:  840: Test: [110/121]	Prec@1 67.500 (87.545)	Prec@5 96.250 (98.716)	mPrec@1 (48.431)	mPrec@5 (78.532)
[02/28 12:52:55][INFO] train_vision.py:  840: Test: [120/121]	Prec@1 39.583 (86.049)	Prec@5 75.000 (98.342)	mPrec@1 (48.496)	mPrec@5 (83.742)
[02/28 12:52:55][INFO] train_vision.py:  847: Overall Prec@1 86.049% Prec@5 98.342% mPrec@1 (48.496) mPrec@5 (83.742)
[02/28 12:52:56][INFO] train_vision.py:  464: Testing: 48.49597930908203/48.49597930908203
[02/28 12:52:56][INFO] train_vision.py:  465: Saving:
[02/28 12:53:15][INFO] train_vision.py:  668: Epoch: [10][0/367], lr: 2.63e-04, eta: 10:31:44	Time 5.163 (5.163)	Data 2.277 (2.277)	Mem 41.61GB	Prec@1 50.000 (50.000)	Loss 2.0071 (2.0071)
[02/28 12:53:45][INFO] train_vision.py:  668: Epoch: [10][10/367], lr: 2.62e-04, eta: 6:29:14	Time 3.003 (3.186)	Data 0.078 (0.267)	Mem 41.61GB	Prec@1 80.000 (71.818)	Loss 1.2786 (1.7530)
[02/28 12:54:15][INFO] train_vision.py:  668: Epoch: [10][20/367], lr: 2.62e-04, eta: 6:18:36	Time 3.004 (3.103)	Data 0.054 (0.171)	Mem 41.61GB	Prec@1 90.000 (70.476)	Loss 1.3251 (1.7657)
[02/28 12:54:45][INFO] train_vision.py:  668: Epoch: [10][30/367], lr: 2.62e-04, eta: 6:14:33	Time 3.015 (3.074)	Data 0.055 (0.136)	Mem 41.61GB	Prec@1 80.000 (72.258)	Loss 1.7772 (1.7456)
[02/28 12:55:15][INFO] train_vision.py:  668: Epoch: [10][40/367], lr: 2.61e-04, eta: 6:12:12	Time 3.009 (3.059)	Data 0.055 (0.118)	Mem 41.61GB	Prec@1 90.000 (73.171)	Loss 1.3762 (1.7208)
[02/28 12:55:45][INFO] train_vision.py:  668: Epoch: [10][50/367], lr: 2.61e-04, eta: 6:10:42	Time 3.003 (3.051)	Data 0.073 (0.105)	Mem 41.61GB	Prec@1 90.000 (74.314)	Loss 1.4565 (1.6949)
[02/28 12:56:15][INFO] train_vision.py:  668: Epoch: [10][60/367], lr: 2.61e-04, eta: 6:09:35	Time 3.029 (3.046)	Data 0.044 (0.098)	Mem 41.61GB	Prec@1 80.000 (75.410)	Loss 1.8434 (1.6746)
[02/28 12:56:46][INFO] train_vision.py:  668: Epoch: [10][70/367], lr: 2.60e-04, eta: 6:08:35	Time 3.030 (3.042)	Data 0.052 (0.092)	Mem 41.61GB	Prec@1 90.000 (74.789)	Loss 1.5643 (1.6969)
[02/28 12:57:16][INFO] train_vision.py:  668: Epoch: [10][80/367], lr: 2.60e-04, eta: 6:07:38	Time 2.997 (3.038)	Data 0.048 (0.087)	Mem 41.61GB	Prec@1 70.000 (74.815)	Loss 1.6075 (1.7035)
[02/28 12:57:46][INFO] train_vision.py:  668: Epoch: [10][90/367], lr: 2.60e-04, eta: 6:06:43	Time 2.985 (3.035)	Data 0.058 (0.085)	Mem 41.61GB	Prec@1 70.000 (74.505)	Loss 1.4474 (1.7145)
[02/28 12:58:16][INFO] train_vision.py:  668: Epoch: [10][100/367], lr: 2.59e-04, eta: 6:05:56	Time 3.006 (3.032)	Data 0.045 (0.082)	Mem 41.61GB	Prec@1 90.000 (74.554)	Loss 1.2762 (1.7112)
[02/28 12:58:46][INFO] train_vision.py:  668: Epoch: [10][110/367], lr: 2.59e-04, eta: 6:05:08	Time 3.013 (3.030)	Data 0.053 (0.079)	Mem 41.61GB	Prec@1 50.000 (74.505)	Loss 2.5758 (1.7179)
[02/28 12:59:16][INFO] train_vision.py:  668: Epoch: [10][120/367], lr: 2.59e-04, eta: 6:04:22	Time 2.997 (3.028)	Data 0.056 (0.077)	Mem 41.61GB	Prec@1 80.000 (75.041)	Loss 1.6123 (1.7047)
[02/28 12:59:46][INFO] train_vision.py:  668: Epoch: [10][130/367], lr: 2.58e-04, eta: 6:03:35	Time 3.010 (3.025)	Data 0.069 (0.075)	Mem 41.61GB	Prec@1 70.000 (75.038)	Loss 1.5995 (1.7017)
[02/28 13:00:16][INFO] train_vision.py:  668: Epoch: [10][140/367], lr: 2.58e-04, eta: 6:02:53	Time 3.006 (3.024)	Data 0.055 (0.074)	Mem 41.61GB	Prec@1 70.000 (75.248)	Loss 1.4946 (1.6979)
[02/28 13:00:46][INFO] train_vision.py:  668: Epoch: [10][150/367], lr: 2.58e-04, eta: 6:02:11	Time 3.022 (3.022)	Data 0.082 (0.072)	Mem 41.61GB	Prec@1 60.000 (75.298)	Loss 2.0033 (1.6948)
[02/28 13:01:16][INFO] train_vision.py:  668: Epoch: [10][160/367], lr: 2.57e-04, eta: 6:01:32	Time 3.006 (3.021)	Data 0.058 (0.071)	Mem 41.61GB	Prec@1 70.000 (75.342)	Loss 1.6262 (1.6891)
[02/28 13:01:46][INFO] train_vision.py:  668: Epoch: [10][170/367], lr: 2.57e-04, eta: 6:00:52	Time 3.002 (3.020)	Data 0.046 (0.070)	Mem 41.61GB	Prec@1 70.000 (75.263)	Loss 1.5217 (1.6924)
[02/28 13:02:16][INFO] train_vision.py:  668: Epoch: [10][180/367], lr: 2.57e-04, eta: 6:00:15	Time 3.002 (3.018)	Data 0.056 (0.068)	Mem 41.61GB	Prec@1 80.000 (75.304)	Loss 1.5255 (1.6842)
[02/28 13:02:46][INFO] train_vision.py:  668: Epoch: [10][190/367], lr: 2.56e-04, eta: 5:59:40	Time 3.002 (3.018)	Data 0.051 (0.068)	Mem 41.61GB	Prec@1 80.000 (75.288)	Loss 1.4568 (1.6850)
[02/28 13:03:16][INFO] train_vision.py:  668: Epoch: [10][200/367], lr: 2.56e-04, eta: 5:59:04	Time 3.003 (3.017)	Data 0.039 (0.067)	Mem 41.61GB	Prec@1 50.000 (75.174)	Loss 2.6119 (1.6883)
[02/28 13:03:46][INFO] train_vision.py:  668: Epoch: [10][210/367], lr: 2.56e-04, eta: 5:58:31	Time 3.007 (3.017)	Data 0.053 (0.066)	Mem 41.61GB	Prec@1 90.000 (75.355)	Loss 1.4710 (1.6854)
[02/28 13:04:16][INFO] train_vision.py:  668: Epoch: [10][220/367], lr: 2.55e-04, eta: 5:57:57	Time 3.002 (3.016)	Data 0.061 (0.066)	Mem 41.61GB	Prec@1 90.000 (75.249)	Loss 1.3670 (1.6866)
[02/28 13:04:46][INFO] train_vision.py:  668: Epoch: [10][230/367], lr: 2.55e-04, eta: 5:57:23	Time 3.002 (3.015)	Data 0.052 (0.065)	Mem 41.61GB	Prec@1 70.000 (74.935)	Loss 2.1709 (1.6941)
[02/28 13:05:16][INFO] train_vision.py:  668: Epoch: [10][240/367], lr: 2.55e-04, eta: 5:56:49	Time 2.994 (3.015)	Data 0.060 (0.065)	Mem 41.61GB	Prec@1 50.000 (74.896)	Loss 2.5671 (1.6990)
[02/28 13:05:46][INFO] train_vision.py:  668: Epoch: [10][250/367], lr: 2.54e-04, eta: 5:56:16	Time 3.015 (3.015)	Data 0.046 (0.064)	Mem 41.61GB	Prec@1 50.000 (74.821)	Loss 2.2667 (1.7036)
[02/28 13:06:16][INFO] train_vision.py:  668: Epoch: [10][260/367], lr: 2.54e-04, eta: 5:55:42	Time 3.026 (3.014)	Data 0.026 (0.064)	Mem 41.61GB	Prec@1 90.000 (74.866)	Loss 1.3774 (1.6992)
[02/28 13:06:46][INFO] train_vision.py:  668: Epoch: [10][270/367], lr: 2.54e-04, eta: 5:55:09	Time 3.009 (3.014)	Data 0.048 (0.063)	Mem 41.61GB	Prec@1 70.000 (75.314)	Loss 1.4311 (1.6885)
[02/28 13:07:16][INFO] train_vision.py:  668: Epoch: [10][280/367], lr: 2.53e-04, eta: 5:54:36	Time 3.002 (3.013)	Data 0.037 (0.063)	Mem 41.61GB	Prec@1 100.000 (75.409)	Loss 1.1074 (1.6852)
[02/28 13:07:46][INFO] train_vision.py:  668: Epoch: [10][290/367], lr: 2.53e-04, eta: 5:54:03	Time 3.013 (3.013)	Data 0.022 (0.062)	Mem 41.61GB	Prec@1 80.000 (75.498)	Loss 2.0099 (1.6846)
[02/28 13:08:16][INFO] train_vision.py:  668: Epoch: [10][300/367], lr: 2.52e-04, eta: 5:53:30	Time 3.003 (3.012)	Data 0.057 (0.062)	Mem 41.61GB	Prec@1 90.000 (75.581)	Loss 1.2303 (1.6823)
[02/28 13:08:46][INFO] train_vision.py:  668: Epoch: [10][310/367], lr: 2.52e-04, eta: 5:52:57	Time 3.038 (3.012)	Data 0.021 (0.062)	Mem 41.61GB	Prec@1 80.000 (75.723)	Loss 1.3780 (1.6754)
[02/28 13:09:16][INFO] train_vision.py:  668: Epoch: [10][320/367], lr: 2.52e-04, eta: 5:52:25	Time 3.018 (3.012)	Data 0.059 (0.061)	Mem 41.61GB	Prec@1 90.000 (75.888)	Loss 1.3409 (1.6735)
[02/28 13:09:46][INFO] train_vision.py:  668: Epoch: [10][330/367], lr: 2.51e-04, eta: 5:51:54	Time 3.011 (3.012)	Data 0.058 (0.061)	Mem 41.61GB	Prec@1 70.000 (75.921)	Loss 1.6801 (1.6733)
[02/28 13:10:16][INFO] train_vision.py:  668: Epoch: [10][340/367], lr: 2.51e-04, eta: 5:51:22	Time 2.998 (3.011)	Data 0.059 (0.061)	Mem 41.61GB	Prec@1 70.000 (75.630)	Loss 1.9260 (1.6786)
[02/28 13:10:47][INFO] train_vision.py:  668: Epoch: [10][350/367], lr: 2.51e-04, eta: 5:50:51	Time 3.017 (3.011)	Data 0.021 (0.061)	Mem 41.61GB	Prec@1 50.000 (75.499)	Loss 2.5924 (1.6828)
[02/28 13:11:17][INFO] train_vision.py:  668: Epoch: [10][360/367], lr: 2.50e-04, eta: 5:50:20	Time 2.993 (3.011)	Data 0.055 (0.060)	Mem 41.61GB	Prec@1 80.000 (75.623)	Loss 1.6860 (1.6794)
[02/28 13:11:40][INFO] train_vision.py:  668: Epoch: [11][0/367], lr: 2.50e-04, eta: 10:43:21	Time 5.535 (5.535)	Data 2.508 (2.508)	Mem 41.61GB	Prec@1 70.000 (70.000)	Loss 1.6112 (1.6112)
[02/28 13:12:10][INFO] train_vision.py:  668: Epoch: [11][10/367], lr: 2.50e-04, eta: 6:15:29	Time 2.982 (3.235)	Data 0.053 (0.284)	Mem 41.61GB	Prec@1 80.000 (71.818)	Loss 2.1912 (1.9126)
[02/28 13:12:40][INFO] train_vision.py:  668: Epoch: [11][20/367], lr: 2.49e-04, eta: 6:02:21	Time 3.002 (3.127)	Data 0.055 (0.179)	Mem 41.61GB	Prec@1 70.000 (73.810)	Loss 1.5664 (1.7869)
[02/28 13:13:10][INFO] train_vision.py:  668: Epoch: [11][30/367], lr: 2.49e-04, eta: 5:57:25	Time 3.046 (3.088)	Data 0.022 (0.140)	Mem 41.61GB	Prec@1 70.000 (74.516)	Loss 1.3185 (1.7257)
[02/28 13:13:40][INFO] train_vision.py:  668: Epoch: [11][40/367], lr: 2.49e-04, eta: 5:54:43	Time 2.988 (3.069)	Data 0.061 (0.121)	Mem 41.61GB	Prec@1 80.000 (75.610)	Loss 1.3782 (1.6848)
[02/28 13:14:10][INFO] train_vision.py:  668: Epoch: [11][50/367], lr: 2.48e-04, eta: 5:52:54	Time 3.005 (3.058)	Data 0.061 (0.108)	Mem 41.61GB	Prec@1 100.000 (75.294)	Loss 1.3389 (1.7086)
[02/28 13:14:40][INFO] train_vision.py:  668: Epoch: [11][60/367], lr: 2.48e-04, eta: 5:51:31	Time 3.022 (3.051)	Data 0.077 (0.101)	Mem 41.61GB	Prec@1 70.000 (75.246)	Loss 1.6827 (1.6922)
[02/28 13:15:10][INFO] train_vision.py:  668: Epoch: [11][70/367], lr: 2.47e-04, eta: 5:50:20	Time 3.011 (3.045)	Data 0.051 (0.094)	Mem 41.61GB	Prec@1 70.000 (74.648)	Loss 2.2297 (1.7244)
[02/28 13:15:40][INFO] train_vision.py:  668: Epoch: [11][80/367], lr: 2.47e-04, eta: 5:49:22	Time 3.010 (3.041)	Data 0.076 (0.090)	Mem 41.61GB	Prec@1 100.000 (74.815)	Loss 1.1018 (1.7154)
[02/28 13:16:10][INFO] train_vision.py:  668: Epoch: [11][90/367], lr: 2.47e-04, eta: 5:48:30	Time 3.012 (3.038)	Data 0.059 (0.087)	Mem 41.61GB	Prec@1 60.000 (74.286)	Loss 1.8762 (1.7247)
[02/28 13:16:41][INFO] train_vision.py:  668: Epoch: [11][100/367], lr: 2.46e-04, eta: 5:47:39	Time 3.006 (3.035)	Data 0.051 (0.084)	Mem 41.61GB	Prec@1 80.000 (74.653)	Loss 1.4240 (1.7118)
[02/28 13:17:11][INFO] train_vision.py:  668: Epoch: [11][110/367], lr: 2.46e-04, eta: 5:46:50	Time 3.002 (3.032)	Data 0.055 (0.081)	Mem 41.61GB	Prec@1 90.000 (75.315)	Loss 1.7157 (1.6952)
[02/28 13:17:41][INFO] train_vision.py:  668: Epoch: [11][120/367], lr: 2.46e-04, eta: 5:46:05	Time 3.011 (3.030)	Data 0.057 (0.079)	Mem 41.61GB	Prec@1 70.000 (74.876)	Loss 1.6495 (1.6966)
[02/28 13:18:11][INFO] train_vision.py:  668: Epoch: [11][130/367], lr: 2.45e-04, eta: 5:45:22	Time 3.008 (3.028)	Data 0.049 (0.077)	Mem 41.61GB	Prec@1 80.000 (74.656)	Loss 1.5908 (1.7153)
[02/28 13:18:41][INFO] train_vision.py:  668: Epoch: [11][140/367], lr: 2.45e-04, eta: 5:44:41	Time 3.021 (3.026)	Data 0.061 (0.075)	Mem 41.61GB	Prec@1 70.000 (74.823)	Loss 1.5841 (1.7144)
[02/28 13:19:11][INFO] train_vision.py:  668: Epoch: [11][150/367], lr: 2.44e-04, eta: 5:44:00	Time 2.985 (3.025)	Data 0.044 (0.074)	Mem 41.61GB	Prec@1 90.000 (74.768)	Loss 1.1823 (1.7127)
[02/28 13:19:41][INFO] train_vision.py:  668: Epoch: [11][160/367], lr: 2.44e-04, eta: 5:43:21	Time 3.012 (3.023)	Data 0.062 (0.072)	Mem 41.61GB	Prec@1 80.000 (75.280)	Loss 1.8229 (1.6980)
[02/28 13:20:11][INFO] train_vision.py:  668: Epoch: [11][170/367], lr: 2.44e-04, eta: 5:42:41	Time 2.989 (3.022)	Data 0.047 (0.071)	Mem 41.61GB	Prec@1 50.000 (74.971)	Loss 2.6383 (1.7069)
[02/28 13:20:41][INFO] train_vision.py:  668: Epoch: [11][180/367], lr: 2.43e-04, eta: 5:42:01	Time 3.003 (3.021)	Data 0.060 (0.069)	Mem 41.61GB	Prec@1 70.000 (75.138)	Loss 1.7009 (1.7032)
[02/28 13:21:11][INFO] train_vision.py:  668: Epoch: [11][190/367], lr: 2.43e-04, eta: 5:41:24	Time 3.015 (3.019)	Data 0.041 (0.068)	Mem 41.61GB	Prec@1 80.000 (75.445)	Loss 1.3071 (1.6920)
[02/28 13:21:41][INFO] train_vision.py:  668: Epoch: [11][200/367], lr: 2.42e-04, eta: 5:40:47	Time 3.021 (3.018)	Data 0.069 (0.067)	Mem 41.61GB	Prec@1 70.000 (75.672)	Loss 1.6866 (1.6865)
[02/28 13:22:11][INFO] train_vision.py:  668: Epoch: [11][210/367], lr: 2.42e-04, eta: 5:40:12	Time 3.002 (3.018)	Data 0.052 (0.067)	Mem 41.61GB	Prec@1 100.000 (75.877)	Loss 1.1684 (1.6794)
[02/28 13:22:41][INFO] train_vision.py:  668: Epoch: [11][220/367], lr: 2.42e-04, eta: 5:39:36	Time 3.004 (3.017)	Data 0.055 (0.066)	Mem 41.61GB	Prec@1 70.000 (75.792)	Loss 1.3545 (1.6836)
[02/28 13:23:11][INFO] train_vision.py:  668: Epoch: [11][230/367], lr: 2.41e-04, eta: 5:39:02	Time 3.003 (3.016)	Data 0.053 (0.066)	Mem 41.61GB	Prec@1 100.000 (76.061)	Loss 1.0651 (1.6811)
[02/28 13:23:41][INFO] train_vision.py:  668: Epoch: [11][240/367], lr: 2.41e-04, eta: 5:38:27	Time 3.010 (3.016)	Data 0.060 (0.065)	Mem 41.61GB	Prec@1 90.000 (76.058)	Loss 1.5401 (1.6790)
[02/28 13:24:11][INFO] train_vision.py:  668: Epoch: [11][250/367], lr: 2.41e-04, eta: 5:37:53	Time 3.003 (3.015)	Data 0.060 (0.065)	Mem 41.61GB	Prec@1 60.000 (76.016)	Loss 1.8385 (1.6755)
[02/28 13:24:41][INFO] train_vision.py:  668: Epoch: [11][260/367], lr: 2.40e-04, eta: 5:37:20	Time 3.001 (3.015)	Data 0.054 (0.064)	Mem 41.61GB	Prec@1 60.000 (76.054)	Loss 2.2859 (1.6743)
[02/28 13:25:11][INFO] train_vision.py:  668: Epoch: [11][270/367], lr: 2.40e-04, eta: 5:36:46	Time 3.007 (3.014)	Data 0.053 (0.064)	Mem 41.61GB	Prec@1 40.000 (75.683)	Loss 2.4994 (1.6854)
[02/28 13:25:41][INFO] train_vision.py:  668: Epoch: [11][280/367], lr: 2.39e-04, eta: 5:36:14	Time 3.012 (3.014)	Data 0.060 (0.063)	Mem 41.61GB	Prec@1 70.000 (75.730)	Loss 1.9125 (1.6816)
[02/28 13:26:11][INFO] train_vision.py:  668: Epoch: [11][290/367], lr: 2.39e-04, eta: 5:35:41	Time 3.028 (3.013)	Data 0.021 (0.063)	Mem 41.61GB	Prec@1 50.000 (75.808)	Loss 1.9635 (1.6819)
[02/28 13:26:41][INFO] train_vision.py:  668: Epoch: [11][300/367], lr: 2.39e-04, eta: 5:35:07	Time 3.028 (3.013)	Data 0.030 (0.062)	Mem 41.61GB	Prec@1 60.000 (75.781)	Loss 1.6210 (1.6804)
[02/28 13:27:11][INFO] train_vision.py:  668: Epoch: [11][310/367], lr: 2.38e-04, eta: 5:34:35	Time 3.002 (3.013)	Data 0.052 (0.062)	Mem 41.61GB	Prec@1 80.000 (75.884)	Loss 1.7310 (1.6790)
[02/28 13:27:41][INFO] train_vision.py:  668: Epoch: [11][320/367], lr: 2.38e-04, eta: 5:34:03	Time 3.010 (3.012)	Data 0.060 (0.061)	Mem 41.61GB	Prec@1 70.000 (75.826)	Loss 1.8989 (1.6802)
[02/28 13:28:11][INFO] train_vision.py:  668: Epoch: [11][330/367], lr: 2.37e-04, eta: 5:33:32	Time 3.039 (3.012)	Data 0.022 (0.061)	Mem 41.61GB	Prec@1 90.000 (76.103)	Loss 1.2972 (1.6727)
[02/28 13:28:41][INFO] train_vision.py:  668: Epoch: [11][340/367], lr: 2.37e-04, eta: 5:33:00	Time 3.022 (3.012)	Data 0.032 (0.061)	Mem 41.61GB	Prec@1 80.000 (76.070)	Loss 1.7664 (1.6760)
[02/28 13:29:11][INFO] train_vision.py:  668: Epoch: [11][350/367], lr: 2.37e-04, eta: 5:32:28	Time 3.001 (3.012)	Data 0.050 (0.060)	Mem 41.61GB	Prec@1 80.000 (76.040)	Loss 1.5114 (1.6753)
[02/28 13:29:41][INFO] train_vision.py:  668: Epoch: [11][360/367], lr: 2.36e-04, eta: 5:31:54	Time 2.968 (3.011)	Data 0.050 (0.060)	Mem 41.61GB	Prec@1 70.000 (76.039)	Loss 1.9296 (1.6730)
[02/28 13:30:05][INFO] train_vision.py:  840: Test: [0/121]	Prec@1 93.750 (93.750)	Prec@5 100.000 (100.000)	mPrec@1 (10.666)	mPrec@5 (11.458)
[02/28 13:30:47][INFO] train_vision.py:  840: Test: [10/121]	Prec@1 88.750 (92.500)	Prec@5 97.500 (99.773)	mPrec@1 (28.137)	mPrec@5 (32.485)
[02/28 13:31:30][INFO] train_vision.py:  840: Test: [20/121]	Prec@1 95.000 (91.310)	Prec@5 100.000 (99.286)	mPrec@1 (37.715)	mPrec@5 (44.242)
[02/28 13:32:12][INFO] train_vision.py:  840: Test: [30/121]	Prec@1 88.750 (90.645)	Prec@5 98.750 (99.355)	mPrec@1 (43.290)	mPrec@5 (53.537)
[02/28 13:32:54][INFO] train_vision.py:  840: Test: [40/121]	Prec@1 87.500 (90.549)	Prec@5 98.750 (99.268)	mPrec@1 (46.879)	mPrec@5 (60.536)
[02/28 13:33:37][INFO] train_vision.py:  840: Test: [50/121]	Prec@1 81.250 (89.338)	Prec@5 100.000 (98.750)	mPrec@1 (46.808)	mPrec@5 (62.425)
[02/28 13:34:19][INFO] train_vision.py:  840: Test: [60/121]	Prec@1 93.750 (89.672)	Prec@5 100.000 (98.811)	mPrec@1 (49.710)	mPrec@5 (65.370)
[02/28 13:35:01][INFO] train_vision.py:  840: Test: [70/121]	Prec@1 93.750 (89.806)	Prec@5 98.750 (98.838)	mPrec@1 (49.902)	mPrec@5 (66.121)
[02/28 13:35:43][INFO] train_vision.py:  840: Test: [80/121]	Prec@1 92.500 (90.031)	Prec@5 100.000 (98.920)	mPrec@1 (50.981)	mPrec@5 (70.347)
[02/28 13:36:26][INFO] train_vision.py:  840: Test: [90/121]	Prec@1 82.500 (89.821)	Prec@5 98.750 (98.942)	mPrec@1 (51.530)	mPrec@5 (73.034)
[02/28 13:37:08][INFO] train_vision.py:  840: Test: [100/121]	Prec@1 86.250 (88.874)	Prec@5 100.000 (98.700)	mPrec@1 (53.291)	mPrec@5 (79.439)
[02/28 13:37:51][INFO] train_vision.py:  840: Test: [110/121]	Prec@1 73.750 (89.077)	Prec@5 96.250 (98.716)	mPrec@1 (53.523)	mPrec@5 (79.360)
[02/28 13:38:31][INFO] train_vision.py:  840: Test: [120/121]	Prec@1 47.917 (87.801)	Prec@5 81.250 (98.362)	mPrec@1 (54.501)	mPrec@5 (85.976)
[02/28 13:38:31][INFO] train_vision.py:  847: Overall Prec@1 87.801% Prec@5 98.362% mPrec@1 (54.501) mPrec@5 (85.976)
[02/28 13:38:31][INFO] train_vision.py:  464: Testing: 54.50132369995117/54.50132369995117
[02/28 13:38:31][INFO] train_vision.py:  465: Saving:
[02/28 13:38:50][INFO] train_vision.py:  668: Epoch: [12][0/367], lr: 2.36e-04, eta: 9:27:06	Time 5.150 (5.150)	Data 2.256 (2.256)	Mem 41.61GB	Prec@1 90.000 (90.000)	Loss 1.2527 (1.2527)
[02/28 13:39:20][INFO] train_vision.py:  668: Epoch: [12][10/367], lr: 2.35e-04, eta: 5:51:15	Time 3.021 (3.195)	Data 0.070 (0.266)	Mem 41.61GB	Prec@1 50.000 (74.545)	Loss 2.1847 (1.6928)
[02/28 13:39:51][INFO] train_vision.py:  668: Epoch: [12][20/367], lr: 2.35e-04, eta: 5:41:31	Time 2.996 (3.111)	Data 0.068 (0.175)	Mem 41.61GB	Prec@1 60.000 (77.619)	Loss 1.8344 (1.6130)
[02/28 13:40:21][INFO] train_vision.py:  668: Epoch: [12][30/367], lr: 2.35e-04, eta: 5:37:59	Time 3.018 (3.083)	Data 0.072 (0.140)	Mem 41.61GB	Prec@1 60.000 (76.774)	Loss 2.3593 (1.6570)
[02/28 13:40:51][INFO] train_vision.py:  668: Epoch: [12][40/367], lr: 2.34e-04, eta: 5:35:47	Time 3.032 (3.068)	Data 0.067 (0.123)	Mem 41.61GB	Prec@1 90.000 (76.829)	Loss 1.4098 (1.6436)
[02/28 13:41:21][INFO] train_vision.py:  668: Epoch: [12][50/367], lr: 2.34e-04, eta: 5:34:20	Time 3.020 (3.059)	Data 0.088 (0.114)	Mem 41.61GB	Prec@1 70.000 (76.078)	Loss 1.5939 (1.6523)
[02/28 13:41:52][INFO] train_vision.py:  668: Epoch: [12][60/367], lr: 2.33e-04, eta: 5:33:17	Time 3.042 (3.054)	Data 0.067 (0.107)	Mem 41.61GB	Prec@1 70.000 (76.557)	Loss 2.0715 (1.6639)
[02/28 13:42:22][INFO] train_vision.py:  668: Epoch: [12][70/367], lr: 2.33e-04, eta: 5:32:16	Time 3.013 (3.050)	Data 0.070 (0.103)	Mem 41.61GB	Prec@1 50.000 (76.056)	Loss 2.6446 (1.6774)
[02/28 13:42:52][INFO] train_vision.py:  668: Epoch: [12][80/367], lr: 2.33e-04, eta: 5:31:20	Time 3.039 (3.046)	Data 0.072 (0.097)	Mem 41.61GB	Prec@1 70.000 (76.049)	Loss 1.9066 (1.6663)
[02/28 13:43:22][INFO] train_vision.py:  668: Epoch: [12][90/367], lr: 2.32e-04, eta: 5:30:33	Time 3.031 (3.043)	Data 0.070 (0.095)	Mem 41.61GB	Prec@1 90.000 (76.593)	Loss 1.3603 (1.6589)
[02/28 13:43:52][INFO] train_vision.py:  668: Epoch: [12][100/367], lr: 2.32e-04, eta: 5:29:54	Time 3.056 (3.042)	Data 0.103 (0.093)	Mem 41.61GB	Prec@1 90.000 (77.129)	Loss 1.3124 (1.6443)
[02/28 13:44:23][INFO] train_vision.py:  668: Epoch: [12][110/367], lr: 2.31e-04, eta: 5:29:07	Time 3.010 (3.040)	Data 0.083 (0.092)	Mem 41.61GB	Prec@1 70.000 (77.477)	Loss 1.5992 (1.6293)
[02/28 13:44:53][INFO] train_vision.py:  668: Epoch: [12][120/367], lr: 2.31e-04, eta: 5:28:25	Time 3.017 (3.038)	Data 0.057 (0.090)	Mem 41.61GB	Prec@1 90.000 (77.521)	Loss 1.2763 (1.6217)
[02/28 13:45:23][INFO] train_vision.py:  668: Epoch: [12][130/367], lr: 2.31e-04, eta: 5:27:41	Time 3.011 (3.036)	Data 0.028 (0.088)	Mem 41.61GB	Prec@1 60.000 (77.405)	Loss 1.7078 (1.6282)
[02/28 13:45:53][INFO] train_vision.py:  668: Epoch: [12][140/367], lr: 2.30e-04, eta: 5:27:02	Time 3.024 (3.034)	Data 0.090 (0.086)	Mem 41.61GB	Prec@1 80.000 (77.447)	Loss 1.6030 (1.6347)
[02/28 13:46:23][INFO] train_vision.py:  668: Epoch: [12][150/367], lr: 2.30e-04, eta: 5:26:22	Time 3.012 (3.033)	Data 0.082 (0.085)	Mem 41.61GB	Prec@1 90.000 (77.815)	Loss 1.2730 (1.6226)
[02/28 13:46:53][INFO] train_vision.py:  668: Epoch: [12][160/367], lr: 2.29e-04, eta: 5:25:47	Time 3.022 (3.032)	Data 0.035 (0.084)	Mem 41.61GB	Prec@1 90.000 (77.764)	Loss 1.3038 (1.6280)
[02/28 13:47:24][INFO] train_vision.py:  668: Epoch: [12][170/367], lr: 2.29e-04, eta: 5:25:11	Time 2.998 (3.031)	Data 0.055 (0.083)	Mem 41.61GB	Prec@1 60.000 (77.953)	Loss 1.9782 (1.6182)
[02/28 13:47:54][INFO] train_vision.py:  668: Epoch: [12][180/367], lr: 2.29e-04, eta: 5:24:35	Time 3.052 (3.030)	Data 0.060 (0.082)	Mem 41.61GB	Prec@1 70.000 (78.011)	Loss 2.0992 (1.6232)
[02/28 13:48:24][INFO] train_vision.py:  668: Epoch: [12][190/367], lr: 2.28e-04, eta: 5:23:59	Time 2.991 (3.029)	Data 0.065 (0.081)	Mem 41.61GB	Prec@1 100.000 (78.115)	Loss 1.1538 (1.6238)
[02/28 13:48:54][INFO] train_vision.py:  668: Epoch: [12][200/367], lr: 2.28e-04, eta: 5:23:26	Time 3.054 (3.029)	Data 0.060 (0.080)	Mem 41.61GB	Prec@1 90.000 (77.761)	Loss 1.7885 (1.6306)
[02/28 13:49:24][INFO] train_vision.py:  668: Epoch: [12][210/367], lr: 2.27e-04, eta: 5:22:51	Time 3.005 (3.028)	Data 0.060 (0.079)	Mem 41.61GB	Prec@1 50.000 (77.678)	Loss 1.9529 (1.6295)
[02/28 13:49:54][INFO] train_vision.py:  668: Epoch: [12][220/367], lr: 2.27e-04, eta: 5:22:15	Time 2.989 (3.027)	Data 0.058 (0.079)	Mem 41.61GB	Prec@1 60.000 (77.647)	Loss 2.0977 (1.6273)
[02/28 13:50:25][INFO] train_vision.py:  668: Epoch: [12][230/367], lr: 2.26e-04, eta: 5:21:43	Time 3.010 (3.027)	Data 0.085 (0.078)	Mem 41.61GB	Prec@1 70.000 (77.749)	Loss 2.1244 (1.6245)
[02/28 13:50:55][INFO] train_vision.py:  668: Epoch: [12][240/367], lr: 2.26e-04, eta: 5:21:10	Time 3.011 (3.027)	Data 0.057 (0.078)	Mem 41.61GB	Prec@1 100.000 (77.676)	Loss 1.1493 (1.6285)
[02/28 13:51:25][INFO] train_vision.py:  668: Epoch: [12][250/367], lr: 2.26e-04, eta: 5:20:35	Time 2.977 (3.026)	Data 0.058 (0.077)	Mem 41.61GB	Prec@1 90.000 (77.649)	Loss 1.2097 (1.6318)
[02/28 13:51:55][INFO] train_vision.py:  668: Epoch: [12][260/367], lr: 2.25e-04, eta: 5:20:01	Time 3.021 (3.025)	Data 0.083 (0.076)	Mem 41.61GB	Prec@1 70.000 (77.548)	Loss 1.7587 (1.6333)
[02/28 13:52:25][INFO] train_vision.py:  668: Epoch: [12][270/367], lr: 2.25e-04, eta: 5:19:25	Time 2.996 (3.024)	Data 0.059 (0.075)	Mem 41.61GB	Prec@1 80.000 (77.417)	Loss 1.2849 (1.6341)
[02/28 13:52:55][INFO] train_vision.py:  668: Epoch: [12][280/367], lr: 2.24e-04, eta: 5:18:50	Time 3.042 (3.024)	Data 0.055 (0.074)	Mem 41.61GB	Prec@1 90.000 (77.687)	Loss 1.2523 (1.6292)
[02/28 13:53:25][INFO] train_vision.py:  668: Epoch: [12][290/367], lr: 2.24e-04, eta: 5:18:15	Time 2.986 (3.023)	Data 0.065 (0.073)	Mem 41.61GB	Prec@1 80.000 (77.835)	Loss 1.4075 (1.6219)
[02/28 13:53:55][INFO] train_vision.py:  668: Epoch: [12][300/367], lr: 2.23e-04, eta: 5:17:42	Time 3.006 (3.022)	Data 0.062 (0.073)	Mem 41.61GB	Prec@1 90.000 (77.674)	Loss 1.5672 (1.6236)
[02/28 13:54:25][INFO] train_vision.py:  668: Epoch: [12][310/367], lr: 2.23e-04, eta: 5:17:07	Time 3.003 (3.022)	Data 0.051 (0.072)	Mem 41.61GB	Prec@1 80.000 (77.621)	Loss 1.5495 (1.6252)
[02/28 13:54:55][INFO] train_vision.py:  668: Epoch: [12][320/367], lr: 2.23e-04, eta: 5:16:33	Time 2.984 (3.021)	Data 0.061 (0.071)	Mem 41.61GB	Prec@1 90.000 (77.632)	Loss 1.3274 (1.6260)
[02/28 13:55:25][INFO] train_vision.py:  668: Epoch: [12][330/367], lr: 2.22e-04, eta: 5:15:58	Time 3.004 (3.020)	Data 0.057 (0.071)	Mem 41.61GB	Prec@1 50.000 (77.402)	Loss 2.5166 (1.6330)
[02/28 13:55:55][INFO] train_vision.py:  668: Epoch: [12][340/367], lr: 2.22e-04, eta: 5:15:24	Time 2.980 (3.020)	Data 0.044 (0.070)	Mem 41.61GB	Prec@1 50.000 (77.302)	Loss 1.8710 (1.6320)
[02/28 13:56:25][INFO] train_vision.py:  668: Epoch: [12][350/367], lr: 2.21e-04, eta: 5:14:52	Time 3.025 (3.019)	Data 0.031 (0.070)	Mem 41.61GB	Prec@1 100.000 (77.493)	Loss 1.0737 (1.6271)
[02/28 13:56:55][INFO] train_vision.py:  668: Epoch: [12][360/367], lr: 2.21e-04, eta: 5:14:19	Time 2.998 (3.019)	Data 0.059 (0.069)	Mem 41.61GB	Prec@1 70.000 (77.313)	Loss 1.6620 (1.6303)
[02/28 13:57:18][INFO] train_vision.py:  668: Epoch: [13][0/367], lr: 2.21e-04, eta: 9:34:39	Time 5.526 (5.526)	Data 2.558 (2.558)	Mem 41.61GB	Prec@1 90.000 (90.000)	Loss 1.1721 (1.1721)
[02/28 13:57:48][INFO] train_vision.py:  668: Epoch: [13][10/367], lr: 2.20e-04, eta: 5:37:30	Time 3.008 (3.251)	Data 0.048 (0.278)	Mem 41.61GB	Prec@1 90.000 (83.636)	Loss 1.1796 (1.4018)
[02/28 13:58:19][INFO] train_vision.py:  668: Epoch: [13][20/367], lr: 2.20e-04, eta: 5:25:17	Time 3.022 (3.138)	Data 0.077 (0.175)	Mem 41.61GB	Prec@1 90.000 (79.524)	Loss 1.3975 (1.5274)
[02/28 13:58:49][INFO] train_vision.py:  668: Epoch: [13][30/367], lr: 2.19e-04, eta: 5:20:35	Time 3.008 (3.098)	Data 0.039 (0.139)	Mem 41.61GB	Prec@1 70.000 (76.129)	Loss 1.7871 (1.6187)
[02/28 13:59:19][INFO] train_vision.py:  668: Epoch: [13][40/367], lr: 2.19e-04, eta: 5:18:12	Time 3.036 (3.079)	Data 0.066 (0.122)	Mem 41.61GB	Prec@1 70.000 (76.829)	Loss 1.8883 (1.6154)
[02/28 13:59:49][INFO] train_vision.py:  668: Epoch: [13][50/367], lr: 2.18e-04, eta: 5:16:29	Time 3.007 (3.068)	Data 0.066 (0.109)	Mem 41.61GB	Prec@1 80.000 (77.647)	Loss 1.2270 (1.6115)
[02/28 14:00:19][INFO] train_vision.py:  668: Epoch: [13][60/367], lr: 2.18e-04, eta: 5:15:01	Time 3.020 (3.058)	Data 0.056 (0.101)	Mem 41.61GB	Prec@1 70.000 (76.393)	Loss 1.6650 (1.6451)
[02/28 14:00:49][INFO] train_vision.py:  668: Epoch: [13][70/367], lr: 2.18e-04, eta: 5:13:50	Time 3.002 (3.052)	Data 0.059 (0.095)	Mem 41.61GB	Prec@1 70.000 (76.338)	Loss 1.6418 (1.6463)
[02/28 14:01:19][INFO] train_vision.py:  668: Epoch: [13][80/367], lr: 2.17e-04, eta: 5:12:53	Time 3.037 (3.048)	Data 0.064 (0.091)	Mem 41.61GB	Prec@1 70.000 (75.926)	Loss 1.8792 (1.6644)
[02/28 14:01:50][INFO] train_vision.py:  668: Epoch: [13][90/367], lr: 2.17e-04, eta: 5:11:57	Time 3.003 (3.043)	Data 0.060 (0.088)	Mem 41.61GB	Prec@1 90.000 (76.484)	Loss 1.3974 (1.6443)
[02/28 14:02:20][INFO] train_vision.py:  668: Epoch: [13][100/367], lr: 2.16e-04, eta: 5:11:09	Time 3.036 (3.041)	Data 0.065 (0.086)	Mem 41.61GB	Prec@1 100.000 (76.733)	Loss 0.9649 (1.6422)
[02/28 14:02:50][INFO] train_vision.py:  668: Epoch: [13][110/367], lr: 2.16e-04, eta: 5:10:22	Time 3.002 (3.038)	Data 0.059 (0.083)	Mem 41.61GB	Prec@1 100.000 (76.667)	Loss 1.0919 (1.6428)
[02/28 14:03:20][INFO] train_vision.py:  668: Epoch: [13][120/367], lr: 2.15e-04, eta: 5:09:40	Time 3.023 (3.036)	Data 0.063 (0.082)	Mem 41.61GB	Prec@1 70.000 (76.942)	Loss 1.6034 (1.6332)
[02/28 14:03:50][INFO] train_vision.py:  668: Epoch: [13][130/367], lr: 2.15e-04, eta: 5:09:00	Time 3.003 (3.034)	Data 0.058 (0.081)	Mem 41.61GB	Prec@1 60.000 (77.405)	Loss 1.6825 (1.6241)
[02/28 14:04:20][INFO] train_vision.py:  668: Epoch: [13][140/367], lr: 2.14e-04, eta: 5:08:21	Time 3.039 (3.033)	Data 0.067 (0.079)	Mem 41.61GB	Prec@1 60.000 (77.660)	Loss 1.9626 (1.6226)
[02/28 14:04:50][INFO] train_vision.py:  668: Epoch: [13][150/367], lr: 2.14e-04, eta: 5:07:41	Time 2.992 (3.031)	Data 0.040 (0.078)	Mem 41.61GB	Prec@1 70.000 (78.146)	Loss 1.4474 (1.6089)
[02/28 14:05:20][INFO] train_vision.py:  668: Epoch: [13][160/367], lr: 2.14e-04, eta: 5:07:00	Time 3.025 (3.030)	Data 0.066 (0.076)	Mem 41.61GB	Prec@1 90.000 (78.199)	Loss 1.5157 (1.6076)
[02/28 14:05:51][INFO] train_vision.py:  668: Epoch: [13][170/367], lr: 2.13e-04, eta: 5:06:23	Time 3.013 (3.029)	Data 0.028 (0.076)	Mem 41.61GB	Prec@1 70.000 (77.953)	Loss 1.8955 (1.6108)
[02/28 14:06:21][INFO] train_vision.py:  668: Epoch: [13][180/367], lr: 2.13e-04, eta: 5:05:48	Time 2.994 (3.028)	Data 0.067 (0.075)	Mem 41.61GB	Prec@1 70.000 (78.177)	Loss 1.7463 (1.6020)
[02/28 14:06:51][INFO] train_vision.py:  668: Epoch: [13][190/367], lr: 2.12e-04, eta: 5:05:12	Time 2.995 (3.027)	Data 0.051 (0.074)	Mem 41.61GB	Prec@1 80.000 (78.377)	Loss 1.8243 (1.5983)
[02/28 14:07:21][INFO] train_vision.py:  668: Epoch: [13][200/367], lr: 2.12e-04, eta: 5:04:33	Time 3.001 (3.025)	Data 0.051 (0.073)	Mem 41.61GB	Prec@1 100.000 (78.557)	Loss 1.0243 (1.5945)
[02/28 14:07:51][INFO] train_vision.py:  668: Epoch: [13][210/367], lr: 2.11e-04, eta: 5:03:55	Time 2.972 (3.024)	Data 0.053 (0.072)	Mem 41.61GB	Prec@1 60.000 (78.626)	Loss 1.7840 (1.5893)
[02/28 14:08:21][INFO] train_vision.py:  668: Epoch: [13][220/367], lr: 2.11e-04, eta: 5:03:21	Time 3.038 (3.023)	Data 0.023 (0.071)	Mem 41.61GB	Prec@1 70.000 (78.597)	Loss 1.9886 (1.5909)
[02/28 14:08:51][INFO] train_vision.py:  668: Epoch: [13][230/367], lr: 2.10e-04, eta: 5:02:46	Time 3.000 (3.023)	Data 0.036 (0.070)	Mem 41.61GB	Prec@1 80.000 (78.442)	Loss 1.5328 (1.5916)
[02/28 14:09:21][INFO] train_vision.py:  668: Epoch: [13][240/367], lr: 2.10e-04, eta: 5:02:10	Time 3.010 (3.022)	Data 0.055 (0.069)	Mem 41.61GB	Prec@1 90.000 (78.465)	Loss 1.6114 (1.5940)
[02/28 14:09:51][INFO] train_vision.py:  668: Epoch: [13][250/367], lr: 2.10e-04, eta: 5:01:35	Time 2.991 (3.021)	Data 0.048 (0.069)	Mem 41.61GB	Prec@1 80.000 (78.486)	Loss 1.2398 (1.5955)
[02/28 14:10:21][INFO] train_vision.py:  668: Epoch: [13][260/367], lr: 2.09e-04, eta: 5:01:01	Time 3.005 (3.020)	Data 0.049 (0.068)	Mem 41.61GB	Prec@1 90.000 (78.621)	Loss 1.2041 (1.5889)
[02/28 14:10:51][INFO] train_vision.py:  668: Epoch: [13][270/367], lr: 2.09e-04, eta: 5:00:29	Time 3.009 (3.020)	Data 0.058 (0.068)	Mem 41.61GB	Prec@1 100.000 (78.524)	Loss 1.2545 (1.5915)
[02/28 14:11:21][INFO] train_vision.py:  668: Epoch: [13][280/367], lr: 2.08e-04, eta: 4:59:55	Time 3.012 (3.019)	Data 0.055 (0.067)	Mem 41.61GB	Prec@1 70.000 (78.719)	Loss 1.9178 (1.5906)
[02/28 14:11:51][INFO] train_vision.py:  668: Epoch: [13][290/367], lr: 2.08e-04, eta: 4:59:21	Time 2.992 (3.019)	Data 0.054 (0.067)	Mem 41.61GB	Prec@1 90.000 (78.625)	Loss 1.2341 (1.5930)
[02/28 14:12:21][INFO] train_vision.py:  668: Epoch: [13][300/367], lr: 2.07e-04, eta: 4:58:49	Time 3.016 (3.018)	Data 0.055 (0.067)	Mem 41.61GB	Prec@1 80.000 (78.538)	Loss 1.6989 (1.5923)
[02/28 14:12:51][INFO] train_vision.py:  668: Epoch: [13][310/367], lr: 2.07e-04, eta: 4:58:16	Time 3.008 (3.018)	Data 0.054 (0.066)	Mem 41.61GB	Prec@1 70.000 (78.360)	Loss 1.9558 (1.5966)
[02/28 14:13:21][INFO] train_vision.py:  668: Epoch: [13][320/367], lr: 2.06e-04, eta: 4:57:43	Time 3.007 (3.017)	Data 0.055 (0.066)	Mem 41.61GB	Prec@1 80.000 (78.380)	Loss 1.7462 (1.5966)
[02/28 14:13:51][INFO] train_vision.py:  668: Epoch: [13][330/367], lr: 2.06e-04, eta: 4:57:11	Time 3.002 (3.017)	Data 0.052 (0.065)	Mem 41.61GB	Prec@1 80.000 (78.459)	Loss 1.7251 (1.5960)
[02/28 14:14:21][INFO] train_vision.py:  668: Epoch: [13][340/367], lr: 2.05e-04, eta: 4:56:38	Time 3.012 (3.017)	Data 0.055 (0.065)	Mem 41.61GB	Prec@1 90.000 (78.387)	Loss 1.2876 (1.6009)
[02/28 14:14:51][INFO] train_vision.py:  668: Epoch: [13][350/367], lr: 2.05e-04, eta: 4:56:06	Time 2.995 (3.016)	Data 0.052 (0.065)	Mem 41.61GB	Prec@1 90.000 (78.490)	Loss 1.2716 (1.5991)
[02/28 14:15:21][INFO] train_vision.py:  668: Epoch: [13][360/367], lr: 2.05e-04, eta: 4:55:34	Time 3.001 (3.016)	Data 0.054 (0.065)	Mem 41.61GB	Prec@1 90.000 (78.670)	Loss 1.4093 (1.5977)
[02/28 14:15:46][INFO] train_vision.py:  840: Test: [0/121]	Prec@1 95.000 (95.000)	Prec@5 100.000 (100.000)	mPrec@1 (10.839)	mPrec@5 (11.458)
[02/28 14:16:28][INFO] train_vision.py:  840: Test: [10/121]	Prec@1 90.000 (93.864)	Prec@5 100.000 (99.773)	mPrec@1 (29.298)	mPrec@5 (32.465)
[02/28 14:17:10][INFO] train_vision.py:  840: Test: [20/121]	Prec@1 96.250 (92.321)	Prec@5 100.000 (99.405)	mPrec@1 (38.344)	mPrec@5 (44.771)
[02/28 14:17:52][INFO] train_vision.py:  840: Test: [30/121]	Prec@1 92.500 (92.540)	Prec@5 98.750 (99.395)	mPrec@1 (45.034)	mPrec@5 (53.882)
[02/28 14:18:35][INFO] train_vision.py:  840: Test: [40/121]	Prec@1 88.750 (92.805)	Prec@5 100.000 (99.421)	mPrec@1 (49.872)	mPrec@5 (61.235)
[02/28 14:19:17][INFO] train_vision.py:  840: Test: [50/121]	Prec@1 86.250 (91.838)	Prec@5 100.000 (99.167)	mPrec@1 (50.036)	mPrec@5 (64.086)
[02/28 14:19:59][INFO] train_vision.py:  840: Test: [60/121]	Prec@1 92.500 (92.008)	Prec@5 100.000 (99.262)	mPrec@1 (53.021)	mPrec@5 (67.160)
[02/28 14:20:42][INFO] train_vision.py:  840: Test: [70/121]	Prec@1 93.750 (92.077)	Prec@5 100.000 (99.313)	mPrec@1 (53.288)	mPrec@5 (68.567)
[02/28 14:21:24][INFO] train_vision.py:  840: Test: [80/121]	Prec@1 91.250 (92.130)	Prec@5 100.000 (99.336)	mPrec@1 (54.826)	mPrec@5 (72.472)
[02/28 14:22:06][INFO] train_vision.py:  840: Test: [90/121]	Prec@1 80.000 (91.731)	Prec@5 100.000 (99.341)	mPrec@1 (55.870)	mPrec@5 (76.077)
[02/28 14:22:49][INFO] train_vision.py:  840: Test: [100/121]	Prec@1 96.250 (90.891)	Prec@5 100.000 (99.208)	mPrec@1 (58.028)	mPrec@5 (83.560)
[02/28 14:23:31][INFO] train_vision.py:  840: Test: [110/121]	Prec@1 73.750 (90.935)	Prec@5 96.250 (99.189)	mPrec@1 (57.890)	mPrec@5 (83.725)
[02/28 14:24:12][INFO] train_vision.py:  840: Test: [120/121]	Prec@1 52.083 (89.687)	Prec@5 81.250 (98.995)	mPrec@1 (58.581)	mPrec@5 (89.763)
[02/28 14:24:12][INFO] train_vision.py:  847: Overall Prec@1 89.687% Prec@5 98.995% mPrec@1 (58.581) mPrec@5 (89.763)
[02/28 14:24:12][INFO] train_vision.py:  464: Testing: 58.58109664916992/58.58109664916992
[02/28 14:24:12][INFO] train_vision.py:  465: Saving:
[02/28 14:24:31][INFO] train_vision.py:  668: Epoch: [14][0/367], lr: 2.04e-04, eta: 8:18:01	Time 5.088 (5.088)	Data 2.233 (2.233)	Mem 41.61GB	Prec@1 80.000 (80.000)	Loss 1.4727 (1.4727)
[02/28 14:25:01][INFO] train_vision.py:  668: Epoch: [14][10/367], lr: 2.04e-04, eta: 5:11:35	Time 3.000 (3.189)	Data 0.062 (0.256)	Mem 41.61GB	Prec@1 80.000 (74.545)	Loss 1.2316 (1.6664)
[02/28 14:25:31][INFO] train_vision.py:  668: Epoch: [14][20/367], lr: 2.03e-04, eta: 5:02:38	Time 2.994 (3.102)	Data 0.051 (0.159)	Mem 41.61GB	Prec@1 70.000 (77.619)	Loss 1.5577 (1.5743)
[02/28 14:26:01][INFO] train_vision.py:  668: Epoch: [14][30/367], lr: 2.03e-04, eta: 4:59:06	Time 3.011 (3.071)	Data 0.056 (0.125)	Mem 41.61GB	Prec@1 70.000 (76.774)	Loss 2.1488 (1.6304)
[02/28 14:26:31][INFO] train_vision.py:  668: Epoch: [14][40/367], lr: 2.02e-04, eta: 4:57:13	Time 3.006 (3.057)	Data 0.050 (0.108)	Mem 41.61GB	Prec@1 40.000 (75.610)	Loss 2.4954 (1.6732)
[02/28 14:27:01][INFO] train_vision.py:  668: Epoch: [14][50/367], lr: 2.02e-04, eta: 4:55:40	Time 3.013 (3.047)	Data 0.032 (0.095)	Mem 41.61GB	Prec@1 70.000 (75.882)	Loss 1.7225 (1.6633)
[02/28 14:27:31][INFO] train_vision.py:  668: Epoch: [14][60/367], lr: 2.01e-04, eta: 4:54:36	Time 3.011 (3.041)	Data 0.065 (0.089)	Mem 41.61GB	Prec@1 40.000 (76.230)	Loss 3.0455 (1.6562)
[02/28 14:28:01][INFO] train_vision.py:  668: Epoch: [14][70/367], lr: 2.01e-04, eta: 4:53:32	Time 3.008 (3.035)	Data 0.053 (0.084)	Mem 41.61GB	Prec@1 80.000 (76.056)	Loss 1.3848 (1.6645)
[02/28 14:28:31][INFO] train_vision.py:  668: Epoch: [14][80/367], lr: 2.01e-04, eta: 4:52:37	Time 3.013 (3.031)	Data 0.067 (0.080)	Mem 41.61GB	Prec@1 60.000 (76.667)	Loss 2.1367 (1.6641)
[02/28 14:29:01][INFO] train_vision.py:  668: Epoch: [14][90/367], lr: 2.00e-04, eta: 4:51:46	Time 3.003 (3.027)	Data 0.061 (0.077)	Mem 41.61GB	Prec@1 80.000 (76.813)	Loss 1.6898 (1.6505)
[02/28 14:29:31][INFO] train_vision.py:  668: Epoch: [14][100/367], lr: 2.00e-04, eta: 4:50:58	Time 2.982 (3.024)	Data 0.056 (0.075)	Mem 41.61GB	Prec@1 90.000 (76.634)	Loss 1.2779 (1.6425)
[02/28 14:30:01][INFO] train_vision.py:  668: Epoch: [14][110/367], lr: 1.99e-04, eta: 4:50:19	Time 3.043 (3.023)	Data 0.071 (0.073)	Mem 41.61GB	Prec@1 80.000 (77.297)	Loss 1.7068 (1.6273)
[02/28 14:30:31][INFO] train_vision.py:  668: Epoch: [14][120/367], lr: 1.99e-04, eta: 4:49:38	Time 3.004 (3.021)	Data 0.058 (0.071)	Mem 41.61GB	Prec@1 100.000 (77.934)	Loss 1.0717 (1.6116)
[02/28 14:31:01][INFO] train_vision.py:  668: Epoch: [14][130/367], lr: 1.98e-04, eta: 4:48:59	Time 2.997 (3.019)	Data 0.046 (0.070)	Mem 41.61GB	Prec@1 80.000 (78.550)	Loss 1.6257 (1.6007)
[02/28 14:31:31][INFO] train_vision.py:  668: Epoch: [14][140/367], lr: 1.98e-04, eta: 4:48:22	Time 3.007 (3.018)	Data 0.057 (0.068)	Mem 41.61GB	Prec@1 60.000 (78.936)	Loss 2.3034 (1.5904)
[02/28 14:32:01][INFO] train_vision.py:  668: Epoch: [14][150/367], lr: 1.97e-04, eta: 4:47:47	Time 2.987 (3.017)	Data 0.049 (0.067)	Mem 41.61GB	Prec@1 90.000 (79.073)	Loss 1.3646 (1.5908)
[02/28 14:32:31][INFO] train_vision.py:  668: Epoch: [14][160/367], lr: 1.97e-04, eta: 4:47:12	Time 3.005 (3.016)	Data 0.057 (0.066)	Mem 41.61GB	Prec@1 90.000 (79.006)	Loss 1.3454 (1.5922)
[02/28 14:33:01][INFO] train_vision.py:  668: Epoch: [14][170/367], lr: 1.96e-04, eta: 4:46:36	Time 2.997 (3.015)	Data 0.040 (0.065)	Mem 41.61GB	Prec@1 70.000 (78.655)	Loss 1.6818 (1.5973)
[02/28 14:33:32][INFO] train_vision.py:  668: Epoch: [14][180/367], lr: 1.96e-04, eta: 4:46:02	Time 3.008 (3.015)	Data 0.063 (0.064)	Mem 41.61GB	Prec@1 90.000 (78.287)	Loss 1.4443 (1.5962)
[02/28 14:34:02][INFO] train_vision.py:  668: Epoch: [14][190/367], lr: 1.95e-04, eta: 4:45:28	Time 2.997 (3.014)	Data 0.050 (0.064)	Mem 41.61GB	Prec@1 70.000 (78.272)	Loss 1.8752 (1.5984)
[02/28 14:34:31][INFO] train_vision.py:  668: Epoch: [14][200/367], lr: 1.95e-04, eta: 4:44:53	Time 3.014 (3.013)	Data 0.085 (0.063)	Mem 41.61GB	Prec@1 60.000 (78.109)	Loss 2.5174 (1.6021)
[02/28 14:35:02][INFO] train_vision.py:  668: Epoch: [14][210/367], lr: 1.94e-04, eta: 4:44:23	Time 3.000 (3.013)	Data 0.047 (0.063)	Mem 41.61GB	Prec@1 90.000 (78.294)	Loss 1.3555 (1.5972)
[02/28 14:35:32][INFO] train_vision.py:  668: Epoch: [14][220/367], lr: 1.94e-04, eta: 4:43:48	Time 3.009 (3.012)	Data 0.057 (0.062)	Mem 41.61GB	Prec@1 80.000 (78.190)	Loss 1.5204 (1.5986)
[02/28 14:36:02][INFO] train_vision.py:  668: Epoch: [14][230/367], lr: 1.94e-04, eta: 4:43:14	Time 2.982 (3.012)	Data 0.047 (0.061)	Mem 41.61GB	Prec@1 60.000 (78.139)	Loss 2.3755 (1.6037)
[02/28 14:36:31][INFO] train_vision.py:  668: Epoch: [14][240/367], lr: 1.93e-04, eta: 4:42:39	Time 3.000 (3.011)	Data 0.052 (0.061)	Mem 41.61GB	Prec@1 80.000 (78.299)	Loss 1.3324 (1.5994)
[02/28 14:37:01][INFO] train_vision.py:  668: Epoch: [14][250/367], lr: 1.93e-04, eta: 4:42:06	Time 2.989 (3.010)	Data 0.043 (0.061)	Mem 41.61GB	Prec@1 100.000 (78.367)	Loss 1.0828 (1.6001)
[02/28 14:37:31][INFO] train_vision.py:  668: Epoch: [14][260/367], lr: 1.92e-04, eta: 4:41:33	Time 3.004 (3.010)	Data 0.069 (0.060)	Mem 41.61GB	Prec@1 80.000 (78.467)	Loss 1.4179 (1.5940)
[02/28 14:38:01][INFO] train_vision.py:  668: Epoch: [14][270/367], lr: 1.92e-04, eta: 4:41:01	Time 3.001 (3.009)	Data 0.047 (0.060)	Mem 41.61GB	Prec@1 80.000 (78.782)	Loss 1.5814 (1.5846)
[02/28 14:38:31][INFO] train_vision.py:  668: Epoch: [14][280/367], lr: 1.91e-04, eta: 4:40:27	Time 3.003 (3.009)	Data 0.056 (0.059)	Mem 41.61GB	Prec@1 100.000 (79.039)	Loss 1.2341 (1.5817)
[02/28 14:39:01][INFO] train_vision.py:  668: Epoch: [14][290/367], lr: 1.91e-04, eta: 4:39:54	Time 2.973 (3.008)	Data 0.048 (0.059)	Mem 41.61GB	Prec@1 100.000 (79.003)	Loss 1.1080 (1.5846)
[02/28 14:39:31][INFO] train_vision.py:  668: Epoch: [14][300/367], lr: 1.90e-04, eta: 4:39:23	Time 3.005 (3.008)	Data 0.033 (0.058)	Mem 41.61GB	Prec@1 80.000 (79.169)	Loss 1.5823 (1.5822)
[02/28 14:40:01][INFO] train_vision.py:  668: Epoch: [14][310/367], lr: 1.90e-04, eta: 4:38:50	Time 2.993 (3.007)	Data 0.039 (0.058)	Mem 41.61GB	Prec@1 90.000 (79.421)	Loss 1.1753 (1.5743)
[02/28 14:40:31][INFO] train_vision.py:  668: Epoch: [14][320/367], lr: 1.89e-04, eta: 4:38:17	Time 3.017 (3.007)	Data 0.036 (0.058)	Mem 41.61GB	Prec@1 70.000 (79.190)	Loss 1.9811 (1.5811)
[02/28 14:41:01][INFO] train_vision.py:  668: Epoch: [14][330/367], lr: 1.89e-04, eta: 4:37:45	Time 2.977 (3.007)	Data 0.043 (0.058)	Mem 41.61GB	Prec@1 80.000 (78.973)	Loss 1.8346 (1.5858)
[02/28 14:41:31][INFO] train_vision.py:  668: Epoch: [14][340/367], lr: 1.88e-04, eta: 4:37:13	Time 3.006 (3.006)	Data 0.055 (0.057)	Mem 41.61GB	Prec@1 60.000 (78.915)	Loss 2.1080 (1.5871)
[02/28 14:42:01][INFO] train_vision.py:  668: Epoch: [14][350/367], lr: 1.88e-04, eta: 4:36:40	Time 2.996 (3.006)	Data 0.021 (0.057)	Mem 41.61GB	Prec@1 80.000 (78.860)	Loss 1.9055 (1.5871)
[02/28 14:42:31][INFO] train_vision.py:  668: Epoch: [14][360/367], lr: 1.87e-04, eta: 4:36:09	Time 2.994 (3.005)	Data 0.055 (0.057)	Mem 41.61GB	Prec@1 80.000 (78.947)	Loss 1.4631 (1.5827)
[02/28 14:42:54][INFO] train_vision.py:  668: Epoch: [15][0/367], lr: 1.87e-04, eta: 8:25:20	Time 5.507 (5.507)	Data 2.444 (2.444)	Mem 41.61GB	Prec@1 80.000 (80.000)	Loss 1.5682 (1.5682)
[02/28 14:43:24][INFO] train_vision.py:  668: Epoch: [15][10/367], lr: 1.87e-04, eta: 4:57:32	Time 3.035 (3.248)	Data 0.064 (0.285)	Mem 41.61GB	Prec@1 80.000 (83.636)	Loss 1.4223 (1.4462)
[02/28 14:43:54][INFO] train_vision.py:  668: Epoch: [15][20/367], lr: 1.86e-04, eta: 4:46:43	Time 2.976 (3.136)	Data 0.060 (0.177)	Mem 41.61GB	Prec@1 90.000 (80.000)	Loss 1.2399 (1.5511)
[02/28 14:44:24][INFO] train_vision.py:  668: Epoch: [15][30/367], lr: 1.86e-04, eta: 4:42:57	Time 3.039 (3.100)	Data 0.080 (0.139)	Mem 41.61GB	Prec@1 80.000 (79.677)	Loss 1.3430 (1.5738)
[02/28 14:44:55][INFO] train_vision.py:  668: Epoch: [15][40/367], lr: 1.85e-04, eta: 4:40:48	Time 3.013 (3.082)	Data 0.062 (0.120)	Mem 41.61GB	Prec@1 50.000 (76.829)	Loss 2.0535 (1.6118)
[02/28 14:45:25][INFO] train_vision.py:  668: Epoch: [15][50/367], lr: 1.85e-04, eta: 4:39:11	Time 3.024 (3.070)	Data 0.072 (0.109)	Mem 41.61GB	Prec@1 80.000 (77.647)	Loss 1.7363 (1.6095)
[02/28 14:45:55][INFO] train_vision.py:  668: Epoch: [15][60/367], lr: 1.84e-04, eta: 4:37:47	Time 3.016 (3.061)	Data 0.066 (0.099)	Mem 41.61GB	Prec@1 80.000 (77.541)	Loss 1.8895 (1.6108)
[02/28 14:46:25][INFO] train_vision.py:  668: Epoch: [15][70/367], lr: 1.84e-04, eta: 4:36:33	Time 3.018 (3.052)	Data 0.068 (0.092)	Mem 41.61GB	Prec@1 90.000 (77.887)	Loss 1.2243 (1.5889)
[02/28 14:46:55][INFO] train_vision.py:  668: Epoch: [15][80/367], lr: 1.83e-04, eta: 4:35:24	Time 2.995 (3.045)	Data 0.055 (0.087)	Mem 41.61GB	Prec@1 70.000 (79.136)	Loss 1.9117 (1.5651)
[02/28 14:47:25][INFO] train_vision.py:  668: Epoch: [15][90/367], lr: 1.83e-04, eta: 4:34:23	Time 2.988 (3.040)	Data 0.058 (0.083)	Mem 41.61GB	Prec@1 70.000 (79.670)	Loss 1.7866 (1.5507)
[02/28 14:47:55][INFO] train_vision.py:  668: Epoch: [15][100/367], lr: 1.82e-04, eta: 4:33:29	Time 2.994 (3.035)	Data 0.062 (0.080)	Mem 41.61GB	Prec@1 70.000 (79.802)	Loss 1.6554 (1.5461)
[02/28 14:48:25][INFO] train_vision.py:  668: Epoch: [15][110/367], lr: 1.82e-04, eta: 4:32:40	Time 2.990 (3.032)	Data 0.056 (0.077)	Mem 41.61GB	Prec@1 80.000 (79.730)	Loss 1.4852 (1.5442)
[02/28 14:48:55][INFO] train_vision.py:  668: Epoch: [15][120/367], lr: 1.81e-04, eta: 4:31:53	Time 2.990 (3.029)	Data 0.052 (0.075)	Mem 41.61GB	Prec@1 90.000 (79.421)	Loss 1.1569 (1.5489)
[02/28 14:49:25][INFO] train_vision.py:  668: Epoch: [15][130/367], lr: 1.81e-04, eta: 4:31:11	Time 3.021 (3.027)	Data 0.062 (0.072)	Mem 41.61GB	Prec@1 60.000 (79.160)	Loss 1.9024 (1.5559)
[02/28 14:49:55][INFO] train_vision.py:  668: Epoch: [15][140/367], lr: 1.80e-04, eta: 4:30:27	Time 3.025 (3.024)	Data 0.074 (0.071)	Mem 41.61GB	Prec@1 90.000 (79.007)	Loss 1.3514 (1.5665)
[02/28 14:50:25][INFO] train_vision.py:  668: Epoch: [15][150/367], lr: 1.80e-04, eta: 4:29:45	Time 2.991 (3.022)	Data 0.057 (0.070)	Mem 41.61GB	Prec@1 80.000 (78.808)	Loss 1.5363 (1.5666)
[02/28 14:50:54][INFO] train_vision.py:  668: Epoch: [15][160/367], lr: 1.79e-04, eta: 4:29:04	Time 2.983 (3.020)	Data 0.049 (0.068)	Mem 41.61GB	Prec@1 80.000 (79.068)	Loss 1.5205 (1.5612)
[02/28 14:51:24][INFO] train_vision.py:  668: Epoch: [15][170/367], lr: 1.79e-04, eta: 4:28:25	Time 2.996 (3.018)	Data 0.055 (0.067)	Mem 41.61GB	Prec@1 100.000 (79.064)	Loss 1.0008 (1.5611)
[02/28 14:51:54][INFO] train_vision.py:  668: Epoch: [15][180/367], lr: 1.78e-04, eta: 4:27:46	Time 2.992 (3.017)	Data 0.052 (0.066)	Mem 41.61GB	Prec@1 100.000 (79.227)	Loss 1.0280 (1.5604)
[02/28 14:52:24][INFO] train_vision.py:  668: Epoch: [15][190/367], lr: 1.78e-04, eta: 4:27:10	Time 2.999 (3.015)	Data 0.044 (0.065)	Mem 41.61GB	Prec@1 100.000 (79.529)	Loss 1.1568 (1.5523)
[02/28 14:52:54][INFO] train_vision.py:  668: Epoch: [15][200/367], lr: 1.78e-04, eta: 4:26:34	Time 2.990 (3.014)	Data 0.049 (0.065)	Mem 41.61GB	Prec@1 50.000 (79.303)	Loss 2.1854 (1.5568)
[02/28 14:53:24][INFO] train_vision.py:  668: Epoch: [15][210/367], lr: 1.77e-04, eta: 4:25:58	Time 2.993 (3.013)	Data 0.057 (0.064)	Mem 41.61GB	Prec@1 90.000 (79.005)	Loss 1.5466 (1.5667)
[02/28 14:53:54][INFO] train_vision.py:  668: Epoch: [15][220/367], lr: 1.77e-04, eta: 4:25:22	Time 2.989 (3.012)	Data 0.037 (0.063)	Mem 41.61GB	Prec@1 80.000 (79.231)	Loss 1.5545 (1.5583)
[02/28 14:54:24][INFO] train_vision.py:  668: Epoch: [15][230/367], lr: 1.76e-04, eta: 4:24:47	Time 2.999 (3.011)	Data 0.059 (0.063)	Mem 41.61GB	Prec@1 90.000 (79.048)	Loss 1.2707 (1.5645)
[02/28 14:54:54][INFO] train_vision.py:  668: Epoch: [15][240/367], lr: 1.76e-04, eta: 4:24:13	Time 2.981 (3.011)	Data 0.041 (0.062)	Mem 41.61GB	Prec@1 80.000 (78.921)	Loss 1.2022 (1.5615)
[02/28 14:55:24][INFO] train_vision.py:  668: Epoch: [15][250/367], lr: 1.75e-04, eta: 4:23:38	Time 3.010 (3.010)	Data 0.029 (0.062)	Mem 41.61GB	Prec@1 80.000 (79.084)	Loss 1.5763 (1.5575)
[02/28 14:55:54][INFO] train_vision.py:  668: Epoch: [15][260/367], lr: 1.75e-04, eta: 4:23:04	Time 3.000 (3.009)	Data 0.051 (0.061)	Mem 41.61GB	Prec@1 90.000 (79.157)	Loss 1.3017 (1.5551)
[02/28 14:56:23][INFO] train_vision.py:  668: Epoch: [15][270/367], lr: 1.74e-04, eta: 4:22:30	Time 2.994 (3.008)	Data 0.056 (0.061)	Mem 41.61GB	Prec@1 70.000 (79.004)	Loss 1.9008 (1.5609)
[02/28 14:56:53][INFO] train_vision.py:  668: Epoch: [15][280/367], lr: 1.74e-04, eta: 4:21:57	Time 2.991 (3.008)	Data 0.043 (0.060)	Mem 41.61GB	Prec@1 80.000 (78.932)	Loss 1.4784 (1.5622)
[02/28 14:57:23][INFO] train_vision.py:  668: Epoch: [15][290/367], lr: 1.73e-04, eta: 4:21:24	Time 2.995 (3.007)	Data 0.063 (0.060)	Mem 41.61GB	Prec@1 80.000 (79.141)	Loss 1.6116 (1.5597)
[02/28 14:57:53][INFO] train_vision.py:  668: Epoch: [15][300/367], lr: 1.73e-04, eta: 4:20:52	Time 2.992 (3.007)	Data 0.046 (0.060)	Mem 41.61GB	Prec@1 80.000 (79.402)	Loss 1.5520 (1.5522)
[02/28 14:58:23][INFO] train_vision.py:  668: Epoch: [15][310/367], lr: 1.72e-04, eta: 4:20:19	Time 2.989 (3.006)	Data 0.053 (0.059)	Mem 41.61GB	Prec@1 60.000 (79.196)	Loss 2.5209 (1.5580)
[02/28 14:58:53][INFO] train_vision.py:  668: Epoch: [15][320/367], lr: 1.72e-04, eta: 4:19:46	Time 3.002 (3.006)	Data 0.025 (0.059)	Mem 41.61GB	Prec@1 80.000 (79.346)	Loss 1.5042 (1.5545)
[02/28 14:59:23][INFO] train_vision.py:  668: Epoch: [15][330/367], lr: 1.71e-04, eta: 4:19:14	Time 2.982 (3.005)	Data 0.051 (0.058)	Mem 41.61GB	Prec@1 90.000 (79.426)	Loss 1.3398 (1.5528)
[02/28 14:59:53][INFO] train_vision.py:  668: Epoch: [15][340/367], lr: 1.71e-04, eta: 4:18:42	Time 2.971 (3.005)	Data 0.049 (0.058)	Mem 41.61GB	Prec@1 70.000 (79.355)	Loss 1.9133 (1.5539)
[02/28 15:00:23][INFO] train_vision.py:  668: Epoch: [15][350/367], lr: 1.70e-04, eta: 4:18:10	Time 2.994 (3.004)	Data 0.056 (0.058)	Mem 41.61GB	Prec@1 80.000 (79.430)	Loss 1.5403 (1.5517)
[02/28 15:00:53][INFO] train_vision.py:  668: Epoch: [15][360/367], lr: 1.70e-04, eta: 4:17:38	Time 2.991 (3.004)	Data 0.048 (0.058)	Mem 41.61GB	Prec@1 90.000 (79.612)	Loss 1.3460 (1.5464)
[02/28 15:01:17][INFO] train_vision.py:  840: Test: [0/121]	Prec@1 98.750 (98.750)	Prec@5 100.000 (100.000)	mPrec@1 (11.389)	mPrec@5 (11.458)
[02/28 15:01:59][INFO] train_vision.py:  840: Test: [10/121]	Prec@1 98.750 (94.205)	Prec@5 100.000 (99.773)	mPrec@1 (29.124)	mPrec@5 (32.465)
[02/28 15:02:41][INFO] train_vision.py:  840: Test: [20/121]	Prec@1 95.000 (92.917)	Prec@5 100.000 (99.464)	mPrec@1 (38.388)	mPrec@5 (44.832)
[02/28 15:03:23][INFO] train_vision.py:  840: Test: [30/121]	Prec@1 91.250 (92.702)	Prec@5 98.750 (99.435)	mPrec@1 (43.798)	mPrec@5 (53.708)
[02/28 15:04:05][INFO] train_vision.py:  840: Test: [40/121]	Prec@1 96.250 (92.957)	Prec@5 98.750 (99.421)	mPrec@1 (48.557)	mPrec@5 (60.964)
[02/28 15:04:47][INFO] train_vision.py:  840: Test: [50/121]	Prec@1 86.250 (92.377)	Prec@5 100.000 (99.314)	mPrec@1 (49.047)	mPrec@5 (63.397)
[02/28 15:05:29][INFO] train_vision.py:  840: Test: [60/121]	Prec@1 95.000 (92.275)	Prec@5 100.000 (99.385)	mPrec@1 (50.769)	mPrec@5 (66.454)
[02/28 15:06:11][INFO] train_vision.py:  840: Test: [70/121]	Prec@1 95.000 (92.465)	Prec@5 100.000 (99.437)	mPrec@1 (51.388)	mPrec@5 (68.184)
[02/28 15:06:53][INFO] train_vision.py:  840: Test: [80/121]	Prec@1 95.000 (92.608)	Prec@5 100.000 (99.429)	mPrec@1 (53.687)	mPrec@5 (72.348)
[02/28 15:07:35][INFO] train_vision.py:  840: Test: [90/121]	Prec@1 90.000 (92.418)	Prec@5 100.000 (99.396)	mPrec@1 (55.607)	mPrec@5 (75.925)
[02/28 15:08:17][INFO] train_vision.py:  840: Test: [100/121]	Prec@1 97.500 (91.609)	Prec@5 100.000 (99.319)	mPrec@1 (57.392)	mPrec@5 (84.207)
[02/28 15:09:00][INFO] train_vision.py:  840: Test: [110/121]	Prec@1 83.750 (91.757)	Prec@5 95.000 (99.279)	mPrec@1 (57.850)	mPrec@5 (84.316)
[02/28 15:09:40][INFO] train_vision.py:  840: Test: [120/121]	Prec@1 47.917 (90.506)	Prec@5 81.250 (99.119)	mPrec@1 (58.056)	mPrec@5 (91.347)
[02/28 15:09:40][INFO] train_vision.py:  847: Overall Prec@1 90.506% Prec@5 99.119% mPrec@1 (58.056) mPrec@5 (91.347)
[02/28 15:09:41][INFO] train_vision.py:  464: Testing: 58.05647659301758/58.58109664916992
[02/28 15:09:41][INFO] train_vision.py:  465: Saving:
[02/28 15:09:53][INFO] train_vision.py:  668: Epoch: [16][0/367], lr: 1.69e-04, eta: 7:24:28	Time 5.189 (5.189)	Data 2.281 (2.281)	Mem 41.61GB	Prec@1 80.000 (80.000)	Loss 1.4676 (1.4676)
[02/28 15:10:23][INFO] train_vision.py:  668: Epoch: [16][10/367], lr: 1.69e-04, eta: 4:33:54	Time 3.008 (3.204)	Data 0.047 (0.252)	Mem 41.61GB	Prec@1 100.000 (80.909)	Loss 0.9685 (1.5580)
[02/28 15:10:53][INFO] train_vision.py:  668: Epoch: [16][20/367], lr: 1.68e-04, eta: 4:25:12	Time 2.996 (3.109)	Data 0.069 (0.159)	Mem 41.61GB	Prec@1 80.000 (80.000)	Loss 1.4222 (1.5952)
[02/28 15:11:23][INFO] train_vision.py:  668: Epoch: [16][30/367], lr: 1.68e-04, eta: 4:21:54	Time 3.007 (3.076)	Data 0.062 (0.126)	Mem 41.61GB	Prec@1 70.000 (80.968)	Loss 2.0747 (1.5818)
[02/28 15:11:53][INFO] train_vision.py:  668: Epoch: [16][40/367], lr: 1.68e-04, eta: 4:19:57	Time 3.006 (3.059)	Data 0.084 (0.110)	Mem 41.61GB	Prec@1 90.000 (81.707)	Loss 1.2606 (1.5439)
[02/28 15:12:23][INFO] train_vision.py:  668: Epoch: [16][50/367], lr: 1.67e-04, eta: 4:18:31	Time 3.008 (3.048)	Data 0.066 (0.098)	Mem 41.61GB	Prec@1 80.000 (80.784)	Loss 1.3279 (1.5386)
[02/28 15:12:53][INFO] train_vision.py:  668: Epoch: [16][60/367], lr: 1.67e-04, eta: 4:17:28	Time 3.036 (3.042)	Data 0.028 (0.090)	Mem 41.61GB	Prec@1 90.000 (80.820)	Loss 1.3111 (1.5436)
[02/28 15:13:23][INFO] train_vision.py:  668: Epoch: [16][70/367], lr: 1.66e-04, eta: 4:16:33	Time 3.028 (3.037)	Data 0.026 (0.085)	Mem 41.61GB	Prec@1 50.000 (79.577)	Loss 2.0727 (1.5697)
[02/28 15:13:53][INFO] train_vision.py:  668: Epoch: [16][80/367], lr: 1.66e-04, eta: 4:15:40	Time 3.007 (3.032)	Data 0.058 (0.082)	Mem 41.61GB	Prec@1 60.000 (79.506)	Loss 1.8242 (1.5670)
[02/28 15:14:23][INFO] train_vision.py:  668: Epoch: [16][90/367], lr: 1.65e-04, eta: 4:14:50	Time 2.988 (3.028)	Data 0.052 (0.079)	Mem 41.61GB	Prec@1 90.000 (80.110)	Loss 1.3409 (1.5551)
[02/28 15:14:53][INFO] train_vision.py:  668: Epoch: [16][100/367], lr: 1.65e-04, eta: 4:14:09	Time 3.024 (3.026)	Data 0.070 (0.077)	Mem 41.61GB	Prec@1 90.000 (80.594)	Loss 1.4155 (1.5436)
[02/28 15:15:23][INFO] train_vision.py:  668: Epoch: [16][110/367], lr: 1.64e-04, eta: 4:13:29	Time 2.981 (3.024)	Data 0.056 (0.075)	Mem 41.61GB	Prec@1 80.000 (80.180)	Loss 1.5577 (1.5516)
[02/28 15:15:53][INFO] train_vision.py:  668: Epoch: [16][120/367], lr: 1.64e-04, eta: 4:12:54	Time 3.018 (3.023)	Data 0.064 (0.073)	Mem 41.61GB	Prec@1 50.000 (80.331)	Loss 2.5905 (1.5577)
[02/28 15:16:24][INFO] train_vision.py:  668: Epoch: [16][130/367], lr: 1.63e-04, eta: 4:12:16	Time 2.996 (3.022)	Data 0.053 (0.072)	Mem 41.61GB	Prec@1 100.000 (80.763)	Loss 1.1149 (1.5450)
[02/28 15:16:54][INFO] train_vision.py:  668: Epoch: [16][140/367], lr: 1.63e-04, eta: 4:11:42	Time 3.029 (3.021)	Data 0.063 (0.071)	Mem 41.61GB	Prec@1 80.000 (80.993)	Loss 1.8999 (1.5472)
[02/28 15:17:24][INFO] train_vision.py:  668: Epoch: [16][150/367], lr: 1.62e-04, eta: 4:11:07	Time 3.031 (3.020)	Data 0.020 (0.070)	Mem 41.61GB	Prec@1 90.000 (81.126)	Loss 1.1335 (1.5385)
[02/28 15:17:54][INFO] train_vision.py:  668: Epoch: [16][160/367], lr: 1.62e-04, eta: 4:10:31	Time 3.025 (3.019)	Data 0.061 (0.069)	Mem 41.61GB	Prec@1 70.000 (81.242)	Loss 1.9535 (1.5383)
[02/28 15:18:24][INFO] train_vision.py:  668: Epoch: [16][170/367], lr: 1.61e-04, eta: 4:09:56	Time 2.989 (3.018)	Data 0.046 (0.069)	Mem 41.61GB	Prec@1 90.000 (81.170)	Loss 1.2093 (1.5395)
[02/28 15:18:54][INFO] train_vision.py:  668: Epoch: [16][180/367], lr: 1.61e-04, eta: 4:09:25	Time 3.000 (3.018)	Data 0.064 (0.068)	Mem 41.61GB	Prec@1 70.000 (80.773)	Loss 1.6733 (1.5496)
[02/28 15:19:24][INFO] train_vision.py:  668: Epoch: [16][190/367], lr: 1.60e-04, eta: 4:08:52	Time 2.971 (3.017)	Data 0.054 (0.067)	Mem 41.61GB	Prec@1 90.000 (80.681)	Loss 1.2517 (1.5536)
[02/28 15:19:54][INFO] train_vision.py:  668: Epoch: [16][200/367], lr: 1.60e-04, eta: 4:08:19	Time 3.034 (3.017)	Data 0.029 (0.066)	Mem 41.61GB	Prec@1 70.000 (80.299)	Loss 1.6831 (1.5571)
[02/28 15:20:24][INFO] train_vision.py:  668: Epoch: [16][210/367], lr: 1.59e-04, eta: 4:07:46	Time 3.005 (3.016)	Data 0.065 (0.066)	Mem 41.61GB	Prec@1 90.000 (80.332)	Loss 1.7389 (1.5551)
[02/28 15:20:54][INFO] train_vision.py:  668: Epoch: [16][220/367], lr: 1.59e-04, eta: 4:07:13	Time 3.028 (3.016)	Data 0.040 (0.065)	Mem 41.61GB	Prec@1 80.000 (80.588)	Loss 1.4770 (1.5473)
[02/28 15:21:24][INFO] train_vision.py:  668: Epoch: [16][230/367], lr: 1.58e-04, eta: 4:06:40	Time 3.008 (3.015)	Data 0.053 (0.065)	Mem 41.61GB	Prec@1 80.000 (80.476)	Loss 1.3591 (1.5508)
[02/28 15:21:54][INFO] train_vision.py:  668: Epoch: [16][240/367], lr: 1.58e-04, eta: 4:06:08	Time 2.994 (3.015)	Data 0.055 (0.064)	Mem 41.61GB	Prec@1 70.000 (80.456)	Loss 1.8193 (1.5476)
[02/28 15:22:24][INFO] train_vision.py:  668: Epoch: [16][250/367], lr: 1.57e-04, eta: 4:05:35	Time 2.999 (3.014)	Data 0.053 (0.064)	Mem 41.61GB	Prec@1 80.000 (80.558)	Loss 1.3190 (1.5421)
[02/28 15:22:54][INFO] train_vision.py:  668: Epoch: [16][260/367], lr: 1.57e-04, eta: 4:05:02	Time 3.002 (3.013)	Data 0.057 (0.063)	Mem 41.61GB	Prec@1 100.000 (81.034)	Loss 1.1319 (1.5295)
[02/28 15:23:24][INFO] train_vision.py:  668: Epoch: [16][270/367], lr: 1.56e-04, eta: 4:04:30	Time 2.989 (3.013)	Data 0.053 (0.063)	Mem 41.61GB	Prec@1 70.000 (80.812)	Loss 1.7165 (1.5302)
[02/28 15:23:54][INFO] train_vision.py:  668: Epoch: [16][280/367], lr: 1.56e-04, eta: 4:03:58	Time 2.983 (3.013)	Data 0.050 (0.063)	Mem 41.61GB	Prec@1 90.000 (80.676)	Loss 1.4281 (1.5357)
[02/28 15:24:24][INFO] train_vision.py:  668: Epoch: [16][290/367], lr: 1.55e-04, eta: 4:03:26	Time 2.972 (3.012)	Data 0.054 (0.062)	Mem 41.61GB	Prec@1 80.000 (80.722)	Loss 1.6544 (1.5359)
[02/28 15:24:54][INFO] train_vision.py:  668: Epoch: [16][300/367], lr: 1.55e-04, eta: 4:02:54	Time 2.991 (3.012)	Data 0.057 (0.062)	Mem 41.61GB	Prec@1 60.000 (80.664)	Loss 2.0477 (1.5362)
[02/28 15:25:24][INFO] train_vision.py:  668: Epoch: [16][310/367], lr: 1.54e-04, eta: 4:02:21	Time 3.000 (3.011)	Data 0.052 (0.062)	Mem 41.61GB	Prec@1 70.000 (80.836)	Loss 1.5579 (1.5329)
[02/28 15:25:54][INFO] train_vision.py:  668: Epoch: [16][320/367], lr: 1.54e-04, eta: 4:01:50	Time 2.999 (3.011)	Data 0.056 (0.061)	Mem 41.61GB	Prec@1 80.000 (80.903)	Loss 1.6797 (1.5297)
[02/28 15:26:24][INFO] train_vision.py:  668: Epoch: [16][330/367], lr: 1.53e-04, eta: 4:01:18	Time 3.003 (3.011)	Data 0.051 (0.061)	Mem 41.61GB	Prec@1 80.000 (80.997)	Loss 1.2960 (1.5287)
[02/28 15:26:54][INFO] train_vision.py:  668: Epoch: [16][340/367], lr: 1.53e-04, eta: 4:00:47	Time 2.998 (3.010)	Data 0.069 (0.061)	Mem 41.61GB	Prec@1 60.000 (80.909)	Loss 2.0509 (1.5321)
[02/28 15:27:24][INFO] train_vision.py:  668: Epoch: [16][350/367], lr: 1.52e-04, eta: 4:00:16	Time 3.003 (3.010)	Data 0.042 (0.061)	Mem 41.61GB	Prec@1 70.000 (80.997)	Loss 2.0497 (1.5289)
[02/28 15:27:54][INFO] train_vision.py:  668: Epoch: [16][360/367], lr: 1.52e-04, eta: 3:59:44	Time 2.990 (3.010)	Data 0.057 (0.060)	Mem 41.61GB	Prec@1 70.000 (81.108)	Loss 1.7846 (1.5253)
[02/28 15:28:17][INFO] train_vision.py:  668: Epoch: [17][0/367], lr: 1.51e-04, eta: 7:16:16	Time 5.485 (5.485)	Data 2.375 (2.375)	Mem 41.61GB	Prec@1 90.000 (90.000)	Loss 1.2636 (1.2636)
[02/28 15:28:47][INFO] train_vision.py:  668: Epoch: [17][10/367], lr: 1.51e-04, eta: 4:16:43	Time 3.003 (3.235)	Data 0.063 (0.272)	Mem 41.61GB	Prec@1 60.000 (81.818)	Loss 2.0384 (1.4616)
[02/28 15:29:17][INFO] train_vision.py:  668: Epoch: [17][20/367], lr: 1.51e-04, eta: 4:07:49	Time 3.000 (3.129)	Data 0.060 (0.173)	Mem 41.61GB	Prec@1 90.000 (80.952)	Loss 1.1584 (1.5228)
[02/28 15:29:48][INFO] train_vision.py:  668: Epoch: [17][30/367], lr: 1.50e-04, eta: 4:04:09	Time 3.047 (3.089)	Data 0.074 (0.136)	Mem 41.61GB	Prec@1 100.000 (81.290)	Loss 1.0936 (1.4952)
[02/28 15:30:18][INFO] train_vision.py:  668: Epoch: [17][40/367], lr: 1.50e-04, eta: 4:02:06	Time 3.003 (3.070)	Data 0.054 (0.118)	Mem 41.61GB	Prec@1 80.000 (81.463)	Loss 1.5269 (1.5118)
[02/28 15:30:48][INFO] train_vision.py:  668: Epoch: [17][50/367], lr: 1.49e-04, eta: 4:00:25	Time 2.985 (3.055)	Data 0.041 (0.106)	Mem 41.61GB	Prec@1 70.000 (80.392)	Loss 1.9720 (1.5223)
[02/28 15:31:18][INFO] train_vision.py:  668: Epoch: [17][60/367], lr: 1.49e-04, eta: 3:59:14	Time 2.988 (3.046)	Data 0.061 (0.098)	Mem 41.61GB	Prec@1 80.000 (80.328)	Loss 1.7467 (1.5263)
[02/28 15:31:48][INFO] train_vision.py:  668: Epoch: [17][70/367], lr: 1.48e-04, eta: 3:58:12	Time 2.996 (3.040)	Data 0.051 (0.092)	Mem 41.61GB	Prec@1 70.000 (79.718)	Loss 1.5948 (1.5341)
[02/28 15:32:18][INFO] train_vision.py:  668: Epoch: [17][80/367], lr: 1.48e-04, eta: 3:57:16	Time 2.991 (3.034)	Data 0.045 (0.087)	Mem 41.61GB	Prec@1 80.000 (79.753)	Loss 1.3627 (1.5237)
[02/28 15:32:48][INFO] train_vision.py:  668: Epoch: [17][90/367], lr: 1.47e-04, eta: 3:56:29	Time 3.011 (3.031)	Data 0.058 (0.083)	Mem 41.61GB	Prec@1 80.000 (80.000)	Loss 1.3011 (1.5202)
[02/28 15:33:18][INFO] train_vision.py:  668: Epoch: [17][100/367], lr: 1.47e-04, eta: 3:55:44	Time 2.999 (3.028)	Data 0.064 (0.081)	Mem 41.61GB	Prec@1 80.000 (80.000)	Loss 1.4095 (1.5114)
[02/28 15:33:48][INFO] train_vision.py:  668: Epoch: [17][110/367], lr: 1.46e-04, eta: 3:55:04	Time 2.996 (3.025)	Data 0.056 (0.079)	Mem 41.61GB	Prec@1 90.000 (80.450)	Loss 1.3078 (1.5010)
[02/28 15:34:18][INFO] train_vision.py:  668: Epoch: [17][120/367], lr: 1.46e-04, eta: 3:54:23	Time 2.989 (3.023)	Data 0.058 (0.077)	Mem 41.61GB	Prec@1 70.000 (80.083)	Loss 1.5498 (1.5060)
[02/28 15:34:48][INFO] train_vision.py:  668: Epoch: [17][130/367], lr: 1.45e-04, eta: 3:53:44	Time 3.000 (3.021)	Data 0.058 (0.075)	Mem 41.61GB	Prec@1 80.000 (79.695)	Loss 1.3462 (1.5143)
[02/28 15:35:18][INFO] train_vision.py:  668: Epoch: [17][140/367], lr: 1.45e-04, eta: 3:53:06	Time 3.002 (3.020)	Data 0.049 (0.074)	Mem 41.61GB	Prec@1 70.000 (79.574)	Loss 1.6557 (1.5136)
[02/28 15:35:47][INFO] train_vision.py:  668: Epoch: [17][150/367], lr: 1.44e-04, eta: 3:52:29	Time 2.991 (3.018)	Data 0.048 (0.072)	Mem 41.61GB	Prec@1 60.000 (79.536)	Loss 1.6461 (1.5143)
[02/28 15:36:17][INFO] train_vision.py:  668: Epoch: [17][160/367], lr: 1.44e-04, eta: 3:51:53	Time 2.991 (3.017)	Data 0.051 (0.071)	Mem 41.61GB	Prec@1 40.000 (79.006)	Loss 2.0367 (1.5239)
[02/28 15:36:47][INFO] train_vision.py:  668: Epoch: [17][170/367], lr: 1.43e-04, eta: 3:51:17	Time 2.986 (3.015)	Data 0.033 (0.070)	Mem 41.61GB	Prec@1 70.000 (79.240)	Loss 1.7322 (1.5182)
[02/28 15:37:17][INFO] train_vision.py:  668: Epoch: [17][180/367], lr: 1.43e-04, eta: 3:50:43	Time 2.999 (3.015)	Data 0.041 (0.069)	Mem 41.61GB	Prec@1 100.000 (79.558)	Loss 1.0821 (1.5115)
[02/28 15:37:47][INFO] train_vision.py:  668: Epoch: [17][190/367], lr: 1.42e-04, eta: 3:50:10	Time 2.996 (3.014)	Data 0.052 (0.068)	Mem 41.61GB	Prec@1 70.000 (79.738)	Loss 2.0148 (1.5089)
[02/28 15:38:17][INFO] train_vision.py:  668: Epoch: [17][200/367], lr: 1.42e-04, eta: 3:49:35	Time 2.999 (3.013)	Data 0.053 (0.067)	Mem 41.61GB	Prec@1 70.000 (79.701)	Loss 1.7900 (1.5109)
[02/28 15:38:47][INFO] train_vision.py:  668: Epoch: [17][210/367], lr: 1.41e-04, eta: 3:49:01	Time 2.991 (3.012)	Data 0.052 (0.067)	Mem 41.61GB	Prec@1 80.000 (80.000)	Loss 1.1875 (1.5057)
[02/28 15:39:17][INFO] train_vision.py:  668: Epoch: [17][220/367], lr: 1.41e-04, eta: 3:48:26	Time 2.982 (3.011)	Data 0.048 (0.066)	Mem 41.61GB	Prec@1 90.000 (80.045)	Loss 1.2905 (1.5021)
[02/28 15:39:47][INFO] train_vision.py:  668: Epoch: [17][230/367], lr: 1.40e-04, eta: 3:47:54	Time 3.025 (3.011)	Data 0.068 (0.066)	Mem 41.61GB	Prec@1 90.000 (80.303)	Loss 1.3329 (1.4944)
[02/28 15:40:17][INFO] train_vision.py:  668: Epoch: [17][240/367], lr: 1.40e-04, eta: 3:47:21	Time 2.988 (3.010)	Data 0.043 (0.065)	Mem 41.61GB	Prec@1 90.000 (80.332)	Loss 1.3959 (1.4952)
[02/28 15:40:47][INFO] train_vision.py:  668: Epoch: [17][250/367], lr: 1.39e-04, eta: 3:46:49	Time 2.997 (3.010)	Data 0.050 (0.064)	Mem 41.61GB	Prec@1 100.000 (80.518)	Loss 1.1323 (1.4905)
[02/28 15:41:17][INFO] train_vision.py:  668: Epoch: [17][260/367], lr: 1.39e-04, eta: 3:46:15	Time 2.971 (3.009)	Data 0.049 (0.064)	Mem 41.61GB	Prec@1 70.000 (80.651)	Loss 1.4760 (1.4875)
[02/28 15:41:47][INFO] train_vision.py:  668: Epoch: [17][270/367], lr: 1.38e-04, eta: 3:45:43	Time 2.981 (3.008)	Data 0.046 (0.063)	Mem 41.61GB	Prec@1 80.000 (80.849)	Loss 1.5244 (1.4851)
[02/28 15:42:17][INFO] train_vision.py:  668: Epoch: [17][280/367], lr: 1.38e-04, eta: 3:45:11	Time 2.971 (3.008)	Data 0.052 (0.063)	Mem 41.61GB	Prec@1 80.000 (80.819)	Loss 1.5667 (1.4864)
[02/28 15:42:47][INFO] train_vision.py:  668: Epoch: [17][290/367], lr: 1.37e-04, eta: 3:44:40	Time 2.987 (3.008)	Data 0.060 (0.062)	Mem 41.61GB	Prec@1 90.000 (80.481)	Loss 1.3092 (1.4932)
[02/28 15:43:17][INFO] train_vision.py:  668: Epoch: [17][300/367], lr: 1.37e-04, eta: 3:44:09	Time 3.022 (3.007)	Data 0.023 (0.062)	Mem 41.61GB	Prec@1 70.000 (80.532)	Loss 1.9285 (1.4949)
[02/28 15:43:47][INFO] train_vision.py:  668: Epoch: [17][310/367], lr: 1.36e-04, eta: 3:43:37	Time 3.000 (3.007)	Data 0.053 (0.062)	Mem 41.61GB	Prec@1 70.000 (80.161)	Loss 1.6076 (1.5013)
[02/28 15:44:17][INFO] train_vision.py:  668: Epoch: [17][320/367], lr: 1.36e-04, eta: 3:43:07	Time 3.024 (3.007)	Data 0.025 (0.061)	Mem 41.61GB	Prec@1 90.000 (80.125)	Loss 1.2234 (1.5038)
[02/28 15:44:47][INFO] train_vision.py:  668: Epoch: [17][330/367], lr: 1.35e-04, eta: 3:42:35	Time 3.013 (3.007)	Data 0.042 (0.061)	Mem 41.61GB	Prec@1 70.000 (80.121)	Loss 2.1243 (1.5034)
[02/28 15:45:17][INFO] train_vision.py:  668: Epoch: [17][340/367], lr: 1.35e-04, eta: 3:42:04	Time 3.000 (3.007)	Data 0.045 (0.061)	Mem 41.61GB	Prec@1 100.000 (79.912)	Loss 1.0957 (1.5067)
[02/28 15:45:47][INFO] train_vision.py:  668: Epoch: [17][350/367], lr: 1.34e-04, eta: 3:41:34	Time 2.978 (3.006)	Data 0.059 (0.060)	Mem 41.61GB	Prec@1 90.000 (80.085)	Loss 1.2531 (1.5048)
[02/28 15:46:17][INFO] train_vision.py:  668: Epoch: [17][360/367], lr: 1.34e-04, eta: 3:41:03	Time 2.991 (3.006)	Data 0.042 (0.060)	Mem 41.61GB	Prec@1 100.000 (80.360)	Loss 0.9770 (1.4993)
[02/28 15:46:41][INFO] train_vision.py:  840: Test: [0/121]	Prec@1 97.500 (97.500)	Prec@5 100.000 (100.000)	mPrec@1 (11.042)	mPrec@5 (11.458)
[02/28 15:47:23][INFO] train_vision.py:  840: Test: [10/121]	Prec@1 97.500 (95.341)	Prec@5 100.000 (99.773)	mPrec@1 (30.767)	mPrec@5 (32.602)
[02/28 15:48:06][INFO] train_vision.py:  840: Test: [20/121]	Prec@1 100.000 (93.869)	Prec@5 100.000 (99.405)	mPrec@1 (40.335)	mPrec@5 (44.902)
[02/28 15:48:48][INFO] train_vision.py:  840: Test: [30/121]	Prec@1 93.750 (93.750)	Prec@5 98.750 (99.395)	mPrec@1 (46.266)	mPrec@5 (54.084)
[02/28 15:49:30][INFO] train_vision.py:  840: Test: [40/121]	Prec@1 90.000 (93.384)	Prec@5 98.750 (99.329)	mPrec@1 (50.498)	mPrec@5 (61.310)
[02/28 15:50:12][INFO] train_vision.py:  840: Test: [50/121]	Prec@1 86.250 (92.181)	Prec@5 100.000 (99.167)	mPrec@1 (51.007)	mPrec@5 (64.324)
[02/28 15:50:55][INFO] train_vision.py:  840: Test: [60/121]	Prec@1 95.000 (92.275)	Prec@5 100.000 (99.221)	mPrec@1 (52.854)	mPrec@5 (67.388)
[02/28 15:51:37][INFO] train_vision.py:  840: Test: [70/121]	Prec@1 97.500 (92.694)	Prec@5 100.000 (99.313)	mPrec@1 (53.593)	mPrec@5 (69.048)
[02/28 15:52:19][INFO] train_vision.py:  840: Test: [80/121]	Prec@1 93.750 (92.809)	Prec@5 100.000 (99.306)	mPrec@1 (56.448)	mPrec@5 (72.971)
[02/28 15:53:02][INFO] train_vision.py:  840: Test: [90/121]	Prec@1 86.250 (92.679)	Prec@5 98.750 (99.299)	mPrec@1 (57.875)	mPrec@5 (76.547)
[02/28 15:53:44][INFO] train_vision.py:  840: Test: [100/121]	Prec@1 97.500 (91.931)	Prec@5 100.000 (99.171)	mPrec@1 (59.920)	mPrec@5 (84.783)
[02/28 15:54:26][INFO] train_vision.py:  840: Test: [110/121]	Prec@1 78.750 (92.016)	Prec@5 93.750 (99.133)	mPrec@1 (60.004)	mPrec@5 (85.345)
[02/28 15:55:07][INFO] train_vision.py:  840: Test: [120/121]	Prec@1 58.333 (90.983)	Prec@5 83.333 (98.943)	mPrec@1 (61.730)	mPrec@5 (91.690)
[02/28 15:55:07][INFO] train_vision.py:  847: Overall Prec@1 90.983% Prec@5 98.943% mPrec@1 (61.730) mPrec@5 (91.690)
[02/28 15:55:07][INFO] train_vision.py:  464: Testing: 61.73037338256836/61.73037338256836
[02/28 15:55:07][INFO] train_vision.py:  465: Saving:
[02/28 15:55:27][INFO] train_vision.py:  668: Epoch: [18][0/367], lr: 1.34e-04, eta: 6:16:52	Time 5.133 (5.133)	Data 2.250 (2.250)	Mem 41.61GB	Prec@1 100.000 (100.000)	Loss 1.1987 (1.1987)
[02/28 15:55:57][INFO] train_vision.py:  668: Epoch: [18][10/367], lr: 1.33e-04, eta: 3:54:03	Time 2.993 (3.195)	Data 0.066 (0.264)	Mem 41.61GB	Prec@1 80.000 (88.182)	Loss 1.9007 (1.4434)
[02/28 15:56:27][INFO] train_vision.py:  668: Epoch: [18][20/367], lr: 1.33e-04, eta: 3:46:58	Time 3.031 (3.106)	Data 0.068 (0.170)	Mem 41.61GB	Prec@1 70.000 (85.238)	Loss 1.7940 (1.4504)
[02/28 15:56:57][INFO] train_vision.py:  668: Epoch: [18][30/367], lr: 1.32e-04, eta: 3:44:06	Time 3.003 (3.074)	Data 0.055 (0.134)	Mem 41.61GB	Prec@1 70.000 (84.839)	Loss 1.5706 (1.4240)
[02/28 15:57:27][INFO] train_vision.py:  668: Epoch: [18][40/367], lr: 1.32e-04, eta: 3:42:26	Time 3.009 (3.058)	Data 0.057 (0.115)	Mem 41.61GB	Prec@1 80.000 (84.634)	Loss 1.5523 (1.4575)
[02/28 15:57:57][INFO] train_vision.py:  668: Epoch: [18][50/367], lr: 1.31e-04, eta: 3:41:11	Time 2.984 (3.047)	Data 0.072 (0.104)	Mem 41.61GB	Prec@1 80.000 (83.922)	Loss 1.3860 (1.4602)
[02/28 15:58:27][INFO] train_vision.py:  668: Epoch: [18][60/367], lr: 1.31e-04, eta: 3:40:16	Time 2.999 (3.042)	Data 0.063 (0.098)	Mem 41.61GB	Prec@1 90.000 (82.951)	Loss 1.1302 (1.4822)
[02/28 15:58:57][INFO] train_vision.py:  668: Epoch: [18][70/367], lr: 1.30e-04, eta: 3:39:23	Time 3.003 (3.037)	Data 0.053 (0.092)	Mem 41.61GB	Prec@1 90.000 (82.817)	Loss 1.1172 (1.4775)
[02/28 15:59:27][INFO] train_vision.py:  668: Epoch: [18][80/367], lr: 1.30e-04, eta: 3:38:34	Time 3.022 (3.032)	Data 0.032 (0.086)	Mem 41.61GB	Prec@1 90.000 (82.840)	Loss 1.3194 (1.4721)
[02/28 15:59:57][INFO] train_vision.py:  668: Epoch: [18][90/367], lr: 1.29e-04, eta: 3:37:48	Time 2.970 (3.029)	Data 0.049 (0.083)	Mem 41.61GB	Prec@1 80.000 (82.637)	Loss 1.4245 (1.4707)
[02/28 16:00:27][INFO] train_vision.py:  668: Epoch: [18][100/367], lr: 1.29e-04, eta: 3:37:06	Time 2.998 (3.026)	Data 0.049 (0.079)	Mem 41.61GB	Prec@1 60.000 (82.673)	Loss 1.8774 (1.4701)
[02/28 16:00:57][INFO] train_vision.py:  668: Epoch: [18][110/367], lr: 1.28e-04, eta: 3:36:25	Time 2.992 (3.023)	Data 0.052 (0.076)	Mem 41.61GB	Prec@1 60.000 (82.252)	Loss 2.1936 (1.4845)
[02/28 16:01:27][INFO] train_vision.py:  668: Epoch: [18][120/367], lr: 1.28e-04, eta: 3:35:47	Time 3.015 (3.022)	Data 0.069 (0.075)	Mem 41.61GB	Prec@1 90.000 (82.479)	Loss 1.1640 (1.4768)
[02/28 16:01:57][INFO] train_vision.py:  668: Epoch: [18][130/367], lr: 1.27e-04, eta: 3:35:10	Time 2.991 (3.020)	Data 0.043 (0.073)	Mem 41.61GB	Prec@1 60.000 (82.290)	Loss 2.0619 (1.4879)
[02/28 16:02:27][INFO] train_vision.py:  668: Epoch: [18][140/367], lr: 1.27e-04, eta: 3:34:33	Time 2.976 (3.018)	Data 0.050 (0.071)	Mem 41.61GB	Prec@1 60.000 (82.128)	Loss 1.8913 (1.4950)
[02/28 16:02:57][INFO] train_vision.py:  668: Epoch: [18][150/367], lr: 1.26e-04, eta: 3:33:59	Time 2.997 (3.018)	Data 0.056 (0.070)	Mem 41.61GB	Prec@1 80.000 (82.450)	Loss 1.7267 (1.4872)
[02/28 16:03:27][INFO] train_vision.py:  668: Epoch: [18][160/367], lr: 1.26e-04, eta: 3:33:24	Time 3.008 (3.016)	Data 0.062 (0.069)	Mem 41.61GB	Prec@1 80.000 (82.671)	Loss 1.4400 (1.4898)
[02/28 16:03:57][INFO] train_vision.py:  668: Epoch: [18][170/367], lr: 1.25e-04, eta: 3:32:50	Time 3.006 (3.015)	Data 0.054 (0.068)	Mem 41.61GB	Prec@1 60.000 (82.632)	Loss 2.0616 (1.4890)
[02/28 16:04:27][INFO] train_vision.py:  668: Epoch: [18][180/367], lr: 1.25e-04, eta: 3:32:17	Time 3.010 (3.015)	Data 0.037 (0.067)	Mem 41.61GB	Prec@1 90.000 (82.707)	Loss 1.3722 (1.4854)
[02/28 16:04:57][INFO] train_vision.py:  668: Epoch: [18][190/367], lr: 1.24e-04, eta: 3:31:44	Time 3.001 (3.014)	Data 0.058 (0.067)	Mem 41.61GB	Prec@1 80.000 (82.827)	Loss 1.5905 (1.4819)
[02/28 16:05:27][INFO] train_vision.py:  668: Epoch: [18][200/367], lr: 1.24e-04, eta: 3:31:10	Time 3.004 (3.013)	Data 0.060 (0.066)	Mem 41.61GB	Prec@1 90.000 (82.537)	Loss 1.2133 (1.4871)
[02/28 16:05:57][INFO] train_vision.py:  668: Epoch: [18][210/367], lr: 1.24e-04, eta: 3:30:37	Time 2.996 (3.012)	Data 0.059 (0.065)	Mem 41.61GB	Prec@1 70.000 (82.464)	Loss 1.3994 (1.4848)
[02/28 16:06:27][INFO] train_vision.py:  668: Epoch: [18][220/367], lr: 1.23e-04, eta: 3:30:04	Time 3.006 (3.012)	Data 0.059 (0.064)	Mem 41.61GB	Prec@1 80.000 (82.579)	Loss 1.6160 (1.4840)
[02/28 16:06:57][INFO] train_vision.py:  668: Epoch: [18][230/367], lr: 1.23e-04, eta: 3:29:32	Time 3.000 (3.011)	Data 0.054 (0.064)	Mem 41.61GB	Prec@1 80.000 (82.554)	Loss 1.7093 (1.4868)
[02/28 16:07:27][INFO] train_vision.py:  668: Epoch: [18][240/367], lr: 1.22e-04, eta: 3:28:59	Time 2.998 (3.011)	Data 0.051 (0.063)	Mem 41.61GB	Prec@1 90.000 (82.614)	Loss 1.3683 (1.4847)
[02/28 16:07:57][INFO] train_vision.py:  668: Epoch: [18][250/367], lr: 1.22e-04, eta: 3:28:28	Time 3.002 (3.010)	Data 0.057 (0.063)	Mem 41.61GB	Prec@1 90.000 (82.669)	Loss 1.5911 (1.4836)
[02/28 16:08:27][INFO] train_vision.py:  668: Epoch: [18][260/367], lr: 1.21e-04, eta: 3:27:56	Time 3.000 (3.010)	Data 0.059 (0.062)	Mem 41.61GB	Prec@1 70.000 (82.644)	Loss 1.8048 (1.4827)
[02/28 16:08:57][INFO] train_vision.py:  668: Epoch: [18][270/367], lr: 1.21e-04, eta: 3:27:24	Time 3.003 (3.010)	Data 0.064 (0.061)	Mem 41.61GB	Prec@1 70.000 (82.620)	Loss 1.6856 (1.4848)
[02/28 16:09:27][INFO] train_vision.py:  668: Epoch: [18][280/367], lr: 1.20e-04, eta: 3:26:53	Time 3.015 (3.009)	Data 0.068 (0.061)	Mem 41.61GB	Prec@1 80.000 (82.527)	Loss 1.2756 (1.4827)
[02/28 16:09:57][INFO] train_vision.py:  668: Epoch: [18][290/367], lr: 1.20e-04, eta: 3:26:22	Time 3.002 (3.009)	Data 0.054 (0.061)	Mem 41.61GB	Prec@1 50.000 (82.337)	Loss 2.3516 (1.4860)
[02/28 16:10:27][INFO] train_vision.py:  668: Epoch: [18][300/367], lr: 1.19e-04, eta: 3:25:51	Time 3.003 (3.009)	Data 0.056 (0.061)	Mem 41.61GB	Prec@1 80.000 (82.425)	Loss 1.6455 (1.4820)
[02/28 16:10:57][INFO] train_vision.py:  668: Epoch: [18][310/367], lr: 1.19e-04, eta: 3:25:19	Time 2.991 (3.008)	Data 0.048 (0.060)	Mem 41.61GB	Prec@1 100.000 (82.540)	Loss 1.0369 (1.4793)
[02/28 16:11:27][INFO] train_vision.py:  668: Epoch: [18][320/367], lr: 1.18e-04, eta: 3:24:48	Time 2.992 (3.008)	Data 0.060 (0.060)	Mem 41.61GB	Prec@1 90.000 (82.430)	Loss 1.2456 (1.4822)
[02/28 16:11:57][INFO] train_vision.py:  668: Epoch: [18][330/367], lr: 1.18e-04, eta: 3:24:18	Time 2.999 (3.008)	Data 0.055 (0.060)	Mem 41.61GB	Prec@1 70.000 (82.236)	Loss 1.7955 (1.4840)
[02/28 16:12:27][INFO] train_vision.py:  668: Epoch: [18][340/367], lr: 1.17e-04, eta: 3:23:46	Time 3.015 (3.008)	Data 0.034 (0.059)	Mem 41.61GB	Prec@1 70.000 (82.287)	Loss 1.6687 (1.4827)
[02/28 16:12:57][INFO] train_vision.py:  668: Epoch: [18][350/367], lr: 1.17e-04, eta: 3:23:15	Time 2.995 (3.008)	Data 0.052 (0.059)	Mem 41.61GB	Prec@1 70.000 (82.194)	Loss 1.5126 (1.4829)
[02/28 16:13:27][INFO] train_vision.py:  668: Epoch: [18][360/367], lr: 1.16e-04, eta: 3:22:44	Time 3.003 (3.007)	Data 0.057 (0.059)	Mem 41.61GB	Prec@1 50.000 (82.244)	Loss 1.8091 (1.4806)
[02/28 16:13:50][INFO] train_vision.py:  668: Epoch: [19][0/367], lr: 1.16e-04, eta: 6:15:25	Time 5.578 (5.578)	Data 2.495 (2.495)	Mem 41.61GB	Prec@1 80.000 (80.000)	Loss 1.4633 (1.4633)
[02/28 16:14:20][INFO] train_vision.py:  668: Epoch: [19][10/367], lr: 1.16e-04, eta: 3:37:14	Time 2.997 (3.236)	Data 0.055 (0.280)	Mem 41.61GB	Prec@1 80.000 (81.818)	Loss 1.6641 (1.4847)
[02/28 16:14:51][INFO] train_vision.py:  668: Epoch: [19][20/367], lr: 1.15e-04, eta: 3:29:51	Time 3.029 (3.134)	Data 0.087 (0.182)	Mem 41.61GB	Prec@1 100.000 (84.286)	Loss 1.0651 (1.4619)
[02/28 16:15:21][INFO] train_vision.py:  668: Epoch: [19][30/367], lr: 1.15e-04, eta: 3:26:56	Time 3.010 (3.098)	Data 0.055 (0.144)	Mem 41.61GB	Prec@1 80.000 (84.194)	Loss 1.6864 (1.4693)
[02/28 16:15:51][INFO] train_vision.py:  668: Epoch: [19][40/367], lr: 1.14e-04, eta: 3:25:08	Time 2.999 (3.079)	Data 0.075 (0.125)	Mem 41.61GB	Prec@1 100.000 (83.902)	Loss 1.1194 (1.4691)
[02/28 16:16:21][INFO] train_vision.py:  668: Epoch: [19][50/367], lr: 1.14e-04, eta: 3:23:55	Time 3.052 (3.068)	Data 0.024 (0.113)	Mem 41.61GB	Prec@1 100.000 (83.529)	Loss 1.0970 (1.4809)
[02/28 16:16:52][INFO] train_vision.py:  668: Epoch: [19][60/367], lr: 1.13e-04, eta: 3:22:58	Time 3.023 (3.061)	Data 0.051 (0.105)	Mem 41.61GB	Prec@1 80.000 (82.459)	Loss 1.8728 (1.5034)
[02/28 16:17:22][INFO] train_vision.py:  668: Epoch: [19][70/367], lr: 1.13e-04, eta: 3:22:05	Time 2.999 (3.056)	Data 0.052 (0.100)	Mem 41.61GB	Prec@1 100.000 (82.535)	Loss 1.0274 (1.4903)
[02/28 16:17:52][INFO] train_vision.py:  668: Epoch: [19][80/367], lr: 1.12e-04, eta: 3:21:20	Time 3.036 (3.052)	Data 0.089 (0.097)	Mem 41.61GB	Prec@1 40.000 (81.481)	Loss 2.0219 (1.4996)
[02/28 16:18:22][INFO] train_vision.py:  668: Epoch: [19][90/367], lr: 1.12e-04, eta: 3:20:32	Time 2.983 (3.048)	Data 0.052 (0.092)	Mem 41.61GB	Prec@1 80.000 (81.538)	Loss 1.7138 (1.5031)
[02/28 16:18:52][INFO] train_vision.py:  668: Epoch: [19][100/367], lr: 1.11e-04, eta: 3:19:48	Time 3.034 (3.044)	Data 0.067 (0.090)	Mem 41.61GB	Prec@1 100.000 (82.376)	Loss 1.0780 (1.4763)
[02/28 16:19:22][INFO] train_vision.py:  668: Epoch: [19][110/367], lr: 1.11e-04, eta: 3:19:06	Time 3.027 (3.041)	Data 0.079 (0.088)	Mem 41.61GB	Prec@1 100.000 (82.252)	Loss 1.0475 (1.4764)
[02/28 16:19:52][INFO] train_vision.py:  668: Epoch: [19][120/367], lr: 1.10e-04, eta: 3:18:22	Time 3.012 (3.038)	Data 0.036 (0.085)	Mem 41.61GB	Prec@1 90.000 (82.562)	Loss 1.2762 (1.4671)
[02/28 16:20:22][INFO] train_vision.py:  668: Epoch: [19][130/367], lr: 1.10e-04, eta: 3:17:42	Time 3.000 (3.035)	Data 0.058 (0.083)	Mem 41.61GB	Prec@1 80.000 (82.443)	Loss 1.3897 (1.4703)
[02/28 16:20:52][INFO] train_vision.py:  668: Epoch: [19][140/367], lr: 1.09e-04, eta: 3:17:02	Time 3.008 (3.033)	Data 0.059 (0.081)	Mem 41.61GB	Prec@1 80.000 (82.624)	Loss 1.6241 (1.4611)
[02/28 16:21:22][INFO] train_vision.py:  668: Epoch: [19][150/367], lr: 1.09e-04, eta: 3:16:25	Time 2.988 (3.031)	Data 0.053 (0.079)	Mem 41.61GB	Prec@1 60.000 (82.980)	Loss 2.4168 (1.4594)
[02/28 16:21:53][INFO] train_vision.py:  668: Epoch: [19][160/367], lr: 1.08e-04, eta: 3:15:48	Time 3.040 (3.030)	Data 0.066 (0.078)	Mem 41.61GB	Prec@1 100.000 (83.416)	Loss 1.1557 (1.4460)
[02/28 16:22:23][INFO] train_vision.py:  668: Epoch: [19][170/367], lr: 1.08e-04, eta: 3:15:12	Time 3.002 (3.028)	Data 0.037 (0.077)	Mem 41.61GB	Prec@1 80.000 (83.158)	Loss 1.2897 (1.4519)
[02/28 16:22:53][INFO] train_vision.py:  668: Epoch: [19][180/367], lr: 1.08e-04, eta: 3:14:38	Time 3.042 (3.027)	Data 0.068 (0.076)	Mem 41.61GB	Prec@1 100.000 (83.702)	Loss 1.2407 (1.4430)
[02/28 16:23:23][INFO] train_vision.py:  668: Epoch: [19][190/367], lr: 1.07e-04, eta: 3:14:04	Time 3.003 (3.026)	Data 0.060 (0.075)	Mem 41.61GB	Prec@1 100.000 (83.665)	Loss 1.0049 (1.4451)
[02/28 16:23:53][INFO] train_vision.py:  668: Epoch: [19][200/367], lr: 1.07e-04, eta: 3:13:31	Time 3.041 (3.026)	Data 0.061 (0.074)	Mem 41.61GB	Prec@1 100.000 (83.383)	Loss 1.0254 (1.4483)
[02/28 16:24:23][INFO] train_vision.py:  668: Epoch: [19][210/367], lr: 1.06e-04, eta: 3:12:57	Time 2.993 (3.024)	Data 0.052 (0.073)	Mem 41.61GB	Prec@1 70.000 (83.412)	Loss 1.8937 (1.4494)
[02/28 16:24:53][INFO] train_vision.py:  668: Epoch: [19][220/367], lr: 1.06e-04, eta: 3:12:24	Time 3.007 (3.024)	Data 0.054 (0.072)	Mem 41.61GB	Prec@1 70.000 (83.303)	Loss 1.4025 (1.4490)
[02/28 16:25:23][INFO] train_vision.py:  668: Epoch: [19][230/367], lr: 1.05e-04, eta: 3:11:51	Time 3.040 (3.023)	Data 0.020 (0.071)	Mem 41.61GB	Prec@1 90.000 (83.030)	Loss 1.5444 (1.4531)
[02/28 16:25:53][INFO] train_vision.py:  668: Epoch: [19][240/367], lr: 1.05e-04, eta: 3:11:17	Time 3.005 (3.022)	Data 0.051 (0.070)	Mem 41.61GB	Prec@1 90.000 (83.071)	Loss 1.2478 (1.4532)
[02/28 16:26:23][INFO] train_vision.py:  668: Epoch: [19][250/367], lr: 1.04e-04, eta: 3:10:43	Time 3.006 (3.021)	Data 0.050 (0.070)	Mem 41.61GB	Prec@1 70.000 (83.028)	Loss 1.6934 (1.4505)
[02/28 16:26:53][INFO] train_vision.py:  668: Epoch: [19][260/367], lr: 1.04e-04, eta: 3:10:10	Time 3.007 (3.020)	Data 0.058 (0.069)	Mem 41.61GB	Prec@1 80.000 (83.065)	Loss 1.5977 (1.4517)
[02/28 16:27:23][INFO] train_vision.py:  668: Epoch: [19][270/367], lr: 1.03e-04, eta: 3:09:37	Time 3.022 (3.019)	Data 0.055 (0.069)	Mem 41.61GB	Prec@1 80.000 (82.915)	Loss 1.7413 (1.4539)
[02/28 16:27:53][INFO] train_vision.py:  668: Epoch: [19][280/367], lr: 1.03e-04, eta: 3:09:04	Time 3.008 (3.019)	Data 0.050 (0.068)	Mem 41.61GB	Prec@1 90.000 (82.918)	Loss 1.3675 (1.4523)
[02/28 16:28:23][INFO] train_vision.py:  668: Epoch: [19][290/367], lr: 1.02e-04, eta: 3:08:31	Time 2.990 (3.018)	Data 0.051 (0.067)	Mem 41.61GB	Prec@1 100.000 (82.955)	Loss 1.0252 (1.4503)
[02/28 16:28:53][INFO] train_vision.py:  668: Epoch: [19][300/367], lr: 1.02e-04, eta: 3:07:59	Time 3.000 (3.018)	Data 0.086 (0.067)	Mem 41.61GB	Prec@1 90.000 (82.890)	Loss 1.0887 (1.4534)
[02/28 16:29:23][INFO] train_vision.py:  668: Epoch: [19][310/367], lr: 1.02e-04, eta: 3:07:28	Time 2.992 (3.017)	Data 0.060 (0.067)	Mem 41.61GB	Prec@1 60.000 (82.765)	Loss 1.9441 (1.4589)
[02/28 16:29:53][INFO] train_vision.py:  668: Epoch: [19][320/367], lr: 1.01e-04, eta: 3:06:56	Time 3.008 (3.017)	Data 0.058 (0.066)	Mem 41.61GB	Prec@1 70.000 (82.741)	Loss 1.9557 (1.4577)
[02/28 16:30:23][INFO] train_vision.py:  668: Epoch: [19][330/367], lr: 1.01e-04, eta: 3:06:24	Time 2.985 (3.016)	Data 0.053 (0.066)	Mem 41.61GB	Prec@1 80.000 (82.749)	Loss 1.6916 (1.4598)
[02/28 16:30:53][INFO] train_vision.py:  668: Epoch: [19][340/367], lr: 1.00e-04, eta: 3:05:53	Time 3.004 (3.016)	Data 0.067 (0.066)	Mem 41.61GB	Prec@1 80.000 (82.815)	Loss 1.4130 (1.4585)
[02/28 16:31:23][INFO] train_vision.py:  668: Epoch: [19][350/367], lr: 9.97e-05, eta: 3:05:22	Time 2.979 (3.016)	Data 0.068 (0.065)	Mem 41.61GB	Prec@1 80.000 (82.707)	Loss 1.5545 (1.4606)
[02/28 16:31:53][INFO] train_vision.py:  668: Epoch: [19][360/367], lr: 9.92e-05, eta: 3:04:51	Time 3.029 (3.016)	Data 0.023 (0.065)	Mem 41.61GB	Prec@1 70.000 (82.604)	Loss 1.7513 (1.4629)
[02/28 16:32:17][INFO] train_vision.py:  840: Test: [0/121]	Prec@1 98.750 (98.750)	Prec@5 98.750 (98.750)	mPrec@1 (11.389)	mPrec@5 (11.389)
[02/28 16:32:59][INFO] train_vision.py:  840: Test: [10/121]	Prec@1 96.250 (96.136)	Prec@5 100.000 (99.773)	mPrec@1 (30.275)	mPrec@5 (32.614)
[02/28 16:33:42][INFO] train_vision.py:  840: Test: [20/121]	Prec@1 96.250 (94.107)	Prec@5 100.000 (99.583)	mPrec@1 (39.890)	mPrec@5 (45.301)
[02/28 16:34:24][INFO] train_vision.py:  840: Test: [30/121]	Prec@1 93.750 (93.952)	Prec@5 98.750 (99.556)	mPrec@1 (45.858)	mPrec@5 (54.217)
[02/28 16:35:06][INFO] train_vision.py:  840: Test: [40/121]	Prec@1 92.500 (93.994)	Prec@5 100.000 (99.512)	mPrec@1 (51.228)	mPrec@5 (61.483)
[02/28 16:35:48][INFO] train_vision.py:  840: Test: [50/121]	Prec@1 91.250 (93.260)	Prec@5 100.000 (99.412)	mPrec@1 (52.216)	mPrec@5 (64.661)
[02/28 16:36:30][INFO] train_vision.py:  840: Test: [60/121]	Prec@1 96.250 (93.443)	Prec@5 100.000 (99.467)	mPrec@1 (54.564)	mPrec@5 (67.789)
[02/28 16:37:12][INFO] train_vision.py:  840: Test: [70/121]	Prec@1 93.750 (93.627)	Prec@5 100.000 (99.507)	mPrec@1 (55.316)	mPrec@5 (68.848)
[02/28 16:37:54][INFO] train_vision.py:  840: Test: [80/121]	Prec@1 98.750 (93.843)	Prec@5 100.000 (99.491)	mPrec@1 (58.563)	mPrec@5 (73.045)
[02/28 16:38:37][INFO] train_vision.py:  840: Test: [90/121]	Prec@1 88.750 (93.626)	Prec@5 100.000 (99.505)	mPrec@1 (60.559)	mPrec@5 (76.643)
[02/28 16:39:19][INFO] train_vision.py:  840: Test: [100/121]	Prec@1 93.750 (92.921)	Prec@5 100.000 (99.443)	mPrec@1 (64.042)	mPrec@5 (85.584)
[02/28 16:40:01][INFO] train_vision.py:  840: Test: [110/121]	Prec@1 86.250 (92.973)	Prec@5 98.750 (99.437)	mPrec@1 (64.132)	mPrec@5 (86.163)
[02/28 16:40:41][INFO] train_vision.py:  840: Test: [120/121]	Prec@1 62.500 (91.947)	Prec@5 83.333 (99.326)	mPrec@1 (65.810)	mPrec@5 (93.188)
[02/28 16:40:42][INFO] train_vision.py:  847: Overall Prec@1 91.947% Prec@5 99.326% mPrec@1 (65.810) mPrec@5 (93.188)
[02/28 16:40:42][INFO] train_vision.py:  464: Testing: 65.81024932861328/65.81024932861328
[02/28 16:40:42][INFO] train_vision.py:  465: Saving:
[02/28 16:41:01][INFO] train_vision.py:  668: Epoch: [20][0/367], lr: 9.88e-05, eta: 5:12:33	Time 5.108 (5.108)	Data 2.247 (2.247)	Mem 41.61GB	Prec@1 80.000 (80.000)	Loss 1.6219 (1.6219)
[02/28 16:41:31][INFO] train_vision.py:  668: Epoch: [20][10/367], lr: 9.84e-05, eta: 3:14:13	Time 3.032 (3.183)	Data 0.066 (0.252)	Mem 41.61GB	Prec@1 100.000 (83.636)	Loss 1.2616 (1.5414)
[02/28 16:42:01][INFO] train_vision.py:  668: Epoch: [20][20/367], lr: 9.80e-05, eta: 3:08:21	Time 3.001 (3.095)	Data 0.069 (0.156)	Mem 41.61GB	Prec@1 90.000 (81.429)	Loss 1.1282 (1.4814)
[02/28 16:42:31][INFO] train_vision.py:  668: Epoch: [20][30/367], lr: 9.75e-05, eta: 3:06:19	Time 3.021 (3.071)	Data 0.055 (0.125)	Mem 41.61GB	Prec@1 80.000 (81.935)	Loss 1.3251 (1.4868)
[02/28 16:43:01][INFO] train_vision.py:  668: Epoch: [20][40/367], lr: 9.71e-05, eta: 3:04:58	Time 2.999 (3.057)	Data 0.063 (0.107)	Mem 41.61GB	Prec@1 80.000 (81.707)	Loss 1.4482 (1.5129)
[02/28 16:43:31][INFO] train_vision.py:  668: Epoch: [20][50/367], lr: 9.66e-05, eta: 3:03:53	Time 2.977 (3.047)	Data 0.054 (0.096)	Mem 41.61GB	Prec@1 100.000 (82.745)	Loss 1.0698 (1.4882)
[02/28 16:44:01][INFO] train_vision.py:  668: Epoch: [20][60/367], lr: 9.62e-05, eta: 3:03:01	Time 3.006 (3.041)	Data 0.060 (0.089)	Mem 41.61GB	Prec@1 70.000 (82.459)	Loss 1.6418 (1.4731)
[02/28 16:44:31][INFO] train_vision.py:  668: Epoch: [20][70/367], lr: 9.57e-05, eta: 3:02:12	Time 3.032 (3.036)	Data 0.056 (0.083)	Mem 41.61GB	Prec@1 80.000 (82.958)	Loss 1.3865 (1.4532)
[02/28 16:45:01][INFO] train_vision.py:  668: Epoch: [20][80/367], lr: 9.52e-05, eta: 3:01:29	Time 3.005 (3.032)	Data 0.059 (0.080)	Mem 41.61GB	Prec@1 90.000 (82.840)	Loss 1.0947 (1.4590)
[02/28 16:45:31][INFO] train_vision.py:  668: Epoch: [20][90/367], lr: 9.48e-05, eta: 3:00:48	Time 3.033 (3.029)	Data 0.022 (0.077)	Mem 41.61GB	Prec@1 60.000 (82.747)	Loss 1.6137 (1.4509)
[02/28 16:46:01][INFO] train_vision.py:  668: Epoch: [20][100/367], lr: 9.43e-05, eta: 3:00:10	Time 3.012 (3.027)	Data 0.066 (0.075)	Mem 41.61GB	Prec@1 70.000 (82.673)	Loss 2.2929 (1.4475)
[02/28 16:46:31][INFO] train_vision.py:  668: Epoch: [20][110/367], lr: 9.39e-05, eta: 2:59:33	Time 2.996 (3.025)	Data 0.043 (0.073)	Mem 41.61GB	Prec@1 80.000 (82.793)	Loss 1.6025 (1.4435)
[02/28 16:47:01][INFO] train_vision.py:  668: Epoch: [20][120/367], lr: 9.34e-05, eta: 2:58:55	Time 2.996 (3.023)	Data 0.051 (0.070)	Mem 41.61GB	Prec@1 90.000 (83.140)	Loss 1.2611 (1.4313)
[02/28 16:47:32][INFO] train_vision.py:  668: Epoch: [20][130/367], lr: 9.30e-05, eta: 2:58:21	Time 3.043 (3.022)	Data 0.047 (0.069)	Mem 41.61GB	Prec@1 80.000 (82.748)	Loss 1.4476 (1.4431)
[02/28 16:48:02][INFO] train_vision.py:  668: Epoch: [20][140/367], lr: 9.25e-05, eta: 2:57:47	Time 3.011 (3.021)	Data 0.065 (0.068)	Mem 41.61GB	Prec@1 80.000 (82.624)	Loss 1.3270 (1.4516)
[02/28 16:48:32][INFO] train_vision.py:  668: Epoch: [20][150/367], lr: 9.21e-05, eta: 2:57:12	Time 2.997 (3.020)	Data 0.043 (0.067)	Mem 41.61GB	Prec@1 100.000 (82.848)	Loss 1.1913 (1.4491)
[02/28 16:49:02][INFO] train_vision.py:  668: Epoch: [20][160/367], lr: 9.16e-05, eta: 2:56:36	Time 2.971 (3.018)	Data 0.058 (0.066)	Mem 41.61GB	Prec@1 90.000 (82.609)	Loss 1.0650 (1.4497)
[02/28 16:49:32][INFO] train_vision.py:  668: Epoch: [20][170/367], lr: 9.12e-05, eta: 2:56:04	Time 2.996 (3.018)	Data 0.049 (0.065)	Mem 41.61GB	Prec@1 80.000 (82.807)	Loss 1.3435 (1.4417)
[02/28 16:50:02][INFO] train_vision.py:  668: Epoch: [20][180/367], lr: 9.08e-05, eta: 2:55:31	Time 3.013 (3.017)	Data 0.050 (0.064)	Mem 41.61GB	Prec@1 80.000 (82.928)	Loss 1.5885 (1.4387)
[02/28 16:50:32][INFO] train_vision.py:  668: Epoch: [20][190/367], lr: 9.03e-05, eta: 2:54:56	Time 3.007 (3.015)	Data 0.051 (0.063)	Mem 41.61GB	Prec@1 100.000 (83.246)	Loss 1.1314 (1.4323)
[02/28 16:51:02][INFO] train_vision.py:  668: Epoch: [20][200/367], lr: 8.99e-05, eta: 2:54:24	Time 3.006 (3.015)	Data 0.059 (0.063)	Mem 41.61GB	Prec@1 80.000 (83.333)	Loss 1.3916 (1.4303)
[02/28 16:51:32][INFO] train_vision.py:  668: Epoch: [20][210/367], lr: 8.94e-05, eta: 2:53:51	Time 3.000 (3.014)	Data 0.040 (0.062)	Mem 41.61GB	Prec@1 80.000 (83.270)	Loss 1.3876 (1.4303)
[02/28 16:52:02][INFO] train_vision.py:  668: Epoch: [20][220/367], lr: 8.90e-05, eta: 2:53:19	Time 3.003 (3.013)	Data 0.046 (0.061)	Mem 41.61GB	Prec@1 90.000 (83.439)	Loss 1.2522 (1.4246)
[02/28 16:52:32][INFO] train_vision.py:  668: Epoch: [20][230/367], lr: 8.85e-05, eta: 2:52:47	Time 2.966 (3.013)	Data 0.047 (0.061)	Mem 41.61GB	Prec@1 90.000 (83.420)	Loss 1.3256 (1.4221)
[02/28 16:53:02][INFO] train_vision.py:  668: Epoch: [20][240/367], lr: 8.81e-05, eta: 2:52:15	Time 3.009 (3.012)	Data 0.065 (0.060)	Mem 41.61GB	Prec@1 80.000 (83.568)	Loss 1.2409 (1.4163)
[02/28 16:53:32][INFO] train_vision.py:  668: Epoch: [20][250/367], lr: 8.76e-05, eta: 2:51:42	Time 3.003 (3.012)	Data 0.048 (0.060)	Mem 41.61GB	Prec@1 80.000 (83.466)	Loss 1.4508 (1.4226)
[02/28 16:54:02][INFO] train_vision.py:  668: Epoch: [20][260/367], lr: 8.72e-05, eta: 2:51:11	Time 3.005 (3.011)	Data 0.058 (0.060)	Mem 41.61GB	Prec@1 80.000 (83.487)	Loss 1.2988 (1.4206)
[02/28 16:54:32][INFO] train_vision.py:  668: Epoch: [20][270/367], lr: 8.68e-05, eta: 2:50:39	Time 2.994 (3.011)	Data 0.042 (0.059)	Mem 41.61GB	Prec@1 100.000 (83.506)	Loss 1.0191 (1.4186)
[02/28 16:55:01][INFO] train_vision.py:  668: Epoch: [20][280/367], lr: 8.63e-05, eta: 2:50:07	Time 3.010 (3.010)	Data 0.072 (0.059)	Mem 41.61GB	Prec@1 80.000 (83.630)	Loss 1.5318 (1.4172)
[02/28 16:55:31][INFO] train_vision.py:  668: Epoch: [20][290/367], lr: 8.59e-05, eta: 2:49:36	Time 3.002 (3.010)	Data 0.048 (0.059)	Mem 41.61GB	Prec@1 80.000 (83.540)	Loss 1.4357 (1.4201)
[02/28 16:56:01][INFO] train_vision.py:  668: Epoch: [20][300/367], lr: 8.54e-05, eta: 2:49:05	Time 3.004 (3.009)	Data 0.064 (0.059)	Mem 41.61GB	Prec@1 80.000 (83.422)	Loss 1.6424 (1.4202)
[02/28 16:56:31][INFO] train_vision.py:  668: Epoch: [20][310/367], lr: 8.50e-05, eta: 2:48:33	Time 2.965 (3.009)	Data 0.052 (0.058)	Mem 41.61GB	Prec@1 90.000 (83.441)	Loss 1.3722 (1.4206)
[02/28 16:57:01][INFO] train_vision.py:  668: Epoch: [20][320/367], lr: 8.46e-05, eta: 2:48:02	Time 3.007 (3.009)	Data 0.063 (0.058)	Mem 41.61GB	Prec@1 100.000 (83.333)	Loss 0.9408 (1.4274)
[02/28 16:57:31][INFO] train_vision.py:  668: Epoch: [20][330/367], lr: 8.41e-05, eta: 2:47:31	Time 2.999 (3.009)	Data 0.051 (0.058)	Mem 41.61GB	Prec@1 80.000 (83.263)	Loss 1.6432 (1.4314)
[02/28 16:58:01][INFO] train_vision.py:  668: Epoch: [20][340/367], lr: 8.37e-05, eta: 2:47:00	Time 2.973 (3.008)	Data 0.061 (0.058)	Mem 41.61GB	Prec@1 90.000 (83.196)	Loss 1.0936 (1.4306)
[02/28 16:58:31][INFO] train_vision.py:  668: Epoch: [20][350/367], lr: 8.33e-05, eta: 2:46:29	Time 2.992 (3.008)	Data 0.048 (0.058)	Mem 41.61GB	Prec@1 90.000 (83.191)	Loss 1.2832 (1.4345)
[02/28 16:59:01][INFO] train_vision.py:  668: Epoch: [20][360/367], lr: 8.28e-05, eta: 2:45:58	Time 3.005 (3.008)	Data 0.063 (0.057)	Mem 41.61GB	Prec@1 100.000 (83.296)	Loss 1.2137 (1.4326)
[02/28 16:59:24][INFO] train_vision.py:  668: Epoch: [21][0/367], lr: 8.25e-05, eta: 5:08:42	Time 5.606 (5.606)	Data 2.476 (2.476)	Mem 41.61GB	Prec@1 90.000 (90.000)	Loss 1.3646 (1.3646)
[02/28 16:59:54][INFO] train_vision.py:  668: Epoch: [21][10/367], lr: 8.21e-05, eta: 2:57:53	Time 3.006 (3.240)	Data 0.064 (0.283)	Mem 41.61GB	Prec@1 80.000 (86.364)	Loss 1.4384 (1.3582)
[02/28 17:00:24][INFO] train_vision.py:  668: Epoch: [21][20/367], lr: 8.17e-05, eta: 2:51:09	Time 2.994 (3.127)	Data 0.064 (0.180)	Mem 41.61GB	Prec@1 80.000 (84.762)	Loss 1.3815 (1.4130)
[02/28 17:00:54][INFO] train_vision.py:  668: Epoch: [21][30/367], lr: 8.12e-05, eta: 2:48:26	Time 3.000 (3.087)	Data 0.060 (0.141)	Mem 41.61GB	Prec@1 70.000 (82.903)	Loss 1.5533 (1.4326)
[02/28 17:01:24][INFO] train_vision.py:  668: Epoch: [21][40/367], lr: 8.08e-05, eta: 2:46:49	Time 3.021 (3.067)	Data 0.077 (0.122)	Mem 41.61GB	Prec@1 100.000 (81.951)	Loss 1.1495 (1.4641)
[02/28 17:01:55][INFO] train_vision.py:  668: Epoch: [21][50/367], lr: 8.04e-05, eta: 2:45:39	Time 3.011 (3.054)	Data 0.087 (0.111)	Mem 41.61GB	Prec@1 90.000 (82.745)	Loss 1.2544 (1.4450)
[02/28 17:02:25][INFO] train_vision.py:  668: Epoch: [21][60/367], lr: 7.99e-05, eta: 2:44:41	Time 3.028 (3.046)	Data 0.089 (0.103)	Mem 41.61GB	Prec@1 70.000 (81.967)	Loss 1.6980 (1.4780)
[02/28 17:02:55][INFO] train_vision.py:  668: Epoch: [21][70/367], lr: 7.95e-05, eta: 2:43:50	Time 2.993 (3.040)	Data 0.054 (0.097)	Mem 41.61GB	Prec@1 90.000 (83.099)	Loss 1.5457 (1.4578)
[02/28 17:03:25][INFO] train_vision.py:  668: Epoch: [21][80/367], lr: 7.91e-05, eta: 2:43:05	Time 2.981 (3.035)	Data 0.080 (0.093)	Mem 41.61GB	Prec@1 80.000 (82.099)	Loss 1.4131 (1.4886)
[02/28 17:03:55][INFO] train_vision.py:  668: Epoch: [21][90/367], lr: 7.87e-05, eta: 2:42:24	Time 3.003 (3.032)	Data 0.053 (0.089)	Mem 41.61GB	Prec@1 70.000 (82.198)	Loss 1.7006 (1.4836)
[02/28 17:04:25][INFO] train_vision.py:  668: Epoch: [21][100/367], lr: 7.82e-05, eta: 2:41:43	Time 3.038 (3.029)	Data 0.081 (0.086)	Mem 41.61GB	Prec@1 90.000 (82.178)	Loss 1.3760 (1.4944)
[02/28 17:04:55][INFO] train_vision.py:  668: Epoch: [21][110/367], lr: 7.78e-05, eta: 2:41:04	Time 2.994 (3.026)	Data 0.020 (0.082)	Mem 41.61GB	Prec@1 70.000 (82.703)	Loss 1.4052 (1.4727)
[02/28 17:05:25][INFO] train_vision.py:  668: Epoch: [21][120/367], lr: 7.74e-05, eta: 2:40:28	Time 3.010 (3.024)	Data 0.074 (0.080)	Mem 41.61GB	Prec@1 90.000 (83.058)	Loss 1.1623 (1.4559)
[02/28 17:05:55][INFO] train_vision.py:  668: Epoch: [21][130/367], lr: 7.70e-05, eta: 2:39:51	Time 3.005 (3.022)	Data 0.052 (0.078)	Mem 41.61GB	Prec@1 80.000 (83.435)	Loss 1.6298 (1.4423)
[02/28 17:06:25][INFO] train_vision.py:  668: Epoch: [21][140/367], lr: 7.65e-05, eta: 2:39:15	Time 3.017 (3.020)	Data 0.024 (0.076)	Mem 41.61GB	Prec@1 80.000 (83.617)	Loss 1.3439 (1.4331)
[02/28 17:06:55][INFO] train_vision.py:  668: Epoch: [21][150/367], lr: 7.61e-05, eta: 2:38:42	Time 3.010 (3.019)	Data 0.055 (0.075)	Mem 41.61GB	Prec@1 50.000 (83.046)	Loss 1.5967 (1.4416)
[02/28 17:07:25][INFO] train_vision.py:  668: Epoch: [21][160/367], lr: 7.57e-05, eta: 2:38:09	Time 3.003 (3.018)	Data 0.053 (0.074)	Mem 41.61GB	Prec@1 90.000 (82.857)	Loss 1.3326 (1.4473)
[02/28 17:07:55][INFO] train_vision.py:  668: Epoch: [21][170/367], lr: 7.53e-05, eta: 2:37:36	Time 3.006 (3.017)	Data 0.072 (0.073)	Mem 41.61GB	Prec@1 100.000 (82.924)	Loss 1.2249 (1.4487)
[02/28 17:08:25][INFO] train_vision.py:  668: Epoch: [21][180/367], lr: 7.49e-05, eta: 2:37:03	Time 3.034 (3.017)	Data 0.074 (0.072)	Mem 41.61GB	Prec@1 80.000 (82.928)	Loss 1.5679 (1.4566)
[02/28 17:08:55][INFO] train_vision.py:  668: Epoch: [21][190/367], lr: 7.44e-05, eta: 2:36:30	Time 3.015 (3.015)	Data 0.031 (0.071)	Mem 41.61GB	Prec@1 100.000 (83.037)	Loss 1.1077 (1.4530)
[02/28 17:09:25][INFO] train_vision.py:  668: Epoch: [21][200/367], lr: 7.40e-05, eta: 2:35:58	Time 3.018 (3.015)	Data 0.060 (0.070)	Mem 41.61GB	Prec@1 80.000 (82.985)	Loss 1.4231 (1.4540)
[02/28 17:09:55][INFO] train_vision.py:  668: Epoch: [21][210/367], lr: 7.36e-05, eta: 2:35:25	Time 2.962 (3.014)	Data 0.055 (0.069)	Mem 41.61GB	Prec@1 70.000 (82.796)	Loss 1.9070 (1.4552)
[02/28 17:10:25][INFO] train_vision.py:  668: Epoch: [21][220/367], lr: 7.32e-05, eta: 2:34:53	Time 2.981 (3.013)	Data 0.056 (0.068)	Mem 41.61GB	Prec@1 80.000 (82.851)	Loss 1.7462 (1.4567)
[02/28 17:10:55][INFO] train_vision.py:  668: Epoch: [21][230/367], lr: 7.28e-05, eta: 2:34:20	Time 2.988 (3.012)	Data 0.054 (0.068)	Mem 41.61GB	Prec@1 100.000 (83.074)	Loss 0.9706 (1.4466)
[02/28 17:11:25][INFO] train_vision.py:  668: Epoch: [21][240/367], lr: 7.24e-05, eta: 2:33:48	Time 3.004 (3.012)	Data 0.056 (0.067)	Mem 41.61GB	Prec@1 70.000 (83.071)	Loss 1.8404 (1.4437)
[02/28 17:11:55][INFO] train_vision.py:  668: Epoch: [21][250/367], lr: 7.20e-05, eta: 2:33:16	Time 2.993 (3.011)	Data 0.053 (0.066)	Mem 41.61GB	Prec@1 90.000 (83.267)	Loss 1.5890 (1.4387)
[02/28 17:12:25][INFO] train_vision.py:  668: Epoch: [21][260/367], lr: 7.15e-05, eta: 2:32:44	Time 2.996 (3.011)	Data 0.054 (0.066)	Mem 41.61GB	Prec@1 80.000 (83.333)	Loss 1.6992 (1.4380)
[02/28 17:12:55][INFO] train_vision.py:  668: Epoch: [21][270/367], lr: 7.11e-05, eta: 2:32:13	Time 3.004 (3.010)	Data 0.055 (0.065)	Mem 41.61GB	Prec@1 90.000 (83.284)	Loss 1.2940 (1.4386)
[02/28 17:13:24][INFO] train_vision.py:  668: Epoch: [21][280/367], lr: 7.07e-05, eta: 2:31:41	Time 3.002 (3.010)	Data 0.047 (0.065)	Mem 41.61GB	Prec@1 100.000 (83.274)	Loss 1.1703 (1.4403)
[02/28 17:13:54][INFO] train_vision.py:  668: Epoch: [21][290/367], lr: 7.03e-05, eta: 2:31:10	Time 3.002 (3.009)	Data 0.053 (0.064)	Mem 41.61GB	Prec@1 70.000 (83.093)	Loss 2.1383 (1.4445)
[02/28 17:14:24][INFO] train_vision.py:  668: Epoch: [21][300/367], lr: 6.99e-05, eta: 2:30:38	Time 3.005 (3.009)	Data 0.052 (0.064)	Mem 41.61GB	Prec@1 90.000 (83.322)	Loss 1.3113 (1.4394)
[02/28 17:14:54][INFO] train_vision.py:  668: Epoch: [21][310/367], lr: 6.95e-05, eta: 2:30:08	Time 3.002 (3.009)	Data 0.053 (0.064)	Mem 41.61GB	Prec@1 100.000 (83.215)	Loss 1.1556 (1.4435)
[02/28 17:15:24][INFO] train_vision.py:  668: Epoch: [21][320/367], lr: 6.91e-05, eta: 2:29:36	Time 3.003 (3.008)	Data 0.055 (0.064)	Mem 41.61GB	Prec@1 90.000 (83.302)	Loss 1.0655 (1.4411)
[02/28 17:15:54][INFO] train_vision.py:  668: Epoch: [21][330/367], lr: 6.87e-05, eta: 2:29:05	Time 3.000 (3.008)	Data 0.056 (0.063)	Mem 41.61GB	Prec@1 90.000 (83.414)	Loss 1.1874 (1.4387)
[02/28 17:16:24][INFO] train_vision.py:  668: Epoch: [21][340/367], lr: 6.83e-05, eta: 2:28:34	Time 2.999 (3.008)	Data 0.038 (0.063)	Mem 41.61GB	Prec@1 90.000 (83.284)	Loss 1.4821 (1.4432)
[02/28 17:16:54][INFO] train_vision.py:  668: Epoch: [21][350/367], lr: 6.79e-05, eta: 2:28:03	Time 3.000 (3.007)	Data 0.047 (0.063)	Mem 41.61GB	Prec@1 80.000 (83.333)	Loss 1.8688 (1.4436)
[02/28 17:17:24][INFO] train_vision.py:  668: Epoch: [21][360/367], lr: 6.75e-05, eta: 2:27:32	Time 3.002 (3.007)	Data 0.052 (0.062)	Mem 41.61GB	Prec@1 90.000 (83.518)	Loss 1.3194 (1.4396)
[02/28 17:17:49][INFO] train_vision.py:  840: Test: [0/121]	Prec@1 96.250 (96.250)	Prec@5 100.000 (100.000)	mPrec@1 (10.926)	mPrec@5 (11.458)
[02/28 17:18:31][INFO] train_vision.py:  840: Test: [10/121]	Prec@1 100.000 (95.341)	Prec@5 100.000 (100.000)	mPrec@1 (30.481)	mPrec@5 (32.639)
[02/28 17:19:13][INFO] train_vision.py:  840: Test: [20/121]	Prec@1 95.000 (94.286)	Prec@5 100.000 (99.643)	mPrec@1 (40.928)	mPrec@5 (44.992)
[02/28 17:19:55][INFO] train_vision.py:  840: Test: [30/121]	Prec@1 95.000 (94.315)	Prec@5 98.750 (99.597)	mPrec@1 (47.717)	mPrec@5 (54.135)
[02/28 17:20:38][INFO] train_vision.py:  840: Test: [40/121]	Prec@1 93.750 (94.146)	Prec@5 98.750 (99.512)	mPrec@1 (53.190)	mPrec@5 (61.344)
[02/28 17:21:20][INFO] train_vision.py:  840: Test: [50/121]	Prec@1 87.500 (93.333)	Prec@5 100.000 (99.387)	mPrec@1 (53.957)	mPrec@5 (64.496)
[02/28 17:22:02][INFO] train_vision.py:  840: Test: [60/121]	Prec@1 96.250 (93.586)	Prec@5 100.000 (99.467)	mPrec@1 (55.928)	mPrec@5 (67.559)
[02/28 17:22:44][INFO] train_vision.py:  840: Test: [70/121]	Prec@1 95.000 (93.944)	Prec@5 100.000 (99.454)	mPrec@1 (56.454)	mPrec@5 (68.822)
[02/28 17:23:26][INFO] train_vision.py:  840: Test: [80/121]	Prec@1 96.250 (94.074)	Prec@5 100.000 (99.460)	mPrec@1 (58.879)	mPrec@5 (73.081)
[02/28 17:24:09][INFO] train_vision.py:  840: Test: [90/121]	Prec@1 88.750 (93.860)	Prec@5 100.000 (99.492)	mPrec@1 (60.013)	mPrec@5 (77.026)
[02/28 17:24:51][INFO] train_vision.py:  840: Test: [100/121]	Prec@1 93.750 (93.119)	Prec@5 100.000 (99.418)	mPrec@1 (63.980)	mPrec@5 (85.629)
[02/28 17:25:33][INFO] train_vision.py:  840: Test: [110/121]	Prec@1 83.750 (93.187)	Prec@5 98.750 (99.437)	mPrec@1 (64.367)	mPrec@5 (86.202)
[02/28 17:26:14][INFO] train_vision.py:  840: Test: [120/121]	Prec@1 58.333 (92.216)	Prec@5 85.417 (99.295)	mPrec@1 (66.216)	mPrec@5 (93.073)
[02/28 17:26:14][INFO] train_vision.py:  847: Overall Prec@1 92.216% Prec@5 99.295% mPrec@1 (66.216) mPrec@5 (93.073)
[02/28 17:26:14][INFO] train_vision.py:  464: Testing: 66.21598815917969/66.21598815917969
[02/28 17:26:14][INFO] train_vision.py:  465: Saving:
[02/28 17:26:34][INFO] train_vision.py:  668: Epoch: [22][0/367], lr: 6.71e-05, eta: 4:21:07	Time 5.335 (5.335)	Data 2.450 (2.450)	Mem 41.61GB	Prec@1 90.000 (90.000)	Loss 1.2697 (1.2697)
[02/28 17:27:04][INFO] train_vision.py:  668: Epoch: [22][10/367], lr: 6.68e-05, eta: 2:36:56	Time 3.004 (3.217)	Data 0.050 (0.277)	Mem 41.61GB	Prec@1 90.000 (91.818)	Loss 1.1002 (1.2134)
[02/28 17:27:34][INFO] train_vision.py:  668: Epoch: [22][20/367], lr: 6.64e-05, eta: 2:31:43	Time 3.041 (3.121)	Data 0.067 (0.169)	Mem 41.61GB	Prec@1 80.000 (87.619)	Loss 1.3995 (1.2968)
[02/28 17:28:04][INFO] train_vision.py:  668: Epoch: [22][30/367], lr: 6.60e-05, eta: 2:29:35	Time 3.019 (3.087)	Data 0.060 (0.133)	Mem 41.61GB	Prec@1 80.000 (86.129)	Loss 1.2887 (1.3465)
[02/28 17:28:34][INFO] train_vision.py:  668: Epoch: [22][40/367], lr: 6.56e-05, eta: 2:28:23	Time 3.088 (3.073)	Data 0.024 (0.113)	Mem 41.61GB	Prec@1 90.000 (85.122)	Loss 1.3191 (1.3650)
[02/28 17:29:05][INFO] train_vision.py:  668: Epoch: [22][50/367], lr: 6.52e-05, eta: 2:27:27	Time 3.044 (3.064)	Data 0.047 (0.101)	Mem 41.61GB	Prec@1 90.000 (84.902)	Loss 1.4621 (1.3716)
[02/28 17:29:35][INFO] train_vision.py:  668: Epoch: [22][60/367], lr: 6.48e-05, eta: 2:26:41	Time 3.039 (3.059)	Data 0.056 (0.093)	Mem 41.61GB	Prec@1 80.000 (85.082)	Loss 1.2394 (1.3671)
[02/28 17:30:05][INFO] train_vision.py:  668: Epoch: [22][70/367], lr: 6.44e-05, eta: 2:25:57	Time 3.048 (3.055)	Data 0.061 (0.087)	Mem 41.61GB	Prec@1 90.000 (85.070)	Loss 1.2238 (1.3636)
[02/28 17:30:35][INFO] train_vision.py:  668: Epoch: [22][80/367], lr: 6.40e-05, eta: 2:25:15	Time 3.022 (3.050)	Data 0.046 (0.083)	Mem 41.61GB	Prec@1 70.000 (83.333)	Loss 2.3059 (1.4050)
[02/28 17:31:06][INFO] train_vision.py:  668: Epoch: [22][90/367], lr: 6.36e-05, eta: 2:24:35	Time 3.034 (3.047)	Data 0.022 (0.079)	Mem 41.61GB	Prec@1 80.000 (83.407)	Loss 1.6579 (1.4045)
[02/28 17:31:36][INFO] train_vision.py:  668: Epoch: [22][100/367], lr: 6.32e-05, eta: 2:23:56	Time 3.038 (3.044)	Data 0.052 (0.077)	Mem 41.61GB	Prec@1 100.000 (84.158)	Loss 0.9440 (1.3836)
[02/28 17:32:06][INFO] train_vision.py:  668: Epoch: [22][110/367], lr: 6.28e-05, eta: 2:23:19	Time 3.033 (3.042)	Data 0.049 (0.074)	Mem 41.61GB	Prec@1 90.000 (83.784)	Loss 1.2922 (1.3926)
[02/28 17:32:36][INFO] train_vision.py:  668: Epoch: [22][120/367], lr: 6.24e-05, eta: 2:22:42	Time 3.028 (3.040)	Data 0.055 (0.073)	Mem 41.61GB	Prec@1 90.000 (83.884)	Loss 1.5209 (1.3946)
[02/28 17:33:06][INFO] train_vision.py:  668: Epoch: [22][130/367], lr: 6.20e-05, eta: 2:22:06	Time 3.006 (3.038)	Data 0.060 (0.071)	Mem 41.61GB	Prec@1 90.000 (84.122)	Loss 1.2362 (1.3839)
[02/28 17:33:36][INFO] train_vision.py:  668: Epoch: [22][140/367], lr: 6.16e-05, eta: 2:21:29	Time 2.986 (3.035)	Data 0.056 (0.070)	Mem 41.61GB	Prec@1 90.000 (84.255)	Loss 1.2197 (1.3851)
[02/28 17:34:06][INFO] train_vision.py:  668: Epoch: [22][150/367], lr: 6.13e-05, eta: 2:20:54	Time 3.000 (3.034)	Data 0.052 (0.068)	Mem 41.61GB	Prec@1 90.000 (84.371)	Loss 1.3959 (1.3822)
[02/28 17:34:36][INFO] train_vision.py:  668: Epoch: [22][160/367], lr: 6.09e-05, eta: 2:20:19	Time 3.004 (3.032)	Data 0.053 (0.068)	Mem 41.61GB	Prec@1 90.000 (84.348)	Loss 1.3384 (1.3895)
[02/28 17:35:07][INFO] train_vision.py:  668: Epoch: [22][170/367], lr: 6.05e-05, eta: 2:19:45	Time 3.007 (3.031)	Data 0.054 (0.067)	Mem 41.61GB	Prec@1 70.000 (84.211)	Loss 1.6730 (1.3907)
[02/28 17:35:37][INFO] train_vision.py:  668: Epoch: [22][180/367], lr: 6.01e-05, eta: 2:19:11	Time 2.985 (3.029)	Data 0.056 (0.066)	Mem 41.61GB	Prec@1 80.000 (84.420)	Loss 1.5987 (1.3934)
[02/28 17:36:07][INFO] train_vision.py:  668: Epoch: [22][190/367], lr: 5.97e-05, eta: 2:18:38	Time 3.008 (3.028)	Data 0.045 (0.065)	Mem 41.61GB	Prec@1 90.000 (84.503)	Loss 1.4291 (1.3929)
[02/28 17:36:37][INFO] train_vision.py:  668: Epoch: [22][200/367], lr: 5.93e-05, eta: 2:18:05	Time 3.000 (3.027)	Data 0.043 (0.064)	Mem 41.61GB	Prec@1 80.000 (84.726)	Loss 1.5432 (1.3902)
[02/28 17:37:07][INFO] train_vision.py:  668: Epoch: [22][210/367], lr: 5.89e-05, eta: 2:17:32	Time 3.016 (3.026)	Data 0.054 (0.064)	Mem 41.61GB	Prec@1 90.000 (84.502)	Loss 1.3895 (1.3947)
[02/28 17:37:37][INFO] train_vision.py:  668: Epoch: [22][220/367], lr: 5.86e-05, eta: 2:17:00	Time 3.018 (3.026)	Data 0.066 (0.063)	Mem 41.61GB	Prec@1 90.000 (84.525)	Loss 1.5328 (1.3920)
[02/28 17:38:07][INFO] train_vision.py:  668: Epoch: [22][230/367], lr: 5.82e-05, eta: 2:16:27	Time 3.005 (3.025)	Data 0.053 (0.063)	Mem 41.61GB	Prec@1 90.000 (84.459)	Loss 1.1362 (1.3892)
[02/28 17:38:37][INFO] train_vision.py:  668: Epoch: [22][240/367], lr: 5.78e-05, eta: 2:15:56	Time 3.017 (3.024)	Data 0.054 (0.063)	Mem 41.61GB	Prec@1 90.000 (84.398)	Loss 1.1597 (1.3918)
[02/28 17:39:07][INFO] train_vision.py:  668: Epoch: [22][250/367], lr: 5.74e-05, eta: 2:15:24	Time 3.015 (3.023)	Data 0.054 (0.062)	Mem 41.61GB	Prec@1 90.000 (84.582)	Loss 1.1493 (1.3906)
[02/28 17:39:37][INFO] train_vision.py:  668: Epoch: [22][260/367], lr: 5.70e-05, eta: 2:14:51	Time 3.009 (3.023)	Data 0.052 (0.062)	Mem 41.61GB	Prec@1 80.000 (84.713)	Loss 1.4242 (1.3876)
[02/28 17:40:07][INFO] train_vision.py:  668: Epoch: [22][270/367], lr: 5.67e-05, eta: 2:14:18	Time 2.994 (3.022)	Data 0.054 (0.062)	Mem 41.61GB	Prec@1 90.000 (84.686)	Loss 1.4104 (1.3920)
[02/28 17:40:37][INFO] train_vision.py:  668: Epoch: [22][280/367], lr: 5.63e-05, eta: 2:13:46	Time 3.005 (3.021)	Data 0.033 (0.061)	Mem 41.61GB	Prec@1 90.000 (84.769)	Loss 1.2697 (1.3900)
[02/28 17:41:07][INFO] train_vision.py:  668: Epoch: [22][290/367], lr: 5.59e-05, eta: 2:13:14	Time 3.016 (3.020)	Data 0.029 (0.061)	Mem 41.61GB	Prec@1 100.000 (84.845)	Loss 1.0065 (1.3859)
[02/28 17:41:37][INFO] train_vision.py:  668: Epoch: [22][300/367], lr: 5.55e-05, eta: 2:12:43	Time 3.012 (3.020)	Data 0.046 (0.061)	Mem 41.61GB	Prec@1 90.000 (84.884)	Loss 1.2511 (1.3865)
[02/28 17:42:07][INFO] train_vision.py:  668: Epoch: [22][310/367], lr: 5.52e-05, eta: 2:12:11	Time 3.002 (3.019)	Data 0.055 (0.060)	Mem 41.61GB	Prec@1 100.000 (84.855)	Loss 1.2230 (1.3880)
[02/28 17:42:37][INFO] train_vision.py:  668: Epoch: [22][320/367], lr: 5.48e-05, eta: 2:11:40	Time 3.017 (3.019)	Data 0.055 (0.060)	Mem 41.61GB	Prec@1 80.000 (84.891)	Loss 1.6778 (1.3853)
[02/28 17:43:07][INFO] train_vision.py:  668: Epoch: [22][330/367], lr: 5.44e-05, eta: 2:11:08	Time 3.009 (3.018)	Data 0.021 (0.060)	Mem 41.61GB	Prec@1 80.000 (84.773)	Loss 1.3579 (1.3876)
[02/28 17:43:37][INFO] train_vision.py:  668: Epoch: [22][340/367], lr: 5.41e-05, eta: 2:10:37	Time 3.011 (3.018)	Data 0.048 (0.060)	Mem 41.61GB	Prec@1 90.000 (84.633)	Loss 1.3096 (1.3926)
[02/28 17:44:07][INFO] train_vision.py:  668: Epoch: [22][350/367], lr: 5.37e-05, eta: 2:10:06	Time 3.010 (3.018)	Data 0.060 (0.059)	Mem 41.61GB	Prec@1 100.000 (84.701)	Loss 1.1020 (1.3902)
[02/28 17:44:38][INFO] train_vision.py:  668: Epoch: [22][360/367], lr: 5.33e-05, eta: 2:09:35	Time 3.025 (3.017)	Data 0.050 (0.059)	Mem 41.61GB	Prec@1 70.000 (84.654)	Loss 1.8739 (1.3925)
[02/28 17:45:01][INFO] train_vision.py:  668: Epoch: [23][0/367], lr: 5.30e-05, eta: 4:07:50	Time 5.786 (5.786)	Data 2.398 (2.398)	Mem 41.61GB	Prec@1 70.000 (70.000)	Loss 1.4706 (1.4706)
[02/28 17:45:31][INFO] train_vision.py:  668: Epoch: [23][10/367], lr: 5.27e-05, eta: 2:19:12	Time 2.980 (3.263)	Data 0.050 (0.267)	Mem 41.61GB	Prec@1 90.000 (78.182)	Loss 1.1902 (1.4002)
[02/28 17:46:01][INFO] train_vision.py:  668: Epoch: [23][20/367], lr: 5.23e-05, eta: 2:13:26	Time 2.994 (3.140)	Data 0.052 (0.165)	Mem 41.61GB	Prec@1 60.000 (79.524)	Loss 1.7672 (1.4185)
[02/28 17:46:31][INFO] train_vision.py:  668: Epoch: [23][30/367], lr: 5.20e-05, eta: 2:10:54	Time 2.989 (3.092)	Data 0.055 (0.127)	Mem 41.61GB	Prec@1 90.000 (81.935)	Loss 1.3184 (1.3795)
[02/28 17:47:01][INFO] train_vision.py:  668: Epoch: [23][40/367], lr: 5.16e-05, eta: 2:09:21	Time 2.992 (3.068)	Data 0.025 (0.108)	Mem 41.61GB	Prec@1 70.000 (81.951)	Loss 1.6078 (1.3869)
[02/28 17:47:31][INFO] train_vision.py:  668: Epoch: [23][50/367], lr: 5.12e-05, eta: 2:08:13	Time 2.998 (3.053)	Data 0.051 (0.096)	Mem 41.61GB	Prec@1 90.000 (83.333)	Loss 1.4962 (1.3600)
[02/28 17:48:01][INFO] train_vision.py:  668: Epoch: [23][60/367], lr: 5.09e-05, eta: 2:07:17	Time 2.993 (3.043)	Data 0.049 (0.088)	Mem 41.61GB	Prec@1 70.000 (83.443)	Loss 1.5727 (1.3669)
[02/28 17:48:31][INFO] train_vision.py:  668: Epoch: [23][70/367], lr: 5.05e-05, eta: 2:06:28	Time 2.968 (3.036)	Data 0.046 (0.083)	Mem 41.61GB	Prec@1 60.000 (82.958)	Loss 1.7017 (1.3770)
[02/28 17:49:00][INFO] train_vision.py:  668: Epoch: [23][80/367], lr: 5.02e-05, eta: 2:05:45	Time 3.017 (3.030)	Data 0.022 (0.078)	Mem 41.61GB	Prec@1 100.000 (83.704)	Loss 0.9802 (1.3651)
[02/28 17:49:30][INFO] train_vision.py:  668: Epoch: [23][90/367], lr: 4.98e-05, eta: 2:05:04	Time 2.981 (3.026)	Data 0.049 (0.074)	Mem 41.61GB	Prec@1 80.000 (83.626)	Loss 1.2549 (1.3678)
[02/28 17:50:00][INFO] train_vision.py:  668: Epoch: [23][100/367], lr: 4.95e-05, eta: 2:04:26	Time 3.014 (3.023)	Data 0.022 (0.071)	Mem 41.61GB	Prec@1 90.000 (83.663)	Loss 1.1321 (1.3682)
[02/28 17:50:30][INFO] train_vision.py:  668: Epoch: [23][110/367], lr: 4.91e-05, eta: 2:03:48	Time 2.984 (3.020)	Data 0.051 (0.069)	Mem 41.61GB	Prec@1 70.000 (83.604)	Loss 1.7353 (1.3734)
[02/28 17:51:00][INFO] train_vision.py:  668: Epoch: [23][120/367], lr: 4.87e-05, eta: 2:03:14	Time 2.988 (3.018)	Data 0.048 (0.067)	Mem 41.61GB	Prec@1 90.000 (83.967)	Loss 1.2528 (1.3733)
[02/28 17:51:30][INFO] train_vision.py:  668: Epoch: [23][130/367], lr: 4.84e-05, eta: 2:02:40	Time 3.001 (3.017)	Data 0.040 (0.066)	Mem 41.61GB	Prec@1 80.000 (84.122)	Loss 1.3422 (1.3776)
[02/28 17:52:00][INFO] train_vision.py:  668: Epoch: [23][140/367], lr: 4.80e-05, eta: 2:02:05	Time 2.994 (3.015)	Data 0.053 (0.065)	Mem 41.61GB	Prec@1 80.000 (84.397)	Loss 1.6875 (1.3749)
[02/28 17:52:30][INFO] train_vision.py:  668: Epoch: [23][150/367], lr: 4.77e-05, eta: 2:01:32	Time 3.027 (3.014)	Data 0.020 (0.064)	Mem 41.61GB	Prec@1 70.000 (84.371)	Loss 1.4804 (1.3745)
[02/28 17:53:00][INFO] train_vision.py:  668: Epoch: [23][160/367], lr: 4.73e-05, eta: 2:00:59	Time 2.995 (3.012)	Data 0.059 (0.063)	Mem 41.61GB	Prec@1 80.000 (84.658)	Loss 1.2569 (1.3658)
[02/28 17:53:30][INFO] train_vision.py:  668: Epoch: [23][170/367], lr: 4.70e-05, eta: 2:00:26	Time 3.019 (3.011)	Data 0.020 (0.062)	Mem 41.61GB	Prec@1 80.000 (84.444)	Loss 1.5583 (1.3726)
[02/28 17:54:00][INFO] train_vision.py:  668: Epoch: [23][180/367], lr: 4.67e-05, eta: 1:59:54	Time 2.989 (3.010)	Data 0.054 (0.061)	Mem 41.61GB	Prec@1 90.000 (84.309)	Loss 1.2576 (1.3786)
[02/28 17:54:30][INFO] train_vision.py:  668: Epoch: [23][190/367], lr: 4.63e-05, eta: 1:59:22	Time 2.996 (3.009)	Data 0.045 (0.061)	Mem 41.61GB	Prec@1 70.000 (84.241)	Loss 2.0487 (1.3854)
[02/28 17:55:00][INFO] train_vision.py:  668: Epoch: [23][200/367], lr: 4.60e-05, eta: 1:58:50	Time 3.025 (3.009)	Data 0.020 (0.060)	Mem 41.61GB	Prec@1 90.000 (84.129)	Loss 1.3969 (1.3860)
[02/28 17:55:30][INFO] train_vision.py:  668: Epoch: [23][210/367], lr: 4.56e-05, eta: 1:58:18	Time 2.984 (3.008)	Data 0.053 (0.060)	Mem 41.61GB	Prec@1 70.000 (83.507)	Loss 1.7011 (1.3971)
[02/28 17:56:00][INFO] train_vision.py:  668: Epoch: [23][220/367], lr: 4.53e-05, eta: 1:57:47	Time 2.997 (3.008)	Data 0.045 (0.059)	Mem 41.61GB	Prec@1 90.000 (83.348)	Loss 1.2563 (1.4008)
[02/28 17:56:30][INFO] train_vision.py:  668: Epoch: [23][230/367], lr: 4.49e-05, eta: 1:57:17	Time 2.998 (3.007)	Data 0.061 (0.059)	Mem 41.61GB	Prec@1 80.000 (83.333)	Loss 1.6702 (1.4007)
[02/28 17:57:00][INFO] train_vision.py:  668: Epoch: [23][240/367], lr: 4.46e-05, eta: 1:56:45	Time 2.993 (3.007)	Data 0.055 (0.059)	Mem 41.61GB	Prec@1 90.000 (83.527)	Loss 1.2756 (1.3940)
[02/28 17:57:30][INFO] train_vision.py:  668: Epoch: [23][250/367], lr: 4.43e-05, eta: 1:56:14	Time 2.975 (3.006)	Data 0.043 (0.058)	Mem 41.61GB	Prec@1 100.000 (83.625)	Loss 0.9825 (1.3910)
[02/28 17:58:00][INFO] train_vision.py:  668: Epoch: [23][260/367], lr: 4.39e-05, eta: 1:55:43	Time 2.992 (3.006)	Data 0.046 (0.058)	Mem 41.61GB	Prec@1 90.000 (83.448)	Loss 1.1175 (1.3961)
[02/28 17:58:29][INFO] train_vision.py:  668: Epoch: [23][270/367], lr: 4.36e-05, eta: 1:55:12	Time 2.990 (3.005)	Data 0.050 (0.058)	Mem 41.61GB	Prec@1 90.000 (83.506)	Loss 1.3333 (1.3953)
[02/28 17:58:59][INFO] train_vision.py:  668: Epoch: [23][280/367], lr: 4.32e-05, eta: 1:54:40	Time 2.993 (3.005)	Data 0.051 (0.057)	Mem 41.61GB	Prec@1 90.000 (83.416)	Loss 1.5070 (1.4022)
[02/28 17:59:29][INFO] train_vision.py:  668: Epoch: [23][290/367], lr: 4.29e-05, eta: 1:54:09	Time 2.994 (3.004)	Data 0.029 (0.057)	Mem 41.61GB	Prec@1 80.000 (83.505)	Loss 1.5198 (1.4000)
[02/28 17:59:59][INFO] train_vision.py:  668: Epoch: [23][300/367], lr: 4.26e-05, eta: 1:53:38	Time 3.018 (3.004)	Data 0.025 (0.056)	Mem 41.61GB	Prec@1 90.000 (83.488)	Loss 1.2337 (1.4040)
[02/28 18:00:29][INFO] train_vision.py:  668: Epoch: [23][310/367], lr: 4.22e-05, eta: 1:53:07	Time 2.993 (3.004)	Data 0.048 (0.056)	Mem 41.61GB	Prec@1 80.000 (83.666)	Loss 1.6071 (1.4009)
[02/28 18:00:59][INFO] train_vision.py:  668: Epoch: [23][320/367], lr: 4.19e-05, eta: 1:52:37	Time 2.992 (3.003)	Data 0.052 (0.056)	Mem 41.61GB	Prec@1 90.000 (83.769)	Loss 1.1479 (1.3986)
[02/28 18:01:29][INFO] train_vision.py:  668: Epoch: [23][330/367], lr: 4.16e-05, eta: 1:52:07	Time 2.994 (3.003)	Data 0.039 (0.056)	Mem 41.61GB	Prec@1 80.000 (83.686)	Loss 1.3046 (1.4013)
[02/28 18:01:59][INFO] train_vision.py:  668: Epoch: [23][340/367], lr: 4.13e-05, eta: 1:51:36	Time 3.002 (3.003)	Data 0.060 (0.055)	Mem 41.61GB	Prec@1 90.000 (83.548)	Loss 1.1675 (1.4029)
[02/28 18:02:29][INFO] train_vision.py:  668: Epoch: [23][350/367], lr: 4.09e-05, eta: 1:51:05	Time 3.005 (3.003)	Data 0.023 (0.055)	Mem 41.61GB	Prec@1 80.000 (83.447)	Loss 1.2954 (1.4075)
[02/28 18:02:59][INFO] train_vision.py:  668: Epoch: [23][360/367], lr: 4.06e-05, eta: 1:50:34	Time 2.990 (3.002)	Data 0.052 (0.055)	Mem 41.61GB	Prec@1 90.000 (83.684)	Loss 1.3168 (1.4023)
[02/28 18:03:23][INFO] train_vision.py:  840: Test: [0/121]	Prec@1 96.250 (96.250)	Prec@5 100.000 (100.000)	mPrec@1 (10.926)	mPrec@5 (11.458)
[02/28 18:04:05][INFO] train_vision.py:  840: Test: [10/121]	Prec@1 97.500 (96.818)	Prec@5 100.000 (100.000)	mPrec@1 (31.292)	mPrec@5 (32.639)
[02/28 18:04:47][INFO] train_vision.py:  840: Test: [20/121]	Prec@1 100.000 (95.417)	Prec@5 100.000 (99.583)	mPrec@1 (41.658)	mPrec@5 (44.913)
[02/28 18:05:29][INFO] train_vision.py:  840: Test: [30/121]	Prec@1 93.750 (95.161)	Prec@5 98.750 (99.556)	mPrec@1 (48.419)	mPrec@5 (54.092)
[02/28 18:06:11][INFO] train_vision.py:  840: Test: [40/121]	Prec@1 96.250 (95.274)	Prec@5 98.750 (99.512)	mPrec@1 (54.158)	mPrec@5 (61.315)
[02/28 18:06:53][INFO] train_vision.py:  840: Test: [50/121]	Prec@1 92.500 (94.510)	Prec@5 100.000 (99.289)	mPrec@1 (54.964)	mPrec@5 (64.047)
[02/28 18:07:35][INFO] train_vision.py:  840: Test: [60/121]	Prec@1 95.000 (94.590)	Prec@5 100.000 (99.406)	mPrec@1 (57.129)	mPrec@5 (67.236)
[02/28 18:08:18][INFO] train_vision.py:  840: Test: [70/121]	Prec@1 96.250 (94.806)	Prec@5 100.000 (99.489)	mPrec@1 (57.912)	mPrec@5 (69.010)
[02/28 18:09:00][INFO] train_vision.py:  840: Test: [80/121]	Prec@1 93.750 (94.907)	Prec@5 100.000 (99.491)	mPrec@1 (60.922)	mPrec@5 (73.224)
[02/28 18:09:42][INFO] train_vision.py:  840: Test: [90/121]	Prec@1 92.500 (94.725)	Prec@5 100.000 (99.519)	mPrec@1 (63.094)	mPrec@5 (77.048)
[02/28 18:10:24][INFO] train_vision.py:  840: Test: [100/121]	Prec@1 97.500 (94.196)	Prec@5 100.000 (99.455)	mPrec@1 (67.464)	mPrec@5 (85.968)
[02/28 18:11:06][INFO] train_vision.py:  840: Test: [110/121]	Prec@1 91.250 (94.189)	Prec@5 100.000 (99.471)	mPrec@1 (67.510)	mPrec@5 (86.530)
[02/28 18:11:47][INFO] train_vision.py:  840: Test: [120/121]	Prec@1 60.417 (93.335)	Prec@5 83.333 (99.326)	mPrec@1 (68.877)	mPrec@5 (93.201)
[02/28 18:11:47][INFO] train_vision.py:  847: Overall Prec@1 93.335% Prec@5 99.326% mPrec@1 (68.877) mPrec@5 (93.201)
[02/28 18:11:47][INFO] train_vision.py:  464: Testing: 68.87699127197266/68.87699127197266
[02/28 18:11:47][INFO] train_vision.py:  465: Saving:
[02/28 18:12:06][INFO] train_vision.py:  668: Epoch: [24][0/367], lr: 4.03e-05, eta: 3:09:48	Time 5.170 (5.170)	Data 2.284 (2.284)	Mem 41.61GB	Prec@1 80.000 (80.000)	Loss 1.5295 (1.5295)
[02/28 18:12:36][INFO] train_vision.py:  668: Epoch: [24][10/367], lr: 4.01e-05, eta: 1:57:12	Time 3.024 (3.207)	Data 0.063 (0.263)	Mem 41.61GB	Prec@1 90.000 (89.091)	Loss 1.1490 (1.3267)
[02/28 18:13:07][INFO] train_vision.py:  668: Epoch: [24][20/367], lr: 3.97e-05, eta: 1:53:25	Time 3.039 (3.118)	Data 0.064 (0.166)	Mem 41.61GB	Prec@1 100.000 (88.095)	Loss 0.9601 (1.3603)
[02/28 18:13:37][INFO] train_vision.py:  668: Epoch: [24][30/367], lr: 3.94e-05, eta: 1:51:47	Time 3.004 (3.087)	Data 0.050 (0.131)	Mem 41.61GB	Prec@1 90.000 (89.355)	Loss 1.2764 (1.3205)
[02/28 18:14:07][INFO] train_vision.py:  668: Epoch: [24][40/367], lr: 3.91e-05, eta: 1:50:42	Time 3.037 (3.071)	Data 0.057 (0.114)	Mem 41.61GB	Prec@1 80.000 (88.780)	Loss 1.4750 (1.3417)
[02/28 18:14:37][INFO] train_vision.py:  668: Epoch: [24][50/367], lr: 3.88e-05, eta: 1:49:49	Time 3.032 (3.061)	Data 0.066 (0.103)	Mem 41.61GB	Prec@1 100.000 (89.020)	Loss 0.9853 (1.3299)
[02/28 18:15:07][INFO] train_vision.py:  668: Epoch: [24][60/367], lr: 3.85e-05, eta: 1:49:02	Time 3.029 (3.053)	Data 0.062 (0.095)	Mem 41.61GB	Prec@1 90.000 (88.033)	Loss 1.0784 (1.3400)
[02/28 18:15:37][INFO] train_vision.py:  668: Epoch: [24][70/367], lr: 3.81e-05, eta: 1:48:18	Time 3.007 (3.047)	Data 0.069 (0.090)	Mem 41.61GB	Prec@1 100.000 (88.451)	Loss 0.9652 (1.3255)
[02/28 18:16:08][INFO] train_vision.py:  668: Epoch: [24][80/367], lr: 3.78e-05, eta: 1:47:37	Time 3.013 (3.042)	Data 0.054 (0.086)	Mem 41.61GB	Prec@1 100.000 (88.148)	Loss 0.9810 (1.3185)
[02/28 18:16:38][INFO] train_vision.py:  668: Epoch: [24][90/367], lr: 3.75e-05, eta: 1:46:59	Time 3.012 (3.038)	Data 0.027 (0.081)	Mem 41.61GB	Prec@1 90.000 (88.352)	Loss 1.5127 (1.3163)
[02/28 18:17:08][INFO] train_vision.py:  668: Epoch: [24][100/367], lr: 3.72e-05, eta: 1:46:21	Time 3.019 (3.035)	Data 0.063 (0.079)	Mem 41.61GB	Prec@1 90.000 (88.317)	Loss 1.3061 (1.3114)
[02/28 18:17:38][INFO] train_vision.py:  668: Epoch: [24][110/367], lr: 3.69e-05, eta: 1:45:46	Time 3.003 (3.032)	Data 0.057 (0.076)	Mem 41.61GB	Prec@1 80.000 (87.477)	Loss 1.4887 (1.3169)
[02/28 18:18:08][INFO] train_vision.py:  668: Epoch: [24][120/367], lr: 3.66e-05, eta: 1:45:11	Time 3.024 (3.030)	Data 0.054 (0.074)	Mem 41.61GB	Prec@1 90.000 (87.438)	Loss 1.2824 (1.3191)
[02/28 18:18:38][INFO] train_vision.py:  668: Epoch: [24][130/367], lr: 3.63e-05, eta: 1:44:37	Time 2.995 (3.028)	Data 0.057 (0.072)	Mem 41.61GB	Prec@1 90.000 (87.328)	Loss 1.0797 (1.3231)
[02/28 18:19:08][INFO] train_vision.py:  668: Epoch: [24][140/367], lr: 3.60e-05, eta: 1:44:04	Time 3.020 (3.027)	Data 0.066 (0.071)	Mem 41.61GB	Prec@1 80.000 (87.305)	Loss 1.4235 (1.3218)
[02/28 18:19:38][INFO] train_vision.py:  668: Epoch: [24][150/367], lr: 3.57e-05, eta: 1:43:31	Time 2.992 (3.026)	Data 0.055 (0.070)	Mem 41.61GB	Prec@1 90.000 (87.351)	Loss 1.1275 (1.3212)
[02/28 18:20:08][INFO] train_vision.py:  668: Epoch: [24][160/367], lr: 3.53e-05, eta: 1:42:58	Time 3.003 (3.024)	Data 0.057 (0.069)	Mem 41.61GB	Prec@1 90.000 (87.453)	Loss 1.1605 (1.3201)
[02/28 18:20:38][INFO] train_vision.py:  668: Epoch: [24][170/367], lr: 3.50e-05, eta: 1:42:26	Time 3.026 (3.023)	Data 0.030 (0.068)	Mem 41.61GB	Prec@1 70.000 (87.135)	Loss 1.5620 (1.3261)
[02/28 18:21:08][INFO] train_vision.py:  668: Epoch: [24][180/367], lr: 3.47e-05, eta: 1:41:54	Time 3.021 (3.023)	Data 0.067 (0.068)	Mem 41.61GB	Prec@1 60.000 (86.961)	Loss 1.5740 (1.3323)
[02/28 18:21:38][INFO] train_vision.py:  668: Epoch: [24][190/367], lr: 3.44e-05, eta: 1:41:22	Time 2.992 (3.022)	Data 0.052 (0.067)	Mem 41.61GB	Prec@1 100.000 (87.120)	Loss 1.0901 (1.3253)
[02/28 18:22:08][INFO] train_vision.py:  668: Epoch: [24][200/367], lr: 3.41e-05, eta: 1:40:50	Time 3.016 (3.021)	Data 0.055 (0.066)	Mem 41.61GB	Prec@1 60.000 (86.617)	Loss 1.5583 (1.3370)
[02/28 18:22:38][INFO] train_vision.py:  668: Epoch: [24][210/367], lr: 3.38e-05, eta: 1:40:18	Time 2.984 (3.020)	Data 0.057 (0.066)	Mem 41.61GB	Prec@1 90.000 (86.540)	Loss 1.1697 (1.3359)
[02/28 18:23:08][INFO] train_vision.py:  668: Epoch: [24][220/367], lr: 3.35e-05, eta: 1:39:47	Time 3.029 (3.019)	Data 0.024 (0.065)	Mem 41.61GB	Prec@1 80.000 (86.561)	Loss 1.7612 (1.3373)
[02/28 18:23:38][INFO] train_vision.py:  668: Epoch: [24][230/367], lr: 3.32e-05, eta: 1:39:15	Time 2.997 (3.019)	Data 0.061 (0.064)	Mem 41.61GB	Prec@1 90.000 (86.234)	Loss 1.1282 (1.3454)
[02/28 18:24:09][INFO] train_vision.py:  668: Epoch: [24][240/367], lr: 3.29e-05, eta: 1:38:44	Time 3.002 (3.018)	Data 0.055 (0.064)	Mem 41.61GB	Prec@1 70.000 (86.224)	Loss 1.6180 (1.3435)
[02/28 18:24:39][INFO] train_vision.py:  668: Epoch: [24][250/367], lr: 3.27e-05, eta: 1:38:13	Time 2.995 (3.018)	Data 0.057 (0.064)	Mem 41.61GB	Prec@1 80.000 (86.096)	Loss 1.4535 (1.3449)
[02/28 18:25:09][INFO] train_vision.py:  668: Epoch: [24][260/367], lr: 3.24e-05, eta: 1:37:42	Time 3.015 (3.017)	Data 0.067 (0.063)	Mem 41.61GB	Prec@1 80.000 (86.015)	Loss 1.6567 (1.3460)
[02/28 18:25:39][INFO] train_vision.py:  668: Epoch: [24][270/367], lr: 3.21e-05, eta: 1:37:11	Time 2.983 (3.017)	Data 0.057 (0.063)	Mem 41.61GB	Prec@1 80.000 (86.162)	Loss 1.7338 (1.3439)
[02/28 18:26:09][INFO] train_vision.py:  668: Epoch: [24][280/367], lr: 3.18e-05, eta: 1:36:39	Time 3.014 (3.016)	Data 0.057 (0.062)	Mem 41.61GB	Prec@1 100.000 (85.836)	Loss 1.0954 (1.3532)
[02/28 18:26:39][INFO] train_vision.py:  668: Epoch: [24][290/367], lr: 3.15e-05, eta: 1:36:08	Time 3.031 (3.015)	Data 0.021 (0.062)	Mem 41.61GB	Prec@1 90.000 (85.739)	Loss 1.4127 (1.3549)
[02/28 18:27:09][INFO] train_vision.py:  668: Epoch: [24][300/367], lr: 3.12e-05, eta: 1:35:37	Time 3.016 (3.015)	Data 0.062 (0.062)	Mem 41.61GB	Prec@1 70.000 (85.415)	Loss 1.7852 (1.3638)
[02/28 18:27:39][INFO] train_vision.py:  668: Epoch: [24][310/367], lr: 3.09e-05, eta: 1:35:06	Time 2.999 (3.015)	Data 0.061 (0.061)	Mem 41.61GB	Prec@1 60.000 (85.338)	Loss 1.7654 (1.3648)
[02/28 18:28:09][INFO] train_vision.py:  668: Epoch: [24][320/367], lr: 3.06e-05, eta: 1:34:36	Time 3.043 (3.014)	Data 0.022 (0.061)	Mem 41.61GB	Prec@1 100.000 (85.265)	Loss 1.0106 (1.3681)
[02/28 18:28:39][INFO] train_vision.py:  668: Epoch: [24][330/367], lr: 3.04e-05, eta: 1:34:05	Time 2.999 (3.014)	Data 0.058 (0.061)	Mem 41.61GB	Prec@1 90.000 (85.317)	Loss 1.2566 (1.3687)
[02/28 18:29:09][INFO] train_vision.py:  668: Epoch: [24][340/367], lr: 3.01e-05, eta: 1:33:34	Time 2.986 (3.014)	Data 0.055 (0.060)	Mem 41.61GB	Prec@1 90.000 (85.279)	Loss 1.2234 (1.3684)
[02/28 18:29:39][INFO] train_vision.py:  668: Epoch: [24][350/367], lr: 2.98e-05, eta: 1:33:04	Time 3.002 (3.014)	Data 0.058 (0.060)	Mem 41.61GB	Prec@1 90.000 (85.185)	Loss 1.1621 (1.3709)
[02/28 18:30:09][INFO] train_vision.py:  668: Epoch: [24][360/367], lr: 2.95e-05, eta: 1:32:33	Time 3.003 (3.013)	Data 0.054 (0.060)	Mem 41.61GB	Prec@1 100.000 (85.208)	Loss 1.0241 (1.3710)
[02/28 18:30:32][INFO] train_vision.py:  668: Epoch: [25][0/367], lr: 2.93e-05, eta: 2:54:36	Time 5.706 (5.706)	Data 2.423 (2.423)	Mem 41.61GB	Prec@1 100.000 (100.000)	Loss 1.0407 (1.0407)
[02/28 18:31:02][INFO] train_vision.py:  668: Epoch: [25][10/367], lr: 2.90e-05, eta: 1:39:21	Time 3.055 (3.265)	Data 0.019 (0.266)	Mem 41.61GB	Prec@1 90.000 (80.000)	Loss 1.3167 (1.5043)
[02/28 18:31:33][INFO] train_vision.py:  668: Epoch: [25][20/367], lr: 2.88e-05, eta: 1:35:16	Time 3.033 (3.148)	Data 0.056 (0.166)	Mem 41.61GB	Prec@1 100.000 (82.857)	Loss 1.0669 (1.4123)
[02/28 18:32:03][INFO] train_vision.py:  668: Epoch: [25][30/367], lr: 2.85e-05, eta: 1:33:24	Time 3.057 (3.103)	Data 0.024 (0.127)	Mem 41.61GB	Prec@1 90.000 (85.484)	Loss 1.0769 (1.3450)
[02/28 18:32:33][INFO] train_vision.py:  668: Epoch: [25][40/367], lr: 2.82e-05, eta: 1:32:10	Time 2.999 (3.079)	Data 0.054 (0.109)	Mem 41.61GB	Prec@1 100.000 (84.878)	Loss 0.9276 (1.3594)
[02/28 18:33:03][INFO] train_vision.py:  668: Epoch: [25][50/367], lr: 2.79e-05, eta: 1:31:11	Time 2.954 (3.064)	Data 0.055 (0.097)	Mem 41.61GB	Prec@1 90.000 (85.882)	Loss 1.1261 (1.3437)
[02/28 18:33:33][INFO] train_vision.py:  668: Epoch: [25][60/367], lr: 2.77e-05, eta: 1:30:25	Time 2.985 (3.055)	Data 0.056 (0.090)	Mem 41.61GB	Prec@1 90.000 (85.902)	Loss 1.2143 (1.3450)
[02/28 18:34:03][INFO] train_vision.py:  668: Epoch: [25][70/367], lr: 2.74e-05, eta: 1:29:41	Time 3.035 (3.048)	Data 0.052 (0.084)	Mem 41.61GB	Prec@1 80.000 (85.775)	Loss 1.7627 (1.3627)
[02/28 18:34:33][INFO] train_vision.py:  668: Epoch: [25][80/367], lr: 2.71e-05, eta: 1:29:01	Time 2.979 (3.042)	Data 0.054 (0.080)	Mem 41.61GB	Prec@1 90.000 (84.815)	Loss 1.2436 (1.3831)
[02/28 18:35:03][INFO] train_vision.py:  668: Epoch: [25][90/367], lr: 2.69e-05, eta: 1:28:23	Time 3.060 (3.038)	Data 0.021 (0.076)	Mem 41.61GB	Prec@1 100.000 (85.934)	Loss 0.9615 (1.3720)
[02/28 18:35:33][INFO] train_vision.py:  668: Epoch: [25][100/367], lr: 2.66e-05, eta: 1:27:46	Time 2.994 (3.034)	Data 0.056 (0.074)	Mem 41.61GB	Prec@1 70.000 (85.941)	Loss 1.7118 (1.3698)
[02/28 18:36:03][INFO] train_vision.py:  668: Epoch: [25][110/367], lr: 2.63e-05, eta: 1:27:11	Time 3.003 (3.031)	Data 0.053 (0.072)	Mem 41.61GB	Prec@1 90.000 (85.856)	Loss 1.2018 (1.3722)
[02/28 18:36:33][INFO] train_vision.py:  668: Epoch: [25][120/367], lr: 2.61e-05, eta: 1:26:36	Time 3.005 (3.028)	Data 0.058 (0.070)	Mem 41.61GB	Prec@1 90.000 (85.785)	Loss 1.5310 (1.3843)
[02/28 18:37:03][INFO] train_vision.py:  668: Epoch: [25][130/367], lr: 2.58e-05, eta: 1:26:03	Time 3.027 (3.026)	Data 0.044 (0.069)	Mem 41.61GB	Prec@1 60.000 (85.267)	Loss 1.8703 (1.3963)
[02/28 18:37:33][INFO] train_vision.py:  668: Epoch: [25][140/367], lr: 2.56e-05, eta: 1:25:31	Time 3.002 (3.026)	Data 0.056 (0.067)	Mem 41.61GB	Prec@1 90.000 (85.319)	Loss 1.1261 (1.3913)
[02/28 18:38:03][INFO] train_vision.py:  668: Epoch: [25][150/367], lr: 2.53e-05, eta: 1:24:58	Time 2.992 (3.024)	Data 0.052 (0.066)	Mem 41.61GB	Prec@1 100.000 (85.629)	Loss 0.9556 (1.3846)
[02/28 18:38:33][INFO] train_vision.py:  668: Epoch: [25][160/367], lr: 2.50e-05, eta: 1:24:25	Time 3.004 (3.022)	Data 0.059 (0.065)	Mem 41.61GB	Prec@1 70.000 (85.466)	Loss 1.5667 (1.3863)
[02/28 18:39:03][INFO] train_vision.py:  668: Epoch: [25][170/367], lr: 2.48e-05, eta: 1:23:53	Time 3.030 (3.021)	Data 0.022 (0.064)	Mem 41.61GB	Prec@1 80.000 (85.439)	Loss 1.9767 (1.3902)
[02/28 18:39:33][INFO] train_vision.py:  668: Epoch: [25][180/367], lr: 2.45e-05, eta: 1:23:21	Time 3.006 (3.020)	Data 0.058 (0.064)	Mem 41.61GB	Prec@1 80.000 (85.525)	Loss 1.6436 (1.3885)
[02/28 18:40:03][INFO] train_vision.py:  668: Epoch: [25][190/367], lr: 2.43e-05, eta: 1:22:49	Time 2.989 (3.019)	Data 0.055 (0.063)	Mem 41.61GB	Prec@1 80.000 (85.707)	Loss 1.3686 (1.3850)
[02/28 18:40:33][INFO] train_vision.py:  668: Epoch: [25][200/367], lr: 2.40e-05, eta: 1:22:18	Time 2.999 (3.019)	Data 0.058 (0.062)	Mem 41.61GB	Prec@1 90.000 (85.771)	Loss 1.2869 (1.3822)
[02/28 18:41:03][INFO] train_vision.py:  668: Epoch: [25][210/367], lr: 2.38e-05, eta: 1:21:47	Time 2.995 (3.018)	Data 0.053 (0.062)	Mem 41.61GB	Prec@1 100.000 (85.829)	Loss 0.9536 (1.3797)
[02/28 18:41:33][INFO] train_vision.py:  668: Epoch: [25][220/367], lr: 2.35e-05, eta: 1:21:17	Time 3.024 (3.018)	Data 0.030 (0.061)	Mem 41.61GB	Prec@1 80.000 (85.475)	Loss 1.3441 (1.3819)
[02/28 18:42:03][INFO] train_vision.py:  668: Epoch: [25][230/367], lr: 2.33e-05, eta: 1:20:45	Time 2.994 (3.017)	Data 0.040 (0.061)	Mem 41.61GB	Prec@1 80.000 (85.498)	Loss 1.5182 (1.3819)
[02/28 18:42:34][INFO] train_vision.py:  668: Epoch: [25][240/367], lr: 2.30e-05, eta: 1:20:14	Time 3.009 (3.017)	Data 0.056 (0.060)	Mem 41.61GB	Prec@1 70.000 (85.560)	Loss 1.3619 (1.3784)
[02/28 18:43:04][INFO] train_vision.py:  668: Epoch: [25][250/367], lr: 2.28e-05, eta: 1:19:44	Time 3.025 (3.017)	Data 0.068 (0.060)	Mem 41.61GB	Prec@1 90.000 (85.578)	Loss 1.1511 (1.3805)
[02/28 18:43:34][INFO] train_vision.py:  668: Epoch: [25][260/367], lr: 2.25e-05, eta: 1:19:13	Time 3.005 (3.016)	Data 0.056 (0.060)	Mem 41.61GB	Prec@1 70.000 (85.594)	Loss 1.3835 (1.3787)
[02/28 18:44:04][INFO] train_vision.py:  668: Epoch: [25][270/367], lr: 2.23e-05, eta: 1:18:42	Time 2.996 (3.016)	Data 0.054 (0.060)	Mem 41.61GB	Prec@1 90.000 (85.314)	Loss 1.1950 (1.3863)
[02/28 18:44:34][INFO] train_vision.py:  668: Epoch: [25][280/367], lr: 2.21e-05, eta: 1:18:11	Time 3.001 (3.015)	Data 0.052 (0.059)	Mem 41.61GB	Prec@1 60.000 (85.231)	Loss 1.7763 (1.3860)
[02/28 18:45:04][INFO] train_vision.py:  668: Epoch: [25][290/367], lr: 2.18e-05, eta: 1:17:40	Time 2.999 (3.015)	Data 0.050 (0.059)	Mem 41.61GB	Prec@1 100.000 (85.430)	Loss 1.0942 (1.3816)
[02/28 18:45:34][INFO] train_vision.py:  668: Epoch: [25][300/367], lr: 2.16e-05, eta: 1:17:09	Time 3.006 (3.014)	Data 0.058 (0.059)	Mem 41.61GB	Prec@1 100.000 (85.615)	Loss 0.9663 (1.3785)
[02/28 18:46:04][INFO] train_vision.py:  668: Epoch: [25][310/367], lr: 2.14e-05, eta: 1:16:38	Time 3.006 (3.014)	Data 0.036 (0.059)	Mem 41.61GB	Prec@1 90.000 (85.723)	Loss 1.1363 (1.3755)
[02/28 18:46:34][INFO] train_vision.py:  668: Epoch: [25][320/367], lr: 2.11e-05, eta: 1:16:08	Time 3.004 (3.013)	Data 0.052 (0.058)	Mem 41.61GB	Prec@1 90.000 (85.794)	Loss 1.0820 (1.3718)
[02/28 18:47:04][INFO] train_vision.py:  668: Epoch: [25][330/367], lr: 2.09e-05, eta: 1:15:37	Time 2.985 (3.013)	Data 0.047 (0.058)	Mem 41.61GB	Prec@1 70.000 (85.650)	Loss 1.3976 (1.3759)
[02/28 18:47:34][INFO] train_vision.py:  668: Epoch: [25][340/367], lr: 2.07e-05, eta: 1:15:06	Time 3.013 (3.013)	Data 0.057 (0.058)	Mem 41.61GB	Prec@1 80.000 (85.689)	Loss 1.4887 (1.3750)
[02/28 18:48:04][INFO] train_vision.py:  668: Epoch: [25][350/367], lr: 2.04e-05, eta: 1:14:36	Time 3.005 (3.012)	Data 0.040 (0.057)	Mem 41.61GB	Prec@1 100.000 (85.670)	Loss 1.0222 (1.3777)
[02/28 18:48:34][INFO] train_vision.py:  668: Epoch: [25][360/367], lr: 2.02e-05, eta: 1:14:05	Time 2.997 (3.012)	Data 0.058 (0.057)	Mem 41.61GB	Prec@1 90.000 (85.789)	Loss 1.0766 (1.3737)
[02/28 18:48:58][INFO] train_vision.py:  840: Test: [0/121]	Prec@1 96.250 (96.250)	Prec@5 100.000 (100.000)	mPrec@1 (10.926)	mPrec@5 (11.458)
[02/28 18:49:40][INFO] train_vision.py:  840: Test: [10/121]	Prec@1 98.750 (95.682)	Prec@5 100.000 (100.000)	mPrec@1 (30.930)	mPrec@5 (32.639)
[02/28 18:50:22][INFO] train_vision.py:  840: Test: [20/121]	Prec@1 98.750 (95.000)	Prec@5 100.000 (99.702)	mPrec@1 (41.654)	mPrec@5 (45.041)
[02/28 18:51:04][INFO] train_vision.py:  840: Test: [30/121]	Prec@1 96.250 (95.081)	Prec@5 100.000 (99.677)	mPrec@1 (49.199)	mPrec@5 (54.188)
[02/28 18:51:46][INFO] train_vision.py:  840: Test: [40/121]	Prec@1 93.750 (95.030)	Prec@5 98.750 (99.604)	mPrec@1 (54.944)	mPrec@5 (61.442)
[02/28 18:52:28][INFO] train_vision.py:  840: Test: [50/121]	Prec@1 93.750 (94.412)	Prec@5 100.000 (99.412)	mPrec@1 (56.397)	mPrec@5 (64.462)
[02/28 18:53:10][INFO] train_vision.py:  840: Test: [60/121]	Prec@1 96.250 (94.467)	Prec@5 100.000 (99.508)	mPrec@1 (58.397)	mPrec@5 (67.644)
[02/28 18:53:52][INFO] train_vision.py:  840: Test: [70/121]	Prec@1 96.250 (94.701)	Prec@5 100.000 (99.507)	mPrec@1 (58.876)	mPrec@5 (69.268)
[02/28 18:54:34][INFO] train_vision.py:  840: Test: [80/121]	Prec@1 96.250 (94.830)	Prec@5 100.000 (99.522)	mPrec@1 (62.005)	mPrec@5 (73.557)
[02/28 18:55:17][INFO] train_vision.py:  840: Test: [90/121]	Prec@1 91.250 (94.615)	Prec@5 100.000 (99.533)	mPrec@1 (63.380)	mPrec@5 (77.328)
[02/28 18:55:59][INFO] train_vision.py:  840: Test: [100/121]	Prec@1 96.250 (94.047)	Prec@5 100.000 (99.443)	mPrec@1 (67.321)	mPrec@5 (86.006)
[02/28 18:56:41][INFO] train_vision.py:  840: Test: [110/121]	Prec@1 86.250 (94.054)	Prec@5 100.000 (99.471)	mPrec@1 (67.454)	mPrec@5 (86.707)
[02/28 18:57:21][INFO] train_vision.py:  840: Test: [120/121]	Prec@1 66.667 (93.201)	Prec@5 83.333 (99.347)	mPrec@1 (69.955)	mPrec@5 (93.735)
[02/28 18:57:22][INFO] train_vision.py:  847: Overall Prec@1 93.201% Prec@5 99.347% mPrec@1 (69.955) mPrec@5 (93.735)
[02/28 18:57:22][INFO] train_vision.py:  464: Testing: 69.95521545410156/69.95521545410156
[02/28 18:57:22][INFO] train_vision.py:  465: Saving:
[02/28 18:57:42][INFO] train_vision.py:  668: Epoch: [26][0/367], lr: 2.00e-05, eta: 2:11:59	Time 5.391 (5.391)	Data 2.508 (2.508)	Mem 41.61GB	Prec@1 70.000 (70.000)	Loss 1.7759 (1.7759)
[02/28 18:58:12][INFO] train_vision.py:  668: Epoch: [26][10/367], lr: 1.98e-05, eta: 1:18:12	Time 3.013 (3.216)	Data 0.044 (0.273)	Mem 41.61GB	Prec@1 100.000 (86.364)	Loss 0.9493 (1.3303)
[02/28 18:58:42][INFO] train_vision.py:  668: Epoch: [26][20/367], lr: 1.96e-05, eta: 1:15:24	Time 3.051 (3.122)	Data 0.025 (0.171)	Mem 41.61GB	Prec@1 100.000 (84.762)	Loss 1.1765 (1.4139)
[02/28 18:59:12][INFO] train_vision.py:  668: Epoch: [26][30/367], lr: 1.94e-05, eta: 1:14:05	Time 3.040 (3.089)	Data 0.060 (0.132)	Mem 41.61GB	Prec@1 90.000 (86.129)	Loss 1.2444 (1.3589)
[02/28 18:59:42][INFO] train_vision.py:  668: Epoch: [26][40/367], lr: 1.91e-05, eta: 1:13:11	Time 3.021 (3.073)	Data 0.054 (0.113)	Mem 41.61GB	Prec@1 80.000 (86.098)	Loss 1.5149 (1.3706)
[02/28 19:00:12][INFO] train_vision.py:  668: Epoch: [26][50/367], lr: 1.89e-05, eta: 1:12:24	Time 3.061 (3.062)	Data 0.020 (0.100)	Mem 41.61GB	Prec@1 50.000 (85.490)	Loss 1.8451 (1.3779)
[02/28 19:00:43][INFO] train_vision.py:  668: Epoch: [26][60/367], lr: 1.87e-05, eta: 1:11:46	Time 3.038 (3.056)	Data 0.063 (0.093)	Mem 41.61GB	Prec@1 90.000 (85.902)	Loss 1.1603 (1.3658)
[02/28 19:01:13][INFO] train_vision.py:  668: Epoch: [26][70/367], lr: 1.85e-05, eta: 1:11:07	Time 3.027 (3.050)	Data 0.042 (0.086)	Mem 41.61GB	Prec@1 80.000 (85.634)	Loss 1.4459 (1.3658)
[02/28 19:01:43][INFO] train_vision.py:  668: Epoch: [26][80/367], lr: 1.83e-05, eta: 1:10:29	Time 3.011 (3.045)	Data 0.067 (0.082)	Mem 41.61GB	Prec@1 90.000 (85.802)	Loss 1.4446 (1.3644)
[02/28 19:02:13][INFO] train_vision.py:  668: Epoch: [26][90/367], lr: 1.80e-05, eta: 1:09:55	Time 3.034 (3.042)	Data 0.022 (0.078)	Mem 41.61GB	Prec@1 80.000 (85.714)	Loss 1.3351 (1.3602)
[02/28 19:02:43][INFO] train_vision.py:  668: Epoch: [26][100/367], lr: 1.78e-05, eta: 1:09:22	Time 3.031 (3.041)	Data 0.056 (0.076)	Mem 41.61GB	Prec@1 80.000 (85.842)	Loss 1.5890 (1.3617)
[02/28 19:03:13][INFO] train_vision.py:  668: Epoch: [26][110/367], lr: 1.76e-05, eta: 1:08:48	Time 3.032 (3.038)	Data 0.021 (0.073)	Mem 41.61GB	Prec@1 100.000 (85.766)	Loss 0.9652 (1.3672)
[02/28 19:03:44][INFO] train_vision.py:  668: Epoch: [26][120/367], lr: 1.74e-05, eta: 1:08:15	Time 3.014 (3.036)	Data 0.047 (0.071)	Mem 41.61GB	Prec@1 100.000 (85.868)	Loss 0.9744 (1.3614)
[02/28 19:04:14][INFO] train_vision.py:  668: Epoch: [26][130/367], lr: 1.72e-05, eta: 1:07:43	Time 3.021 (3.035)	Data 0.049 (0.070)	Mem 41.61GB	Prec@1 90.000 (85.649)	Loss 1.3253 (1.3637)
[02/28 19:04:44][INFO] train_vision.py:  668: Epoch: [26][140/367], lr: 1.70e-05, eta: 1:07:11	Time 2.995 (3.034)	Data 0.051 (0.069)	Mem 41.61GB	Prec@1 80.000 (85.248)	Loss 1.2905 (1.3687)
[02/28 19:05:14][INFO] train_vision.py:  668: Epoch: [26][150/367], lr: 1.68e-05, eta: 1:06:39	Time 3.053 (3.033)	Data 0.048 (0.068)	Mem 41.61GB	Prec@1 90.000 (84.967)	Loss 1.1305 (1.3791)
[02/28 19:05:44][INFO] train_vision.py:  668: Epoch: [26][160/367], lr: 1.66e-05, eta: 1:06:07	Time 2.999 (3.031)	Data 0.044 (0.067)	Mem 41.61GB	Prec@1 80.000 (85.093)	Loss 1.5972 (1.3746)
[02/28 19:06:14][INFO] train_vision.py:  668: Epoch: [26][170/367], lr: 1.64e-05, eta: 1:05:35	Time 3.072 (3.030)	Data 0.023 (0.065)	Mem 41.61GB	Prec@1 80.000 (84.912)	Loss 1.1818 (1.3688)
[02/28 19:06:44][INFO] train_vision.py:  668: Epoch: [26][180/367], lr: 1.62e-05, eta: 1:05:03	Time 3.003 (3.029)	Data 0.054 (0.065)	Mem 41.61GB	Prec@1 70.000 (84.641)	Loss 1.4932 (1.3743)
[02/28 19:07:14][INFO] train_vision.py:  668: Epoch: [26][190/367], lr: 1.60e-05, eta: 1:04:31	Time 2.999 (3.027)	Data 0.055 (0.064)	Mem 41.61GB	Prec@1 70.000 (85.026)	Loss 1.5130 (1.3648)
[02/28 19:07:44][INFO] train_vision.py:  668: Epoch: [26][200/367], lr: 1.58e-05, eta: 1:04:00	Time 2.997 (3.026)	Data 0.050 (0.063)	Mem 41.61GB	Prec@1 70.000 (84.726)	Loss 1.4519 (1.3655)
[02/28 19:08:14][INFO] train_vision.py:  668: Epoch: [26][210/367], lr: 1.56e-05, eta: 1:03:28	Time 2.986 (3.025)	Data 0.046 (0.063)	Mem 41.61GB	Prec@1 80.000 (85.024)	Loss 1.4353 (1.3619)
[02/28 19:08:44][INFO] train_vision.py:  668: Epoch: [26][220/367], lr: 1.54e-05, eta: 1:02:56	Time 3.005 (3.024)	Data 0.063 (0.062)	Mem 41.61GB	Prec@1 70.000 (85.023)	Loss 1.5953 (1.3632)
[02/28 19:09:14][INFO] train_vision.py:  668: Epoch: [26][230/367], lr: 1.52e-05, eta: 1:02:25	Time 3.001 (3.023)	Data 0.053 (0.062)	Mem 41.61GB	Prec@1 90.000 (85.022)	Loss 1.1677 (1.3635)
[02/28 19:09:45][INFO] train_vision.py:  668: Epoch: [26][240/367], lr: 1.50e-05, eta: 1:01:54	Time 3.008 (3.022)	Data 0.046 (0.061)	Mem 41.61GB	Prec@1 80.000 (84.938)	Loss 1.5301 (1.3686)
[02/28 19:10:15][INFO] train_vision.py:  668: Epoch: [26][250/367], lr: 1.48e-05, eta: 1:01:23	Time 2.990 (3.021)	Data 0.067 (0.061)	Mem 41.61GB	Prec@1 90.000 (85.179)	Loss 1.2707 (1.3621)
[02/28 19:10:45][INFO] train_vision.py:  668: Epoch: [26][260/367], lr: 1.46e-05, eta: 1:00:52	Time 3.006 (3.021)	Data 0.063 (0.061)	Mem 41.61GB	Prec@1 100.000 (85.172)	Loss 1.0660 (1.3601)
[02/28 19:11:15][INFO] train_vision.py:  668: Epoch: [26][270/367], lr: 1.44e-05, eta: 1:00:21	Time 3.009 (3.021)	Data 0.050 (0.061)	Mem 41.61GB	Prec@1 90.000 (85.424)	Loss 1.1492 (1.3542)
[02/28 19:11:45][INFO] train_vision.py:  668: Epoch: [26][280/367], lr: 1.42e-05, eta: 0:59:51	Time 2.980 (3.020)	Data 0.054 (0.061)	Mem 41.61GB	Prec@1 70.000 (85.409)	Loss 1.9546 (1.3564)
[02/28 19:12:15][INFO] train_vision.py:  668: Epoch: [26][290/367], lr: 1.40e-05, eta: 0:59:20	Time 3.015 (3.020)	Data 0.052 (0.060)	Mem 41.61GB	Prec@1 100.000 (85.567)	Loss 1.0118 (1.3520)
[02/28 19:12:45][INFO] train_vision.py:  668: Epoch: [26][300/367], lr: 1.39e-05, eta: 0:58:49	Time 3.035 (3.020)	Data 0.041 (0.060)	Mem 41.61GB	Prec@1 90.000 (85.581)	Loss 1.4913 (1.3528)
[02/28 19:13:15][INFO] train_vision.py:  668: Epoch: [26][310/367], lr: 1.37e-05, eta: 0:58:19	Time 3.017 (3.019)	Data 0.057 (0.060)	Mem 41.61GB	Prec@1 90.000 (85.595)	Loss 1.1611 (1.3524)
[02/28 19:13:45][INFO] train_vision.py:  668: Epoch: [26][320/367], lr: 1.35e-05, eta: 0:57:48	Time 3.009 (3.019)	Data 0.054 (0.060)	Mem 41.61GB	Prec@1 70.000 (85.545)	Loss 1.9767 (1.3545)
[02/28 19:14:15][INFO] train_vision.py:  668: Epoch: [26][330/367], lr: 1.33e-05, eta: 0:57:18	Time 3.026 (3.019)	Data 0.051 (0.059)	Mem 41.61GB	Prec@1 90.000 (85.529)	Loss 1.2405 (1.3544)
[02/28 19:14:45][INFO] train_vision.py:  668: Epoch: [26][340/367], lr: 1.31e-05, eta: 0:56:47	Time 2.997 (3.018)	Data 0.066 (0.059)	Mem 41.61GB	Prec@1 90.000 (85.601)	Loss 1.6114 (1.3538)
[02/28 19:15:15][INFO] train_vision.py:  668: Epoch: [26][350/367], lr: 1.30e-05, eta: 0:56:16	Time 3.002 (3.018)	Data 0.061 (0.059)	Mem 41.61GB	Prec@1 90.000 (85.755)	Loss 1.1335 (1.3484)
[02/28 19:15:45][INFO] train_vision.py:  668: Epoch: [26][360/367], lr: 1.28e-05, eta: 0:55:46	Time 2.999 (3.017)	Data 0.023 (0.059)	Mem 41.61GB	Prec@1 100.000 (85.734)	Loss 0.9502 (1.3476)
[02/28 19:16:09][INFO] train_vision.py:  840: Test: [0/121]	Prec@1 96.250 (96.250)	Prec@5 100.000 (100.000)	mPrec@1 (10.926)	mPrec@5 (11.458)
[02/28 19:16:51][INFO] train_vision.py:  840: Test: [10/121]	Prec@1 97.500 (95.682)	Prec@5 100.000 (100.000)	mPrec@1 (30.893)	mPrec@5 (32.639)
[02/28 19:17:34][INFO] train_vision.py:  840: Test: [20/121]	Prec@1 100.000 (94.940)	Prec@5 100.000 (99.762)	mPrec@1 (41.426)	mPrec@5 (45.066)
[02/28 19:18:16][INFO] train_vision.py:  840: Test: [30/121]	Prec@1 95.000 (95.040)	Prec@5 100.000 (99.718)	mPrec@1 (48.447)	mPrec@5 (54.212)
[02/28 19:18:58][INFO] train_vision.py:  840: Test: [40/121]	Prec@1 93.750 (95.061)	Prec@5 98.750 (99.665)	mPrec@1 (54.217)	mPrec@5 (61.630)
[02/28 19:19:40][INFO] train_vision.py:  840: Test: [50/121]	Prec@1 92.500 (94.461)	Prec@5 100.000 (99.485)	mPrec@1 (55.677)	mPrec@5 (64.720)
[02/28 19:20:22][INFO] train_vision.py:  840: Test: [60/121]	Prec@1 96.250 (94.570)	Prec@5 100.000 (99.570)	mPrec@1 (58.024)	mPrec@5 (67.901)
[02/28 19:21:04][INFO] train_vision.py:  840: Test: [70/121]	Prec@1 96.250 (94.771)	Prec@5 100.000 (99.577)	mPrec@1 (58.346)	mPrec@5 (69.526)
[02/28 19:21:46][INFO] train_vision.py:  840: Test: [80/121]	Prec@1 97.500 (94.877)	Prec@5 100.000 (99.568)	mPrec@1 (61.120)	mPrec@5 (73.786)
[02/28 19:22:29][INFO] train_vision.py:  840: Test: [90/121]	Prec@1 95.000 (94.698)	Prec@5 100.000 (99.588)	mPrec@1 (63.396)	mPrec@5 (77.729)
[02/28 19:23:11][INFO] train_vision.py:  840: Test: [100/121]	Prec@1 96.250 (94.158)	Prec@5 100.000 (99.517)	mPrec@1 (67.479)	mPrec@5 (86.841)
[02/28 19:23:53][INFO] train_vision.py:  840: Test: [110/121]	Prec@1 87.500 (94.144)	Prec@5 100.000 (99.538)	mPrec@1 (67.829)	mPrec@5 (87.477)
[02/28 19:24:33][INFO] train_vision.py:  840: Test: [120/121]	Prec@1 66.667 (93.284)	Prec@5 85.417 (99.430)	mPrec@1 (69.984)	mPrec@5 (94.976)
[02/28 19:24:34][INFO] train_vision.py:  847: Overall Prec@1 93.284% Prec@5 99.430% mPrec@1 (69.984) mPrec@5 (94.976)
[02/28 19:24:34][INFO] train_vision.py:  464: Testing: 69.98391723632812/69.98391723632812
[02/28 19:24:34][INFO] train_vision.py:  465: Saving:
[02/28 19:24:53][INFO] train_vision.py:  668: Epoch: [27][0/367], lr: 1.27e-05, eta: 1:35:14	Time 5.186 (5.186)	Data 2.309 (2.309)	Mem 41.61GB	Prec@1 90.000 (90.000)	Loss 1.1745 (1.1745)
[02/28 19:25:23][INFO] train_vision.py:  668: Epoch: [27][10/367], lr: 1.25e-05, eta: 0:58:11	Time 3.035 (3.197)	Data 0.082 (0.265)	Mem 41.61GB	Prec@1 80.000 (84.545)	Loss 1.2567 (1.3418)
[02/28 19:25:53][INFO] train_vision.py:  668: Epoch: [27][20/367], lr: 1.23e-05, eta: 0:55:57	Time 3.015 (3.103)	Data 0.062 (0.168)	Mem 41.61GB	Prec@1 80.000 (86.190)	Loss 1.3965 (1.2930)
[02/28 19:26:23][INFO] train_vision.py:  668: Epoch: [27][30/367], lr: 1.22e-05, eta: 0:54:53	Time 3.027 (3.072)	Data 0.087 (0.134)	Mem 41.61GB	Prec@1 60.000 (85.806)	Loss 2.0355 (1.3037)
[02/28 19:26:53][INFO] train_vision.py:  668: Epoch: [27][40/367], lr: 1.20e-05, eta: 0:54:08	Time 3.023 (3.059)	Data 0.023 (0.117)	Mem 41.61GB	Prec@1 90.000 (86.829)	Loss 1.1162 (1.2905)
[02/28 19:27:23][INFO] train_vision.py:  668: Epoch: [27][50/367], lr: 1.18e-05, eta: 0:53:28	Time 3.007 (3.050)	Data 0.054 (0.107)	Mem 41.61GB	Prec@1 100.000 (87.059)	Loss 1.0427 (1.2971)
[02/28 19:27:54][INFO] train_vision.py:  668: Epoch: [27][60/367], lr: 1.17e-05, eta: 0:52:52	Time 2.967 (3.045)	Data 0.061 (0.099)	Mem 41.61GB	Prec@1 90.000 (86.885)	Loss 1.1484 (1.3015)
[02/28 19:28:24][INFO] train_vision.py:  668: Epoch: [27][70/367], lr: 1.15e-05, eta: 0:52:17	Time 2.980 (3.041)	Data 0.053 (0.092)	Mem 41.61GB	Prec@1 80.000 (86.901)	Loss 1.2625 (1.3064)
[02/28 19:28:54][INFO] train_vision.py:  668: Epoch: [27][80/367], lr: 1.13e-05, eta: 0:51:43	Time 3.025 (3.036)	Data 0.025 (0.087)	Mem 41.61GB	Prec@1 80.000 (86.173)	Loss 1.2544 (1.3287)
[02/28 19:29:24][INFO] train_vision.py:  668: Epoch: [27][90/367], lr: 1.12e-05, eta: 0:51:09	Time 2.992 (3.033)	Data 0.056 (0.083)	Mem 41.61GB	Prec@1 90.000 (86.154)	Loss 1.2029 (1.3373)
[02/28 19:29:54][INFO] train_vision.py:  668: Epoch: [27][100/367], lr: 1.10e-05, eta: 0:50:36	Time 2.972 (3.030)	Data 0.059 (0.080)	Mem 41.61GB	Prec@1 80.000 (86.139)	Loss 1.4918 (1.3408)
[02/28 19:30:24][INFO] train_vision.py:  668: Epoch: [27][110/367], lr: 1.09e-05, eta: 0:50:04	Time 3.007 (3.029)	Data 0.082 (0.079)	Mem 41.61GB	Prec@1 80.000 (86.667)	Loss 1.6229 (1.3334)
[02/28 19:30:54][INFO] train_vision.py:  668: Epoch: [27][120/367], lr: 1.07e-05, eta: 0:49:33	Time 2.994 (3.028)	Data 0.055 (0.077)	Mem 41.61GB	Prec@1 70.000 (86.364)	Loss 1.8738 (1.3437)
[02/28 19:31:24][INFO] train_vision.py:  668: Epoch: [27][130/367], lr: 1.05e-05, eta: 0:49:01	Time 3.000 (3.026)	Data 0.047 (0.075)	Mem 41.61GB	Prec@1 90.000 (86.565)	Loss 1.2720 (1.3388)
[02/28 19:31:55][INFO] train_vision.py:  668: Epoch: [27][140/367], lr: 1.04e-05, eta: 0:48:31	Time 3.033 (3.026)	Data 0.069 (0.075)	Mem 41.61GB	Prec@1 70.000 (86.383)	Loss 1.3728 (1.3379)
[02/28 19:32:25][INFO] train_vision.py:  668: Epoch: [27][150/367], lr: 1.02e-05, eta: 0:48:00	Time 3.036 (3.026)	Data 0.098 (0.074)	Mem 41.61GB	Prec@1 100.000 (86.490)	Loss 1.1916 (1.3380)
[02/28 19:32:55][INFO] train_vision.py:  668: Epoch: [27][160/367], lr: 1.01e-05, eta: 0:47:29	Time 2.964 (3.025)	Data 0.047 (0.073)	Mem 41.61GB	Prec@1 80.000 (86.708)	Loss 1.1470 (1.3280)
[02/28 19:33:25][INFO] train_vision.py:  668: Epoch: [27][170/367], lr: 9.94e-06, eta: 0:46:57	Time 2.989 (3.023)	Data 0.039 (0.071)	Mem 41.61GB	Prec@1 90.000 (86.550)	Loss 1.6607 (1.3344)
[02/28 19:33:55][INFO] train_vision.py:  668: Epoch: [27][180/367], lr: 9.79e-06, eta: 0:46:25	Time 2.977 (3.022)	Data 0.058 (0.070)	Mem 41.61GB	Prec@1 70.000 (86.354)	Loss 1.7131 (1.3360)
[02/28 19:34:25][INFO] train_vision.py:  668: Epoch: [27][190/367], lr: 9.64e-06, eta: 0:45:54	Time 3.025 (3.021)	Data 0.084 (0.069)	Mem 41.61GB	Prec@1 90.000 (86.230)	Loss 1.4773 (1.3449)
[02/28 19:34:55][INFO] train_vision.py:  668: Epoch: [27][200/367], lr: 9.50e-06, eta: 0:45:24	Time 3.010 (3.020)	Data 0.073 (0.069)	Mem 41.61GB	Prec@1 100.000 (86.418)	Loss 0.9740 (1.3405)
[02/28 19:35:25][INFO] train_vision.py:  668: Epoch: [27][210/367], lr: 9.36e-06, eta: 0:44:53	Time 3.025 (3.020)	Data 0.081 (0.069)	Mem 41.61GB	Prec@1 60.000 (85.972)	Loss 2.0716 (1.3472)
[02/28 19:35:55][INFO] train_vision.py:  668: Epoch: [27][220/367], lr: 9.22e-06, eta: 0:44:22	Time 2.984 (3.019)	Data 0.050 (0.068)	Mem 41.61GB	Prec@1 90.000 (86.063)	Loss 1.2217 (1.3460)
[02/28 19:36:25][INFO] train_vision.py:  668: Epoch: [27][230/367], lr: 9.08e-06, eta: 0:43:52	Time 3.045 (3.019)	Data 0.061 (0.067)	Mem 41.61GB	Prec@1 80.000 (85.801)	Loss 1.5356 (1.3539)
[02/28 19:36:55][INFO] train_vision.py:  668: Epoch: [27][240/367], lr: 8.94e-06, eta: 0:43:21	Time 3.011 (3.018)	Data 0.079 (0.067)	Mem 41.61GB	Prec@1 80.000 (85.685)	Loss 1.4941 (1.3570)
[02/28 19:37:25][INFO] train_vision.py:  668: Epoch: [27][250/367], lr: 8.80e-06, eta: 0:42:50	Time 2.995 (3.018)	Data 0.054 (0.066)	Mem 41.61GB	Prec@1 90.000 (85.697)	Loss 1.3408 (1.3578)
[02/28 19:37:55][INFO] train_vision.py:  668: Epoch: [27][260/367], lr: 8.67e-06, eta: 0:42:20	Time 2.993 (3.017)	Data 0.060 (0.066)	Mem 41.61GB	Prec@1 90.000 (85.709)	Loss 1.0865 (1.3576)
[02/28 19:38:25][INFO] train_vision.py:  668: Epoch: [27][270/367], lr: 8.54e-06, eta: 0:41:49	Time 3.007 (3.016)	Data 0.058 (0.065)	Mem 41.61GB	Prec@1 100.000 (85.793)	Loss 0.9354 (1.3568)
[02/28 19:38:55][INFO] train_vision.py:  668: Epoch: [27][280/367], lr: 8.41e-06, eta: 0:41:18	Time 2.999 (3.015)	Data 0.068 (0.065)	Mem 41.61GB	Prec@1 90.000 (86.014)	Loss 1.2737 (1.3542)
[02/28 19:39:25][INFO] train_vision.py:  668: Epoch: [27][290/367], lr: 8.28e-06, eta: 0:40:47	Time 2.987 (3.015)	Data 0.053 (0.064)	Mem 41.61GB	Prec@1 80.000 (85.876)	Loss 1.4629 (1.3615)
[02/28 19:39:55][INFO] train_vision.py:  668: Epoch: [27][300/367], lr: 8.15e-06, eta: 0:40:17	Time 2.987 (3.014)	Data 0.055 (0.064)	Mem 41.61GB	Prec@1 90.000 (85.880)	Loss 1.2388 (1.3623)
[02/28 19:40:25][INFO] train_vision.py:  668: Epoch: [27][310/367], lr: 8.02e-06, eta: 0:39:46	Time 2.998 (3.013)	Data 0.060 (0.064)	Mem 41.61GB	Prec@1 100.000 (85.884)	Loss 1.0420 (1.3603)
[02/28 19:40:55][INFO] train_vision.py:  668: Epoch: [27][320/367], lr: 7.89e-06, eta: 0:39:15	Time 2.982 (3.013)	Data 0.063 (0.063)	Mem 41.61GB	Prec@1 100.000 (85.919)	Loss 1.1308 (1.3601)
[02/28 19:41:25][INFO] train_vision.py:  668: Epoch: [27][330/367], lr: 7.77e-06, eta: 0:38:45	Time 2.997 (3.012)	Data 0.055 (0.063)	Mem 41.61GB	Prec@1 70.000 (85.770)	Loss 1.6380 (1.3627)
[02/28 19:41:55][INFO] train_vision.py:  668: Epoch: [27][340/367], lr: 7.65e-06, eta: 0:38:14	Time 3.002 (3.012)	Data 0.066 (0.063)	Mem 41.61GB	Prec@1 90.000 (85.865)	Loss 1.5151 (1.3591)
[02/28 19:42:25][INFO] train_vision.py:  668: Epoch: [27][350/367], lr: 7.53e-06, eta: 0:37:44	Time 2.984 (3.011)	Data 0.055 (0.063)	Mem 41.61GB	Prec@1 90.000 (85.783)	Loss 1.1973 (1.3585)
[02/28 19:42:55][INFO] train_vision.py:  668: Epoch: [27][360/367], lr: 7.41e-06, eta: 0:37:14	Time 2.998 (3.011)	Data 0.063 (0.062)	Mem 41.61GB	Prec@1 90.000 (85.734)	Loss 1.0528 (1.3595)
[02/28 19:43:19][INFO] train_vision.py:  840: Test: [0/121]	Prec@1 96.250 (96.250)	Prec@5 100.000 (100.000)	mPrec@1 (10.926)	mPrec@5 (11.458)
[02/28 19:44:01][INFO] train_vision.py:  840: Test: [10/121]	Prec@1 97.500 (96.136)	Prec@5 100.000 (100.000)	mPrec@1 (30.994)	mPrec@5 (32.639)
[02/28 19:44:43][INFO] train_vision.py:  840: Test: [20/121]	Prec@1 100.000 (95.119)	Prec@5 100.000 (99.821)	mPrec@1 (41.350)	mPrec@5 (45.413)
[02/28 19:45:26][INFO] train_vision.py:  840: Test: [30/121]	Prec@1 96.250 (95.202)	Prec@5 100.000 (99.758)	mPrec@1 (48.732)	mPrec@5 (54.560)
[02/28 19:46:08][INFO] train_vision.py:  840: Test: [40/121]	Prec@1 95.000 (95.274)	Prec@5 98.750 (99.726)	mPrec@1 (54.626)	mPrec@5 (61.819)
[02/28 19:46:50][INFO] train_vision.py:  840: Test: [50/121]	Prec@1 92.500 (94.583)	Prec@5 100.000 (99.534)	mPrec@1 (56.067)	mPrec@5 (64.943)
[02/28 19:47:32][INFO] train_vision.py:  840: Test: [60/121]	Prec@1 96.250 (94.672)	Prec@5 100.000 (99.570)	mPrec@1 (58.352)	mPrec@5 (68.001)
[02/28 19:48:14][INFO] train_vision.py:  840: Test: [70/121]	Prec@1 97.500 (94.930)	Prec@5 100.000 (99.560)	mPrec@1 (58.913)	mPrec@5 (69.616)
[02/28 19:48:56][INFO] train_vision.py:  840: Test: [80/121]	Prec@1 97.500 (95.062)	Prec@5 100.000 (99.568)	mPrec@1 (62.034)	mPrec@5 (73.891)
[02/28 19:49:39][INFO] train_vision.py:  840: Test: [90/121]	Prec@1 91.250 (94.918)	Prec@5 100.000 (99.588)	mPrec@1 (63.344)	mPrec@5 (77.830)
[02/28 19:50:21][INFO] train_vision.py:  840: Test: [100/121]	Prec@1 96.250 (94.468)	Prec@5 100.000 (99.517)	mPrec@1 (68.416)	mPrec@5 (87.056)
[02/28 19:51:03][INFO] train_vision.py:  840: Test: [110/121]	Prec@1 90.000 (94.493)	Prec@5 100.000 (99.527)	mPrec@1 (68.679)	mPrec@5 (87.574)
[02/28 19:51:43][INFO] train_vision.py:  840: Test: [120/121]	Prec@1 66.667 (93.677)	Prec@5 85.417 (99.409)	mPrec@1 (70.704)	mPrec@5 (94.832)
[02/28 19:51:44][INFO] train_vision.py:  847: Overall Prec@1 93.677% Prec@5 99.409% mPrec@1 (70.704) mPrec@5 (94.832)
[02/28 19:51:44][INFO] train_vision.py:  464: Testing: 70.70352172851562/70.70352172851562
[02/28 19:51:44][INFO] train_vision.py:  465: Saving:
[02/28 19:52:03][INFO] train_vision.py:  668: Epoch: [28][0/367], lr: 7.32e-06, eta: 1:03:11	Time 5.158 (5.158)	Data 2.281 (2.281)	Mem 41.61GB	Prec@1 90.000 (90.000)	Loss 1.2640 (1.2640)
[02/28 19:52:33][INFO] train_vision.py:  668: Epoch: [28][10/367], lr: 7.21e-06, eta: 0:38:30	Time 3.003 (3.188)	Data 0.048 (0.257)	Mem 41.61GB	Prec@1 100.000 (92.727)	Loss 1.0110 (1.1642)
[02/28 19:53:03][INFO] train_vision.py:  668: Epoch: [28][20/367], lr: 7.10e-06, eta: 0:36:56	Time 3.045 (3.100)	Data 0.076 (0.165)	Mem 41.61GB	Prec@1 100.000 (90.000)	Loss 1.0262 (1.2195)
[02/28 19:53:33][INFO] train_vision.py:  668: Epoch: [28][30/367], lr: 6.98e-06, eta: 0:36:04	Time 2.989 (3.070)	Data 0.062 (0.133)	Mem 41.61GB	Prec@1 100.000 (89.032)	Loss 1.0115 (1.2526)
[02/28 19:54:03][INFO] train_vision.py:  668: Epoch: [28][40/367], lr: 6.87e-06, eta: 0:35:22	Time 3.049 (3.054)	Data 0.079 (0.113)	Mem 41.61GB	Prec@1 80.000 (88.293)	Loss 1.3961 (1.2677)
[02/28 19:54:33][INFO] train_vision.py:  668: Epoch: [28][50/367], lr: 6.76e-06, eta: 0:34:46	Time 3.031 (3.046)	Data 0.079 (0.104)	Mem 41.61GB	Prec@1 100.000 (88.824)	Loss 1.0083 (1.2660)
[02/28 19:55:03][INFO] train_vision.py:  668: Epoch: [28][60/367], lr: 6.65e-06, eta: 0:34:13	Time 3.043 (3.042)	Data 0.077 (0.097)	Mem 41.61GB	Prec@1 80.000 (88.197)	Loss 1.2880 (1.2707)
[02/28 19:55:33][INFO] train_vision.py:  668: Epoch: [28][70/367], lr: 6.55e-06, eta: 0:33:39	Time 2.993 (3.037)	Data 0.070 (0.092)	Mem 41.61GB	Prec@1 100.000 (88.310)	Loss 1.0141 (1.2677)
[02/28 19:56:04][INFO] train_vision.py:  668: Epoch: [28][80/367], lr: 6.44e-06, eta: 0:33:07	Time 3.038 (3.034)	Data 0.048 (0.089)	Mem 41.61GB	Prec@1 100.000 (88.395)	Loss 1.0266 (1.2709)
[02/28 19:56:34][INFO] train_vision.py:  668: Epoch: [28][90/367], lr: 6.34e-06, eta: 0:32:35	Time 3.008 (3.031)	Data 0.060 (0.085)	Mem 41.61GB	Prec@1 90.000 (87.912)	Loss 1.2676 (1.2786)
[02/28 19:57:04][INFO] train_vision.py:  668: Epoch: [28][100/367], lr: 6.23e-06, eta: 0:32:03	Time 3.003 (3.029)	Data 0.043 (0.082)	Mem 41.61GB	Prec@1 70.000 (87.921)	Loss 1.8511 (1.2826)
[02/28 19:57:34][INFO] train_vision.py:  668: Epoch: [28][110/367], lr: 6.13e-06, eta: 0:31:31	Time 2.979 (3.026)	Data 0.060 (0.079)	Mem 41.61GB	Prec@1 70.000 (88.018)	Loss 1.6559 (1.2820)
[02/28 19:58:04][INFO] train_vision.py:  668: Epoch: [28][120/367], lr: 6.03e-06, eta: 0:31:00	Time 3.003 (3.025)	Data 0.048 (0.077)	Mem 41.61GB	Prec@1 80.000 (88.182)	Loss 1.8258 (1.2836)
[02/28 19:58:34][INFO] train_vision.py:  668: Epoch: [28][130/367], lr: 5.94e-06, eta: 0:30:28	Time 2.999 (3.023)	Data 0.055 (0.075)	Mem 41.61GB	Prec@1 70.000 (88.015)	Loss 1.9354 (1.2901)
[02/28 19:59:04][INFO] train_vision.py:  668: Epoch: [28][140/367], lr: 5.84e-06, eta: 0:29:57	Time 3.010 (3.021)	Data 0.055 (0.074)	Mem 41.61GB	Prec@1 90.000 (87.872)	Loss 1.1690 (1.3011)
[02/28 19:59:34][INFO] train_vision.py:  668: Epoch: [28][150/367], lr: 5.75e-06, eta: 0:29:26	Time 3.003 (3.020)	Data 0.051 (0.073)	Mem 41.61GB	Prec@1 100.000 (87.881)	Loss 1.1320 (1.3038)
[02/28 20:00:04][INFO] train_vision.py:  668: Epoch: [28][160/367], lr: 5.65e-06, eta: 0:28:55	Time 3.013 (3.019)	Data 0.022 (0.071)	Mem 41.61GB	Prec@1 70.000 (87.826)	Loss 1.3539 (1.3002)
[02/28 20:00:34][INFO] train_vision.py:  668: Epoch: [28][170/367], lr: 5.56e-06, eta: 0:28:24	Time 2.997 (3.017)	Data 0.055 (0.070)	Mem 41.61GB	Prec@1 80.000 (87.076)	Loss 1.5874 (1.3186)
[02/28 20:01:04][INFO] train_vision.py:  668: Epoch: [28][180/367], lr: 5.47e-06, eta: 0:27:54	Time 2.998 (3.016)	Data 0.050 (0.069)	Mem 41.61GB	Prec@1 50.000 (86.740)	Loss 1.7150 (1.3208)
[02/28 20:01:34][INFO] train_vision.py:  668: Epoch: [28][190/367], lr: 5.38e-06, eta: 0:27:23	Time 2.981 (3.015)	Data 0.058 (0.068)	Mem 41.61GB	Prec@1 90.000 (86.859)	Loss 1.3061 (1.3147)
[02/28 20:02:04][INFO] train_vision.py:  668: Epoch: [28][200/367], lr: 5.30e-06, eta: 0:26:52	Time 2.999 (3.014)	Data 0.048 (0.067)	Mem 41.61GB	Prec@1 70.000 (86.468)	Loss 1.8551 (1.3242)
[02/28 20:02:34][INFO] train_vision.py:  668: Epoch: [28][210/367], lr: 5.21e-06, eta: 0:26:22	Time 3.007 (3.014)	Data 0.062 (0.067)	Mem 41.61GB	Prec@1 70.000 (86.493)	Loss 1.8093 (1.3224)
[02/28 20:03:04][INFO] train_vision.py:  668: Epoch: [28][220/367], lr: 5.13e-06, eta: 0:25:51	Time 2.991 (3.013)	Data 0.048 (0.066)	Mem 41.61GB	Prec@1 90.000 (86.833)	Loss 1.2508 (1.3139)
[02/28 20:03:34][INFO] train_vision.py:  668: Epoch: [28][230/367], lr: 5.05e-06, eta: 0:25:20	Time 2.994 (3.012)	Data 0.051 (0.065)	Mem 41.61GB	Prec@1 100.000 (86.753)	Loss 1.2580 (1.3190)
[02/28 20:04:03][INFO] train_vision.py:  668: Epoch: [28][240/367], lr: 4.97e-06, eta: 0:24:50	Time 2.992 (3.011)	Data 0.041 (0.064)	Mem 41.61GB	Prec@1 90.000 (86.722)	Loss 1.3189 (1.3186)
[02/28 20:04:33][INFO] train_vision.py:  668: Epoch: [28][250/367], lr: 4.89e-06, eta: 0:24:20	Time 2.995 (3.011)	Data 0.055 (0.064)	Mem 41.61GB	Prec@1 80.000 (86.773)	Loss 1.1918 (1.3164)
[02/28 20:05:03][INFO] train_vision.py:  668: Epoch: [28][260/367], lr: 4.81e-06, eta: 0:23:49	Time 2.986 (3.010)	Data 0.040 (0.063)	Mem 41.61GB	Prec@1 90.000 (86.743)	Loss 1.3688 (1.3163)
[02/28 20:05:33][INFO] train_vision.py:  668: Epoch: [28][270/367], lr: 4.74e-06, eta: 0:23:19	Time 2.996 (3.009)	Data 0.052 (0.063)	Mem 41.61GB	Prec@1 90.000 (86.974)	Loss 1.2349 (1.3137)
[02/28 20:06:03][INFO] train_vision.py:  668: Epoch: [28][280/367], lr: 4.66e-06, eta: 0:22:49	Time 2.996 (3.009)	Data 0.048 (0.062)	Mem 41.61GB	Prec@1 70.000 (86.904)	Loss 1.7053 (1.3148)
[02/28 20:06:33][INFO] train_vision.py:  668: Epoch: [28][290/367], lr: 4.59e-06, eta: 0:22:18	Time 3.004 (3.009)	Data 0.060 (0.062)	Mem 41.61GB	Prec@1 90.000 (86.907)	Loss 1.0510 (1.3138)
[02/28 20:07:03][INFO] train_vision.py:  668: Epoch: [28][300/367], lr: 4.52e-06, eta: 0:21:48	Time 2.977 (3.008)	Data 0.049 (0.062)	Mem 41.61GB	Prec@1 80.000 (86.811)	Loss 1.6752 (1.3152)
[02/28 20:07:33][INFO] train_vision.py:  668: Epoch: [28][310/367], lr: 4.45e-06, eta: 0:21:18	Time 3.003 (3.008)	Data 0.060 (0.061)	Mem 41.61GB	Prec@1 80.000 (86.977)	Loss 2.0235 (1.3135)
[02/28 20:08:03][INFO] train_vision.py:  668: Epoch: [28][320/367], lr: 4.38e-06, eta: 0:20:48	Time 2.991 (3.007)	Data 0.055 (0.061)	Mem 41.61GB	Prec@1 90.000 (87.040)	Loss 1.1494 (1.3085)
[02/28 20:08:33][INFO] train_vision.py:  668: Epoch: [28][330/367], lr: 4.32e-06, eta: 0:20:17	Time 3.027 (3.007)	Data 0.029 (0.061)	Mem 41.61GB	Prec@1 60.000 (86.888)	Loss 1.7368 (1.3092)
[02/28 20:09:03][INFO] train_vision.py:  668: Epoch: [28][340/367], lr: 4.25e-06, eta: 0:19:47	Time 2.984 (3.007)	Data 0.051 (0.061)	Mem 41.61GB	Prec@1 80.000 (86.774)	Loss 1.5908 (1.3113)
[02/28 20:09:33][INFO] train_vision.py:  668: Epoch: [28][350/367], lr: 4.19e-06, eta: 0:19:17	Time 2.994 (3.006)	Data 0.056 (0.060)	Mem 41.61GB	Prec@1 90.000 (86.752)	Loss 1.5450 (1.3132)
[02/28 20:10:03][INFO] train_vision.py:  668: Epoch: [28][360/367], lr: 4.13e-06, eta: 0:18:47	Time 2.983 (3.006)	Data 0.050 (0.060)	Mem 41.61GB	Prec@1 100.000 (86.731)	Loss 0.9596 (1.3125)
[02/28 20:10:27][INFO] train_vision.py:  840: Test: [0/121]	Prec@1 96.250 (96.250)	Prec@5 100.000 (100.000)	mPrec@1 (10.926)	mPrec@5 (11.458)
[02/28 20:11:09][INFO] train_vision.py:  840: Test: [10/121]	Prec@1 97.500 (96.477)	Prec@5 100.000 (100.000)	mPrec@1 (31.055)	mPrec@5 (32.639)
[02/28 20:11:51][INFO] train_vision.py:  840: Test: [20/121]	Prec@1 100.000 (95.238)	Prec@5 100.000 (99.762)	mPrec@1 (41.510)	mPrec@5 (45.000)
[02/28 20:12:33][INFO] train_vision.py:  840: Test: [30/121]	Prec@1 95.000 (95.202)	Prec@5 100.000 (99.718)	mPrec@1 (48.607)	mPrec@5 (54.180)
[02/28 20:13:16][INFO] train_vision.py:  840: Test: [40/121]	Prec@1 93.750 (95.213)	Prec@5 98.750 (99.665)	mPrec@1 (54.709)	mPrec@5 (61.445)
[02/28 20:13:58][INFO] train_vision.py:  840: Test: [50/121]	Prec@1 93.750 (94.436)	Prec@5 100.000 (99.485)	mPrec@1 (56.281)	mPrec@5 (64.573)
[02/28 20:14:40][INFO] train_vision.py:  840: Test: [60/121]	Prec@1 96.250 (94.549)	Prec@5 100.000 (99.570)	mPrec@1 (58.406)	mPrec@5 (67.747)
[02/28 20:15:22][INFO] train_vision.py:  840: Test: [70/121]	Prec@1 97.500 (94.824)	Prec@5 100.000 (99.595)	mPrec@1 (58.933)	mPrec@5 (69.389)
[02/28 20:16:04][INFO] train_vision.py:  840: Test: [80/121]	Prec@1 98.750 (94.969)	Prec@5 100.000 (99.583)	mPrec@1 (62.043)	mPrec@5 (73.648)
[02/28 20:16:46][INFO] train_vision.py:  840: Test: [90/121]	Prec@1 91.250 (94.766)	Prec@5 100.000 (99.588)	mPrec@1 (63.626)	mPrec@5 (77.417)
[02/28 20:17:28][INFO] train_vision.py:  840: Test: [100/121]	Prec@1 96.250 (94.307)	Prec@5 100.000 (99.530)	mPrec@1 (68.518)	mPrec@5 (86.701)
[02/28 20:18:11][INFO] train_vision.py:  840: Test: [110/121]	Prec@1 87.500 (94.336)	Prec@5 100.000 (99.550)	mPrec@1 (68.453)	mPrec@5 (87.394)
[02/28 20:18:51][INFO] train_vision.py:  840: Test: [120/121]	Prec@1 66.667 (93.522)	Prec@5 87.500 (99.440)	mPrec@1 (70.497)	mPrec@5 (94.833)
[02/28 20:18:51][INFO] train_vision.py:  847: Overall Prec@1 93.522% Prec@5 99.440% mPrec@1 (70.497) mPrec@5 (94.833)
[02/28 20:18:51][INFO] train_vision.py:  464: Testing: 70.49739837646484/70.70352172851562
[02/28 20:18:51][INFO] train_vision.py:  465: Saving:
[02/28 20:19:04][INFO] train_vision.py:  668: Epoch: [29][0/367], lr: 4.08e-06, eta: 0:32:04	Time 5.229 (5.229)	Data 2.337 (2.337)	Mem 41.61GB	Prec@1 90.000 (90.000)	Loss 1.0628 (1.0628)
[02/28 20:19:34][INFO] train_vision.py:  668: Epoch: [29][10/367], lr: 4.03e-06, eta: 0:19:08	Time 2.998 (3.209)	Data 0.064 (0.269)	Mem 41.61GB	Prec@1 90.000 (87.273)	Loss 1.6220 (1.3250)
[02/28 20:20:04][INFO] train_vision.py:  668: Epoch: [29][20/367], lr: 3.97e-06, eta: 0:18:00	Time 2.963 (3.104)	Data 0.049 (0.162)	Mem 41.61GB	Prec@1 80.000 (87.143)	Loss 1.3412 (1.3089)
[02/28 20:20:34][INFO] train_vision.py:  668: Epoch: [29][30/367], lr: 3.92e-06, eta: 0:17:16	Time 2.993 (3.067)	Data 0.040 (0.123)	Mem 41.61GB	Prec@1 80.000 (87.097)	Loss 1.5052 (1.3249)
[02/28 20:21:04][INFO] train_vision.py:  668: Epoch: [29][40/367], lr: 3.87e-06, eta: 0:16:39	Time 3.013 (3.049)	Data 0.045 (0.104)	Mem 41.61GB	Prec@1 80.000 (87.073)	Loss 1.6114 (1.3243)
[02/28 20:21:33][INFO] train_vision.py:  668: Epoch: [29][50/367], lr: 3.81e-06, eta: 0:16:05	Time 2.994 (3.036)	Data 0.045 (0.092)	Mem 41.61GB	Prec@1 90.000 (87.843)	Loss 1.2146 (1.3007)
[02/28 20:22:03][INFO] train_vision.py:  668: Epoch: [29][60/367], lr: 3.76e-06, eta: 0:15:32	Time 2.983 (3.029)	Data 0.047 (0.084)	Mem 41.61GB	Prec@1 90.000 (87.213)	Loss 1.3651 (1.3220)
[02/28 20:22:33][INFO] train_vision.py:  668: Epoch: [29][70/367], lr: 3.71e-06, eta: 0:15:01	Time 3.016 (3.024)	Data 0.023 (0.078)	Mem 41.61GB	Prec@1 90.000 (87.183)	Loss 1.0336 (1.3225)
[02/28 20:23:03][INFO] train_vision.py:  668: Epoch: [29][80/367], lr: 3.67e-06, eta: 0:14:29	Time 2.990 (3.019)	Data 0.042 (0.073)	Mem 41.61GB	Prec@1 100.000 (87.531)	Loss 1.1302 (1.3128)
[02/28 20:23:33][INFO] train_vision.py:  668: Epoch: [29][90/367], lr: 3.62e-06, eta: 0:13:58	Time 2.999 (3.016)	Data 0.039 (0.070)	Mem 41.61GB	Prec@1 80.000 (87.143)	Loss 1.4496 (1.3168)
[02/28 20:24:03][INFO] train_vision.py:  668: Epoch: [29][100/367], lr: 3.58e-06, eta: 0:13:27	Time 2.998 (3.014)	Data 0.049 (0.068)	Mem 41.61GB	Prec@1 80.000 (86.832)	Loss 1.3101 (1.3256)
[02/28 20:24:33][INFO] train_vision.py:  668: Epoch: [29][110/367], lr: 3.54e-06, eta: 0:12:57	Time 2.998 (3.012)	Data 0.049 (0.066)	Mem 41.61GB	Prec@1 60.000 (86.847)	Loss 1.6460 (1.3203)
[02/28 20:25:03][INFO] train_vision.py:  668: Epoch: [29][120/367], lr: 3.49e-06, eta: 0:12:26	Time 2.964 (3.010)	Data 0.042 (0.064)	Mem 41.61GB	Prec@1 100.000 (87.025)	Loss 1.1378 (1.3130)
[02/28 20:25:33][INFO] train_vision.py:  668: Epoch: [29][130/367], lr: 3.46e-06, eta: 0:11:56	Time 2.969 (3.009)	Data 0.041 (0.063)	Mem 41.61GB	Prec@1 80.000 (86.641)	Loss 1.4110 (1.3296)
[02/28 20:26:03][INFO] train_vision.py:  668: Epoch: [29][140/367], lr: 3.42e-06, eta: 0:11:25	Time 2.996 (3.008)	Data 0.049 (0.061)	Mem 41.61GB	Prec@1 90.000 (86.809)	Loss 1.2271 (1.3275)
[02/28 20:26:33][INFO] train_vision.py:  668: Epoch: [29][150/367], lr: 3.38e-06, eta: 0:10:55	Time 2.993 (3.006)	Data 0.042 (0.060)	Mem 41.61GB	Prec@1 80.000 (86.689)	Loss 1.5094 (1.3336)
[02/28 20:27:02][INFO] train_vision.py:  668: Epoch: [29][160/367], lr: 3.35e-06, eta: 0:10:25	Time 2.980 (3.005)	Data 0.045 (0.059)	Mem 41.61GB	Prec@1 100.000 (86.957)	Loss 1.1864 (1.3268)
[02/28 20:27:32][INFO] train_vision.py:  668: Epoch: [29][170/367], lr: 3.32e-06, eta: 0:09:54	Time 2.998 (3.004)	Data 0.042 (0.058)	Mem 41.61GB	Prec@1 80.000 (86.901)	Loss 1.7295 (1.3276)
[02/28 20:28:02][INFO] train_vision.py:  668: Epoch: [29][180/367], lr: 3.28e-06, eta: 0:09:24	Time 2.983 (3.003)	Data 0.051 (0.057)	Mem 41.61GB	Prec@1 80.000 (86.519)	Loss 1.6290 (1.3347)
[02/28 20:28:32][INFO] train_vision.py:  668: Epoch: [29][190/367], lr: 3.25e-06, eta: 0:08:54	Time 2.998 (3.002)	Data 0.041 (0.056)	Mem 41.61GB	Prec@1 80.000 (86.440)	Loss 1.4154 (1.3342)
[02/28 20:29:02][INFO] train_vision.py:  668: Epoch: [29][200/367], lr: 3.23e-06, eta: 0:08:24	Time 2.994 (3.001)	Data 0.040 (0.056)	Mem 41.61GB	Prec@1 90.000 (86.418)	Loss 1.1463 (1.3321)
[02/28 20:29:32][INFO] train_vision.py:  668: Epoch: [29][210/367], lr: 3.20e-06, eta: 0:07:54	Time 2.966 (3.001)	Data 0.044 (0.055)	Mem 41.61GB	Prec@1 100.000 (86.540)	Loss 1.0325 (1.3318)
[02/28 20:30:02][INFO] train_vision.py:  668: Epoch: [29][220/367], lr: 3.18e-06, eta: 0:07:24	Time 2.993 (3.000)	Data 0.048 (0.055)	Mem 41.61GB	Prec@1 70.000 (86.380)	Loss 1.7621 (1.3332)
[02/28 20:30:32][INFO] train_vision.py:  668: Epoch: [29][230/367], lr: 3.15e-06, eta: 0:06:53	Time 2.994 (3.000)	Data 0.042 (0.054)	Mem 41.61GB	Prec@1 80.000 (86.494)	Loss 1.5255 (1.3298)
[02/28 20:31:02][INFO] train_vision.py:  668: Epoch: [29][240/367], lr: 3.13e-06, eta: 0:06:23	Time 2.984 (3.000)	Data 0.041 (0.054)	Mem 41.61GB	Prec@1 100.000 (86.680)	Loss 1.0462 (1.3242)
[02/28 20:31:31][INFO] train_vision.py:  668: Epoch: [29][250/367], lr: 3.11e-06, eta: 0:05:53	Time 3.011 (2.999)	Data 0.043 (0.053)	Mem 41.61GB	Prec@1 70.000 (86.614)	Loss 1.4033 (1.3232)
[02/28 20:32:01][INFO] train_vision.py:  668: Epoch: [29][260/367], lr: 3.09e-06, eta: 0:05:23	Time 2.985 (2.999)	Data 0.024 (0.053)	Mem 41.61GB	Prec@1 100.000 (86.667)	Loss 1.1487 (1.3225)
[02/28 20:32:31][INFO] train_vision.py:  668: Epoch: [29][270/367], lr: 3.08e-06, eta: 0:04:53	Time 3.011 (2.999)	Data 0.052 (0.053)	Mem 41.61GB	Prec@1 90.000 (86.458)	Loss 1.1881 (1.3255)
[02/28 20:33:01][INFO] train_vision.py:  668: Epoch: [29][280/367], lr: 3.06e-06, eta: 0:04:23	Time 2.962 (2.998)	Data 0.049 (0.052)	Mem 41.61GB	Prec@1 100.000 (86.370)	Loss 1.1334 (1.3286)
[02/28 20:33:31][INFO] train_vision.py:  668: Epoch: [29][290/367], lr: 3.05e-06, eta: 0:03:53	Time 2.986 (2.998)	Data 0.042 (0.052)	Mem 41.61GB	Prec@1 100.000 (86.564)	Loss 1.0907 (1.3251)
[02/28 20:34:01][INFO] train_vision.py:  668: Epoch: [29][300/367], lr: 3.04e-06, eta: 0:03:23	Time 2.983 (2.998)	Data 0.039 (0.052)	Mem 41.61GB	Prec@1 80.000 (86.777)	Loss 1.7374 (1.3202)
[02/28 20:34:31][INFO] train_vision.py:  668: Epoch: [29][310/367], lr: 3.03e-06, eta: 0:02:53	Time 2.988 (2.998)	Data 0.040 (0.051)	Mem 41.61GB	Prec@1 90.000 (86.785)	Loss 1.2308 (1.3202)
[02/28 20:35:01][INFO] train_vision.py:  668: Epoch: [29][320/367], lr: 3.02e-06, eta: 0:02:23	Time 3.009 (2.997)	Data 0.029 (0.051)	Mem 41.61GB	Prec@1 80.000 (86.729)	Loss 1.2238 (1.3222)
[02/28 20:35:31][INFO] train_vision.py:  668: Epoch: [29][330/367], lr: 3.01e-06, eta: 0:01:53	Time 3.029 (2.997)	Data 0.022 (0.051)	Mem 41.61GB	Prec@1 90.000 (86.888)	Loss 1.3544 (1.3173)
[02/28 20:36:01][INFO] train_vision.py:  668: Epoch: [29][340/367], lr: 3.01e-06, eta: 0:01:23	Time 2.996 (2.997)	Data 0.041 (0.051)	Mem 41.61GB	Prec@1 100.000 (86.979)	Loss 0.9454 (1.3150)
[02/28 20:36:31][INFO] train_vision.py:  668: Epoch: [29][350/367], lr: 3.00e-06, eta: 0:00:53	Time 2.999 (2.997)	Data 0.040 (0.050)	Mem 41.61GB	Prec@1 70.000 (86.952)	Loss 1.5588 (1.3141)
[02/28 20:37:01][INFO] train_vision.py:  668: Epoch: [29][360/367], lr: 3.00e-06, eta: 0:00:23	Time 2.985 (2.997)	Data 0.038 (0.050)	Mem 41.61GB	Prec@1 80.000 (87.064)	Loss 1.2905 (1.3117)
[02/28 20:37:24][INFO] train_vision.py:  840: Test: [0/121]	Prec@1 96.250 (96.250)	Prec@5 100.000 (100.000)	mPrec@1 (10.926)	mPrec@5 (11.458)
[02/28 20:38:06][INFO] train_vision.py:  840: Test: [10/121]	Prec@1 97.500 (96.136)	Prec@5 100.000 (100.000)	mPrec@1 (31.012)	mPrec@5 (32.639)
[02/28 20:38:49][INFO] train_vision.py:  840: Test: [20/121]	Prec@1 100.000 (95.119)	Prec@5 100.000 (99.762)	mPrec@1 (41.576)	mPrec@5 (45.066)
[02/28 20:39:31][INFO] train_vision.py:  840: Test: [30/121]	Prec@1 95.000 (95.161)	Prec@5 100.000 (99.718)	mPrec@1 (48.793)	mPrec@5 (54.212)
[02/28 20:40:13][INFO] train_vision.py:  840: Test: [40/121]	Prec@1 93.750 (95.213)	Prec@5 98.750 (99.665)	mPrec@1 (54.863)	mPrec@5 (61.472)
[02/28 20:40:55][INFO] train_vision.py:  840: Test: [50/121]	Prec@1 92.500 (94.534)	Prec@5 100.000 (99.510)	mPrec@1 (56.184)	mPrec@5 (64.622)
[02/28 20:41:37][INFO] train_vision.py:  840: Test: [60/121]	Prec@1 96.250 (94.570)	Prec@5 100.000 (99.590)	mPrec@1 (58.100)	mPrec@5 (67.794)
[02/28 20:42:19][INFO] train_vision.py:  840: Test: [70/121]	Prec@1 97.500 (94.789)	Prec@5 100.000 (99.595)	mPrec@1 (58.498)	mPrec@5 (69.192)
[02/28 20:43:01][INFO] train_vision.py:  840: Test: [80/121]	Prec@1 98.750 (94.938)	Prec@5 100.000 (99.599)	mPrec@1 (61.596)	mPrec@5 (73.416)
[02/28 20:43:43][INFO] train_vision.py:  840: Test: [90/121]	Prec@1 93.750 (94.794)	Prec@5 100.000 (99.602)	mPrec@1 (63.530)	mPrec@5 (77.180)
[02/28 20:44:25][INFO] train_vision.py:  840: Test: [100/121]	Prec@1 96.250 (94.319)	Prec@5 100.000 (99.542)	mPrec@1 (68.051)	mPrec@5 (86.327)
[02/28 20:45:07][INFO] train_vision.py:  840: Test: [110/121]	Prec@1 86.250 (94.313)	Prec@5 100.000 (99.561)	mPrec@1 (67.996)	mPrec@5 (86.995)
[02/28 20:45:48][INFO] train_vision.py:  840: Test: [120/121]	Prec@1 66.667 (93.522)	Prec@5 85.417 (99.451)	mPrec@1 (70.400)	mPrec@5 (94.419)
[02/28 20:45:48][INFO] train_vision.py:  847: Overall Prec@1 93.522% Prec@5 99.451% mPrec@1 (70.400) mPrec@5 (94.419)
[02/28 20:45:48][INFO] train_vision.py:  464: Testing: 70.39988708496094/70.70352172851562
[02/28 20:45:48][INFO] train_vision.py:  465: Saving:
[02/28 20:46:14][DEBUG] cmd.py: 1253: Popen(['git', 'rev-parse', '--show-toplevel'], cwd=/home/anonymous/research/CorrelationSideTuning, stdin=None, shell=False, universal_newlines=False)
[02/28 20:46:14][DEBUG] cmd.py: 1253: Popen(['git', 'rev-parse', '--show-toplevel'], cwd=/home/anonymous/research/CorrelationSideTuning, stdin=None, shell=False, universal_newlines=False)
[02/28 20:46:16][DEBUG] cmd.py: 1253: Popen(['git', 'cat-file', '--batch-check'], cwd=/home/anonymous/research/CorrelationSideTuning, stdin=<valid stream>, shell=False, universal_newlines=False)
[02/28 20:46:20][INFO] model.py:  404: dropout used:[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0]
[02/28 20:46:22][INFO] model.py:  444: dropout used:[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0]
[02/28 20:46:22][INFO] model.py:  404: dropout used:[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0]
[02/28 20:46:24][INFO] model.py:  921: loading clip pretrained model!
[02/28 20:46:24][INFO] utils.py:  500: Model:
VideoCLIP(
  (visual): VisualTransformer(
    (conv1): Conv2d(3, 1024, kernel_size=(14, 14), stride=(14, 14), bias=False)
    (dropout): Dropout(p=0.0, inplace=False)
    (ln_pre): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
    (transformer): Transformer(
      (resblocks): ModuleList(
        (0): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (1): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (2): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (3): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (4): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (5): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (6): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (7): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (8): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (9): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (10): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (11): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (12): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (13): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (14): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (15): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (16): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (17): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (18): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (19): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (20): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (21): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (22): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (23): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
      )
    )
    (side_network): SideNetwork(
      (resblocks): ModuleList(
        (0): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (1): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (2): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (3): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (4): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (5): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (6): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (7): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (8): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (9): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (10): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (11): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (12): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (13): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (14): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (15): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (16): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (17): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (18): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (19): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (20): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (21): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (22): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (23): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
      )
      (adaptation): ModuleList(
        (0): Linear(in_features=1024, out_features=448, bias=True)
        (1): Linear(in_features=1024, out_features=448, bias=True)
        (2): Linear(in_features=1024, out_features=448, bias=True)
        (3): Linear(in_features=1024, out_features=448, bias=True)
        (4): Linear(in_features=1024, out_features=448, bias=True)
        (5): Linear(in_features=1024, out_features=448, bias=True)
        (6): Linear(in_features=1024, out_features=448, bias=True)
        (7): Linear(in_features=1024, out_features=448, bias=True)
        (8): Linear(in_features=1024, out_features=448, bias=True)
        (9): Linear(in_features=1024, out_features=448, bias=True)
        (10): Linear(in_features=1024, out_features=448, bias=True)
        (11): Linear(in_features=1024, out_features=448, bias=True)
        (12): Linear(in_features=1024, out_features=448, bias=True)
        (13): Linear(in_features=1024, out_features=448, bias=True)
        (14): Linear(in_features=1024, out_features=448, bias=True)
        (15): Linear(in_features=1024, out_features=448, bias=True)
        (16): Linear(in_features=1024, out_features=448, bias=True)
        (17): Linear(in_features=1024, out_features=448, bias=True)
        (18): Linear(in_features=1024, out_features=448, bias=True)
        (19): Linear(in_features=1024, out_features=448, bias=True)
        (20): Linear(in_features=1024, out_features=448, bias=True)
        (21): Linear(in_features=1024, out_features=448, bias=True)
        (22): Linear(in_features=1024, out_features=448, bias=True)
        (23): Linear(in_features=1024, out_features=448, bias=True)
      )
      (lns_pre): ModuleList(
        (0): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (3): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (4): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (5): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (6): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (7): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (8): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (9): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (10): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (11): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (12): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (13): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (14): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (15): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (16): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (17): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (18): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (19): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (20): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (21): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (22): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (23): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
      )
      (moss_layers): ModuleList(
        (0): MOSSBlock(
          (stss_encoders): ModuleList(
            (0): STSSEncoder(
              (ln_pre): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
              (in_proj): Linear(in_features=1024, out_features=256, bias=True)
              (stss_transformation): STSSTransformation()
              (stss_extraction): STSSExtraction(
                (conv0): Sequential(
                  (0): Conv3d(81, 96, kernel_size=(1, 1, 1), stride=(1, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
              )
              (stss_integration): STSSIntegration(
                (conv0): Sequential(
                  (0): Conv3d(96, 96, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
                (conv1): Sequential(
                  (0): Conv3d(96, 96, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
                (conv2_fuse): Sequential(
                  (0): Rearrange('(b l) c t h w -> b (l c) t h w', l=5)
                  (1): Conv3d(480, 192, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (2): BatchNorm3d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (3): GELU(approximate=none)
                )
              )
              (out_proj): Linear(in_features=192, out_features=448, bias=True)
            )
            (1): STSSEncoder(
              (ln_pre): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
              (in_proj): Linear(in_features=448, out_features=256, bias=True)
              (stss_transformation): STSSTransformation()
              (stss_extraction): STSSExtraction(
                (conv0): Sequential(
                  (0): Conv3d(81, 96, kernel_size=(1, 1, 1), stride=(1, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
              )
              (stss_integration): STSSIntegration(
                (conv0): Sequential(
                  (0): Conv3d(96, 96, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
                (conv1): Sequential(
                  (0): Conv3d(96, 96, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
                (conv2_fuse): Sequential(
                  (0): Rearrange('(b l) c t h w -> b (l c) t h w', l=5)
                  (1): Conv3d(480, 192, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (2): BatchNorm3d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (3): GELU(approximate=none)
                )
              )
              (out_proj): Linear(in_features=192, out_features=448, bias=True)
            )
          )
        )
      )
    )
    (side_post_bn): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
    (side_conv1): Conv3d(3, 448, kernel_size=(3, 14, 14), stride=(1, 14, 14), padding=(1, 0, 0))
    (side_pre_bn3d): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
  )
  (fusion_model): video_header()
  (drop_out): Dropout(p=0, inplace=False)
  (fc): Linear(in_features=448, out_features=288, bias=True)
)
[02/28 20:47:26][WARNING] jit_analysis.py:  501: Unsupported operator aten::div encountered 205 time(s)
[02/28 20:47:26][WARNING] jit_analysis.py:  501: Unsupported operator aten::add_ encountered 58 time(s)
[02/28 20:47:26][WARNING] jit_analysis.py:  501: Unsupported operator aten::mul encountered 463 time(s)
[02/28 20:47:26][WARNING] jit_analysis.py:  501: Unsupported operator aten::mul_ encountered 90 time(s)
[02/28 20:47:26][WARNING] jit_analysis.py:  501: Unsupported operator aten::add encountered 171 time(s)
[02/28 20:47:26][WARNING] jit_analysis.py:  501: Unsupported operator aten::softmax encountered 24 time(s)
[02/28 20:47:26][WARNING] jit_analysis.py:  501: Unsupported operator aten::sigmoid encountered 24 time(s)
[02/28 20:47:26][WARNING] jit_analysis.py:  501: Unsupported operator prim::PythonOp.CheckpointFunction encountered 72 time(s)
[02/28 20:47:26][WARNING] jit_analysis.py:  501: Unsupported operator aten::pad encountered 38 time(s)
[02/28 20:47:26][WARNING] jit_analysis.py:  501: Unsupported operator aten::unfold encountered 2 time(s)
[02/28 20:47:26][WARNING] jit_analysis.py:  501: Unsupported operator aten::norm encountered 4 time(s)
[02/28 20:47:26][WARNING] jit_analysis.py:  501: Unsupported operator aten::clamp_min encountered 4 time(s)
[02/28 20:47:26][WARNING] jit_analysis.py:  501: Unsupported operator aten::expand_as encountered 4 time(s)
[02/28 20:47:26][WARNING] jit_analysis.py:  501: Unsupported operator aten::diagonal encountered 36 time(s)
[02/28 20:47:26][WARNING] jit_analysis.py:  501: Unsupported operator aten::gelu encountered 8 time(s)
[02/28 20:47:26][WARNING] jit_analysis.py:  501: Unsupported operator aten::sum encountered 1 time(s)
[02/28 20:47:26][WARNING] jit_analysis.py:  501: Unsupported operator aten::mean encountered 1 time(s)
[02/28 20:47:26][WARNING] jit_analysis.py:  513: The following submodules of the model were never called during the trace of the graph. They may be unused, or they were accessed by direct calls to .forward() or via other python methods. In the latter case they will have zeros for statistics, though their statistics will still contribute to their parent calling module.
fusion_model, visual.dropout, visual.side_network.resblocks.0.attn, visual.side_network.resblocks.0.attn.out_proj, visual.side_network.resblocks.0.conv, visual.side_network.resblocks.0.conv.0, visual.side_network.resblocks.0.conv.1, visual.side_network.resblocks.0.conv.2, visual.side_network.resblocks.0.mlp, visual.side_network.resblocks.0.mlp.act, visual.side_network.resblocks.0.mlp.drop, visual.side_network.resblocks.0.mlp.fc1, visual.side_network.resblocks.0.mlp.fc2, visual.side_network.resblocks.1.attn, visual.side_network.resblocks.1.attn.out_proj, visual.side_network.resblocks.1.conv, visual.side_network.resblocks.1.conv.0, visual.side_network.resblocks.1.conv.1, visual.side_network.resblocks.1.conv.2, visual.side_network.resblocks.1.mlp, visual.side_network.resblocks.1.mlp.act, visual.side_network.resblocks.1.mlp.drop, visual.side_network.resblocks.1.mlp.fc1, visual.side_network.resblocks.1.mlp.fc2, visual.side_network.resblocks.10.attn, visual.side_network.resblocks.10.attn.out_proj, visual.side_network.resblocks.10.conv, visual.side_network.resblocks.10.conv.0, visual.side_network.resblocks.10.conv.1, visual.side_network.resblocks.10.conv.2, visual.side_network.resblocks.10.mlp, visual.side_network.resblocks.10.mlp.act, visual.side_network.resblocks.10.mlp.drop, visual.side_network.resblocks.10.mlp.fc1, visual.side_network.resblocks.10.mlp.fc2, visual.side_network.resblocks.11.attn, visual.side_network.resblocks.11.attn.out_proj, visual.side_network.resblocks.11.conv, visual.side_network.resblocks.11.conv.0, visual.side_network.resblocks.11.conv.1, visual.side_network.resblocks.11.conv.2, visual.side_network.resblocks.11.mlp, visual.side_network.resblocks.11.mlp.act, visual.side_network.resblocks.11.mlp.drop, visual.side_network.resblocks.11.mlp.fc1, visual.side_network.resblocks.11.mlp.fc2, visual.side_network.resblocks.12.attn, visual.side_network.resblocks.12.attn.out_proj, visual.side_network.resblocks.12.conv, visual.side_network.resblocks.12.conv.0, visual.side_network.resblocks.12.conv.1, visual.side_network.resblocks.12.conv.2, visual.side_network.resblocks.12.mlp, visual.side_network.resblocks.12.mlp.act, visual.side_network.resblocks.12.mlp.drop, visual.side_network.resblocks.12.mlp.fc1, visual.side_network.resblocks.12.mlp.fc2, visual.side_network.resblocks.13.attn, visual.side_network.resblocks.13.attn.out_proj, visual.side_network.resblocks.13.conv, visual.side_network.resblocks.13.conv.0, visual.side_network.resblocks.13.conv.1, visual.side_network.resblocks.13.conv.2, visual.side_network.resblocks.13.mlp, visual.side_network.resblocks.13.mlp.act, visual.side_network.resblocks.13.mlp.drop, visual.side_network.resblocks.13.mlp.fc1, visual.side_network.resblocks.13.mlp.fc2, visual.side_network.resblocks.14.attn, visual.side_network.resblocks.14.attn.out_proj, visual.side_network.resblocks.14.conv, visual.side_network.resblocks.14.conv.0, visual.side_network.resblocks.14.conv.1, visual.side_network.resblocks.14.conv.2, visual.side_network.resblocks.14.mlp, visual.side_network.resblocks.14.mlp.act, visual.side_network.resblocks.14.mlp.drop, visual.side_network.resblocks.14.mlp.fc1, visual.side_network.resblocks.14.mlp.fc2, visual.side_network.resblocks.15.attn, visual.side_network.resblocks.15.attn.out_proj, visual.side_network.resblocks.15.conv, visual.side_network.resblocks.15.conv.0, visual.side_network.resblocks.15.conv.1, visual.side_network.resblocks.15.conv.2, visual.side_network.resblocks.15.mlp, visual.side_network.resblocks.15.mlp.act, visual.side_network.resblocks.15.mlp.drop, visual.side_network.resblocks.15.mlp.fc1, visual.side_network.resblocks.15.mlp.fc2, visual.side_network.resblocks.16.attn, visual.side_network.resblocks.16.attn.out_proj, visual.side_network.resblocks.16.conv, visual.side_network.resblocks.16.conv.0, visual.side_network.resblocks.16.conv.1, visual.side_network.resblocks.16.conv.2, visual.side_network.resblocks.16.mlp, visual.side_network.resblocks.16.mlp.act, visual.side_network.resblocks.16.mlp.drop, visual.side_network.resblocks.16.mlp.fc1, visual.side_network.resblocks.16.mlp.fc2, visual.side_network.resblocks.17.attn, visual.side_network.resblocks.17.attn.out_proj, visual.side_network.resblocks.17.conv, visual.side_network.resblocks.17.conv.0, visual.side_network.resblocks.17.conv.1, visual.side_network.resblocks.17.conv.2, visual.side_network.resblocks.17.mlp, visual.side_network.resblocks.17.mlp.act, visual.side_network.resblocks.17.mlp.drop, visual.side_network.resblocks.17.mlp.fc1, visual.side_network.resblocks.17.mlp.fc2, visual.side_network.resblocks.18.attn, visual.side_network.resblocks.18.attn.out_proj, visual.side_network.resblocks.18.conv, visual.side_network.resblocks.18.conv.0, visual.side_network.resblocks.18.conv.1, visual.side_network.resblocks.18.conv.2, visual.side_network.resblocks.18.mlp, visual.side_network.resblocks.18.mlp.act, visual.side_network.resblocks.18.mlp.drop, visual.side_network.resblocks.18.mlp.fc1, visual.side_network.resblocks.18.mlp.fc2, visual.side_network.resblocks.19.attn, visual.side_network.resblocks.19.attn.out_proj, visual.side_network.resblocks.19.conv, visual.side_network.resblocks.19.conv.0, visual.side_network.resblocks.19.conv.1, visual.side_network.resblocks.19.conv.2, visual.side_network.resblocks.19.mlp, visual.side_network.resblocks.19.mlp.act, visual.side_network.resblocks.19.mlp.drop, visual.side_network.resblocks.19.mlp.fc1, visual.side_network.resblocks.19.mlp.fc2, visual.side_network.resblocks.2.attn, visual.side_network.resblocks.2.attn.out_proj, visual.side_network.resblocks.2.conv, visual.side_network.resblocks.2.conv.0, visual.side_network.resblocks.2.conv.1, visual.side_network.resblocks.2.conv.2, visual.side_network.resblocks.2.mlp, visual.side_network.resblocks.2.mlp.act, visual.side_network.resblocks.2.mlp.drop, visual.side_network.resblocks.2.mlp.fc1, visual.side_network.resblocks.2.mlp.fc2, visual.side_network.resblocks.20.attn, visual.side_network.resblocks.20.attn.out_proj, visual.side_network.resblocks.20.conv, visual.side_network.resblocks.20.conv.0, visual.side_network.resblocks.20.conv.1, visual.side_network.resblocks.20.conv.2, visual.side_network.resblocks.20.mlp, visual.side_network.resblocks.20.mlp.act, visual.side_network.resblocks.20.mlp.drop, visual.side_network.resblocks.20.mlp.fc1, visual.side_network.resblocks.20.mlp.fc2, visual.side_network.resblocks.21.attn, visual.side_network.resblocks.21.attn.out_proj, visual.side_network.resblocks.21.conv, visual.side_network.resblocks.21.conv.0, visual.side_network.resblocks.21.conv.1, visual.side_network.resblocks.21.conv.2, visual.side_network.resblocks.21.mlp, visual.side_network.resblocks.21.mlp.act, visual.side_network.resblocks.21.mlp.drop, visual.side_network.resblocks.21.mlp.fc1, visual.side_network.resblocks.21.mlp.fc2, visual.side_network.resblocks.22.attn, visual.side_network.resblocks.22.attn.out_proj, visual.side_network.resblocks.22.conv, visual.side_network.resblocks.22.conv.0, visual.side_network.resblocks.22.conv.1, visual.side_network.resblocks.22.conv.2, visual.side_network.resblocks.22.mlp, visual.side_network.resblocks.22.mlp.act, visual.side_network.resblocks.22.mlp.drop, visual.side_network.resblocks.22.mlp.fc1, visual.side_network.resblocks.22.mlp.fc2, visual.side_network.resblocks.23.attn, visual.side_network.resblocks.23.attn.out_proj, visual.side_network.resblocks.23.conv, visual.side_network.resblocks.23.conv.0, visual.side_network.resblocks.23.conv.1, visual.side_network.resblocks.23.conv.2, visual.side_network.resblocks.23.mlp, visual.side_network.resblocks.23.mlp.act, visual.side_network.resblocks.23.mlp.drop, visual.side_network.resblocks.23.mlp.fc1, visual.side_network.resblocks.23.mlp.fc2, visual.side_network.resblocks.3.attn, visual.side_network.resblocks.3.attn.out_proj, visual.side_network.resblocks.3.conv, visual.side_network.resblocks.3.conv.0, visual.side_network.resblocks.3.conv.1, visual.side_network.resblocks.3.conv.2, visual.side_network.resblocks.3.mlp, visual.side_network.resblocks.3.mlp.act, visual.side_network.resblocks.3.mlp.drop, visual.side_network.resblocks.3.mlp.fc1, visual.side_network.resblocks.3.mlp.fc2, visual.side_network.resblocks.4.attn, visual.side_network.resblocks.4.attn.out_proj, visual.side_network.resblocks.4.conv, visual.side_network.resblocks.4.conv.0, visual.side_network.resblocks.4.conv.1, visual.side_network.resblocks.4.conv.2, visual.side_network.resblocks.4.mlp, visual.side_network.resblocks.4.mlp.act, visual.side_network.resblocks.4.mlp.drop, visual.side_network.resblocks.4.mlp.fc1, visual.side_network.resblocks.4.mlp.fc2, visual.side_network.resblocks.5.attn, visual.side_network.resblocks.5.attn.out_proj, visual.side_network.resblocks.5.conv, visual.side_network.resblocks.5.conv.0, visual.side_network.resblocks.5.conv.1, visual.side_network.resblocks.5.conv.2, visual.side_network.resblocks.5.mlp, visual.side_network.resblocks.5.mlp.act, visual.side_network.resblocks.5.mlp.drop, visual.side_network.resblocks.5.mlp.fc1, visual.side_network.resblocks.5.mlp.fc2, visual.side_network.resblocks.6.attn, visual.side_network.resblocks.6.attn.out_proj, visual.side_network.resblocks.6.conv, visual.side_network.resblocks.6.conv.0, visual.side_network.resblocks.6.conv.1, visual.side_network.resblocks.6.conv.2, visual.side_network.resblocks.6.mlp, visual.side_network.resblocks.6.mlp.act, visual.side_network.resblocks.6.mlp.drop, visual.side_network.resblocks.6.mlp.fc1, visual.side_network.resblocks.6.mlp.fc2, visual.side_network.resblocks.7.attn, visual.side_network.resblocks.7.attn.out_proj, visual.side_network.resblocks.7.conv, visual.side_network.resblocks.7.conv.0, visual.side_network.resblocks.7.conv.1, visual.side_network.resblocks.7.conv.2, visual.side_network.resblocks.7.mlp, visual.side_network.resblocks.7.mlp.act, visual.side_network.resblocks.7.mlp.drop, visual.side_network.resblocks.7.mlp.fc1, visual.side_network.resblocks.7.mlp.fc2, visual.side_network.resblocks.8.attn, visual.side_network.resblocks.8.attn.out_proj, visual.side_network.resblocks.8.conv, visual.side_network.resblocks.8.conv.0, visual.side_network.resblocks.8.conv.1, visual.side_network.resblocks.8.conv.2, visual.side_network.resblocks.8.mlp, visual.side_network.resblocks.8.mlp.act, visual.side_network.resblocks.8.mlp.drop, visual.side_network.resblocks.8.mlp.fc1, visual.side_network.resblocks.8.mlp.fc2, visual.side_network.resblocks.9.attn, visual.side_network.resblocks.9.attn.out_proj, visual.side_network.resblocks.9.conv, visual.side_network.resblocks.9.conv.0, visual.side_network.resblocks.9.conv.1, visual.side_network.resblocks.9.conv.2, visual.side_network.resblocks.9.mlp, visual.side_network.resblocks.9.mlp.act, visual.side_network.resblocks.9.mlp.drop, visual.side_network.resblocks.9.mlp.fc1, visual.side_network.resblocks.9.mlp.fc2, visual.transformer.resblocks.0.attn.out_proj, visual.transformer.resblocks.1.attn.out_proj, visual.transformer.resblocks.10.attn.out_proj, visual.transformer.resblocks.11.attn.out_proj, visual.transformer.resblocks.12.attn.out_proj, visual.transformer.resblocks.13.attn.out_proj, visual.transformer.resblocks.14.attn.out_proj, visual.transformer.resblocks.15.attn.out_proj, visual.transformer.resblocks.16.attn.out_proj, visual.transformer.resblocks.17.attn.out_proj, visual.transformer.resblocks.18.attn.out_proj, visual.transformer.resblocks.19.attn.out_proj, visual.transformer.resblocks.2.attn.out_proj, visual.transformer.resblocks.20.attn.out_proj, visual.transformer.resblocks.21.attn.out_proj, visual.transformer.resblocks.22.attn.out_proj, visual.transformer.resblocks.23.attn.out_proj, visual.transformer.resblocks.3.attn.out_proj, visual.transformer.resblocks.4.attn.out_proj, visual.transformer.resblocks.5.attn.out_proj, visual.transformer.resblocks.6.attn.out_proj, visual.transformer.resblocks.7.attn.out_proj, visual.transformer.resblocks.8.attn.out_proj, visual.transformer.resblocks.9.attn.out_proj
[02/28 20:47:26][INFO] utils.py:  502: Flops: 2.732T
[02/28 20:47:26][INFO] utils.py:  504: Params: 385.508M, tunable Params: 385.508M
[02/28 20:47:27][INFO] test_vision.py:  303: load model: epoch 27
[02/28 20:47:36][INFO] test_vision.py:  602: Test: [0/402], average 0.3139 sec/video 	Prec@1 100.000 (100.000)	Prec@5 100.000 (100.000)	mPrec@1 7.944	mPrec@5 6.944
[02/28 20:48:00][INFO] test_vision.py:  602: Test: [10/402], average 0.1184 sec/video 	Prec@1 95.833 (97.348)	Prec@5 100.000 (100.000)	mPrec@1 19.949	mPrec@5 19.792
[02/28 20:48:24][INFO] test_vision.py:  602: Test: [20/402], average 0.1101 sec/video 	Prec@1 95.833 (97.024)	Prec@5 100.000 (100.000)	mPrec@1 24.193	mPrec@5 24.306
[02/28 20:48:48][INFO] test_vision.py:  602: Test: [30/402], average 0.1075 sec/video 	Prec@1 100.000 (96.237)	Prec@5 100.000 (100.000)	mPrec@1 29.010	mPrec@5 29.514
[02/28 20:49:13][INFO] test_vision.py:  602: Test: [40/402], average 0.1063 sec/video 	Prec@1 79.167 (95.427)	Prec@5 100.000 (100.000)	mPrec@1 35.328	mPrec@5 36.806
[02/28 20:49:38][INFO] test_vision.py:  602: Test: [50/402], average 0.1056 sec/video 	Prec@1 95.833 (95.261)	Prec@5 100.000 (100.000)	mPrec@1 37.595	mPrec@5 39.931
[02/28 20:50:02][INFO] test_vision.py:  602: Test: [60/402], average 0.1052 sec/video 	Prec@1 91.667 (95.150)	Prec@5 100.000 (99.863)	mPrec@1 39.600	mPrec@5 42.622
[02/28 20:50:27][INFO] test_vision.py:  602: Test: [70/402], average 0.1050 sec/video 	Prec@1 91.667 (95.423)	Prec@5 100.000 (99.824)	mPrec@1 42.793	mPrec@5 45.761
[02/28 20:50:52][INFO] test_vision.py:  602: Test: [80/402], average 0.1048 sec/video 	Prec@1 95.833 (95.473)	Prec@5 100.000 (99.794)	mPrec@1 45.485	mPrec@5 49.841
[02/28 20:51:17][INFO] test_vision.py:  602: Test: [90/402], average 0.1047 sec/video 	Prec@1 95.833 (95.467)	Prec@5 100.000 (99.817)	mPrec@1 48.199	mPrec@5 52.457
[02/28 20:51:42][INFO] test_vision.py:  602: Test: [100/402], average 0.1045 sec/video 	Prec@1 100.000 (95.462)	Prec@5 100.000 (99.752)	mPrec@1 49.471	mPrec@5 53.860
[02/28 20:52:07][INFO] test_vision.py:  602: Test: [110/402], average 0.1045 sec/video 	Prec@1 91.667 (95.608)	Prec@5 95.833 (99.737)	mPrec@1 51.843	mPrec@5 56.633
[02/28 20:52:31][INFO] test_vision.py:  602: Test: [120/402], average 0.1044 sec/video 	Prec@1 100.000 (95.455)	Prec@5 100.000 (99.690)	mPrec@1 54.257	mPrec@5 59.527
[02/28 20:52:56][INFO] test_vision.py:  602: Test: [130/402], average 0.1043 sec/video 	Prec@1 87.500 (95.611)	Prec@5 100.000 (99.714)	mPrec@1 55.284	mPrec@5 60.936
[02/28 20:53:21][INFO] test_vision.py:  602: Test: [140/402], average 0.1042 sec/video 	Prec@1 100.000 (95.567)	Prec@5 100.000 (99.675)	mPrec@1 56.898	mPrec@5 62.638
[02/28 20:53:46][INFO] test_vision.py:  602: Test: [150/402], average 0.1041 sec/video 	Prec@1 87.500 (95.061)	Prec@5 100.000 (99.531)	mPrec@1 56.864	mPrec@5 63.374
[02/28 20:54:10][INFO] test_vision.py:  602: Test: [160/402], average 0.1040 sec/video 	Prec@1 95.833 (94.902)	Prec@5 100.000 (99.482)	mPrec@1 57.093	mPrec@5 64.696
[02/28 20:54:35][INFO] test_vision.py:  602: Test: [170/402], average 0.1040 sec/video 	Prec@1 100.000 (94.956)	Prec@5 100.000 (99.488)	mPrec@1 57.293	mPrec@5 64.716
[02/28 20:55:00][INFO] test_vision.py:  602: Test: [180/402], average 0.1039 sec/video 	Prec@1 100.000 (95.051)	Prec@5 100.000 (99.517)	mPrec@1 57.734	mPrec@5 65.414
[02/28 20:55:25][INFO] test_vision.py:  602: Test: [190/402], average 0.1039 sec/video 	Prec@1 91.667 (94.961)	Prec@5 100.000 (99.520)	mPrec@1 58.134	mPrec@5 66.114
[02/28 20:55:49][INFO] test_vision.py:  602: Test: [200/402], average 0.1038 sec/video 	Prec@1 95.833 (94.983)	Prec@5 100.000 (99.523)	mPrec@1 59.285	mPrec@5 67.074
[02/28 20:56:14][INFO] test_vision.py:  602: Test: [210/402], average 0.1038 sec/video 	Prec@1 95.833 (95.043)	Prec@5 100.000 (99.506)	mPrec@1 59.623	mPrec@5 68.113
[02/28 20:56:39][INFO] test_vision.py:  602: Test: [220/402], average 0.1038 sec/video 	Prec@1 87.500 (95.155)	Prec@5 95.833 (99.510)	mPrec@1 59.523	mPrec@5 68.448
[02/28 20:57:04][INFO] test_vision.py:  602: Test: [230/402], average 0.1038 sec/video 	Prec@1 100.000 (95.130)	Prec@5 100.000 (99.531)	mPrec@1 59.152	mPrec@5 68.803
[02/28 20:57:29][INFO] test_vision.py:  602: Test: [240/402], average 0.1038 sec/video 	Prec@1 100.000 (95.211)	Prec@5 100.000 (99.550)	mPrec@1 60.385	mPrec@5 70.195
[02/28 20:57:53][INFO] test_vision.py:  602: Test: [250/402], average 0.1037 sec/video 	Prec@1 87.500 (95.219)	Prec@5 95.833 (99.552)	mPrec@1 60.484	mPrec@5 70.529
[02/28 20:58:18][INFO] test_vision.py:  602: Test: [260/402], average 0.1037 sec/video 	Prec@1 100.000 (95.307)	Prec@5 100.000 (99.569)	mPrec@1 62.174	mPrec@5 72.272
[02/28 20:58:43][INFO] test_vision.py:  602: Test: [270/402], average 0.1037 sec/video 	Prec@1 91.667 (95.341)	Prec@5 100.000 (99.554)	mPrec@1 62.795	mPrec@5 73.729
[02/28 20:59:08][INFO] test_vision.py:  602: Test: [280/402], average 0.1037 sec/video 	Prec@1 87.500 (95.314)	Prec@5 100.000 (99.570)	mPrec@1 62.880	mPrec@5 74.774
[02/28 20:59:33][INFO] test_vision.py:  602: Test: [290/402], average 0.1037 sec/video 	Prec@1 100.000 (95.275)	Prec@5 100.000 (99.556)	mPrec@1 63.450	mPrec@5 75.931
[02/28 20:59:57][INFO] test_vision.py:  602: Test: [300/402], average 0.1037 sec/video 	Prec@1 95.833 (95.266)	Prec@5 100.000 (99.571)	mPrec@1 64.300	mPrec@5 76.974
[02/28 21:00:22][INFO] test_vision.py:  602: Test: [310/402], average 0.1037 sec/video 	Prec@1 91.667 (95.190)	Prec@5 100.000 (99.571)	mPrec@1 64.552	mPrec@5 78.378
[02/28 21:00:47][INFO] test_vision.py:  602: Test: [320/402], average 0.1037 sec/video 	Prec@1 91.667 (94.678)	Prec@5 100.000 (99.507)	mPrec@1 69.317	mPrec@5 86.714
[02/28 21:01:12][INFO] test_vision.py:  602: Test: [330/402], average 0.1037 sec/video 	Prec@1 95.833 (94.763)	Prec@5 100.000 (99.522)	mPrec@1 69.396	mPrec@5 86.717
[02/28 21:01:37][INFO] test_vision.py:  602: Test: [340/402], average 0.1036 sec/video 	Prec@1 95.833 (94.807)	Prec@5 100.000 (99.536)	mPrec@1 69.398	mPrec@5 86.723
[02/28 21:02:01][INFO] test_vision.py:  602: Test: [350/402], average 0.1036 sec/video 	Prec@1 100.000 (94.860)	Prec@5 100.000 (99.549)	mPrec@1 69.252	mPrec@5 86.733
[02/28 21:02:26][INFO] test_vision.py:  602: Test: [360/402], average 0.1036 sec/video 	Prec@1 91.667 (94.818)	Prec@5 100.000 (99.561)	mPrec@1 69.118	mPrec@5 86.968
[02/28 21:02:51][INFO] test_vision.py:  602: Test: [370/402], average 0.1036 sec/video 	Prec@1 100.000 (94.811)	Prec@5 100.000 (99.551)	mPrec@1 69.465	mPrec@5 87.325
[02/28 21:03:16][INFO] test_vision.py:  602: Test: [380/402], average 0.1036 sec/video 	Prec@1 87.500 (94.794)	Prec@5 100.000 (99.563)	mPrec@1 69.360	mPrec@5 87.676
[02/28 21:03:41][INFO] test_vision.py:  602: Test: [390/402], average 0.1036 sec/video 	Prec@1 79.167 (94.672)	Prec@5 95.833 (99.552)	mPrec@1 69.142	mPrec@5 88.190
[02/28 21:04:05][INFO] test_vision.py:  602: Test: [400/402], average 0.1036 sec/video 	Prec@1 58.333 (94.067)	Prec@5 83.333 (99.460)	mPrec@1 70.828	mPrec@5 93.221
[02/28 21:04:08][INFO] test_vision.py:  615: -----Evaluation is finished------
[02/28 21:04:08][INFO] test_vision.py:  621: Overall Prec@1 94.009% Prec@5 99.451%	mPrec@1 (71.088)	mPrec@5 (94.957)
[02/28 21:04:08][INFO] test_vision.py:  338: Per-class accuracies saved to ./exp/s4v_selfy_vitl14_32x224_finegym288_run2/per_class_accuracies.txt
[02/28 21:04:08][INFO] test_vision.py:  371: Per-sample results saved to ./exp/s4v_selfy_vitl14_32x224_finegym288_run2/per_sample_results.txt
