[02/27 08:39:06][DEBUG] cmd.py: 1253: Popen(['git', 'rev-parse', '--show-toplevel'], cwd=/home/anonymous/research/CorrelationSideTuning, stdin=None, shell=False, universal_newlines=False)
[02/27 08:39:06][DEBUG] cmd.py: 1253: Popen(['git', 'rev-parse', '--show-toplevel'], cwd=/home/anonymous/research/CorrelationSideTuning, stdin=None, shell=False, universal_newlines=False)
[02/27 08:39:08][DEBUG] cmd.py: 1253: Popen(['git', 'cat-file', '--batch-check'], cwd=/home/anonymous/research/CorrelationSideTuning, stdin=<valid stream>, shell=False, universal_newlines=False)
[02/27 08:39:11][INFO] train_vision.py:  207: ------------------------------------
[02/27 08:39:11][INFO] train_vision.py:  208: Environment Versions:
[02/27 08:39:11][INFO] train_vision.py:  209: - Python: 3.8.19 (default, Mar 20 2024, 19:58:24) 
[GCC 11.2.0]
[02/27 08:39:11][INFO] train_vision.py:  210: - PyTorch: 1.12.1
[02/27 08:39:11][INFO] train_vision.py:  211: - TorchVison: 0.13.1
[02/27 08:39:11][INFO] train_vision.py:  212: ------------------------------------
[02/27 08:39:11][INFO] train_vision.py:  214: {   'data': {   'batch_size': 10,
                'dataset': 'finegym',
                'image_tmpl': 'img_{:05d}.jpg',
                'input_size': 224,
                'label_list': 'lists/finegym99_labels.csv',
                'modality': 'RGB',
                'num_classes': 99,
                'num_sample': 1,
                'num_segments': 32,
                'rand_aug': False,
                'rand_erase': False,
                'random_shift': True,
                'seg_length': 1,
                'test_batch_size': 3,
                'train_list': 'lists/finegym/train_gym99_rgb_320px_60fps.txt',
                'train_root': '/home/anonymous/datasets/finegym',
                'val_list': 'lists/finegym/val_gym99_rgb_320px_60fps.txt',
                'val_root': '/home/anonymous/datasets/finegym',
                'workers': 4},
    'logging': {   'acc_per_class': True,
                   'correct_per_sample': True,
                   'eval_freq': 2,
                   'print_freq': 10,
                   'skip_epoch': []},
    'network': {   'arch': 'ViT-L/14',
                   'corr_dim': 256,
                   'corr_ext_chnls': [96],
                   'corr_func': 'cosine',
                   'corr_int_chnls': [96, 96, 192],
                   'corr_layer_index': [7],
                   'corr_num_encoders': 2,
                   'corr_window': [5, 9, 9],
                   'drop_fc': 0,
                   'dropout': 0.0,
                   'emb_dropout': 0.0,
                   'fix_clip': False,
                   'init': True,
                   'joint_st': False,
                   'my_fix_clip': True,
                   'n_emb': 448,
                   'num_checkpoints': 24,
                   'side_dim': 448,
                   'sim_header': 'None',
                   'sync_bn': False,
                   'tm': False,
                   'type': 'clip_k400'},
    'pretrain': 'exp/s4v_selfy_vitl14_16x224_k400_run3/model_best.pt',
    'resume': None,
    'seed': 2048,
    'solver': {   'betas': [0.9, 0.999],
                  'clip_ratio': 1,
                  'epoch_offset': 0,
                  'epochs': 30,
                  'evaluate': False,
                  'final_factor': 0.01,
                  'grad_accumulation_steps': 1,
                  'layer_decay': 1.0,
                  'loss_type': 'CE',
                  'lr': 0.0002,
                  'lr_warmup_step': 4,
                  'optim': 'adamw',
                  'smoothing': 0.1,
                  'start_epoch': 0,
                  'type': 'cosine',
                  'warmup_lr': 2e-07,
                  'weight_decay': 0.15},
    'wandb': {   'entity': 'anonymous',
                 'exp_name': 'exp/s4v_selfy_vitl14_32x224_finegym99_run1/train',
                 'group_name': 'exp/s4v_selfy_vitl14_32x224_finegym99_run1',
                 'key': '1234',
                 'project_name': 'corr_adapter_finegym99',
                 'use_wandb': True}}
[02/27 08:39:11][INFO] train_vision.py:  215: ------------------------------------
[02/27 08:39:11][INFO] train_vision.py:  216: storing name: ./exp/exp/s4v_selfy_vitl14_32x224_finegym99_run1
[02/27 08:39:13][INFO] model.py:  404: dropout used:[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0]
[02/27 08:39:14][INFO] model.py:  444: dropout used:[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0]
[02/27 08:39:15][INFO] model.py:  404: dropout used:[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0]
[02/27 08:39:16][INFO] model.py:  921: loading clip pretrained model!
[02/27 08:39:16][INFO] train_vision.py:  284: visual.class_embedding False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.positional_embedding False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.conv1.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.ln_pre.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.ln_pre.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.0.attn.in_proj_weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.0.attn.in_proj_bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.0.attn.out_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.0.attn.out_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.0.ln_1.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.0.ln_1.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.0.mlp.c_fc.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.0.mlp.c_fc.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.0.mlp.c_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.0.mlp.c_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.0.ln_2.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.0.ln_2.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.1.attn.in_proj_weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.1.attn.in_proj_bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.1.attn.out_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.1.attn.out_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.1.ln_1.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.1.ln_1.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.1.mlp.c_fc.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.1.mlp.c_fc.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.1.mlp.c_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.1.mlp.c_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.1.ln_2.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.1.ln_2.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.2.attn.in_proj_weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.2.attn.in_proj_bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.2.attn.out_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.2.attn.out_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.2.ln_1.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.2.ln_1.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.2.mlp.c_fc.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.2.mlp.c_fc.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.2.mlp.c_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.2.mlp.c_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.2.ln_2.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.2.ln_2.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.3.attn.in_proj_weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.3.attn.in_proj_bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.3.attn.out_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.3.attn.out_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.3.ln_1.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.3.ln_1.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.3.mlp.c_fc.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.3.mlp.c_fc.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.3.mlp.c_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.3.mlp.c_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.3.ln_2.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.3.ln_2.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.4.attn.in_proj_weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.4.attn.in_proj_bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.4.attn.out_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.4.attn.out_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.4.ln_1.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.4.ln_1.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.4.mlp.c_fc.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.4.mlp.c_fc.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.4.mlp.c_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.4.mlp.c_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.4.ln_2.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.4.ln_2.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.5.attn.in_proj_weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.5.attn.in_proj_bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.5.attn.out_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.5.attn.out_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.5.ln_1.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.5.ln_1.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.5.mlp.c_fc.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.5.mlp.c_fc.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.5.mlp.c_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.5.mlp.c_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.5.ln_2.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.5.ln_2.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.6.attn.in_proj_weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.6.attn.in_proj_bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.6.attn.out_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.6.attn.out_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.6.ln_1.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.6.ln_1.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.6.mlp.c_fc.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.6.mlp.c_fc.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.6.mlp.c_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.6.mlp.c_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.6.ln_2.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.6.ln_2.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.7.attn.in_proj_weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.7.attn.in_proj_bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.7.attn.out_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.7.attn.out_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.7.ln_1.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.7.ln_1.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.7.mlp.c_fc.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.7.mlp.c_fc.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.7.mlp.c_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.7.mlp.c_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.7.ln_2.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.7.ln_2.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.8.attn.in_proj_weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.8.attn.in_proj_bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.8.attn.out_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.8.attn.out_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.8.ln_1.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.8.ln_1.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.8.mlp.c_fc.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.8.mlp.c_fc.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.8.mlp.c_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.8.mlp.c_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.8.ln_2.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.8.ln_2.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.9.attn.in_proj_weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.9.attn.in_proj_bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.9.attn.out_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.9.attn.out_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.9.ln_1.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.9.ln_1.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.9.mlp.c_fc.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.9.mlp.c_fc.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.9.mlp.c_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.9.mlp.c_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.9.ln_2.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.9.ln_2.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.10.attn.in_proj_weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.10.attn.in_proj_bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.10.attn.out_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.10.attn.out_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.10.ln_1.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.10.ln_1.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.10.mlp.c_fc.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.10.mlp.c_fc.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.10.mlp.c_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.10.mlp.c_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.10.ln_2.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.10.ln_2.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.11.attn.in_proj_weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.11.attn.in_proj_bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.11.attn.out_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.11.attn.out_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.11.ln_1.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.11.ln_1.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.11.mlp.c_fc.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.11.mlp.c_fc.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.11.mlp.c_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.11.mlp.c_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.11.ln_2.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.11.ln_2.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.12.attn.in_proj_weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.12.attn.in_proj_bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.12.attn.out_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.12.attn.out_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.12.ln_1.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.12.ln_1.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.12.mlp.c_fc.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.12.mlp.c_fc.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.12.mlp.c_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.12.mlp.c_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.12.ln_2.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.12.ln_2.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.13.attn.in_proj_weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.13.attn.in_proj_bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.13.attn.out_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.13.attn.out_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.13.ln_1.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.13.ln_1.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.13.mlp.c_fc.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.13.mlp.c_fc.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.13.mlp.c_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.13.mlp.c_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.13.ln_2.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.13.ln_2.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.14.attn.in_proj_weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.14.attn.in_proj_bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.14.attn.out_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.14.attn.out_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.14.ln_1.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.14.ln_1.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.14.mlp.c_fc.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.14.mlp.c_fc.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.14.mlp.c_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.14.mlp.c_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.14.ln_2.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.14.ln_2.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.15.attn.in_proj_weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.15.attn.in_proj_bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.15.attn.out_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.15.attn.out_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.15.ln_1.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.15.ln_1.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.15.mlp.c_fc.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.15.mlp.c_fc.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.15.mlp.c_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.15.mlp.c_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.15.ln_2.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.15.ln_2.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.16.attn.in_proj_weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.16.attn.in_proj_bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.16.attn.out_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.16.attn.out_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.16.ln_1.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.16.ln_1.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.16.mlp.c_fc.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.16.mlp.c_fc.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.16.mlp.c_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.16.mlp.c_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.16.ln_2.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.16.ln_2.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.17.attn.in_proj_weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.17.attn.in_proj_bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.17.attn.out_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.17.attn.out_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.17.ln_1.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.17.ln_1.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.17.mlp.c_fc.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.17.mlp.c_fc.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.17.mlp.c_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.17.mlp.c_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.17.ln_2.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.17.ln_2.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.18.attn.in_proj_weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.18.attn.in_proj_bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.18.attn.out_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.18.attn.out_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.18.ln_1.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.18.ln_1.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.18.mlp.c_fc.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.18.mlp.c_fc.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.18.mlp.c_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.18.mlp.c_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.18.ln_2.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.18.ln_2.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.19.attn.in_proj_weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.19.attn.in_proj_bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.19.attn.out_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.19.attn.out_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.19.ln_1.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.19.ln_1.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.19.mlp.c_fc.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.19.mlp.c_fc.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.19.mlp.c_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.19.mlp.c_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.19.ln_2.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.19.ln_2.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.20.attn.in_proj_weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.20.attn.in_proj_bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.20.attn.out_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.20.attn.out_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.20.ln_1.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.20.ln_1.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.20.mlp.c_fc.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.20.mlp.c_fc.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.20.mlp.c_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.20.mlp.c_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.20.ln_2.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.20.ln_2.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.21.attn.in_proj_weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.21.attn.in_proj_bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.21.attn.out_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.21.attn.out_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.21.ln_1.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.21.ln_1.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.21.mlp.c_fc.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.21.mlp.c_fc.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.21.mlp.c_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.21.mlp.c_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.21.ln_2.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.21.ln_2.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.22.attn.in_proj_weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.22.attn.in_proj_bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.22.attn.out_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.22.attn.out_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.22.ln_1.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.22.ln_1.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.22.mlp.c_fc.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.22.mlp.c_fc.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.22.mlp.c_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.22.mlp.c_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.22.ln_2.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.22.ln_2.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.23.attn.in_proj_weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.23.attn.in_proj_bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.23.attn.out_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.23.attn.out_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.23.ln_1.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.23.ln_1.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.23.mlp.c_fc.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.23.mlp.c_fc.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.23.mlp.c_proj.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.23.mlp.c_proj.bias False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.23.ln_2.weight False
[02/27 08:39:16][INFO] train_vision.py:  284: visual.transformer.resblocks.23.ln_2.bias False
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.side_spatial_position_embeddings True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.0.bn_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.0.bn_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.0.conv.0.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.0.conv.0.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.0.conv.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.0.conv.1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.0.conv.2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.0.conv.2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.0.bn_2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.0.bn_2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.0.mlp.fc1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.0.mlp.fc1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.0.mlp.fc2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.0.mlp.fc2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.0.attn.in_proj_weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.0.attn.in_proj_bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.0.attn.out_proj.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.0.attn.out_proj.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.0.ln_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.0.ln_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.1.bn_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.1.bn_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.1.conv.0.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.1.conv.0.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.1.conv.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.1.conv.1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.1.conv.2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.1.conv.2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.1.bn_2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.1.bn_2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.1.mlp.fc1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.1.mlp.fc1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.1.mlp.fc2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.1.mlp.fc2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.1.attn.in_proj_weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.1.attn.in_proj_bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.1.attn.out_proj.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.1.attn.out_proj.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.1.ln_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.1.ln_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.2.bn_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.2.bn_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.2.conv.0.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.2.conv.0.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.2.conv.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.2.conv.1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.2.conv.2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.2.conv.2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.2.bn_2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.2.bn_2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.2.mlp.fc1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.2.mlp.fc1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.2.mlp.fc2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.2.mlp.fc2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.2.attn.in_proj_weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.2.attn.in_proj_bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.2.attn.out_proj.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.2.attn.out_proj.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.2.ln_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.2.ln_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.3.bn_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.3.bn_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.3.conv.0.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.3.conv.0.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.3.conv.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.3.conv.1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.3.conv.2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.3.conv.2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.3.bn_2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.3.bn_2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.3.mlp.fc1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.3.mlp.fc1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.3.mlp.fc2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.3.mlp.fc2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.3.attn.in_proj_weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.3.attn.in_proj_bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.3.attn.out_proj.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.3.attn.out_proj.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.3.ln_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.3.ln_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.4.bn_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.4.bn_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.4.conv.0.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.4.conv.0.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.4.conv.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.4.conv.1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.4.conv.2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.4.conv.2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.4.bn_2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.4.bn_2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.4.mlp.fc1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.4.mlp.fc1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.4.mlp.fc2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.4.mlp.fc2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.4.attn.in_proj_weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.4.attn.in_proj_bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.4.attn.out_proj.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.4.attn.out_proj.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.4.ln_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.4.ln_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.5.bn_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.5.bn_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.5.conv.0.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.5.conv.0.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.5.conv.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.5.conv.1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.5.conv.2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.5.conv.2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.5.bn_2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.5.bn_2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.5.mlp.fc1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.5.mlp.fc1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.5.mlp.fc2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.5.mlp.fc2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.5.attn.in_proj_weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.5.attn.in_proj_bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.5.attn.out_proj.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.5.attn.out_proj.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.5.ln_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.5.ln_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.6.bn_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.6.bn_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.6.conv.0.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.6.conv.0.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.6.conv.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.6.conv.1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.6.conv.2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.6.conv.2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.6.bn_2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.6.bn_2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.6.mlp.fc1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.6.mlp.fc1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.6.mlp.fc2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.6.mlp.fc2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.6.attn.in_proj_weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.6.attn.in_proj_bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.6.attn.out_proj.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.6.attn.out_proj.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.6.ln_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.6.ln_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.7.bn_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.7.bn_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.7.conv.0.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.7.conv.0.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.7.conv.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.7.conv.1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.7.conv.2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.7.conv.2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.7.bn_2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.7.bn_2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.7.mlp.fc1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.7.mlp.fc1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.7.mlp.fc2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.7.mlp.fc2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.7.attn.in_proj_weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.7.attn.in_proj_bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.7.attn.out_proj.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.7.attn.out_proj.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.7.ln_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.7.ln_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.8.bn_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.8.bn_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.8.conv.0.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.8.conv.0.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.8.conv.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.8.conv.1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.8.conv.2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.8.conv.2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.8.bn_2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.8.bn_2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.8.mlp.fc1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.8.mlp.fc1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.8.mlp.fc2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.8.mlp.fc2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.8.attn.in_proj_weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.8.attn.in_proj_bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.8.attn.out_proj.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.8.attn.out_proj.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.8.ln_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.8.ln_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.9.bn_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.9.bn_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.9.conv.0.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.9.conv.0.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.9.conv.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.9.conv.1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.9.conv.2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.9.conv.2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.9.bn_2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.9.bn_2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.9.mlp.fc1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.9.mlp.fc1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.9.mlp.fc2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.9.mlp.fc2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.9.attn.in_proj_weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.9.attn.in_proj_bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.9.attn.out_proj.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.9.attn.out_proj.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.9.ln_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.9.ln_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.10.bn_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.10.bn_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.10.conv.0.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.10.conv.0.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.10.conv.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.10.conv.1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.10.conv.2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.10.conv.2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.10.bn_2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.10.bn_2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.10.mlp.fc1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.10.mlp.fc1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.10.mlp.fc2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.10.mlp.fc2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.10.attn.in_proj_weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.10.attn.in_proj_bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.10.attn.out_proj.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.10.attn.out_proj.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.10.ln_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.10.ln_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.11.bn_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.11.bn_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.11.conv.0.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.11.conv.0.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.11.conv.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.11.conv.1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.11.conv.2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.11.conv.2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.11.bn_2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.11.bn_2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.11.mlp.fc1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.11.mlp.fc1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.11.mlp.fc2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.11.mlp.fc2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.11.attn.in_proj_weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.11.attn.in_proj_bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.11.attn.out_proj.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.11.attn.out_proj.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.11.ln_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.11.ln_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.12.bn_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.12.bn_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.12.conv.0.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.12.conv.0.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.12.conv.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.12.conv.1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.12.conv.2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.12.conv.2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.12.bn_2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.12.bn_2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.12.mlp.fc1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.12.mlp.fc1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.12.mlp.fc2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.12.mlp.fc2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.12.attn.in_proj_weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.12.attn.in_proj_bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.12.attn.out_proj.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.12.attn.out_proj.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.12.ln_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.12.ln_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.13.bn_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.13.bn_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.13.conv.0.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.13.conv.0.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.13.conv.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.13.conv.1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.13.conv.2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.13.conv.2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.13.bn_2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.13.bn_2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.13.mlp.fc1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.13.mlp.fc1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.13.mlp.fc2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.13.mlp.fc2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.13.attn.in_proj_weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.13.attn.in_proj_bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.13.attn.out_proj.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.13.attn.out_proj.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.13.ln_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.13.ln_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.14.bn_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.14.bn_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.14.conv.0.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.14.conv.0.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.14.conv.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.14.conv.1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.14.conv.2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.14.conv.2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.14.bn_2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.14.bn_2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.14.mlp.fc1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.14.mlp.fc1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.14.mlp.fc2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.14.mlp.fc2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.14.attn.in_proj_weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.14.attn.in_proj_bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.14.attn.out_proj.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.14.attn.out_proj.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.14.ln_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.14.ln_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.15.bn_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.15.bn_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.15.conv.0.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.15.conv.0.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.15.conv.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.15.conv.1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.15.conv.2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.15.conv.2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.15.bn_2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.15.bn_2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.15.mlp.fc1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.15.mlp.fc1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.15.mlp.fc2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.15.mlp.fc2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.15.attn.in_proj_weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.15.attn.in_proj_bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.15.attn.out_proj.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.15.attn.out_proj.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.15.ln_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.15.ln_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.16.bn_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.16.bn_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.16.conv.0.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.16.conv.0.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.16.conv.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.16.conv.1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.16.conv.2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.16.conv.2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.16.bn_2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.16.bn_2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.16.mlp.fc1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.16.mlp.fc1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.16.mlp.fc2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.16.mlp.fc2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.16.attn.in_proj_weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.16.attn.in_proj_bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.16.attn.out_proj.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.16.attn.out_proj.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.16.ln_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.16.ln_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.17.bn_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.17.bn_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.17.conv.0.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.17.conv.0.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.17.conv.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.17.conv.1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.17.conv.2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.17.conv.2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.17.bn_2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.17.bn_2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.17.mlp.fc1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.17.mlp.fc1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.17.mlp.fc2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.17.mlp.fc2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.17.attn.in_proj_weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.17.attn.in_proj_bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.17.attn.out_proj.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.17.attn.out_proj.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.17.ln_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.17.ln_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.18.bn_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.18.bn_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.18.conv.0.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.18.conv.0.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.18.conv.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.18.conv.1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.18.conv.2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.18.conv.2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.18.bn_2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.18.bn_2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.18.mlp.fc1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.18.mlp.fc1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.18.mlp.fc2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.18.mlp.fc2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.18.attn.in_proj_weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.18.attn.in_proj_bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.18.attn.out_proj.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.18.attn.out_proj.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.18.ln_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.18.ln_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.19.bn_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.19.bn_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.19.conv.0.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.19.conv.0.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.19.conv.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.19.conv.1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.19.conv.2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.19.conv.2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.19.bn_2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.19.bn_2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.19.mlp.fc1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.19.mlp.fc1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.19.mlp.fc2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.19.mlp.fc2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.19.attn.in_proj_weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.19.attn.in_proj_bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.19.attn.out_proj.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.19.attn.out_proj.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.19.ln_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.19.ln_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.20.bn_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.20.bn_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.20.conv.0.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.20.conv.0.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.20.conv.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.20.conv.1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.20.conv.2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.20.conv.2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.20.bn_2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.20.bn_2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.20.mlp.fc1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.20.mlp.fc1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.20.mlp.fc2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.20.mlp.fc2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.20.attn.in_proj_weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.20.attn.in_proj_bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.20.attn.out_proj.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.20.attn.out_proj.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.20.ln_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.20.ln_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.21.bn_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.21.bn_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.21.conv.0.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.21.conv.0.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.21.conv.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.21.conv.1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.21.conv.2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.21.conv.2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.21.bn_2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.21.bn_2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.21.mlp.fc1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.21.mlp.fc1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.21.mlp.fc2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.21.mlp.fc2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.21.attn.in_proj_weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.21.attn.in_proj_bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.21.attn.out_proj.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.21.attn.out_proj.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.21.ln_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.21.ln_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.22.bn_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.22.bn_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.22.conv.0.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.22.conv.0.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.22.conv.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.22.conv.1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.22.conv.2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.22.conv.2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.22.bn_2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.22.bn_2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.22.mlp.fc1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.22.mlp.fc1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.22.mlp.fc2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.22.mlp.fc2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.22.attn.in_proj_weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.22.attn.in_proj_bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.22.attn.out_proj.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.22.attn.out_proj.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.22.ln_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.22.ln_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.23.bn_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.23.bn_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.23.conv.0.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.23.conv.0.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.23.conv.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.23.conv.1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.23.conv.2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.23.conv.2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.23.bn_2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.23.bn_2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.23.mlp.fc1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.23.mlp.fc1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.23.mlp.fc2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.23.mlp.fc2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.23.attn.in_proj_weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.23.attn.in_proj_bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.23.attn.out_proj.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.23.attn.out_proj.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.23.ln_1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.resblocks.23.ln_1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.0.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.0.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.3.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.3.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.4.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.4.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.5.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.5.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.6.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.6.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.7.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.7.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.8.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.8.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.9.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.9.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.10.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.10.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.11.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.11.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.12.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.12.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.13.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.13.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.14.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.14.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.15.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.15.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.16.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.16.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.17.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.17.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.18.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.18.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.19.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.19.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.20.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.20.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.21.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.21.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.22.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.22.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.23.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.adaptation.23.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.0.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.0.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.3.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.3.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.4.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.4.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.5.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.5.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.6.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.6.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.7.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.7.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.8.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.8.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.9.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.9.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.10.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.10.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.11.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.11.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.12.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.12.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.13.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.13.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.14.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.14.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.15.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.15.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.16.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.16.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.17.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.17.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.18.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.18.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.19.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.19.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.20.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.20.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.21.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.21.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.22.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.22.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.23.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.lns_pre.23.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.ln_pre.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.ln_pre.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.in_proj.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.in_proj.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.stss_extraction.conv0.0.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.stss_extraction.conv0.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.stss_extraction.conv0.1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.stss_integration.conv0.0.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.stss_integration.conv0.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.stss_integration.conv0.1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.stss_integration.conv1.0.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.stss_integration.conv1.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.stss_integration.conv1.1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.stss_integration.conv2_fuse.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.stss_integration.conv2_fuse.2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.stss_integration.conv2_fuse.2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.out_proj.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.0.out_proj.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.ln_pre.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.ln_pre.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.in_proj.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.in_proj.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.stss_extraction.conv0.0.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.stss_extraction.conv0.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.stss_extraction.conv0.1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.stss_integration.conv0.0.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.stss_integration.conv0.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.stss_integration.conv0.1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.stss_integration.conv1.0.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.stss_integration.conv1.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.stss_integration.conv1.1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.stss_integration.conv2_fuse.1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.stss_integration.conv2_fuse.2.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.stss_integration.conv2_fuse.2.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.out_proj.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_network.moss_layers.0.stss_encoders.1.out_proj.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_post_bn.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_post_bn.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_conv1.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_conv1.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_pre_bn3d.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: visual.side_pre_bn3d.bias True
[02/27 08:39:16][INFO] train_vision.py:  287: fc.weight True
[02/27 08:39:16][INFO] train_vision.py:  287: fc.bias True
[02/27 08:39:16][INFO] utils.py:  500: Model:
VideoCLIP(
  (visual): VisualTransformer(
    (conv1): Conv2d(3, 1024, kernel_size=(14, 14), stride=(14, 14), bias=False)
    (dropout): Dropout(p=0.0, inplace=False)
    (ln_pre): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
    (transformer): Transformer(
      (resblocks): ModuleList(
        (0): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (1): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (2): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (3): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (4): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (5): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (6): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (7): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (8): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (9): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (10): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (11): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (12): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (13): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (14): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (15): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (16): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (17): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (18): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (19): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (20): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (21): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (22): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (23): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
      )
    )
    (side_network): SideNetwork(
      (resblocks): ModuleList(
        (0): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (1): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (2): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (3): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (4): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (5): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (6): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (7): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (8): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (9): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (10): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (11): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (12): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (13): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (14): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (15): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (16): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (17): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (18): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (19): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (20): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (21): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (22): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (23): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
      )
      (adaptation): ModuleList(
        (0): Linear(in_features=1024, out_features=448, bias=True)
        (1): Linear(in_features=1024, out_features=448, bias=True)
        (2): Linear(in_features=1024, out_features=448, bias=True)
        (3): Linear(in_features=1024, out_features=448, bias=True)
        (4): Linear(in_features=1024, out_features=448, bias=True)
        (5): Linear(in_features=1024, out_features=448, bias=True)
        (6): Linear(in_features=1024, out_features=448, bias=True)
        (7): Linear(in_features=1024, out_features=448, bias=True)
        (8): Linear(in_features=1024, out_features=448, bias=True)
        (9): Linear(in_features=1024, out_features=448, bias=True)
        (10): Linear(in_features=1024, out_features=448, bias=True)
        (11): Linear(in_features=1024, out_features=448, bias=True)
        (12): Linear(in_features=1024, out_features=448, bias=True)
        (13): Linear(in_features=1024, out_features=448, bias=True)
        (14): Linear(in_features=1024, out_features=448, bias=True)
        (15): Linear(in_features=1024, out_features=448, bias=True)
        (16): Linear(in_features=1024, out_features=448, bias=True)
        (17): Linear(in_features=1024, out_features=448, bias=True)
        (18): Linear(in_features=1024, out_features=448, bias=True)
        (19): Linear(in_features=1024, out_features=448, bias=True)
        (20): Linear(in_features=1024, out_features=448, bias=True)
        (21): Linear(in_features=1024, out_features=448, bias=True)
        (22): Linear(in_features=1024, out_features=448, bias=True)
        (23): Linear(in_features=1024, out_features=448, bias=True)
      )
      (lns_pre): ModuleList(
        (0): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (3): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (4): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (5): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (6): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (7): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (8): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (9): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (10): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (11): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (12): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (13): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (14): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (15): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (16): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (17): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (18): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (19): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (20): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (21): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (22): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (23): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
      )
      (moss_layers): ModuleList(
        (0): MOSSBlock(
          (stss_encoders): ModuleList(
            (0): STSSEncoder(
              (ln_pre): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
              (in_proj): Linear(in_features=1024, out_features=256, bias=True)
              (stss_transformation): STSSTransformation()
              (stss_extraction): STSSExtraction(
                (conv0): Sequential(
                  (0): Conv3d(81, 96, kernel_size=(1, 1, 1), stride=(1, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
              )
              (stss_integration): STSSIntegration(
                (conv0): Sequential(
                  (0): Conv3d(96, 96, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
                (conv1): Sequential(
                  (0): Conv3d(96, 96, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
                (conv2_fuse): Sequential(
                  (0): Rearrange('(b l) c t h w -> b (l c) t h w', l=5)
                  (1): Conv3d(480, 192, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (2): BatchNorm3d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (3): GELU(approximate=none)
                )
              )
              (out_proj): Linear(in_features=192, out_features=448, bias=True)
            )
            (1): STSSEncoder(
              (ln_pre): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
              (in_proj): Linear(in_features=448, out_features=256, bias=True)
              (stss_transformation): STSSTransformation()
              (stss_extraction): STSSExtraction(
                (conv0): Sequential(
                  (0): Conv3d(81, 96, kernel_size=(1, 1, 1), stride=(1, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
              )
              (stss_integration): STSSIntegration(
                (conv0): Sequential(
                  (0): Conv3d(96, 96, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
                (conv1): Sequential(
                  (0): Conv3d(96, 96, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
                (conv2_fuse): Sequential(
                  (0): Rearrange('(b l) c t h w -> b (l c) t h w', l=5)
                  (1): Conv3d(480, 192, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (2): BatchNorm3d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (3): GELU(approximate=none)
                )
              )
              (out_proj): Linear(in_features=192, out_features=448, bias=True)
            )
          )
        )
      )
    )
    (side_post_bn): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
    (side_conv1): Conv3d(3, 448, kernel_size=(3, 14, 14), stride=(1, 14, 14), padding=(1, 0, 0))
    (side_pre_bn3d): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
  )
  (fusion_model): video_header()
  (drop_out): Dropout(p=0, inplace=False)
  (fc): Linear(in_features=448, out_features=99, bias=True)
)
[02/27 08:40:18][WARNING] jit_analysis.py:  501: Unsupported operator aten::div encountered 205 time(s)
[02/27 08:40:18][WARNING] jit_analysis.py:  501: Unsupported operator aten::add_ encountered 58 time(s)
[02/27 08:40:18][WARNING] jit_analysis.py:  501: Unsupported operator aten::mul encountered 463 time(s)
[02/27 08:40:18][WARNING] jit_analysis.py:  501: Unsupported operator aten::mul_ encountered 90 time(s)
[02/27 08:40:18][WARNING] jit_analysis.py:  501: Unsupported operator aten::add encountered 171 time(s)
[02/27 08:40:18][WARNING] jit_analysis.py:  501: Unsupported operator aten::softmax encountered 24 time(s)
[02/27 08:40:18][WARNING] jit_analysis.py:  501: Unsupported operator aten::sigmoid encountered 24 time(s)
[02/27 08:40:18][WARNING] jit_analysis.py:  501: Unsupported operator prim::PythonOp.CheckpointFunction encountered 72 time(s)
[02/27 08:40:18][WARNING] jit_analysis.py:  501: Unsupported operator aten::pad encountered 38 time(s)
[02/27 08:40:18][WARNING] jit_analysis.py:  501: Unsupported operator aten::unfold encountered 2 time(s)
[02/27 08:40:18][WARNING] jit_analysis.py:  501: Unsupported operator aten::norm encountered 4 time(s)
[02/27 08:40:18][WARNING] jit_analysis.py:  501: Unsupported operator aten::clamp_min encountered 4 time(s)
[02/27 08:40:18][WARNING] jit_analysis.py:  501: Unsupported operator aten::expand_as encountered 4 time(s)
[02/27 08:40:18][WARNING] jit_analysis.py:  501: Unsupported operator aten::diagonal encountered 36 time(s)
[02/27 08:40:18][WARNING] jit_analysis.py:  501: Unsupported operator aten::gelu encountered 8 time(s)
[02/27 08:40:18][WARNING] jit_analysis.py:  501: Unsupported operator aten::sum encountered 1 time(s)
[02/27 08:40:18][WARNING] jit_analysis.py:  501: Unsupported operator aten::mean encountered 1 time(s)
[02/27 08:40:18][WARNING] jit_analysis.py:  513: The following submodules of the model were never called during the trace of the graph. They may be unused, or they were accessed by direct calls to .forward() or via other python methods. In the latter case they will have zeros for statistics, though their statistics will still contribute to their parent calling module.
fusion_model, visual.dropout, visual.side_network.resblocks.0.attn, visual.side_network.resblocks.0.attn.out_proj, visual.side_network.resblocks.0.conv, visual.side_network.resblocks.0.conv.0, visual.side_network.resblocks.0.conv.1, visual.side_network.resblocks.0.conv.2, visual.side_network.resblocks.0.mlp, visual.side_network.resblocks.0.mlp.act, visual.side_network.resblocks.0.mlp.drop, visual.side_network.resblocks.0.mlp.fc1, visual.side_network.resblocks.0.mlp.fc2, visual.side_network.resblocks.1.attn, visual.side_network.resblocks.1.attn.out_proj, visual.side_network.resblocks.1.conv, visual.side_network.resblocks.1.conv.0, visual.side_network.resblocks.1.conv.1, visual.side_network.resblocks.1.conv.2, visual.side_network.resblocks.1.mlp, visual.side_network.resblocks.1.mlp.act, visual.side_network.resblocks.1.mlp.drop, visual.side_network.resblocks.1.mlp.fc1, visual.side_network.resblocks.1.mlp.fc2, visual.side_network.resblocks.10.attn, visual.side_network.resblocks.10.attn.out_proj, visual.side_network.resblocks.10.conv, visual.side_network.resblocks.10.conv.0, visual.side_network.resblocks.10.conv.1, visual.side_network.resblocks.10.conv.2, visual.side_network.resblocks.10.mlp, visual.side_network.resblocks.10.mlp.act, visual.side_network.resblocks.10.mlp.drop, visual.side_network.resblocks.10.mlp.fc1, visual.side_network.resblocks.10.mlp.fc2, visual.side_network.resblocks.11.attn, visual.side_network.resblocks.11.attn.out_proj, visual.side_network.resblocks.11.conv, visual.side_network.resblocks.11.conv.0, visual.side_network.resblocks.11.conv.1, visual.side_network.resblocks.11.conv.2, visual.side_network.resblocks.11.mlp, visual.side_network.resblocks.11.mlp.act, visual.side_network.resblocks.11.mlp.drop, visual.side_network.resblocks.11.mlp.fc1, visual.side_network.resblocks.11.mlp.fc2, visual.side_network.resblocks.12.attn, visual.side_network.resblocks.12.attn.out_proj, visual.side_network.resblocks.12.conv, visual.side_network.resblocks.12.conv.0, visual.side_network.resblocks.12.conv.1, visual.side_network.resblocks.12.conv.2, visual.side_network.resblocks.12.mlp, visual.side_network.resblocks.12.mlp.act, visual.side_network.resblocks.12.mlp.drop, visual.side_network.resblocks.12.mlp.fc1, visual.side_network.resblocks.12.mlp.fc2, visual.side_network.resblocks.13.attn, visual.side_network.resblocks.13.attn.out_proj, visual.side_network.resblocks.13.conv, visual.side_network.resblocks.13.conv.0, visual.side_network.resblocks.13.conv.1, visual.side_network.resblocks.13.conv.2, visual.side_network.resblocks.13.mlp, visual.side_network.resblocks.13.mlp.act, visual.side_network.resblocks.13.mlp.drop, visual.side_network.resblocks.13.mlp.fc1, visual.side_network.resblocks.13.mlp.fc2, visual.side_network.resblocks.14.attn, visual.side_network.resblocks.14.attn.out_proj, visual.side_network.resblocks.14.conv, visual.side_network.resblocks.14.conv.0, visual.side_network.resblocks.14.conv.1, visual.side_network.resblocks.14.conv.2, visual.side_network.resblocks.14.mlp, visual.side_network.resblocks.14.mlp.act, visual.side_network.resblocks.14.mlp.drop, visual.side_network.resblocks.14.mlp.fc1, visual.side_network.resblocks.14.mlp.fc2, visual.side_network.resblocks.15.attn, visual.side_network.resblocks.15.attn.out_proj, visual.side_network.resblocks.15.conv, visual.side_network.resblocks.15.conv.0, visual.side_network.resblocks.15.conv.1, visual.side_network.resblocks.15.conv.2, visual.side_network.resblocks.15.mlp, visual.side_network.resblocks.15.mlp.act, visual.side_network.resblocks.15.mlp.drop, visual.side_network.resblocks.15.mlp.fc1, visual.side_network.resblocks.15.mlp.fc2, visual.side_network.resblocks.16.attn, visual.side_network.resblocks.16.attn.out_proj, visual.side_network.resblocks.16.conv, visual.side_network.resblocks.16.conv.0, visual.side_network.resblocks.16.conv.1, visual.side_network.resblocks.16.conv.2, visual.side_network.resblocks.16.mlp, visual.side_network.resblocks.16.mlp.act, visual.side_network.resblocks.16.mlp.drop, visual.side_network.resblocks.16.mlp.fc1, visual.side_network.resblocks.16.mlp.fc2, visual.side_network.resblocks.17.attn, visual.side_network.resblocks.17.attn.out_proj, visual.side_network.resblocks.17.conv, visual.side_network.resblocks.17.conv.0, visual.side_network.resblocks.17.conv.1, visual.side_network.resblocks.17.conv.2, visual.side_network.resblocks.17.mlp, visual.side_network.resblocks.17.mlp.act, visual.side_network.resblocks.17.mlp.drop, visual.side_network.resblocks.17.mlp.fc1, visual.side_network.resblocks.17.mlp.fc2, visual.side_network.resblocks.18.attn, visual.side_network.resblocks.18.attn.out_proj, visual.side_network.resblocks.18.conv, visual.side_network.resblocks.18.conv.0, visual.side_network.resblocks.18.conv.1, visual.side_network.resblocks.18.conv.2, visual.side_network.resblocks.18.mlp, visual.side_network.resblocks.18.mlp.act, visual.side_network.resblocks.18.mlp.drop, visual.side_network.resblocks.18.mlp.fc1, visual.side_network.resblocks.18.mlp.fc2, visual.side_network.resblocks.19.attn, visual.side_network.resblocks.19.attn.out_proj, visual.side_network.resblocks.19.conv, visual.side_network.resblocks.19.conv.0, visual.side_network.resblocks.19.conv.1, visual.side_network.resblocks.19.conv.2, visual.side_network.resblocks.19.mlp, visual.side_network.resblocks.19.mlp.act, visual.side_network.resblocks.19.mlp.drop, visual.side_network.resblocks.19.mlp.fc1, visual.side_network.resblocks.19.mlp.fc2, visual.side_network.resblocks.2.attn, visual.side_network.resblocks.2.attn.out_proj, visual.side_network.resblocks.2.conv, visual.side_network.resblocks.2.conv.0, visual.side_network.resblocks.2.conv.1, visual.side_network.resblocks.2.conv.2, visual.side_network.resblocks.2.mlp, visual.side_network.resblocks.2.mlp.act, visual.side_network.resblocks.2.mlp.drop, visual.side_network.resblocks.2.mlp.fc1, visual.side_network.resblocks.2.mlp.fc2, visual.side_network.resblocks.20.attn, visual.side_network.resblocks.20.attn.out_proj, visual.side_network.resblocks.20.conv, visual.side_network.resblocks.20.conv.0, visual.side_network.resblocks.20.conv.1, visual.side_network.resblocks.20.conv.2, visual.side_network.resblocks.20.mlp, visual.side_network.resblocks.20.mlp.act, visual.side_network.resblocks.20.mlp.drop, visual.side_network.resblocks.20.mlp.fc1, visual.side_network.resblocks.20.mlp.fc2, visual.side_network.resblocks.21.attn, visual.side_network.resblocks.21.attn.out_proj, visual.side_network.resblocks.21.conv, visual.side_network.resblocks.21.conv.0, visual.side_network.resblocks.21.conv.1, visual.side_network.resblocks.21.conv.2, visual.side_network.resblocks.21.mlp, visual.side_network.resblocks.21.mlp.act, visual.side_network.resblocks.21.mlp.drop, visual.side_network.resblocks.21.mlp.fc1, visual.side_network.resblocks.21.mlp.fc2, visual.side_network.resblocks.22.attn, visual.side_network.resblocks.22.attn.out_proj, visual.side_network.resblocks.22.conv, visual.side_network.resblocks.22.conv.0, visual.side_network.resblocks.22.conv.1, visual.side_network.resblocks.22.conv.2, visual.side_network.resblocks.22.mlp, visual.side_network.resblocks.22.mlp.act, visual.side_network.resblocks.22.mlp.drop, visual.side_network.resblocks.22.mlp.fc1, visual.side_network.resblocks.22.mlp.fc2, visual.side_network.resblocks.23.attn, visual.side_network.resblocks.23.attn.out_proj, visual.side_network.resblocks.23.conv, visual.side_network.resblocks.23.conv.0, visual.side_network.resblocks.23.conv.1, visual.side_network.resblocks.23.conv.2, visual.side_network.resblocks.23.mlp, visual.side_network.resblocks.23.mlp.act, visual.side_network.resblocks.23.mlp.drop, visual.side_network.resblocks.23.mlp.fc1, visual.side_network.resblocks.23.mlp.fc2, visual.side_network.resblocks.3.attn, visual.side_network.resblocks.3.attn.out_proj, visual.side_network.resblocks.3.conv, visual.side_network.resblocks.3.conv.0, visual.side_network.resblocks.3.conv.1, visual.side_network.resblocks.3.conv.2, visual.side_network.resblocks.3.mlp, visual.side_network.resblocks.3.mlp.act, visual.side_network.resblocks.3.mlp.drop, visual.side_network.resblocks.3.mlp.fc1, visual.side_network.resblocks.3.mlp.fc2, visual.side_network.resblocks.4.attn, visual.side_network.resblocks.4.attn.out_proj, visual.side_network.resblocks.4.conv, visual.side_network.resblocks.4.conv.0, visual.side_network.resblocks.4.conv.1, visual.side_network.resblocks.4.conv.2, visual.side_network.resblocks.4.mlp, visual.side_network.resblocks.4.mlp.act, visual.side_network.resblocks.4.mlp.drop, visual.side_network.resblocks.4.mlp.fc1, visual.side_network.resblocks.4.mlp.fc2, visual.side_network.resblocks.5.attn, visual.side_network.resblocks.5.attn.out_proj, visual.side_network.resblocks.5.conv, visual.side_network.resblocks.5.conv.0, visual.side_network.resblocks.5.conv.1, visual.side_network.resblocks.5.conv.2, visual.side_network.resblocks.5.mlp, visual.side_network.resblocks.5.mlp.act, visual.side_network.resblocks.5.mlp.drop, visual.side_network.resblocks.5.mlp.fc1, visual.side_network.resblocks.5.mlp.fc2, visual.side_network.resblocks.6.attn, visual.side_network.resblocks.6.attn.out_proj, visual.side_network.resblocks.6.conv, visual.side_network.resblocks.6.conv.0, visual.side_network.resblocks.6.conv.1, visual.side_network.resblocks.6.conv.2, visual.side_network.resblocks.6.mlp, visual.side_network.resblocks.6.mlp.act, visual.side_network.resblocks.6.mlp.drop, visual.side_network.resblocks.6.mlp.fc1, visual.side_network.resblocks.6.mlp.fc2, visual.side_network.resblocks.7.attn, visual.side_network.resblocks.7.attn.out_proj, visual.side_network.resblocks.7.conv, visual.side_network.resblocks.7.conv.0, visual.side_network.resblocks.7.conv.1, visual.side_network.resblocks.7.conv.2, visual.side_network.resblocks.7.mlp, visual.side_network.resblocks.7.mlp.act, visual.side_network.resblocks.7.mlp.drop, visual.side_network.resblocks.7.mlp.fc1, visual.side_network.resblocks.7.mlp.fc2, visual.side_network.resblocks.8.attn, visual.side_network.resblocks.8.attn.out_proj, visual.side_network.resblocks.8.conv, visual.side_network.resblocks.8.conv.0, visual.side_network.resblocks.8.conv.1, visual.side_network.resblocks.8.conv.2, visual.side_network.resblocks.8.mlp, visual.side_network.resblocks.8.mlp.act, visual.side_network.resblocks.8.mlp.drop, visual.side_network.resblocks.8.mlp.fc1, visual.side_network.resblocks.8.mlp.fc2, visual.side_network.resblocks.9.attn, visual.side_network.resblocks.9.attn.out_proj, visual.side_network.resblocks.9.conv, visual.side_network.resblocks.9.conv.0, visual.side_network.resblocks.9.conv.1, visual.side_network.resblocks.9.conv.2, visual.side_network.resblocks.9.mlp, visual.side_network.resblocks.9.mlp.act, visual.side_network.resblocks.9.mlp.drop, visual.side_network.resblocks.9.mlp.fc1, visual.side_network.resblocks.9.mlp.fc2, visual.transformer.resblocks.0.attn.out_proj, visual.transformer.resblocks.1.attn.out_proj, visual.transformer.resblocks.10.attn.out_proj, visual.transformer.resblocks.11.attn.out_proj, visual.transformer.resblocks.12.attn.out_proj, visual.transformer.resblocks.13.attn.out_proj, visual.transformer.resblocks.14.attn.out_proj, visual.transformer.resblocks.15.attn.out_proj, visual.transformer.resblocks.16.attn.out_proj, visual.transformer.resblocks.17.attn.out_proj, visual.transformer.resblocks.18.attn.out_proj, visual.transformer.resblocks.19.attn.out_proj, visual.transformer.resblocks.2.attn.out_proj, visual.transformer.resblocks.20.attn.out_proj, visual.transformer.resblocks.21.attn.out_proj, visual.transformer.resblocks.22.attn.out_proj, visual.transformer.resblocks.23.attn.out_proj, visual.transformer.resblocks.3.attn.out_proj, visual.transformer.resblocks.4.attn.out_proj, visual.transformer.resblocks.5.attn.out_proj, visual.transformer.resblocks.6.attn.out_proj, visual.transformer.resblocks.7.attn.out_proj, visual.transformer.resblocks.8.attn.out_proj, visual.transformer.resblocks.9.attn.out_proj
[02/27 08:40:18][INFO] utils.py:  502: Flops: 2.732T
[02/27 08:40:18][INFO] utils.py:  504: Params: 385.423M, tunable Params: 82.245M
[02/27 08:40:18][INFO] train_vision.py:  297: train transforms: [Compose(
    <datasets.transforms.GroupScale object at 0x145f27d51250>
    Compose(
    <datasets.transforms.GroupRandomSizedCrop object at 0x145f27d51700>
    <datasets.transforms.GroupRandomHorizontalFlip object at 0x145f27d51730>
)
    <datasets.transforms.GroupRandomGrayscale object at 0x145f27d51a00>
), Compose(
    <datasets.transforms.Stack object at 0x145f27d511c0>
    <datasets.transforms.ToTorchFormatTensor object at 0x145f27d51670>
    <datasets.transforms.GroupNormalize object at 0x145f27d51100>
)]
[02/27 08:40:18][INFO] train_vision.py:  298: val transforms: [Compose(
    <datasets.transforms.GroupScale object at 0x145f27d51e50>
    <datasets.transforms.GroupCenterCrop object at 0x145f278ea1c0>
), Compose(
    <datasets.transforms.Stack object at 0x145f278ea820>
    <datasets.transforms.ToTorchFormatTensor object at 0x145f278ea070>
    <datasets.transforms.GroupNormalize object at 0x145f27d51e20>
)]
[02/27 08:40:18][INFO] train_vision.py:  361: => Using label smoothing: 0.1
[02/27 08:40:18][INFO] train_vision.py:  372: => loading checkpoint 'exp/s4v_selfy_vitl14_16x224_k400_run3/model_best.pt'
[02/27 08:40:19][INFO] train_vision.py:  384: => pop last fc layer
[02/27 08:40:32][INFO] train_vision.py:  668: Epoch: [0][0/329], lr: 2.00e-07, eta: 1 day, 10:20:15	Time 12.523 (12.523)	Data 2.749 (2.749)	Mem 40.67GB	Prec@1 0.000 (0.000)	Loss 4.5993 (4.5993)
[02/27 08:40:32][INFO] distributed.py:  995: Reducer buckets have been rebuilt in this iteration.
[02/27 08:41:03][INFO] train_vision.py:  668: Epoch: [0][10/329], lr: 1.57e-06, eta: 10:44:05	Time 2.947 (3.919)	Data 0.048 (0.290)	Mem 41.61GB	Prec@1 0.000 (0.909)	Loss 4.7417 (4.6383)
[02/27 08:41:32][INFO] train_vision.py:  668: Epoch: [0][20/329], lr: 3.08e-06, eta: 9:28:50	Time 2.997 (3.465)	Data 0.089 (0.183)	Mem 41.61GB	Prec@1 10.000 (2.381)	Loss 4.5012 (4.6052)
[02/27 08:42:02][INFO] train_vision.py:  668: Epoch: [0][30/329], lr: 4.60e-06, eta: 9:02:52	Time 2.977 (3.310)	Data 0.057 (0.143)	Mem 41.61GB	Prec@1 0.000 (1.613)	Loss 4.5718 (4.6243)
[02/27 08:42:32][INFO] train_vision.py:  668: Epoch: [0][40/329], lr: 6.12e-06, eta: 8:50:01	Time 2.990 (3.235)	Data 0.052 (0.122)	Mem 41.61GB	Prec@1 0.000 (1.463)	Loss 4.5147 (4.6206)
[02/27 08:43:02][INFO] train_vision.py:  668: Epoch: [0][50/329], lr: 7.64e-06, eta: 8:41:57	Time 2.996 (3.189)	Data 0.048 (0.108)	Mem 41.61GB	Prec@1 10.000 (2.941)	Loss 4.3807 (4.6006)
[02/27 08:43:32][INFO] train_vision.py:  668: Epoch: [0][60/329], lr: 9.16e-06, eta: 8:36:24	Time 2.990 (3.158)	Data 0.076 (0.100)	Mem 41.61GB	Prec@1 0.000 (3.607)	Loss 4.5109 (4.5822)
[02/27 08:44:02][INFO] train_vision.py:  668: Epoch: [0][70/329], lr: 1.07e-05, eta: 8:32:31	Time 3.003 (3.138)	Data 0.054 (0.094)	Mem 41.61GB	Prec@1 20.000 (4.225)	Loss 4.0933 (4.5579)
[02/27 08:44:32][INFO] train_vision.py:  668: Epoch: [0][80/329], lr: 1.22e-05, eta: 8:29:17	Time 2.998 (3.121)	Data 0.051 (0.089)	Mem 41.61GB	Prec@1 10.000 (5.556)	Loss 4.2192 (4.5138)
[02/27 08:45:03][INFO] train_vision.py:  668: Epoch: [0][90/329], lr: 1.37e-05, eta: 8:26:49	Time 3.005 (3.109)	Data 0.053 (0.086)	Mem 41.61GB	Prec@1 10.000 (6.374)	Loss 4.0061 (4.4691)
[02/27 08:45:33][INFO] train_vision.py:  668: Epoch: [0][100/329], lr: 1.52e-05, eta: 8:24:42	Time 3.015 (3.099)	Data 0.059 (0.083)	Mem 41.61GB	Prec@1 10.000 (7.525)	Loss 4.0912 (4.4240)
[02/27 08:46:03][INFO] train_vision.py:  668: Epoch: [0][110/329], lr: 1.67e-05, eta: 8:22:57	Time 3.000 (3.092)	Data 0.045 (0.081)	Mem 41.61GB	Prec@1 20.000 (8.198)	Loss 3.7242 (4.3772)
[02/27 08:46:33][INFO] train_vision.py:  668: Epoch: [0][120/329], lr: 1.83e-05, eta: 8:21:17	Time 3.012 (3.085)	Data 0.059 (0.080)	Mem 41.61GB	Prec@1 20.000 (8.512)	Loss 3.9950 (4.3438)
[02/27 08:47:03][INFO] train_vision.py:  668: Epoch: [0][130/329], lr: 1.98e-05, eta: 8:19:47	Time 3.005 (3.079)	Data 0.052 (0.078)	Mem 41.61GB	Prec@1 20.000 (8.855)	Loss 3.7587 (4.3143)
[02/27 08:47:33][INFO] train_vision.py:  668: Epoch: [0][140/329], lr: 2.13e-05, eta: 8:18:30	Time 3.011 (3.074)	Data 0.053 (0.077)	Mem 41.61GB	Prec@1 30.000 (9.007)	Loss 3.4947 (4.2738)
[02/27 08:48:03][INFO] train_vision.py:  668: Epoch: [0][150/329], lr: 2.28e-05, eta: 8:17:14	Time 3.001 (3.069)	Data 0.041 (0.075)	Mem 41.61GB	Prec@1 20.000 (9.735)	Loss 3.3390 (4.2289)
[02/27 08:48:33][INFO] train_vision.py:  668: Epoch: [0][160/329], lr: 2.43e-05, eta: 8:16:08	Time 3.013 (3.065)	Data 0.080 (0.075)	Mem 41.61GB	Prec@1 20.000 (9.814)	Loss 3.6243 (4.2022)
[02/27 08:49:03][INFO] train_vision.py:  668: Epoch: [0][170/329], lr: 2.59e-05, eta: 8:15:00	Time 2.991 (3.062)	Data 0.044 (0.073)	Mem 41.61GB	Prec@1 30.000 (10.585)	Loss 3.5868 (4.1599)
[02/27 08:49:33][INFO] train_vision.py:  668: Epoch: [0][180/329], lr: 2.74e-05, eta: 8:14:01	Time 3.050 (3.059)	Data 0.076 (0.073)	Mem 41.61GB	Prec@1 40.000 (11.215)	Loss 3.3950 (4.1155)
[02/27 08:50:03][INFO] train_vision.py:  668: Epoch: [0][190/329], lr: 2.89e-05, eta: 8:13:06	Time 3.008 (3.056)	Data 0.056 (0.072)	Mem 41.61GB	Prec@1 20.000 (11.832)	Loss 3.5216 (4.0757)
[02/27 08:50:33][INFO] train_vision.py:  668: Epoch: [0][200/329], lr: 3.04e-05, eta: 8:12:14	Time 3.017 (3.054)	Data 0.035 (0.071)	Mem 41.61GB	Prec@1 0.000 (12.139)	Loss 3.7479 (4.0427)
[02/27 08:51:03][INFO] train_vision.py:  668: Epoch: [0][210/329], lr: 3.19e-05, eta: 8:11:20	Time 2.970 (3.052)	Data 0.053 (0.071)	Mem 41.61GB	Prec@1 30.000 (12.464)	Loss 3.0343 (4.0150)
[02/27 08:51:34][INFO] train_vision.py:  668: Epoch: [0][220/329], lr: 3.34e-05, eta: 8:10:31	Time 2.998 (3.050)	Data 0.087 (0.070)	Mem 41.61GB	Prec@1 40.000 (13.122)	Loss 3.1783 (3.9832)
[02/27 08:52:04][INFO] train_vision.py:  668: Epoch: [0][230/329], lr: 3.50e-05, eta: 8:09:44	Time 3.001 (3.048)	Data 0.050 (0.069)	Mem 41.61GB	Prec@1 30.000 (14.113)	Loss 3.3878 (3.9447)
[02/27 08:52:34][INFO] train_vision.py:  668: Epoch: [0][240/329], lr: 3.65e-05, eta: 8:08:59	Time 3.043 (3.046)	Data 0.091 (0.069)	Mem 41.61GB	Prec@1 40.000 (14.938)	Loss 2.6385 (3.9090)
[02/27 08:53:04][INFO] train_vision.py:  668: Epoch: [0][250/329], lr: 3.80e-05, eta: 8:08:15	Time 3.012 (3.045)	Data 0.054 (0.068)	Mem 41.61GB	Prec@1 40.000 (15.777)	Loss 3.0009 (3.8677)
[02/27 08:53:34][INFO] train_vision.py:  668: Epoch: [0][260/329], lr: 3.95e-05, eta: 8:07:33	Time 3.013 (3.044)	Data 0.050 (0.068)	Mem 41.61GB	Prec@1 60.000 (16.743)	Loss 2.2328 (3.8248)
[02/27 08:54:04][INFO] train_vision.py:  668: Epoch: [0][270/329], lr: 4.10e-05, eta: 8:06:47	Time 2.997 (3.042)	Data 0.048 (0.068)	Mem 41.61GB	Prec@1 30.000 (17.380)	Loss 3.0084 (3.7958)
[02/27 08:54:34][INFO] train_vision.py:  668: Epoch: [0][280/329], lr: 4.26e-05, eta: 8:06:06	Time 3.027 (3.041)	Data 0.090 (0.068)	Mem 41.61GB	Prec@1 30.000 (17.936)	Loss 3.0639 (3.7632)
[02/27 08:55:04][INFO] train_vision.py:  668: Epoch: [0][290/329], lr: 4.41e-05, eta: 8:05:27	Time 2.998 (3.040)	Data 0.055 (0.068)	Mem 41.61GB	Prec@1 30.000 (18.625)	Loss 2.8573 (3.7306)
[02/27 08:55:34][INFO] train_vision.py:  668: Epoch: [0][300/329], lr: 4.56e-05, eta: 8:04:45	Time 2.978 (3.039)	Data 0.062 (0.067)	Mem 41.61GB	Prec@1 20.000 (19.136)	Loss 3.5429 (3.7033)
[02/27 08:56:04][INFO] train_vision.py:  668: Epoch: [0][310/329], lr: 4.71e-05, eta: 8:04:07	Time 2.996 (3.038)	Data 0.050 (0.067)	Mem 41.61GB	Prec@1 20.000 (19.614)	Loss 3.2358 (3.6790)
[02/27 08:56:34][INFO] train_vision.py:  668: Epoch: [0][320/329], lr: 4.86e-05, eta: 8:03:27	Time 3.007 (3.037)	Data 0.058 (0.067)	Mem 41.61GB	Prec@1 10.000 (19.875)	Loss 3.5789 (3.6553)
[02/27 08:57:04][INFO] train_vision.py:  668: Epoch: [1][0/329], lr: 5.01e-05, eta: 14:06:10	Time 5.321 (5.321)	Data 2.380 (2.380)	Mem 41.61GB	Prec@1 30.000 (30.000)	Loss 3.0322 (3.0322)
[02/27 08:57:35][INFO] train_vision.py:  668: Epoch: [1][10/329], lr: 5.15e-05, eta: 8:32:15	Time 3.015 (3.224)	Data 0.059 (0.277)	Mem 41.61GB	Prec@1 20.000 (40.000)	Loss 3.0117 (2.7324)
[02/27 08:58:05][INFO] train_vision.py:  668: Epoch: [1][20/329], lr: 5.30e-05, eta: 8:17:05	Time 3.038 (3.132)	Data 0.087 (0.182)	Mem 41.61GB	Prec@1 40.000 (42.381)	Loss 2.4240 (2.6735)
[02/27 08:58:35][INFO] train_vision.py:  668: Epoch: [1][30/329], lr: 5.46e-05, eta: 8:11:01	Time 3.036 (3.097)	Data 0.046 (0.143)	Mem 41.61GB	Prec@1 40.000 (43.226)	Loss 2.8287 (2.6373)
[02/27 08:59:05][INFO] train_vision.py:  668: Epoch: [1][40/329], lr: 5.61e-05, eta: 8:07:50	Time 3.046 (3.080)	Data 0.075 (0.124)	Mem 41.61GB	Prec@1 60.000 (42.195)	Loss 2.2203 (2.6477)
[02/27 08:59:36][INFO] train_vision.py:  668: Epoch: [1][50/329], lr: 5.76e-05, eta: 8:05:45	Time 3.027 (3.071)	Data 0.055 (0.111)	Mem 41.61GB	Prec@1 50.000 (40.196)	Loss 2.5427 (2.6916)
[02/27 09:00:06][INFO] train_vision.py:  668: Epoch: [1][60/329], lr: 5.91e-05, eta: 8:04:15	Time 3.006 (3.064)	Data 0.060 (0.103)	Mem 41.61GB	Prec@1 50.000 (41.803)	Loss 2.5095 (2.6546)
[02/27 09:00:36][INFO] train_vision.py:  668: Epoch: [1][70/329], lr: 6.06e-05, eta: 8:02:58	Time 3.031 (3.059)	Data 0.035 (0.096)	Mem 41.61GB	Prec@1 50.000 (41.972)	Loss 2.4899 (2.6424)
[02/27 09:01:07][INFO] train_vision.py:  668: Epoch: [1][80/329], lr: 6.21e-05, eta: 8:01:49	Time 3.053 (3.055)	Data 0.054 (0.091)	Mem 41.61GB	Prec@1 20.000 (41.605)	Loss 2.6060 (2.6361)
[02/27 09:01:37][INFO] train_vision.py:  668: Epoch: [1][90/329], lr: 6.37e-05, eta: 8:00:59	Time 3.062 (3.053)	Data 0.068 (0.088)	Mem 41.61GB	Prec@1 80.000 (41.758)	Loss 1.9281 (2.6386)
[02/27 09:02:07][INFO] train_vision.py:  668: Epoch: [1][100/329], lr: 6.52e-05, eta: 8:00:00	Time 3.055 (3.050)	Data 0.042 (0.085)	Mem 41.61GB	Prec@1 50.000 (41.584)	Loss 2.8580 (2.6340)
[02/27 09:02:37][INFO] train_vision.py:  668: Epoch: [1][110/329], lr: 6.67e-05, eta: 7:59:05	Time 2.973 (3.048)	Data 0.040 (0.082)	Mem 41.61GB	Prec@1 60.000 (42.432)	Loss 2.2193 (2.6102)
[02/27 09:03:08][INFO] train_vision.py:  668: Epoch: [1][120/329], lr: 6.82e-05, eta: 7:58:12	Time 3.005 (3.045)	Data 0.049 (0.080)	Mem 41.61GB	Prec@1 40.000 (42.314)	Loss 2.9259 (2.6079)
[02/27 09:03:38][INFO] train_vision.py:  668: Epoch: [1][130/329], lr: 6.97e-05, eta: 7:57:16	Time 2.989 (3.043)	Data 0.054 (0.078)	Mem 41.61GB	Prec@1 30.000 (42.595)	Loss 2.6724 (2.5957)
[02/27 09:04:08][INFO] train_vision.py:  668: Epoch: [1][140/329], lr: 7.13e-05, eta: 7:56:27	Time 3.012 (3.041)	Data 0.052 (0.077)	Mem 41.61GB	Prec@1 60.000 (42.766)	Loss 2.2884 (2.5832)
[02/27 09:04:38][INFO] train_vision.py:  668: Epoch: [1][150/329], lr: 7.28e-05, eta: 7:55:42	Time 3.006 (3.039)	Data 0.038 (0.075)	Mem 41.61GB	Prec@1 40.000 (42.980)	Loss 2.4190 (2.5785)
[02/27 09:05:08][INFO] train_vision.py:  668: Epoch: [1][160/329], lr: 7.43e-05, eta: 7:54:53	Time 3.028 (3.037)	Data 0.035 (0.074)	Mem 41.61GB	Prec@1 30.000 (42.422)	Loss 2.7935 (2.5850)
[02/27 09:05:38][INFO] train_vision.py:  668: Epoch: [1][170/329], lr: 7.58e-05, eta: 7:54:13	Time 2.995 (3.036)	Data 0.047 (0.073)	Mem 41.61GB	Prec@1 60.000 (42.456)	Loss 2.4963 (2.5866)
[02/27 09:06:08][INFO] train_vision.py:  668: Epoch: [1][180/329], lr: 7.73e-05, eta: 7:53:28	Time 2.984 (3.034)	Data 0.047 (0.072)	Mem 41.61GB	Prec@1 60.000 (42.983)	Loss 2.2857 (2.5613)
[02/27 09:06:38][INFO] train_vision.py:  668: Epoch: [1][190/329], lr: 7.88e-05, eta: 7:52:45	Time 3.002 (3.033)	Data 0.050 (0.071)	Mem 41.61GB	Prec@1 50.000 (43.403)	Loss 2.2461 (2.5466)
[02/27 09:07:09][INFO] train_vision.py:  668: Epoch: [1][200/329], lr: 8.04e-05, eta: 7:52:02	Time 3.016 (3.032)	Data 0.054 (0.070)	Mem 41.61GB	Prec@1 60.000 (43.731)	Loss 1.7777 (2.5352)
[02/27 09:07:39][INFO] train_vision.py:  668: Epoch: [1][210/329], lr: 8.19e-05, eta: 7:51:22	Time 3.013 (3.031)	Data 0.056 (0.069)	Mem 41.61GB	Prec@1 40.000 (44.076)	Loss 2.1745 (2.5197)
[02/27 09:08:09][INFO] train_vision.py:  668: Epoch: [1][220/329], lr: 8.34e-05, eta: 7:50:41	Time 3.016 (3.030)	Data 0.064 (0.068)	Mem 41.61GB	Prec@1 40.000 (44.389)	Loss 2.2013 (2.5089)
[02/27 09:08:39][INFO] train_vision.py:  668: Epoch: [1][230/329], lr: 8.49e-05, eta: 7:50:01	Time 3.011 (3.029)	Data 0.058 (0.068)	Mem 41.61GB	Prec@1 50.000 (44.675)	Loss 2.3000 (2.4975)
[02/27 09:09:09][INFO] train_vision.py:  668: Epoch: [1][240/329], lr: 8.64e-05, eta: 7:49:23	Time 3.036 (3.028)	Data 0.057 (0.067)	Mem 41.61GB	Prec@1 70.000 (45.021)	Loss 2.1693 (2.4832)
[02/27 09:09:39][INFO] train_vision.py:  668: Epoch: [1][250/329], lr: 8.80e-05, eta: 7:48:47	Time 3.013 (3.027)	Data 0.056 (0.066)	Mem 41.61GB	Prec@1 40.000 (45.179)	Loss 2.4262 (2.4738)
[02/27 09:10:09][INFO] train_vision.py:  668: Epoch: [1][260/329], lr: 8.95e-05, eta: 7:48:08	Time 3.011 (3.026)	Data 0.052 (0.066)	Mem 41.61GB	Prec@1 40.000 (45.249)	Loss 2.1168 (2.4663)
[02/27 09:10:39][INFO] train_vision.py:  668: Epoch: [1][270/329], lr: 9.10e-05, eta: 7:47:31	Time 3.011 (3.025)	Data 0.045 (0.066)	Mem 41.61GB	Prec@1 40.000 (45.166)	Loss 2.5992 (2.4612)
[02/27 09:11:09][INFO] train_vision.py:  668: Epoch: [1][280/329], lr: 9.25e-05, eta: 7:46:55	Time 3.005 (3.025)	Data 0.055 (0.065)	Mem 41.61GB	Prec@1 60.000 (45.623)	Loss 2.0555 (2.4508)
[02/27 09:11:39][INFO] train_vision.py:  668: Epoch: [1][290/329], lr: 9.40e-05, eta: 7:46:20	Time 3.008 (3.024)	Data 0.056 (0.065)	Mem 41.61GB	Prec@1 80.000 (45.979)	Loss 1.5685 (2.4380)
[02/27 09:12:09][INFO] train_vision.py:  668: Epoch: [1][300/329], lr: 9.55e-05, eta: 7:45:44	Time 3.013 (3.024)	Data 0.043 (0.064)	Mem 41.61GB	Prec@1 50.000 (46.047)	Loss 2.3325 (2.4349)
[02/27 09:12:39][INFO] train_vision.py:  668: Epoch: [1][310/329], lr: 9.71e-05, eta: 7:45:10	Time 3.035 (3.023)	Data 0.023 (0.064)	Mem 41.61GB	Prec@1 80.000 (46.399)	Loss 1.8962 (2.4207)
[02/27 09:13:09][INFO] train_vision.py:  668: Epoch: [1][320/329], lr: 9.86e-05, eta: 7:44:35	Time 3.013 (3.023)	Data 0.054 (0.064)	Mem 41.61GB	Prec@1 70.000 (46.417)	Loss 1.8078 (2.4145)
[02/27 09:13:42][INFO] train_vision.py:  840: Test: [0/107]	Prec@1 66.250 (66.250)	Prec@5 95.000 (95.000)	mPrec@1 (17.273)	mPrec@5 (30.101)
[02/27 09:14:24][INFO] train_vision.py:  840: Test: [10/107]	Prec@1 71.250 (66.250)	Prec@5 100.000 (95.682)	mPrec@1 (36.463)	mPrec@5 (76.752)
[02/27 09:15:07][INFO] train_vision.py:  840: Test: [20/107]	Prec@1 71.250 (64.524)	Prec@5 100.000 (95.000)	mPrec@1 (41.404)	mPrec@5 (83.768)
[02/27 09:15:49][INFO] train_vision.py:  840: Test: [30/107]	Prec@1 63.750 (64.234)	Prec@5 98.750 (94.435)	mPrec@1 (40.968)	mPrec@5 (84.784)
[02/27 09:16:32][INFO] train_vision.py:  840: Test: [40/107]	Prec@1 62.500 (63.811)	Prec@5 91.250 (94.299)	mPrec@1 (40.717)	mPrec@5 (86.336)
[02/27 09:17:15][INFO] train_vision.py:  840: Test: [50/107]	Prec@1 65.000 (64.118)	Prec@5 96.250 (94.485)	mPrec@1 (40.269)	mPrec@5 (85.872)
[02/27 09:17:58][INFO] train_vision.py:  840: Test: [60/107]	Prec@1 67.500 (63.607)	Prec@5 96.250 (94.303)	mPrec@1 (40.164)	mPrec@5 (85.656)
[02/27 09:18:41][INFO] train_vision.py:  840: Test: [70/107]	Prec@1 50.000 (63.204)	Prec@5 83.750 (94.313)	mPrec@1 (40.030)	mPrec@5 (85.741)
[02/27 09:19:23][INFO] train_vision.py:  840: Test: [80/107]	Prec@1 51.250 (62.176)	Prec@5 88.750 (93.935)	mPrec@1 (39.967)	mPrec@5 (85.774)
[02/27 09:20:06][INFO] train_vision.py:  840: Test: [90/107]	Prec@1 66.250 (61.236)	Prec@5 95.000 (93.832)	mPrec@1 (40.285)	mPrec@5 (85.945)
[02/27 09:20:48][INFO] train_vision.py:  840: Test: [100/107]	Prec@1 55.000 (61.621)	Prec@5 88.750 (93.775)	mPrec@1 (40.577)	mPrec@5 (85.875)
[02/27 09:21:13][INFO] train_vision.py:  847: Overall Prec@1 60.763% Prec@5 93.732% mPrec@1 (40.665) mPrec@5 (85.933)
[02/27 09:21:13][INFO] train_vision.py:  464: Testing: 40.665008544921875/40.665008544921875
[02/27 09:21:13][INFO] train_vision.py:  465: Saving:
[02/27 09:21:23][INFO] train_vision.py:  668: Epoch: [2][0/329], lr: 1.00e-04, eta: 13:22:26	Time 5.226 (5.226)	Data 2.163 (2.163)	Mem 41.61GB	Prec@1 60.000 (60.000)	Loss 1.8956 (1.8956)
[02/27 09:21:53][INFO] train_vision.py:  668: Epoch: [2][10/329], lr: 1.01e-04, eta: 8:12:58	Time 3.020 (3.214)	Data 0.050 (0.249)	Mem 41.61GB	Prec@1 70.000 (49.091)	Loss 1.6421 (2.2567)
[02/27 09:22:23][INFO] train_vision.py:  668: Epoch: [2][20/329], lr: 1.03e-04, eta: 7:56:56	Time 3.039 (3.113)	Data 0.029 (0.156)	Mem 41.61GB	Prec@1 50.000 (55.238)	Loss 1.9298 (2.1271)
[02/27 09:22:53][INFO] train_vision.py:  668: Epoch: [2][30/329], lr: 1.05e-04, eta: 7:51:19	Time 3.035 (3.079)	Data 0.085 (0.125)	Mem 41.61GB	Prec@1 50.000 (56.774)	Loss 2.0856 (2.0488)
[02/27 09:23:23][INFO] train_vision.py:  668: Epoch: [2][40/329], lr: 1.06e-04, eta: 7:47:51	Time 3.008 (3.060)	Data 0.059 (0.107)	Mem 41.61GB	Prec@1 60.000 (56.341)	Loss 2.1299 (2.0772)
[02/27 09:23:53][INFO] train_vision.py:  668: Epoch: [2][50/329], lr: 1.08e-04, eta: 7:45:43	Time 2.996 (3.050)	Data 0.049 (0.098)	Mem 41.61GB	Prec@1 70.000 (57.059)	Loss 1.8622 (2.0660)
[02/27 09:24:23][INFO] train_vision.py:  668: Epoch: [2][60/329], lr: 1.09e-04, eta: 7:44:01	Time 3.009 (3.042)	Data 0.053 (0.091)	Mem 41.61GB	Prec@1 30.000 (57.377)	Loss 2.7577 (2.0484)
[02/27 09:24:53][INFO] train_vision.py:  668: Epoch: [2][70/329], lr: 1.11e-04, eta: 7:42:43	Time 3.001 (3.037)	Data 0.047 (0.086)	Mem 41.61GB	Prec@1 80.000 (58.592)	Loss 1.5445 (2.0306)
[02/27 09:25:23][INFO] train_vision.py:  668: Epoch: [2][80/329], lr: 1.12e-04, eta: 7:41:37	Time 3.000 (3.033)	Data 0.049 (0.083)	Mem 41.61GB	Prec@1 60.000 (58.272)	Loss 2.0102 (2.0398)
[02/27 09:25:53][INFO] train_vision.py:  668: Epoch: [2][90/329], lr: 1.14e-04, eta: 7:40:42	Time 3.014 (3.030)	Data 0.078 (0.080)	Mem 41.61GB	Prec@1 50.000 (58.462)	Loss 2.2657 (2.0394)
[02/27 09:26:23][INFO] train_vision.py:  668: Epoch: [2][100/329], lr: 1.15e-04, eta: 7:39:51	Time 2.994 (3.028)	Data 0.064 (0.077)	Mem 41.61GB	Prec@1 30.000 (58.812)	Loss 2.7315 (2.0394)
[02/27 09:26:53][INFO] train_vision.py:  668: Epoch: [2][110/329], lr: 1.17e-04, eta: 7:39:00	Time 3.019 (3.025)	Data 0.021 (0.075)	Mem 41.61GB	Prec@1 60.000 (58.378)	Loss 2.1011 (2.0438)
[02/27 09:27:23][INFO] train_vision.py:  668: Epoch: [2][120/329], lr: 1.18e-04, eta: 7:38:16	Time 3.015 (3.024)	Data 0.061 (0.073)	Mem 41.61GB	Prec@1 80.000 (58.843)	Loss 1.5069 (2.0321)
[02/27 09:27:53][INFO] train_vision.py:  668: Epoch: [2][130/329], lr: 1.20e-04, eta: 7:37:34	Time 2.996 (3.023)	Data 0.045 (0.072)	Mem 41.61GB	Prec@1 80.000 (59.160)	Loss 2.1263 (2.0319)
[02/27 09:28:24][INFO] train_vision.py:  668: Epoch: [2][140/329], lr: 1.21e-04, eta: 7:36:52	Time 2.990 (3.021)	Data 0.061 (0.071)	Mem 41.61GB	Prec@1 50.000 (59.433)	Loss 2.2245 (2.0255)
[02/27 09:28:53][INFO] train_vision.py:  668: Epoch: [2][150/329], lr: 1.23e-04, eta: 7:36:09	Time 2.995 (3.020)	Data 0.042 (0.069)	Mem 41.61GB	Prec@1 70.000 (59.338)	Loss 1.7975 (2.0230)
[02/27 09:29:24][INFO] train_vision.py:  668: Epoch: [2][160/329], lr: 1.24e-04, eta: 7:35:31	Time 3.017 (3.019)	Data 0.063 (0.068)	Mem 41.61GB	Prec@1 60.000 (59.379)	Loss 1.9003 (2.0187)
[02/27 09:29:54][INFO] train_vision.py:  668: Epoch: [2][170/329], lr: 1.26e-04, eta: 7:34:49	Time 3.002 (3.018)	Data 0.048 (0.067)	Mem 41.61GB	Prec@1 70.000 (59.708)	Loss 1.9245 (2.0115)
[02/27 09:30:24][INFO] train_vision.py:  668: Epoch: [2][180/329], lr: 1.27e-04, eta: 7:34:10	Time 3.006 (3.017)	Data 0.050 (0.066)	Mem 41.61GB	Prec@1 50.000 (59.613)	Loss 2.5916 (2.0099)
[02/27 09:30:54][INFO] train_vision.py:  668: Epoch: [2][190/329], lr: 1.29e-04, eta: 7:33:33	Time 3.015 (3.016)	Data 0.062 (0.066)	Mem 41.61GB	Prec@1 70.000 (59.791)	Loss 1.8582 (2.0082)
[02/27 09:31:24][INFO] train_vision.py:  668: Epoch: [2][200/329], lr: 1.30e-04, eta: 7:32:58	Time 2.990 (3.016)	Data 0.062 (0.065)	Mem 41.61GB	Prec@1 70.000 (60.000)	Loss 2.2022 (2.0062)
[02/27 09:31:54][INFO] train_vision.py:  668: Epoch: [2][210/329], lr: 1.32e-04, eta: 7:32:23	Time 3.000 (3.015)	Data 0.049 (0.065)	Mem 41.61GB	Prec@1 70.000 (60.379)	Loss 1.7804 (1.9931)
[02/27 09:32:24][INFO] train_vision.py:  668: Epoch: [2][220/329], lr: 1.33e-04, eta: 7:31:47	Time 3.011 (3.014)	Data 0.053 (0.064)	Mem 41.61GB	Prec@1 60.000 (60.136)	Loss 2.0320 (1.9943)
[02/27 09:32:54][INFO] train_vision.py:  668: Epoch: [2][230/329], lr: 1.35e-04, eta: 7:31:12	Time 2.995 (3.014)	Data 0.044 (0.063)	Mem 41.61GB	Prec@1 50.000 (60.303)	Loss 2.0378 (1.9869)
[02/27 09:33:24][INFO] train_vision.py:  668: Epoch: [2][240/329], lr: 1.36e-04, eta: 7:30:38	Time 2.990 (3.013)	Data 0.057 (0.063)	Mem 41.61GB	Prec@1 80.000 (60.539)	Loss 1.4787 (1.9804)
[02/27 09:33:54][INFO] train_vision.py:  668: Epoch: [2][250/329], lr: 1.38e-04, eta: 7:30:03	Time 2.993 (3.013)	Data 0.050 (0.062)	Mem 41.61GB	Prec@1 90.000 (60.797)	Loss 1.5569 (1.9750)
[02/27 09:34:24][INFO] train_vision.py:  668: Epoch: [2][260/329], lr: 1.39e-04, eta: 7:29:30	Time 3.010 (3.013)	Data 0.054 (0.062)	Mem 41.61GB	Prec@1 70.000 (60.690)	Loss 1.7820 (1.9706)
[02/27 09:34:54][INFO] train_vision.py:  668: Epoch: [2][270/329], lr: 1.41e-04, eta: 7:28:57	Time 3.004 (3.012)	Data 0.051 (0.061)	Mem 41.61GB	Prec@1 70.000 (60.590)	Loss 1.8860 (1.9694)
[02/27 09:35:24][INFO] train_vision.py:  668: Epoch: [2][280/329], lr: 1.42e-04, eta: 7:28:23	Time 2.999 (3.012)	Data 0.050 (0.061)	Mem 41.61GB	Prec@1 80.000 (60.463)	Loss 1.6953 (1.9701)
[02/27 09:35:54][INFO] train_vision.py:  668: Epoch: [2][290/329], lr: 1.44e-04, eta: 7:27:49	Time 2.992 (3.011)	Data 0.022 (0.061)	Mem 41.61GB	Prec@1 60.000 (60.722)	Loss 1.9368 (1.9654)
[02/27 09:36:24][INFO] train_vision.py:  668: Epoch: [2][300/329], lr: 1.45e-04, eta: 7:27:16	Time 3.000 (3.011)	Data 0.053 (0.060)	Mem 41.61GB	Prec@1 100.000 (60.930)	Loss 1.2515 (1.9580)
[02/27 09:36:54][INFO] train_vision.py:  668: Epoch: [2][310/329], lr: 1.47e-04, eta: 7:26:42	Time 2.995 (3.011)	Data 0.042 (0.060)	Mem 41.61GB	Prec@1 40.000 (61.318)	Loss 2.4341 (1.9487)
[02/27 09:37:24][INFO] train_vision.py:  668: Epoch: [2][320/329], lr: 1.49e-04, eta: 7:26:10	Time 2.990 (3.010)	Data 0.060 (0.060)	Mem 41.61GB	Prec@1 50.000 (61.121)	Loss 1.8488 (1.9489)
[02/27 09:37:54][INFO] train_vision.py:  668: Epoch: [3][0/329], lr: 1.50e-04, eta: 14:00:08	Time 5.674 (5.674)	Data 2.422 (2.422)	Mem 41.61GB	Prec@1 50.000 (50.000)	Loss 1.9481 (1.9481)
[02/27 09:38:24][INFO] train_vision.py:  668: Epoch: [3][10/329], lr: 1.51e-04, eta: 8:01:06	Time 2.988 (3.253)	Data 0.049 (0.262)	Mem 41.61GB	Prec@1 50.000 (64.545)	Loss 2.5457 (1.8975)
[02/27 09:38:54][INFO] train_vision.py:  668: Epoch: [3][20/329], lr: 1.53e-04, eta: 7:44:10	Time 3.014 (3.142)	Data 0.045 (0.163)	Mem 41.61GB	Prec@1 80.000 (64.762)	Loss 1.9745 (1.8604)
[02/27 09:39:24][INFO] train_vision.py:  668: Epoch: [3][30/329], lr: 1.54e-04, eta: 7:37:42	Time 2.997 (3.102)	Data 0.050 (0.126)	Mem 41.61GB	Prec@1 30.000 (64.839)	Loss 2.5600 (1.8775)
[02/27 09:39:55][INFO] train_vision.py:  668: Epoch: [3][40/329], lr: 1.56e-04, eta: 7:34:24	Time 2.990 (3.083)	Data 0.054 (0.109)	Mem 41.61GB	Prec@1 70.000 (64.146)	Loss 1.8163 (1.8626)
[02/27 09:40:25][INFO] train_vision.py:  668: Epoch: [3][50/329], lr: 1.57e-04, eta: 7:32:10	Time 3.016 (3.071)	Data 0.068 (0.100)	Mem 41.61GB	Prec@1 90.000 (66.275)	Loss 1.7276 (1.8262)
[02/27 09:40:55][INFO] train_vision.py:  668: Epoch: [3][60/329], lr: 1.59e-04, eta: 7:30:22	Time 2.977 (3.062)	Data 0.060 (0.094)	Mem 41.61GB	Prec@1 60.000 (66.230)	Loss 2.1970 (1.8326)
[02/27 09:41:25][INFO] train_vision.py:  668: Epoch: [3][70/329], lr: 1.61e-04, eta: 7:29:01	Time 3.009 (3.057)	Data 0.068 (0.087)	Mem 41.61GB	Prec@1 90.000 (67.042)	Loss 1.1821 (1.7956)
[02/27 09:41:55][INFO] train_vision.py:  668: Epoch: [3][80/329], lr: 1.62e-04, eta: 7:27:42	Time 2.996 (3.051)	Data 0.022 (0.082)	Mem 41.61GB	Prec@1 70.000 (66.420)	Loss 1.6494 (1.7977)
[02/27 09:42:25][INFO] train_vision.py:  668: Epoch: [3][90/329], lr: 1.64e-04, eta: 7:26:30	Time 3.004 (3.046)	Data 0.057 (0.078)	Mem 41.61GB	Prec@1 60.000 (65.275)	Loss 1.7853 (1.8121)
[02/27 09:42:55][INFO] train_vision.py:  668: Epoch: [3][100/329], lr: 1.65e-04, eta: 7:25:28	Time 3.007 (3.043)	Data 0.052 (0.076)	Mem 41.61GB	Prec@1 70.000 (65.446)	Loss 1.5830 (1.8023)
[02/27 09:43:26][INFO] train_vision.py:  668: Epoch: [3][110/329], lr: 1.67e-04, eta: 7:24:31	Time 2.999 (3.040)	Data 0.048 (0.074)	Mem 41.61GB	Prec@1 90.000 (65.045)	Loss 1.1300 (1.8152)
[02/27 09:43:56][INFO] train_vision.py:  668: Epoch: [3][120/329], lr: 1.68e-04, eta: 7:23:47	Time 3.004 (3.038)	Data 0.046 (0.072)	Mem 41.61GB	Prec@1 50.000 (65.537)	Loss 1.8413 (1.8049)
[02/27 09:44:26][INFO] train_vision.py:  668: Epoch: [3][130/329], lr: 1.70e-04, eta: 7:22:57	Time 2.999 (3.036)	Data 0.045 (0.070)	Mem 41.61GB	Prec@1 70.000 (65.420)	Loss 1.3945 (1.8080)
[02/27 09:44:56][INFO] train_vision.py:  668: Epoch: [3][140/329], lr: 1.71e-04, eta: 7:22:16	Time 2.985 (3.035)	Data 0.045 (0.068)	Mem 41.61GB	Prec@1 80.000 (65.461)	Loss 1.3367 (1.8036)
[02/27 09:45:26][INFO] train_vision.py:  668: Epoch: [3][150/329], lr: 1.73e-04, eta: 7:21:33	Time 3.007 (3.033)	Data 0.056 (0.068)	Mem 41.61GB	Prec@1 70.000 (65.033)	Loss 1.5214 (1.8106)
[02/27 09:45:56][INFO] train_vision.py:  668: Epoch: [3][160/329], lr: 1.74e-04, eta: 7:20:51	Time 3.055 (3.032)	Data 0.047 (0.066)	Mem 41.61GB	Prec@1 70.000 (65.031)	Loss 1.7811 (1.8145)
[02/27 09:46:27][INFO] train_vision.py:  668: Epoch: [3][170/329], lr: 1.76e-04, eta: 7:20:14	Time 3.016 (3.031)	Data 0.060 (0.066)	Mem 41.61GB	Prec@1 40.000 (64.737)	Loss 2.1667 (1.8213)
[02/27 09:46:57][INFO] train_vision.py:  668: Epoch: [3][180/329], lr: 1.77e-04, eta: 7:19:36	Time 3.013 (3.030)	Data 0.054 (0.065)	Mem 41.61GB	Prec@1 90.000 (64.586)	Loss 1.4178 (1.8236)
[02/27 09:47:27][INFO] train_vision.py:  668: Epoch: [3][190/329], lr: 1.79e-04, eta: 7:18:58	Time 3.048 (3.029)	Data 0.046 (0.065)	Mem 41.61GB	Prec@1 70.000 (64.712)	Loss 1.6365 (1.8199)
[02/27 09:47:57][INFO] train_vision.py:  668: Epoch: [3][200/329], lr: 1.80e-04, eta: 7:18:14	Time 3.015 (3.028)	Data 0.028 (0.064)	Mem 41.61GB	Prec@1 80.000 (65.075)	Loss 1.2985 (1.8097)
[02/27 09:48:27][INFO] train_vision.py:  668: Epoch: [3][210/329], lr: 1.82e-04, eta: 7:17:33	Time 3.030 (3.027)	Data 0.051 (0.063)	Mem 41.61GB	Prec@1 90.000 (65.118)	Loss 1.2358 (1.8134)
[02/27 09:48:57][INFO] train_vision.py:  668: Epoch: [3][220/329], lr: 1.83e-04, eta: 7:16:54	Time 3.006 (3.026)	Data 0.044 (0.063)	Mem 41.61GB	Prec@1 40.000 (64.751)	Loss 2.3037 (1.8204)
[02/27 09:49:27][INFO] train_vision.py:  668: Epoch: [3][230/329], lr: 1.85e-04, eta: 7:16:14	Time 2.996 (3.025)	Data 0.057 (0.062)	Mem 41.61GB	Prec@1 80.000 (64.545)	Loss 1.5925 (1.8256)
[02/27 09:49:57][INFO] train_vision.py:  668: Epoch: [3][240/329], lr: 1.86e-04, eta: 7:15:36	Time 2.997 (3.024)	Data 0.048 (0.062)	Mem 41.61GB	Prec@1 70.000 (64.523)	Loss 1.7226 (1.8249)
[02/27 09:50:27][INFO] train_vision.py:  668: Epoch: [3][250/329], lr: 1.88e-04, eta: 7:14:57	Time 3.008 (3.023)	Data 0.057 (0.061)	Mem 41.61GB	Prec@1 60.000 (64.622)	Loss 1.9689 (1.8189)
[02/27 09:50:57][INFO] train_vision.py:  668: Epoch: [3][260/329], lr: 1.89e-04, eta: 7:14:22	Time 2.996 (3.022)	Data 0.047 (0.061)	Mem 41.61GB	Prec@1 50.000 (64.674)	Loss 1.9848 (1.8174)
[02/27 09:51:27][INFO] train_vision.py:  668: Epoch: [3][270/329], lr: 1.91e-04, eta: 7:13:47	Time 3.022 (3.022)	Data 0.031 (0.060)	Mem 41.61GB	Prec@1 40.000 (64.576)	Loss 2.7080 (1.8182)
[02/27 09:51:57][INFO] train_vision.py:  668: Epoch: [3][280/329], lr: 1.92e-04, eta: 7:13:11	Time 3.001 (3.021)	Data 0.049 (0.060)	Mem 41.61GB	Prec@1 40.000 (64.769)	Loss 2.3970 (1.8152)
[02/27 09:52:27][INFO] train_vision.py:  668: Epoch: [3][290/329], lr: 1.94e-04, eta: 7:12:35	Time 3.006 (3.020)	Data 0.057 (0.060)	Mem 41.61GB	Prec@1 80.000 (65.052)	Loss 1.3670 (1.8071)
[02/27 09:52:57][INFO] train_vision.py:  668: Epoch: [3][300/329], lr: 1.95e-04, eta: 7:12:00	Time 3.006 (3.020)	Data 0.049 (0.059)	Mem 41.61GB	Prec@1 70.000 (64.718)	Loss 1.7940 (1.8123)
[02/27 09:53:27][INFO] train_vision.py:  668: Epoch: [3][310/329], lr: 1.97e-04, eta: 7:11:27	Time 2.984 (3.019)	Data 0.066 (0.059)	Mem 41.61GB	Prec@1 70.000 (64.695)	Loss 1.5491 (1.8136)
[02/27 09:53:57][INFO] train_vision.py:  668: Epoch: [3][320/329], lr: 1.98e-04, eta: 7:10:53	Time 3.002 (3.019)	Data 0.050 (0.059)	Mem 41.61GB	Prec@1 70.000 (64.673)	Loss 1.7903 (1.8140)
[02/27 09:54:29][INFO] train_vision.py:  840: Test: [0/107]	Prec@1 90.000 (90.000)	Prec@5 98.750 (98.750)	mPrec@1 (27.912)	mPrec@5 (32.121)
[02/27 09:55:12][INFO] train_vision.py:  840: Test: [10/107]	Prec@1 91.250 (86.705)	Prec@5 100.000 (99.659)	mPrec@1 (64.719)	mPrec@5 (86.471)
[02/27 09:55:54][INFO] train_vision.py:  840: Test: [20/107]	Prec@1 90.000 (83.750)	Prec@5 100.000 (99.643)	mPrec@1 (67.884)	mPrec@5 (97.365)
[02/27 09:56:37][INFO] train_vision.py:  840: Test: [30/107]	Prec@1 87.500 (83.266)	Prec@5 100.000 (99.677)	mPrec@1 (67.992)	mPrec@5 (98.422)
[02/27 09:57:19][INFO] train_vision.py:  840: Test: [40/107]	Prec@1 77.500 (83.598)	Prec@5 97.500 (99.451)	mPrec@1 (69.380)	mPrec@5 (99.016)
[02/27 09:58:01][INFO] train_vision.py:  840: Test: [50/107]	Prec@1 81.250 (82.083)	Prec@5 98.750 (98.799)	mPrec@1 (67.913)	mPrec@5 (98.393)
[02/27 09:58:44][INFO] train_vision.py:  840: Test: [60/107]	Prec@1 88.750 (81.844)	Prec@5 100.000 (98.811)	mPrec@1 (67.858)	mPrec@5 (97.993)
[02/27 09:59:26][INFO] train_vision.py:  840: Test: [70/107]	Prec@1 71.250 (81.778)	Prec@5 96.250 (98.856)	mPrec@1 (67.965)	mPrec@5 (98.146)
[02/27 10:00:09][INFO] train_vision.py:  840: Test: [80/107]	Prec@1 71.250 (81.682)	Prec@5 100.000 (98.935)	mPrec@1 (68.024)	mPrec@5 (98.230)
[02/27 10:00:51][INFO] train_vision.py:  840: Test: [90/107]	Prec@1 76.250 (80.838)	Prec@5 100.000 (99.025)	mPrec@1 (68.492)	mPrec@5 (98.312)
[02/27 10:01:33][INFO] train_vision.py:  840: Test: [100/107]	Prec@1 70.000 (81.040)	Prec@5 96.250 (99.072)	mPrec@1 (68.641)	mPrec@5 (98.198)
[02/27 10:01:57][INFO] train_vision.py:  847: Overall Prec@1 80.646% Prec@5 99.061% mPrec@1 (68.686) mPrec@5 (98.220)
[02/27 10:01:57][INFO] train_vision.py:  464: Testing: 68.6862564086914/68.6862564086914
[02/27 10:01:57][INFO] train_vision.py:  465: Saving:
[02/27 10:02:16][INFO] train_vision.py:  668: Epoch: [4][0/329], lr: 2.00e-04, eta: 13:05:05	Time 5.506 (5.506)	Data 2.413 (2.413)	Mem 41.61GB	Prec@1 90.000 (90.000)	Loss 1.3932 (1.3932)
[02/27 10:02:46][INFO] train_vision.py:  668: Epoch: [4][10/329], lr: 2.00e-04, eta: 7:37:45	Time 3.011 (3.214)	Data 0.088 (0.274)	Mem 41.61GB	Prec@1 90.000 (73.636)	Loss 1.3140 (1.6164)
[02/27 10:03:16][INFO] train_vision.py:  668: Epoch: [4][20/329], lr: 2.00e-04, eta: 7:22:08	Time 3.000 (3.108)	Data 0.061 (0.168)	Mem 41.61GB	Prec@1 90.000 (69.048)	Loss 1.4314 (1.7032)
[02/27 10:03:46][INFO] train_vision.py:  668: Epoch: [4][30/329], lr: 2.00e-04, eta: 7:16:42	Time 3.006 (3.074)	Data 0.080 (0.136)	Mem 41.61GB	Prec@1 70.000 (69.677)	Loss 2.1310 (1.7120)
[02/27 10:04:16][INFO] train_vision.py:  668: Epoch: [4][40/329], lr: 2.00e-04, eta: 7:13:53	Time 3.015 (3.057)	Data 0.085 (0.118)	Mem 41.61GB	Prec@1 80.000 (69.512)	Loss 1.4229 (1.7114)
[02/27 10:04:46][INFO] train_vision.py:  668: Epoch: [4][50/329], lr: 2.00e-04, eta: 7:11:49	Time 2.996 (3.046)	Data 0.053 (0.106)	Mem 41.61GB	Prec@1 70.000 (67.059)	Loss 1.7058 (1.7349)
[02/27 10:05:16][INFO] train_vision.py:  668: Epoch: [4][60/329], lr: 2.00e-04, eta: 7:10:20	Time 3.004 (3.039)	Data 0.059 (0.098)	Mem 41.61GB	Prec@1 80.000 (68.852)	Loss 1.5654 (1.6871)
[02/27 10:05:46][INFO] train_vision.py:  668: Epoch: [4][70/329], lr: 2.00e-04, eta: 7:09:08	Time 3.014 (3.035)	Data 0.078 (0.092)	Mem 41.61GB	Prec@1 70.000 (68.732)	Loss 1.4909 (1.6778)
[02/27 10:06:16][INFO] train_vision.py:  668: Epoch: [4][80/329], lr: 2.00e-04, eta: 7:08:16	Time 2.996 (3.032)	Data 0.056 (0.089)	Mem 41.61GB	Prec@1 60.000 (68.519)	Loss 1.8372 (1.6762)
[02/27 10:06:47][INFO] train_vision.py:  668: Epoch: [4][90/329], lr: 2.00e-04, eta: 7:07:22	Time 3.037 (3.029)	Data 0.084 (0.086)	Mem 41.61GB	Prec@1 80.000 (69.560)	Loss 1.4856 (1.6685)
[02/27 10:07:17][INFO] train_vision.py:  668: Epoch: [4][100/329], lr: 2.00e-04, eta: 7:06:34	Time 3.005 (3.027)	Data 0.057 (0.084)	Mem 41.61GB	Prec@1 80.000 (70.198)	Loss 1.4235 (1.6525)
[02/27 10:07:47][INFO] train_vision.py:  668: Epoch: [4][110/329], lr: 2.00e-04, eta: 7:05:44	Time 3.006 (3.025)	Data 0.026 (0.082)	Mem 41.61GB	Prec@1 70.000 (70.270)	Loss 1.5673 (1.6482)
[02/27 10:08:17][INFO] train_vision.py:  668: Epoch: [4][120/329], lr: 2.00e-04, eta: 7:04:59	Time 3.001 (3.023)	Data 0.057 (0.080)	Mem 41.61GB	Prec@1 80.000 (70.000)	Loss 1.2549 (1.6541)
[02/27 10:08:47][INFO] train_vision.py:  668: Epoch: [4][130/329], lr: 2.00e-04, eta: 7:04:11	Time 2.963 (3.021)	Data 0.048 (0.078)	Mem 41.61GB	Prec@1 60.000 (70.305)	Loss 1.8678 (1.6534)
[02/27 10:09:17][INFO] train_vision.py:  668: Epoch: [4][140/329], lr: 2.00e-04, eta: 7:03:31	Time 3.026 (3.020)	Data 0.030 (0.076)	Mem 41.61GB	Prec@1 80.000 (69.645)	Loss 1.3349 (1.6611)
[02/27 10:09:47][INFO] train_vision.py:  668: Epoch: [4][150/329], lr: 2.00e-04, eta: 7:02:54	Time 3.027 (3.019)	Data 0.028 (0.075)	Mem 41.61GB	Prec@1 90.000 (69.536)	Loss 1.1935 (1.6738)
[02/27 10:10:17][INFO] train_vision.py:  668: Epoch: [4][160/329], lr: 2.00e-04, eta: 7:02:13	Time 3.025 (3.018)	Data 0.024 (0.073)	Mem 41.61GB	Prec@1 60.000 (69.689)	Loss 1.6738 (1.6726)
[02/27 10:10:47][INFO] train_vision.py:  668: Epoch: [4][170/329], lr: 2.00e-04, eta: 7:01:38	Time 3.007 (3.017)	Data 0.051 (0.073)	Mem 41.61GB	Prec@1 80.000 (69.064)	Loss 1.4069 (1.6865)
[02/27 10:11:17][INFO] train_vision.py:  668: Epoch: [4][180/329], lr: 2.00e-04, eta: 7:01:02	Time 2.989 (3.016)	Data 0.057 (0.072)	Mem 41.61GB	Prec@1 50.000 (68.619)	Loss 1.9665 (1.6916)
[02/27 10:11:47][INFO] train_vision.py:  668: Epoch: [4][190/329], lr: 2.00e-04, eta: 7:00:28	Time 2.998 (3.016)	Data 0.049 (0.071)	Mem 41.61GB	Prec@1 80.000 (69.215)	Loss 1.5764 (1.6815)
[02/27 10:12:17][INFO] train_vision.py:  668: Epoch: [4][200/329], lr: 2.00e-04, eta: 6:59:53	Time 3.004 (3.015)	Data 0.053 (0.070)	Mem 41.61GB	Prec@1 80.000 (69.104)	Loss 1.4233 (1.6793)
[02/27 10:12:47][INFO] train_vision.py:  668: Epoch: [4][210/329], lr: 2.00e-04, eta: 6:59:17	Time 3.017 (3.015)	Data 0.050 (0.069)	Mem 41.61GB	Prec@1 90.000 (68.720)	Loss 1.4037 (1.6816)
[02/27 10:13:17][INFO] train_vision.py:  668: Epoch: [4][220/329], lr: 2.00e-04, eta: 6:58:42	Time 3.024 (3.014)	Data 0.089 (0.069)	Mem 41.61GB	Prec@1 80.000 (68.733)	Loss 1.4147 (1.6780)
[02/27 10:13:47][INFO] train_vision.py:  668: Epoch: [4][230/329], lr: 2.00e-04, eta: 6:58:08	Time 3.008 (3.014)	Data 0.026 (0.068)	Mem 41.61GB	Prec@1 50.000 (68.701)	Loss 1.8359 (1.6777)
[02/27 10:14:17][INFO] train_vision.py:  668: Epoch: [4][240/329], lr: 2.00e-04, eta: 6:57:33	Time 3.013 (3.013)	Data 0.038 (0.068)	Mem 41.61GB	Prec@1 70.000 (68.548)	Loss 1.6147 (1.6809)
[02/27 10:14:47][INFO] train_vision.py:  668: Epoch: [4][250/329], lr: 2.00e-04, eta: 6:56:57	Time 2.995 (3.012)	Data 0.047 (0.067)	Mem 41.61GB	Prec@1 60.000 (68.884)	Loss 1.5050 (1.6734)
[02/27 10:15:17][INFO] train_vision.py:  668: Epoch: [4][260/329], lr: 2.00e-04, eta: 6:56:22	Time 3.010 (3.012)	Data 0.055 (0.067)	Mem 41.61GB	Prec@1 70.000 (69.004)	Loss 1.4765 (1.6696)
[02/27 10:15:47][INFO] train_vision.py:  668: Epoch: [4][270/329], lr: 2.00e-04, eta: 6:55:49	Time 2.969 (3.011)	Data 0.049 (0.067)	Mem 41.61GB	Prec@1 70.000 (69.041)	Loss 1.5554 (1.6699)
[02/27 10:16:17][INFO] train_vision.py:  668: Epoch: [4][280/329], lr: 1.99e-04, eta: 6:55:16	Time 3.003 (3.011)	Data 0.057 (0.066)	Mem 41.61GB	Prec@1 90.000 (69.075)	Loss 1.3723 (1.6701)
[02/27 10:16:47][INFO] train_vision.py:  668: Epoch: [4][290/329], lr: 1.99e-04, eta: 6:54:43	Time 2.991 (3.011)	Data 0.050 (0.066)	Mem 41.61GB	Prec@1 80.000 (69.175)	Loss 1.4660 (1.6655)
[02/27 10:17:17][INFO] train_vision.py:  668: Epoch: [4][300/329], lr: 1.99e-04, eta: 6:54:11	Time 3.000 (3.010)	Data 0.049 (0.066)	Mem 41.61GB	Prec@1 70.000 (68.937)	Loss 1.8623 (1.6693)
[02/27 10:17:47][INFO] train_vision.py:  668: Epoch: [4][310/329], lr: 1.99e-04, eta: 6:53:38	Time 2.995 (3.010)	Data 0.048 (0.065)	Mem 41.61GB	Prec@1 90.000 (69.196)	Loss 1.4281 (1.6654)
[02/27 10:18:17][INFO] train_vision.py:  668: Epoch: [4][320/329], lr: 1.99e-04, eta: 6:53:05	Time 2.990 (3.010)	Data 0.087 (0.065)	Mem 41.61GB	Prec@1 80.000 (69.439)	Loss 1.2974 (1.6589)
[02/27 10:18:47][INFO] train_vision.py:  668: Epoch: [5][0/329], lr: 1.99e-04, eta: 12:24:51	Time 5.433 (5.433)	Data 2.468 (2.468)	Mem 41.61GB	Prec@1 70.000 (70.000)	Loss 1.6233 (1.6233)
[02/27 10:19:17][INFO] train_vision.py:  668: Epoch: [5][10/329], lr: 1.99e-04, eta: 7:21:46	Time 3.000 (3.226)	Data 0.052 (0.277)	Mem 41.61GB	Prec@1 80.000 (73.636)	Loss 1.5055 (1.5507)
[02/27 10:19:47][INFO] train_vision.py:  668: Epoch: [5][20/329], lr: 1.99e-04, eta: 7:07:39	Time 3.023 (3.127)	Data 0.090 (0.175)	Mem 41.61GB	Prec@1 80.000 (72.857)	Loss 1.7882 (1.6277)
[02/27 10:20:17][INFO] train_vision.py:  668: Epoch: [5][30/329], lr: 1.99e-04, eta: 7:01:54	Time 2.993 (3.089)	Data 0.053 (0.136)	Mem 41.61GB	Prec@1 60.000 (70.323)	Loss 1.6168 (1.6344)
[02/27 10:20:47][INFO] train_vision.py:  668: Epoch: [5][40/329], lr: 1.99e-04, eta: 6:58:49	Time 3.014 (3.070)	Data 0.083 (0.119)	Mem 41.61GB	Prec@1 60.000 (70.000)	Loss 1.7729 (1.6313)
[02/27 10:21:17][INFO] train_vision.py:  668: Epoch: [5][50/329], lr: 1.99e-04, eta: 6:56:40	Time 3.000 (3.058)	Data 0.050 (0.106)	Mem 41.61GB	Prec@1 70.000 (69.608)	Loss 1.6992 (1.6317)
[02/27 10:21:47][INFO] train_vision.py:  668: Epoch: [5][60/329], lr: 1.99e-04, eta: 6:55:04	Time 3.030 (3.050)	Data 0.077 (0.099)	Mem 41.61GB	Prec@1 80.000 (70.492)	Loss 1.4469 (1.6069)
[02/27 10:22:18][INFO] train_vision.py:  668: Epoch: [5][70/329], lr: 1.99e-04, eta: 6:53:47	Time 3.012 (3.044)	Data 0.032 (0.092)	Mem 41.61GB	Prec@1 90.000 (71.408)	Loss 1.3599 (1.5822)
[02/27 10:22:48][INFO] train_vision.py:  668: Epoch: [5][80/329], lr: 1.99e-04, eta: 6:52:40	Time 3.002 (3.040)	Data 0.054 (0.087)	Mem 41.61GB	Prec@1 70.000 (71.358)	Loss 1.3751 (1.5848)
[02/27 10:23:18][INFO] train_vision.py:  668: Epoch: [5][90/329], lr: 1.99e-04, eta: 6:51:38	Time 3.013 (3.036)	Data 0.038 (0.084)	Mem 41.61GB	Prec@1 70.000 (71.978)	Loss 1.5444 (1.5729)
[02/27 10:23:48][INFO] train_vision.py:  668: Epoch: [5][100/329], lr: 1.99e-04, eta: 6:50:43	Time 2.989 (3.033)	Data 0.042 (0.081)	Mem 41.61GB	Prec@1 60.000 (71.683)	Loss 1.7701 (1.5991)
[02/27 10:24:18][INFO] train_vision.py:  668: Epoch: [5][110/329], lr: 1.99e-04, eta: 6:49:52	Time 3.006 (3.030)	Data 0.056 (0.079)	Mem 41.61GB	Prec@1 60.000 (71.802)	Loss 2.2451 (1.6051)
[02/27 10:24:48][INFO] train_vision.py:  668: Epoch: [5][120/329], lr: 1.99e-04, eta: 6:49:04	Time 2.998 (3.028)	Data 0.049 (0.076)	Mem 41.61GB	Prec@1 80.000 (71.653)	Loss 1.5051 (1.6164)
[02/27 10:25:18][INFO] train_vision.py:  668: Epoch: [5][130/329], lr: 1.99e-04, eta: 6:48:18	Time 3.013 (3.026)	Data 0.059 (0.074)	Mem 41.61GB	Prec@1 80.000 (71.985)	Loss 1.4397 (1.6060)
[02/27 10:25:48][INFO] train_vision.py:  668: Epoch: [5][140/329], lr: 1.99e-04, eta: 6:47:35	Time 2.997 (3.024)	Data 0.048 (0.073)	Mem 41.61GB	Prec@1 80.000 (72.411)	Loss 1.1042 (1.5977)
[02/27 10:26:18][INFO] train_vision.py:  668: Epoch: [5][150/329], lr: 1.98e-04, eta: 6:46:51	Time 2.986 (3.023)	Data 0.054 (0.071)	Mem 41.61GB	Prec@1 100.000 (72.715)	Loss 1.0133 (1.5951)
[02/27 10:26:48][INFO] train_vision.py:  668: Epoch: [5][160/329], lr: 1.98e-04, eta: 6:46:10	Time 3.004 (3.021)	Data 0.058 (0.069)	Mem 41.61GB	Prec@1 70.000 (72.857)	Loss 1.7525 (1.5932)
[02/27 10:27:18][INFO] train_vision.py:  668: Epoch: [5][170/329], lr: 1.98e-04, eta: 6:45:31	Time 3.011 (3.020)	Data 0.035 (0.068)	Mem 41.61GB	Prec@1 70.000 (72.865)	Loss 1.6558 (1.5929)
[02/27 10:27:48][INFO] train_vision.py:  668: Epoch: [5][180/329], lr: 1.98e-04, eta: 6:44:54	Time 3.005 (3.019)	Data 0.033 (0.067)	Mem 41.61GB	Prec@1 70.000 (72.873)	Loss 1.6622 (1.5972)
[02/27 10:28:18][INFO] train_vision.py:  668: Epoch: [5][190/329], lr: 1.98e-04, eta: 6:44:16	Time 3.000 (3.018)	Data 0.054 (0.066)	Mem 41.61GB	Prec@1 60.000 (73.298)	Loss 1.6729 (1.5921)
[02/27 10:28:48][INFO] train_vision.py:  668: Epoch: [5][200/329], lr: 1.98e-04, eta: 6:43:39	Time 3.009 (3.018)	Data 0.064 (0.065)	Mem 41.61GB	Prec@1 100.000 (73.532)	Loss 1.0104 (1.5847)
[02/27 10:29:18][INFO] train_vision.py:  668: Epoch: [5][210/329], lr: 1.98e-04, eta: 6:43:04	Time 3.027 (3.017)	Data 0.026 (0.064)	Mem 41.61GB	Prec@1 90.000 (73.649)	Loss 1.1771 (1.5767)
[02/27 10:29:48][INFO] train_vision.py:  668: Epoch: [5][220/329], lr: 1.98e-04, eta: 6:42:29	Time 3.006 (3.016)	Data 0.054 (0.064)	Mem 41.61GB	Prec@1 70.000 (73.937)	Loss 1.7062 (1.5710)
[02/27 10:30:18][INFO] train_vision.py:  668: Epoch: [5][230/329], lr: 1.98e-04, eta: 6:41:55	Time 3.003 (3.016)	Data 0.056 (0.063)	Mem 41.61GB	Prec@1 80.000 (73.896)	Loss 1.7381 (1.5773)
[02/27 10:30:48][INFO] train_vision.py:  668: Epoch: [5][240/329], lr: 1.98e-04, eta: 6:41:20	Time 3.006 (3.015)	Data 0.038 (0.062)	Mem 41.61GB	Prec@1 40.000 (73.693)	Loss 2.2301 (1.5792)
[02/27 10:31:18][INFO] train_vision.py:  668: Epoch: [5][250/329], lr: 1.98e-04, eta: 6:40:46	Time 3.001 (3.015)	Data 0.058 (0.062)	Mem 41.61GB	Prec@1 80.000 (73.546)	Loss 1.5874 (1.5817)
[02/27 10:31:48][INFO] train_vision.py:  668: Epoch: [5][260/329], lr: 1.98e-04, eta: 6:40:11	Time 3.001 (3.014)	Data 0.064 (0.061)	Mem 41.61GB	Prec@1 90.000 (73.602)	Loss 1.1714 (1.5787)
[02/27 10:32:18][INFO] train_vision.py:  668: Epoch: [5][270/329], lr: 1.98e-04, eta: 6:39:37	Time 3.009 (3.014)	Data 0.051 (0.061)	Mem 41.61GB	Prec@1 70.000 (73.948)	Loss 1.6699 (1.5697)
[02/27 10:32:48][INFO] train_vision.py:  668: Epoch: [5][280/329], lr: 1.98e-04, eta: 6:39:03	Time 2.991 (3.013)	Data 0.049 (0.061)	Mem 41.61GB	Prec@1 90.000 (74.270)	Loss 1.3420 (1.5645)
[02/27 10:33:18][INFO] train_vision.py:  668: Epoch: [5][290/329], lr: 1.97e-04, eta: 6:38:26	Time 2.975 (3.012)	Data 0.050 (0.060)	Mem 41.61GB	Prec@1 90.000 (74.296)	Loss 1.3254 (1.5630)
[02/27 10:33:48][INFO] train_vision.py:  668: Epoch: [5][300/329], lr: 1.97e-04, eta: 6:37:53	Time 2.997 (3.012)	Data 0.055 (0.060)	Mem 41.61GB	Prec@1 60.000 (74.286)	Loss 1.5843 (1.5613)
[02/27 10:34:18][INFO] train_vision.py:  668: Epoch: [5][310/329], lr: 1.97e-04, eta: 6:37:19	Time 2.998 (3.012)	Data 0.055 (0.059)	Mem 41.61GB	Prec@1 60.000 (74.180)	Loss 1.8158 (1.5650)
[02/27 10:34:48][INFO] train_vision.py:  668: Epoch: [5][320/329], lr: 1.97e-04, eta: 6:36:44	Time 3.017 (3.011)	Data 0.025 (0.059)	Mem 41.61GB	Prec@1 60.000 (74.174)	Loss 2.2157 (1.5636)
[02/27 10:35:19][INFO] train_vision.py:  840: Test: [0/107]	Prec@1 96.250 (96.250)	Prec@5 98.750 (98.750)	mPrec@1 (30.859)	mPrec@5 (32.121)
[02/27 10:36:01][INFO] train_vision.py:  840: Test: [10/107]	Prec@1 95.000 (93.750)	Prec@5 100.000 (99.773)	mPrec@1 (76.377)	mPrec@5 (86.771)
[02/27 10:36:44][INFO] train_vision.py:  840: Test: [20/107]	Prec@1 95.000 (89.821)	Prec@5 100.000 (99.405)	mPrec@1 (79.035)	mPrec@5 (96.469)
[02/27 10:37:26][INFO] train_vision.py:  840: Test: [30/107]	Prec@1 96.250 (90.121)	Prec@5 100.000 (99.556)	mPrec@1 (79.759)	mPrec@5 (97.655)
[02/27 10:38:09][INFO] train_vision.py:  840: Test: [40/107]	Prec@1 86.250 (89.207)	Prec@5 100.000 (99.573)	mPrec@1 (80.108)	mPrec@5 (98.688)
[02/27 10:38:51][INFO] train_vision.py:  840: Test: [50/107]	Prec@1 87.500 (87.990)	Prec@5 100.000 (99.461)	mPrec@1 (78.241)	mPrec@5 (98.610)
[02/27 10:39:34][INFO] train_vision.py:  840: Test: [60/107]	Prec@1 95.000 (88.033)	Prec@5 100.000 (99.529)	mPrec@1 (78.449)	mPrec@5 (98.675)
[02/27 10:40:16][INFO] train_vision.py:  840: Test: [70/107]	Prec@1 80.000 (88.257)	Prec@5 98.750 (99.577)	mPrec@1 (79.020)	mPrec@5 (98.835)
[02/27 10:40:59][INFO] train_vision.py:  840: Test: [80/107]	Prec@1 87.500 (88.318)	Prec@5 98.750 (99.568)	mPrec@1 (79.434)	mPrec@5 (98.749)
[02/27 10:41:41][INFO] train_vision.py:  840: Test: [90/107]	Prec@1 90.000 (88.022)	Prec@5 100.000 (99.574)	mPrec@1 (79.795)	mPrec@5 (98.700)
[02/27 10:42:24][INFO] train_vision.py:  840: Test: [100/107]	Prec@1 85.000 (88.317)	Prec@5 98.750 (99.579)	mPrec@1 (80.068)	mPrec@5 (98.844)
[02/27 10:42:47][INFO] train_vision.py:  847: Overall Prec@1 88.016% Prec@5 99.507% mPrec@1 (80.055) mPrec@5 (98.836)
[02/27 10:42:47][INFO] train_vision.py:  464: Testing: 80.05497741699219/80.05497741699219
[02/27 10:42:47][INFO] train_vision.py:  465: Saving:
[02/27 10:43:07][INFO] train_vision.py:  668: Epoch: [6][0/329], lr: 1.97e-04, eta: 11:06:58	Time 5.068 (5.068)	Data 2.185 (2.185)	Mem 41.61GB	Prec@1 50.000 (50.000)	Loss 1.8584 (1.8584)
[02/27 10:43:37][INFO] train_vision.py:  668: Epoch: [6][10/329], lr: 1.97e-04, eta: 6:56:47	Time 2.996 (3.171)	Data 0.068 (0.255)	Mem 41.61GB	Prec@1 70.000 (77.273)	Loss 1.5038 (1.4214)
[02/27 10:44:07][INFO] train_vision.py:  668: Epoch: [6][20/329], lr: 1.97e-04, eta: 6:45:32	Time 2.996 (3.089)	Data 0.058 (0.163)	Mem 41.61GB	Prec@1 80.000 (73.333)	Loss 1.4418 (1.5215)
[02/27 10:44:37][INFO] train_vision.py:  668: Epoch: [6][30/329], lr: 1.97e-04, eta: 6:41:28	Time 3.011 (3.062)	Data 0.052 (0.130)	Mem 41.61GB	Prec@1 80.000 (74.194)	Loss 1.3105 (1.5339)
[02/27 10:45:07][INFO] train_vision.py:  668: Epoch: [6][40/329], lr: 1.97e-04, eta: 6:39:08	Time 3.001 (3.048)	Data 0.064 (0.113)	Mem 41.61GB	Prec@1 100.000 (74.634)	Loss 1.0287 (1.5447)
[02/27 10:45:37][INFO] train_vision.py:  668: Epoch: [6][50/329], lr: 1.97e-04, eta: 6:37:37	Time 3.009 (3.040)	Data 0.061 (0.103)	Mem 41.61GB	Prec@1 70.000 (74.510)	Loss 1.7091 (1.5393)
[02/27 10:46:07][INFO] train_vision.py:  668: Epoch: [6][60/329], lr: 1.97e-04, eta: 6:36:19	Time 3.014 (3.034)	Data 0.070 (0.096)	Mem 41.61GB	Prec@1 80.000 (74.918)	Loss 1.7275 (1.5310)
[02/27 10:46:37][INFO] train_vision.py:  668: Epoch: [6][70/329], lr: 1.96e-04, eta: 6:35:21	Time 3.025 (3.031)	Data 0.060 (0.091)	Mem 41.61GB	Prec@1 70.000 (76.479)	Loss 1.8347 (1.5049)
[02/27 10:47:07][INFO] train_vision.py:  668: Epoch: [6][80/329], lr: 1.96e-04, eta: 6:34:27	Time 2.994 (3.028)	Data 0.047 (0.087)	Mem 41.61GB	Prec@1 80.000 (75.802)	Loss 1.6114 (1.5208)
[02/27 10:47:37][INFO] train_vision.py:  668: Epoch: [6][90/329], lr: 1.96e-04, eta: 6:33:31	Time 2.975 (3.024)	Data 0.057 (0.083)	Mem 41.61GB	Prec@1 70.000 (76.154)	Loss 1.6638 (1.5190)
[02/27 10:48:07][INFO] train_vision.py:  668: Epoch: [6][100/329], lr: 1.96e-04, eta: 6:32:46	Time 2.998 (3.023)	Data 0.051 (0.080)	Mem 41.61GB	Prec@1 60.000 (76.634)	Loss 1.9570 (1.5108)
[02/27 10:48:37][INFO] train_vision.py:  668: Epoch: [6][110/329], lr: 1.96e-04, eta: 6:32:02	Time 3.014 (3.021)	Data 0.056 (0.078)	Mem 41.61GB	Prec@1 90.000 (76.396)	Loss 1.1886 (1.5107)
[02/27 10:49:07][INFO] train_vision.py:  668: Epoch: [6][120/329], lr: 1.96e-04, eta: 6:31:20	Time 2.974 (3.019)	Data 0.050 (0.077)	Mem 41.61GB	Prec@1 80.000 (76.694)	Loss 1.5563 (1.5118)
[02/27 10:49:37][INFO] train_vision.py:  668: Epoch: [6][130/329], lr: 1.96e-04, eta: 6:30:43	Time 3.004 (3.018)	Data 0.060 (0.075)	Mem 41.61GB	Prec@1 90.000 (77.023)	Loss 1.2603 (1.5053)
[02/27 10:50:07][INFO] train_vision.py:  668: Epoch: [6][140/329], lr: 1.96e-04, eta: 6:30:06	Time 3.001 (3.018)	Data 0.049 (0.074)	Mem 41.61GB	Prec@1 40.000 (77.092)	Loss 2.4734 (1.4992)
[02/27 10:50:37][INFO] train_vision.py:  668: Epoch: [6][150/329], lr: 1.96e-04, eta: 6:29:28	Time 2.998 (3.017)	Data 0.053 (0.073)	Mem 41.61GB	Prec@1 80.000 (76.755)	Loss 1.4480 (1.5059)
[02/27 10:51:07][INFO] train_vision.py:  668: Epoch: [6][160/329], lr: 1.96e-04, eta: 6:28:52	Time 2.995 (3.016)	Data 0.048 (0.072)	Mem 41.61GB	Prec@1 80.000 (77.081)	Loss 1.3574 (1.4952)
[02/27 10:51:37][INFO] train_vision.py:  668: Epoch: [6][170/329], lr: 1.95e-04, eta: 6:28:16	Time 3.004 (3.015)	Data 0.058 (0.071)	Mem 41.61GB	Prec@1 40.000 (77.018)	Loss 1.6674 (1.4899)
[02/27 10:52:07][INFO] train_vision.py:  668: Epoch: [6][180/329], lr: 1.95e-04, eta: 6:27:36	Time 2.937 (3.014)	Data 0.049 (0.069)	Mem 41.61GB	Prec@1 60.000 (76.851)	Loss 1.7678 (1.5019)
[02/27 10:52:37][INFO] train_vision.py:  668: Epoch: [6][190/329], lr: 1.95e-04, eta: 6:27:02	Time 2.995 (3.013)	Data 0.045 (0.069)	Mem 41.61GB	Prec@1 90.000 (76.911)	Loss 1.0139 (1.5028)
[02/27 10:53:07][INFO] train_vision.py:  668: Epoch: [6][200/329], lr: 1.95e-04, eta: 6:26:28	Time 2.998 (3.013)	Data 0.051 (0.068)	Mem 41.61GB	Prec@1 80.000 (77.363)	Loss 1.1905 (1.4947)
[02/27 10:53:37][INFO] train_vision.py:  668: Epoch: [6][210/329], lr: 1.95e-04, eta: 6:25:54	Time 3.002 (3.012)	Data 0.058 (0.068)	Mem 41.61GB	Prec@1 80.000 (76.919)	Loss 1.2928 (1.5089)
[02/27 10:54:07][INFO] train_vision.py:  668: Epoch: [6][220/329], lr: 1.95e-04, eta: 6:25:18	Time 2.986 (3.011)	Data 0.045 (0.067)	Mem 41.61GB	Prec@1 70.000 (77.195)	Loss 1.6899 (1.5001)
[02/27 10:54:37][INFO] train_vision.py:  668: Epoch: [6][230/329], lr: 1.95e-04, eta: 6:24:44	Time 3.027 (3.011)	Data 0.029 (0.066)	Mem 41.61GB	Prec@1 60.000 (76.883)	Loss 1.9000 (1.5056)
[02/27 10:55:07][INFO] train_vision.py:  668: Epoch: [6][240/329], lr: 1.95e-04, eta: 6:24:08	Time 3.021 (3.010)	Data 0.024 (0.065)	Mem 41.61GB	Prec@1 80.000 (76.805)	Loss 1.4439 (1.5064)
[02/27 10:55:37][INFO] train_vision.py:  668: Epoch: [6][250/329], lr: 1.95e-04, eta: 6:23:33	Time 3.003 (3.009)	Data 0.064 (0.064)	Mem 41.61GB	Prec@1 70.000 (76.853)	Loss 1.7747 (1.5119)
[02/27 10:56:07][INFO] train_vision.py:  668: Epoch: [6][260/329], lr: 1.94e-04, eta: 6:22:58	Time 3.000 (3.009)	Data 0.032 (0.064)	Mem 41.61GB	Prec@1 70.000 (77.241)	Loss 1.8550 (1.5059)
[02/27 10:56:37][INFO] train_vision.py:  668: Epoch: [6][270/329], lr: 1.94e-04, eta: 6:22:25	Time 3.004 (3.008)	Data 0.059 (0.063)	Mem 41.61GB	Prec@1 90.000 (77.343)	Loss 1.0563 (1.5028)
[02/27 10:57:07][INFO] train_vision.py:  668: Epoch: [6][280/329], lr: 1.94e-04, eta: 6:21:50	Time 2.985 (3.008)	Data 0.043 (0.062)	Mem 41.61GB	Prec@1 90.000 (77.509)	Loss 1.4170 (1.4975)
[02/27 10:57:37][INFO] train_vision.py:  668: Epoch: [6][290/329], lr: 1.94e-04, eta: 6:21:17	Time 3.000 (3.007)	Data 0.060 (0.062)	Mem 41.61GB	Prec@1 90.000 (77.560)	Loss 1.2848 (1.4946)
[02/27 10:58:07][INFO] train_vision.py:  668: Epoch: [6][300/329], lr: 1.94e-04, eta: 6:20:42	Time 2.990 (3.007)	Data 0.046 (0.061)	Mem 41.61GB	Prec@1 90.000 (77.608)	Loss 1.1858 (1.4934)
[02/27 10:58:37][INFO] train_vision.py:  668: Epoch: [6][310/329], lr: 1.94e-04, eta: 6:20:08	Time 3.000 (3.006)	Data 0.060 (0.061)	Mem 41.61GB	Prec@1 60.000 (77.846)	Loss 1.7154 (1.4869)
[02/27 10:59:07][INFO] train_vision.py:  668: Epoch: [6][320/329], lr: 1.94e-04, eta: 6:19:35	Time 2.987 (3.006)	Data 0.045 (0.061)	Mem 41.61GB	Prec@1 60.000 (77.850)	Loss 2.0104 (1.4851)
[02/27 10:59:36][INFO] train_vision.py:  668: Epoch: [7][0/329], lr: 1.94e-04, eta: 11:35:57	Time 5.518 (5.518)	Data 2.529 (2.529)	Mem 41.61GB	Prec@1 40.000 (40.000)	Loss 2.0645 (2.0645)
[02/27 11:00:06][INFO] train_vision.py:  668: Epoch: [7][10/329], lr: 1.93e-04, eta: 6:47:02	Time 2.976 (3.231)	Data 0.045 (0.279)	Mem 41.61GB	Prec@1 90.000 (73.636)	Loss 1.3725 (1.4920)
[02/27 11:00:36][INFO] train_vision.py:  668: Epoch: [7][20/329], lr: 1.93e-04, eta: 6:33:05	Time 2.993 (3.125)	Data 0.052 (0.171)	Mem 41.61GB	Prec@1 80.000 (78.095)	Loss 1.3302 (1.4012)
[02/27 11:01:07][INFO] train_vision.py:  668: Epoch: [7][30/329], lr: 1.93e-04, eta: 6:28:26	Time 3.039 (3.092)	Data 0.051 (0.131)	Mem 41.61GB	Prec@1 80.000 (80.645)	Loss 1.6474 (1.3722)
[02/27 11:01:37][INFO] train_vision.py:  668: Epoch: [7][40/329], lr: 1.93e-04, eta: 6:25:15	Time 3.012 (3.071)	Data 0.065 (0.112)	Mem 41.61GB	Prec@1 80.000 (80.488)	Loss 1.4161 (1.3705)
[02/27 11:02:07][INFO] train_vision.py:  668: Epoch: [7][50/329], lr: 1.93e-04, eta: 6:23:06	Time 2.992 (3.058)	Data 0.042 (0.100)	Mem 41.61GB	Prec@1 60.000 (77.843)	Loss 2.0674 (1.4179)
[02/27 11:02:37][INFO] train_vision.py:  668: Epoch: [7][60/329], lr: 1.93e-04, eta: 6:21:38	Time 3.002 (3.050)	Data 0.058 (0.092)	Mem 41.61GB	Prec@1 90.000 (78.033)	Loss 1.0279 (1.4245)
[02/27 11:03:07][INFO] train_vision.py:  668: Epoch: [7][70/329], lr: 1.93e-04, eta: 6:20:19	Time 3.032 (3.043)	Data 0.049 (0.086)	Mem 41.61GB	Prec@1 60.000 (78.592)	Loss 1.5127 (1.4161)
[02/27 11:03:37][INFO] train_vision.py:  668: Epoch: [7][80/329], lr: 1.93e-04, eta: 6:19:13	Time 3.001 (3.039)	Data 0.057 (0.082)	Mem 41.61GB	Prec@1 60.000 (78.272)	Loss 1.9232 (1.4310)
[02/27 11:04:07][INFO] train_vision.py:  668: Epoch: [7][90/329], lr: 1.92e-04, eta: 6:18:17	Time 3.028 (3.035)	Data 0.060 (0.079)	Mem 41.61GB	Prec@1 70.000 (77.692)	Loss 1.5029 (1.4270)
[02/27 11:04:37][INFO] train_vision.py:  668: Epoch: [7][100/329], lr: 1.92e-04, eta: 6:17:27	Time 3.007 (3.033)	Data 0.052 (0.076)	Mem 41.61GB	Prec@1 80.000 (77.525)	Loss 1.4258 (1.4367)
[02/27 11:05:07][INFO] train_vision.py:  668: Epoch: [7][110/329], lr: 1.92e-04, eta: 6:16:31	Time 2.970 (3.029)	Data 0.053 (0.074)	Mem 41.61GB	Prec@1 70.000 (77.387)	Loss 1.1927 (1.4405)
[02/27 11:05:37][INFO] train_vision.py:  668: Epoch: [7][120/329], lr: 1.92e-04, eta: 6:15:44	Time 2.999 (3.027)	Data 0.061 (0.072)	Mem 41.61GB	Prec@1 90.000 (77.521)	Loss 1.1407 (1.4399)
[02/27 11:06:07][INFO] train_vision.py:  668: Epoch: [7][130/329], lr: 1.92e-04, eta: 6:14:58	Time 2.991 (3.025)	Data 0.041 (0.070)	Mem 41.61GB	Prec@1 80.000 (77.328)	Loss 1.3529 (1.4453)
[02/27 11:06:37][INFO] train_vision.py:  668: Epoch: [7][140/329], lr: 1.92e-04, eta: 6:14:15	Time 3.001 (3.023)	Data 0.056 (0.069)	Mem 41.61GB	Prec@1 60.000 (77.376)	Loss 2.1050 (1.4467)
[02/27 11:07:07][INFO] train_vision.py:  668: Epoch: [7][150/329], lr: 1.92e-04, eta: 6:13:34	Time 3.005 (3.022)	Data 0.036 (0.067)	Mem 41.61GB	Prec@1 40.000 (77.748)	Loss 2.7536 (1.4493)
[02/27 11:07:37][INFO] train_vision.py:  668: Epoch: [7][160/329], lr: 1.91e-04, eta: 6:12:53	Time 3.000 (3.020)	Data 0.057 (0.066)	Mem 41.61GB	Prec@1 70.000 (77.702)	Loss 1.4679 (1.4507)
[02/27 11:08:07][INFO] train_vision.py:  668: Epoch: [7][170/329], lr: 1.91e-04, eta: 6:12:14	Time 3.003 (3.019)	Data 0.054 (0.065)	Mem 41.61GB	Prec@1 70.000 (77.778)	Loss 1.7204 (1.4499)
[02/27 11:08:37][INFO] train_vision.py:  668: Epoch: [7][180/329], lr: 1.91e-04, eta: 6:11:36	Time 3.016 (3.018)	Data 0.068 (0.065)	Mem 41.61GB	Prec@1 90.000 (77.403)	Loss 1.4037 (1.4619)
[02/27 11:09:07][INFO] train_vision.py:  668: Epoch: [7][190/329], lr: 1.91e-04, eta: 6:11:00	Time 3.007 (3.017)	Data 0.056 (0.064)	Mem 41.61GB	Prec@1 70.000 (77.696)	Loss 1.3412 (1.4514)
[02/27 11:09:37][INFO] train_vision.py:  668: Epoch: [7][200/329], lr: 1.91e-04, eta: 6:10:22	Time 2.993 (3.016)	Data 0.057 (0.063)	Mem 41.61GB	Prec@1 80.000 (77.811)	Loss 1.1804 (1.4491)
[02/27 11:10:07][INFO] train_vision.py:  668: Epoch: [7][210/329], lr: 1.91e-04, eta: 6:09:46	Time 3.013 (3.015)	Data 0.050 (0.063)	Mem 41.61GB	Prec@1 70.000 (77.678)	Loss 1.7641 (1.4572)
[02/27 11:10:37][INFO] train_vision.py:  668: Epoch: [7][220/329], lr: 1.90e-04, eta: 6:09:11	Time 2.991 (3.015)	Data 0.056 (0.063)	Mem 41.61GB	Prec@1 80.000 (77.828)	Loss 1.4545 (1.4556)
[02/27 11:11:07][INFO] train_vision.py:  668: Epoch: [7][230/329], lr: 1.90e-04, eta: 6:08:36	Time 3.024 (3.014)	Data 0.025 (0.062)	Mem 41.61GB	Prec@1 90.000 (77.835)	Loss 1.1668 (1.4569)
[02/27 11:11:37][INFO] train_vision.py:  668: Epoch: [7][240/329], lr: 1.90e-04, eta: 6:08:00	Time 2.995 (3.013)	Data 0.053 (0.062)	Mem 41.61GB	Prec@1 80.000 (77.967)	Loss 1.2452 (1.4547)
[02/27 11:12:07][INFO] train_vision.py:  668: Epoch: [7][250/329], lr: 1.90e-04, eta: 6:07:24	Time 2.994 (3.012)	Data 0.056 (0.061)	Mem 41.61GB	Prec@1 80.000 (78.048)	Loss 1.3391 (1.4570)
[02/27 11:12:37][INFO] train_vision.py:  668: Epoch: [7][260/329], lr: 1.90e-04, eta: 6:06:50	Time 2.998 (3.012)	Data 0.057 (0.061)	Mem 41.61GB	Prec@1 100.000 (78.161)	Loss 1.0694 (1.4523)
[02/27 11:13:07][INFO] train_vision.py:  668: Epoch: [7][270/329], lr: 1.90e-04, eta: 6:06:17	Time 2.996 (3.012)	Data 0.055 (0.061)	Mem 41.61GB	Prec@1 90.000 (78.229)	Loss 1.3484 (1.4515)
[02/27 11:13:37][INFO] train_vision.py:  668: Epoch: [7][280/329], lr: 1.89e-04, eta: 6:05:43	Time 2.998 (3.011)	Data 0.058 (0.060)	Mem 41.61GB	Prec@1 50.000 (78.043)	Loss 1.8551 (1.4566)
[02/27 11:14:07][INFO] train_vision.py:  668: Epoch: [7][290/329], lr: 1.89e-04, eta: 6:05:11	Time 3.006 (3.011)	Data 0.055 (0.060)	Mem 41.61GB	Prec@1 90.000 (78.041)	Loss 1.1635 (1.4586)
[02/27 11:14:37][INFO] train_vision.py:  668: Epoch: [7][300/329], lr: 1.89e-04, eta: 6:04:38	Time 3.002 (3.010)	Data 0.058 (0.059)	Mem 41.61GB	Prec@1 70.000 (77.907)	Loss 1.4697 (1.4588)
[02/27 11:15:07][INFO] train_vision.py:  668: Epoch: [7][310/329], lr: 1.89e-04, eta: 6:04:04	Time 2.995 (3.010)	Data 0.057 (0.059)	Mem 41.61GB	Prec@1 70.000 (77.910)	Loss 1.6390 (1.4556)
[02/27 11:15:37][INFO] train_vision.py:  668: Epoch: [7][320/329], lr: 1.89e-04, eta: 6:03:32	Time 3.002 (3.009)	Data 0.057 (0.059)	Mem 41.61GB	Prec@1 70.000 (77.850)	Loss 1.7560 (1.4599)
[02/27 11:16:08][INFO] train_vision.py:  840: Test: [0/107]	Prec@1 96.250 (96.250)	Prec@5 98.750 (98.750)	mPrec@1 (30.606)	mPrec@5 (32.121)
[02/27 11:16:50][INFO] train_vision.py:  840: Test: [10/107]	Prec@1 93.750 (93.068)	Prec@5 100.000 (99.773)	mPrec@1 (75.793)	mPrec@5 (86.512)
[02/27 11:17:32][INFO] train_vision.py:  840: Test: [20/107]	Prec@1 96.250 (92.083)	Prec@5 100.000 (99.643)	mPrec@1 (82.576)	mPrec@5 (97.210)
[02/27 11:18:15][INFO] train_vision.py:  840: Test: [30/107]	Prec@1 98.750 (92.782)	Prec@5 100.000 (99.718)	mPrec@1 (84.466)	mPrec@5 (98.554)
[02/27 11:18:57][INFO] train_vision.py:  840: Test: [40/107]	Prec@1 90.000 (92.591)	Prec@5 100.000 (99.756)	mPrec@1 (86.181)	mPrec@5 (99.680)
[02/27 11:19:40][INFO] train_vision.py:  840: Test: [50/107]	Prec@1 93.750 (91.667)	Prec@5 100.000 (99.730)	mPrec@1 (85.334)	mPrec@5 (99.615)
[02/27 11:20:22][INFO] train_vision.py:  840: Test: [60/107]	Prec@1 93.750 (91.598)	Prec@5 100.000 (99.754)	mPrec@1 (85.411)	mPrec@5 (99.683)
[02/27 11:21:05][INFO] train_vision.py:  840: Test: [70/107]	Prec@1 85.000 (91.496)	Prec@5 98.750 (99.754)	mPrec@1 (85.429)	mPrec@5 (99.648)
[02/27 11:21:47][INFO] train_vision.py:  840: Test: [80/107]	Prec@1 95.000 (91.651)	Prec@5 100.000 (99.769)	mPrec@1 (85.681)	mPrec@5 (99.658)
[02/27 11:22:29][INFO] train_vision.py:  840: Test: [90/107]	Prec@1 96.250 (91.538)	Prec@5 100.000 (99.780)	mPrec@1 (86.123)	mPrec@5 (99.633)
[02/27 11:23:12][INFO] train_vision.py:  840: Test: [100/107]	Prec@1 80.000 (91.559)	Prec@5 100.000 (99.790)	mPrec@1 (86.050)	mPrec@5 (99.658)
[02/27 11:23:36][INFO] train_vision.py:  847: Overall Prec@1 91.350% Prec@5 99.789% mPrec@1 (86.016) mPrec@5 (99.654)
[02/27 11:23:36][INFO] train_vision.py:  464: Testing: 86.01566314697266/86.01566314697266
[02/27 11:23:36][INFO] train_vision.py:  465: Saving:
[02/27 11:23:55][INFO] train_vision.py:  668: Epoch: [8][0/329], lr: 1.89e-04, eta: 10:24:28	Time 5.176 (5.176)	Data 2.299 (2.299)	Mem 41.61GB	Prec@1 70.000 (70.000)	Loss 1.5299 (1.5299)
[02/27 11:24:25][INFO] train_vision.py:  668: Epoch: [8][10/329], lr: 1.89e-04, eta: 6:24:18	Time 2.991 (3.190)	Data 0.053 (0.268)	Mem 41.61GB	Prec@1 90.000 (88.182)	Loss 1.2897 (1.2912)
[02/27 11:24:55][INFO] train_vision.py:  668: Epoch: [8][20/329], lr: 1.88e-04, eta: 6:13:12	Time 3.021 (3.102)	Data 0.053 (0.170)	Mem 41.61GB	Prec@1 60.000 (84.762)	Loss 2.1720 (1.3154)
[02/27 11:25:25][INFO] train_vision.py:  668: Epoch: [8][30/329], lr: 1.88e-04, eta: 6:08:47	Time 3.037 (3.069)	Data 0.054 (0.133)	Mem 41.61GB	Prec@1 80.000 (83.226)	Loss 1.3183 (1.3511)
[02/27 11:25:55][INFO] train_vision.py:  668: Epoch: [8][40/329], lr: 1.88e-04, eta: 6:06:18	Time 3.014 (3.053)	Data 0.069 (0.114)	Mem 41.61GB	Prec@1 80.000 (80.488)	Loss 1.3736 (1.3986)
[02/27 11:26:25][INFO] train_vision.py:  668: Epoch: [8][50/329], lr: 1.88e-04, eta: 6:04:41	Time 3.026 (3.044)	Data 0.064 (0.101)	Mem 41.61GB	Prec@1 80.000 (79.020)	Loss 1.2278 (1.4398)
[02/27 11:26:55][INFO] train_vision.py:  668: Epoch: [8][60/329], lr: 1.88e-04, eta: 6:03:32	Time 3.037 (3.038)	Data 0.068 (0.094)	Mem 41.61GB	Prec@1 80.000 (80.328)	Loss 1.4713 (1.4184)
[02/27 11:27:25][INFO] train_vision.py:  668: Epoch: [8][70/329], lr: 1.87e-04, eta: 6:02:32	Time 3.033 (3.034)	Data 0.063 (0.089)	Mem 41.61GB	Prec@1 90.000 (80.563)	Loss 1.3576 (1.4100)
[02/27 11:27:55][INFO] train_vision.py:  668: Epoch: [8][80/329], lr: 1.87e-04, eta: 6:01:39	Time 3.006 (3.031)	Data 0.047 (0.084)	Mem 41.61GB	Prec@1 70.000 (80.247)	Loss 1.5324 (1.4147)
[02/27 11:28:25][INFO] train_vision.py:  668: Epoch: [8][90/329], lr: 1.87e-04, eta: 6:00:51	Time 3.053 (3.029)	Data 0.049 (0.081)	Mem 41.61GB	Prec@1 70.000 (80.220)	Loss 1.6951 (1.4230)
[02/27 11:28:55][INFO] train_vision.py:  668: Epoch: [8][100/329], lr: 1.87e-04, eta: 6:00:00	Time 2.997 (3.026)	Data 0.062 (0.078)	Mem 41.61GB	Prec@1 80.000 (80.099)	Loss 1.5922 (1.4202)
[02/27 11:29:25][INFO] train_vision.py:  668: Epoch: [8][110/329], lr: 1.87e-04, eta: 5:59:19	Time 3.027 (3.024)	Data 0.051 (0.075)	Mem 41.61GB	Prec@1 60.000 (79.279)	Loss 1.6306 (1.4283)
[02/27 11:29:55][INFO] train_vision.py:  668: Epoch: [8][120/329], lr: 1.87e-04, eta: 5:58:36	Time 3.007 (3.022)	Data 0.062 (0.073)	Mem 41.61GB	Prec@1 90.000 (79.587)	Loss 1.6218 (1.4171)
[02/27 11:30:25][INFO] train_vision.py:  668: Epoch: [8][130/329], lr: 1.86e-04, eta: 5:57:57	Time 3.025 (3.021)	Data 0.044 (0.071)	Mem 41.61GB	Prec@1 80.000 (79.847)	Loss 1.2944 (1.4087)
[02/27 11:30:55][INFO] train_vision.py:  668: Epoch: [8][140/329], lr: 1.86e-04, eta: 5:57:21	Time 3.029 (3.020)	Data 0.065 (0.070)	Mem 41.61GB	Prec@1 90.000 (79.574)	Loss 1.1053 (1.4071)
[02/27 11:31:25][INFO] train_vision.py:  668: Epoch: [8][150/329], lr: 1.86e-04, eta: 5:56:42	Time 3.004 (3.019)	Data 0.051 (0.069)	Mem 41.61GB	Prec@1 60.000 (79.272)	Loss 1.7079 (1.4127)
[02/27 11:31:55][INFO] train_vision.py:  668: Epoch: [8][160/329], lr: 1.86e-04, eta: 5:56:04	Time 3.003 (3.018)	Data 0.058 (0.068)	Mem 41.61GB	Prec@1 90.000 (79.193)	Loss 1.2860 (1.4105)
[02/27 11:32:25][INFO] train_vision.py:  668: Epoch: [8][170/329], lr: 1.86e-04, eta: 5:55:27	Time 3.001 (3.017)	Data 0.051 (0.067)	Mem 41.61GB	Prec@1 70.000 (79.064)	Loss 1.3576 (1.4166)
[02/27 11:32:56][INFO] train_vision.py:  668: Epoch: [8][180/329], lr: 1.85e-04, eta: 5:54:55	Time 3.025 (3.017)	Data 0.087 (0.066)	Mem 41.61GB	Prec@1 80.000 (79.171)	Loss 1.4997 (1.4141)
[02/27 11:33:25][INFO] train_vision.py:  668: Epoch: [8][190/329], lr: 1.85e-04, eta: 5:54:16	Time 2.954 (3.016)	Data 0.049 (0.066)	Mem 41.61GB	Prec@1 80.000 (79.634)	Loss 1.6006 (1.4069)
[02/27 11:33:56][INFO] train_vision.py:  668: Epoch: [8][200/329], lr: 1.85e-04, eta: 5:53:43	Time 3.046 (3.015)	Data 0.066 (0.065)	Mem 41.61GB	Prec@1 100.000 (79.701)	Loss 0.9213 (1.4048)
[02/27 11:34:26][INFO] train_vision.py:  668: Epoch: [8][210/329], lr: 1.85e-04, eta: 5:53:08	Time 2.988 (3.014)	Data 0.045 (0.064)	Mem 41.61GB	Prec@1 70.000 (79.716)	Loss 1.7041 (1.4044)
[02/27 11:34:56][INFO] train_vision.py:  668: Epoch: [8][220/329], lr: 1.85e-04, eta: 5:52:32	Time 2.999 (3.014)	Data 0.051 (0.064)	Mem 41.61GB	Prec@1 90.000 (79.729)	Loss 1.0585 (1.4022)
[02/27 11:35:26][INFO] train_vision.py:  668: Epoch: [8][230/329], lr: 1.84e-04, eta: 5:51:57	Time 3.005 (3.013)	Data 0.049 (0.063)	Mem 41.61GB	Prec@1 100.000 (79.827)	Loss 1.0493 (1.4013)
[02/27 11:35:56][INFO] train_vision.py:  668: Epoch: [8][240/329], lr: 1.84e-04, eta: 5:51:24	Time 3.042 (3.013)	Data 0.066 (0.063)	Mem 41.61GB	Prec@1 90.000 (79.917)	Loss 1.1702 (1.3970)
[02/27 11:36:26][INFO] train_vision.py:  668: Epoch: [8][250/329], lr: 1.84e-04, eta: 5:50:49	Time 2.973 (3.012)	Data 0.042 (0.062)	Mem 41.61GB	Prec@1 90.000 (80.080)	Loss 1.2983 (1.3949)
[02/27 11:36:55][INFO] train_vision.py:  668: Epoch: [8][260/329], lr: 1.84e-04, eta: 5:50:16	Time 2.983 (3.011)	Data 0.062 (0.062)	Mem 41.61GB	Prec@1 70.000 (80.115)	Loss 1.9580 (1.3950)
[02/27 11:37:25][INFO] train_vision.py:  668: Epoch: [8][270/329], lr: 1.84e-04, eta: 5:49:40	Time 2.993 (3.011)	Data 0.049 (0.061)	Mem 41.61GB	Prec@1 70.000 (80.037)	Loss 1.4527 (1.3974)
[02/27 11:37:55][INFO] train_vision.py:  668: Epoch: [8][280/329], lr: 1.83e-04, eta: 5:49:07	Time 2.993 (3.010)	Data 0.055 (0.061)	Mem 41.61GB	Prec@1 90.000 (79.858)	Loss 1.1895 (1.4035)
[02/27 11:38:25][INFO] train_vision.py:  668: Epoch: [8][290/329], lr: 1.83e-04, eta: 5:48:35	Time 2.991 (3.010)	Data 0.042 (0.060)	Mem 41.61GB	Prec@1 80.000 (79.897)	Loss 1.3406 (1.4010)
[02/27 11:38:55][INFO] train_vision.py:  668: Epoch: [8][300/329], lr: 1.83e-04, eta: 5:48:02	Time 3.026 (3.009)	Data 0.068 (0.060)	Mem 41.61GB	Prec@1 100.000 (79.867)	Loss 1.0355 (1.4021)
[02/27 11:39:25][INFO] train_vision.py:  668: Epoch: [8][310/329], lr: 1.83e-04, eta: 5:47:30	Time 2.998 (3.009)	Data 0.055 (0.060)	Mem 41.61GB	Prec@1 60.000 (79.646)	Loss 1.9131 (1.4062)
[02/27 11:39:55][INFO] train_vision.py:  668: Epoch: [8][320/329], lr: 1.83e-04, eta: 5:46:56	Time 2.998 (3.009)	Data 0.057 (0.059)	Mem 41.61GB	Prec@1 70.000 (79.439)	Loss 1.6554 (1.4102)
[02/27 11:40:25][INFO] train_vision.py:  668: Epoch: [9][0/329], lr: 1.82e-04, eta: 10:41:57	Time 5.574 (5.574)	Data 2.316 (2.316)	Mem 41.61GB	Prec@1 80.000 (80.000)	Loss 1.2884 (1.2884)
[02/27 11:40:55][INFO] train_vision.py:  668: Epoch: [9][10/329], lr: 1.82e-04, eta: 6:12:53	Time 3.044 (3.243)	Data 0.052 (0.260)	Mem 41.61GB	Prec@1 80.000 (72.727)	Loss 1.2702 (1.4269)
[02/27 11:41:25][INFO] train_vision.py:  668: Epoch: [9][20/329], lr: 1.82e-04, eta: 5:59:54	Time 2.995 (3.134)	Data 0.046 (0.164)	Mem 41.61GB	Prec@1 60.000 (76.667)	Loss 1.7182 (1.4031)
[02/27 11:41:56][INFO] train_vision.py:  668: Epoch: [9][30/329], lr: 1.82e-04, eta: 5:54:45	Time 3.036 (3.094)	Data 0.020 (0.128)	Mem 41.61GB	Prec@1 90.000 (75.161)	Loss 1.2658 (1.4368)
[02/27 11:42:26][INFO] train_vision.py:  668: Epoch: [9][40/329], lr: 1.82e-04, eta: 5:51:38	Time 2.974 (3.071)	Data 0.041 (0.109)	Mem 41.61GB	Prec@1 80.000 (75.366)	Loss 1.2441 (1.4353)
[02/27 11:42:56][INFO] train_vision.py:  668: Epoch: [9][50/329], lr: 1.81e-04, eta: 5:49:35	Time 2.977 (3.058)	Data 0.044 (0.099)	Mem 41.61GB	Prec@1 70.000 (76.667)	Loss 1.5106 (1.4267)
[02/27 11:43:26][INFO] train_vision.py:  668: Epoch: [9][60/329], lr: 1.81e-04, eta: 5:47:55	Time 2.988 (3.047)	Data 0.041 (0.091)	Mem 41.61GB	Prec@1 70.000 (77.869)	Loss 1.7115 (1.4242)
[02/27 11:43:56][INFO] train_vision.py:  668: Epoch: [9][70/329], lr: 1.81e-04, eta: 5:46:50	Time 3.027 (3.042)	Data 0.068 (0.086)	Mem 41.61GB	Prec@1 50.000 (77.887)	Loss 2.3839 (1.4450)
[02/27 11:44:26][INFO] train_vision.py:  668: Epoch: [9][80/329], lr: 1.81e-04, eta: 5:45:44	Time 2.983 (3.037)	Data 0.047 (0.082)	Mem 41.61GB	Prec@1 70.000 (78.025)	Loss 1.8229 (1.4368)
[02/27 11:44:56][INFO] train_vision.py:  668: Epoch: [9][90/329], lr: 1.81e-04, eta: 5:44:42	Time 3.002 (3.033)	Data 0.065 (0.079)	Mem 41.61GB	Prec@1 60.000 (78.242)	Loss 1.8685 (1.4337)
[02/27 11:45:26][INFO] train_vision.py:  668: Epoch: [9][100/329], lr: 1.80e-04, eta: 5:43:53	Time 2.985 (3.030)	Data 0.045 (0.076)	Mem 41.61GB	Prec@1 80.000 (78.515)	Loss 1.5363 (1.4246)
[02/27 11:45:56][INFO] train_vision.py:  668: Epoch: [9][110/329], lr: 1.80e-04, eta: 5:43:05	Time 3.036 (3.027)	Data 0.072 (0.074)	Mem 41.61GB	Prec@1 70.000 (78.198)	Loss 1.5507 (1.4287)
[02/27 11:46:26][INFO] train_vision.py:  668: Epoch: [9][120/329], lr: 1.80e-04, eta: 5:42:21	Time 2.992 (3.025)	Data 0.049 (0.072)	Mem 41.61GB	Prec@1 80.000 (78.182)	Loss 1.5774 (1.4278)
[02/27 11:46:56][INFO] train_vision.py:  668: Epoch: [9][130/329], lr: 1.80e-04, eta: 5:41:37	Time 3.032 (3.023)	Data 0.060 (0.070)	Mem 41.61GB	Prec@1 90.000 (78.473)	Loss 1.3175 (1.4216)
[02/27 11:47:26][INFO] train_vision.py:  668: Epoch: [9][140/329], lr: 1.79e-04, eta: 5:40:52	Time 2.993 (3.021)	Data 0.049 (0.068)	Mem 41.61GB	Prec@1 90.000 (78.582)	Loss 1.5287 (1.4173)
[02/27 11:47:56][INFO] train_vision.py:  668: Epoch: [9][150/329], lr: 1.79e-04, eta: 5:40:14	Time 3.020 (3.020)	Data 0.068 (0.067)	Mem 41.61GB	Prec@1 70.000 (78.477)	Loss 1.9901 (1.4228)
[02/27 11:48:26][INFO] train_vision.py:  668: Epoch: [9][160/329], lr: 1.79e-04, eta: 5:39:33	Time 2.990 (3.018)	Data 0.048 (0.067)	Mem 41.61GB	Prec@1 60.000 (78.571)	Loss 1.9591 (1.4214)
[02/27 11:48:56][INFO] train_vision.py:  668: Epoch: [9][170/329], lr: 1.79e-04, eta: 5:38:56	Time 3.036 (3.017)	Data 0.065 (0.066)	Mem 41.61GB	Prec@1 80.000 (78.538)	Loss 1.6397 (1.4217)
[02/27 11:49:26][INFO] train_vision.py:  668: Epoch: [9][180/329], lr: 1.79e-04, eta: 5:38:18	Time 2.988 (3.016)	Data 0.044 (0.065)	Mem 41.61GB	Prec@1 80.000 (78.840)	Loss 1.2843 (1.4132)
[02/27 11:49:55][INFO] train_vision.py:  668: Epoch: [9][190/329], lr: 1.78e-04, eta: 5:37:40	Time 2.973 (3.015)	Data 0.046 (0.064)	Mem 41.61GB	Prec@1 100.000 (79.215)	Loss 0.9950 (1.4087)
[02/27 11:50:25][INFO] train_vision.py:  668: Epoch: [9][200/329], lr: 1.78e-04, eta: 5:37:05	Time 2.983 (3.014)	Data 0.048 (0.064)	Mem 41.61GB	Prec@1 90.000 (79.154)	Loss 1.1605 (1.4090)
[02/27 11:50:55][INFO] train_vision.py:  668: Epoch: [9][210/329], lr: 1.78e-04, eta: 5:36:27	Time 2.978 (3.013)	Data 0.036 (0.063)	Mem 41.61GB	Prec@1 90.000 (78.957)	Loss 1.1179 (1.4085)
[02/27 11:51:25][INFO] train_vision.py:  668: Epoch: [9][220/329], lr: 1.78e-04, eta: 5:35:51	Time 2.980 (3.012)	Data 0.041 (0.062)	Mem 41.61GB	Prec@1 100.000 (78.914)	Loss 0.8578 (1.4103)
[02/27 11:51:55][INFO] train_vision.py:  668: Epoch: [9][230/329], lr: 1.77e-04, eta: 5:35:15	Time 3.006 (3.011)	Data 0.066 (0.061)	Mem 41.61GB	Prec@1 100.000 (78.615)	Loss 0.8398 (1.4133)
[02/27 11:52:25][INFO] train_vision.py:  668: Epoch: [9][240/329], lr: 1.77e-04, eta: 5:34:40	Time 2.975 (3.011)	Data 0.044 (0.061)	Mem 41.61GB	Prec@1 80.000 (78.714)	Loss 1.1629 (1.4110)
[02/27 11:52:55][INFO] train_vision.py:  668: Epoch: [9][250/329], lr: 1.77e-04, eta: 5:34:04	Time 2.985 (3.010)	Data 0.049 (0.060)	Mem 41.61GB	Prec@1 90.000 (78.486)	Loss 1.0269 (1.4145)
[02/27 11:53:25][INFO] train_vision.py:  668: Epoch: [9][260/329], lr: 1.77e-04, eta: 5:33:29	Time 2.984 (3.009)	Data 0.046 (0.060)	Mem 41.61GB	Prec@1 80.000 (78.659)	Loss 1.2570 (1.4100)
[02/27 11:53:55][INFO] train_vision.py:  668: Epoch: [9][270/329], lr: 1.77e-04, eta: 5:32:54	Time 2.998 (3.008)	Data 0.049 (0.059)	Mem 41.61GB	Prec@1 80.000 (78.672)	Loss 1.6194 (1.4103)
[02/27 11:54:25][INFO] train_vision.py:  668: Epoch: [9][280/329], lr: 1.76e-04, eta: 5:32:20	Time 3.002 (3.008)	Data 0.049 (0.059)	Mem 41.61GB	Prec@1 60.000 (78.683)	Loss 2.1828 (1.4122)
[02/27 11:54:55][INFO] train_vision.py:  668: Epoch: [9][290/329], lr: 1.76e-04, eta: 5:31:46	Time 2.981 (3.007)	Data 0.041 (0.059)	Mem 41.61GB	Prec@1 100.000 (79.003)	Loss 0.8522 (1.4049)
[02/27 11:55:25][INFO] train_vision.py:  668: Epoch: [9][300/329], lr: 1.76e-04, eta: 5:31:12	Time 2.980 (3.006)	Data 0.041 (0.058)	Mem 41.61GB	Prec@1 70.000 (79.136)	Loss 1.4876 (1.4004)
[02/27 11:55:54][INFO] train_vision.py:  668: Epoch: [9][310/329], lr: 1.76e-04, eta: 5:30:37	Time 2.990 (3.006)	Data 0.049 (0.058)	Mem 41.61GB	Prec@1 70.000 (79.196)	Loss 1.4315 (1.3999)
[02/27 11:56:24][INFO] train_vision.py:  668: Epoch: [9][320/329], lr: 1.75e-04, eta: 5:30:03	Time 2.987 (3.005)	Data 0.048 (0.058)	Mem 41.61GB	Prec@1 70.000 (79.377)	Loss 1.6141 (1.3939)
[02/27 11:56:55][INFO] train_vision.py:  840: Test: [0/107]	Prec@1 96.250 (96.250)	Prec@5 98.750 (98.750)	mPrec@1 (30.859)	mPrec@5 (32.121)
[02/27 11:57:37][INFO] train_vision.py:  840: Test: [10/107]	Prec@1 95.000 (93.864)	Prec@5 100.000 (99.659)	mPrec@1 (77.112)	mPrec@5 (86.760)
[02/27 11:58:20][INFO] train_vision.py:  840: Test: [20/107]	Prec@1 98.750 (92.202)	Prec@5 100.000 (99.702)	mPrec@1 (84.997)	mPrec@5 (97.562)
[02/27 11:59:03][INFO] train_vision.py:  840: Test: [30/107]	Prec@1 97.500 (93.427)	Prec@5 100.000 (99.758)	mPrec@1 (87.770)	mPrec@5 (98.725)
[02/27 11:59:45][INFO] train_vision.py:  840: Test: [40/107]	Prec@1 85.000 (93.110)	Prec@5 100.000 (99.756)	mPrec@1 (88.820)	mPrec@5 (99.718)
[02/27 12:00:28][INFO] train_vision.py:  840: Test: [50/107]	Prec@1 95.000 (91.863)	Prec@5 100.000 (99.657)	mPrec@1 (87.447)	mPrec@5 (99.576)
[02/27 12:01:10][INFO] train_vision.py:  840: Test: [60/107]	Prec@1 95.000 (92.070)	Prec@5 100.000 (99.693)	mPrec@1 (87.851)	mPrec@5 (99.590)
[02/27 12:01:53][INFO] train_vision.py:  840: Test: [70/107]	Prec@1 85.000 (92.060)	Prec@5 100.000 (99.736)	mPrec@1 (87.868)	mPrec@5 (99.658)
[02/27 12:02:35][INFO] train_vision.py:  840: Test: [80/107]	Prec@1 92.500 (92.160)	Prec@5 100.000 (99.753)	mPrec@1 (87.963)	mPrec@5 (99.665)
[02/27 12:03:18][INFO] train_vision.py:  840: Test: [90/107]	Prec@1 100.000 (91.882)	Prec@5 100.000 (99.780)	mPrec@1 (87.877)	mPrec@5 (99.682)
[02/27 12:04:00][INFO] train_vision.py:  840: Test: [100/107]	Prec@1 88.750 (92.104)	Prec@5 100.000 (99.802)	mPrec@1 (88.103)	mPrec@5 (99.695)
[02/27 12:04:24][INFO] train_vision.py:  847: Overall Prec@1 91.960% Prec@5 99.812% mPrec@1 (88.008) mPrec@5 (99.698)
[02/27 12:04:24][INFO] train_vision.py:  464: Testing: 88.00768280029297/88.00768280029297
[02/27 12:04:24][INFO] train_vision.py:  465: Saving:
[02/27 12:04:43][INFO] train_vision.py:  668: Epoch: [10][0/329], lr: 1.75e-04, eta: 9:34:17	Time 5.236 (5.236)	Data 2.357 (2.357)	Mem 41.61GB	Prec@1 80.000 (80.000)	Loss 1.5868 (1.5868)
[02/27 12:05:13][INFO] train_vision.py:  668: Epoch: [10][10/329], lr: 1.75e-04, eta: 5:49:56	Time 2.979 (3.195)	Data 0.072 (0.269)	Mem 41.61GB	Prec@1 90.000 (78.182)	Loss 1.4961 (1.4469)
[02/27 12:05:43][INFO] train_vision.py:  668: Epoch: [10][20/329], lr: 1.75e-04, eta: 5:40:06	Time 3.019 (3.110)	Data 0.075 (0.168)	Mem 41.61GB	Prec@1 90.000 (80.000)	Loss 1.1484 (1.4079)
[02/27 12:06:13][INFO] train_vision.py:  668: Epoch: [10][30/329], lr: 1.74e-04, eta: 5:36:02	Time 3.029 (3.078)	Data 0.046 (0.133)	Mem 41.61GB	Prec@1 90.000 (78.710)	Loss 1.0463 (1.4194)
[02/27 12:06:44][INFO] train_vision.py:  668: Epoch: [10][40/329], lr: 1.74e-04, eta: 5:33:42	Time 3.018 (3.061)	Data 0.043 (0.115)	Mem 41.61GB	Prec@1 90.000 (81.220)	Loss 1.1765 (1.3619)
[02/27 12:07:14][INFO] train_vision.py:  668: Epoch: [10][50/329], lr: 1.74e-04, eta: 5:32:16	Time 3.038 (3.053)	Data 0.064 (0.101)	Mem 41.61GB	Prec@1 80.000 (80.196)	Loss 1.3255 (1.3832)
[02/27 12:07:44][INFO] train_vision.py:  668: Epoch: [10][60/329], lr: 1.74e-04, eta: 5:31:01	Time 3.017 (3.046)	Data 0.054 (0.093)	Mem 41.61GB	Prec@1 100.000 (80.164)	Loss 0.9672 (1.3905)
[02/27 12:08:14][INFO] train_vision.py:  668: Epoch: [10][70/329], lr: 1.73e-04, eta: 5:29:51	Time 2.971 (3.040)	Data 0.054 (0.088)	Mem 41.61GB	Prec@1 80.000 (79.014)	Loss 1.3474 (1.4126)
[02/27 12:08:44][INFO] train_vision.py:  668: Epoch: [10][80/329], lr: 1.73e-04, eta: 5:28:50	Time 3.002 (3.035)	Data 0.068 (0.084)	Mem 41.61GB	Prec@1 90.000 (78.889)	Loss 1.2137 (1.4189)
[02/27 12:09:14][INFO] train_vision.py:  668: Epoch: [10][90/329], lr: 1.73e-04, eta: 5:27:57	Time 3.003 (3.032)	Data 0.053 (0.081)	Mem 41.61GB	Prec@1 60.000 (78.242)	Loss 1.6040 (1.4306)
[02/27 12:09:44][INFO] train_vision.py:  668: Epoch: [10][100/329], lr: 1.73e-04, eta: 5:27:07	Time 3.008 (3.029)	Data 0.056 (0.078)	Mem 41.61GB	Prec@1 90.000 (78.317)	Loss 1.2219 (1.4318)
[02/27 12:10:14][INFO] train_vision.py:  668: Epoch: [10][110/329], lr: 1.72e-04, eta: 5:26:21	Time 2.994 (3.026)	Data 0.053 (0.076)	Mem 41.61GB	Prec@1 80.000 (78.468)	Loss 1.7338 (1.4262)
[02/27 12:10:44][INFO] train_vision.py:  668: Epoch: [10][120/329], lr: 1.72e-04, eta: 5:25:34	Time 2.988 (3.023)	Data 0.047 (0.074)	Mem 41.61GB	Prec@1 80.000 (78.926)	Loss 1.2126 (1.4201)
[02/27 12:11:14][INFO] train_vision.py:  668: Epoch: [10][130/329], lr: 1.72e-04, eta: 5:24:50	Time 2.999 (3.021)	Data 0.054 (0.072)	Mem 41.61GB	Prec@1 60.000 (78.702)	Loss 1.4349 (1.4150)
[02/27 12:11:44][INFO] train_vision.py:  668: Epoch: [10][140/329], lr: 1.72e-04, eta: 5:24:09	Time 2.986 (3.020)	Data 0.048 (0.071)	Mem 41.61GB	Prec@1 80.000 (79.149)	Loss 1.3178 (1.4064)
[02/27 12:12:14][INFO] train_vision.py:  668: Epoch: [10][150/329], lr: 1.71e-04, eta: 5:23:28	Time 3.033 (3.018)	Data 0.025 (0.070)	Mem 41.61GB	Prec@1 80.000 (79.338)	Loss 1.4712 (1.4065)
[02/27 12:12:44][INFO] train_vision.py:  668: Epoch: [10][160/329], lr: 1.71e-04, eta: 5:22:48	Time 2.993 (3.016)	Data 0.058 (0.069)	Mem 41.61GB	Prec@1 70.000 (79.317)	Loss 1.7336 (1.4093)
[02/27 12:13:14][INFO] train_vision.py:  668: Epoch: [10][170/329], lr: 1.71e-04, eta: 5:22:13	Time 3.040 (3.016)	Data 0.022 (0.068)	Mem 41.61GB	Prec@1 80.000 (79.532)	Loss 1.3123 (1.4034)
[02/27 12:13:44][INFO] train_vision.py:  668: Epoch: [10][180/329], lr: 1.71e-04, eta: 5:21:35	Time 2.989 (3.014)	Data 0.059 (0.067)	Mem 41.61GB	Prec@1 90.000 (79.613)	Loss 1.1938 (1.4038)
[02/27 12:14:14][INFO] train_vision.py:  668: Epoch: [10][190/329], lr: 1.70e-04, eta: 5:21:00	Time 2.987 (3.014)	Data 0.055 (0.066)	Mem 41.61GB	Prec@1 70.000 (79.895)	Loss 1.6411 (1.3976)
[02/27 12:14:44][INFO] train_vision.py:  668: Epoch: [10][200/329], lr: 1.70e-04, eta: 5:20:24	Time 2.988 (3.013)	Data 0.051 (0.066)	Mem 41.61GB	Prec@1 70.000 (79.701)	Loss 1.6933 (1.3979)
[02/27 12:15:14][INFO] train_vision.py:  668: Epoch: [10][210/329], lr: 1.70e-04, eta: 5:19:50	Time 3.032 (3.012)	Data 0.022 (0.065)	Mem 41.61GB	Prec@1 80.000 (79.716)	Loss 1.4923 (1.3968)
[02/27 12:15:44][INFO] train_vision.py:  668: Epoch: [10][220/329], lr: 1.70e-04, eta: 5:19:15	Time 3.005 (3.011)	Data 0.062 (0.065)	Mem 41.61GB	Prec@1 30.000 (79.457)	Loss 2.7208 (1.4025)
[02/27 12:16:14][INFO] train_vision.py:  668: Epoch: [10][230/329], lr: 1.69e-04, eta: 5:18:42	Time 3.006 (3.011)	Data 0.058 (0.064)	Mem 41.61GB	Prec@1 80.000 (79.784)	Loss 1.2161 (1.3946)
[02/27 12:16:44][INFO] train_vision.py:  668: Epoch: [10][240/329], lr: 1.69e-04, eta: 5:18:09	Time 2.994 (3.010)	Data 0.051 (0.064)	Mem 41.61GB	Prec@1 70.000 (79.751)	Loss 1.4446 (1.3936)
[02/27 12:17:14][INFO] train_vision.py:  668: Epoch: [10][250/329], lr: 1.69e-04, eta: 5:17:36	Time 3.015 (3.010)	Data 0.039 (0.064)	Mem 41.61GB	Prec@1 100.000 (79.681)	Loss 0.9821 (1.4003)
[02/27 12:17:44][INFO] train_vision.py:  668: Epoch: [10][260/329], lr: 1.69e-04, eta: 5:17:04	Time 3.002 (3.010)	Data 0.058 (0.063)	Mem 41.61GB	Prec@1 70.000 (79.579)	Loss 1.6945 (1.3994)
[02/27 12:18:14][INFO] train_vision.py:  668: Epoch: [10][270/329], lr: 1.68e-04, eta: 5:16:31	Time 3.002 (3.009)	Data 0.054 (0.063)	Mem 41.61GB	Prec@1 50.000 (79.114)	Loss 1.6432 (1.4095)
[02/27 12:18:44][INFO] train_vision.py:  668: Epoch: [10][280/329], lr: 1.68e-04, eta: 5:15:59	Time 3.000 (3.009)	Data 0.040 (0.063)	Mem 41.61GB	Prec@1 90.000 (79.075)	Loss 1.1595 (1.4064)
[02/27 12:19:14][INFO] train_vision.py:  668: Epoch: [10][290/329], lr: 1.68e-04, eta: 5:15:28	Time 2.996 (3.009)	Data 0.056 (0.062)	Mem 41.61GB	Prec@1 90.000 (79.038)	Loss 1.0913 (1.4064)
[02/27 12:19:44][INFO] train_vision.py:  668: Epoch: [10][300/329], lr: 1.67e-04, eta: 5:14:55	Time 2.996 (3.008)	Data 0.056 (0.062)	Mem 41.61GB	Prec@1 80.000 (78.870)	Loss 2.0013 (1.4099)
[02/27 12:20:14][INFO] train_vision.py:  668: Epoch: [10][310/329], lr: 1.67e-04, eta: 5:14:24	Time 3.002 (3.008)	Data 0.050 (0.062)	Mem 41.61GB	Prec@1 90.000 (79.068)	Loss 1.1910 (1.4064)
[02/27 12:20:44][INFO] train_vision.py:  668: Epoch: [10][320/329], lr: 1.67e-04, eta: 5:13:52	Time 2.975 (3.008)	Data 0.065 (0.062)	Mem 41.61GB	Prec@1 80.000 (78.910)	Loss 1.1306 (1.4075)
[02/27 12:21:13][INFO] train_vision.py:  668: Epoch: [11][0/329], lr: 1.67e-04, eta: 9:34:35	Time 5.514 (5.514)	Data 2.606 (2.606)	Mem 41.61GB	Prec@1 70.000 (70.000)	Loss 1.6554 (1.6554)
[02/27 12:21:43][INFO] train_vision.py:  668: Epoch: [11][10/329], lr: 1.66e-04, eta: 5:36:07	Time 3.011 (3.231)	Data 0.075 (0.292)	Mem 41.61GB	Prec@1 90.000 (84.545)	Loss 0.9725 (1.2227)
[02/27 12:22:13][INFO] train_vision.py:  668: Epoch: [11][20/329], lr: 1.66e-04, eta: 5:24:23	Time 3.013 (3.123)	Data 0.054 (0.177)	Mem 41.61GB	Prec@1 90.000 (80.952)	Loss 1.0632 (1.2809)
[02/27 12:22:44][INFO] train_vision.py:  668: Epoch: [11][30/329], lr: 1.66e-04, eta: 5:20:07	Time 3.030 (3.087)	Data 0.072 (0.137)	Mem 41.61GB	Prec@1 80.000 (80.000)	Loss 1.1829 (1.3076)
[02/27 12:23:14][INFO] train_vision.py:  668: Epoch: [11][40/329], lr: 1.66e-04, eta: 5:17:25	Time 2.985 (3.066)	Data 0.050 (0.117)	Mem 41.61GB	Prec@1 90.000 (79.756)	Loss 1.2844 (1.3231)
[02/27 12:23:44][INFO] train_vision.py:  668: Epoch: [11][50/329], lr: 1.65e-04, eta: 5:15:51	Time 3.024 (3.056)	Data 0.053 (0.103)	Mem 41.61GB	Prec@1 90.000 (80.980)	Loss 1.2303 (1.3088)
[02/27 12:24:14][INFO] train_vision.py:  668: Epoch: [11][60/329], lr: 1.65e-04, eta: 5:14:31	Time 3.041 (3.048)	Data 0.062 (0.094)	Mem 41.61GB	Prec@1 100.000 (80.492)	Loss 1.0555 (1.3249)
[02/27 12:24:44][INFO] train_vision.py:  668: Epoch: [11][70/329], lr: 1.65e-04, eta: 5:13:28	Time 2.995 (3.042)	Data 0.054 (0.088)	Mem 41.61GB	Prec@1 80.000 (79.437)	Loss 1.3237 (1.3518)
[02/27 12:25:14][INFO] train_vision.py:  668: Epoch: [11][80/329], lr: 1.64e-04, eta: 5:12:32	Time 3.025 (3.038)	Data 0.061 (0.084)	Mem 41.61GB	Prec@1 80.000 (79.012)	Loss 1.3256 (1.3625)
[02/27 12:25:44][INFO] train_vision.py:  668: Epoch: [11][90/329], lr: 1.64e-04, eta: 5:11:41	Time 3.009 (3.035)	Data 0.046 (0.080)	Mem 41.61GB	Prec@1 70.000 (79.451)	Loss 1.6306 (1.3628)
[02/27 12:26:14][INFO] train_vision.py:  668: Epoch: [11][100/329], lr: 1.64e-04, eta: 5:10:52	Time 3.025 (3.032)	Data 0.058 (0.077)	Mem 41.61GB	Prec@1 50.000 (79.505)	Loss 2.1403 (1.3600)
[02/27 12:26:44][INFO] train_vision.py:  668: Epoch: [11][110/329], lr: 1.64e-04, eta: 5:10:07	Time 2.975 (3.030)	Data 0.056 (0.075)	Mem 41.61GB	Prec@1 70.000 (79.369)	Loss 1.6658 (1.3650)
[02/27 12:27:14][INFO] train_vision.py:  668: Epoch: [11][120/329], lr: 1.63e-04, eta: 5:09:29	Time 3.020 (3.028)	Data 0.056 (0.072)	Mem 41.61GB	Prec@1 80.000 (79.587)	Loss 1.4454 (1.3553)
[02/27 12:27:44][INFO] train_vision.py:  668: Epoch: [11][130/329], lr: 1.63e-04, eta: 5:08:49	Time 3.019 (3.027)	Data 0.051 (0.070)	Mem 41.61GB	Prec@1 80.000 (79.313)	Loss 1.4959 (1.3656)
[02/27 12:28:15][INFO] train_vision.py:  668: Epoch: [11][140/329], lr: 1.63e-04, eta: 5:08:13	Time 3.044 (3.026)	Data 0.021 (0.069)	Mem 41.61GB	Prec@1 60.000 (79.433)	Loss 1.8301 (1.3729)
[02/27 12:28:45][INFO] train_vision.py:  668: Epoch: [11][150/329], lr: 1.62e-04, eta: 5:07:38	Time 3.020 (3.025)	Data 0.056 (0.068)	Mem 41.61GB	Prec@1 60.000 (78.808)	Loss 1.7133 (1.3868)
[02/27 12:29:15][INFO] train_vision.py:  668: Epoch: [11][160/329], lr: 1.62e-04, eta: 5:07:02	Time 3.014 (3.024)	Data 0.062 (0.067)	Mem 41.61GB	Prec@1 70.000 (78.820)	Loss 1.6773 (1.3875)
[02/27 12:29:45][INFO] train_vision.py:  668: Epoch: [11][170/329], lr: 1.62e-04, eta: 5:06:26	Time 2.996 (3.023)	Data 0.053 (0.066)	Mem 41.61GB	Prec@1 80.000 (79.181)	Loss 1.3698 (1.3820)
[02/27 12:30:15][INFO] train_vision.py:  668: Epoch: [11][180/329], lr: 1.62e-04, eta: 5:05:50	Time 2.986 (3.022)	Data 0.050 (0.065)	Mem 41.61GB	Prec@1 80.000 (79.116)	Loss 1.1739 (1.3881)
[02/27 12:30:45][INFO] train_vision.py:  668: Epoch: [11][190/329], lr: 1.61e-04, eta: 5:05:13	Time 2.996 (3.021)	Data 0.056 (0.065)	Mem 41.61GB	Prec@1 60.000 (79.319)	Loss 1.4002 (1.3777)
[02/27 12:31:15][INFO] train_vision.py:  668: Epoch: [11][200/329], lr: 1.61e-04, eta: 5:04:36	Time 2.993 (3.020)	Data 0.043 (0.064)	Mem 41.61GB	Prec@1 90.000 (79.652)	Loss 1.0768 (1.3719)
[02/27 12:31:45][INFO] train_vision.py:  668: Epoch: [11][210/329], lr: 1.61e-04, eta: 5:04:03	Time 3.004 (3.019)	Data 0.055 (0.064)	Mem 41.61GB	Prec@1 80.000 (80.000)	Loss 1.7277 (1.3657)
[02/27 12:32:15][INFO] train_vision.py:  668: Epoch: [11][220/329], lr: 1.60e-04, eta: 5:03:25	Time 2.997 (3.018)	Data 0.052 (0.063)	Mem 41.61GB	Prec@1 50.000 (79.819)	Loss 2.3111 (1.3703)
[02/27 12:32:45][INFO] train_vision.py:  668: Epoch: [11][230/329], lr: 1.60e-04, eta: 5:02:48	Time 2.995 (3.017)	Data 0.055 (0.063)	Mem 41.61GB	Prec@1 60.000 (79.913)	Loss 1.8109 (1.3710)
[02/27 12:33:15][INFO] train_vision.py:  668: Epoch: [11][240/329], lr: 1.60e-04, eta: 5:02:12	Time 2.993 (3.016)	Data 0.055 (0.062)	Mem 41.61GB	Prec@1 90.000 (79.959)	Loss 1.0955 (1.3710)
[02/27 12:33:45][INFO] train_vision.py:  668: Epoch: [11][250/329], lr: 1.60e-04, eta: 5:01:37	Time 3.017 (3.015)	Data 0.022 (0.062)	Mem 41.61GB	Prec@1 90.000 (79.880)	Loss 1.1251 (1.3718)
[02/27 12:34:15][INFO] train_vision.py:  668: Epoch: [11][260/329], lr: 1.59e-04, eta: 5:01:02	Time 2.980 (3.015)	Data 0.033 (0.062)	Mem 41.61GB	Prec@1 60.000 (79.732)	Loss 1.8535 (1.3780)
[02/27 12:34:45][INFO] train_vision.py:  668: Epoch: [11][270/329], lr: 1.59e-04, eta: 5:00:28	Time 2.993 (3.014)	Data 0.054 (0.061)	Mem 41.61GB	Prec@1 90.000 (79.889)	Loss 1.3489 (1.3764)
[02/27 12:35:15][INFO] train_vision.py:  668: Epoch: [11][280/329], lr: 1.59e-04, eta: 4:59:56	Time 2.994 (3.013)	Data 0.052 (0.061)	Mem 41.61GB	Prec@1 70.000 (79.858)	Loss 1.7754 (1.3774)
[02/27 12:35:45][INFO] train_vision.py:  668: Epoch: [11][290/329], lr: 1.58e-04, eta: 4:59:22	Time 2.990 (3.013)	Data 0.055 (0.061)	Mem 41.61GB	Prec@1 70.000 (79.931)	Loss 1.5502 (1.3759)
[02/27 12:36:15][INFO] train_vision.py:  668: Epoch: [11][300/329], lr: 1.58e-04, eta: 4:58:48	Time 3.005 (3.012)	Data 0.081 (0.061)	Mem 41.61GB	Prec@1 100.000 (80.100)	Loss 0.9571 (1.3727)
[02/27 12:36:45][INFO] train_vision.py:  668: Epoch: [11][310/329], lr: 1.58e-04, eta: 4:58:17	Time 2.960 (3.012)	Data 0.060 (0.060)	Mem 41.61GB	Prec@1 80.000 (80.096)	Loss 1.4075 (1.3715)
[02/27 12:37:15][INFO] train_vision.py:  668: Epoch: [11][320/329], lr: 1.58e-04, eta: 4:57:45	Time 2.988 (3.012)	Data 0.051 (0.060)	Mem 41.61GB	Prec@1 100.000 (80.436)	Loss 0.9263 (1.3663)
[02/27 12:37:46][INFO] train_vision.py:  840: Test: [0/107]	Prec@1 97.500 (97.500)	Prec@5 100.000 (100.000)	mPrec@1 (31.111)	mPrec@5 (32.323)
[02/27 12:38:28][INFO] train_vision.py:  840: Test: [10/107]	Prec@1 98.750 (95.227)	Prec@5 100.000 (100.000)	mPrec@1 (78.115)	mPrec@5 (86.869)
[02/27 12:39:10][INFO] train_vision.py:  840: Test: [20/107]	Prec@1 96.250 (93.810)	Prec@5 100.000 (99.821)	mPrec@1 (85.234)	mPrec@5 (97.634)
[02/27 12:39:52][INFO] train_vision.py:  840: Test: [30/107]	Prec@1 97.500 (94.194)	Prec@5 100.000 (99.839)	mPrec@1 (88.242)	mPrec@5 (98.783)
[02/27 12:40:35][INFO] train_vision.py:  840: Test: [40/107]	Prec@1 87.500 (94.146)	Prec@5 100.000 (99.878)	mPrec@1 (88.988)	mPrec@5 (99.868)
[02/27 12:41:17][INFO] train_vision.py:  840: Test: [50/107]	Prec@1 93.750 (93.480)	Prec@5 100.000 (99.853)	mPrec@1 (88.352)	mPrec@5 (99.850)
[02/27 12:41:59][INFO] train_vision.py:  840: Test: [60/107]	Prec@1 97.500 (93.627)	Prec@5 100.000 (99.877)	mPrec@1 (88.717)	mPrec@5 (99.871)
[02/27 12:42:41][INFO] train_vision.py:  840: Test: [70/107]	Prec@1 86.250 (93.715)	Prec@5 98.750 (99.877)	mPrec@1 (88.983)	mPrec@5 (99.838)
[02/27 12:43:24][INFO] train_vision.py:  840: Test: [80/107]	Prec@1 95.000 (93.920)	Prec@5 100.000 (99.861)	mPrec@1 (89.267)	mPrec@5 (99.792)
[02/27 12:44:06][INFO] train_vision.py:  840: Test: [90/107]	Prec@1 97.500 (93.709)	Prec@5 100.000 (99.876)	mPrec@1 (89.344)	mPrec@5 (99.811)
[02/27 12:44:49][INFO] train_vision.py:  840: Test: [100/107]	Prec@1 88.750 (93.762)	Prec@5 100.000 (99.876)	mPrec@1 (89.456)	mPrec@5 (99.793)
[02/27 12:45:12][INFO] train_vision.py:  847: Overall Prec@1 93.509% Prec@5 99.871% mPrec@1 (89.362) mPrec@5 (99.782)
[02/27 12:45:12][INFO] train_vision.py:  464: Testing: 89.36155700683594/89.36155700683594
[02/27 12:45:12][INFO] train_vision.py:  465: Saving:
[02/27 12:45:31][INFO] train_vision.py:  668: Epoch: [12][0/329], lr: 1.57e-04, eta: 8:32:38	Time 5.193 (5.193)	Data 2.335 (2.335)	Mem 41.61GB	Prec@1 70.000 (70.000)	Loss 1.4323 (1.4323)
[02/27 12:46:01][INFO] train_vision.py:  668: Epoch: [12][10/329], lr: 1.57e-04, eta: 5:15:21	Time 3.002 (3.200)	Data 0.089 (0.273)	Mem 41.61GB	Prec@1 80.000 (85.455)	Loss 1.2471 (1.2075)
[02/27 12:46:32][INFO] train_vision.py:  668: Epoch: [12][20/329], lr: 1.57e-04, eta: 5:05:58	Time 2.994 (3.110)	Data 0.056 (0.171)	Mem 41.61GB	Prec@1 70.000 (80.000)	Loss 1.2046 (1.3144)
[02/27 12:47:02][INFO] train_vision.py:  668: Epoch: [12][30/329], lr: 1.56e-04, eta: 5:02:21	Time 3.019 (3.078)	Data 0.038 (0.136)	Mem 41.61GB	Prec@1 80.000 (80.645)	Loss 1.4320 (1.3207)
[02/27 12:47:32][INFO] train_vision.py:  668: Epoch: [12][40/329], lr: 1.56e-04, eta: 5:00:02	Time 2.982 (3.060)	Data 0.057 (0.117)	Mem 41.61GB	Prec@1 100.000 (81.220)	Loss 0.9581 (1.2946)
[02/27 12:48:02][INFO] train_vision.py:  668: Epoch: [12][50/329], lr: 1.56e-04, eta: 4:58:35	Time 3.035 (3.051)	Data 0.033 (0.104)	Mem 41.61GB	Prec@1 90.000 (82.157)	Loss 1.0035 (1.2731)
[02/27 12:48:32][INFO] train_vision.py:  668: Epoch: [12][60/329], lr: 1.55e-04, eta: 4:57:35	Time 2.995 (3.045)	Data 0.048 (0.095)	Mem 41.61GB	Prec@1 70.000 (81.148)	Loss 1.3709 (1.3042)
[02/27 12:49:02][INFO] train_vision.py:  668: Epoch: [12][70/329], lr: 1.55e-04, eta: 4:56:34	Time 3.021 (3.040)	Data 0.079 (0.089)	Mem 41.61GB	Prec@1 70.000 (81.268)	Loss 1.2012 (1.3101)
[02/27 12:49:32][INFO] train_vision.py:  668: Epoch: [12][80/329], lr: 1.55e-04, eta: 4:55:40	Time 3.009 (3.036)	Data 0.063 (0.084)	Mem 41.61GB	Prec@1 100.000 (81.111)	Loss 0.9152 (1.3215)
[02/27 12:50:02][INFO] train_vision.py:  668: Epoch: [12][90/329], lr: 1.55e-04, eta: 4:54:51	Time 3.040 (3.033)	Data 0.066 (0.082)	Mem 41.61GB	Prec@1 90.000 (81.538)	Loss 1.2093 (1.3181)
[02/27 12:50:32][INFO] train_vision.py:  668: Epoch: [12][100/329], lr: 1.54e-04, eta: 4:54:05	Time 2.979 (3.030)	Data 0.057 (0.079)	Mem 41.61GB	Prec@1 80.000 (81.386)	Loss 1.3065 (1.3296)
[02/27 12:51:02][INFO] train_vision.py:  668: Epoch: [12][110/329], lr: 1.54e-04, eta: 4:53:25	Time 3.044 (3.029)	Data 0.063 (0.077)	Mem 41.61GB	Prec@1 70.000 (80.991)	Loss 1.7242 (1.3361)
[02/27 12:51:32][INFO] train_vision.py:  668: Epoch: [12][120/329], lr: 1.54e-04, eta: 4:52:42	Time 2.992 (3.026)	Data 0.046 (0.074)	Mem 41.61GB	Prec@1 70.000 (80.909)	Loss 1.5223 (1.3284)
[02/27 12:52:03][INFO] train_vision.py:  668: Epoch: [12][130/329], lr: 1.53e-04, eta: 4:52:04	Time 3.041 (3.025)	Data 0.065 (0.073)	Mem 41.61GB	Prec@1 90.000 (80.687)	Loss 1.1698 (1.3287)
[02/27 12:52:33][INFO] train_vision.py:  668: Epoch: [12][140/329], lr: 1.53e-04, eta: 4:51:27	Time 3.009 (3.024)	Data 0.037 (0.071)	Mem 41.61GB	Prec@1 70.000 (80.284)	Loss 1.5512 (1.3378)
[02/27 12:53:03][INFO] train_vision.py:  668: Epoch: [12][150/329], lr: 1.53e-04, eta: 4:50:52	Time 3.023 (3.023)	Data 0.066 (0.070)	Mem 41.61GB	Prec@1 80.000 (80.397)	Loss 1.5804 (1.3354)
[02/27 12:53:33][INFO] train_vision.py:  668: Epoch: [12][160/329], lr: 1.52e-04, eta: 4:50:14	Time 2.991 (3.022)	Data 0.058 (0.069)	Mem 41.61GB	Prec@1 70.000 (80.559)	Loss 1.3909 (1.3288)
[02/27 12:54:03][INFO] train_vision.py:  668: Epoch: [12][170/329], lr: 1.52e-04, eta: 4:49:37	Time 3.007 (3.021)	Data 0.061 (0.068)	Mem 41.61GB	Prec@1 70.000 (80.409)	Loss 1.7736 (1.3305)
[02/27 12:54:33][INFO] train_vision.py:  668: Epoch: [12][180/329], lr: 1.52e-04, eta: 4:49:02	Time 3.031 (3.020)	Data 0.023 (0.067)	Mem 41.61GB	Prec@1 90.000 (80.552)	Loss 1.3011 (1.3347)
[02/27 12:55:03][INFO] train_vision.py:  668: Epoch: [12][190/329], lr: 1.51e-04, eta: 4:48:27	Time 3.034 (3.019)	Data 0.051 (0.066)	Mem 41.61GB	Prec@1 100.000 (80.628)	Loss 1.0660 (1.3319)
[02/27 12:55:33][INFO] train_vision.py:  668: Epoch: [12][200/329], lr: 1.51e-04, eta: 4:47:52	Time 3.008 (3.018)	Data 0.056 (0.066)	Mem 41.61GB	Prec@1 60.000 (80.846)	Loss 1.8166 (1.3295)
[02/27 12:56:03][INFO] train_vision.py:  668: Epoch: [12][210/329], lr: 1.51e-04, eta: 4:47:18	Time 3.002 (3.017)	Data 0.057 (0.065)	Mem 41.61GB	Prec@1 70.000 (80.806)	Loss 1.6913 (1.3307)
[02/27 12:56:33][INFO] train_vision.py:  668: Epoch: [12][220/329], lr: 1.51e-04, eta: 4:46:43	Time 2.996 (3.017)	Data 0.060 (0.065)	Mem 41.61GB	Prec@1 80.000 (80.905)	Loss 1.4786 (1.3273)
[02/27 12:57:03][INFO] train_vision.py:  668: Epoch: [12][230/329], lr: 1.50e-04, eta: 4:46:10	Time 3.006 (3.016)	Data 0.059 (0.064)	Mem 41.61GB	Prec@1 70.000 (80.476)	Loss 1.5475 (1.3355)
[02/27 12:57:33][INFO] train_vision.py:  668: Epoch: [12][240/329], lr: 1.50e-04, eta: 4:45:38	Time 3.002 (3.016)	Data 0.064 (0.064)	Mem 41.61GB	Prec@1 90.000 (80.415)	Loss 1.0178 (1.3363)
[02/27 12:58:03][INFO] train_vision.py:  668: Epoch: [12][250/329], lr: 1.50e-04, eta: 4:45:04	Time 3.015 (3.015)	Data 0.058 (0.064)	Mem 41.61GB	Prec@1 100.000 (80.398)	Loss 0.9770 (1.3392)
[02/27 12:58:33][INFO] train_vision.py:  668: Epoch: [12][260/329], lr: 1.49e-04, eta: 4:44:31	Time 3.003 (3.015)	Data 0.048 (0.063)	Mem 41.61GB	Prec@1 90.000 (80.690)	Loss 0.9518 (1.3347)
[02/27 12:59:03][INFO] train_vision.py:  668: Epoch: [12][270/329], lr: 1.49e-04, eta: 4:43:57	Time 2.998 (3.014)	Data 0.056 (0.063)	Mem 41.61GB	Prec@1 80.000 (80.517)	Loss 1.2278 (1.3414)
[02/27 12:59:33][INFO] train_vision.py:  668: Epoch: [12][280/329], lr: 1.49e-04, eta: 4:43:24	Time 2.990 (3.013)	Data 0.049 (0.063)	Mem 41.61GB	Prec@1 80.000 (80.463)	Loss 1.1365 (1.3403)
[02/27 13:00:03][INFO] train_vision.py:  668: Epoch: [12][290/329], lr: 1.48e-04, eta: 4:42:53	Time 3.006 (3.013)	Data 0.060 (0.062)	Mem 41.61GB	Prec@1 80.000 (80.344)	Loss 1.5281 (1.3436)
[02/27 13:00:33][INFO] train_vision.py:  668: Epoch: [12][300/329], lr: 1.48e-04, eta: 4:42:22	Time 2.997 (3.013)	Data 0.055 (0.062)	Mem 41.61GB	Prec@1 80.000 (80.365)	Loss 1.1767 (1.3432)
[02/27 13:01:03][INFO] train_vision.py:  668: Epoch: [12][310/329], lr: 1.48e-04, eta: 4:41:49	Time 3.006 (3.013)	Data 0.056 (0.062)	Mem 41.61GB	Prec@1 70.000 (80.289)	Loss 1.4566 (1.3453)
[02/27 13:01:33][INFO] train_vision.py:  668: Epoch: [12][320/329], lr: 1.47e-04, eta: 4:41:18	Time 3.009 (3.012)	Data 0.042 (0.062)	Mem 41.61GB	Prec@1 70.000 (80.156)	Loss 1.5147 (1.3489)
[02/27 13:02:03][INFO] train_vision.py:  668: Epoch: [13][0/329], lr: 1.47e-04, eta: 8:36:11	Time 5.537 (5.537)	Data 2.416 (2.416)	Mem 41.61GB	Prec@1 90.000 (90.000)	Loss 0.9961 (0.9961)
[02/27 13:02:33][INFO] train_vision.py:  668: Epoch: [13][10/329], lr: 1.47e-04, eta: 5:01:51	Time 3.037 (3.243)	Data 0.025 (0.272)	Mem 41.61GB	Prec@1 90.000 (90.909)	Loss 1.1068 (1.0655)
[02/27 13:03:04][INFO] train_vision.py:  668: Epoch: [13][20/329], lr: 1.46e-04, eta: 4:51:59	Time 3.042 (3.143)	Data 0.072 (0.175)	Mem 41.61GB	Prec@1 80.000 (88.571)	Loss 1.2720 (1.1535)
[02/27 13:03:34][INFO] train_vision.py:  668: Epoch: [13][30/329], lr: 1.46e-04, eta: 4:47:53	Time 3.058 (3.105)	Data 0.080 (0.140)	Mem 41.61GB	Prec@1 70.000 (86.129)	Loss 1.6435 (1.2176)
[02/27 13:04:04][INFO] train_vision.py:  668: Epoch: [13][40/329], lr: 1.46e-04, eta: 4:45:25	Time 3.031 (3.084)	Data 0.047 (0.122)	Mem 41.61GB	Prec@1 80.000 (83.415)	Loss 1.2440 (1.2675)
[02/27 13:04:34][INFO] train_vision.py:  668: Epoch: [13][50/329], lr: 1.45e-04, eta: 4:43:44	Time 3.001 (3.071)	Data 0.051 (0.111)	Mem 41.61GB	Prec@1 80.000 (82.353)	Loss 1.5432 (1.2937)
[02/27 13:05:04][INFO] train_vision.py:  668: Epoch: [13][60/329], lr: 1.45e-04, eta: 4:42:23	Time 3.003 (3.062)	Data 0.078 (0.104)	Mem 41.61GB	Prec@1 80.000 (81.967)	Loss 1.2859 (1.3007)
[02/27 13:05:35][INFO] train_vision.py:  668: Epoch: [13][70/329], lr: 1.45e-04, eta: 4:41:22	Time 3.053 (3.056)	Data 0.022 (0.098)	Mem 41.61GB	Prec@1 100.000 (82.535)	Loss 0.9322 (1.2882)
[02/27 13:06:05][INFO] train_vision.py:  668: Epoch: [13][80/329], lr: 1.44e-04, eta: 4:40:28	Time 3.015 (3.052)	Data 0.042 (0.093)	Mem 41.61GB	Prec@1 80.000 (82.469)	Loss 1.3114 (1.2947)
[02/27 13:06:35][INFO] train_vision.py:  668: Epoch: [13][90/329], lr: 1.44e-04, eta: 4:39:33	Time 2.994 (3.047)	Data 0.058 (0.090)	Mem 41.61GB	Prec@1 60.000 (82.088)	Loss 1.6834 (1.3071)
[02/27 13:07:05][INFO] train_vision.py:  668: Epoch: [13][100/329], lr: 1.44e-04, eta: 4:38:42	Time 2.992 (3.044)	Data 0.049 (0.087)	Mem 41.61GB	Prec@1 90.000 (82.772)	Loss 1.1202 (1.3012)
[02/27 13:07:35][INFO] train_vision.py:  668: Epoch: [13][110/329], lr: 1.43e-04, eta: 4:37:52	Time 3.021 (3.040)	Data 0.092 (0.084)	Mem 41.61GB	Prec@1 100.000 (82.523)	Loss 1.0316 (1.3088)
[02/27 13:08:05][INFO] train_vision.py:  668: Epoch: [13][120/329], lr: 1.43e-04, eta: 4:37:05	Time 3.006 (3.037)	Data 0.072 (0.082)	Mem 41.61GB	Prec@1 80.000 (82.066)	Loss 1.1752 (1.3183)
[02/27 13:08:35][INFO] train_vision.py:  668: Epoch: [13][130/329], lr: 1.43e-04, eta: 4:36:22	Time 3.010 (3.035)	Data 0.052 (0.080)	Mem 41.61GB	Prec@1 100.000 (82.290)	Loss 1.0982 (1.3080)
[02/27 13:09:05][INFO] train_vision.py:  668: Epoch: [13][140/329], lr: 1.42e-04, eta: 4:35:39	Time 2.971 (3.032)	Data 0.050 (0.079)	Mem 41.61GB	Prec@1 60.000 (82.411)	Loss 1.6477 (1.3096)
[02/27 13:09:35][INFO] train_vision.py:  668: Epoch: [13][150/329], lr: 1.42e-04, eta: 4:34:59	Time 3.003 (3.031)	Data 0.052 (0.077)	Mem 41.61GB	Prec@1 80.000 (82.252)	Loss 1.3545 (1.3094)
[02/27 13:10:05][INFO] train_vision.py:  668: Epoch: [13][160/329], lr: 1.42e-04, eta: 4:34:19	Time 2.996 (3.029)	Data 0.050 (0.076)	Mem 41.61GB	Prec@1 100.000 (82.609)	Loss 1.0830 (1.3061)
[02/27 13:10:35][INFO] train_vision.py:  668: Epoch: [13][170/329], lr: 1.41e-04, eta: 4:33:41	Time 3.013 (3.028)	Data 0.058 (0.075)	Mem 41.61GB	Prec@1 100.000 (82.749)	Loss 0.9228 (1.2990)
[02/27 13:11:05][INFO] train_vision.py:  668: Epoch: [13][180/329], lr: 1.41e-04, eta: 4:33:04	Time 3.032 (3.026)	Data 0.022 (0.073)	Mem 41.61GB	Prec@1 70.000 (82.652)	Loss 1.2791 (1.3010)
[02/27 13:11:35][INFO] train_vision.py:  668: Epoch: [13][190/329], lr: 1.41e-04, eta: 4:32:27	Time 3.000 (3.025)	Data 0.043 (0.072)	Mem 41.61GB	Prec@1 80.000 (82.565)	Loss 1.2217 (1.3037)
[02/27 13:12:06][INFO] train_vision.py:  668: Epoch: [13][200/329], lr: 1.40e-04, eta: 4:31:51	Time 3.004 (3.024)	Data 0.047 (0.071)	Mem 41.61GB	Prec@1 70.000 (82.239)	Loss 1.5296 (1.3103)
[02/27 13:12:36][INFO] train_vision.py:  668: Epoch: [13][210/329], lr: 1.40e-04, eta: 4:31:14	Time 2.969 (3.023)	Data 0.051 (0.070)	Mem 41.61GB	Prec@1 100.000 (82.275)	Loss 1.0143 (1.3125)
[02/27 13:13:06][INFO] train_vision.py:  668: Epoch: [13][220/329], lr: 1.40e-04, eta: 4:30:39	Time 2.997 (3.022)	Data 0.049 (0.069)	Mem 41.61GB	Prec@1 90.000 (82.262)	Loss 1.0695 (1.3112)
[02/27 13:13:35][INFO] train_vision.py:  668: Epoch: [13][230/329], lr: 1.39e-04, eta: 4:30:03	Time 2.994 (3.021)	Data 0.043 (0.068)	Mem 41.61GB	Prec@1 70.000 (82.208)	Loss 1.5878 (1.3162)
[02/27 13:14:05][INFO] train_vision.py:  668: Epoch: [13][240/329], lr: 1.39e-04, eta: 4:29:28	Time 3.009 (3.020)	Data 0.053 (0.068)	Mem 41.61GB	Prec@1 100.000 (82.490)	Loss 0.9260 (1.3080)
[02/27 13:14:35][INFO] train_vision.py:  668: Epoch: [13][250/329], lr: 1.39e-04, eta: 4:28:53	Time 2.993 (3.019)	Data 0.048 (0.067)	Mem 41.61GB	Prec@1 80.000 (82.590)	Loss 1.1668 (1.3047)
[02/27 13:15:05][INFO] train_vision.py:  668: Epoch: [13][260/329], lr: 1.38e-04, eta: 4:28:18	Time 2.997 (3.018)	Data 0.051 (0.067)	Mem 41.61GB	Prec@1 70.000 (82.529)	Loss 1.5558 (1.3050)
[02/27 13:15:35][INFO] train_vision.py:  668: Epoch: [13][270/329], lr: 1.38e-04, eta: 4:27:44	Time 3.003 (3.017)	Data 0.077 (0.066)	Mem 41.61GB	Prec@1 70.000 (82.694)	Loss 1.4625 (1.3016)
[02/27 13:16:05][INFO] train_vision.py:  668: Epoch: [13][280/329], lr: 1.38e-04, eta: 4:27:10	Time 2.996 (3.017)	Data 0.032 (0.066)	Mem 41.61GB	Prec@1 90.000 (82.811)	Loss 1.1928 (1.2982)
[02/27 13:16:35][INFO] train_vision.py:  668: Epoch: [13][290/329], lr: 1.37e-04, eta: 4:26:35	Time 2.991 (3.016)	Data 0.052 (0.065)	Mem 41.61GB	Prec@1 80.000 (82.612)	Loss 1.4285 (1.3012)
[02/27 13:17:05][INFO] train_vision.py:  668: Epoch: [13][300/329], lr: 1.37e-04, eta: 4:26:02	Time 2.997 (3.015)	Data 0.049 (0.065)	Mem 41.61GB	Prec@1 60.000 (82.425)	Loss 1.9965 (1.3053)
[02/27 13:17:35][INFO] train_vision.py:  668: Epoch: [13][310/329], lr: 1.37e-04, eta: 4:25:29	Time 3.019 (3.015)	Data 0.058 (0.064)	Mem 41.61GB	Prec@1 70.000 (82.572)	Loss 1.7816 (1.3018)
[02/27 13:18:05][INFO] train_vision.py:  668: Epoch: [13][320/329], lr: 1.36e-04, eta: 4:24:56	Time 2.996 (3.014)	Data 0.052 (0.064)	Mem 41.61GB	Prec@1 80.000 (82.648)	Loss 1.1040 (1.2989)
[02/27 13:18:36][INFO] train_vision.py:  840: Test: [0/107]	Prec@1 97.500 (97.500)	Prec@5 100.000 (100.000)	mPrec@1 (31.111)	mPrec@5 (32.323)
[02/27 13:19:19][INFO] train_vision.py:  840: Test: [10/107]	Prec@1 98.750 (96.250)	Prec@5 100.000 (100.000)	mPrec@1 (79.624)	mPrec@5 (86.869)
[02/27 13:20:01][INFO] train_vision.py:  840: Test: [20/107]	Prec@1 95.000 (94.762)	Prec@5 100.000 (99.821)	mPrec@1 (87.576)	mPrec@5 (97.441)
[02/27 13:20:44][INFO] train_vision.py:  840: Test: [30/107]	Prec@1 97.500 (94.919)	Prec@5 100.000 (99.839)	mPrec@1 (89.733)	mPrec@5 (98.678)
[02/27 13:21:27][INFO] train_vision.py:  840: Test: [40/107]	Prec@1 92.500 (94.787)	Prec@5 100.000 (99.787)	mPrec@1 (91.006)	mPrec@5 (99.617)
[02/27 13:22:09][INFO] train_vision.py:  840: Test: [50/107]	Prec@1 97.500 (94.020)	Prec@5 100.000 (99.804)	mPrec@1 (89.734)	mPrec@5 (99.625)
[02/27 13:22:52][INFO] train_vision.py:  840: Test: [60/107]	Prec@1 98.750 (94.283)	Prec@5 100.000 (99.836)	mPrec@1 (90.076)	mPrec@5 (99.688)
[02/27 13:23:34][INFO] train_vision.py:  840: Test: [70/107]	Prec@1 90.000 (94.261)	Prec@5 100.000 (99.859)	mPrec@1 (90.232)	mPrec@5 (99.737)
[02/27 13:24:17][INFO] train_vision.py:  840: Test: [80/107]	Prec@1 96.250 (94.352)	Prec@5 100.000 (99.861)	mPrec@1 (90.255)	mPrec@5 (99.734)
[02/27 13:25:00][INFO] train_vision.py:  840: Test: [90/107]	Prec@1 98.750 (93.997)	Prec@5 100.000 (99.876)	mPrec@1 (90.135)	mPrec@5 (99.753)
[02/27 13:25:42][INFO] train_vision.py:  840: Test: [100/107]	Prec@1 88.750 (93.998)	Prec@5 100.000 (99.889)	mPrec@1 (90.140)	mPrec@5 (99.774)
[02/27 13:26:06][INFO] train_vision.py:  847: Overall Prec@1 93.826% Prec@5 99.883% mPrec@1 (90.126) mPrec@5 (99.766)
[02/27 13:26:06][INFO] train_vision.py:  464: Testing: 90.12641143798828/90.12641143798828
[02/27 13:26:06][INFO] train_vision.py:  465: Saving:
[02/27 13:26:25][INFO] train_vision.py:  668: Epoch: [14][0/329], lr: 1.36e-04, eta: 7:28:57	Time 5.116 (5.116)	Data 2.232 (2.232)	Mem 41.61GB	Prec@1 100.000 (100.000)	Loss 1.0152 (1.0152)
[02/27 13:26:55][INFO] train_vision.py:  668: Epoch: [14][10/329], lr: 1.36e-04, eta: 4:39:20	Time 2.992 (3.189)	Data 0.043 (0.266)	Mem 41.61GB	Prec@1 70.000 (81.818)	Loss 1.4870 (1.2293)
[02/27 13:27:25][INFO] train_vision.py:  668: Epoch: [14][20/329], lr: 1.35e-04, eta: 4:31:34	Time 3.032 (3.107)	Data 0.056 (0.169)	Mem 41.61GB	Prec@1 90.000 (82.381)	Loss 1.2672 (1.2608)
[02/27 13:27:55][INFO] train_vision.py:  668: Epoch: [14][30/329], lr: 1.35e-04, eta: 4:28:09	Time 2.993 (3.073)	Data 0.054 (0.130)	Mem 41.61GB	Prec@1 80.000 (81.935)	Loss 1.4798 (1.2846)
[02/27 13:28:25][INFO] train_vision.py:  668: Epoch: [14][40/329], lr: 1.35e-04, eta: 4:26:24	Time 3.012 (3.059)	Data 0.038 (0.110)	Mem 41.61GB	Prec@1 100.000 (82.439)	Loss 0.8848 (1.2745)
[02/27 13:28:56][INFO] train_vision.py:  668: Epoch: [14][50/329], lr: 1.34e-04, eta: 4:24:58	Time 3.019 (3.049)	Data 0.023 (0.098)	Mem 41.61GB	Prec@1 70.000 (81.765)	Loss 1.8494 (1.3046)
[02/27 13:29:26][INFO] train_vision.py:  668: Epoch: [14][60/329], lr: 1.34e-04, eta: 4:23:57	Time 3.000 (3.043)	Data 0.058 (0.091)	Mem 41.61GB	Prec@1 100.000 (82.131)	Loss 0.9254 (1.3026)
[02/27 13:29:56][INFO] train_vision.py:  668: Epoch: [14][70/329], lr: 1.34e-04, eta: 4:23:02	Time 2.997 (3.038)	Data 0.048 (0.085)	Mem 41.61GB	Prec@1 80.000 (81.831)	Loss 1.1730 (1.2975)
[02/27 13:30:26][INFO] train_vision.py:  668: Epoch: [14][80/329], lr: 1.33e-04, eta: 4:22:16	Time 3.011 (3.035)	Data 0.057 (0.081)	Mem 41.61GB	Prec@1 80.000 (81.728)	Loss 1.4110 (1.2952)
[02/27 13:30:56][INFO] train_vision.py:  668: Epoch: [14][90/329], lr: 1.33e-04, eta: 4:21:28	Time 2.993 (3.032)	Data 0.048 (0.078)	Mem 41.61GB	Prec@1 90.000 (81.868)	Loss 1.0175 (1.2984)
[02/27 13:31:26][INFO] train_vision.py:  668: Epoch: [14][100/329], lr: 1.33e-04, eta: 4:20:43	Time 3.024 (3.029)	Data 0.021 (0.075)	Mem 41.61GB	Prec@1 90.000 (82.574)	Loss 1.3568 (1.2913)
[02/27 13:31:56][INFO] train_vision.py:  668: Epoch: [14][110/329], lr: 1.32e-04, eta: 4:20:01	Time 2.983 (3.026)	Data 0.057 (0.073)	Mem 41.61GB	Prec@1 100.000 (83.243)	Loss 1.0577 (1.2813)
[02/27 13:32:26][INFO] train_vision.py:  668: Epoch: [14][120/329], lr: 1.32e-04, eta: 4:19:23	Time 3.009 (3.025)	Data 0.024 (0.071)	Mem 41.61GB	Prec@1 80.000 (83.306)	Loss 1.2902 (1.2884)
[02/27 13:32:56][INFO] train_vision.py:  668: Epoch: [14][130/329], lr: 1.32e-04, eta: 4:18:44	Time 2.990 (3.023)	Data 0.051 (0.069)	Mem 41.61GB	Prec@1 100.000 (83.282)	Loss 0.9324 (1.2845)
[02/27 13:33:26][INFO] train_vision.py:  668: Epoch: [14][140/329], lr: 1.31e-04, eta: 4:18:08	Time 3.027 (3.022)	Data 0.030 (0.068)	Mem 41.61GB	Prec@1 90.000 (83.191)	Loss 1.0920 (1.2811)
[02/27 13:33:56][INFO] train_vision.py:  668: Epoch: [14][150/329], lr: 1.31e-04, eta: 4:17:31	Time 3.001 (3.021)	Data 0.056 (0.067)	Mem 41.61GB	Prec@1 80.000 (82.914)	Loss 1.1064 (1.2858)
[02/27 13:34:26][INFO] train_vision.py:  668: Epoch: [14][160/329], lr: 1.31e-04, eta: 4:16:57	Time 3.019 (3.020)	Data 0.065 (0.066)	Mem 41.61GB	Prec@1 60.000 (82.981)	Loss 1.4873 (1.2839)
[02/27 13:34:56][INFO] train_vision.py:  668: Epoch: [14][170/329], lr: 1.30e-04, eta: 4:16:21	Time 2.969 (3.019)	Data 0.050 (0.066)	Mem 41.61GB	Prec@1 90.000 (82.865)	Loss 1.0617 (1.2847)
[02/27 13:35:26][INFO] train_vision.py:  668: Epoch: [14][180/329], lr: 1.30e-04, eta: 4:15:49	Time 3.017 (3.019)	Data 0.052 (0.065)	Mem 41.61GB	Prec@1 70.000 (82.762)	Loss 1.6255 (1.2896)
[02/27 13:35:57][INFO] train_vision.py:  668: Epoch: [14][190/329], lr: 1.30e-04, eta: 4:15:17	Time 2.996 (3.018)	Data 0.057 (0.064)	Mem 41.61GB	Prec@1 80.000 (82.932)	Loss 1.9419 (1.2961)
[02/27 13:36:27][INFO] train_vision.py:  668: Epoch: [14][200/329], lr: 1.29e-04, eta: 4:14:43	Time 2.999 (3.017)	Data 0.058 (0.064)	Mem 41.61GB	Prec@1 80.000 (82.935)	Loss 1.1768 (1.2938)
[02/27 13:36:57][INFO] train_vision.py:  668: Epoch: [14][210/329], lr: 1.29e-04, eta: 4:14:09	Time 2.965 (3.017)	Data 0.062 (0.064)	Mem 41.61GB	Prec@1 80.000 (82.938)	Loss 1.0975 (1.2943)
[02/27 13:37:27][INFO] train_vision.py:  668: Epoch: [14][220/329], lr: 1.29e-04, eta: 4:13:38	Time 3.023 (3.016)	Data 0.061 (0.063)	Mem 41.61GB	Prec@1 60.000 (82.715)	Loss 1.4141 (1.2959)
[02/27 13:37:57][INFO] train_vision.py:  668: Epoch: [14][230/329], lr: 1.28e-04, eta: 4:13:06	Time 3.007 (3.016)	Data 0.055 (0.063)	Mem 41.61GB	Prec@1 70.000 (82.727)	Loss 1.6522 (1.2959)
[02/27 13:38:27][INFO] train_vision.py:  668: Epoch: [14][240/329], lr: 1.28e-04, eta: 4:12:33	Time 3.022 (3.016)	Data 0.023 (0.062)	Mem 41.61GB	Prec@1 90.000 (82.780)	Loss 1.2564 (1.2957)
[02/27 13:38:57][INFO] train_vision.py:  668: Epoch: [14][250/329], lr: 1.28e-04, eta: 4:12:00	Time 3.022 (3.015)	Data 0.024 (0.062)	Mem 41.61GB	Prec@1 80.000 (82.988)	Loss 1.0467 (1.2896)
[02/27 13:39:27][INFO] train_vision.py:  668: Epoch: [14][260/329], lr: 1.27e-04, eta: 4:11:26	Time 2.997 (3.014)	Data 0.058 (0.061)	Mem 41.61GB	Prec@1 80.000 (83.027)	Loss 1.0384 (1.2867)
[02/27 13:39:57][INFO] train_vision.py:  668: Epoch: [14][270/329], lr: 1.27e-04, eta: 4:10:52	Time 3.012 (3.013)	Data 0.021 (0.061)	Mem 41.61GB	Prec@1 80.000 (83.173)	Loss 1.1406 (1.2827)
[02/27 13:40:27][INFO] train_vision.py:  668: Epoch: [14][280/329], lr: 1.26e-04, eta: 4:10:19	Time 3.005 (3.013)	Data 0.059 (0.061)	Mem 41.61GB	Prec@1 70.000 (83.203)	Loss 1.5302 (1.2806)
[02/27 13:40:57][INFO] train_vision.py:  668: Epoch: [14][290/329], lr: 1.26e-04, eta: 4:09:46	Time 2.997 (3.012)	Data 0.060 (0.060)	Mem 41.61GB	Prec@1 80.000 (83.058)	Loss 1.3013 (1.2829)
[02/27 13:41:27][INFO] train_vision.py:  668: Epoch: [14][300/329], lr: 1.26e-04, eta: 4:09:13	Time 2.988 (3.012)	Data 0.046 (0.060)	Mem 41.61GB	Prec@1 90.000 (83.189)	Loss 1.1715 (1.2823)
[02/27 13:41:57][INFO] train_vision.py:  668: Epoch: [14][310/329], lr: 1.25e-04, eta: 4:08:40	Time 3.013 (3.011)	Data 0.023 (0.060)	Mem 41.61GB	Prec@1 100.000 (83.183)	Loss 0.8664 (1.2832)
[02/27 13:42:27][INFO] train_vision.py:  668: Epoch: [14][320/329], lr: 1.25e-04, eta: 4:08:09	Time 3.000 (3.011)	Data 0.063 (0.060)	Mem 41.61GB	Prec@1 80.000 (83.146)	Loss 1.3467 (1.2824)
[02/27 13:42:56][INFO] train_vision.py:  668: Epoch: [15][0/329], lr: 1.25e-04, eta: 7:37:45	Time 5.564 (5.564)	Data 2.476 (2.476)	Mem 41.61GB	Prec@1 90.000 (90.000)	Loss 1.1621 (1.1621)
[02/27 13:43:27][INFO] train_vision.py:  668: Epoch: [15][10/329], lr: 1.24e-04, eta: 4:26:38	Time 2.999 (3.248)	Data 0.070 (0.292)	Mem 41.61GB	Prec@1 70.000 (78.182)	Loss 1.6097 (1.3516)
[02/27 13:43:57][INFO] train_vision.py:  668: Epoch: [15][20/329], lr: 1.24e-04, eta: 4:16:40	Time 3.028 (3.133)	Data 0.071 (0.181)	Mem 41.61GB	Prec@1 100.000 (82.857)	Loss 0.9127 (1.2910)
[02/27 13:44:27][INFO] train_vision.py:  668: Epoch: [15][30/329], lr: 1.24e-04, eta: 4:12:46	Time 3.016 (3.091)	Data 0.033 (0.142)	Mem 41.61GB	Prec@1 90.000 (83.871)	Loss 1.1680 (1.2658)
[02/27 13:44:57][INFO] train_vision.py:  668: Epoch: [15][40/329], lr: 1.23e-04, eta: 4:10:37	Time 3.018 (3.071)	Data 0.073 (0.122)	Mem 41.61GB	Prec@1 90.000 (84.878)	Loss 1.1597 (1.2420)
[02/27 13:45:27][INFO] train_vision.py:  668: Epoch: [15][50/329], lr: 1.23e-04, eta: 4:09:08	Time 2.988 (3.059)	Data 0.068 (0.111)	Mem 41.61GB	Prec@1 60.000 (84.706)	Loss 1.6809 (1.2528)
[02/27 13:45:57][INFO] train_vision.py:  668: Epoch: [15][60/329], lr: 1.23e-04, eta: 4:07:46	Time 2.995 (3.049)	Data 0.043 (0.100)	Mem 41.61GB	Prec@1 90.000 (84.262)	Loss 0.9959 (1.2547)
[02/27 13:46:27][INFO] train_vision.py:  668: Epoch: [15][70/329], lr: 1.22e-04, eta: 4:06:45	Time 3.027 (3.043)	Data 0.027 (0.095)	Mem 41.61GB	Prec@1 100.000 (83.662)	Loss 0.8429 (1.2739)
[02/27 13:46:57][INFO] train_vision.py:  668: Epoch: [15][80/329], lr: 1.22e-04, eta: 4:05:52	Time 3.035 (3.038)	Data 0.068 (0.090)	Mem 41.61GB	Prec@1 70.000 (83.457)	Loss 1.6361 (1.2741)
[02/27 13:47:27][INFO] train_vision.py:  668: Epoch: [15][90/329], lr: 1.22e-04, eta: 4:05:03	Time 2.997 (3.034)	Data 0.055 (0.086)	Mem 41.61GB	Prec@1 80.000 (83.626)	Loss 1.5597 (1.2782)
[02/27 13:47:57][INFO] train_vision.py:  668: Epoch: [15][100/329], lr: 1.21e-04, eta: 4:04:17	Time 3.019 (3.031)	Data 0.052 (0.083)	Mem 41.61GB	Prec@1 90.000 (83.861)	Loss 1.2781 (1.2804)
[02/27 13:48:27][INFO] train_vision.py:  668: Epoch: [15][110/329], lr: 1.21e-04, eta: 4:03:33	Time 3.015 (3.028)	Data 0.025 (0.080)	Mem 41.61GB	Prec@1 80.000 (83.604)	Loss 1.5000 (1.2876)
[02/27 13:48:57][INFO] train_vision.py:  668: Epoch: [15][120/329], lr: 1.20e-04, eta: 4:02:49	Time 3.026 (3.025)	Data 0.067 (0.078)	Mem 41.61GB	Prec@1 90.000 (84.050)	Loss 1.1266 (1.2788)
[02/27 13:49:27][INFO] train_vision.py:  668: Epoch: [15][130/329], lr: 1.20e-04, eta: 4:02:09	Time 2.992 (3.023)	Data 0.053 (0.076)	Mem 41.61GB	Prec@1 80.000 (84.351)	Loss 1.3775 (1.2723)
[02/27 13:49:57][INFO] train_vision.py:  668: Epoch: [15][140/329], lr: 1.20e-04, eta: 4:01:28	Time 3.039 (3.021)	Data 0.018 (0.073)	Mem 41.61GB	Prec@1 60.000 (84.468)	Loss 1.4109 (1.2742)
[02/27 13:50:27][INFO] train_vision.py:  668: Epoch: [15][150/329], lr: 1.19e-04, eta: 4:00:50	Time 2.992 (3.019)	Data 0.036 (0.072)	Mem 41.61GB	Prec@1 80.000 (84.570)	Loss 1.4046 (1.2713)
[02/27 13:50:57][INFO] train_vision.py:  668: Epoch: [15][160/329], lr: 1.19e-04, eta: 4:00:13	Time 3.026 (3.018)	Data 0.022 (0.071)	Mem 41.61GB	Prec@1 70.000 (84.286)	Loss 1.4248 (1.2719)
[02/27 13:51:27][INFO] train_vision.py:  668: Epoch: [15][170/329], lr: 1.19e-04, eta: 3:59:38	Time 2.997 (3.017)	Data 0.055 (0.070)	Mem 41.61GB	Prec@1 50.000 (84.211)	Loss 2.0562 (1.2769)
[02/27 13:51:57][INFO] train_vision.py:  668: Epoch: [15][180/329], lr: 1.18e-04, eta: 3:59:03	Time 3.013 (3.016)	Data 0.061 (0.069)	Mem 41.61GB	Prec@1 100.000 (84.365)	Loss 0.8641 (1.2699)
[02/27 13:52:27][INFO] train_vision.py:  668: Epoch: [15][190/329], lr: 1.18e-04, eta: 3:58:29	Time 3.002 (3.015)	Data 0.020 (0.068)	Mem 41.61GB	Prec@1 80.000 (84.084)	Loss 1.4769 (1.2751)
[02/27 13:52:57][INFO] train_vision.py:  668: Epoch: [15][200/329], lr: 1.18e-04, eta: 3:57:54	Time 3.007 (3.014)	Data 0.055 (0.067)	Mem 41.61GB	Prec@1 100.000 (84.378)	Loss 0.8977 (1.2674)
[02/27 13:53:27][INFO] train_vision.py:  668: Epoch: [15][210/329], lr: 1.17e-04, eta: 3:57:21	Time 2.997 (3.013)	Data 0.049 (0.066)	Mem 41.61GB	Prec@1 80.000 (84.313)	Loss 1.6488 (1.2685)
[02/27 13:53:57][INFO] train_vision.py:  668: Epoch: [15][220/329], lr: 1.17e-04, eta: 3:56:47	Time 3.003 (3.013)	Data 0.058 (0.066)	Mem 41.61GB	Prec@1 80.000 (84.118)	Loss 1.2566 (1.2729)
[02/27 13:54:27][INFO] train_vision.py:  668: Epoch: [15][230/329], lr: 1.17e-04, eta: 3:56:13	Time 3.012 (3.012)	Data 0.023 (0.065)	Mem 41.61GB	Prec@1 100.000 (84.242)	Loss 0.8256 (1.2703)
[02/27 13:54:57][INFO] train_vision.py:  668: Epoch: [15][240/329], lr: 1.16e-04, eta: 3:55:41	Time 3.042 (3.011)	Data 0.055 (0.064)	Mem 41.61GB	Prec@1 90.000 (84.357)	Loss 1.2147 (1.2708)
[02/27 13:55:27][INFO] train_vision.py:  668: Epoch: [15][250/329], lr: 1.16e-04, eta: 3:55:09	Time 2.993 (3.011)	Data 0.047 (0.064)	Mem 41.61GB	Prec@1 100.000 (84.661)	Loss 0.8787 (1.2645)
[02/27 13:55:57][INFO] train_vision.py:  668: Epoch: [15][260/329], lr: 1.15e-04, eta: 3:54:36	Time 2.997 (3.010)	Data 0.056 (0.063)	Mem 41.61GB	Prec@1 100.000 (84.559)	Loss 0.9269 (1.2677)
[02/27 13:56:26][INFO] train_vision.py:  668: Epoch: [15][270/329], lr: 1.15e-04, eta: 3:54:02	Time 2.988 (3.010)	Data 0.048 (0.063)	Mem 41.61GB	Prec@1 70.000 (84.539)	Loss 1.5693 (1.2707)
[02/27 13:56:56][INFO] train_vision.py:  668: Epoch: [15][280/329], lr: 1.15e-04, eta: 3:53:29	Time 3.015 (3.009)	Data 0.022 (0.062)	Mem 41.61GB	Prec@1 100.000 (84.448)	Loss 0.8470 (1.2705)
[02/27 13:57:26][INFO] train_vision.py:  668: Epoch: [15][290/329], lr: 1.14e-04, eta: 3:52:56	Time 3.016 (3.008)	Data 0.027 (0.062)	Mem 41.61GB	Prec@1 80.000 (84.502)	Loss 1.2664 (1.2677)
[02/27 13:57:56][INFO] train_vision.py:  668: Epoch: [15][300/329], lr: 1.14e-04, eta: 3:52:25	Time 3.003 (3.008)	Data 0.054 (0.061)	Mem 41.61GB	Prec@1 70.000 (84.585)	Loss 1.3548 (1.2648)
[02/27 13:58:26][INFO] train_vision.py:  668: Epoch: [15][310/329], lr: 1.14e-04, eta: 3:51:53	Time 2.987 (3.008)	Data 0.048 (0.061)	Mem 41.61GB	Prec@1 100.000 (84.437)	Loss 0.9012 (1.2680)
[02/27 13:58:56][INFO] train_vision.py:  668: Epoch: [15][320/329], lr: 1.13e-04, eta: 3:51:21	Time 2.993 (3.007)	Data 0.054 (0.061)	Mem 41.61GB	Prec@1 90.000 (84.548)	Loss 1.2262 (1.2682)
[02/27 13:59:27][INFO] train_vision.py:  840: Test: [0/107]	Prec@1 98.750 (98.750)	Prec@5 100.000 (100.000)	mPrec@1 (32.121)	mPrec@5 (32.323)
[02/27 14:00:09][INFO] train_vision.py:  840: Test: [10/107]	Prec@1 98.750 (96.364)	Prec@5 100.000 (100.000)	mPrec@1 (80.183)	mPrec@5 (86.869)
[02/27 14:00:52][INFO] train_vision.py:  840: Test: [20/107]	Prec@1 98.750 (94.345)	Prec@5 100.000 (99.762)	mPrec@1 (88.656)	mPrec@5 (97.556)
[02/27 14:01:34][INFO] train_vision.py:  840: Test: [30/107]	Prec@1 97.500 (95.040)	Prec@5 100.000 (99.839)	mPrec@1 (90.649)	mPrec@5 (98.742)
[02/27 14:02:16][INFO] train_vision.py:  840: Test: [40/107]	Prec@1 91.250 (95.152)	Prec@5 100.000 (99.787)	mPrec@1 (91.837)	mPrec@5 (99.661)
[02/27 14:02:59][INFO] train_vision.py:  840: Test: [50/107]	Prec@1 96.250 (94.314)	Prec@5 100.000 (99.779)	mPrec@1 (90.811)	mPrec@5 (99.693)
[02/27 14:03:41][INFO] train_vision.py:  840: Test: [60/107]	Prec@1 96.250 (94.508)	Prec@5 100.000 (99.775)	mPrec@1 (91.089)	mPrec@5 (99.680)
[02/27 14:04:23][INFO] train_vision.py:  840: Test: [70/107]	Prec@1 93.750 (94.577)	Prec@5 100.000 (99.806)	mPrec@1 (91.439)	mPrec@5 (99.720)
[02/27 14:05:05][INFO] train_vision.py:  840: Test: [80/107]	Prec@1 95.000 (94.614)	Prec@5 100.000 (99.830)	mPrec@1 (91.495)	mPrec@5 (99.759)
[02/27 14:05:48][INFO] train_vision.py:  840: Test: [90/107]	Prec@1 98.750 (94.409)	Prec@5 100.000 (99.835)	mPrec@1 (91.564)	mPrec@5 (99.739)
[02/27 14:06:31][INFO] train_vision.py:  840: Test: [100/107]	Prec@1 85.000 (94.505)	Prec@5 98.750 (99.827)	mPrec@1 (91.574)	mPrec@5 (99.729)
[02/27 14:06:55][INFO] train_vision.py:  847: Overall Prec@1 94.261% Prec@5 99.824% mPrec@1 (91.714) mPrec@5 (99.730)
[02/27 14:06:55][INFO] train_vision.py:  464: Testing: 91.7143783569336/91.7143783569336
[02/27 14:06:55][INFO] train_vision.py:  465: Saving:
[02/27 14:07:14][INFO] train_vision.py:  668: Epoch: [16][0/329], lr: 1.13e-04, eta: 6:43:21	Time 5.253 (5.253)	Data 2.370 (2.370)	Mem 41.61GB	Prec@1 70.000 (70.000)	Loss 1.5723 (1.5723)
[02/27 14:07:44][INFO] train_vision.py:  668: Epoch: [16][10/329], lr: 1.13e-04, eta: 4:05:17	Time 2.996 (3.202)	Data 0.071 (0.275)	Mem 41.61GB	Prec@1 80.000 (78.182)	Loss 1.4389 (1.4006)
[02/27 14:08:14][INFO] train_vision.py:  668: Epoch: [16][20/329], lr: 1.12e-04, eta: 3:57:59	Time 2.999 (3.113)	Data 0.080 (0.178)	Mem 41.61GB	Prec@1 100.000 (80.000)	Loss 0.8453 (1.3595)
[02/27 14:08:45][INFO] train_vision.py:  668: Epoch: [16][30/329], lr: 1.12e-04, eta: 3:55:03	Time 3.054 (3.081)	Data 0.075 (0.141)	Mem 41.61GB	Prec@1 80.000 (81.613)	Loss 1.3952 (1.3285)
[02/27 14:09:15][INFO] train_vision.py:  668: Epoch: [16][40/329], lr: 1.12e-04, eta: 3:53:23	Time 3.006 (3.066)	Data 0.067 (0.123)	Mem 41.61GB	Prec@1 80.000 (82.439)	Loss 1.3413 (1.3069)
[02/27 14:09:45][INFO] train_vision.py:  668: Epoch: [16][50/329], lr: 1.11e-04, eta: 3:52:13	Time 3.035 (3.058)	Data 0.056 (0.110)	Mem 41.61GB	Prec@1 90.000 (83.529)	Loss 1.0979 (1.2793)
[02/27 14:10:15][INFO] train_vision.py:  668: Epoch: [16][60/329], lr: 1.11e-04, eta: 3:51:10	Time 2.965 (3.050)	Data 0.050 (0.102)	Mem 41.61GB	Prec@1 90.000 (83.279)	Loss 1.2350 (1.2797)
[02/27 14:10:45][INFO] train_vision.py:  668: Epoch: [16][70/329], lr: 1.10e-04, eta: 3:50:17	Time 2.982 (3.046)	Data 0.052 (0.097)	Mem 41.61GB	Prec@1 80.000 (84.085)	Loss 1.6576 (1.2785)
[02/27 14:11:16][INFO] train_vision.py:  668: Epoch: [16][80/329], lr: 1.10e-04, eta: 3:49:33	Time 3.008 (3.043)	Data 0.061 (0.093)	Mem 41.61GB	Prec@1 90.000 (84.198)	Loss 1.0374 (1.2773)
[02/27 14:11:46][INFO] train_vision.py:  668: Epoch: [16][90/329], lr: 1.10e-04, eta: 3:48:51	Time 3.032 (3.040)	Data 0.071 (0.090)	Mem 41.61GB	Prec@1 100.000 (84.615)	Loss 0.9954 (1.2614)
[02/27 14:12:16][INFO] train_vision.py:  668: Epoch: [16][100/329], lr: 1.09e-04, eta: 3:48:09	Time 3.002 (3.037)	Data 0.050 (0.088)	Mem 41.61GB	Prec@1 80.000 (83.960)	Loss 1.3095 (1.2735)
[02/27 14:12:46][INFO] train_vision.py:  668: Epoch: [16][110/329], lr: 1.09e-04, eta: 3:47:28	Time 3.007 (3.035)	Data 0.077 (0.086)	Mem 41.61GB	Prec@1 70.000 (83.514)	Loss 1.9098 (1.2855)
[02/27 14:13:16][INFO] train_vision.py:  668: Epoch: [16][120/329], lr: 1.09e-04, eta: 3:46:49	Time 2.964 (3.033)	Data 0.047 (0.085)	Mem 41.61GB	Prec@1 80.000 (83.884)	Loss 1.4131 (1.2783)
[02/27 14:13:46][INFO] train_vision.py:  668: Epoch: [16][130/329], lr: 1.08e-04, eta: 3:46:12	Time 3.012 (3.032)	Data 0.058 (0.084)	Mem 41.61GB	Prec@1 80.000 (84.504)	Loss 1.4288 (1.2637)
[02/27 14:14:16][INFO] train_vision.py:  668: Epoch: [16][140/329], lr: 1.08e-04, eta: 3:45:36	Time 3.003 (3.030)	Data 0.061 (0.083)	Mem 41.61GB	Prec@1 80.000 (84.468)	Loss 1.2060 (1.2601)
[02/27 14:14:46][INFO] train_vision.py:  668: Epoch: [16][150/329], lr: 1.08e-04, eta: 3:45:00	Time 3.014 (3.029)	Data 0.079 (0.081)	Mem 41.61GB	Prec@1 90.000 (84.570)	Loss 1.1142 (1.2571)
[02/27 14:15:17][INFO] train_vision.py:  668: Epoch: [16][160/329], lr: 1.07e-04, eta: 3:44:24	Time 3.020 (3.028)	Data 0.078 (0.081)	Mem 41.61GB	Prec@1 100.000 (84.410)	Loss 0.9911 (1.2589)
[02/27 14:15:47][INFO] train_vision.py:  668: Epoch: [16][170/329], lr: 1.07e-04, eta: 3:43:50	Time 3.008 (3.027)	Data 0.051 (0.080)	Mem 41.61GB	Prec@1 90.000 (84.795)	Loss 1.1900 (1.2484)
[02/27 14:16:17][INFO] train_vision.py:  668: Epoch: [16][180/329], lr: 1.06e-04, eta: 3:43:14	Time 3.007 (3.026)	Data 0.056 (0.079)	Mem 41.61GB	Prec@1 90.000 (84.696)	Loss 1.0899 (1.2523)
[02/27 14:16:47][INFO] train_vision.py:  668: Epoch: [16][190/329], lr: 1.06e-04, eta: 3:42:40	Time 3.012 (3.025)	Data 0.078 (0.078)	Mem 41.61GB	Prec@1 90.000 (84.660)	Loss 1.0787 (1.2550)
[02/27 14:17:17][INFO] train_vision.py:  668: Epoch: [16][200/329], lr: 1.06e-04, eta: 3:42:09	Time 2.992 (3.025)	Data 0.056 (0.078)	Mem 41.61GB	Prec@1 100.000 (84.577)	Loss 0.8912 (1.2581)
[02/27 14:17:47][INFO] train_vision.py:  668: Epoch: [16][210/329], lr: 1.05e-04, eta: 3:41:36	Time 3.015 (3.024)	Data 0.038 (0.077)	Mem 41.61GB	Prec@1 70.000 (84.028)	Loss 1.9102 (1.2672)
[02/27 14:18:17][INFO] train_vision.py:  668: Epoch: [16][220/329], lr: 1.05e-04, eta: 3:41:03	Time 3.012 (3.023)	Data 0.080 (0.077)	Mem 41.61GB	Prec@1 90.000 (84.027)	Loss 1.0290 (1.2679)
[02/27 14:18:47][INFO] train_vision.py:  668: Epoch: [16][230/329], lr: 1.05e-04, eta: 3:40:31	Time 3.009 (3.023)	Data 0.052 (0.076)	Mem 41.61GB	Prec@1 60.000 (83.983)	Loss 1.7824 (1.2686)
[02/27 14:19:17][INFO] train_vision.py:  668: Epoch: [16][240/329], lr: 1.04e-04, eta: 3:39:58	Time 2.977 (3.022)	Data 0.060 (0.076)	Mem 41.61GB	Prec@1 90.000 (83.983)	Loss 1.0698 (1.2663)
[02/27 14:19:48][INFO] train_vision.py:  668: Epoch: [16][250/329], lr: 1.04e-04, eta: 3:39:25	Time 3.006 (3.022)	Data 0.061 (0.075)	Mem 41.61GB	Prec@1 70.000 (84.104)	Loss 1.5766 (1.2629)
[02/27 14:20:18][INFO] train_vision.py:  668: Epoch: [16][260/329], lr: 1.04e-04, eta: 3:38:53	Time 2.983 (3.021)	Data 0.046 (0.074)	Mem 41.61GB	Prec@1 90.000 (84.291)	Loss 1.1885 (1.2601)
[02/27 14:20:48][INFO] train_vision.py:  668: Epoch: [16][270/329], lr: 1.03e-04, eta: 3:38:19	Time 2.980 (3.020)	Data 0.048 (0.073)	Mem 41.61GB	Prec@1 80.000 (84.317)	Loss 1.2578 (1.2594)
[02/27 14:21:18][INFO] train_vision.py:  668: Epoch: [16][280/329], lr: 1.03e-04, eta: 3:37:46	Time 3.000 (3.020)	Data 0.057 (0.073)	Mem 41.61GB	Prec@1 90.000 (84.306)	Loss 1.2008 (1.2591)
[02/27 14:21:48][INFO] train_vision.py:  668: Epoch: [16][290/329], lr: 1.02e-04, eta: 3:37:13	Time 2.998 (3.019)	Data 0.057 (0.072)	Mem 41.61GB	Prec@1 70.000 (84.261)	Loss 1.4876 (1.2598)
[02/27 14:22:18][INFO] train_vision.py:  668: Epoch: [16][300/329], lr: 1.02e-04, eta: 3:36:42	Time 2.999 (3.019)	Data 0.057 (0.072)	Mem 41.61GB	Prec@1 100.000 (84.286)	Loss 0.9082 (1.2583)
[02/27 14:22:48][INFO] train_vision.py:  668: Epoch: [16][310/329], lr: 1.02e-04, eta: 3:36:10	Time 3.002 (3.018)	Data 0.056 (0.071)	Mem 41.61GB	Prec@1 90.000 (84.309)	Loss 0.9449 (1.2592)
[02/27 14:23:18][INFO] train_vision.py:  668: Epoch: [16][320/329], lr: 1.01e-04, eta: 3:35:38	Time 3.017 (3.018)	Data 0.066 (0.071)	Mem 41.61GB	Prec@1 80.000 (84.081)	Loss 1.1541 (1.2655)
[02/27 14:23:48][INFO] train_vision.py:  668: Epoch: [17][0/329], lr: 1.01e-04, eta: 6:44:43	Time 5.676 (5.676)	Data 2.344 (2.344)	Mem 41.61GB	Prec@1 70.000 (70.000)	Loss 1.8633 (1.8633)
[02/27 14:24:18][INFO] train_vision.py:  668: Epoch: [17][10/329], lr: 1.01e-04, eta: 3:50:59	Time 3.020 (3.247)	Data 0.072 (0.248)	Mem 41.61GB	Prec@1 100.000 (87.273)	Loss 0.9276 (1.2736)
[02/27 14:24:48][INFO] train_vision.py:  668: Epoch: [17][20/329], lr: 1.00e-04, eta: 3:42:12	Time 2.997 (3.131)	Data 0.057 (0.155)	Mem 41.61GB	Prec@1 80.000 (84.762)	Loss 1.5547 (1.3096)
[02/27 14:25:18][INFO] train_vision.py:  668: Epoch: [17][30/329], lr: 9.99e-05, eta: 3:38:53	Time 3.035 (3.092)	Data 0.026 (0.120)	Mem 41.61GB	Prec@1 90.000 (84.194)	Loss 1.3175 (1.3160)
[02/27 14:25:48][INFO] train_vision.py:  668: Epoch: [17][40/329], lr: 9.96e-05, eta: 3:36:50	Time 3.007 (3.070)	Data 0.026 (0.102)	Mem 41.61GB	Prec@1 80.000 (84.146)	Loss 1.1797 (1.2792)
[02/27 14:26:18][INFO] train_vision.py:  668: Epoch: [17][50/329], lr: 9.92e-05, eta: 3:35:24	Time 2.991 (3.057)	Data 0.048 (0.090)	Mem 41.61GB	Prec@1 70.000 (83.333)	Loss 1.4368 (1.2870)
[02/27 14:26:48][INFO] train_vision.py:  668: Epoch: [17][60/329], lr: 9.89e-05, eta: 3:34:16	Time 2.986 (3.048)	Data 0.051 (0.083)	Mem 41.61GB	Prec@1 100.000 (84.590)	Loss 0.8488 (1.2615)
[02/27 14:27:18][INFO] train_vision.py:  668: Epoch: [17][70/329], lr: 9.85e-05, eta: 3:33:21	Time 3.008 (3.042)	Data 0.049 (0.078)	Mem 41.61GB	Prec@1 80.000 (84.930)	Loss 1.1493 (1.2466)
[02/27 14:27:48][INFO] train_vision.py:  668: Epoch: [17][80/329], lr: 9.81e-05, eta: 3:32:35	Time 2.994 (3.038)	Data 0.052 (0.075)	Mem 41.61GB	Prec@1 70.000 (84.815)	Loss 1.5249 (1.2546)
[02/27 14:28:18][INFO] train_vision.py:  668: Epoch: [17][90/329], lr: 9.78e-05, eta: 3:31:48	Time 2.999 (3.034)	Data 0.050 (0.072)	Mem 41.61GB	Prec@1 80.000 (85.385)	Loss 1.3355 (1.2444)
[02/27 14:28:48][INFO] train_vision.py:  668: Epoch: [17][100/329], lr: 9.74e-05, eta: 3:31:03	Time 2.993 (3.031)	Data 0.055 (0.069)	Mem 41.61GB	Prec@1 70.000 (84.851)	Loss 1.7200 (1.2554)
[02/27 14:29:18][INFO] train_vision.py:  668: Epoch: [17][110/329], lr: 9.70e-05, eta: 3:30:21	Time 2.991 (3.028)	Data 0.057 (0.067)	Mem 41.61GB	Prec@1 80.000 (84.505)	Loss 1.4440 (1.2547)
[02/27 14:29:48][INFO] train_vision.py:  668: Epoch: [17][120/329], lr: 9.67e-05, eta: 3:29:41	Time 3.010 (3.026)	Data 0.056 (0.066)	Mem 41.61GB	Prec@1 70.000 (84.628)	Loss 1.6106 (1.2495)
[02/27 14:30:18][INFO] train_vision.py:  668: Epoch: [17][130/329], lr: 9.63e-05, eta: 3:29:01	Time 2.999 (3.024)	Data 0.054 (0.065)	Mem 41.61GB	Prec@1 100.000 (85.115)	Loss 0.8617 (1.2339)
[02/27 14:30:48][INFO] train_vision.py:  668: Epoch: [17][140/329], lr: 9.59e-05, eta: 3:28:23	Time 2.985 (3.022)	Data 0.047 (0.065)	Mem 41.61GB	Prec@1 90.000 (85.248)	Loss 1.1902 (1.2316)
[02/27 14:31:18][INFO] train_vision.py:  668: Epoch: [17][150/329], lr: 9.56e-05, eta: 3:27:44	Time 2.989 (3.020)	Data 0.054 (0.064)	Mem 41.61GB	Prec@1 70.000 (85.298)	Loss 1.6894 (1.2342)
[02/27 14:31:48][INFO] train_vision.py:  668: Epoch: [17][160/329], lr: 9.52e-05, eta: 3:27:07	Time 3.024 (3.018)	Data 0.023 (0.063)	Mem 41.61GB	Prec@1 70.000 (85.342)	Loss 1.4094 (1.2336)
[02/27 14:32:18][INFO] train_vision.py:  668: Epoch: [17][170/329], lr: 9.49e-05, eta: 3:26:32	Time 2.991 (3.017)	Data 0.056 (0.063)	Mem 41.61GB	Prec@1 90.000 (85.614)	Loss 1.1290 (1.2288)
[02/27 14:32:48][INFO] train_vision.py:  668: Epoch: [17][180/329], lr: 9.45e-05, eta: 3:25:56	Time 2.982 (3.015)	Data 0.084 (0.062)	Mem 41.61GB	Prec@1 70.000 (85.746)	Loss 1.5944 (1.2277)
[02/27 14:33:18][INFO] train_vision.py:  668: Epoch: [17][190/329], lr: 9.41e-05, eta: 3:25:22	Time 3.007 (3.014)	Data 0.024 (0.062)	Mem 41.61GB	Prec@1 90.000 (85.340)	Loss 1.1724 (1.2410)
[02/27 14:33:48][INFO] train_vision.py:  668: Epoch: [17][200/329], lr: 9.38e-05, eta: 3:24:47	Time 2.990 (3.013)	Data 0.054 (0.061)	Mem 41.61GB	Prec@1 80.000 (84.677)	Loss 1.5493 (1.2555)
[02/27 14:34:18][INFO] train_vision.py:  668: Epoch: [17][210/329], lr: 9.34e-05, eta: 3:24:13	Time 2.996 (3.012)	Data 0.055 (0.061)	Mem 41.61GB	Prec@1 70.000 (84.645)	Loss 1.6185 (1.2589)
[02/27 14:34:48][INFO] train_vision.py:  668: Epoch: [17][220/329], lr: 9.30e-05, eta: 3:23:39	Time 2.997 (3.011)	Data 0.054 (0.060)	Mem 41.61GB	Prec@1 80.000 (84.751)	Loss 1.4176 (1.2580)
[02/27 14:35:18][INFO] train_vision.py:  668: Epoch: [17][230/329], lr: 9.27e-05, eta: 3:23:06	Time 2.997 (3.010)	Data 0.056 (0.060)	Mem 41.61GB	Prec@1 80.000 (84.416)	Loss 1.2815 (1.2631)
[02/27 14:35:48][INFO] train_vision.py:  668: Epoch: [17][240/329], lr: 9.23e-05, eta: 3:22:32	Time 2.953 (3.010)	Data 0.054 (0.059)	Mem 41.61GB	Prec@1 100.000 (84.564)	Loss 0.8707 (1.2564)
[02/27 14:36:18][INFO] train_vision.py:  668: Epoch: [17][250/329], lr: 9.20e-05, eta: 3:22:00	Time 2.980 (3.009)	Data 0.058 (0.059)	Mem 41.61GB	Prec@1 80.000 (84.781)	Loss 1.1322 (1.2498)
[02/27 14:36:47][INFO] train_vision.py:  668: Epoch: [17][260/329], lr: 9.16e-05, eta: 3:21:28	Time 2.994 (3.009)	Data 0.061 (0.059)	Mem 41.61GB	Prec@1 90.000 (84.674)	Loss 1.2368 (1.2489)
[02/27 14:37:17][INFO] train_vision.py:  668: Epoch: [17][270/329], lr: 9.12e-05, eta: 3:20:55	Time 2.996 (3.008)	Data 0.039 (0.058)	Mem 41.61GB	Prec@1 80.000 (84.723)	Loss 1.2108 (1.2438)
[02/27 14:37:47][INFO] train_vision.py:  668: Epoch: [17][280/329], lr: 9.09e-05, eta: 3:20:23	Time 2.992 (3.007)	Data 0.048 (0.058)	Mem 41.61GB	Prec@1 90.000 (84.911)	Loss 0.9692 (1.2389)
[02/27 14:38:17][INFO] train_vision.py:  668: Epoch: [17][290/329], lr: 9.05e-05, eta: 3:19:51	Time 3.021 (3.007)	Data 0.026 (0.058)	Mem 41.61GB	Prec@1 60.000 (84.708)	Loss 1.6661 (1.2439)
[02/27 14:38:47][INFO] train_vision.py:  668: Epoch: [17][300/329], lr: 9.02e-05, eta: 3:19:19	Time 2.993 (3.006)	Data 0.055 (0.057)	Mem 41.61GB	Prec@1 80.000 (84.718)	Loss 1.1286 (1.2412)
[02/27 14:39:17][INFO] train_vision.py:  668: Epoch: [17][310/329], lr: 8.98e-05, eta: 3:18:47	Time 2.988 (3.006)	Data 0.050 (0.057)	Mem 41.61GB	Prec@1 90.000 (84.823)	Loss 1.0267 (1.2379)
[02/27 14:39:47][INFO] train_vision.py:  668: Epoch: [17][320/329], lr: 8.94e-05, eta: 3:18:16	Time 2.996 (3.006)	Data 0.049 (0.057)	Mem 41.61GB	Prec@1 70.000 (84.798)	Loss 1.8485 (1.2405)
[02/27 14:40:18][INFO] train_vision.py:  840: Test: [0/107]	Prec@1 96.250 (96.250)	Prec@5 98.750 (98.750)	mPrec@1 (30.774)	mPrec@5 (32.121)
[02/27 14:41:00][INFO] train_vision.py:  840: Test: [10/107]	Prec@1 96.250 (96.705)	Prec@5 100.000 (99.886)	mPrec@1 (82.231)	mPrec@5 (86.848)
[02/27 14:41:43][INFO] train_vision.py:  840: Test: [20/107]	Prec@1 96.250 (95.417)	Prec@5 100.000 (99.821)	mPrec@1 (90.901)	mPrec@5 (97.597)
[02/27 14:42:25][INFO] train_vision.py:  840: Test: [30/107]	Prec@1 98.750 (95.887)	Prec@5 100.000 (99.839)	mPrec@1 (93.454)	mPrec@5 (98.760)
[02/27 14:43:08][INFO] train_vision.py:  840: Test: [40/107]	Prec@1 87.500 (95.701)	Prec@5 100.000 (99.817)	mPrec@1 (94.286)	mPrec@5 (99.767)
[02/27 14:43:50][INFO] train_vision.py:  840: Test: [50/107]	Prec@1 97.500 (95.319)	Prec@5 100.000 (99.828)	mPrec@1 (93.481)	mPrec@5 (99.715)
[02/27 14:44:32][INFO] train_vision.py:  840: Test: [60/107]	Prec@1 100.000 (95.533)	Prec@5 100.000 (99.857)	mPrec@1 (93.577)	mPrec@5 (99.765)
[02/27 14:45:15][INFO] train_vision.py:  840: Test: [70/107]	Prec@1 93.750 (95.563)	Prec@5 98.750 (99.859)	mPrec@1 (93.690)	mPrec@5 (99.764)
[02/27 14:45:57][INFO] train_vision.py:  840: Test: [80/107]	Prec@1 92.500 (95.478)	Prec@5 100.000 (99.830)	mPrec@1 (93.192)	mPrec@5 (99.678)
[02/27 14:46:40][INFO] train_vision.py:  840: Test: [90/107]	Prec@1 100.000 (95.343)	Prec@5 100.000 (99.849)	mPrec@1 (93.056)	mPrec@5 (99.694)
[02/27 14:47:22][INFO] train_vision.py:  840: Test: [100/107]	Prec@1 88.750 (95.347)	Prec@5 100.000 (99.864)	mPrec@1 (93.095)	mPrec@5 (99.728)
[02/27 14:47:45][INFO] train_vision.py:  847: Overall Prec@1 95.293% Prec@5 99.871% mPrec@1 (93.225) mPrec@5 (99.731)
[02/27 14:47:45][INFO] train_vision.py:  464: Testing: 93.22480773925781/93.22480773925781
[02/27 14:47:45][INFO] train_vision.py:  465: Saving:
[02/27 14:48:05][INFO] train_vision.py:  668: Epoch: [18][0/329], lr: 8.91e-05, eta: 5:43:13	Time 5.215 (5.215)	Data 2.332 (2.332)	Mem 41.61GB	Prec@1 90.000 (90.000)	Loss 1.1574 (1.1574)
[02/27 14:48:34][INFO] train_vision.py:  668: Epoch: [18][10/329], lr: 8.87e-05, eta: 3:29:15	Time 2.983 (3.187)	Data 0.062 (0.260)	Mem 41.61GB	Prec@1 80.000 (81.818)	Loss 1.1385 (1.2853)
[02/27 14:49:04][INFO] train_vision.py:  668: Epoch: [18][20/329], lr: 8.84e-05, eta: 3:22:45	Time 2.988 (3.096)	Data 0.052 (0.163)	Mem 41.61GB	Prec@1 80.000 (80.952)	Loss 1.2599 (1.2615)
[02/27 14:49:34][INFO] train_vision.py:  668: Epoch: [18][30/329], lr: 8.80e-05, eta: 3:20:06	Time 3.000 (3.064)	Data 0.065 (0.126)	Mem 41.61GB	Prec@1 70.000 (82.581)	Loss 1.4239 (1.2433)
[02/27 14:50:04][INFO] train_vision.py:  668: Epoch: [18][40/329], lr: 8.77e-05, eta: 3:18:35	Time 3.012 (3.048)	Data 0.047 (0.109)	Mem 41.61GB	Prec@1 90.000 (81.707)	Loss 1.1918 (1.2621)
[02/27 14:50:34][INFO] train_vision.py:  668: Epoch: [18][50/329], lr: 8.73e-05, eta: 3:17:27	Time 2.972 (3.039)	Data 0.060 (0.099)	Mem 41.61GB	Prec@1 70.000 (82.157)	Loss 1.2375 (1.2611)
[02/27 14:51:04][INFO] train_vision.py:  668: Epoch: [18][60/329], lr: 8.69e-05, eta: 3:16:38	Time 2.994 (3.034)	Data 0.059 (0.092)	Mem 41.61GB	Prec@1 90.000 (81.803)	Loss 1.1695 (1.2654)
[02/27 14:51:35][INFO] train_vision.py:  668: Epoch: [18][70/329], lr: 8.66e-05, eta: 3:15:52	Time 3.028 (3.030)	Data 0.021 (0.086)	Mem 41.61GB	Prec@1 90.000 (82.535)	Loss 1.1309 (1.2521)
[02/27 14:52:05][INFO] train_vision.py:  668: Epoch: [18][80/329], lr: 8.62e-05, eta: 3:15:07	Time 2.992 (3.026)	Data 0.056 (0.082)	Mem 41.61GB	Prec@1 90.000 (83.333)	Loss 1.0382 (1.2384)
[02/27 14:52:35][INFO] train_vision.py:  668: Epoch: [18][90/329], lr: 8.59e-05, eta: 3:14:30	Time 3.012 (3.024)	Data 0.063 (0.080)	Mem 41.61GB	Prec@1 80.000 (82.967)	Loss 1.4339 (1.2510)
[02/27 14:53:05][INFO] train_vision.py:  668: Epoch: [18][100/329], lr: 8.55e-05, eta: 3:13:54	Time 3.009 (3.023)	Data 0.055 (0.078)	Mem 41.61GB	Prec@1 80.000 (83.564)	Loss 1.2405 (1.2464)
[02/27 14:53:35][INFO] train_vision.py:  668: Epoch: [18][110/329], lr: 8.51e-05, eta: 3:13:17	Time 3.038 (3.021)	Data 0.034 (0.075)	Mem 41.61GB	Prec@1 90.000 (83.694)	Loss 1.0771 (1.2400)
[02/27 14:54:05][INFO] train_vision.py:  668: Epoch: [18][120/329], lr: 8.48e-05, eta: 3:12:39	Time 2.986 (3.019)	Data 0.053 (0.074)	Mem 41.61GB	Prec@1 70.000 (83.140)	Loss 1.4218 (1.2527)
[02/27 14:54:35][INFO] train_vision.py:  668: Epoch: [18][130/329], lr: 8.44e-05, eta: 3:12:02	Time 2.992 (3.017)	Data 0.056 (0.072)	Mem 41.61GB	Prec@1 90.000 (83.359)	Loss 0.9993 (1.2438)
[02/27 14:55:05][INFO] train_vision.py:  668: Epoch: [18][140/329], lr: 8.41e-05, eta: 3:11:28	Time 3.021 (3.016)	Data 0.054 (0.071)	Mem 41.61GB	Prec@1 80.000 (83.617)	Loss 1.3353 (1.2381)
[02/27 14:55:35][INFO] train_vision.py:  668: Epoch: [18][150/329], lr: 8.37e-05, eta: 3:10:54	Time 3.007 (3.015)	Data 0.054 (0.070)	Mem 41.61GB	Prec@1 100.000 (84.238)	Loss 0.8882 (1.2249)
[02/27 14:56:05][INFO] train_vision.py:  668: Epoch: [18][160/329], lr: 8.34e-05, eta: 3:10:19	Time 3.025 (3.014)	Data 0.056 (0.069)	Mem 41.61GB	Prec@1 70.000 (83.727)	Loss 1.3180 (1.2389)
[02/27 14:56:35][INFO] train_vision.py:  668: Epoch: [18][170/329], lr: 8.30e-05, eta: 3:09:46	Time 2.968 (3.013)	Data 0.065 (0.068)	Mem 41.61GB	Prec@1 100.000 (83.860)	Loss 0.9828 (1.2378)
[02/27 14:57:05][INFO] train_vision.py:  668: Epoch: [18][180/329], lr: 8.26e-05, eta: 3:09:16	Time 3.006 (3.013)	Data 0.049 (0.067)	Mem 41.61GB	Prec@1 90.000 (84.144)	Loss 1.4475 (1.2335)
[02/27 14:57:35][INFO] train_vision.py:  668: Epoch: [18][190/329], lr: 8.23e-05, eta: 3:08:42	Time 3.017 (3.012)	Data 0.064 (0.066)	Mem 41.61GB	Prec@1 70.000 (84.084)	Loss 1.7536 (1.2399)
[02/27 14:58:05][INFO] train_vision.py:  668: Epoch: [18][200/329], lr: 8.19e-05, eta: 3:08:09	Time 2.984 (3.011)	Data 0.032 (0.066)	Mem 41.61GB	Prec@1 80.000 (84.030)	Loss 1.2382 (1.2431)
[02/27 14:58:35][INFO] train_vision.py:  668: Epoch: [18][210/329], lr: 8.16e-05, eta: 3:07:37	Time 2.987 (3.011)	Data 0.052 (0.065)	Mem 41.61GB	Prec@1 80.000 (83.934)	Loss 1.2779 (1.2458)
[02/27 14:59:05][INFO] train_vision.py:  668: Epoch: [18][220/329], lr: 8.12e-05, eta: 3:07:05	Time 3.008 (3.010)	Data 0.036 (0.065)	Mem 41.61GB	Prec@1 90.000 (84.163)	Loss 1.4599 (1.2438)
[02/27 14:59:35][INFO] train_vision.py:  668: Epoch: [18][230/329], lr: 8.09e-05, eta: 3:06:33	Time 3.027 (3.010)	Data 0.055 (0.064)	Mem 41.61GB	Prec@1 90.000 (84.069)	Loss 1.1008 (1.2476)
[02/27 15:00:05][INFO] train_vision.py:  668: Epoch: [18][240/329], lr: 8.05e-05, eta: 3:06:01	Time 2.998 (3.009)	Data 0.059 (0.064)	Mem 41.61GB	Prec@1 80.000 (84.274)	Loss 1.3407 (1.2450)
[02/27 15:00:35][INFO] train_vision.py:  668: Epoch: [18][250/329], lr: 8.01e-05, eta: 3:05:29	Time 3.006 (3.009)	Data 0.050 (0.063)	Mem 41.61GB	Prec@1 80.000 (84.343)	Loss 1.3965 (1.2445)
[02/27 15:01:05][INFO] train_vision.py:  668: Epoch: [18][260/329], lr: 7.98e-05, eta: 3:04:57	Time 3.004 (3.008)	Data 0.053 (0.063)	Mem 41.61GB	Prec@1 60.000 (84.330)	Loss 1.7317 (1.2427)
[02/27 15:01:35][INFO] train_vision.py:  668: Epoch: [18][270/329], lr: 7.94e-05, eta: 3:04:25	Time 3.015 (3.008)	Data 0.053 (0.062)	Mem 41.61GB	Prec@1 80.000 (84.539)	Loss 1.3375 (1.2391)
[02/27 15:02:04][INFO] train_vision.py:  668: Epoch: [18][280/329], lr: 7.91e-05, eta: 3:03:54	Time 3.020 (3.007)	Data 0.024 (0.062)	Mem 41.61GB	Prec@1 100.000 (84.484)	Loss 0.9466 (1.2439)
[02/27 15:02:34][INFO] train_vision.py:  668: Epoch: [18][290/329], lr: 7.87e-05, eta: 3:03:22	Time 2.988 (3.007)	Data 0.053 (0.062)	Mem 41.61GB	Prec@1 80.000 (84.502)	Loss 1.6075 (1.2435)
[02/27 15:03:04][INFO] train_vision.py:  668: Epoch: [18][300/329], lr: 7.84e-05, eta: 3:02:52	Time 2.996 (3.007)	Data 0.053 (0.061)	Mem 41.61GB	Prec@1 90.000 (84.518)	Loss 0.9463 (1.2408)
[02/27 15:03:34][INFO] train_vision.py:  668: Epoch: [18][310/329], lr: 7.80e-05, eta: 3:02:21	Time 2.996 (3.007)	Data 0.057 (0.061)	Mem 41.61GB	Prec@1 80.000 (84.630)	Loss 1.3493 (1.2377)
[02/27 15:04:04][INFO] train_vision.py:  668: Epoch: [18][320/329], lr: 7.77e-05, eta: 3:01:49	Time 2.992 (3.006)	Data 0.057 (0.060)	Mem 41.61GB	Prec@1 70.000 (84.735)	Loss 1.6217 (1.2364)
[02/27 15:04:34][INFO] train_vision.py:  668: Epoch: [19][0/329], lr: 7.73e-05, eta: 5:24:49	Time 5.384 (5.384)	Data 2.442 (2.442)	Mem 41.61GB	Prec@1 90.000 (90.000)	Loss 1.0994 (1.0994)
[02/27 15:05:04][INFO] train_vision.py:  668: Epoch: [19][10/329], lr: 7.70e-05, eta: 3:13:27	Time 2.990 (3.215)	Data 0.038 (0.270)	Mem 41.61GB	Prec@1 70.000 (75.455)	Loss 1.9020 (1.4133)
[02/27 15:05:34][INFO] train_vision.py:  668: Epoch: [19][20/329], lr: 7.66e-05, eta: 3:06:58	Time 3.013 (3.116)	Data 0.031 (0.165)	Mem 41.61GB	Prec@1 100.000 (82.381)	Loss 0.8353 (1.2895)
[02/27 15:06:04][INFO] train_vision.py:  668: Epoch: [19][30/329], lr: 7.63e-05, eta: 3:04:15	Time 3.002 (3.080)	Data 0.056 (0.128)	Mem 41.61GB	Prec@1 60.000 (83.871)	Loss 1.9477 (1.2512)
[02/27 15:06:34][INFO] train_vision.py:  668: Epoch: [19][40/329], lr: 7.59e-05, eta: 3:02:32	Time 3.010 (3.059)	Data 0.067 (0.111)	Mem 41.61GB	Prec@1 80.000 (85.122)	Loss 1.1081 (1.2239)
[02/27 15:07:04][INFO] train_vision.py:  668: Epoch: [19][50/329], lr: 7.56e-05, eta: 3:01:19	Time 2.993 (3.047)	Data 0.056 (0.101)	Mem 41.61GB	Prec@1 80.000 (83.922)	Loss 1.2379 (1.2632)
[02/27 15:07:34][INFO] train_vision.py:  668: Epoch: [19][60/329], lr: 7.52e-05, eta: 3:00:18	Time 2.999 (3.039)	Data 0.054 (0.093)	Mem 41.61GB	Prec@1 80.000 (84.426)	Loss 1.3331 (1.2454)
[02/27 15:08:04][INFO] train_vision.py:  668: Epoch: [19][70/329], lr: 7.49e-05, eta: 2:59:24	Time 2.979 (3.032)	Data 0.054 (0.086)	Mem 41.61GB	Prec@1 80.000 (84.225)	Loss 1.2356 (1.2573)
[02/27 15:08:34][INFO] train_vision.py:  668: Epoch: [19][80/329], lr: 7.45e-05, eta: 2:58:38	Time 3.012 (3.028)	Data 0.027 (0.082)	Mem 41.61GB	Prec@1 50.000 (83.827)	Loss 2.4501 (1.2653)
[02/27 15:09:04][INFO] train_vision.py:  668: Epoch: [19][90/329], lr: 7.42e-05, eta: 2:57:54	Time 2.997 (3.024)	Data 0.047 (0.079)	Mem 41.61GB	Prec@1 90.000 (83.956)	Loss 1.1319 (1.2601)
[02/27 15:09:34][INFO] train_vision.py:  668: Epoch: [19][100/329], lr: 7.38e-05, eta: 2:57:13	Time 2.990 (3.021)	Data 0.056 (0.077)	Mem 41.61GB	Prec@1 100.000 (84.851)	Loss 0.8376 (1.2371)
[02/27 15:10:04][INFO] train_vision.py:  668: Epoch: [19][110/329], lr: 7.35e-05, eta: 2:56:34	Time 3.003 (3.018)	Data 0.027 (0.075)	Mem 41.61GB	Prec@1 100.000 (84.775)	Loss 1.2140 (1.2397)
[02/27 15:10:34][INFO] train_vision.py:  668: Epoch: [19][120/329], lr: 7.31e-05, eta: 2:55:56	Time 2.992 (3.016)	Data 0.056 (0.073)	Mem 41.61GB	Prec@1 80.000 (85.207)	Loss 1.2455 (1.2328)
[02/27 15:11:04][INFO] train_vision.py:  668: Epoch: [19][130/329], lr: 7.28e-05, eta: 2:55:19	Time 2.986 (3.014)	Data 0.056 (0.072)	Mem 41.61GB	Prec@1 80.000 (85.267)	Loss 1.1020 (1.2274)
[02/27 15:11:34][INFO] train_vision.py:  668: Epoch: [19][140/329], lr: 7.24e-05, eta: 2:54:44	Time 2.994 (3.013)	Data 0.056 (0.070)	Mem 41.61GB	Prec@1 100.000 (85.106)	Loss 0.8218 (1.2301)
[02/27 15:12:03][INFO] train_vision.py:  668: Epoch: [19][150/329], lr: 7.21e-05, eta: 2:54:10	Time 2.999 (3.012)	Data 0.058 (0.069)	Mem 41.61GB	Prec@1 90.000 (85.430)	Loss 1.0335 (1.2229)
[02/27 15:12:33][INFO] train_vision.py:  668: Epoch: [19][160/329], lr: 7.17e-05, eta: 2:53:37	Time 3.024 (3.011)	Data 0.023 (0.068)	Mem 41.61GB	Prec@1 100.000 (85.280)	Loss 0.8884 (1.2216)
[02/27 15:13:03][INFO] train_vision.py:  668: Epoch: [19][170/329], lr: 7.14e-05, eta: 2:53:04	Time 2.999 (3.010)	Data 0.057 (0.067)	Mem 41.61GB	Prec@1 80.000 (85.088)	Loss 1.4361 (1.2247)
[02/27 15:13:33][INFO] train_vision.py:  668: Epoch: [19][180/329], lr: 7.10e-05, eta: 2:52:30	Time 2.997 (3.009)	Data 0.055 (0.066)	Mem 41.61GB	Prec@1 80.000 (84.917)	Loss 1.3172 (1.2284)
[02/27 15:14:03][INFO] train_vision.py:  668: Epoch: [19][190/329], lr: 7.07e-05, eta: 2:51:58	Time 2.997 (3.008)	Data 0.057 (0.066)	Mem 41.61GB	Prec@1 80.000 (84.555)	Loss 1.2207 (1.2329)
[02/27 15:14:33][INFO] train_vision.py:  668: Epoch: [19][200/329], lr: 7.04e-05, eta: 2:51:26	Time 2.996 (3.008)	Data 0.051 (0.065)	Mem 41.61GB	Prec@1 90.000 (84.577)	Loss 1.0656 (1.2334)
[02/27 15:15:03][INFO] train_vision.py:  668: Epoch: [19][210/329], lr: 7.00e-05, eta: 2:50:54	Time 2.992 (3.007)	Data 0.058 (0.065)	Mem 41.61GB	Prec@1 90.000 (84.692)	Loss 1.0124 (1.2304)
[02/27 15:15:33][INFO] train_vision.py:  668: Epoch: [19][220/329], lr: 6.97e-05, eta: 2:50:22	Time 3.000 (3.007)	Data 0.055 (0.064)	Mem 41.61GB	Prec@1 60.000 (84.570)	Loss 1.7152 (1.2317)
[02/27 15:16:03][INFO] train_vision.py:  668: Epoch: [19][230/329], lr: 6.93e-05, eta: 2:49:51	Time 2.985 (3.006)	Data 0.057 (0.064)	Mem 41.61GB	Prec@1 100.000 (84.675)	Loss 0.8643 (1.2264)
[02/27 15:16:33][INFO] train_vision.py:  668: Epoch: [19][240/329], lr: 6.90e-05, eta: 2:49:19	Time 2.994 (3.006)	Data 0.050 (0.063)	Mem 41.61GB	Prec@1 100.000 (84.896)	Loss 1.1282 (1.2213)
[02/27 15:17:03][INFO] train_vision.py:  668: Epoch: [19][250/329], lr: 6.86e-05, eta: 2:48:48	Time 2.997 (3.005)	Data 0.055 (0.063)	Mem 41.61GB	Prec@1 80.000 (84.661)	Loss 1.3085 (1.2231)
[02/27 15:17:33][INFO] train_vision.py:  668: Epoch: [19][260/329], lr: 6.83e-05, eta: 2:48:16	Time 2.967 (3.005)	Data 0.053 (0.063)	Mem 41.61GB	Prec@1 90.000 (84.904)	Loss 1.3257 (1.2191)
[02/27 15:18:03][INFO] train_vision.py:  668: Epoch: [19][270/329], lr: 6.79e-05, eta: 2:47:45	Time 2.999 (3.005)	Data 0.060 (0.062)	Mem 41.61GB	Prec@1 70.000 (84.649)	Loss 1.4821 (1.2249)
[02/27 15:18:33][INFO] train_vision.py:  668: Epoch: [19][280/329], lr: 6.76e-05, eta: 2:47:14	Time 2.992 (3.004)	Data 0.060 (0.062)	Mem 41.61GB	Prec@1 70.000 (84.520)	Loss 1.7379 (1.2279)
[02/27 15:19:03][INFO] train_vision.py:  668: Epoch: [19][290/329], lr: 6.73e-05, eta: 2:46:43	Time 3.000 (3.004)	Data 0.058 (0.062)	Mem 41.61GB	Prec@1 80.000 (84.536)	Loss 1.3341 (1.2293)
[02/27 15:19:33][INFO] train_vision.py:  668: Epoch: [19][300/329], lr: 6.69e-05, eta: 2:46:11	Time 2.969 (3.003)	Data 0.038 (0.061)	Mem 41.61GB	Prec@1 100.000 (84.718)	Loss 0.9100 (1.2242)
[02/27 15:20:03][INFO] train_vision.py:  668: Epoch: [19][310/329], lr: 6.66e-05, eta: 2:45:41	Time 3.013 (3.003)	Data 0.066 (0.061)	Mem 41.61GB	Prec@1 90.000 (84.695)	Loss 1.0435 (1.2243)
[02/27 15:20:33][INFO] train_vision.py:  668: Epoch: [19][320/329], lr: 6.62e-05, eta: 2:45:10	Time 2.988 (3.003)	Data 0.054 (0.061)	Mem 41.61GB	Prec@1 100.000 (84.891)	Loss 1.0493 (1.2225)
[02/27 15:21:04][INFO] train_vision.py:  840: Test: [0/107]	Prec@1 95.000 (95.000)	Prec@5 100.000 (100.000)	mPrec@1 (30.522)	mPrec@5 (32.323)
[02/27 15:21:46][INFO] train_vision.py:  840: Test: [10/107]	Prec@1 96.250 (95.682)	Prec@5 100.000 (100.000)	mPrec@1 (81.719)	mPrec@5 (86.869)
[02/27 15:22:28][INFO] train_vision.py:  840: Test: [20/107]	Prec@1 97.500 (95.298)	Prec@5 100.000 (99.881)	mPrec@1 (90.429)	mPrec@5 (97.643)
[02/27 15:23:10][INFO] train_vision.py:  840: Test: [30/107]	Prec@1 98.750 (95.685)	Prec@5 100.000 (99.919)	mPrec@1 (92.756)	mPrec@5 (98.797)
[02/27 15:23:53][INFO] train_vision.py:  840: Test: [40/107]	Prec@1 92.500 (95.762)	Prec@5 100.000 (99.909)	mPrec@1 (93.832)	mPrec@5 (99.820)
[02/27 15:24:35][INFO] train_vision.py:  840: Test: [50/107]	Prec@1 98.750 (95.270)	Prec@5 100.000 (99.853)	mPrec@1 (92.990)	mPrec@5 (99.730)
[02/27 15:25:17][INFO] train_vision.py:  840: Test: [60/107]	Prec@1 100.000 (95.492)	Prec@5 100.000 (99.857)	mPrec@1 (93.286)	mPrec@5 (99.783)
[02/27 15:25:59][INFO] train_vision.py:  840: Test: [70/107]	Prec@1 91.250 (95.493)	Prec@5 100.000 (99.877)	mPrec@1 (93.094)	mPrec@5 (99.816)
[02/27 15:26:42][INFO] train_vision.py:  840: Test: [80/107]	Prec@1 95.000 (95.509)	Prec@5 100.000 (99.877)	mPrec@1 (92.846)	mPrec@5 (99.808)
[02/27 15:27:24][INFO] train_vision.py:  840: Test: [90/107]	Prec@1 100.000 (95.275)	Prec@5 100.000 (99.890)	mPrec@1 (92.756)	mPrec@5 (99.827)
[02/27 15:28:06][INFO] train_vision.py:  840: Test: [100/107]	Prec@1 88.750 (95.371)	Prec@5 100.000 (99.901)	mPrec@1 (92.991)	mPrec@5 (99.840)
[02/27 15:28:30][INFO] train_vision.py:  847: Overall Prec@1 95.258% Prec@5 99.906% mPrec@1 (93.057) mPrec@5 (99.845)
[02/27 15:28:30][INFO] train_vision.py:  464: Testing: 93.05740356445312/93.22480773925781
[02/27 15:28:30][INFO] train_vision.py:  465: Saving:
[02/27 15:28:42][INFO] train_vision.py:  668: Epoch: [20][0/329], lr: 6.59e-05, eta: 4:45:26	Time 5.204 (5.204)	Data 2.308 (2.308)	Mem 41.61GB	Prec@1 60.000 (60.000)	Loss 1.6741 (1.6741)
[02/27 15:29:12][INFO] train_vision.py:  668: Epoch: [20][10/329], lr: 6.56e-05, eta: 2:53:58	Time 2.964 (3.182)	Data 0.034 (0.255)	Mem 41.61GB	Prec@1 60.000 (78.182)	Loss 1.7240 (1.3744)
[02/27 15:29:42][INFO] train_vision.py:  668: Epoch: [20][20/329], lr: 6.52e-05, eta: 2:48:36	Time 2.992 (3.093)	Data 0.049 (0.158)	Mem 41.61GB	Prec@1 90.000 (82.381)	Loss 1.1040 (1.2873)
[02/27 15:30:12][INFO] train_vision.py:  668: Epoch: [20][30/329], lr: 6.49e-05, eta: 2:46:25	Time 2.995 (3.062)	Data 0.055 (0.123)	Mem 41.61GB	Prec@1 90.000 (84.839)	Loss 0.9833 (1.2286)
[02/27 15:30:42][INFO] train_vision.py:  668: Epoch: [20][40/329], lr: 6.46e-05, eta: 2:45:02	Time 3.000 (3.046)	Data 0.047 (0.105)	Mem 41.61GB	Prec@1 90.000 (82.439)	Loss 1.1719 (1.3018)
[02/27 15:31:12][INFO] train_vision.py:  668: Epoch: [20][50/329], lr: 6.42e-05, eta: 2:44:00	Time 2.988 (3.036)	Data 0.053 (0.094)	Mem 41.61GB	Prec@1 70.000 (82.549)	Loss 1.3542 (1.2790)
[02/27 15:31:42][INFO] train_vision.py:  668: Epoch: [20][60/329], lr: 6.39e-05, eta: 2:43:10	Time 2.984 (3.030)	Data 0.039 (0.087)	Mem 41.61GB	Prec@1 80.000 (83.115)	Loss 1.0629 (1.2576)
[02/27 15:32:12][INFO] train_vision.py:  668: Epoch: [20][70/329], lr: 6.36e-05, eta: 2:42:22	Time 2.981 (3.025)	Data 0.049 (0.081)	Mem 41.61GB	Prec@1 90.000 (83.662)	Loss 1.0823 (1.2493)
[02/27 15:32:42][INFO] train_vision.py:  668: Epoch: [20][80/329], lr: 6.32e-05, eta: 2:41:40	Time 2.988 (3.021)	Data 0.049 (0.077)	Mem 41.61GB	Prec@1 100.000 (84.074)	Loss 0.8631 (1.2434)
[02/27 15:33:12][INFO] train_vision.py:  668: Epoch: [20][90/329], lr: 6.29e-05, eta: 2:41:01	Time 2.963 (3.018)	Data 0.054 (0.074)	Mem 41.61GB	Prec@1 80.000 (84.615)	Loss 1.2854 (1.2314)
[02/27 15:33:42][INFO] train_vision.py:  668: Epoch: [20][100/329], lr: 6.26e-05, eta: 2:40:27	Time 2.994 (3.017)	Data 0.053 (0.071)	Mem 41.61GB	Prec@1 100.000 (85.050)	Loss 0.9402 (1.2215)
[02/27 15:34:12][INFO] train_vision.py:  668: Epoch: [20][110/329], lr: 6.22e-05, eta: 2:39:50	Time 2.988 (3.015)	Data 0.056 (0.069)	Mem 41.61GB	Prec@1 100.000 (85.135)	Loss 1.0358 (1.2197)
[02/27 15:34:42][INFO] train_vision.py:  668: Epoch: [20][120/329], lr: 6.19e-05, eta: 2:39:15	Time 2.991 (3.013)	Data 0.048 (0.068)	Mem 41.61GB	Prec@1 100.000 (85.289)	Loss 1.0053 (1.2158)
[02/27 15:35:11][INFO] train_vision.py:  668: Epoch: [20][130/329], lr: 6.15e-05, eta: 2:38:38	Time 2.983 (3.011)	Data 0.041 (0.066)	Mem 41.61GB	Prec@1 70.000 (85.038)	Loss 1.3298 (1.2220)
[02/27 15:35:41][INFO] train_vision.py:  668: Epoch: [20][140/329], lr: 6.12e-05, eta: 2:38:03	Time 2.985 (3.010)	Data 0.048 (0.065)	Mem 41.61GB	Prec@1 80.000 (85.106)	Loss 1.3452 (1.2218)
[02/27 15:36:11][INFO] train_vision.py:  668: Epoch: [20][150/329], lr: 6.09e-05, eta: 2:37:29	Time 2.991 (3.008)	Data 0.050 (0.064)	Mem 41.61GB	Prec@1 90.000 (85.430)	Loss 1.1277 (1.2120)
[02/27 15:36:41][INFO] train_vision.py:  668: Epoch: [20][160/329], lr: 6.06e-05, eta: 2:36:54	Time 2.990 (3.007)	Data 0.049 (0.063)	Mem 41.61GB	Prec@1 90.000 (85.093)	Loss 1.2002 (1.2187)
[02/27 15:37:11][INFO] train_vision.py:  668: Epoch: [20][170/329], lr: 6.02e-05, eta: 2:36:21	Time 2.988 (3.006)	Data 0.052 (0.062)	Mem 41.61GB	Prec@1 90.000 (85.322)	Loss 1.0558 (1.2134)
[02/27 15:37:41][INFO] train_vision.py:  668: Epoch: [20][180/329], lr: 5.99e-05, eta: 2:35:48	Time 2.992 (3.005)	Data 0.050 (0.061)	Mem 41.61GB	Prec@1 80.000 (85.414)	Loss 1.1746 (1.2093)
[02/27 15:38:11][INFO] train_vision.py:  668: Epoch: [20][190/329], lr: 5.96e-05, eta: 2:35:17	Time 2.999 (3.005)	Data 0.054 (0.060)	Mem 41.61GB	Prec@1 90.000 (85.288)	Loss 1.0044 (1.2099)
[02/27 15:38:41][INFO] train_vision.py:  668: Epoch: [20][200/329], lr: 5.92e-05, eta: 2:34:44	Time 2.993 (3.004)	Data 0.053 (0.060)	Mem 41.61GB	Prec@1 80.000 (85.323)	Loss 1.0980 (1.2058)
[02/27 15:39:11][INFO] train_vision.py:  668: Epoch: [20][210/329], lr: 5.89e-05, eta: 2:34:13	Time 2.993 (3.003)	Data 0.055 (0.059)	Mem 41.61GB	Prec@1 80.000 (85.450)	Loss 1.5861 (1.2023)
[02/27 15:39:41][INFO] train_vision.py:  668: Epoch: [20][220/329], lr: 5.86e-05, eta: 2:33:41	Time 2.995 (3.003)	Data 0.050 (0.058)	Mem 41.61GB	Prec@1 70.000 (85.339)	Loss 1.3922 (1.2058)
[02/27 15:40:10][INFO] train_vision.py:  668: Epoch: [20][230/329], lr: 5.82e-05, eta: 2:33:09	Time 2.998 (3.002)	Data 0.056 (0.058)	Mem 41.61GB	Prec@1 90.000 (85.758)	Loss 1.1512 (1.1973)
[02/27 15:40:40][INFO] train_vision.py:  668: Epoch: [20][240/329], lr: 5.79e-05, eta: 2:32:37	Time 2.958 (3.001)	Data 0.042 (0.058)	Mem 41.61GB	Prec@1 80.000 (85.975)	Loss 1.5299 (1.1932)
[02/27 15:41:10][INFO] train_vision.py:  668: Epoch: [20][250/329], lr: 5.76e-05, eta: 2:32:06	Time 3.004 (3.001)	Data 0.056 (0.057)	Mem 41.61GB	Prec@1 70.000 (86.096)	Loss 1.4300 (1.1886)
[02/27 15:41:40][INFO] train_vision.py:  668: Epoch: [20][260/329], lr: 5.73e-05, eta: 2:31:35	Time 2.990 (3.001)	Data 0.048 (0.057)	Mem 41.61GB	Prec@1 100.000 (85.977)	Loss 0.8254 (1.1925)
[02/27 15:42:10][INFO] train_vision.py:  668: Epoch: [20][270/329], lr: 5.69e-05, eta: 2:31:04	Time 2.983 (3.001)	Data 0.041 (0.057)	Mem 41.61GB	Prec@1 90.000 (85.867)	Loss 1.2219 (1.1940)
[02/27 15:42:40][INFO] train_vision.py:  668: Epoch: [20][280/329], lr: 5.66e-05, eta: 2:30:33	Time 2.996 (3.000)	Data 0.049 (0.056)	Mem 41.61GB	Prec@1 90.000 (86.014)	Loss 1.2319 (1.1924)
[02/27 15:43:10][INFO] train_vision.py:  668: Epoch: [20][290/329], lr: 5.63e-05, eta: 2:30:02	Time 3.003 (3.000)	Data 0.057 (0.056)	Mem 41.61GB	Prec@1 80.000 (86.082)	Loss 1.6002 (1.1913)
[02/27 15:43:40][INFO] train_vision.py:  668: Epoch: [20][300/329], lr: 5.60e-05, eta: 2:29:31	Time 2.990 (3.000)	Data 0.048 (0.056)	Mem 41.61GB	Prec@1 90.000 (86.246)	Loss 1.1968 (1.1874)
[02/27 15:44:10][INFO] train_vision.py:  668: Epoch: [20][310/329], lr: 5.56e-05, eta: 2:29:01	Time 2.984 (2.999)	Data 0.044 (0.055)	Mem 41.61GB	Prec@1 60.000 (86.206)	Loss 1.5432 (1.1859)
[02/27 15:44:40][INFO] train_vision.py:  668: Epoch: [20][320/329], lr: 5.53e-05, eta: 2:28:30	Time 2.996 (2.999)	Data 0.049 (0.055)	Mem 41.61GB	Prec@1 90.000 (86.137)	Loss 0.9030 (1.1856)
[02/27 15:45:10][INFO] train_vision.py:  668: Epoch: [21][0/329], lr: 5.50e-05, eta: 4:47:06	Time 5.816 (5.816)	Data 2.572 (2.572)	Mem 41.61GB	Prec@1 80.000 (80.000)	Loss 1.3932 (1.3932)
[02/27 15:45:40][INFO] train_vision.py:  668: Epoch: [21][10/329], lr: 5.47e-05, eta: 2:39:57	Time 2.987 (3.251)	Data 0.053 (0.280)	Mem 41.61GB	Prec@1 90.000 (84.545)	Loss 1.0011 (1.2496)
[02/27 15:46:10][INFO] train_vision.py:  668: Epoch: [21][20/329], lr: 5.44e-05, eta: 2:33:41	Time 2.987 (3.134)	Data 0.053 (0.175)	Mem 41.61GB	Prec@1 70.000 (85.238)	Loss 1.3842 (1.2270)
[02/27 15:46:40][INFO] train_vision.py:  668: Epoch: [21][30/329], lr: 5.41e-05, eta: 2:31:07	Time 2.989 (3.092)	Data 0.057 (0.138)	Mem 41.61GB	Prec@1 100.000 (87.742)	Loss 0.9741 (1.1744)
[02/27 15:47:10][INFO] train_vision.py:  668: Epoch: [21][40/329], lr: 5.37e-05, eta: 2:29:40	Time 3.036 (3.073)	Data 0.054 (0.119)	Mem 41.61GB	Prec@1 90.000 (87.317)	Loss 1.1168 (1.1637)
[02/27 15:47:40][INFO] train_vision.py:  668: Epoch: [21][50/329], lr: 5.34e-05, eta: 2:28:31	Time 2.999 (3.060)	Data 0.056 (0.105)	Mem 41.61GB	Prec@1 80.000 (86.863)	Loss 1.3969 (1.1637)
[02/27 15:48:10][INFO] train_vision.py:  668: Epoch: [21][60/329], lr: 5.31e-05, eta: 2:27:35	Time 3.024 (3.051)	Data 0.022 (0.097)	Mem 41.61GB	Prec@1 70.000 (86.885)	Loss 1.7699 (1.1735)
[02/27 15:48:40][INFO] train_vision.py:  668: Epoch: [21][70/329], lr: 5.28e-05, eta: 2:26:44	Time 2.995 (3.044)	Data 0.038 (0.091)	Mem 41.61GB	Prec@1 100.000 (86.901)	Loss 0.8148 (1.1741)
[02/27 15:49:10][INFO] train_vision.py:  668: Epoch: [21][80/329], lr: 5.25e-05, eta: 2:25:58	Time 3.001 (3.039)	Data 0.051 (0.086)	Mem 41.61GB	Prec@1 100.000 (87.160)	Loss 0.8139 (1.1719)
[02/27 15:49:40][INFO] train_vision.py:  668: Epoch: [21][90/329], lr: 5.22e-05, eta: 2:25:16	Time 3.015 (3.035)	Data 0.045 (0.082)	Mem 41.61GB	Prec@1 80.000 (86.703)	Loss 1.4350 (1.1821)
[02/27 15:50:10][INFO] train_vision.py:  668: Epoch: [21][100/329], lr: 5.18e-05, eta: 2:24:33	Time 2.995 (3.031)	Data 0.054 (0.078)	Mem 41.61GB	Prec@1 90.000 (85.743)	Loss 1.0249 (1.1981)
[02/27 15:50:40][INFO] train_vision.py:  668: Epoch: [21][110/329], lr: 5.15e-05, eta: 2:23:53	Time 2.994 (3.027)	Data 0.045 (0.075)	Mem 41.61GB	Prec@1 100.000 (85.676)	Loss 0.8624 (1.1933)
[02/27 15:51:10][INFO] train_vision.py:  668: Epoch: [21][120/329], lr: 5.12e-05, eta: 2:23:16	Time 3.000 (3.025)	Data 0.055 (0.073)	Mem 41.61GB	Prec@1 90.000 (85.207)	Loss 1.1988 (1.2096)
[02/27 15:51:40][INFO] train_vision.py:  668: Epoch: [21][130/329], lr: 5.09e-05, eta: 2:22:39	Time 2.973 (3.022)	Data 0.045 (0.071)	Mem 41.61GB	Prec@1 100.000 (85.115)	Loss 0.8874 (1.2201)
[02/27 15:52:10][INFO] train_vision.py:  668: Epoch: [21][140/329], lr: 5.06e-05, eta: 2:22:03	Time 3.004 (3.020)	Data 0.056 (0.070)	Mem 41.61GB	Prec@1 100.000 (85.461)	Loss 0.8201 (1.2120)
[02/27 15:52:40][INFO] train_vision.py:  668: Epoch: [21][150/329], lr: 5.03e-05, eta: 2:21:27	Time 2.976 (3.018)	Data 0.044 (0.068)	Mem 41.61GB	Prec@1 80.000 (85.232)	Loss 1.1096 (1.2164)
[02/27 15:53:10][INFO] train_vision.py:  668: Epoch: [21][160/329], lr: 5.00e-05, eta: 2:20:53	Time 2.982 (3.017)	Data 0.053 (0.067)	Mem 41.61GB	Prec@1 90.000 (85.093)	Loss 0.9424 (1.2178)
[02/27 15:53:40][INFO] train_vision.py:  668: Epoch: [21][170/329], lr: 4.96e-05, eta: 2:20:20	Time 2.994 (3.016)	Data 0.048 (0.066)	Mem 41.61GB	Prec@1 90.000 (84.971)	Loss 1.2084 (1.2176)
[02/27 15:54:10][INFO] train_vision.py:  668: Epoch: [21][180/329], lr: 4.93e-05, eta: 2:19:48	Time 2.997 (3.015)	Data 0.038 (0.065)	Mem 41.61GB	Prec@1 90.000 (85.193)	Loss 1.0767 (1.2128)
[02/27 15:54:40][INFO] train_vision.py:  668: Epoch: [21][190/329], lr: 4.90e-05, eta: 2:19:15	Time 2.999 (3.014)	Data 0.032 (0.064)	Mem 41.61GB	Prec@1 90.000 (85.288)	Loss 1.3007 (1.2115)
[02/27 15:55:10][INFO] train_vision.py:  668: Epoch: [21][200/329], lr: 4.87e-05, eta: 2:18:42	Time 3.000 (3.013)	Data 0.055 (0.064)	Mem 41.61GB	Prec@1 100.000 (85.373)	Loss 0.8929 (1.2083)
[02/27 15:55:40][INFO] train_vision.py:  668: Epoch: [21][210/329], lr: 4.84e-05, eta: 2:18:10	Time 3.001 (3.013)	Data 0.052 (0.063)	Mem 41.61GB	Prec@1 80.000 (85.450)	Loss 1.4877 (1.2095)
[02/27 15:56:10][INFO] train_vision.py:  668: Epoch: [21][220/329], lr: 4.81e-05, eta: 2:17:38	Time 2.979 (3.012)	Data 0.043 (0.062)	Mem 41.61GB	Prec@1 40.000 (85.385)	Loss 1.9790 (1.2068)
[02/27 15:56:39][INFO] train_vision.py:  668: Epoch: [21][230/329], lr: 4.78e-05, eta: 2:17:06	Time 2.995 (3.011)	Data 0.052 (0.062)	Mem 41.61GB	Prec@1 80.000 (85.498)	Loss 1.2466 (1.2048)
[02/27 15:57:09][INFO] train_vision.py:  668: Epoch: [21][240/329], lr: 4.75e-05, eta: 2:16:34	Time 2.999 (3.011)	Data 0.056 (0.061)	Mem 41.61GB	Prec@1 90.000 (85.311)	Loss 1.2025 (1.2082)
[02/27 15:57:39][INFO] train_vision.py:  668: Epoch: [21][250/329], lr: 4.72e-05, eta: 2:16:01	Time 2.975 (3.010)	Data 0.040 (0.061)	Mem 41.61GB	Prec@1 90.000 (85.578)	Loss 0.9700 (1.2020)
[02/27 15:58:09][INFO] train_vision.py:  668: Epoch: [21][260/329], lr: 4.69e-05, eta: 2:15:30	Time 3.003 (3.009)	Data 0.054 (0.060)	Mem 41.61GB	Prec@1 90.000 (85.479)	Loss 1.1385 (1.2027)
[02/27 15:58:39][INFO] train_vision.py:  668: Epoch: [21][270/329], lr: 4.66e-05, eta: 2:14:59	Time 2.994 (3.009)	Data 0.049 (0.060)	Mem 41.61GB	Prec@1 80.000 (85.424)	Loss 1.4591 (1.2021)
[02/27 15:59:09][INFO] train_vision.py:  668: Epoch: [21][280/329], lr: 4.63e-05, eta: 2:14:28	Time 3.000 (3.008)	Data 0.055 (0.060)	Mem 41.61GB	Prec@1 80.000 (85.694)	Loss 1.2987 (1.1972)
[02/27 15:59:39][INFO] train_vision.py:  668: Epoch: [21][290/329], lr: 4.60e-05, eta: 2:13:57	Time 2.999 (3.008)	Data 0.049 (0.059)	Mem 41.61GB	Prec@1 90.000 (85.876)	Loss 1.1991 (1.1945)
[02/27 16:00:09][INFO] train_vision.py:  668: Epoch: [21][300/329], lr: 4.57e-05, eta: 2:13:26	Time 3.007 (3.008)	Data 0.060 (0.059)	Mem 41.61GB	Prec@1 80.000 (85.880)	Loss 1.2345 (1.1937)
[02/27 16:00:39][INFO] train_vision.py:  668: Epoch: [21][310/329], lr: 4.54e-05, eta: 2:12:54	Time 2.997 (3.007)	Data 0.048 (0.059)	Mem 41.61GB	Prec@1 70.000 (85.627)	Loss 1.6313 (1.1980)
[02/27 16:01:09][INFO] train_vision.py:  668: Epoch: [21][320/329], lr: 4.51e-05, eta: 2:12:24	Time 3.006 (3.007)	Data 0.057 (0.059)	Mem 41.61GB	Prec@1 100.000 (85.763)	Loss 0.8918 (1.1951)
[02/27 16:01:40][INFO] train_vision.py:  840: Test: [0/107]	Prec@1 98.750 (98.750)	Prec@5 100.000 (100.000)	mPrec@1 (32.121)	mPrec@5 (32.323)
[02/27 16:02:22][INFO] train_vision.py:  840: Test: [10/107]	Prec@1 98.750 (97.159)	Prec@5 100.000 (100.000)	mPrec@1 (82.930)	mPrec@5 (86.869)
[02/27 16:03:05][INFO] train_vision.py:  840: Test: [20/107]	Prec@1 96.250 (96.012)	Prec@5 100.000 (99.821)	mPrec@1 (90.962)	mPrec@5 (97.608)
[02/27 16:03:47][INFO] train_vision.py:  840: Test: [30/107]	Prec@1 98.750 (96.290)	Prec@5 100.000 (99.839)	mPrec@1 (93.480)	mPrec@5 (98.759)
[02/27 16:04:30][INFO] train_vision.py:  840: Test: [40/107]	Prec@1 92.500 (96.189)	Prec@5 100.000 (99.817)	mPrec@1 (94.577)	mPrec@5 (99.768)
[02/27 16:05:13][INFO] train_vision.py:  840: Test: [50/107]	Prec@1 98.750 (95.956)	Prec@5 100.000 (99.828)	mPrec@1 (93.650)	mPrec@5 (99.778)
[02/27 16:05:55][INFO] train_vision.py:  840: Test: [60/107]	Prec@1 100.000 (96.107)	Prec@5 100.000 (99.857)	mPrec@1 (93.727)	mPrec@5 (99.814)
[02/27 16:06:38][INFO] train_vision.py:  840: Test: [70/107]	Prec@1 93.750 (96.127)	Prec@5 100.000 (99.877)	mPrec@1 (93.802)	mPrec@5 (99.839)
[02/27 16:07:20][INFO] train_vision.py:  840: Test: [80/107]	Prec@1 97.500 (96.235)	Prec@5 100.000 (99.892)	mPrec@1 (93.722)	mPrec@5 (99.867)
[02/27 16:08:03][INFO] train_vision.py:  840: Test: [90/107]	Prec@1 100.000 (96.154)	Prec@5 100.000 (99.904)	mPrec@1 (93.763)	mPrec@5 (99.882)
[02/27 16:08:45][INFO] train_vision.py:  840: Test: [100/107]	Prec@1 88.750 (96.101)	Prec@5 100.000 (99.913)	mPrec@1 (93.690)	mPrec@5 (99.892)
[02/27 16:09:08][INFO] train_vision.py:  847: Overall Prec@1 95.962% Prec@5 99.894% mPrec@1 (93.724) mPrec@5 (99.874)
[02/27 16:09:08][INFO] train_vision.py:  464: Testing: 93.72360229492188/93.72360229492188
[02/27 16:09:08][INFO] train_vision.py:  465: Saving:
[02/27 16:09:28][INFO] train_vision.py:  668: Epoch: [22][0/329], lr: 4.48e-05, eta: 3:57:18	Time 5.408 (5.408)	Data 2.520 (2.520)	Mem 41.61GB	Prec@1 80.000 (80.000)	Loss 1.3690 (1.3690)
[02/27 16:09:58][INFO] train_vision.py:  668: Epoch: [22][10/329], lr: 4.45e-05, eta: 2:20:35	Time 3.025 (3.216)	Data 0.021 (0.284)	Mem 41.61GB	Prec@1 100.000 (85.455)	Loss 0.8366 (1.1170)
[02/27 16:10:28][INFO] train_vision.py:  668: Epoch: [22][20/329], lr: 4.42e-05, eta: 2:15:44	Time 3.036 (3.117)	Data 0.049 (0.175)	Mem 41.61GB	Prec@1 80.000 (85.238)	Loss 1.3587 (1.1611)
[02/27 16:10:58][INFO] train_vision.py:  668: Epoch: [22][30/329], lr: 4.39e-05, eta: 2:13:42	Time 3.002 (3.082)	Data 0.057 (0.137)	Mem 41.61GB	Prec@1 100.000 (83.871)	Loss 0.8746 (1.1808)
[02/27 16:11:28][INFO] train_vision.py:  668: Epoch: [22][40/329], lr: 4.36e-05, eta: 2:12:28	Time 3.020 (3.065)	Data 0.078 (0.118)	Mem 41.61GB	Prec@1 90.000 (83.171)	Loss 1.1656 (1.2248)
[02/27 16:11:58][INFO] train_vision.py:  668: Epoch: [22][50/329], lr: 4.33e-05, eta: 2:11:32	Time 2.993 (3.056)	Data 0.054 (0.107)	Mem 41.61GB	Prec@1 90.000 (82.353)	Loss 1.2241 (1.2499)
[02/27 16:12:28][INFO] train_vision.py:  668: Epoch: [22][60/329], lr: 4.30e-05, eta: 2:10:43	Time 3.056 (3.048)	Data 0.041 (0.098)	Mem 41.61GB	Prec@1 80.000 (83.443)	Loss 1.4551 (1.2287)
[02/27 16:12:58][INFO] train_vision.py:  668: Epoch: [22][70/329], lr: 4.27e-05, eta: 2:09:57	Time 2.973 (3.043)	Data 0.053 (0.092)	Mem 41.61GB	Prec@1 90.000 (84.366)	Loss 1.1089 (1.2146)
[02/27 16:13:28][INFO] train_vision.py:  668: Epoch: [22][80/329], lr: 4.24e-05, eta: 2:09:17	Time 3.025 (3.039)	Data 0.056 (0.087)	Mem 41.61GB	Prec@1 80.000 (84.321)	Loss 1.3803 (1.2096)
[02/27 16:13:59][INFO] train_vision.py:  668: Epoch: [22][90/329], lr: 4.21e-05, eta: 2:08:39	Time 3.011 (3.036)	Data 0.062 (0.083)	Mem 41.61GB	Prec@1 90.000 (84.396)	Loss 0.9376 (1.2085)
[02/27 16:14:29][INFO] train_vision.py:  668: Epoch: [22][100/329], lr: 4.18e-05, eta: 2:08:02	Time 3.017 (3.033)	Data 0.066 (0.081)	Mem 41.61GB	Prec@1 80.000 (84.851)	Loss 1.2821 (1.1983)
[02/27 16:14:59][INFO] train_vision.py:  668: Epoch: [22][110/329], lr: 4.15e-05, eta: 2:07:28	Time 3.021 (3.032)	Data 0.056 (0.078)	Mem 41.61GB	Prec@1 100.000 (84.685)	Loss 0.9461 (1.2010)
[02/27 16:15:29][INFO] train_vision.py:  668: Epoch: [22][120/329], lr: 4.13e-05, eta: 2:06:55	Time 3.047 (3.031)	Data 0.064 (0.077)	Mem 41.61GB	Prec@1 80.000 (84.793)	Loss 1.0480 (1.1989)
[02/27 16:15:59][INFO] train_vision.py:  668: Epoch: [22][130/329], lr: 4.10e-05, eta: 2:06:22	Time 3.011 (3.029)	Data 0.060 (0.075)	Mem 41.61GB	Prec@1 90.000 (84.427)	Loss 1.0867 (1.2118)
[02/27 16:16:29][INFO] train_vision.py:  668: Epoch: [22][140/329], lr: 4.07e-05, eta: 2:05:51	Time 3.014 (3.029)	Data 0.051 (0.074)	Mem 41.61GB	Prec@1 100.000 (84.468)	Loss 0.8132 (1.2183)
[02/27 16:16:59][INFO] train_vision.py:  668: Epoch: [22][150/329], lr: 4.04e-05, eta: 2:05:18	Time 3.005 (3.028)	Data 0.054 (0.073)	Mem 41.61GB	Prec@1 90.000 (84.636)	Loss 1.0387 (1.2123)
[02/27 16:17:30][INFO] train_vision.py:  668: Epoch: [22][160/329], lr: 4.01e-05, eta: 2:04:46	Time 3.032 (3.027)	Data 0.055 (0.072)	Mem 41.61GB	Prec@1 80.000 (84.783)	Loss 1.2933 (1.2103)
[02/27 16:18:00][INFO] train_vision.py:  668: Epoch: [22][170/329], lr: 3.98e-05, eta: 2:04:13	Time 3.005 (3.026)	Data 0.056 (0.070)	Mem 41.61GB	Prec@1 100.000 (84.795)	Loss 0.9530 (1.2088)
[02/27 16:18:30][INFO] train_vision.py:  668: Epoch: [22][180/329], lr: 3.95e-05, eta: 2:03:40	Time 3.010 (3.025)	Data 0.050 (0.070)	Mem 41.61GB	Prec@1 80.000 (84.807)	Loss 1.3004 (1.2112)
[02/27 16:19:00][INFO] train_vision.py:  668: Epoch: [22][190/329], lr: 3.92e-05, eta: 2:03:09	Time 3.000 (3.025)	Data 0.055 (0.069)	Mem 41.61GB	Prec@1 100.000 (84.974)	Loss 1.0398 (1.2076)
[02/27 16:19:30][INFO] train_vision.py:  668: Epoch: [22][200/329], lr: 3.90e-05, eta: 2:02:35	Time 3.004 (3.023)	Data 0.052 (0.068)	Mem 41.61GB	Prec@1 90.000 (84.925)	Loss 1.0103 (1.2062)
[02/27 16:20:00][INFO] train_vision.py:  668: Epoch: [22][210/329], lr: 3.87e-05, eta: 2:02:03	Time 2.985 (3.023)	Data 0.058 (0.067)	Mem 41.61GB	Prec@1 90.000 (85.071)	Loss 1.1851 (1.2039)
[02/27 16:20:30][INFO] train_vision.py:  668: Epoch: [22][220/329], lr: 3.84e-05, eta: 2:01:31	Time 2.980 (3.022)	Data 0.050 (0.067)	Mem 41.61GB	Prec@1 90.000 (85.204)	Loss 1.0769 (1.2012)
[02/27 16:21:00][INFO] train_vision.py:  668: Epoch: [22][230/329], lr: 3.81e-05, eta: 2:00:58	Time 3.010 (3.021)	Data 0.027 (0.066)	Mem 41.61GB	Prec@1 90.000 (85.541)	Loss 1.0411 (1.1945)
[02/27 16:21:30][INFO] train_vision.py:  668: Epoch: [22][240/329], lr: 3.78e-05, eta: 2:00:26	Time 3.009 (3.020)	Data 0.054 (0.066)	Mem 41.61GB	Prec@1 100.000 (85.560)	Loss 0.8536 (1.1937)
[02/27 16:22:00][INFO] train_vision.py:  668: Epoch: [22][250/329], lr: 3.76e-05, eta: 1:59:55	Time 3.013 (3.020)	Data 0.028 (0.065)	Mem 41.61GB	Prec@1 90.000 (85.498)	Loss 0.9519 (1.1946)
[02/27 16:22:30][INFO] train_vision.py:  668: Epoch: [22][260/329], lr: 3.73e-05, eta: 1:59:23	Time 3.023 (3.019)	Data 0.037 (0.065)	Mem 41.61GB	Prec@1 80.000 (85.632)	Loss 1.4062 (1.1932)
[02/27 16:23:00][INFO] train_vision.py:  668: Epoch: [22][270/329], lr: 3.70e-05, eta: 1:58:52	Time 3.001 (3.018)	Data 0.055 (0.064)	Mem 41.61GB	Prec@1 90.000 (85.793)	Loss 1.1785 (1.1902)
[02/27 16:23:30][INFO] train_vision.py:  668: Epoch: [22][280/329], lr: 3.67e-05, eta: 1:58:20	Time 3.000 (3.018)	Data 0.061 (0.064)	Mem 41.61GB	Prec@1 80.000 (85.730)	Loss 1.2702 (1.1917)
[02/27 16:24:00][INFO] train_vision.py:  668: Epoch: [22][290/329], lr: 3.64e-05, eta: 1:57:49	Time 3.000 (3.017)	Data 0.056 (0.064)	Mem 41.61GB	Prec@1 80.000 (85.739)	Loss 1.4290 (1.1905)
[02/27 16:24:30][INFO] train_vision.py:  668: Epoch: [22][300/329], lr: 3.62e-05, eta: 1:57:18	Time 2.982 (3.017)	Data 0.049 (0.063)	Mem 41.61GB	Prec@1 90.000 (85.648)	Loss 1.0177 (1.1915)
[02/27 16:25:00][INFO] train_vision.py:  668: Epoch: [22][310/329], lr: 3.59e-05, eta: 1:56:46	Time 2.995 (3.016)	Data 0.052 (0.063)	Mem 41.61GB	Prec@1 100.000 (85.659)	Loss 0.8254 (1.1922)
[02/27 16:25:30][INFO] train_vision.py:  668: Epoch: [22][320/329], lr: 3.56e-05, eta: 1:56:15	Time 3.001 (3.016)	Data 0.058 (0.063)	Mem 41.61GB	Prec@1 90.000 (85.826)	Loss 1.0904 (1.1891)
[02/27 16:26:00][INFO] train_vision.py:  668: Epoch: [23][0/329], lr: 3.54e-05, eta: 3:33:17	Time 5.554 (5.554)	Data 2.445 (2.445)	Mem 41.61GB	Prec@1 80.000 (80.000)	Loss 1.4775 (1.4775)
[02/27 16:26:30][INFO] train_vision.py:  668: Epoch: [23][10/329], lr: 3.51e-05, eta: 2:03:40	Time 3.038 (3.235)	Data 0.018 (0.269)	Mem 41.61GB	Prec@1 100.000 (90.000)	Loss 0.8962 (1.1274)
[02/27 16:27:00][INFO] train_vision.py:  668: Epoch: [23][20/329], lr: 3.48e-05, eta: 1:59:06	Time 3.036 (3.129)	Data 0.064 (0.168)	Mem 41.61GB	Prec@1 90.000 (87.143)	Loss 1.1789 (1.1430)
[02/27 16:27:30][INFO] train_vision.py:  668: Epoch: [23][30/329], lr: 3.46e-05, eta: 1:57:08	Time 3.001 (3.091)	Data 0.059 (0.131)	Mem 41.61GB	Prec@1 100.000 (86.774)	Loss 0.9129 (1.1495)
[02/27 16:28:01][INFO] train_vision.py:  668: Epoch: [23][40/329], lr: 3.43e-05, eta: 1:55:55	Time 3.004 (3.072)	Data 0.068 (0.112)	Mem 41.61GB	Prec@1 80.000 (86.829)	Loss 1.3135 (1.1575)
[02/27 16:28:31][INFO] train_vision.py:  668: Epoch: [23][50/329], lr: 3.40e-05, eta: 1:54:59	Time 3.003 (3.061)	Data 0.071 (0.100)	Mem 41.61GB	Prec@1 100.000 (86.863)	Loss 0.9986 (1.1714)
[02/27 16:29:01][INFO] train_vision.py:  668: Epoch: [23][60/329], lr: 3.38e-05, eta: 1:54:08	Time 3.026 (3.052)	Data 0.020 (0.092)	Mem 41.61GB	Prec@1 90.000 (86.721)	Loss 1.1957 (1.1606)
[02/27 16:29:31][INFO] train_vision.py:  668: Epoch: [23][70/329], lr: 3.35e-05, eta: 1:53:25	Time 3.015 (3.047)	Data 0.063 (0.086)	Mem 41.61GB	Prec@1 100.000 (87.042)	Loss 0.8790 (1.1474)
[02/27 16:30:01][INFO] train_vision.py:  668: Epoch: [23][80/329], lr: 3.32e-05, eta: 1:52:45	Time 3.036 (3.042)	Data 0.056 (0.083)	Mem 41.61GB	Prec@1 90.000 (87.037)	Loss 1.0946 (1.1468)
[02/27 16:30:31][INFO] train_vision.py:  668: Epoch: [23][90/329], lr: 3.30e-05, eta: 1:52:05	Time 2.993 (3.038)	Data 0.059 (0.079)	Mem 41.61GB	Prec@1 90.000 (86.593)	Loss 1.0102 (1.1556)
[02/27 16:31:01][INFO] train_vision.py:  668: Epoch: [23][100/329], lr: 3.27e-05, eta: 1:51:27	Time 2.992 (3.034)	Data 0.064 (0.077)	Mem 41.61GB	Prec@1 90.000 (87.129)	Loss 1.0997 (1.1474)
[02/27 16:31:31][INFO] train_vision.py:  668: Epoch: [23][110/329], lr: 3.24e-05, eta: 1:50:49	Time 2.988 (3.031)	Data 0.054 (0.075)	Mem 41.61GB	Prec@1 90.000 (86.937)	Loss 1.0923 (1.1640)
[02/27 16:32:01][INFO] train_vision.py:  668: Epoch: [23][120/329], lr: 3.22e-05, eta: 1:50:13	Time 2.996 (3.028)	Data 0.051 (0.073)	Mem 41.61GB	Prec@1 100.000 (86.694)	Loss 0.8129 (1.1656)
[02/27 16:32:31][INFO] train_vision.py:  668: Epoch: [23][130/329], lr: 3.19e-05, eta: 1:49:38	Time 2.990 (3.026)	Data 0.057 (0.071)	Mem 41.61GB	Prec@1 90.000 (86.565)	Loss 1.2258 (1.1642)
[02/27 16:33:01][INFO] train_vision.py:  668: Epoch: [23][140/329], lr: 3.17e-05, eta: 1:49:04	Time 2.990 (3.024)	Data 0.047 (0.070)	Mem 41.61GB	Prec@1 70.000 (86.596)	Loss 1.5367 (1.1632)
[02/27 16:33:31][INFO] train_vision.py:  668: Epoch: [23][150/329], lr: 3.14e-05, eta: 1:48:31	Time 3.023 (3.023)	Data 0.053 (0.069)	Mem 41.61GB	Prec@1 90.000 (86.689)	Loss 1.1324 (1.1581)
[02/27 16:34:01][INFO] train_vision.py:  668: Epoch: [23][160/329], lr: 3.11e-05, eta: 1:47:58	Time 2.981 (3.021)	Data 0.052 (0.068)	Mem 41.61GB	Prec@1 90.000 (86.708)	Loss 1.0581 (1.1540)
[02/27 16:34:31][INFO] train_vision.py:  668: Epoch: [23][170/329], lr: 3.09e-05, eta: 1:47:26	Time 2.994 (3.021)	Data 0.054 (0.067)	Mem 41.61GB	Prec@1 90.000 (86.784)	Loss 0.9574 (1.1512)
[02/27 16:35:01][INFO] train_vision.py:  668: Epoch: [23][180/329], lr: 3.06e-05, eta: 1:46:53	Time 2.988 (3.020)	Data 0.055 (0.066)	Mem 41.61GB	Prec@1 90.000 (86.851)	Loss 0.9768 (1.1503)
[02/27 16:35:31][INFO] train_vision.py:  668: Epoch: [23][190/329], lr: 3.04e-05, eta: 1:46:21	Time 2.989 (3.019)	Data 0.053 (0.066)	Mem 41.61GB	Prec@1 100.000 (86.859)	Loss 0.8657 (1.1507)
[02/27 16:36:01][INFO] train_vision.py:  668: Epoch: [23][200/329], lr: 3.01e-05, eta: 1:45:49	Time 3.000 (3.018)	Data 0.057 (0.065)	Mem 41.61GB	Prec@1 100.000 (87.015)	Loss 0.8103 (1.1482)
[02/27 16:36:31][INFO] train_vision.py:  668: Epoch: [23][210/329], lr: 2.99e-05, eta: 1:45:16	Time 2.966 (3.017)	Data 0.053 (0.065)	Mem 41.61GB	Prec@1 80.000 (87.062)	Loss 1.2046 (1.1475)
[02/27 16:37:01][INFO] train_vision.py:  668: Epoch: [23][220/329], lr: 2.96e-05, eta: 1:44:44	Time 2.989 (3.016)	Data 0.052 (0.064)	Mem 41.61GB	Prec@1 80.000 (87.104)	Loss 1.2885 (1.1466)
[02/27 16:37:31][INFO] train_vision.py:  668: Epoch: [23][230/329], lr: 2.94e-05, eta: 1:44:12	Time 2.992 (3.015)	Data 0.049 (0.063)	Mem 41.61GB	Prec@1 100.000 (87.186)	Loss 0.8584 (1.1433)
[02/27 16:38:01][INFO] train_vision.py:  668: Epoch: [23][240/329], lr: 2.91e-05, eta: 1:43:39	Time 2.987 (3.013)	Data 0.049 (0.063)	Mem 41.61GB	Prec@1 100.000 (87.054)	Loss 1.1312 (1.1474)
[02/27 16:38:31][INFO] train_vision.py:  668: Epoch: [23][250/329], lr: 2.89e-05, eta: 1:43:08	Time 2.994 (3.013)	Data 0.050 (0.062)	Mem 41.61GB	Prec@1 80.000 (86.972)	Loss 1.4582 (1.1496)
[02/27 16:39:01][INFO] train_vision.py:  668: Epoch: [23][260/329], lr: 2.86e-05, eta: 1:42:36	Time 3.024 (3.012)	Data 0.031 (0.062)	Mem 41.61GB	Prec@1 80.000 (86.935)	Loss 1.1639 (1.1514)
[02/27 16:39:31][INFO] train_vision.py:  668: Epoch: [23][270/329], lr: 2.84e-05, eta: 1:42:04	Time 2.954 (3.011)	Data 0.046 (0.061)	Mem 41.61GB	Prec@1 90.000 (87.011)	Loss 0.9223 (1.1485)
[02/27 16:40:01][INFO] train_vision.py:  668: Epoch: [23][280/329], lr: 2.81e-05, eta: 1:41:33	Time 2.982 (3.011)	Data 0.058 (0.061)	Mem 41.61GB	Prec@1 70.000 (87.117)	Loss 1.4959 (1.1466)
[02/27 16:40:31][INFO] train_vision.py:  668: Epoch: [23][290/329], lr: 2.79e-05, eta: 1:41:02	Time 2.994 (3.010)	Data 0.059 (0.061)	Mem 41.61GB	Prec@1 90.000 (86.976)	Loss 1.2487 (1.1513)
[02/27 16:41:01][INFO] train_vision.py:  668: Epoch: [23][300/329], lr: 2.76e-05, eta: 1:40:31	Time 2.999 (3.010)	Data 0.061 (0.061)	Mem 41.61GB	Prec@1 90.000 (87.010)	Loss 1.2603 (1.1526)
[02/27 16:41:31][INFO] train_vision.py:  668: Epoch: [23][310/329], lr: 2.74e-05, eta: 1:40:00	Time 2.994 (3.009)	Data 0.055 (0.060)	Mem 41.61GB	Prec@1 90.000 (87.138)	Loss 1.0435 (1.1509)
[02/27 16:42:00][INFO] train_vision.py:  668: Epoch: [23][320/329], lr: 2.71e-05, eta: 1:39:29	Time 3.001 (3.009)	Data 0.058 (0.060)	Mem 41.61GB	Prec@1 90.000 (87.103)	Loss 1.3375 (1.1522)
[02/27 16:42:31][INFO] train_vision.py:  840: Test: [0/107]	Prec@1 98.750 (98.750)	Prec@5 100.000 (100.000)	mPrec@1 (32.121)	mPrec@5 (32.323)
[02/27 16:43:14][INFO] train_vision.py:  840: Test: [10/107]	Prec@1 95.000 (96.818)	Prec@5 100.000 (100.000)	mPrec@1 (83.367)	mPrec@5 (86.869)
[02/27 16:43:56][INFO] train_vision.py:  840: Test: [20/107]	Prec@1 95.000 (96.190)	Prec@5 100.000 (99.821)	mPrec@1 (91.410)	mPrec@5 (97.634)
[02/27 16:44:38][INFO] train_vision.py:  840: Test: [30/107]	Prec@1 98.750 (96.250)	Prec@5 100.000 (99.879)	mPrec@1 (93.503)	mPrec@5 (98.790)
[02/27 16:45:21][INFO] train_vision.py:  840: Test: [40/107]	Prec@1 92.500 (96.372)	Prec@5 100.000 (99.848)	mPrec@1 (94.993)	mPrec@5 (99.784)
[02/27 16:46:03][INFO] train_vision.py:  840: Test: [50/107]	Prec@1 98.750 (95.980)	Prec@5 100.000 (99.877)	mPrec@1 (94.248)	mPrec@5 (99.799)
[02/27 16:46:45][INFO] train_vision.py:  840: Test: [60/107]	Prec@1 100.000 (96.209)	Prec@5 100.000 (99.877)	mPrec@1 (94.506)	mPrec@5 (99.822)
[02/27 16:47:28][INFO] train_vision.py:  840: Test: [70/107]	Prec@1 93.750 (96.232)	Prec@5 98.750 (99.877)	mPrec@1 (94.497)	mPrec@5 (99.798)
[02/27 16:48:11][INFO] train_vision.py:  840: Test: [80/107]	Prec@1 95.000 (96.250)	Prec@5 100.000 (99.892)	mPrec@1 (94.177)	mPrec@5 (99.831)
[02/27 16:48:53][INFO] train_vision.py:  840: Test: [90/107]	Prec@1 100.000 (96.085)	Prec@5 100.000 (99.904)	mPrec@1 (93.872)	mPrec@5 (99.845)
[02/27 16:49:35][INFO] train_vision.py:  840: Test: [100/107]	Prec@1 88.750 (96.126)	Prec@5 100.000 (99.913)	mPrec@1 (93.921)	mPrec@5 (99.859)
[02/27 16:49:59][INFO] train_vision.py:  847: Overall Prec@1 95.962% Prec@5 99.894% mPrec@1 (93.861) mPrec@5 (99.801)
[02/27 16:49:59][INFO] train_vision.py:  464: Testing: 93.86101531982422/93.86101531982422
[02/27 16:49:59][INFO] train_vision.py:  465: Saving:
[02/27 16:50:18][INFO] train_vision.py:  668: Epoch: [24][0/329], lr: 2.69e-05, eta: 2:51:07	Time 5.199 (5.199)	Data 2.320 (2.320)	Mem 41.61GB	Prec@1 70.000 (70.000)	Loss 1.4279 (1.4279)
[02/27 16:50:48][INFO] train_vision.py:  668: Epoch: [24][10/329], lr: 2.67e-05, eta: 1:44:30	Time 3.028 (3.191)	Data 0.054 (0.255)	Mem 41.61GB	Prec@1 90.000 (82.727)	Loss 0.9458 (1.2232)
[02/27 16:51:18][INFO] train_vision.py:  668: Epoch: [24][20/329], lr: 2.64e-05, eta: 1:41:11	Time 3.002 (3.105)	Data 0.051 (0.159)	Mem 41.61GB	Prec@1 70.000 (81.905)	Loss 1.4243 (1.2497)
[02/27 16:51:48][INFO] train_vision.py:  668: Epoch: [24][30/329], lr: 2.62e-05, eta: 1:39:36	Time 3.017 (3.073)	Data 0.032 (0.121)	Mem 41.61GB	Prec@1 100.000 (84.194)	Loss 0.9457 (1.2021)
[02/27 16:52:18][INFO] train_vision.py:  668: Epoch: [24][40/329], lr: 2.60e-05, eta: 1:38:34	Time 3.003 (3.056)	Data 0.052 (0.104)	Mem 41.61GB	Prec@1 90.000 (85.122)	Loss 1.1884 (1.1835)
[02/27 16:52:48][INFO] train_vision.py:  668: Epoch: [24][50/329], lr: 2.57e-05, eta: 1:37:44	Time 3.031 (3.047)	Data 0.020 (0.094)	Mem 41.61GB	Prec@1 90.000 (86.078)	Loss 1.0709 (1.1717)
[02/27 16:53:18][INFO] train_vision.py:  668: Epoch: [24][60/329], lr: 2.55e-05, eta: 1:36:57	Time 2.990 (3.038)	Data 0.057 (0.087)	Mem 41.61GB	Prec@1 80.000 (86.393)	Loss 1.3023 (1.1634)
[02/27 16:53:48][INFO] train_vision.py:  668: Epoch: [24][70/329], lr: 2.53e-05, eta: 1:36:18	Time 3.011 (3.033)	Data 0.050 (0.082)	Mem 41.61GB	Prec@1 90.000 (86.056)	Loss 0.9495 (1.1816)
[02/27 16:54:18][INFO] train_vision.py:  668: Epoch: [24][80/329], lr: 2.50e-05, eta: 1:35:40	Time 2.999 (3.029)	Data 0.048 (0.078)	Mem 41.61GB	Prec@1 80.000 (86.914)	Loss 1.3161 (1.1614)
[02/27 16:54:48][INFO] train_vision.py:  668: Epoch: [24][90/329], lr: 2.48e-05, eta: 1:35:04	Time 2.990 (3.026)	Data 0.061 (0.075)	Mem 41.61GB	Prec@1 100.000 (87.473)	Loss 0.8328 (1.1476)
[02/27 16:55:18][INFO] train_vision.py:  668: Epoch: [24][100/329], lr: 2.46e-05, eta: 1:34:29	Time 3.004 (3.024)	Data 0.067 (0.073)	Mem 41.61GB	Prec@1 80.000 (87.525)	Loss 1.1661 (1.1484)
[02/27 16:55:48][INFO] train_vision.py:  668: Epoch: [24][110/329], lr: 2.43e-05, eta: 1:33:54	Time 2.965 (3.021)	Data 0.060 (0.071)	Mem 41.61GB	Prec@1 90.000 (87.568)	Loss 1.0842 (1.1483)
[02/27 16:56:18][INFO] train_vision.py:  668: Epoch: [24][120/329], lr: 2.41e-05, eta: 1:33:22	Time 3.001 (3.020)	Data 0.054 (0.070)	Mem 41.61GB	Prec@1 90.000 (87.273)	Loss 1.0951 (1.1513)
[02/27 16:56:48][INFO] train_vision.py:  668: Epoch: [24][130/329], lr: 2.39e-05, eta: 1:32:49	Time 3.021 (3.019)	Data 0.037 (0.068)	Mem 41.61GB	Prec@1 100.000 (87.786)	Loss 0.9629 (1.1447)
[02/27 16:57:18][INFO] train_vision.py:  668: Epoch: [24][140/329], lr: 2.36e-05, eta: 1:32:16	Time 3.001 (3.017)	Data 0.063 (0.067)	Mem 41.61GB	Prec@1 90.000 (87.660)	Loss 1.0875 (1.1476)
[02/27 16:57:48][INFO] train_vision.py:  668: Epoch: [24][150/329], lr: 2.34e-05, eta: 1:31:44	Time 3.009 (3.016)	Data 0.056 (0.066)	Mem 41.61GB	Prec@1 70.000 (87.483)	Loss 1.4052 (1.1496)
[02/27 16:58:18][INFO] train_vision.py:  668: Epoch: [24][160/329], lr: 2.32e-05, eta: 1:31:12	Time 2.999 (3.015)	Data 0.059 (0.066)	Mem 41.61GB	Prec@1 90.000 (87.640)	Loss 1.2399 (1.1463)
[02/27 16:58:48][INFO] train_vision.py:  668: Epoch: [24][170/329], lr: 2.30e-05, eta: 1:30:40	Time 2.993 (3.014)	Data 0.057 (0.065)	Mem 41.61GB	Prec@1 90.000 (87.544)	Loss 1.1338 (1.1502)
[02/27 16:59:18][INFO] train_vision.py:  668: Epoch: [24][180/329], lr: 2.27e-05, eta: 1:30:08	Time 3.004 (3.013)	Data 0.054 (0.064)	Mem 41.61GB	Prec@1 100.000 (87.790)	Loss 0.8100 (1.1428)
[02/27 16:59:48][INFO] train_vision.py:  668: Epoch: [24][190/329], lr: 2.25e-05, eta: 1:29:37	Time 3.003 (3.012)	Data 0.054 (0.063)	Mem 41.61GB	Prec@1 90.000 (87.435)	Loss 1.0287 (1.1517)
[02/27 17:00:18][INFO] train_vision.py:  668: Epoch: [24][200/329], lr: 2.23e-05, eta: 1:29:05	Time 3.021 (3.012)	Data 0.021 (0.063)	Mem 41.61GB	Prec@1 90.000 (87.363)	Loss 1.1144 (1.1542)
[02/27 17:00:48][INFO] train_vision.py:  668: Epoch: [24][210/329], lr: 2.21e-05, eta: 1:28:34	Time 3.010 (3.011)	Data 0.060 (0.062)	Mem 41.61GB	Prec@1 80.000 (87.299)	Loss 1.1415 (1.1573)
[02/27 17:01:18][INFO] train_vision.py:  668: Epoch: [24][220/329], lr: 2.19e-05, eta: 1:28:03	Time 3.007 (3.011)	Data 0.031 (0.062)	Mem 41.61GB	Prec@1 90.000 (87.330)	Loss 1.2214 (1.1569)
[02/27 17:01:48][INFO] train_vision.py:  668: Epoch: [24][230/329], lr: 2.16e-05, eta: 1:27:32	Time 2.995 (3.010)	Data 0.052 (0.061)	Mem 41.61GB	Prec@1 70.000 (87.316)	Loss 1.7137 (1.1602)
[02/27 17:02:18][INFO] train_vision.py:  668: Epoch: [24][240/329], lr: 2.14e-05, eta: 1:27:01	Time 3.002 (3.010)	Data 0.055 (0.061)	Mem 41.61GB	Prec@1 80.000 (87.427)	Loss 1.3305 (1.1575)
[02/27 17:02:48][INFO] train_vision.py:  668: Epoch: [24][250/329], lr: 2.12e-05, eta: 1:26:30	Time 3.004 (3.009)	Data 0.051 (0.060)	Mem 41.61GB	Prec@1 90.000 (87.610)	Loss 0.9686 (1.1519)
[02/27 17:03:18][INFO] train_vision.py:  668: Epoch: [24][260/329], lr: 2.10e-05, eta: 1:25:59	Time 2.993 (3.009)	Data 0.057 (0.060)	Mem 41.61GB	Prec@1 90.000 (87.586)	Loss 1.2865 (1.1510)
[02/27 17:03:48][INFO] train_vision.py:  668: Epoch: [24][270/329], lr: 2.08e-05, eta: 1:25:28	Time 2.983 (3.008)	Data 0.056 (0.060)	Mem 41.61GB	Prec@1 90.000 (87.601)	Loss 1.0954 (1.1520)
[02/27 17:04:18][INFO] train_vision.py:  668: Epoch: [24][280/329], lr: 2.06e-05, eta: 1:24:58	Time 3.007 (3.008)	Data 0.056 (0.060)	Mem 41.61GB	Prec@1 90.000 (87.651)	Loss 1.0353 (1.1500)
[02/27 17:04:48][INFO] train_vision.py:  668: Epoch: [24][290/329], lr: 2.04e-05, eta: 1:24:27	Time 2.997 (3.007)	Data 0.055 (0.059)	Mem 41.61GB	Prec@1 80.000 (87.732)	Loss 1.4102 (1.1470)
[02/27 17:05:18][INFO] train_vision.py:  668: Epoch: [24][300/329], lr: 2.01e-05, eta: 1:23:57	Time 3.004 (3.007)	Data 0.056 (0.059)	Mem 41.61GB	Prec@1 100.000 (87.708)	Loss 0.8450 (1.1476)
[02/27 17:05:48][INFO] train_vision.py:  668: Epoch: [24][310/329], lr: 1.99e-05, eta: 1:23:26	Time 2.965 (3.007)	Data 0.053 (0.059)	Mem 41.61GB	Prec@1 80.000 (87.781)	Loss 1.2765 (1.1450)
[02/27 17:06:18][INFO] train_vision.py:  668: Epoch: [24][320/329], lr: 1.97e-05, eta: 1:22:55	Time 3.004 (3.007)	Data 0.053 (0.059)	Mem 41.61GB	Prec@1 80.000 (87.414)	Loss 1.5149 (1.1502)
[02/27 17:06:48][INFO] train_vision.py:  668: Epoch: [25][0/329], lr: 1.95e-05, eta: 2:35:01	Time 5.651 (5.651)	Data 2.440 (2.440)	Mem 41.61GB	Prec@1 90.000 (90.000)	Loss 1.2066 (1.2066)
[02/27 17:07:18][INFO] train_vision.py:  668: Epoch: [25][10/329], lr: 1.93e-05, eta: 1:28:14	Time 2.984 (3.236)	Data 0.055 (0.273)	Mem 41.61GB	Prec@1 100.000 (87.273)	Loss 1.0132 (1.1400)
[02/27 17:07:48][INFO] train_vision.py:  668: Epoch: [25][20/329], lr: 1.91e-05, eta: 1:24:33	Time 2.984 (3.120)	Data 0.089 (0.170)	Mem 41.61GB	Prec@1 90.000 (85.714)	Loss 1.0490 (1.1669)
[02/27 17:08:18][INFO] train_vision.py:  668: Epoch: [25][30/329], lr: 1.89e-05, eta: 1:22:56	Time 2.994 (3.079)	Data 0.048 (0.131)	Mem 41.61GB	Prec@1 100.000 (86.129)	Loss 0.8081 (1.1644)
[02/27 17:08:48][INFO] train_vision.py:  668: Epoch: [25][40/329], lr: 1.87e-05, eta: 1:21:52	Time 3.008 (3.059)	Data 0.064 (0.110)	Mem 41.61GB	Prec@1 80.000 (88.049)	Loss 1.3127 (1.1221)
[02/27 17:09:18][INFO] train_vision.py:  668: Epoch: [25][50/329], lr: 1.85e-05, eta: 1:21:02	Time 2.997 (3.047)	Data 0.048 (0.098)	Mem 41.61GB	Prec@1 90.000 (88.824)	Loss 0.9822 (1.1111)
[02/27 17:09:48][INFO] train_vision.py:  668: Epoch: [25][60/329], lr: 1.83e-05, eta: 1:20:17	Time 2.991 (3.038)	Data 0.062 (0.091)	Mem 41.61GB	Prec@1 70.000 (88.033)	Loss 1.1938 (1.1209)
[02/27 17:10:18][INFO] train_vision.py:  668: Epoch: [25][70/329], lr: 1.81e-05, eta: 1:19:38	Time 2.990 (3.032)	Data 0.040 (0.086)	Mem 41.61GB	Prec@1 100.000 (88.310)	Loss 0.8693 (1.1097)
[02/27 17:10:48][INFO] train_vision.py:  668: Epoch: [25][80/329], lr: 1.79e-05, eta: 1:19:01	Time 3.007 (3.028)	Data 0.082 (0.082)	Mem 41.61GB	Prec@1 100.000 (88.148)	Loss 0.9281 (1.1160)
[02/27 17:11:18][INFO] train_vision.py:  668: Epoch: [25][90/329], lr: 1.77e-05, eta: 1:18:25	Time 2.990 (3.024)	Data 0.047 (0.079)	Mem 41.61GB	Prec@1 100.000 (88.352)	Loss 0.8686 (1.1150)
[02/27 17:11:48][INFO] train_vision.py:  668: Epoch: [25][100/329], lr: 1.75e-05, eta: 1:17:51	Time 3.003 (3.022)	Data 0.077 (0.076)	Mem 41.61GB	Prec@1 100.000 (88.812)	Loss 0.8211 (1.1060)
[02/27 17:12:17][INFO] train_vision.py:  668: Epoch: [25][110/329], lr: 1.73e-05, eta: 1:17:17	Time 2.989 (3.019)	Data 0.043 (0.074)	Mem 41.61GB	Prec@1 80.000 (88.829)	Loss 1.4746 (1.1038)
[02/27 17:12:47][INFO] train_vision.py:  668: Epoch: [25][120/329], lr: 1.71e-05, eta: 1:16:44	Time 2.995 (3.017)	Data 0.061 (0.073)	Mem 41.61GB	Prec@1 80.000 (88.760)	Loss 1.1364 (1.1062)
[02/27 17:13:17][INFO] train_vision.py:  668: Epoch: [25][130/329], lr: 1.70e-05, eta: 1:16:11	Time 2.984 (3.016)	Data 0.045 (0.072)	Mem 41.61GB	Prec@1 100.000 (88.626)	Loss 0.8397 (1.1170)
[02/27 17:13:47][INFO] train_vision.py:  668: Epoch: [25][140/329], lr: 1.68e-05, eta: 1:15:38	Time 2.982 (3.014)	Data 0.056 (0.070)	Mem 41.61GB	Prec@1 80.000 (88.723)	Loss 1.2368 (1.1154)
[02/27 17:14:17][INFO] train_vision.py:  668: Epoch: [25][150/329], lr: 1.66e-05, eta: 1:15:06	Time 2.988 (3.012)	Data 0.048 (0.069)	Mem 41.61GB	Prec@1 90.000 (88.344)	Loss 1.0878 (1.1248)
[02/27 17:14:47][INFO] train_vision.py:  668: Epoch: [25][160/329], lr: 1.64e-05, eta: 1:14:33	Time 2.981 (3.010)	Data 0.057 (0.067)	Mem 41.61GB	Prec@1 90.000 (88.199)	Loss 1.1297 (1.1328)
[02/27 17:15:17][INFO] train_vision.py:  668: Epoch: [25][170/329], lr: 1.62e-05, eta: 1:14:01	Time 2.977 (3.009)	Data 0.042 (0.066)	Mem 41.61GB	Prec@1 100.000 (88.129)	Loss 0.9246 (1.1345)
[02/27 17:15:47][INFO] train_vision.py:  668: Epoch: [25][180/329], lr: 1.60e-05, eta: 1:13:29	Time 2.997 (3.008)	Data 0.058 (0.065)	Mem 41.61GB	Prec@1 80.000 (88.343)	Loss 1.4451 (1.1316)
[02/27 17:16:17][INFO] train_vision.py:  668: Epoch: [25][190/329], lr: 1.58e-05, eta: 1:12:57	Time 3.006 (3.007)	Data 0.027 (0.064)	Mem 41.61GB	Prec@1 100.000 (88.272)	Loss 0.8057 (1.1300)
[02/27 17:16:47][INFO] train_vision.py:  668: Epoch: [25][200/329], lr: 1.56e-05, eta: 1:12:26	Time 3.004 (3.006)	Data 0.058 (0.064)	Mem 41.61GB	Prec@1 100.000 (88.358)	Loss 0.8505 (1.1252)
[02/27 17:17:16][INFO] train_vision.py:  668: Epoch: [25][210/329], lr: 1.55e-05, eta: 1:11:55	Time 3.006 (3.005)	Data 0.024 (0.063)	Mem 41.61GB	Prec@1 90.000 (88.531)	Loss 0.9936 (1.1221)
[02/27 17:17:46][INFO] train_vision.py:  668: Epoch: [25][220/329], lr: 1.53e-05, eta: 1:11:24	Time 2.986 (3.005)	Data 0.052 (0.063)	Mem 41.61GB	Prec@1 80.000 (88.507)	Loss 1.3075 (1.1193)
[02/27 17:18:16][INFO] train_vision.py:  668: Epoch: [25][230/329], lr: 1.51e-05, eta: 1:10:53	Time 2.981 (3.004)	Data 0.043 (0.062)	Mem 41.61GB	Prec@1 100.000 (88.615)	Loss 0.9687 (1.1190)
[02/27 17:18:46][INFO] train_vision.py:  668: Epoch: [25][240/329], lr: 1.49e-05, eta: 1:10:23	Time 2.998 (3.004)	Data 0.052 (0.061)	Mem 41.61GB	Prec@1 90.000 (88.631)	Loss 1.0575 (1.1204)
[02/27 17:19:16][INFO] train_vision.py:  668: Epoch: [25][250/329], lr: 1.47e-05, eta: 1:09:52	Time 2.993 (3.003)	Data 0.048 (0.061)	Mem 41.61GB	Prec@1 100.000 (88.566)	Loss 0.8464 (1.1221)
[02/27 17:19:46][INFO] train_vision.py:  668: Epoch: [25][260/329], lr: 1.46e-05, eta: 1:09:22	Time 2.996 (3.003)	Data 0.058 (0.060)	Mem 41.61GB	Prec@1 100.000 (88.736)	Loss 0.9548 (1.1179)
[02/27 17:20:16][INFO] train_vision.py:  668: Epoch: [25][270/329], lr: 1.44e-05, eta: 1:08:51	Time 3.000 (3.003)	Data 0.077 (0.060)	Mem 41.61GB	Prec@1 80.000 (88.893)	Loss 1.2368 (1.1126)
[02/27 17:20:46][INFO] train_vision.py:  668: Epoch: [25][280/329], lr: 1.42e-05, eta: 1:08:21	Time 3.003 (3.002)	Data 0.058 (0.060)	Mem 41.61GB	Prec@1 90.000 (88.932)	Loss 1.0939 (1.1131)
[02/27 17:21:16][INFO] train_vision.py:  668: Epoch: [25][290/329], lr: 1.40e-05, eta: 1:07:50	Time 2.992 (3.002)	Data 0.049 (0.059)	Mem 41.61GB	Prec@1 90.000 (88.763)	Loss 1.0209 (1.1164)
[02/27 17:21:46][INFO] train_vision.py:  668: Epoch: [25][300/329], lr: 1.39e-05, eta: 1:07:20	Time 2.996 (3.002)	Data 0.051 (0.059)	Mem 41.61GB	Prec@1 80.000 (88.671)	Loss 1.2014 (1.1165)
[02/27 17:22:16][INFO] train_vision.py:  668: Epoch: [25][310/329], lr: 1.37e-05, eta: 1:06:49	Time 2.978 (3.001)	Data 0.039 (0.058)	Mem 41.61GB	Prec@1 90.000 (88.457)	Loss 1.0656 (1.1219)
[02/27 17:22:46][INFO] train_vision.py:  668: Epoch: [25][320/329], lr: 1.35e-05, eta: 1:06:19	Time 2.983 (3.001)	Data 0.058 (0.058)	Mem 41.61GB	Prec@1 90.000 (88.411)	Loss 1.0253 (1.1223)
[02/27 17:23:17][INFO] train_vision.py:  840: Test: [0/107]	Prec@1 98.750 (98.750)	Prec@5 100.000 (100.000)	mPrec@1 (32.121)	mPrec@5 (32.323)
[02/27 17:23:59][INFO] train_vision.py:  840: Test: [10/107]	Prec@1 98.750 (97.841)	Prec@5 100.000 (100.000)	mPrec@1 (83.907)	mPrec@5 (86.869)
[02/27 17:24:41][INFO] train_vision.py:  840: Test: [20/107]	Prec@1 98.750 (96.786)	Prec@5 100.000 (99.762)	mPrec@1 (91.899)	mPrec@5 (97.432)
[02/27 17:25:23][INFO] train_vision.py:  840: Test: [30/107]	Prec@1 98.750 (96.976)	Prec@5 100.000 (99.839)	mPrec@1 (94.198)	mPrec@5 (98.678)
[02/27 17:26:05][INFO] train_vision.py:  840: Test: [40/107]	Prec@1 92.500 (96.799)	Prec@5 100.000 (99.848)	mPrec@1 (95.440)	mPrec@5 (99.736)
[02/27 17:26:47][INFO] train_vision.py:  840: Test: [50/107]	Prec@1 98.750 (96.422)	Prec@5 100.000 (99.828)	mPrec@1 (94.732)	mPrec@5 (99.732)
[02/27 17:27:30][INFO] train_vision.py:  840: Test: [60/107]	Prec@1 100.000 (96.537)	Prec@5 100.000 (99.857)	mPrec@1 (94.682)	mPrec@5 (99.773)
[02/27 17:28:12][INFO] train_vision.py:  840: Test: [70/107]	Prec@1 95.000 (96.567)	Prec@5 100.000 (99.877)	mPrec@1 (94.730)	mPrec@5 (99.812)
[02/27 17:28:55][INFO] train_vision.py:  840: Test: [80/107]	Prec@1 93.750 (96.528)	Prec@5 100.000 (99.877)	mPrec@1 (94.455)	mPrec@5 (99.804)
[02/27 17:29:37][INFO] train_vision.py:  840: Test: [90/107]	Prec@1 98.750 (96.415)	Prec@5 100.000 (99.890)	mPrec@1 (94.407)	mPrec@5 (99.819)
[02/27 17:30:19][INFO] train_vision.py:  840: Test: [100/107]	Prec@1 88.750 (96.436)	Prec@5 100.000 (99.901)	mPrec@1 (94.360)	mPrec@5 (99.835)
[02/27 17:30:43][INFO] train_vision.py:  847: Overall Prec@1 96.209% Prec@5 99.894% mPrec@1 (94.275) mPrec@5 (99.823)
[02/27 17:30:43][INFO] train_vision.py:  464: Testing: 94.27516174316406/94.27516174316406
[02/27 17:30:43][INFO] train_vision.py:  465: Saving:
[02/27 17:31:02][INFO] train_vision.py:  668: Epoch: [26][0/329], lr: 1.33e-05, eta: 1:52:59	Time 5.148 (5.148)	Data 2.259 (2.259)	Mem 41.61GB	Prec@1 90.000 (90.000)	Loss 0.9741 (0.9741)
[02/27 17:31:32][INFO] train_vision.py:  668: Epoch: [26][10/329], lr: 1.32e-05, eta: 1:09:13	Time 3.010 (3.178)	Data 0.045 (0.259)	Mem 41.61GB	Prec@1 90.000 (88.182)	Loss 1.3325 (1.1327)
[02/27 17:32:02][INFO] train_vision.py:  668: Epoch: [26][20/329], lr: 1.30e-05, eta: 1:06:52	Time 2.990 (3.093)	Data 0.066 (0.166)	Mem 41.61GB	Prec@1 90.000 (86.190)	Loss 1.0937 (1.1640)
[02/27 17:32:32][INFO] train_vision.py:  668: Epoch: [26][30/329], lr: 1.29e-05, eta: 1:05:43	Time 3.021 (3.064)	Data 0.059 (0.132)	Mem 41.61GB	Prec@1 70.000 (87.097)	Loss 1.3984 (1.1301)
[02/27 17:33:02][INFO] train_vision.py:  668: Epoch: [26][40/329], lr: 1.27e-05, eta: 1:04:53	Time 2.994 (3.049)	Data 0.053 (0.113)	Mem 41.61GB	Prec@1 80.000 (88.537)	Loss 1.5088 (1.1077)
[02/27 17:33:32][INFO] train_vision.py:  668: Epoch: [26][50/329], lr: 1.25e-05, eta: 1:04:09	Time 3.010 (3.038)	Data 0.024 (0.100)	Mem 41.61GB	Prec@1 90.000 (88.431)	Loss 0.9477 (1.1127)
[02/27 17:34:02][INFO] train_vision.py:  668: Epoch: [26][60/329], lr: 1.24e-05, eta: 1:03:31	Time 2.984 (3.032)	Data 0.055 (0.092)	Mem 41.61GB	Prec@1 80.000 (87.869)	Loss 1.4360 (1.1283)
[02/27 17:34:32][INFO] train_vision.py:  668: Epoch: [26][70/329], lr: 1.22e-05, eta: 1:02:55	Time 3.036 (3.028)	Data 0.042 (0.085)	Mem 41.61GB	Prec@1 70.000 (88.310)	Loss 1.4812 (1.1149)
[02/27 17:35:02][INFO] train_vision.py:  668: Epoch: [26][80/329], lr: 1.20e-05, eta: 1:02:20	Time 2.956 (3.024)	Data 0.042 (0.081)	Mem 41.61GB	Prec@1 100.000 (88.272)	Loss 0.8057 (1.1093)
[02/27 17:35:32][INFO] train_vision.py:  668: Epoch: [26][90/329], lr: 1.19e-05, eta: 1:01:46	Time 3.001 (3.021)	Data 0.051 (0.078)	Mem 41.61GB	Prec@1 100.000 (88.352)	Loss 0.9392 (1.1154)
[02/27 17:36:01][INFO] train_vision.py:  668: Epoch: [26][100/329], lr: 1.17e-05, eta: 1:01:12	Time 2.967 (3.018)	Data 0.052 (0.075)	Mem 41.61GB	Prec@1 70.000 (88.119)	Loss 1.4794 (1.1192)
[02/27 17:36:31][INFO] train_vision.py:  668: Epoch: [26][110/329], lr: 1.16e-05, eta: 1:00:40	Time 3.025 (3.016)	Data 0.064 (0.073)	Mem 41.61GB	Prec@1 80.000 (87.838)	Loss 1.2596 (1.1216)
[02/27 17:37:01][INFO] train_vision.py:  668: Epoch: [26][120/329], lr: 1.14e-05, eta: 1:00:08	Time 2.988 (3.014)	Data 0.047 (0.071)	Mem 41.61GB	Prec@1 80.000 (87.438)	Loss 1.9600 (1.1468)
[02/27 17:37:31][INFO] train_vision.py:  668: Epoch: [26][130/329], lr: 1.13e-05, eta: 0:59:36	Time 2.976 (3.013)	Data 0.058 (0.069)	Mem 41.61GB	Prec@1 100.000 (87.405)	Loss 0.9426 (1.1492)
[02/27 17:38:01][INFO] train_vision.py:  668: Epoch: [26][140/329], lr: 1.11e-05, eta: 0:59:05	Time 3.007 (3.013)	Data 0.081 (0.068)	Mem 41.61GB	Prec@1 100.000 (87.660)	Loss 1.1498 (1.1480)
[02/27 17:38:31][INFO] train_vision.py:  668: Epoch: [26][150/329], lr: 1.10e-05, eta: 0:58:33	Time 2.995 (3.011)	Data 0.049 (0.066)	Mem 41.61GB	Prec@1 80.000 (88.013)	Loss 1.3276 (1.1382)
[02/27 17:39:01][INFO] train_vision.py:  668: Epoch: [26][160/329], lr: 1.08e-05, eta: 0:58:02	Time 2.997 (3.010)	Data 0.057 (0.065)	Mem 41.61GB	Prec@1 100.000 (88.137)	Loss 1.0206 (1.1347)
[02/27 17:39:31][INFO] train_vision.py:  668: Epoch: [26][170/329], lr: 1.07e-05, eta: 0:57:31	Time 2.982 (3.009)	Data 0.049 (0.065)	Mem 41.61GB	Prec@1 90.000 (88.246)	Loss 1.1552 (1.1328)
[02/27 17:40:01][INFO] train_vision.py:  668: Epoch: [26][180/329], lr: 1.05e-05, eta: 0:57:00	Time 3.000 (3.008)	Data 0.054 (0.064)	Mem 41.61GB	Prec@1 90.000 (88.122)	Loss 1.2896 (1.1368)
[02/27 17:40:31][INFO] train_vision.py:  668: Epoch: [26][190/329], lr: 1.04e-05, eta: 0:56:29	Time 2.991 (3.008)	Data 0.049 (0.063)	Mem 41.61GB	Prec@1 90.000 (87.958)	Loss 1.1199 (1.1434)
[02/27 17:41:01][INFO] train_vision.py:  668: Epoch: [26][200/329], lr: 1.02e-05, eta: 0:55:58	Time 2.971 (3.007)	Data 0.051 (0.063)	Mem 41.61GB	Prec@1 90.000 (87.861)	Loss 1.0994 (1.1409)
[02/27 17:41:31][INFO] train_vision.py:  668: Epoch: [26][210/329], lr: 1.01e-05, eta: 0:55:28	Time 3.012 (3.006)	Data 0.022 (0.062)	Mem 41.61GB	Prec@1 80.000 (88.009)	Loss 1.1996 (1.1377)
[02/27 17:42:01][INFO] train_vision.py:  668: Epoch: [26][220/329], lr: 9.93e-06, eta: 0:54:57	Time 2.997 (3.006)	Data 0.054 (0.061)	Mem 41.61GB	Prec@1 80.000 (88.100)	Loss 1.1421 (1.1360)
[02/27 17:42:31][INFO] train_vision.py:  668: Epoch: [26][230/329], lr: 9.78e-06, eta: 0:54:26	Time 3.007 (3.005)	Data 0.023 (0.061)	Mem 41.61GB	Prec@1 100.000 (88.095)	Loss 0.9873 (1.1355)
[02/27 17:43:01][INFO] train_vision.py:  668: Epoch: [26][240/329], lr: 9.64e-06, eta: 0:53:55	Time 2.992 (3.004)	Data 0.048 (0.060)	Mem 41.61GB	Prec@1 90.000 (88.008)	Loss 1.0381 (1.1367)
[02/27 17:43:31][INFO] train_vision.py:  668: Epoch: [26][250/329], lr: 9.50e-06, eta: 0:53:25	Time 2.997 (3.004)	Data 0.048 (0.060)	Mem 41.61GB	Prec@1 70.000 (87.968)	Loss 1.1324 (1.1339)
[02/27 17:44:00][INFO] train_vision.py:  668: Epoch: [26][260/329], lr: 9.37e-06, eta: 0:52:54	Time 2.996 (3.003)	Data 0.054 (0.059)	Mem 41.61GB	Prec@1 90.000 (87.969)	Loss 1.2107 (1.1348)
[02/27 17:44:30][INFO] train_vision.py:  668: Epoch: [26][270/329], lr: 9.23e-06, eta: 0:52:23	Time 3.003 (3.003)	Data 0.026 (0.059)	Mem 41.61GB	Prec@1 60.000 (88.044)	Loss 1.5520 (1.1329)
[02/27 17:45:00][INFO] train_vision.py:  668: Epoch: [26][280/329], lr: 9.09e-06, eta: 0:51:53	Time 3.004 (3.002)	Data 0.066 (0.058)	Mem 41.61GB	Prec@1 90.000 (87.758)	Loss 0.9922 (1.1382)
[02/27 17:45:30][INFO] train_vision.py:  668: Epoch: [26][290/329], lr: 8.96e-06, eta: 0:51:23	Time 2.989 (3.002)	Data 0.049 (0.058)	Mem 41.61GB	Prec@1 100.000 (87.869)	Loss 0.9278 (1.1373)
[02/27 17:46:00][INFO] train_vision.py:  668: Epoch: [26][300/329], lr: 8.83e-06, eta: 0:50:52	Time 3.002 (3.002)	Data 0.064 (0.058)	Mem 41.61GB	Prec@1 100.000 (87.973)	Loss 0.8766 (1.1375)
[02/27 17:46:30][INFO] train_vision.py:  668: Epoch: [26][310/329], lr: 8.69e-06, eta: 0:50:22	Time 2.969 (3.001)	Data 0.052 (0.058)	Mem 41.61GB	Prec@1 100.000 (88.006)	Loss 0.8137 (1.1374)
[02/27 17:47:00][INFO] train_vision.py:  668: Epoch: [26][320/329], lr: 8.56e-06, eta: 0:49:52	Time 3.006 (3.001)	Data 0.065 (0.057)	Mem 41.61GB	Prec@1 80.000 (88.100)	Loss 1.2402 (1.1352)
[02/27 17:47:31][INFO] train_vision.py:  840: Test: [0/107]	Prec@1 98.750 (98.750)	Prec@5 100.000 (100.000)	mPrec@1 (32.121)	mPrec@5 (32.323)
[02/27 17:48:13][INFO] train_vision.py:  840: Test: [10/107]	Prec@1 96.250 (97.273)	Prec@5 100.000 (100.000)	mPrec@1 (83.592)	mPrec@5 (86.869)
[02/27 17:48:56][INFO] train_vision.py:  840: Test: [20/107]	Prec@1 97.500 (96.667)	Prec@5 100.000 (99.762)	mPrec@1 (92.057)	mPrec@5 (97.432)
[02/27 17:49:38][INFO] train_vision.py:  840: Test: [30/107]	Prec@1 98.750 (96.895)	Prec@5 100.000 (99.839)	mPrec@1 (94.324)	mPrec@5 (98.678)
[02/27 17:50:20][INFO] train_vision.py:  840: Test: [40/107]	Prec@1 93.750 (96.951)	Prec@5 100.000 (99.817)	mPrec@1 (95.704)	mPrec@5 (99.707)
[02/27 17:51:02][INFO] train_vision.py:  840: Test: [50/107]	Prec@1 98.750 (96.544)	Prec@5 100.000 (99.804)	mPrec@1 (94.932)	mPrec@5 (99.704)
[02/27 17:51:45][INFO] train_vision.py:  840: Test: [60/107]	Prec@1 100.000 (96.660)	Prec@5 100.000 (99.836)	mPrec@1 (94.908)	mPrec@5 (99.752)
[02/27 17:52:27][INFO] train_vision.py:  840: Test: [70/107]	Prec@1 95.000 (96.690)	Prec@5 100.000 (99.859)	mPrec@1 (94.957)	mPrec@5 (99.796)
[02/27 17:53:09][INFO] train_vision.py:  840: Test: [80/107]	Prec@1 95.000 (96.667)	Prec@5 100.000 (99.877)	mPrec@1 (94.671)	mPrec@5 (99.830)
[02/27 17:53:51][INFO] train_vision.py:  840: Test: [90/107]	Prec@1 100.000 (96.484)	Prec@5 100.000 (99.890)	mPrec@1 (94.329)	mPrec@5 (99.844)
[02/27 17:54:34][INFO] train_vision.py:  840: Test: [100/107]	Prec@1 88.750 (96.485)	Prec@5 100.000 (99.901)	mPrec@1 (94.319)	mPrec@5 (99.858)
[02/27 17:54:57][INFO] train_vision.py:  847: Overall Prec@1 96.397% Prec@5 99.894% mPrec@1 (94.419) mPrec@5 (99.847)
[02/27 17:54:57][INFO] train_vision.py:  464: Testing: 94.41860961914062/94.41860961914062
[02/27 17:54:57][INFO] train_vision.py:  465: Saving:
[02/27 17:55:16][INFO] train_vision.py:  668: Epoch: [27][0/329], lr: 8.43e-06, eta: 1:26:00	Time 5.223 (5.223)	Data 2.340 (2.340)	Mem 41.61GB	Prec@1 80.000 (80.000)	Loss 1.1421 (1.1421)
[02/27 17:55:46][INFO] train_vision.py:  668: Epoch: [27][10/329], lr: 8.32e-06, eta: 0:52:01	Time 2.998 (3.192)	Data 0.077 (0.276)	Mem 41.61GB	Prec@1 80.000 (87.273)	Loss 1.0704 (1.1488)
[02/27 17:56:16][INFO] train_vision.py:  668: Epoch: [27][20/329], lr: 8.19e-06, eta: 0:49:59	Time 3.012 (3.099)	Data 0.095 (0.181)	Mem 41.61GB	Prec@1 90.000 (88.095)	Loss 1.0041 (1.0847)
[02/27 17:56:46][INFO] train_vision.py:  668: Epoch: [27][30/329], lr: 8.06e-06, eta: 0:48:58	Time 2.995 (3.067)	Data 0.061 (0.144)	Mem 41.61GB	Prec@1 100.000 (89.032)	Loss 0.9289 (1.0657)
[02/27 17:57:16][INFO] train_vision.py:  668: Epoch: [27][40/329], lr: 7.94e-06, eta: 0:48:13	Time 2.992 (3.052)	Data 0.052 (0.124)	Mem 41.61GB	Prec@1 100.000 (88.780)	Loss 0.8318 (1.0733)
[02/27 17:57:46][INFO] train_vision.py:  668: Epoch: [27][50/329], lr: 7.82e-06, eta: 0:47:33	Time 3.002 (3.042)	Data 0.056 (0.112)	Mem 41.61GB	Prec@1 100.000 (88.431)	Loss 0.9644 (1.0823)
[02/27 17:58:16][INFO] train_vision.py:  668: Epoch: [27][60/329], lr: 7.69e-06, eta: 0:46:56	Time 3.000 (3.035)	Data 0.082 (0.104)	Mem 41.61GB	Prec@1 80.000 (89.180)	Loss 1.3184 (1.0802)
[02/27 17:58:46][INFO] train_vision.py:  668: Epoch: [27][70/329], lr: 7.57e-06, eta: 0:46:21	Time 3.005 (3.030)	Data 0.067 (0.098)	Mem 41.61GB	Prec@1 90.000 (89.437)	Loss 1.2203 (1.0802)
[02/27 17:59:16][INFO] train_vision.py:  668: Epoch: [27][80/329], lr: 7.45e-06, eta: 0:45:47	Time 3.000 (3.026)	Data 0.058 (0.093)	Mem 41.61GB	Prec@1 90.000 (89.753)	Loss 1.0781 (1.0780)
[02/27 17:59:46][INFO] train_vision.py:  668: Epoch: [27][90/329], lr: 7.34e-06, eta: 0:45:14	Time 2.957 (3.023)	Data 0.051 (0.089)	Mem 41.61GB	Prec@1 90.000 (90.110)	Loss 0.9898 (1.0708)
[02/27 18:00:16][INFO] train_vision.py:  668: Epoch: [27][100/329], lr: 7.22e-06, eta: 0:44:41	Time 2.983 (3.019)	Data 0.050 (0.085)	Mem 41.61GB	Prec@1 90.000 (89.901)	Loss 1.0350 (1.0697)
[02/27 18:00:46][INFO] train_vision.py:  668: Epoch: [27][110/329], lr: 7.10e-06, eta: 0:44:09	Time 3.012 (3.017)	Data 0.066 (0.083)	Mem 41.61GB	Prec@1 80.000 (89.550)	Loss 1.3320 (1.0815)
[02/27 18:01:16][INFO] train_vision.py:  668: Epoch: [27][120/329], lr: 6.99e-06, eta: 0:43:38	Time 3.005 (3.016)	Data 0.052 (0.081)	Mem 41.61GB	Prec@1 80.000 (89.421)	Loss 1.2583 (1.0850)
[02/27 18:01:46][INFO] train_vision.py:  668: Epoch: [27][130/329], lr: 6.87e-06, eta: 0:43:06	Time 2.984 (3.015)	Data 0.047 (0.079)	Mem 41.61GB	Prec@1 80.000 (89.160)	Loss 1.2107 (1.0902)
[02/27 18:02:16][INFO] train_vision.py:  668: Epoch: [27][140/329], lr: 6.76e-06, eta: 0:42:35	Time 2.959 (3.013)	Data 0.061 (0.078)	Mem 41.61GB	Prec@1 100.000 (89.362)	Loss 0.9679 (1.0858)
[02/27 18:02:46][INFO] train_vision.py:  668: Epoch: [27][150/329], lr: 6.65e-06, eta: 0:42:04	Time 2.981 (3.012)	Data 0.044 (0.076)	Mem 41.61GB	Prec@1 100.000 (89.536)	Loss 0.8841 (1.0795)
[02/27 18:03:16][INFO] train_vision.py:  668: Epoch: [27][160/329], lr: 6.54e-06, eta: 0:41:33	Time 2.997 (3.011)	Data 0.057 (0.074)	Mem 41.61GB	Prec@1 70.000 (89.130)	Loss 1.4782 (1.0880)
[02/27 18:03:46][INFO] train_vision.py:  668: Epoch: [27][170/329], lr: 6.43e-06, eta: 0:41:02	Time 2.985 (3.011)	Data 0.078 (0.074)	Mem 41.61GB	Prec@1 70.000 (89.064)	Loss 1.5613 (1.0899)
[02/27 18:04:16][INFO] train_vision.py:  668: Epoch: [27][180/329], lr: 6.33e-06, eta: 0:40:32	Time 2.991 (3.010)	Data 0.045 (0.072)	Mem 41.61GB	Prec@1 90.000 (89.006)	Loss 1.0767 (1.0881)
[02/27 18:04:46][INFO] train_vision.py:  668: Epoch: [27][190/329], lr: 6.22e-06, eta: 0:40:01	Time 2.985 (3.009)	Data 0.081 (0.072)	Mem 41.61GB	Prec@1 100.000 (89.162)	Loss 0.9296 (1.0848)
[02/27 18:05:16][INFO] train_vision.py:  668: Epoch: [27][200/329], lr: 6.12e-06, eta: 0:39:30	Time 3.008 (3.009)	Data 0.058 (0.071)	Mem 41.61GB	Prec@1 80.000 (89.154)	Loss 1.1837 (1.0850)
[02/27 18:05:46][INFO] train_vision.py:  668: Epoch: [27][210/329], lr: 6.01e-06, eta: 0:39:00	Time 2.990 (3.008)	Data 0.047 (0.070)	Mem 41.61GB	Prec@1 80.000 (89.100)	Loss 1.0724 (1.0833)
[02/27 18:06:16][INFO] train_vision.py:  668: Epoch: [27][220/329], lr: 5.91e-06, eta: 0:38:29	Time 2.991 (3.008)	Data 0.058 (0.069)	Mem 41.61GB	Prec@1 100.000 (89.186)	Loss 0.8447 (1.0840)
[02/27 18:06:46][INFO] train_vision.py:  668: Epoch: [27][230/329], lr: 5.81e-06, eta: 0:37:59	Time 2.991 (3.007)	Data 0.050 (0.069)	Mem 41.61GB	Prec@1 90.000 (89.048)	Loss 1.0096 (1.0921)
[02/27 18:07:16][INFO] train_vision.py:  668: Epoch: [27][240/329], lr: 5.71e-06, eta: 0:37:29	Time 3.000 (3.007)	Data 0.056 (0.068)	Mem 41.61GB	Prec@1 70.000 (88.838)	Loss 1.5144 (1.0988)
[02/27 18:07:46][INFO] train_vision.py:  668: Epoch: [27][250/329], lr: 5.61e-06, eta: 0:36:58	Time 3.013 (3.006)	Data 0.082 (0.067)	Mem 41.61GB	Prec@1 80.000 (88.645)	Loss 1.3970 (1.1015)
[02/27 18:08:15][INFO] train_vision.py:  668: Epoch: [27][260/329], lr: 5.52e-06, eta: 0:36:28	Time 2.985 (3.006)	Data 0.058 (0.067)	Mem 41.61GB	Prec@1 90.000 (88.544)	Loss 0.9506 (1.1030)
[02/27 18:08:45][INFO] train_vision.py:  668: Epoch: [27][270/329], lr: 5.42e-06, eta: 0:35:57	Time 2.997 (3.006)	Data 0.049 (0.066)	Mem 41.61GB	Prec@1 70.000 (88.524)	Loss 1.2443 (1.1037)
[02/27 18:09:15][INFO] train_vision.py:  668: Epoch: [27][280/329], lr: 5.33e-06, eta: 0:35:27	Time 3.002 (3.005)	Data 0.059 (0.066)	Mem 41.61GB	Prec@1 100.000 (88.612)	Loss 0.8320 (1.1039)
[02/27 18:09:45][INFO] train_vision.py:  668: Epoch: [27][290/329], lr: 5.24e-06, eta: 0:34:57	Time 2.999 (3.005)	Data 0.055 (0.065)	Mem 41.61GB	Prec@1 90.000 (88.694)	Loss 1.0690 (1.1040)
[02/27 18:10:15][INFO] train_vision.py:  668: Epoch: [27][300/329], lr: 5.14e-06, eta: 0:34:27	Time 3.029 (3.005)	Data 0.023 (0.065)	Mem 41.61GB	Prec@1 90.000 (88.571)	Loss 1.1173 (1.1082)
[02/27 18:10:45][INFO] train_vision.py:  668: Epoch: [27][310/329], lr: 5.05e-06, eta: 0:33:56	Time 2.993 (3.004)	Data 0.049 (0.064)	Mem 41.61GB	Prec@1 100.000 (88.617)	Loss 0.9274 (1.1076)
[02/27 18:11:15][INFO] train_vision.py:  668: Epoch: [27][320/329], lr: 4.96e-06, eta: 0:33:26	Time 3.019 (3.004)	Data 0.025 (0.064)	Mem 41.61GB	Prec@1 80.000 (88.536)	Loss 1.2821 (1.1105)
[02/27 18:11:46][INFO] train_vision.py:  840: Test: [0/107]	Prec@1 98.750 (98.750)	Prec@5 100.000 (100.000)	mPrec@1 (32.121)	mPrec@5 (32.323)
[02/27 18:12:28][INFO] train_vision.py:  840: Test: [10/107]	Prec@1 98.750 (97.841)	Prec@5 100.000 (100.000)	mPrec@1 (83.906)	mPrec@5 (86.869)
[02/27 18:13:11][INFO] train_vision.py:  840: Test: [20/107]	Prec@1 98.750 (96.905)	Prec@5 100.000 (99.702)	mPrec@1 (92.030)	mPrec@5 (97.397)
[02/27 18:13:53][INFO] train_vision.py:  840: Test: [30/107]	Prec@1 98.750 (96.935)	Prec@5 100.000 (99.798)	mPrec@1 (94.124)	mPrec@5 (98.647)
[02/27 18:14:35][INFO] train_vision.py:  840: Test: [40/107]	Prec@1 93.750 (96.829)	Prec@5 100.000 (99.787)	mPrec@1 (95.233)	mPrec@5 (99.690)
[02/27 18:15:17][INFO] train_vision.py:  840: Test: [50/107]	Prec@1 98.750 (96.446)	Prec@5 100.000 (99.804)	mPrec@1 (94.268)	mPrec@5 (99.700)
[02/27 18:15:59][INFO] train_vision.py:  840: Test: [60/107]	Prec@1 100.000 (96.578)	Prec@5 100.000 (99.836)	mPrec@1 (94.368)	mPrec@5 (99.747)
[02/27 18:16:42][INFO] train_vision.py:  840: Test: [70/107]	Prec@1 95.000 (96.585)	Prec@5 98.750 (99.842)	mPrec@1 (94.403)	mPrec@5 (99.743)
[02/27 18:17:24][INFO] train_vision.py:  840: Test: [80/107]	Prec@1 95.000 (96.620)	Prec@5 100.000 (99.846)	mPrec@1 (94.250)	mPrec@5 (99.740)
[02/27 18:18:07][INFO] train_vision.py:  840: Test: [90/107]	Prec@1 100.000 (96.415)	Prec@5 100.000 (99.863)	mPrec@1 (94.043)	mPrec@5 (99.761)
[02/27 18:18:49][INFO] train_vision.py:  840: Test: [100/107]	Prec@1 90.000 (96.448)	Prec@5 100.000 (99.876)	mPrec@1 (94.071)	mPrec@5 (99.784)
[02/27 18:19:12][INFO] train_vision.py:  847: Overall Prec@1 96.350% Prec@5 99.871% mPrec@1 (94.173) mPrec@5 (99.773)
[02/27 18:19:12][INFO] train_vision.py:  464: Testing: 94.17326354980469/94.41860961914062
[02/27 18:19:12][INFO] train_vision.py:  465: Saving:
[02/27 18:19:25][INFO] train_vision.py:  668: Epoch: [28][0/329], lr: 4.88e-06, eta: 0:57:13	Time 5.210 (5.210)	Data 2.307 (2.307)	Mem 41.61GB	Prec@1 90.000 (90.000)	Loss 1.0605 (1.0605)
[02/27 18:19:55][INFO] train_vision.py:  668: Epoch: [28][10/329], lr: 4.80e-06, eta: 0:34:38	Time 2.975 (3.203)	Data 0.048 (0.271)	Mem 41.61GB	Prec@1 70.000 (87.273)	Loss 1.4442 (1.1404)
[02/27 18:20:25][INFO] train_vision.py:  668: Epoch: [28][20/329], lr: 4.71e-06, eta: 0:33:07	Time 3.006 (3.111)	Data 0.096 (0.168)	Mem 41.61GB	Prec@1 90.000 (87.143)	Loss 0.9977 (1.1430)
[02/27 18:20:55][INFO] train_vision.py:  668: Epoch: [28][30/329], lr: 4.63e-06, eta: 0:32:15	Time 2.976 (3.077)	Data 0.054 (0.131)	Mem 41.61GB	Prec@1 100.000 (88.387)	Loss 0.8822 (1.1324)
[02/27 18:21:25][INFO] train_vision.py:  668: Epoch: [28][40/329], lr: 4.55e-06, eta: 0:31:34	Time 3.052 (3.060)	Data 0.073 (0.116)	Mem 41.61GB	Prec@1 70.000 (87.561)	Loss 1.3551 (1.1386)
[02/27 18:21:55][INFO] train_vision.py:  668: Epoch: [28][50/329], lr: 4.47e-06, eta: 0:30:56	Time 3.007 (3.048)	Data 0.024 (0.102)	Mem 41.61GB	Prec@1 70.000 (86.275)	Loss 1.5144 (1.1714)
[02/27 18:22:25][INFO] train_vision.py:  668: Epoch: [28][60/329], lr: 4.39e-06, eta: 0:30:22	Time 3.023 (3.042)	Data 0.089 (0.095)	Mem 41.61GB	Prec@1 70.000 (86.557)	Loss 1.6644 (1.1791)
[02/27 18:22:55][INFO] train_vision.py:  668: Epoch: [28][70/329], lr: 4.31e-06, eta: 0:29:48	Time 2.992 (3.037)	Data 0.046 (0.090)	Mem 41.61GB	Prec@1 80.000 (86.901)	Loss 1.3538 (1.1747)
[02/27 18:23:25][INFO] train_vision.py:  668: Epoch: [28][80/329], lr: 4.23e-06, eta: 0:29:15	Time 3.016 (3.032)	Data 0.058 (0.085)	Mem 41.61GB	Prec@1 90.000 (86.790)	Loss 0.9366 (1.1779)
[02/27 18:23:55][INFO] train_vision.py:  668: Epoch: [28][90/329], lr: 4.15e-06, eta: 0:28:43	Time 2.980 (3.029)	Data 0.046 (0.082)	Mem 41.61GB	Prec@1 100.000 (87.033)	Loss 0.9226 (1.1757)
[02/27 18:24:25][INFO] train_vision.py:  668: Epoch: [28][100/329], lr: 4.08e-06, eta: 0:28:11	Time 3.046 (3.026)	Data 0.022 (0.079)	Mem 41.61GB	Prec@1 100.000 (87.129)	Loss 0.8061 (1.1657)
[02/27 18:24:55][INFO] train_vision.py:  668: Epoch: [28][110/329], lr: 4.01e-06, eta: 0:27:40	Time 3.012 (3.024)	Data 0.064 (0.077)	Mem 41.61GB	Prec@1 80.000 (86.937)	Loss 1.3152 (1.1722)
[02/27 18:25:25][INFO] train_vision.py:  668: Epoch: [28][120/329], lr: 3.93e-06, eta: 0:27:09	Time 3.019 (3.023)	Data 0.074 (0.075)	Mem 41.61GB	Prec@1 100.000 (87.107)	Loss 0.8790 (1.1635)
[02/27 18:25:55][INFO] train_vision.py:  668: Epoch: [28][130/329], lr: 3.86e-06, eta: 0:26:38	Time 3.010 (3.022)	Data 0.046 (0.074)	Mem 41.61GB	Prec@1 80.000 (87.023)	Loss 1.2366 (1.1630)
[02/27 18:26:25][INFO] train_vision.py:  668: Epoch: [28][140/329], lr: 3.79e-06, eta: 0:26:07	Time 3.031 (3.021)	Data 0.084 (0.072)	Mem 41.61GB	Prec@1 90.000 (87.376)	Loss 1.0884 (1.1550)
[02/27 18:26:56][INFO] train_vision.py:  668: Epoch: [28][150/329], lr: 3.72e-06, eta: 0:25:36	Time 2.990 (3.020)	Data 0.055 (0.072)	Mem 41.61GB	Prec@1 100.000 (87.417)	Loss 0.8218 (1.1530)
[02/27 18:27:26][INFO] train_vision.py:  668: Epoch: [28][160/329], lr: 3.66e-06, eta: 0:25:06	Time 3.013 (3.019)	Data 0.070 (0.071)	Mem 41.61GB	Prec@1 80.000 (87.391)	Loss 1.0377 (1.1498)
[02/27 18:27:56][INFO] train_vision.py:  668: Epoch: [28][170/329], lr: 3.59e-06, eta: 0:24:35	Time 2.981 (3.017)	Data 0.057 (0.069)	Mem 41.61GB	Prec@1 90.000 (87.368)	Loss 1.1394 (1.1476)
[02/27 18:28:26][INFO] train_vision.py:  668: Epoch: [28][180/329], lr: 3.53e-06, eta: 0:24:04	Time 3.018 (3.016)	Data 0.093 (0.069)	Mem 41.61GB	Prec@1 100.000 (87.182)	Loss 0.9431 (1.1508)
[02/27 18:28:56][INFO] train_vision.py:  668: Epoch: [28][190/329], lr: 3.47e-06, eta: 0:23:34	Time 2.990 (3.016)	Data 0.054 (0.068)	Mem 41.61GB	Prec@1 90.000 (86.963)	Loss 1.0541 (1.1528)
[02/27 18:29:26][INFO] train_vision.py:  668: Epoch: [28][200/329], lr: 3.40e-06, eta: 0:23:03	Time 3.003 (3.015)	Data 0.054 (0.067)	Mem 41.61GB	Prec@1 100.000 (87.015)	Loss 0.9040 (1.1538)
[02/27 18:29:56][INFO] train_vision.py:  668: Epoch: [28][210/329], lr: 3.34e-06, eta: 0:22:33	Time 2.999 (3.014)	Data 0.052 (0.067)	Mem 41.61GB	Prec@1 90.000 (87.156)	Loss 1.3764 (1.1527)
[02/27 18:30:26][INFO] train_vision.py:  668: Epoch: [28][220/329], lr: 3.28e-06, eta: 0:22:02	Time 3.014 (3.014)	Data 0.080 (0.066)	Mem 41.61GB	Prec@1 100.000 (87.149)	Loss 0.9023 (1.1543)
[02/27 18:30:56][INFO] train_vision.py:  668: Epoch: [28][230/329], lr: 3.23e-06, eta: 0:21:32	Time 2.958 (3.013)	Data 0.055 (0.066)	Mem 41.61GB	Prec@1 100.000 (87.489)	Loss 0.9062 (1.1472)
[02/27 18:31:26][INFO] train_vision.py:  668: Epoch: [28][240/329], lr: 3.17e-06, eta: 0:21:02	Time 2.998 (3.012)	Data 0.050 (0.066)	Mem 41.61GB	Prec@1 70.000 (87.510)	Loss 1.4155 (1.1472)
[02/27 18:31:55][INFO] train_vision.py:  668: Epoch: [28][250/329], lr: 3.11e-06, eta: 0:20:31	Time 2.995 (3.012)	Data 0.054 (0.065)	Mem 41.61GB	Prec@1 80.000 (87.570)	Loss 1.1996 (1.1448)
[02/27 18:32:25][INFO] train_vision.py:  668: Epoch: [28][260/329], lr: 3.06e-06, eta: 0:20:01	Time 2.994 (3.011)	Data 0.051 (0.065)	Mem 41.61GB	Prec@1 90.000 (87.816)	Loss 1.0483 (1.1383)
[02/27 18:32:55][INFO] train_vision.py:  668: Epoch: [28][270/329], lr: 3.01e-06, eta: 0:19:31	Time 2.999 (3.011)	Data 0.055 (0.064)	Mem 41.61GB	Prec@1 70.000 (87.712)	Loss 1.5930 (1.1401)
[02/27 18:33:25][INFO] train_vision.py:  668: Epoch: [28][280/329], lr: 2.96e-06, eta: 0:19:00	Time 3.030 (3.010)	Data 0.024 (0.064)	Mem 41.61GB	Prec@1 90.000 (87.687)	Loss 1.1231 (1.1426)
[02/27 18:33:55][INFO] train_vision.py:  668: Epoch: [28][290/329], lr: 2.91e-06, eta: 0:18:30	Time 3.041 (3.010)	Data 0.028 (0.063)	Mem 41.61GB	Prec@1 70.000 (87.732)	Loss 1.4637 (1.1409)
[02/27 18:34:25][INFO] train_vision.py:  668: Epoch: [28][300/329], lr: 2.86e-06, eta: 0:18:00	Time 3.002 (3.009)	Data 0.057 (0.063)	Mem 41.61GB	Prec@1 80.000 (87.508)	Loss 1.2631 (1.1429)
[02/27 18:34:55][INFO] train_vision.py:  668: Epoch: [28][310/329], lr: 2.81e-06, eta: 0:17:30	Time 2.985 (3.009)	Data 0.052 (0.062)	Mem 41.61GB	Prec@1 70.000 (87.621)	Loss 1.7195 (1.1414)
[02/27 18:35:25][INFO] train_vision.py:  668: Epoch: [28][320/329], lr: 2.77e-06, eta: 0:16:59	Time 2.990 (3.009)	Data 0.046 (0.062)	Mem 41.61GB	Prec@1 90.000 (87.695)	Loss 1.0035 (1.1382)
[02/27 18:35:56][INFO] train_vision.py:  840: Test: [0/107]	Prec@1 98.750 (98.750)	Prec@5 100.000 (100.000)	mPrec@1 (32.121)	mPrec@5 (32.323)
[02/27 18:36:39][INFO] train_vision.py:  840: Test: [10/107]	Prec@1 97.500 (97.386)	Prec@5 100.000 (100.000)	mPrec@1 (83.574)	mPrec@5 (86.869)
[02/27 18:37:21][INFO] train_vision.py:  840: Test: [20/107]	Prec@1 98.750 (96.548)	Prec@5 100.000 (99.762)	mPrec@1 (91.727)	mPrec@5 (97.432)
[02/27 18:38:04][INFO] train_vision.py:  840: Test: [30/107]	Prec@1 98.750 (96.734)	Prec@5 100.000 (99.839)	mPrec@1 (94.034)	mPrec@5 (98.678)
[02/27 18:38:46][INFO] train_vision.py:  840: Test: [40/107]	Prec@1 93.750 (96.677)	Prec@5 100.000 (99.848)	mPrec@1 (95.209)	mPrec@5 (99.736)
[02/27 18:39:29][INFO] train_vision.py:  840: Test: [50/107]	Prec@1 98.750 (96.446)	Prec@5 100.000 (99.853)	mPrec@1 (94.449)	mPrec@5 (99.742)
[02/27 18:40:11][INFO] train_vision.py:  840: Test: [60/107]	Prec@1 100.000 (96.598)	Prec@5 100.000 (99.857)	mPrec@1 (94.515)	mPrec@5 (99.770)
[02/27 18:40:54][INFO] train_vision.py:  840: Test: [70/107]	Prec@1 93.750 (96.602)	Prec@5 98.750 (99.859)	mPrec@1 (94.510)	mPrec@5 (99.761)
[02/27 18:41:37][INFO] train_vision.py:  840: Test: [80/107]	Prec@1 95.000 (96.590)	Prec@5 100.000 (99.877)	mPrec@1 (94.269)	mPrec@5 (99.800)
[02/27 18:42:19][INFO] train_vision.py:  840: Test: [90/107]	Prec@1 100.000 (96.442)	Prec@5 100.000 (99.890)	mPrec@1 (94.201)	mPrec@5 (99.815)
[02/27 18:43:01][INFO] train_vision.py:  840: Test: [100/107]	Prec@1 90.000 (96.460)	Prec@5 100.000 (99.901)	mPrec@1 (94.168)	mPrec@5 (99.831)
[02/27 18:43:25][INFO] train_vision.py:  847: Overall Prec@1 96.350% Prec@5 99.894% mPrec@1 (94.169) mPrec@5 (99.819)
[02/27 18:43:25][INFO] train_vision.py:  464: Testing: 94.16935729980469/94.41860961914062
[02/27 18:43:25][INFO] train_vision.py:  465: Saving:
[02/27 18:43:37][INFO] train_vision.py:  668: Epoch: [29][0/329], lr: 2.72e-06, eta: 0:28:17	Time 5.145 (5.145)	Data 2.246 (2.246)	Mem 41.61GB	Prec@1 90.000 (90.000)	Loss 1.3376 (1.3376)
[02/27 18:44:07][INFO] train_vision.py:  668: Epoch: [29][10/329], lr: 2.68e-06, eta: 0:17:00	Time 3.039 (3.189)	Data 0.022 (0.256)	Mem 41.61GB	Prec@1 70.000 (87.273)	Loss 1.2706 (1.1202)
[02/27 18:44:37][INFO] train_vision.py:  668: Epoch: [29][20/329], lr: 2.64e-06, eta: 0:16:02	Time 3.014 (3.105)	Data 0.077 (0.162)	Mem 41.61GB	Prec@1 90.000 (89.048)	Loss 1.1511 (1.1341)
[02/27 18:45:07][INFO] train_vision.py:  668: Epoch: [29][30/329], lr: 2.60e-06, eta: 0:15:21	Time 3.001 (3.073)	Data 0.040 (0.130)	Mem 41.61GB	Prec@1 70.000 (87.419)	Loss 1.3834 (1.1580)
[02/27 18:45:38][INFO] train_vision.py:  668: Epoch: [29][40/329], lr: 2.56e-06, eta: 0:14:47	Time 3.034 (3.059)	Data 0.062 (0.112)	Mem 41.61GB	Prec@1 90.000 (87.561)	Loss 1.0490 (1.1291)
[02/27 18:46:08][INFO] train_vision.py:  668: Epoch: [29][50/329], lr: 2.52e-06, eta: 0:14:13	Time 3.003 (3.050)	Data 0.054 (0.102)	Mem 41.61GB	Prec@1 100.000 (87.843)	Loss 0.8112 (1.1312)
[02/27 18:46:38][INFO] train_vision.py:  668: Epoch: [29][60/329], lr: 2.49e-06, eta: 0:13:41	Time 3.012 (3.043)	Data 0.069 (0.095)	Mem 41.61GB	Prec@1 90.000 (86.557)	Loss 1.1320 (1.1648)
[02/27 18:47:08][INFO] train_vision.py:  668: Epoch: [29][70/329], lr: 2.45e-06, eta: 0:13:09	Time 2.984 (3.038)	Data 0.046 (0.089)	Mem 41.61GB	Prec@1 90.000 (87.324)	Loss 1.2019 (1.1430)
[02/27 18:47:38][INFO] train_vision.py:  668: Epoch: [29][80/329], lr: 2.42e-06, eta: 0:12:38	Time 3.009 (3.035)	Data 0.082 (0.086)	Mem 41.61GB	Prec@1 90.000 (87.531)	Loss 0.9643 (1.1320)
[02/27 18:48:08][INFO] train_vision.py:  668: Epoch: [29][90/329], lr: 2.38e-06, eta: 0:12:07	Time 2.993 (3.032)	Data 0.053 (0.083)	Mem 41.61GB	Prec@1 90.000 (87.912)	Loss 1.0395 (1.1178)
[02/27 18:48:38][INFO] train_vision.py:  668: Epoch: [29][100/329], lr: 2.35e-06, eta: 0:11:36	Time 3.032 (3.029)	Data 0.087 (0.081)	Mem 41.61GB	Prec@1 90.000 (87.822)	Loss 1.0330 (1.1195)
[02/27 18:49:08][INFO] train_vision.py:  668: Epoch: [29][110/329], lr: 2.32e-06, eta: 0:11:06	Time 3.011 (3.028)	Data 0.045 (0.078)	Mem 41.61GB	Prec@1 90.000 (88.018)	Loss 1.2234 (1.1275)
[02/27 18:49:38][INFO] train_vision.py:  668: Epoch: [29][120/329], lr: 2.29e-06, eta: 0:10:35	Time 2.993 (3.026)	Data 0.058 (0.077)	Mem 41.61GB	Prec@1 90.000 (87.686)	Loss 1.0259 (1.1331)
[02/27 18:50:08][INFO] train_vision.py:  668: Epoch: [29][130/329], lr: 2.27e-06, eta: 0:10:04	Time 2.988 (3.024)	Data 0.048 (0.075)	Mem 41.61GB	Prec@1 80.000 (87.710)	Loss 1.3270 (1.1324)
[02/27 18:50:38][INFO] train_vision.py:  668: Epoch: [29][140/329], lr: 2.24e-06, eta: 0:09:34	Time 3.001 (3.023)	Data 0.069 (0.074)	Mem 41.61GB	Prec@1 90.000 (87.730)	Loss 1.2680 (1.1359)
[02/27 18:51:08][INFO] train_vision.py:  668: Epoch: [29][150/329], lr: 2.22e-06, eta: 0:09:03	Time 3.000 (3.021)	Data 0.056 (0.072)	Mem 41.61GB	Prec@1 60.000 (87.483)	Loss 1.4305 (1.1413)
[02/27 18:51:38][INFO] train_vision.py:  668: Epoch: [29][160/329], lr: 2.19e-06, eta: 0:08:33	Time 3.000 (3.020)	Data 0.052 (0.072)	Mem 41.61GB	Prec@1 100.000 (87.640)	Loss 0.9957 (1.1393)
[02/27 18:52:08][INFO] train_vision.py:  668: Epoch: [29][170/329], lr: 2.17e-06, eta: 0:08:03	Time 2.984 (3.019)	Data 0.055 (0.070)	Mem 41.61GB	Prec@1 80.000 (87.836)	Loss 1.5067 (1.1392)
[02/27 18:52:38][INFO] train_vision.py:  668: Epoch: [29][180/329], lr: 2.15e-06, eta: 0:07:32	Time 2.992 (3.019)	Data 0.061 (0.070)	Mem 41.61GB	Prec@1 80.000 (87.569)	Loss 1.0946 (1.1438)
[02/27 18:53:09][INFO] train_vision.py:  668: Epoch: [29][190/329], lr: 2.13e-06, eta: 0:07:02	Time 3.004 (3.018)	Data 0.052 (0.069)	Mem 41.61GB	Prec@1 100.000 (87.801)	Loss 0.9185 (1.1397)
[02/27 18:53:39][INFO] train_vision.py:  668: Epoch: [29][200/329], lr: 2.11e-06, eta: 0:06:32	Time 3.002 (3.017)	Data 0.055 (0.068)	Mem 41.61GB	Prec@1 80.000 (87.711)	Loss 1.1586 (1.1432)
[02/27 18:54:09][INFO] train_vision.py:  668: Epoch: [29][210/329], lr: 2.10e-06, eta: 0:06:01	Time 2.965 (3.016)	Data 0.054 (0.067)	Mem 41.61GB	Prec@1 80.000 (87.630)	Loss 1.3023 (1.1457)
[02/27 18:54:39][INFO] train_vision.py:  668: Epoch: [29][220/329], lr: 2.08e-06, eta: 0:05:31	Time 3.004 (3.016)	Data 0.061 (0.067)	Mem 41.61GB	Prec@1 100.000 (87.873)	Loss 0.9978 (1.1426)
[02/27 18:55:09][INFO] train_vision.py:  668: Epoch: [29][230/329], lr: 2.07e-06, eta: 0:05:01	Time 3.011 (3.015)	Data 0.040 (0.066)	Mem 41.61GB	Prec@1 80.000 (87.792)	Loss 1.4964 (1.1441)
[02/27 18:55:39][INFO] train_vision.py:  668: Epoch: [29][240/329], lr: 2.05e-06, eta: 0:04:31	Time 3.004 (3.014)	Data 0.058 (0.066)	Mem 41.61GB	Prec@1 80.000 (87.676)	Loss 1.4210 (1.1456)
[02/27 18:56:09][INFO] train_vision.py:  668: Epoch: [29][250/329], lr: 2.04e-06, eta: 0:04:01	Time 3.005 (3.014)	Data 0.052 (0.065)	Mem 41.61GB	Prec@1 90.000 (87.809)	Loss 1.0861 (1.1422)
[02/27 18:56:39][INFO] train_vision.py:  668: Epoch: [29][260/329], lr: 2.03e-06, eta: 0:03:30	Time 2.998 (3.013)	Data 0.058 (0.065)	Mem 41.61GB	Prec@1 100.000 (87.931)	Loss 0.8926 (1.1362)
[02/27 18:57:09][INFO] train_vision.py:  668: Epoch: [29][270/329], lr: 2.02e-06, eta: 0:03:00	Time 2.997 (3.013)	Data 0.023 (0.064)	Mem 41.61GB	Prec@1 90.000 (87.970)	Loss 1.0987 (1.1344)
[02/27 18:57:39][INFO] train_vision.py:  668: Epoch: [29][280/329], lr: 2.02e-06, eta: 0:02:30	Time 2.995 (3.012)	Data 0.064 (0.064)	Mem 41.61GB	Prec@1 100.000 (88.256)	Loss 0.8889 (1.1290)
[02/27 18:58:09][INFO] train_vision.py:  668: Epoch: [29][290/329], lr: 2.01e-06, eta: 0:02:00	Time 3.002 (3.012)	Data 0.055 (0.063)	Mem 41.61GB	Prec@1 90.000 (88.351)	Loss 1.0047 (1.1285)
[02/27 18:58:39][INFO] train_vision.py:  668: Epoch: [29][300/329], lr: 2.01e-06, eta: 0:01:30	Time 3.007 (3.012)	Data 0.065 (0.063)	Mem 41.61GB	Prec@1 100.000 (88.439)	Loss 0.9283 (1.1277)
[02/27 18:59:09][INFO] train_vision.py:  668: Epoch: [29][310/329], lr: 2.00e-06, eta: 0:01:00	Time 3.003 (3.012)	Data 0.059 (0.063)	Mem 41.61GB	Prec@1 80.000 (88.360)	Loss 1.1977 (1.1286)
[02/27 18:59:39][INFO] train_vision.py:  668: Epoch: [29][320/329], lr: 2.00e-06, eta: 0:00:30	Time 3.004 (3.012)	Data 0.062 (0.063)	Mem 41.61GB	Prec@1 60.000 (88.224)	Loss 1.7249 (1.1319)
[02/27 19:00:10][INFO] train_vision.py:  840: Test: [0/107]	Prec@1 98.750 (98.750)	Prec@5 100.000 (100.000)	mPrec@1 (32.121)	mPrec@5 (32.323)
[02/27 19:00:52][INFO] train_vision.py:  840: Test: [10/107]	Prec@1 96.250 (97.273)	Prec@5 100.000 (100.000)	mPrec@1 (83.535)	mPrec@5 (86.869)
[02/27 19:01:35][INFO] train_vision.py:  840: Test: [20/107]	Prec@1 98.750 (96.607)	Prec@5 100.000 (99.762)	mPrec@1 (91.976)	mPrec@5 (97.432)
[02/27 19:02:17][INFO] train_vision.py:  840: Test: [30/107]	Prec@1 98.750 (96.774)	Prec@5 100.000 (99.839)	mPrec@1 (94.217)	mPrec@5 (98.678)
[02/27 19:02:59][INFO] train_vision.py:  840: Test: [40/107]	Prec@1 93.750 (96.768)	Prec@5 100.000 (99.817)	mPrec@1 (95.501)	mPrec@5 (99.707)
[02/27 19:03:41][INFO] train_vision.py:  840: Test: [50/107]	Prec@1 98.750 (96.471)	Prec@5 100.000 (99.828)	mPrec@1 (94.628)	mPrec@5 (99.715)
[02/27 19:04:23][INFO] train_vision.py:  840: Test: [60/107]	Prec@1 100.000 (96.619)	Prec@5 100.000 (99.836)	mPrec@1 (94.656)	mPrec@5 (99.749)
[02/27 19:05:06][INFO] train_vision.py:  840: Test: [70/107]	Prec@1 95.000 (96.655)	Prec@5 100.000 (99.859)	mPrec@1 (94.678)	mPrec@5 (99.793)
[02/27 19:05:48][INFO] train_vision.py:  840: Test: [80/107]	Prec@1 93.750 (96.651)	Prec@5 100.000 (99.877)	mPrec@1 (94.480)	mPrec@5 (99.826)
[02/27 19:06:30][INFO] train_vision.py:  840: Test: [90/107]	Prec@1 100.000 (96.484)	Prec@5 100.000 (99.890)	mPrec@1 (94.349)	mPrec@5 (99.840)
[02/27 19:07:13][INFO] train_vision.py:  840: Test: [100/107]	Prec@1 90.000 (96.522)	Prec@5 100.000 (99.901)	mPrec@1 (94.346)	mPrec@5 (99.854)
[02/27 19:07:36][INFO] train_vision.py:  847: Overall Prec@1 96.408% Prec@5 99.894% mPrec@1 (94.365) mPrec@5 (99.843)
[02/27 19:07:36][INFO] train_vision.py:  464: Testing: 94.36515808105469/94.41860961914062
[02/27 19:07:36][INFO] train_vision.py:  465: Saving:
[02/27 19:08:08][DEBUG] cmd.py: 1253: Popen(['git', 'rev-parse', '--show-toplevel'], cwd=/home/anonymous/research/CorrelationSideTuning, stdin=None, shell=False, universal_newlines=False)
[02/27 19:08:08][DEBUG] cmd.py: 1253: Popen(['git', 'rev-parse', '--show-toplevel'], cwd=/home/anonymous/research/CorrelationSideTuning, stdin=None, shell=False, universal_newlines=False)
[02/27 19:08:10][DEBUG] cmd.py: 1253: Popen(['git', 'cat-file', '--batch-check'], cwd=/home/anonymous/research/CorrelationSideTuning, stdin=<valid stream>, shell=False, universal_newlines=False)
[02/27 19:08:15][INFO] model.py:  404: dropout used:[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0]
[02/27 19:08:16][INFO] model.py:  444: dropout used:[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0]
[02/27 19:08:17][INFO] model.py:  404: dropout used:[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0]
[02/27 19:08:18][INFO] model.py:  921: loading clip pretrained model!
[02/27 19:08:18][INFO] utils.py:  500: Model:
VideoCLIP(
  (visual): VisualTransformer(
    (conv1): Conv2d(3, 1024, kernel_size=(14, 14), stride=(14, 14), bias=False)
    (dropout): Dropout(p=0.0, inplace=False)
    (ln_pre): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
    (transformer): Transformer(
      (resblocks): ModuleList(
        (0): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (1): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (2): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (3): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (4): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (5): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (6): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (7): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (8): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (9): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (10): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (11): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (12): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (13): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (14): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (15): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (16): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (17): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (18): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (19): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (20): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (21): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (22): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (23): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
      )
    )
    (side_network): SideNetwork(
      (resblocks): ModuleList(
        (0): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (1): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (2): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (3): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (4): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (5): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (6): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (7): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (8): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (9): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (10): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (11): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (12): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (13): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (14): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (15): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (16): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (17): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (18): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (19): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (20): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (21): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (22): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (23): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
      )
      (adaptation): ModuleList(
        (0): Linear(in_features=1024, out_features=448, bias=True)
        (1): Linear(in_features=1024, out_features=448, bias=True)
        (2): Linear(in_features=1024, out_features=448, bias=True)
        (3): Linear(in_features=1024, out_features=448, bias=True)
        (4): Linear(in_features=1024, out_features=448, bias=True)
        (5): Linear(in_features=1024, out_features=448, bias=True)
        (6): Linear(in_features=1024, out_features=448, bias=True)
        (7): Linear(in_features=1024, out_features=448, bias=True)
        (8): Linear(in_features=1024, out_features=448, bias=True)
        (9): Linear(in_features=1024, out_features=448, bias=True)
        (10): Linear(in_features=1024, out_features=448, bias=True)
        (11): Linear(in_features=1024, out_features=448, bias=True)
        (12): Linear(in_features=1024, out_features=448, bias=True)
        (13): Linear(in_features=1024, out_features=448, bias=True)
        (14): Linear(in_features=1024, out_features=448, bias=True)
        (15): Linear(in_features=1024, out_features=448, bias=True)
        (16): Linear(in_features=1024, out_features=448, bias=True)
        (17): Linear(in_features=1024, out_features=448, bias=True)
        (18): Linear(in_features=1024, out_features=448, bias=True)
        (19): Linear(in_features=1024, out_features=448, bias=True)
        (20): Linear(in_features=1024, out_features=448, bias=True)
        (21): Linear(in_features=1024, out_features=448, bias=True)
        (22): Linear(in_features=1024, out_features=448, bias=True)
        (23): Linear(in_features=1024, out_features=448, bias=True)
      )
      (lns_pre): ModuleList(
        (0): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (3): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (4): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (5): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (6): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (7): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (8): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (9): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (10): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (11): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (12): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (13): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (14): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (15): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (16): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (17): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (18): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (19): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (20): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (21): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (22): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (23): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
      )
      (moss_layers): ModuleList(
        (0): MOSSBlock(
          (stss_encoders): ModuleList(
            (0): STSSEncoder(
              (ln_pre): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
              (in_proj): Linear(in_features=1024, out_features=256, bias=True)
              (stss_transformation): STSSTransformation()
              (stss_extraction): STSSExtraction(
                (conv0): Sequential(
                  (0): Conv3d(81, 96, kernel_size=(1, 1, 1), stride=(1, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
              )
              (stss_integration): STSSIntegration(
                (conv0): Sequential(
                  (0): Conv3d(96, 96, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
                (conv1): Sequential(
                  (0): Conv3d(96, 96, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
                (conv2_fuse): Sequential(
                  (0): Rearrange('(b l) c t h w -> b (l c) t h w', l=5)
                  (1): Conv3d(480, 192, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (2): BatchNorm3d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (3): GELU(approximate=none)
                )
              )
              (out_proj): Linear(in_features=192, out_features=448, bias=True)
            )
            (1): STSSEncoder(
              (ln_pre): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
              (in_proj): Linear(in_features=448, out_features=256, bias=True)
              (stss_transformation): STSSTransformation()
              (stss_extraction): STSSExtraction(
                (conv0): Sequential(
                  (0): Conv3d(81, 96, kernel_size=(1, 1, 1), stride=(1, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
              )
              (stss_integration): STSSIntegration(
                (conv0): Sequential(
                  (0): Conv3d(96, 96, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
                (conv1): Sequential(
                  (0): Conv3d(96, 96, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
                (conv2_fuse): Sequential(
                  (0): Rearrange('(b l) c t h w -> b (l c) t h w', l=5)
                  (1): Conv3d(480, 192, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (2): BatchNorm3d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (3): GELU(approximate=none)
                )
              )
              (out_proj): Linear(in_features=192, out_features=448, bias=True)
            )
          )
        )
      )
    )
    (side_post_bn): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
    (side_conv1): Conv3d(3, 448, kernel_size=(3, 14, 14), stride=(1, 14, 14), padding=(1, 0, 0))
    (side_pre_bn3d): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
  )
  (fusion_model): video_header()
  (drop_out): Dropout(p=0, inplace=False)
  (fc): Linear(in_features=448, out_features=99, bias=True)
)
[02/27 19:09:20][WARNING] jit_analysis.py:  501: Unsupported operator aten::div encountered 205 time(s)
[02/27 19:09:20][WARNING] jit_analysis.py:  501: Unsupported operator aten::add_ encountered 58 time(s)
[02/27 19:09:20][WARNING] jit_analysis.py:  501: Unsupported operator aten::mul encountered 463 time(s)
[02/27 19:09:20][WARNING] jit_analysis.py:  501: Unsupported operator aten::mul_ encountered 90 time(s)
[02/27 19:09:20][WARNING] jit_analysis.py:  501: Unsupported operator aten::add encountered 171 time(s)
[02/27 19:09:20][WARNING] jit_analysis.py:  501: Unsupported operator aten::softmax encountered 24 time(s)
[02/27 19:09:20][WARNING] jit_analysis.py:  501: Unsupported operator aten::sigmoid encountered 24 time(s)
[02/27 19:09:20][WARNING] jit_analysis.py:  501: Unsupported operator prim::PythonOp.CheckpointFunction encountered 72 time(s)
[02/27 19:09:20][WARNING] jit_analysis.py:  501: Unsupported operator aten::pad encountered 38 time(s)
[02/27 19:09:20][WARNING] jit_analysis.py:  501: Unsupported operator aten::unfold encountered 2 time(s)
[02/27 19:09:20][WARNING] jit_analysis.py:  501: Unsupported operator aten::norm encountered 4 time(s)
[02/27 19:09:20][WARNING] jit_analysis.py:  501: Unsupported operator aten::clamp_min encountered 4 time(s)
[02/27 19:09:20][WARNING] jit_analysis.py:  501: Unsupported operator aten::expand_as encountered 4 time(s)
[02/27 19:09:20][WARNING] jit_analysis.py:  501: Unsupported operator aten::diagonal encountered 36 time(s)
[02/27 19:09:20][WARNING] jit_analysis.py:  501: Unsupported operator aten::gelu encountered 8 time(s)
[02/27 19:09:20][WARNING] jit_analysis.py:  501: Unsupported operator aten::sum encountered 1 time(s)
[02/27 19:09:20][WARNING] jit_analysis.py:  501: Unsupported operator aten::mean encountered 1 time(s)
[02/27 19:09:20][WARNING] jit_analysis.py:  513: The following submodules of the model were never called during the trace of the graph. They may be unused, or they were accessed by direct calls to .forward() or via other python methods. In the latter case they will have zeros for statistics, though their statistics will still contribute to their parent calling module.
fusion_model, visual.dropout, visual.side_network.resblocks.0.attn, visual.side_network.resblocks.0.attn.out_proj, visual.side_network.resblocks.0.conv, visual.side_network.resblocks.0.conv.0, visual.side_network.resblocks.0.conv.1, visual.side_network.resblocks.0.conv.2, visual.side_network.resblocks.0.mlp, visual.side_network.resblocks.0.mlp.act, visual.side_network.resblocks.0.mlp.drop, visual.side_network.resblocks.0.mlp.fc1, visual.side_network.resblocks.0.mlp.fc2, visual.side_network.resblocks.1.attn, visual.side_network.resblocks.1.attn.out_proj, visual.side_network.resblocks.1.conv, visual.side_network.resblocks.1.conv.0, visual.side_network.resblocks.1.conv.1, visual.side_network.resblocks.1.conv.2, visual.side_network.resblocks.1.mlp, visual.side_network.resblocks.1.mlp.act, visual.side_network.resblocks.1.mlp.drop, visual.side_network.resblocks.1.mlp.fc1, visual.side_network.resblocks.1.mlp.fc2, visual.side_network.resblocks.10.attn, visual.side_network.resblocks.10.attn.out_proj, visual.side_network.resblocks.10.conv, visual.side_network.resblocks.10.conv.0, visual.side_network.resblocks.10.conv.1, visual.side_network.resblocks.10.conv.2, visual.side_network.resblocks.10.mlp, visual.side_network.resblocks.10.mlp.act, visual.side_network.resblocks.10.mlp.drop, visual.side_network.resblocks.10.mlp.fc1, visual.side_network.resblocks.10.mlp.fc2, visual.side_network.resblocks.11.attn, visual.side_network.resblocks.11.attn.out_proj, visual.side_network.resblocks.11.conv, visual.side_network.resblocks.11.conv.0, visual.side_network.resblocks.11.conv.1, visual.side_network.resblocks.11.conv.2, visual.side_network.resblocks.11.mlp, visual.side_network.resblocks.11.mlp.act, visual.side_network.resblocks.11.mlp.drop, visual.side_network.resblocks.11.mlp.fc1, visual.side_network.resblocks.11.mlp.fc2, visual.side_network.resblocks.12.attn, visual.side_network.resblocks.12.attn.out_proj, visual.side_network.resblocks.12.conv, visual.side_network.resblocks.12.conv.0, visual.side_network.resblocks.12.conv.1, visual.side_network.resblocks.12.conv.2, visual.side_network.resblocks.12.mlp, visual.side_network.resblocks.12.mlp.act, visual.side_network.resblocks.12.mlp.drop, visual.side_network.resblocks.12.mlp.fc1, visual.side_network.resblocks.12.mlp.fc2, visual.side_network.resblocks.13.attn, visual.side_network.resblocks.13.attn.out_proj, visual.side_network.resblocks.13.conv, visual.side_network.resblocks.13.conv.0, visual.side_network.resblocks.13.conv.1, visual.side_network.resblocks.13.conv.2, visual.side_network.resblocks.13.mlp, visual.side_network.resblocks.13.mlp.act, visual.side_network.resblocks.13.mlp.drop, visual.side_network.resblocks.13.mlp.fc1, visual.side_network.resblocks.13.mlp.fc2, visual.side_network.resblocks.14.attn, visual.side_network.resblocks.14.attn.out_proj, visual.side_network.resblocks.14.conv, visual.side_network.resblocks.14.conv.0, visual.side_network.resblocks.14.conv.1, visual.side_network.resblocks.14.conv.2, visual.side_network.resblocks.14.mlp, visual.side_network.resblocks.14.mlp.act, visual.side_network.resblocks.14.mlp.drop, visual.side_network.resblocks.14.mlp.fc1, visual.side_network.resblocks.14.mlp.fc2, visual.side_network.resblocks.15.attn, visual.side_network.resblocks.15.attn.out_proj, visual.side_network.resblocks.15.conv, visual.side_network.resblocks.15.conv.0, visual.side_network.resblocks.15.conv.1, visual.side_network.resblocks.15.conv.2, visual.side_network.resblocks.15.mlp, visual.side_network.resblocks.15.mlp.act, visual.side_network.resblocks.15.mlp.drop, visual.side_network.resblocks.15.mlp.fc1, visual.side_network.resblocks.15.mlp.fc2, visual.side_network.resblocks.16.attn, visual.side_network.resblocks.16.attn.out_proj, visual.side_network.resblocks.16.conv, visual.side_network.resblocks.16.conv.0, visual.side_network.resblocks.16.conv.1, visual.side_network.resblocks.16.conv.2, visual.side_network.resblocks.16.mlp, visual.side_network.resblocks.16.mlp.act, visual.side_network.resblocks.16.mlp.drop, visual.side_network.resblocks.16.mlp.fc1, visual.side_network.resblocks.16.mlp.fc2, visual.side_network.resblocks.17.attn, visual.side_network.resblocks.17.attn.out_proj, visual.side_network.resblocks.17.conv, visual.side_network.resblocks.17.conv.0, visual.side_network.resblocks.17.conv.1, visual.side_network.resblocks.17.conv.2, visual.side_network.resblocks.17.mlp, visual.side_network.resblocks.17.mlp.act, visual.side_network.resblocks.17.mlp.drop, visual.side_network.resblocks.17.mlp.fc1, visual.side_network.resblocks.17.mlp.fc2, visual.side_network.resblocks.18.attn, visual.side_network.resblocks.18.attn.out_proj, visual.side_network.resblocks.18.conv, visual.side_network.resblocks.18.conv.0, visual.side_network.resblocks.18.conv.1, visual.side_network.resblocks.18.conv.2, visual.side_network.resblocks.18.mlp, visual.side_network.resblocks.18.mlp.act, visual.side_network.resblocks.18.mlp.drop, visual.side_network.resblocks.18.mlp.fc1, visual.side_network.resblocks.18.mlp.fc2, visual.side_network.resblocks.19.attn, visual.side_network.resblocks.19.attn.out_proj, visual.side_network.resblocks.19.conv, visual.side_network.resblocks.19.conv.0, visual.side_network.resblocks.19.conv.1, visual.side_network.resblocks.19.conv.2, visual.side_network.resblocks.19.mlp, visual.side_network.resblocks.19.mlp.act, visual.side_network.resblocks.19.mlp.drop, visual.side_network.resblocks.19.mlp.fc1, visual.side_network.resblocks.19.mlp.fc2, visual.side_network.resblocks.2.attn, visual.side_network.resblocks.2.attn.out_proj, visual.side_network.resblocks.2.conv, visual.side_network.resblocks.2.conv.0, visual.side_network.resblocks.2.conv.1, visual.side_network.resblocks.2.conv.2, visual.side_network.resblocks.2.mlp, visual.side_network.resblocks.2.mlp.act, visual.side_network.resblocks.2.mlp.drop, visual.side_network.resblocks.2.mlp.fc1, visual.side_network.resblocks.2.mlp.fc2, visual.side_network.resblocks.20.attn, visual.side_network.resblocks.20.attn.out_proj, visual.side_network.resblocks.20.conv, visual.side_network.resblocks.20.conv.0, visual.side_network.resblocks.20.conv.1, visual.side_network.resblocks.20.conv.2, visual.side_network.resblocks.20.mlp, visual.side_network.resblocks.20.mlp.act, visual.side_network.resblocks.20.mlp.drop, visual.side_network.resblocks.20.mlp.fc1, visual.side_network.resblocks.20.mlp.fc2, visual.side_network.resblocks.21.attn, visual.side_network.resblocks.21.attn.out_proj, visual.side_network.resblocks.21.conv, visual.side_network.resblocks.21.conv.0, visual.side_network.resblocks.21.conv.1, visual.side_network.resblocks.21.conv.2, visual.side_network.resblocks.21.mlp, visual.side_network.resblocks.21.mlp.act, visual.side_network.resblocks.21.mlp.drop, visual.side_network.resblocks.21.mlp.fc1, visual.side_network.resblocks.21.mlp.fc2, visual.side_network.resblocks.22.attn, visual.side_network.resblocks.22.attn.out_proj, visual.side_network.resblocks.22.conv, visual.side_network.resblocks.22.conv.0, visual.side_network.resblocks.22.conv.1, visual.side_network.resblocks.22.conv.2, visual.side_network.resblocks.22.mlp, visual.side_network.resblocks.22.mlp.act, visual.side_network.resblocks.22.mlp.drop, visual.side_network.resblocks.22.mlp.fc1, visual.side_network.resblocks.22.mlp.fc2, visual.side_network.resblocks.23.attn, visual.side_network.resblocks.23.attn.out_proj, visual.side_network.resblocks.23.conv, visual.side_network.resblocks.23.conv.0, visual.side_network.resblocks.23.conv.1, visual.side_network.resblocks.23.conv.2, visual.side_network.resblocks.23.mlp, visual.side_network.resblocks.23.mlp.act, visual.side_network.resblocks.23.mlp.drop, visual.side_network.resblocks.23.mlp.fc1, visual.side_network.resblocks.23.mlp.fc2, visual.side_network.resblocks.3.attn, visual.side_network.resblocks.3.attn.out_proj, visual.side_network.resblocks.3.conv, visual.side_network.resblocks.3.conv.0, visual.side_network.resblocks.3.conv.1, visual.side_network.resblocks.3.conv.2, visual.side_network.resblocks.3.mlp, visual.side_network.resblocks.3.mlp.act, visual.side_network.resblocks.3.mlp.drop, visual.side_network.resblocks.3.mlp.fc1, visual.side_network.resblocks.3.mlp.fc2, visual.side_network.resblocks.4.attn, visual.side_network.resblocks.4.attn.out_proj, visual.side_network.resblocks.4.conv, visual.side_network.resblocks.4.conv.0, visual.side_network.resblocks.4.conv.1, visual.side_network.resblocks.4.conv.2, visual.side_network.resblocks.4.mlp, visual.side_network.resblocks.4.mlp.act, visual.side_network.resblocks.4.mlp.drop, visual.side_network.resblocks.4.mlp.fc1, visual.side_network.resblocks.4.mlp.fc2, visual.side_network.resblocks.5.attn, visual.side_network.resblocks.5.attn.out_proj, visual.side_network.resblocks.5.conv, visual.side_network.resblocks.5.conv.0, visual.side_network.resblocks.5.conv.1, visual.side_network.resblocks.5.conv.2, visual.side_network.resblocks.5.mlp, visual.side_network.resblocks.5.mlp.act, visual.side_network.resblocks.5.mlp.drop, visual.side_network.resblocks.5.mlp.fc1, visual.side_network.resblocks.5.mlp.fc2, visual.side_network.resblocks.6.attn, visual.side_network.resblocks.6.attn.out_proj, visual.side_network.resblocks.6.conv, visual.side_network.resblocks.6.conv.0, visual.side_network.resblocks.6.conv.1, visual.side_network.resblocks.6.conv.2, visual.side_network.resblocks.6.mlp, visual.side_network.resblocks.6.mlp.act, visual.side_network.resblocks.6.mlp.drop, visual.side_network.resblocks.6.mlp.fc1, visual.side_network.resblocks.6.mlp.fc2, visual.side_network.resblocks.7.attn, visual.side_network.resblocks.7.attn.out_proj, visual.side_network.resblocks.7.conv, visual.side_network.resblocks.7.conv.0, visual.side_network.resblocks.7.conv.1, visual.side_network.resblocks.7.conv.2, visual.side_network.resblocks.7.mlp, visual.side_network.resblocks.7.mlp.act, visual.side_network.resblocks.7.mlp.drop, visual.side_network.resblocks.7.mlp.fc1, visual.side_network.resblocks.7.mlp.fc2, visual.side_network.resblocks.8.attn, visual.side_network.resblocks.8.attn.out_proj, visual.side_network.resblocks.8.conv, visual.side_network.resblocks.8.conv.0, visual.side_network.resblocks.8.conv.1, visual.side_network.resblocks.8.conv.2, visual.side_network.resblocks.8.mlp, visual.side_network.resblocks.8.mlp.act, visual.side_network.resblocks.8.mlp.drop, visual.side_network.resblocks.8.mlp.fc1, visual.side_network.resblocks.8.mlp.fc2, visual.side_network.resblocks.9.attn, visual.side_network.resblocks.9.attn.out_proj, visual.side_network.resblocks.9.conv, visual.side_network.resblocks.9.conv.0, visual.side_network.resblocks.9.conv.1, visual.side_network.resblocks.9.conv.2, visual.side_network.resblocks.9.mlp, visual.side_network.resblocks.9.mlp.act, visual.side_network.resblocks.9.mlp.drop, visual.side_network.resblocks.9.mlp.fc1, visual.side_network.resblocks.9.mlp.fc2, visual.transformer.resblocks.0.attn.out_proj, visual.transformer.resblocks.1.attn.out_proj, visual.transformer.resblocks.10.attn.out_proj, visual.transformer.resblocks.11.attn.out_proj, visual.transformer.resblocks.12.attn.out_proj, visual.transformer.resblocks.13.attn.out_proj, visual.transformer.resblocks.14.attn.out_proj, visual.transformer.resblocks.15.attn.out_proj, visual.transformer.resblocks.16.attn.out_proj, visual.transformer.resblocks.17.attn.out_proj, visual.transformer.resblocks.18.attn.out_proj, visual.transformer.resblocks.19.attn.out_proj, visual.transformer.resblocks.2.attn.out_proj, visual.transformer.resblocks.20.attn.out_proj, visual.transformer.resblocks.21.attn.out_proj, visual.transformer.resblocks.22.attn.out_proj, visual.transformer.resblocks.23.attn.out_proj, visual.transformer.resblocks.3.attn.out_proj, visual.transformer.resblocks.4.attn.out_proj, visual.transformer.resblocks.5.attn.out_proj, visual.transformer.resblocks.6.attn.out_proj, visual.transformer.resblocks.7.attn.out_proj, visual.transformer.resblocks.8.attn.out_proj, visual.transformer.resblocks.9.attn.out_proj
[02/27 19:09:20][INFO] utils.py:  502: Flops: 2.732T
[02/27 19:09:20][INFO] utils.py:  504: Params: 385.423M, tunable Params: 385.423M
[02/27 19:09:21][INFO] test_vision.py:  303: load model: epoch 26
[02/27 19:09:31][INFO] test_vision.py:  602: Test: [0/356], average 0.3584 sec/video 	Prec@1 100.000 (100.000)	Prec@5 100.000 (100.000)	mPrec@1 21.212	mPrec@5 21.212
[02/27 19:09:54][INFO] test_vision.py:  602: Test: [10/356], average 0.1210 sec/video 	Prec@1 100.000 (98.485)	Prec@5 100.000 (100.000)	mPrec@1 50.077	mPrec@5 51.515
[02/27 19:10:19][INFO] test_vision.py:  602: Test: [20/356], average 0.1114 sec/video 	Prec@1 95.833 (97.817)	Prec@5 100.000 (100.000)	mPrec@1 62.936	mPrec@5 65.657
[02/27 19:10:43][INFO] test_vision.py:  602: Test: [30/356], average 0.1083 sec/video 	Prec@1 91.667 (97.043)	Prec@5 100.000 (100.000)	mPrec@1 79.111	mPrec@5 82.828
[02/27 19:11:08][INFO] test_vision.py:  602: Test: [40/356], average 0.1070 sec/video 	Prec@1 95.833 (97.053)	Prec@5 100.000 (100.000)	mPrec@1 87.527	mPrec@5 91.919
[02/27 19:11:32][INFO] test_vision.py:  602: Test: [50/356], average 0.1061 sec/video 	Prec@1 95.833 (96.487)	Prec@5 100.000 (99.918)	mPrec@1 88.328	mPrec@5 93.603
[02/27 19:11:57][INFO] test_vision.py:  602: Test: [60/356], average 0.1057 sec/video 	Prec@1 95.833 (96.448)	Prec@5 100.000 (99.863)	mPrec@1 89.848	mPrec@5 95.589
[02/27 19:12:22][INFO] test_vision.py:  602: Test: [70/356], average 0.1053 sec/video 	Prec@1 83.333 (96.538)	Prec@5 100.000 (99.824)	mPrec@1 91.914	mPrec@5 97.634
[02/27 19:12:47][INFO] test_vision.py:  602: Test: [80/356], average 0.1051 sec/video 	Prec@1 95.833 (96.708)	Prec@5 100.000 (99.846)	mPrec@1 94.032	mPrec@5 98.687
[02/27 19:13:12][INFO] test_vision.py:  602: Test: [90/356], average 0.1049 sec/video 	Prec@1 91.667 (96.703)	Prec@5 100.000 (99.863)	mPrec@1 94.124	mPrec@5 98.743
[02/27 19:13:36][INFO] test_vision.py:  602: Test: [100/356], average 0.1048 sec/video 	Prec@1 100.000 (96.906)	Prec@5 100.000 (99.876)	mPrec@1 94.301	mPrec@5 98.790
[02/27 19:14:01][INFO] test_vision.py:  602: Test: [110/356], average 0.1046 sec/video 	Prec@1 100.000 (96.959)	Prec@5 100.000 (99.850)	mPrec@1 94.360	mPrec@5 98.762
[02/27 19:14:26][INFO] test_vision.py:  602: Test: [120/356], average 0.1045 sec/video 	Prec@1 100.000 (97.176)	Prec@5 100.000 (99.862)	mPrec@1 94.658	mPrec@5 98.770
[02/27 19:14:51][INFO] test_vision.py:  602: Test: [130/356], average 0.1044 sec/video 	Prec@1 95.833 (97.074)	Prec@5 100.000 (99.841)	mPrec@1 94.757	mPrec@5 98.745
[02/27 19:15:15][INFO] test_vision.py:  602: Test: [140/356], average 0.1043 sec/video 	Prec@1 79.167 (96.661)	Prec@5 100.000 (99.823)	mPrec@1 95.376	mPrec@5 99.775
[02/27 19:15:40][INFO] test_vision.py:  602: Test: [150/356], average 0.1042 sec/video 	Prec@1 100.000 (96.606)	Prec@5 100.000 (99.834)	mPrec@1 95.255	mPrec@5 99.779
[02/27 19:16:05][INFO] test_vision.py:  602: Test: [160/356], average 0.1042 sec/video 	Prec@1 100.000 (96.558)	Prec@5 100.000 (99.819)	mPrec@1 94.948	mPrec@5 99.778
[02/27 19:16:30][INFO] test_vision.py:  602: Test: [170/356], average 0.1041 sec/video 	Prec@1 100.000 (96.662)	Prec@5 100.000 (99.829)	mPrec@1 94.939	mPrec@5 99.785
[02/27 19:16:55][INFO] test_vision.py:  602: Test: [180/356], average 0.1041 sec/video 	Prec@1 100.000 (96.593)	Prec@5 100.000 (99.839)	mPrec@1 94.677	mPrec@5 99.796
[02/27 19:17:19][INFO] test_vision.py:  602: Test: [190/356], average 0.1040 sec/video 	Prec@1 100.000 (96.640)	Prec@5 100.000 (99.847)	mPrec@1 94.780	mPrec@5 99.810
[02/27 19:17:44][INFO] test_vision.py:  602: Test: [200/356], average 0.1040 sec/video 	Prec@1 100.000 (96.725)	Prec@5 100.000 (99.855)	mPrec@1 94.913	mPrec@5 99.819
[02/27 19:18:09][INFO] test_vision.py:  602: Test: [210/356], average 0.1040 sec/video 	Prec@1 91.667 (96.781)	Prec@5 100.000 (99.862)	mPrec@1 94.994	mPrec@5 99.828
[02/27 19:18:34][INFO] test_vision.py:  602: Test: [220/356], average 0.1040 sec/video 	Prec@1 100.000 (96.738)	Prec@5 100.000 (99.868)	mPrec@1 95.038	mPrec@5 99.837
[02/27 19:18:59][INFO] test_vision.py:  602: Test: [230/356], average 0.1039 sec/video 	Prec@1 100.000 (96.807)	Prec@5 100.000 (99.874)	mPrec@1 95.062	mPrec@5 99.841
[02/27 19:19:23][INFO] test_vision.py:  602: Test: [240/356], average 0.1039 sec/video 	Prec@1 100.000 (96.784)	Prec@5 100.000 (99.879)	mPrec@1 94.953	mPrec@5 99.851
[02/27 19:19:48][INFO] test_vision.py:  602: Test: [250/356], average 0.1039 sec/video 	Prec@1 100.000 (96.813)	Prec@5 100.000 (99.884)	mPrec@1 94.881	mPrec@5 99.859
[02/27 19:20:13][INFO] test_vision.py:  602: Test: [260/356], average 0.1039 sec/video 	Prec@1 100.000 (96.823)	Prec@5 100.000 (99.888)	mPrec@1 94.863	mPrec@5 99.864
[02/27 19:20:38][INFO] test_vision.py:  602: Test: [270/356], average 0.1039 sec/video 	Prec@1 95.833 (96.787)	Prec@5 100.000 (99.892)	mPrec@1 94.800	mPrec@5 99.872
[02/27 19:21:03][INFO] test_vision.py:  602: Test: [280/356], average 0.1039 sec/video 	Prec@1 95.833 (96.664)	Prec@5 100.000 (99.896)	mPrec@1 94.607	mPrec@5 99.877
[02/27 19:21:28][INFO] test_vision.py:  602: Test: [290/356], average 0.1038 sec/video 	Prec@1 100.000 (96.492)	Prec@5 100.000 (99.900)	mPrec@1 94.573	mPrec@5 99.882
[02/27 19:21:52][INFO] test_vision.py:  602: Test: [300/356], average 0.1038 sec/video 	Prec@1 100.000 (96.567)	Prec@5 100.000 (99.903)	mPrec@1 94.621	mPrec@5 99.885
[02/27 19:22:17][INFO] test_vision.py:  602: Test: [310/356], average 0.1038 sec/video 	Prec@1 100.000 (96.651)	Prec@5 100.000 (99.906)	mPrec@1 94.722	mPrec@5 99.888
[02/27 19:22:42][INFO] test_vision.py:  602: Test: [320/356], average 0.1038 sec/video 	Prec@1 87.500 (96.664)	Prec@5 100.000 (99.909)	mPrec@1 94.692	mPrec@5 99.892
[02/27 19:23:07][INFO] test_vision.py:  602: Test: [330/356], average 0.1038 sec/video 	Prec@1 100.000 (96.689)	Prec@5 100.000 (99.912)	mPrec@1 94.714	mPrec@5 99.894
[02/27 19:23:32][INFO] test_vision.py:  602: Test: [340/356], average 0.1038 sec/video 	Prec@1 100.000 (96.615)	Prec@5 100.000 (99.914)	mPrec@1 94.636	mPrec@5 99.894
[02/27 19:23:56][INFO] test_vision.py:  602: Test: [350/356], average 0.1038 sec/video 	Prec@1 100.000 (96.617)	Prec@5 100.000 (99.905)	mPrec@1 94.749	mPrec@5 99.872
[02/27 19:24:09][INFO] test_vision.py:  615: -----Evaluation is finished------
[02/27 19:24:09][INFO] test_vision.py:  621: Overall Prec@1 96.506% Prec@5 99.894%	mPrec@1 (94.701)	mPrec@5 (99.860)
[02/27 19:24:09][INFO] test_vision.py:  338: Per-class accuracies saved to ./exp/exp/s4v_selfy_vitl14_32x224_finegym99_run1/per_class_accuracies.txt
[02/27 19:24:09][INFO] test_vision.py:  371: Per-sample results saved to ./exp/exp/s4v_selfy_vitl14_32x224_finegym99_run1/per_sample_results.txt
