[02/24 01:17:04][DEBUG] cmd.py: 1253: Popen(['git', 'rev-parse', '--show-toplevel'], cwd=/home/anonymous/research/CorrelationSideTuning, stdin=None, shell=False, universal_newlines=False)
[02/24 01:17:04][DEBUG] cmd.py: 1253: Popen(['git', 'rev-parse', '--show-toplevel'], cwd=/home/anonymous/research/CorrelationSideTuning, stdin=None, shell=False, universal_newlines=False)
[02/24 01:17:05][DEBUG] cmd.py: 1253: Popen(['git', 'cat-file', '--batch-check'], cwd=/home/anonymous/research/CorrelationSideTuning, stdin=<valid stream>, shell=False, universal_newlines=False)
[02/24 01:17:08][INFO] train_vision.py:  194: ------------------------------------
[02/24 01:17:08][INFO] train_vision.py:  195: Environment Versions:
[02/24 01:17:08][INFO] train_vision.py:  196: - Python: 3.8.19 (default, Mar 20 2024, 19:58:24) 
[GCC 11.2.0]
[02/24 01:17:08][INFO] train_vision.py:  197: - PyTorch: 1.12.1
[02/24 01:17:08][INFO] train_vision.py:  198: - TorchVison: 0.13.1
[02/24 01:17:08][INFO] train_vision.py:  199: ------------------------------------
[02/24 01:17:08][INFO] train_vision.py:  201: {   'data': {   'batch_size': 9,
                'dataset': 'diving48',
                'image_tmpl': 'img_{:05d}.jpg',
                'input_size': 224,
                'label_list': 'lists/diving48_labels.csv',
                'modality': 'RGB',
                'num_classes': 48,
                'num_sample': 1,
                'num_segments': 32,
                'rand_aug': False,
                'rand_erase': False,
                'random_shift': True,
                'seg_length': 1,
                'test_batch_size': 3,
                'train_list': 'lists/diving48/train_rgb_320px_60fps_v2.txt',
                'train_root': '/home/anonymous/datasets/diving48',
                'val_list': 'lists/diving48/val_rgb_320px_60fps_v2.txt',
                'val_root': '/home/anonymous/datasets/diving48',
                'workers': 4},
    'logging': {   'acc_per_class': True,
                   'correct_per_sample': True,
                   'eval_freq': 2,
                   'print_freq': 10,
                   'skip_epoch': []},
    'network': {   'arch': 'ViT-L/14',
                   'corr_dim': 256,
                   'corr_ext_chnls': [96],
                   'corr_func': 'cosine',
                   'corr_int_chnls': [96, 96, 192],
                   'corr_layer_index': [7],
                   'corr_num_encoders': 2,
                   'corr_window': [5, 9, 9],
                   'drop_fc': 0,
                   'dropout': 0.0,
                   'emb_dropout': 0.0,
                   'fix_clip': False,
                   'init': True,
                   'joint_st': False,
                   'my_fix_clip': True,
                   'n_emb': 448,
                   'num_checkpoints': 24,
                   'side_dim': 448,
                   'sim_header': 'None',
                   'sync_bn': False,
                   'tm': False,
                   'type': 'clip_k400'},
    'pretrain': 'exp/s4v_selfy_vitl14_16x224_k400_run3/model_best.pt',
    'resume': None,
    'seed': 1024,
    'solver': {   'betas': [0.9, 0.999],
                  'clip_ratio': 1,
                  'epoch_offset': 0,
                  'epochs': 30,
                  'evaluate': False,
                  'final_factor': 0.01,
                  'grad_accumulation_steps': 1,
                  'layer_decay': 1.0,
                  'loss_type': 'CE',
                  'lr': 0.0002,
                  'lr_warmup_step': 4,
                  'optim': 'adamw',
                  'smoothing': 0.1,
                  'start_epoch': 0,
                  'type': 'cosine',
                  'warmup_lr': 2e-07,
                  'weight_decay': 0.15},
    'wandb': {   'entity': 'anonymous',
                 'exp_name': 's4v_selfy_vitl14_32x224_diving48_run4/train',
                 'group_name': 's4v_selfy_vitl14_32x224_diving48_run4',
                 'key': '1234',
                 'project_name': 'corr_adapter_diving48',
                 'use_wandb': True}}
[02/24 01:17:08][INFO] train_vision.py:  202: ------------------------------------
[02/24 01:17:08][INFO] train_vision.py:  203: storing name: ./exp/s4v_selfy_vitl14_32x224_diving48_run4
[02/24 01:17:11][INFO] model.py:  404: dropout used:[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0]
[02/24 01:17:12][INFO] model.py:  444: dropout used:[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0]
[02/24 01:17:13][INFO] model.py:  404: dropout used:[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0]
[02/24 01:17:15][INFO] model.py:  921: loading clip pretrained model!
[02/24 01:17:15][INFO] train_vision.py:  271: visual.class_embedding False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.positional_embedding False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.conv1.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.ln_pre.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.ln_pre.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.0.attn.in_proj_weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.0.attn.in_proj_bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.0.attn.out_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.0.attn.out_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.0.ln_1.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.0.ln_1.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.0.mlp.c_fc.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.0.mlp.c_fc.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.0.mlp.c_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.0.mlp.c_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.0.ln_2.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.0.ln_2.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.1.attn.in_proj_weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.1.attn.in_proj_bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.1.attn.out_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.1.attn.out_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.1.ln_1.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.1.ln_1.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.1.mlp.c_fc.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.1.mlp.c_fc.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.1.mlp.c_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.1.mlp.c_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.1.ln_2.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.1.ln_2.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.2.attn.in_proj_weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.2.attn.in_proj_bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.2.attn.out_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.2.attn.out_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.2.ln_1.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.2.ln_1.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.2.mlp.c_fc.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.2.mlp.c_fc.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.2.mlp.c_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.2.mlp.c_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.2.ln_2.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.2.ln_2.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.3.attn.in_proj_weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.3.attn.in_proj_bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.3.attn.out_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.3.attn.out_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.3.ln_1.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.3.ln_1.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.3.mlp.c_fc.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.3.mlp.c_fc.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.3.mlp.c_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.3.mlp.c_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.3.ln_2.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.3.ln_2.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.4.attn.in_proj_weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.4.attn.in_proj_bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.4.attn.out_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.4.attn.out_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.4.ln_1.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.4.ln_1.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.4.mlp.c_fc.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.4.mlp.c_fc.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.4.mlp.c_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.4.mlp.c_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.4.ln_2.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.4.ln_2.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.5.attn.in_proj_weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.5.attn.in_proj_bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.5.attn.out_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.5.attn.out_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.5.ln_1.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.5.ln_1.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.5.mlp.c_fc.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.5.mlp.c_fc.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.5.mlp.c_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.5.mlp.c_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.5.ln_2.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.5.ln_2.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.6.attn.in_proj_weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.6.attn.in_proj_bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.6.attn.out_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.6.attn.out_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.6.ln_1.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.6.ln_1.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.6.mlp.c_fc.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.6.mlp.c_fc.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.6.mlp.c_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.6.mlp.c_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.6.ln_2.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.6.ln_2.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.7.attn.in_proj_weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.7.attn.in_proj_bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.7.attn.out_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.7.attn.out_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.7.ln_1.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.7.ln_1.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.7.mlp.c_fc.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.7.mlp.c_fc.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.7.mlp.c_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.7.mlp.c_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.7.ln_2.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.7.ln_2.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.8.attn.in_proj_weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.8.attn.in_proj_bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.8.attn.out_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.8.attn.out_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.8.ln_1.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.8.ln_1.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.8.mlp.c_fc.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.8.mlp.c_fc.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.8.mlp.c_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.8.mlp.c_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.8.ln_2.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.8.ln_2.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.9.attn.in_proj_weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.9.attn.in_proj_bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.9.attn.out_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.9.attn.out_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.9.ln_1.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.9.ln_1.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.9.mlp.c_fc.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.9.mlp.c_fc.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.9.mlp.c_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.9.mlp.c_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.9.ln_2.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.9.ln_2.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.10.attn.in_proj_weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.10.attn.in_proj_bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.10.attn.out_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.10.attn.out_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.10.ln_1.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.10.ln_1.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.10.mlp.c_fc.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.10.mlp.c_fc.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.10.mlp.c_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.10.mlp.c_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.10.ln_2.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.10.ln_2.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.11.attn.in_proj_weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.11.attn.in_proj_bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.11.attn.out_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.11.attn.out_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.11.ln_1.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.11.ln_1.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.11.mlp.c_fc.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.11.mlp.c_fc.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.11.mlp.c_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.11.mlp.c_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.11.ln_2.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.11.ln_2.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.12.attn.in_proj_weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.12.attn.in_proj_bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.12.attn.out_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.12.attn.out_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.12.ln_1.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.12.ln_1.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.12.mlp.c_fc.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.12.mlp.c_fc.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.12.mlp.c_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.12.mlp.c_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.12.ln_2.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.12.ln_2.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.13.attn.in_proj_weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.13.attn.in_proj_bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.13.attn.out_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.13.attn.out_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.13.ln_1.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.13.ln_1.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.13.mlp.c_fc.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.13.mlp.c_fc.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.13.mlp.c_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.13.mlp.c_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.13.ln_2.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.13.ln_2.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.14.attn.in_proj_weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.14.attn.in_proj_bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.14.attn.out_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.14.attn.out_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.14.ln_1.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.14.ln_1.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.14.mlp.c_fc.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.14.mlp.c_fc.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.14.mlp.c_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.14.mlp.c_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.14.ln_2.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.14.ln_2.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.15.attn.in_proj_weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.15.attn.in_proj_bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.15.attn.out_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.15.attn.out_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.15.ln_1.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.15.ln_1.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.15.mlp.c_fc.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.15.mlp.c_fc.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.15.mlp.c_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.15.mlp.c_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.15.ln_2.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.15.ln_2.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.16.attn.in_proj_weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.16.attn.in_proj_bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.16.attn.out_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.16.attn.out_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.16.ln_1.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.16.ln_1.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.16.mlp.c_fc.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.16.mlp.c_fc.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.16.mlp.c_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.16.mlp.c_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.16.ln_2.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.16.ln_2.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.17.attn.in_proj_weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.17.attn.in_proj_bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.17.attn.out_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.17.attn.out_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.17.ln_1.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.17.ln_1.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.17.mlp.c_fc.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.17.mlp.c_fc.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.17.mlp.c_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.17.mlp.c_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.17.ln_2.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.17.ln_2.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.18.attn.in_proj_weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.18.attn.in_proj_bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.18.attn.out_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.18.attn.out_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.18.ln_1.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.18.ln_1.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.18.mlp.c_fc.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.18.mlp.c_fc.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.18.mlp.c_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.18.mlp.c_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.18.ln_2.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.18.ln_2.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.19.attn.in_proj_weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.19.attn.in_proj_bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.19.attn.out_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.19.attn.out_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.19.ln_1.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.19.ln_1.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.19.mlp.c_fc.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.19.mlp.c_fc.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.19.mlp.c_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.19.mlp.c_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.19.ln_2.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.19.ln_2.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.20.attn.in_proj_weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.20.attn.in_proj_bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.20.attn.out_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.20.attn.out_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.20.ln_1.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.20.ln_1.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.20.mlp.c_fc.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.20.mlp.c_fc.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.20.mlp.c_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.20.mlp.c_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.20.ln_2.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.20.ln_2.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.21.attn.in_proj_weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.21.attn.in_proj_bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.21.attn.out_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.21.attn.out_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.21.ln_1.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.21.ln_1.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.21.mlp.c_fc.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.21.mlp.c_fc.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.21.mlp.c_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.21.mlp.c_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.21.ln_2.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.21.ln_2.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.22.attn.in_proj_weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.22.attn.in_proj_bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.22.attn.out_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.22.attn.out_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.22.ln_1.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.22.ln_1.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.22.mlp.c_fc.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.22.mlp.c_fc.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.22.mlp.c_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.22.mlp.c_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.22.ln_2.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.22.ln_2.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.23.attn.in_proj_weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.23.attn.in_proj_bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.23.attn.out_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.23.attn.out_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.23.ln_1.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.23.ln_1.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.23.mlp.c_fc.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.23.mlp.c_fc.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.23.mlp.c_proj.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.23.mlp.c_proj.bias False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.23.ln_2.weight False
[02/24 01:17:15][INFO] train_vision.py:  271: visual.transformer.resblocks.23.ln_2.bias False
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.side_spatial_position_embeddings True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.0.bn_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.0.bn_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.0.conv.0.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.0.conv.0.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.0.conv.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.0.conv.1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.0.conv.2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.0.conv.2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.0.bn_2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.0.bn_2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.0.mlp.fc1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.0.mlp.fc1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.0.mlp.fc2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.0.mlp.fc2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.0.attn.in_proj_weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.0.attn.in_proj_bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.0.attn.out_proj.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.0.attn.out_proj.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.0.ln_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.0.ln_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.1.bn_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.1.bn_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.1.conv.0.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.1.conv.0.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.1.conv.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.1.conv.1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.1.conv.2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.1.conv.2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.1.bn_2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.1.bn_2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.1.mlp.fc1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.1.mlp.fc1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.1.mlp.fc2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.1.mlp.fc2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.1.attn.in_proj_weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.1.attn.in_proj_bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.1.attn.out_proj.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.1.attn.out_proj.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.1.ln_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.1.ln_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.2.bn_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.2.bn_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.2.conv.0.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.2.conv.0.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.2.conv.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.2.conv.1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.2.conv.2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.2.conv.2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.2.bn_2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.2.bn_2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.2.mlp.fc1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.2.mlp.fc1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.2.mlp.fc2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.2.mlp.fc2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.2.attn.in_proj_weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.2.attn.in_proj_bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.2.attn.out_proj.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.2.attn.out_proj.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.2.ln_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.2.ln_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.3.bn_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.3.bn_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.3.conv.0.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.3.conv.0.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.3.conv.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.3.conv.1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.3.conv.2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.3.conv.2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.3.bn_2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.3.bn_2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.3.mlp.fc1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.3.mlp.fc1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.3.mlp.fc2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.3.mlp.fc2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.3.attn.in_proj_weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.3.attn.in_proj_bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.3.attn.out_proj.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.3.attn.out_proj.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.3.ln_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.3.ln_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.4.bn_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.4.bn_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.4.conv.0.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.4.conv.0.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.4.conv.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.4.conv.1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.4.conv.2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.4.conv.2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.4.bn_2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.4.bn_2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.4.mlp.fc1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.4.mlp.fc1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.4.mlp.fc2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.4.mlp.fc2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.4.attn.in_proj_weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.4.attn.in_proj_bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.4.attn.out_proj.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.4.attn.out_proj.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.4.ln_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.4.ln_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.5.bn_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.5.bn_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.5.conv.0.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.5.conv.0.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.5.conv.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.5.conv.1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.5.conv.2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.5.conv.2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.5.bn_2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.5.bn_2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.5.mlp.fc1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.5.mlp.fc1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.5.mlp.fc2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.5.mlp.fc2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.5.attn.in_proj_weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.5.attn.in_proj_bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.5.attn.out_proj.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.5.attn.out_proj.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.5.ln_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.5.ln_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.6.bn_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.6.bn_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.6.conv.0.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.6.conv.0.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.6.conv.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.6.conv.1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.6.conv.2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.6.conv.2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.6.bn_2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.6.bn_2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.6.mlp.fc1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.6.mlp.fc1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.6.mlp.fc2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.6.mlp.fc2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.6.attn.in_proj_weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.6.attn.in_proj_bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.6.attn.out_proj.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.6.attn.out_proj.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.6.ln_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.6.ln_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.7.bn_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.7.bn_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.7.conv.0.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.7.conv.0.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.7.conv.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.7.conv.1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.7.conv.2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.7.conv.2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.7.bn_2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.7.bn_2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.7.mlp.fc1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.7.mlp.fc1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.7.mlp.fc2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.7.mlp.fc2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.7.attn.in_proj_weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.7.attn.in_proj_bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.7.attn.out_proj.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.7.attn.out_proj.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.7.ln_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.7.ln_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.8.bn_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.8.bn_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.8.conv.0.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.8.conv.0.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.8.conv.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.8.conv.1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.8.conv.2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.8.conv.2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.8.bn_2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.8.bn_2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.8.mlp.fc1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.8.mlp.fc1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.8.mlp.fc2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.8.mlp.fc2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.8.attn.in_proj_weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.8.attn.in_proj_bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.8.attn.out_proj.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.8.attn.out_proj.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.8.ln_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.8.ln_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.9.bn_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.9.bn_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.9.conv.0.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.9.conv.0.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.9.conv.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.9.conv.1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.9.conv.2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.9.conv.2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.9.bn_2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.9.bn_2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.9.mlp.fc1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.9.mlp.fc1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.9.mlp.fc2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.9.mlp.fc2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.9.attn.in_proj_weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.9.attn.in_proj_bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.9.attn.out_proj.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.9.attn.out_proj.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.9.ln_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.9.ln_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.10.bn_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.10.bn_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.10.conv.0.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.10.conv.0.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.10.conv.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.10.conv.1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.10.conv.2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.10.conv.2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.10.bn_2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.10.bn_2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.10.mlp.fc1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.10.mlp.fc1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.10.mlp.fc2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.10.mlp.fc2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.10.attn.in_proj_weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.10.attn.in_proj_bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.10.attn.out_proj.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.10.attn.out_proj.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.10.ln_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.10.ln_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.11.bn_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.11.bn_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.11.conv.0.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.11.conv.0.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.11.conv.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.11.conv.1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.11.conv.2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.11.conv.2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.11.bn_2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.11.bn_2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.11.mlp.fc1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.11.mlp.fc1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.11.mlp.fc2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.11.mlp.fc2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.11.attn.in_proj_weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.11.attn.in_proj_bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.11.attn.out_proj.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.11.attn.out_proj.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.11.ln_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.11.ln_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.12.bn_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.12.bn_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.12.conv.0.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.12.conv.0.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.12.conv.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.12.conv.1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.12.conv.2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.12.conv.2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.12.bn_2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.12.bn_2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.12.mlp.fc1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.12.mlp.fc1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.12.mlp.fc2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.12.mlp.fc2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.12.attn.in_proj_weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.12.attn.in_proj_bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.12.attn.out_proj.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.12.attn.out_proj.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.12.ln_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.12.ln_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.13.bn_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.13.bn_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.13.conv.0.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.13.conv.0.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.13.conv.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.13.conv.1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.13.conv.2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.13.conv.2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.13.bn_2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.13.bn_2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.13.mlp.fc1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.13.mlp.fc1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.13.mlp.fc2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.13.mlp.fc2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.13.attn.in_proj_weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.13.attn.in_proj_bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.13.attn.out_proj.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.13.attn.out_proj.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.13.ln_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.13.ln_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.14.bn_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.14.bn_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.14.conv.0.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.14.conv.0.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.14.conv.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.14.conv.1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.14.conv.2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.14.conv.2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.14.bn_2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.14.bn_2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.14.mlp.fc1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.14.mlp.fc1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.14.mlp.fc2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.14.mlp.fc2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.14.attn.in_proj_weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.14.attn.in_proj_bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.14.attn.out_proj.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.14.attn.out_proj.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.14.ln_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.14.ln_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.15.bn_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.15.bn_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.15.conv.0.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.15.conv.0.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.15.conv.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.15.conv.1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.15.conv.2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.15.conv.2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.15.bn_2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.15.bn_2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.15.mlp.fc1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.15.mlp.fc1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.15.mlp.fc2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.15.mlp.fc2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.15.attn.in_proj_weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.15.attn.in_proj_bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.15.attn.out_proj.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.15.attn.out_proj.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.15.ln_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.15.ln_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.16.bn_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.16.bn_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.16.conv.0.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.16.conv.0.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.16.conv.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.16.conv.1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.16.conv.2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.16.conv.2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.16.bn_2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.16.bn_2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.16.mlp.fc1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.16.mlp.fc1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.16.mlp.fc2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.16.mlp.fc2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.16.attn.in_proj_weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.16.attn.in_proj_bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.16.attn.out_proj.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.16.attn.out_proj.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.16.ln_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.16.ln_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.17.bn_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.17.bn_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.17.conv.0.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.17.conv.0.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.17.conv.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.17.conv.1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.17.conv.2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.17.conv.2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.17.bn_2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.17.bn_2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.17.mlp.fc1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.17.mlp.fc1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.17.mlp.fc2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.17.mlp.fc2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.17.attn.in_proj_weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.17.attn.in_proj_bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.17.attn.out_proj.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.17.attn.out_proj.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.17.ln_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.17.ln_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.18.bn_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.18.bn_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.18.conv.0.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.18.conv.0.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.18.conv.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.18.conv.1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.18.conv.2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.18.conv.2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.18.bn_2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.18.bn_2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.18.mlp.fc1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.18.mlp.fc1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.18.mlp.fc2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.18.mlp.fc2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.18.attn.in_proj_weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.18.attn.in_proj_bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.18.attn.out_proj.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.18.attn.out_proj.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.18.ln_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.18.ln_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.19.bn_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.19.bn_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.19.conv.0.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.19.conv.0.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.19.conv.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.19.conv.1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.19.conv.2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.19.conv.2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.19.bn_2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.19.bn_2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.19.mlp.fc1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.19.mlp.fc1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.19.mlp.fc2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.19.mlp.fc2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.19.attn.in_proj_weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.19.attn.in_proj_bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.19.attn.out_proj.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.19.attn.out_proj.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.19.ln_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.19.ln_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.20.bn_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.20.bn_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.20.conv.0.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.20.conv.0.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.20.conv.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.20.conv.1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.20.conv.2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.20.conv.2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.20.bn_2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.20.bn_2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.20.mlp.fc1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.20.mlp.fc1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.20.mlp.fc2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.20.mlp.fc2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.20.attn.in_proj_weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.20.attn.in_proj_bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.20.attn.out_proj.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.20.attn.out_proj.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.20.ln_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.20.ln_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.21.bn_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.21.bn_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.21.conv.0.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.21.conv.0.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.21.conv.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.21.conv.1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.21.conv.2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.21.conv.2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.21.bn_2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.21.bn_2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.21.mlp.fc1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.21.mlp.fc1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.21.mlp.fc2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.21.mlp.fc2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.21.attn.in_proj_weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.21.attn.in_proj_bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.21.attn.out_proj.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.21.attn.out_proj.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.21.ln_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.21.ln_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.22.bn_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.22.bn_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.22.conv.0.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.22.conv.0.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.22.conv.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.22.conv.1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.22.conv.2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.22.conv.2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.22.bn_2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.22.bn_2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.22.mlp.fc1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.22.mlp.fc1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.22.mlp.fc2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.22.mlp.fc2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.22.attn.in_proj_weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.22.attn.in_proj_bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.22.attn.out_proj.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.22.attn.out_proj.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.22.ln_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.22.ln_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.23.bn_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.23.bn_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.23.conv.0.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.23.conv.0.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.23.conv.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.23.conv.1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.23.conv.2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.23.conv.2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.23.bn_2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.23.bn_2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.23.mlp.fc1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.23.mlp.fc1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.23.mlp.fc2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.23.mlp.fc2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.23.attn.in_proj_weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.23.attn.in_proj_bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.23.attn.out_proj.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.23.attn.out_proj.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.23.ln_1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.resblocks.23.ln_1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.0.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.0.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.3.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.3.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.4.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.4.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.5.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.5.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.6.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.6.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.7.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.7.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.8.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.8.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.9.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.9.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.10.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.10.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.11.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.11.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.12.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.12.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.13.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.13.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.14.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.14.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.15.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.15.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.16.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.16.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.17.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.17.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.18.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.18.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.19.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.19.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.20.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.20.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.21.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.21.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.22.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.22.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.23.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.adaptation.23.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.0.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.0.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.3.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.3.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.4.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.4.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.5.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.5.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.6.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.6.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.7.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.7.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.8.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.8.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.9.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.9.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.10.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.10.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.11.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.11.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.12.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.12.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.13.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.13.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.14.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.14.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.15.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.15.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.16.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.16.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.17.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.17.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.18.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.18.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.19.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.19.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.20.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.20.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.21.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.21.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.22.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.22.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.23.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.lns_pre.23.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.0.ln_pre.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.0.ln_pre.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.0.in_proj.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.0.in_proj.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.0.stss_extraction.conv0.0.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.0.stss_extraction.conv0.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.0.stss_extraction.conv0.1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.0.stss_integration.conv0.0.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.0.stss_integration.conv0.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.0.stss_integration.conv0.1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.0.stss_integration.conv1.0.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.0.stss_integration.conv1.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.0.stss_integration.conv1.1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.0.stss_integration.conv2_fuse.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.0.stss_integration.conv2_fuse.2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.0.stss_integration.conv2_fuse.2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.0.out_proj.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.0.out_proj.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.1.ln_pre.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.1.ln_pre.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.1.in_proj.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.1.in_proj.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.1.stss_extraction.conv0.0.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.1.stss_extraction.conv0.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.1.stss_extraction.conv0.1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.1.stss_integration.conv0.0.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.1.stss_integration.conv0.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.1.stss_integration.conv0.1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.1.stss_integration.conv1.0.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.1.stss_integration.conv1.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.1.stss_integration.conv1.1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.1.stss_integration.conv2_fuse.1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.1.stss_integration.conv2_fuse.2.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.1.stss_integration.conv2_fuse.2.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.1.out_proj.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_network.moss_layers.0.stss_encoders.1.out_proj.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_post_bn.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_post_bn.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_conv1.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_conv1.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_pre_bn3d.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: visual.side_pre_bn3d.bias True
[02/24 01:17:15][INFO] train_vision.py:  274: fc.weight True
[02/24 01:17:15][INFO] train_vision.py:  274: fc.bias True
[02/24 01:17:15][INFO] utils.py:  456: Model:
VideoCLIP(
  (visual): VisualTransformer(
    (conv1): Conv2d(3, 1024, kernel_size=(14, 14), stride=(14, 14), bias=False)
    (dropout): Dropout(p=0.0, inplace=False)
    (ln_pre): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
    (transformer): Transformer(
      (resblocks): ModuleList(
        (0): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (1): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (2): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (3): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (4): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (5): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (6): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (7): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (8): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (9): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (10): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (11): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (12): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (13): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (14): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (15): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (16): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (17): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (18): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (19): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (20): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (21): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (22): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (23): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
      )
    )
    (side_network): SideNetwork(
      (resblocks): ModuleList(
        (0): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (1): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (2): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (3): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (4): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (5): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (6): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (7): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (8): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (9): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (10): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (11): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (12): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (13): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (14): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (15): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (16): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (17): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (18): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (19): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (20): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (21): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (22): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (23): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
      )
      (adaptation): ModuleList(
        (0): Linear(in_features=1024, out_features=448, bias=True)
        (1): Linear(in_features=1024, out_features=448, bias=True)
        (2): Linear(in_features=1024, out_features=448, bias=True)
        (3): Linear(in_features=1024, out_features=448, bias=True)
        (4): Linear(in_features=1024, out_features=448, bias=True)
        (5): Linear(in_features=1024, out_features=448, bias=True)
        (6): Linear(in_features=1024, out_features=448, bias=True)
        (7): Linear(in_features=1024, out_features=448, bias=True)
        (8): Linear(in_features=1024, out_features=448, bias=True)
        (9): Linear(in_features=1024, out_features=448, bias=True)
        (10): Linear(in_features=1024, out_features=448, bias=True)
        (11): Linear(in_features=1024, out_features=448, bias=True)
        (12): Linear(in_features=1024, out_features=448, bias=True)
        (13): Linear(in_features=1024, out_features=448, bias=True)
        (14): Linear(in_features=1024, out_features=448, bias=True)
        (15): Linear(in_features=1024, out_features=448, bias=True)
        (16): Linear(in_features=1024, out_features=448, bias=True)
        (17): Linear(in_features=1024, out_features=448, bias=True)
        (18): Linear(in_features=1024, out_features=448, bias=True)
        (19): Linear(in_features=1024, out_features=448, bias=True)
        (20): Linear(in_features=1024, out_features=448, bias=True)
        (21): Linear(in_features=1024, out_features=448, bias=True)
        (22): Linear(in_features=1024, out_features=448, bias=True)
        (23): Linear(in_features=1024, out_features=448, bias=True)
      )
      (lns_pre): ModuleList(
        (0): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (3): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (4): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (5): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (6): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (7): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (8): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (9): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (10): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (11): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (12): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (13): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (14): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (15): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (16): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (17): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (18): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (19): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (20): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (21): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (22): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (23): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
      )
      (moss_layers): ModuleList(
        (0): MOSSBlock(
          (stss_encoders): ModuleList(
            (0): STSSEncoder(
              (ln_pre): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
              (in_proj): Linear(in_features=1024, out_features=256, bias=True)
              (stss_transformation): STSSTransformation()
              (stss_extraction): STSSExtraction(
                (conv0): Sequential(
                  (0): Conv3d(81, 96, kernel_size=(1, 1, 1), stride=(1, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
              )
              (stss_integration): STSSIntegration(
                (conv0): Sequential(
                  (0): Conv3d(96, 96, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
                (conv1): Sequential(
                  (0): Conv3d(96, 96, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
                (conv2_fuse): Sequential(
                  (0): Rearrange('(b l) c t h w -> b (l c) t h w', l=5)
                  (1): Conv3d(480, 192, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (2): BatchNorm3d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (3): GELU(approximate=none)
                )
              )
              (out_proj): Linear(in_features=192, out_features=448, bias=True)
            )
            (1): STSSEncoder(
              (ln_pre): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
              (in_proj): Linear(in_features=448, out_features=256, bias=True)
              (stss_transformation): STSSTransformation()
              (stss_extraction): STSSExtraction(
                (conv0): Sequential(
                  (0): Conv3d(81, 96, kernel_size=(1, 1, 1), stride=(1, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
              )
              (stss_integration): STSSIntegration(
                (conv0): Sequential(
                  (0): Conv3d(96, 96, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
                (conv1): Sequential(
                  (0): Conv3d(96, 96, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
                (conv2_fuse): Sequential(
                  (0): Rearrange('(b l) c t h w -> b (l c) t h w', l=5)
                  (1): Conv3d(480, 192, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (2): BatchNorm3d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (3): GELU(approximate=none)
                )
              )
              (out_proj): Linear(in_features=192, out_features=448, bias=True)
            )
          )
        )
      )
    )
    (side_post_bn): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
    (side_conv1): Conv3d(3, 448, kernel_size=(3, 14, 14), stride=(1, 14, 14), padding=(1, 0, 0))
    (side_pre_bn3d): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
  )
  (fusion_model): video_header()
  (drop_out): Dropout(p=0, inplace=False)
  (fc): Linear(in_features=448, out_features=48, bias=True)
)
[02/24 01:19:25][WARNING] jit_analysis.py:  501: Unsupported operator aten::div encountered 205 time(s)
[02/24 01:19:25][WARNING] jit_analysis.py:  501: Unsupported operator aten::add_ encountered 58 time(s)
[02/24 01:19:25][WARNING] jit_analysis.py:  501: Unsupported operator aten::mul encountered 463 time(s)
[02/24 01:19:25][WARNING] jit_analysis.py:  501: Unsupported operator aten::mul_ encountered 90 time(s)
[02/24 01:19:25][WARNING] jit_analysis.py:  501: Unsupported operator aten::add encountered 171 time(s)
[02/24 01:19:25][WARNING] jit_analysis.py:  501: Unsupported operator aten::softmax encountered 24 time(s)
[02/24 01:19:25][WARNING] jit_analysis.py:  501: Unsupported operator aten::sigmoid encountered 24 time(s)
[02/24 01:19:25][WARNING] jit_analysis.py:  501: Unsupported operator prim::PythonOp.CheckpointFunction encountered 72 time(s)
[02/24 01:19:25][WARNING] jit_analysis.py:  501: Unsupported operator aten::pad encountered 38 time(s)
[02/24 01:19:25][WARNING] jit_analysis.py:  501: Unsupported operator aten::unfold encountered 2 time(s)
[02/24 01:19:25][WARNING] jit_analysis.py:  501: Unsupported operator aten::norm encountered 4 time(s)
[02/24 01:19:25][WARNING] jit_analysis.py:  501: Unsupported operator aten::clamp_min encountered 4 time(s)
[02/24 01:19:25][WARNING] jit_analysis.py:  501: Unsupported operator aten::expand_as encountered 4 time(s)
[02/24 01:19:25][WARNING] jit_analysis.py:  501: Unsupported operator aten::diagonal encountered 36 time(s)
[02/24 01:19:25][WARNING] jit_analysis.py:  501: Unsupported operator aten::gelu encountered 8 time(s)
[02/24 01:19:25][WARNING] jit_analysis.py:  501: Unsupported operator aten::sum encountered 1 time(s)
[02/24 01:19:25][WARNING] jit_analysis.py:  501: Unsupported operator aten::mean encountered 1 time(s)
[02/24 01:19:25][WARNING] jit_analysis.py:  513: The following submodules of the model were never called during the trace of the graph. They may be unused, or they were accessed by direct calls to .forward() or via other python methods. In the latter case they will have zeros for statistics, though their statistics will still contribute to their parent calling module.
fusion_model, visual.dropout, visual.side_network.resblocks.0.attn, visual.side_network.resblocks.0.attn.out_proj, visual.side_network.resblocks.0.conv, visual.side_network.resblocks.0.conv.0, visual.side_network.resblocks.0.conv.1, visual.side_network.resblocks.0.conv.2, visual.side_network.resblocks.0.mlp, visual.side_network.resblocks.0.mlp.act, visual.side_network.resblocks.0.mlp.drop, visual.side_network.resblocks.0.mlp.fc1, visual.side_network.resblocks.0.mlp.fc2, visual.side_network.resblocks.1.attn, visual.side_network.resblocks.1.attn.out_proj, visual.side_network.resblocks.1.conv, visual.side_network.resblocks.1.conv.0, visual.side_network.resblocks.1.conv.1, visual.side_network.resblocks.1.conv.2, visual.side_network.resblocks.1.mlp, visual.side_network.resblocks.1.mlp.act, visual.side_network.resblocks.1.mlp.drop, visual.side_network.resblocks.1.mlp.fc1, visual.side_network.resblocks.1.mlp.fc2, visual.side_network.resblocks.10.attn, visual.side_network.resblocks.10.attn.out_proj, visual.side_network.resblocks.10.conv, visual.side_network.resblocks.10.conv.0, visual.side_network.resblocks.10.conv.1, visual.side_network.resblocks.10.conv.2, visual.side_network.resblocks.10.mlp, visual.side_network.resblocks.10.mlp.act, visual.side_network.resblocks.10.mlp.drop, visual.side_network.resblocks.10.mlp.fc1, visual.side_network.resblocks.10.mlp.fc2, visual.side_network.resblocks.11.attn, visual.side_network.resblocks.11.attn.out_proj, visual.side_network.resblocks.11.conv, visual.side_network.resblocks.11.conv.0, visual.side_network.resblocks.11.conv.1, visual.side_network.resblocks.11.conv.2, visual.side_network.resblocks.11.mlp, visual.side_network.resblocks.11.mlp.act, visual.side_network.resblocks.11.mlp.drop, visual.side_network.resblocks.11.mlp.fc1, visual.side_network.resblocks.11.mlp.fc2, visual.side_network.resblocks.12.attn, visual.side_network.resblocks.12.attn.out_proj, visual.side_network.resblocks.12.conv, visual.side_network.resblocks.12.conv.0, visual.side_network.resblocks.12.conv.1, visual.side_network.resblocks.12.conv.2, visual.side_network.resblocks.12.mlp, visual.side_network.resblocks.12.mlp.act, visual.side_network.resblocks.12.mlp.drop, visual.side_network.resblocks.12.mlp.fc1, visual.side_network.resblocks.12.mlp.fc2, visual.side_network.resblocks.13.attn, visual.side_network.resblocks.13.attn.out_proj, visual.side_network.resblocks.13.conv, visual.side_network.resblocks.13.conv.0, visual.side_network.resblocks.13.conv.1, visual.side_network.resblocks.13.conv.2, visual.side_network.resblocks.13.mlp, visual.side_network.resblocks.13.mlp.act, visual.side_network.resblocks.13.mlp.drop, visual.side_network.resblocks.13.mlp.fc1, visual.side_network.resblocks.13.mlp.fc2, visual.side_network.resblocks.14.attn, visual.side_network.resblocks.14.attn.out_proj, visual.side_network.resblocks.14.conv, visual.side_network.resblocks.14.conv.0, visual.side_network.resblocks.14.conv.1, visual.side_network.resblocks.14.conv.2, visual.side_network.resblocks.14.mlp, visual.side_network.resblocks.14.mlp.act, visual.side_network.resblocks.14.mlp.drop, visual.side_network.resblocks.14.mlp.fc1, visual.side_network.resblocks.14.mlp.fc2, visual.side_network.resblocks.15.attn, visual.side_network.resblocks.15.attn.out_proj, visual.side_network.resblocks.15.conv, visual.side_network.resblocks.15.conv.0, visual.side_network.resblocks.15.conv.1, visual.side_network.resblocks.15.conv.2, visual.side_network.resblocks.15.mlp, visual.side_network.resblocks.15.mlp.act, visual.side_network.resblocks.15.mlp.drop, visual.side_network.resblocks.15.mlp.fc1, visual.side_network.resblocks.15.mlp.fc2, visual.side_network.resblocks.16.attn, visual.side_network.resblocks.16.attn.out_proj, visual.side_network.resblocks.16.conv, visual.side_network.resblocks.16.conv.0, visual.side_network.resblocks.16.conv.1, visual.side_network.resblocks.16.conv.2, visual.side_network.resblocks.16.mlp, visual.side_network.resblocks.16.mlp.act, visual.side_network.resblocks.16.mlp.drop, visual.side_network.resblocks.16.mlp.fc1, visual.side_network.resblocks.16.mlp.fc2, visual.side_network.resblocks.17.attn, visual.side_network.resblocks.17.attn.out_proj, visual.side_network.resblocks.17.conv, visual.side_network.resblocks.17.conv.0, visual.side_network.resblocks.17.conv.1, visual.side_network.resblocks.17.conv.2, visual.side_network.resblocks.17.mlp, visual.side_network.resblocks.17.mlp.act, visual.side_network.resblocks.17.mlp.drop, visual.side_network.resblocks.17.mlp.fc1, visual.side_network.resblocks.17.mlp.fc2, visual.side_network.resblocks.18.attn, visual.side_network.resblocks.18.attn.out_proj, visual.side_network.resblocks.18.conv, visual.side_network.resblocks.18.conv.0, visual.side_network.resblocks.18.conv.1, visual.side_network.resblocks.18.conv.2, visual.side_network.resblocks.18.mlp, visual.side_network.resblocks.18.mlp.act, visual.side_network.resblocks.18.mlp.drop, visual.side_network.resblocks.18.mlp.fc1, visual.side_network.resblocks.18.mlp.fc2, visual.side_network.resblocks.19.attn, visual.side_network.resblocks.19.attn.out_proj, visual.side_network.resblocks.19.conv, visual.side_network.resblocks.19.conv.0, visual.side_network.resblocks.19.conv.1, visual.side_network.resblocks.19.conv.2, visual.side_network.resblocks.19.mlp, visual.side_network.resblocks.19.mlp.act, visual.side_network.resblocks.19.mlp.drop, visual.side_network.resblocks.19.mlp.fc1, visual.side_network.resblocks.19.mlp.fc2, visual.side_network.resblocks.2.attn, visual.side_network.resblocks.2.attn.out_proj, visual.side_network.resblocks.2.conv, visual.side_network.resblocks.2.conv.0, visual.side_network.resblocks.2.conv.1, visual.side_network.resblocks.2.conv.2, visual.side_network.resblocks.2.mlp, visual.side_network.resblocks.2.mlp.act, visual.side_network.resblocks.2.mlp.drop, visual.side_network.resblocks.2.mlp.fc1, visual.side_network.resblocks.2.mlp.fc2, visual.side_network.resblocks.20.attn, visual.side_network.resblocks.20.attn.out_proj, visual.side_network.resblocks.20.conv, visual.side_network.resblocks.20.conv.0, visual.side_network.resblocks.20.conv.1, visual.side_network.resblocks.20.conv.2, visual.side_network.resblocks.20.mlp, visual.side_network.resblocks.20.mlp.act, visual.side_network.resblocks.20.mlp.drop, visual.side_network.resblocks.20.mlp.fc1, visual.side_network.resblocks.20.mlp.fc2, visual.side_network.resblocks.21.attn, visual.side_network.resblocks.21.attn.out_proj, visual.side_network.resblocks.21.conv, visual.side_network.resblocks.21.conv.0, visual.side_network.resblocks.21.conv.1, visual.side_network.resblocks.21.conv.2, visual.side_network.resblocks.21.mlp, visual.side_network.resblocks.21.mlp.act, visual.side_network.resblocks.21.mlp.drop, visual.side_network.resblocks.21.mlp.fc1, visual.side_network.resblocks.21.mlp.fc2, visual.side_network.resblocks.22.attn, visual.side_network.resblocks.22.attn.out_proj, visual.side_network.resblocks.22.conv, visual.side_network.resblocks.22.conv.0, visual.side_network.resblocks.22.conv.1, visual.side_network.resblocks.22.conv.2, visual.side_network.resblocks.22.mlp, visual.side_network.resblocks.22.mlp.act, visual.side_network.resblocks.22.mlp.drop, visual.side_network.resblocks.22.mlp.fc1, visual.side_network.resblocks.22.mlp.fc2, visual.side_network.resblocks.23.attn, visual.side_network.resblocks.23.attn.out_proj, visual.side_network.resblocks.23.conv, visual.side_network.resblocks.23.conv.0, visual.side_network.resblocks.23.conv.1, visual.side_network.resblocks.23.conv.2, visual.side_network.resblocks.23.mlp, visual.side_network.resblocks.23.mlp.act, visual.side_network.resblocks.23.mlp.drop, visual.side_network.resblocks.23.mlp.fc1, visual.side_network.resblocks.23.mlp.fc2, visual.side_network.resblocks.3.attn, visual.side_network.resblocks.3.attn.out_proj, visual.side_network.resblocks.3.conv, visual.side_network.resblocks.3.conv.0, visual.side_network.resblocks.3.conv.1, visual.side_network.resblocks.3.conv.2, visual.side_network.resblocks.3.mlp, visual.side_network.resblocks.3.mlp.act, visual.side_network.resblocks.3.mlp.drop, visual.side_network.resblocks.3.mlp.fc1, visual.side_network.resblocks.3.mlp.fc2, visual.side_network.resblocks.4.attn, visual.side_network.resblocks.4.attn.out_proj, visual.side_network.resblocks.4.conv, visual.side_network.resblocks.4.conv.0, visual.side_network.resblocks.4.conv.1, visual.side_network.resblocks.4.conv.2, visual.side_network.resblocks.4.mlp, visual.side_network.resblocks.4.mlp.act, visual.side_network.resblocks.4.mlp.drop, visual.side_network.resblocks.4.mlp.fc1, visual.side_network.resblocks.4.mlp.fc2, visual.side_network.resblocks.5.attn, visual.side_network.resblocks.5.attn.out_proj, visual.side_network.resblocks.5.conv, visual.side_network.resblocks.5.conv.0, visual.side_network.resblocks.5.conv.1, visual.side_network.resblocks.5.conv.2, visual.side_network.resblocks.5.mlp, visual.side_network.resblocks.5.mlp.act, visual.side_network.resblocks.5.mlp.drop, visual.side_network.resblocks.5.mlp.fc1, visual.side_network.resblocks.5.mlp.fc2, visual.side_network.resblocks.6.attn, visual.side_network.resblocks.6.attn.out_proj, visual.side_network.resblocks.6.conv, visual.side_network.resblocks.6.conv.0, visual.side_network.resblocks.6.conv.1, visual.side_network.resblocks.6.conv.2, visual.side_network.resblocks.6.mlp, visual.side_network.resblocks.6.mlp.act, visual.side_network.resblocks.6.mlp.drop, visual.side_network.resblocks.6.mlp.fc1, visual.side_network.resblocks.6.mlp.fc2, visual.side_network.resblocks.7.attn, visual.side_network.resblocks.7.attn.out_proj, visual.side_network.resblocks.7.conv, visual.side_network.resblocks.7.conv.0, visual.side_network.resblocks.7.conv.1, visual.side_network.resblocks.7.conv.2, visual.side_network.resblocks.7.mlp, visual.side_network.resblocks.7.mlp.act, visual.side_network.resblocks.7.mlp.drop, visual.side_network.resblocks.7.mlp.fc1, visual.side_network.resblocks.7.mlp.fc2, visual.side_network.resblocks.8.attn, visual.side_network.resblocks.8.attn.out_proj, visual.side_network.resblocks.8.conv, visual.side_network.resblocks.8.conv.0, visual.side_network.resblocks.8.conv.1, visual.side_network.resblocks.8.conv.2, visual.side_network.resblocks.8.mlp, visual.side_network.resblocks.8.mlp.act, visual.side_network.resblocks.8.mlp.drop, visual.side_network.resblocks.8.mlp.fc1, visual.side_network.resblocks.8.mlp.fc2, visual.side_network.resblocks.9.attn, visual.side_network.resblocks.9.attn.out_proj, visual.side_network.resblocks.9.conv, visual.side_network.resblocks.9.conv.0, visual.side_network.resblocks.9.conv.1, visual.side_network.resblocks.9.conv.2, visual.side_network.resblocks.9.mlp, visual.side_network.resblocks.9.mlp.act, visual.side_network.resblocks.9.mlp.drop, visual.side_network.resblocks.9.mlp.fc1, visual.side_network.resblocks.9.mlp.fc2, visual.transformer.resblocks.0.attn.out_proj, visual.transformer.resblocks.1.attn.out_proj, visual.transformer.resblocks.10.attn.out_proj, visual.transformer.resblocks.11.attn.out_proj, visual.transformer.resblocks.12.attn.out_proj, visual.transformer.resblocks.13.attn.out_proj, visual.transformer.resblocks.14.attn.out_proj, visual.transformer.resblocks.15.attn.out_proj, visual.transformer.resblocks.16.attn.out_proj, visual.transformer.resblocks.17.attn.out_proj, visual.transformer.resblocks.18.attn.out_proj, visual.transformer.resblocks.19.attn.out_proj, visual.transformer.resblocks.2.attn.out_proj, visual.transformer.resblocks.20.attn.out_proj, visual.transformer.resblocks.21.attn.out_proj, visual.transformer.resblocks.22.attn.out_proj, visual.transformer.resblocks.23.attn.out_proj, visual.transformer.resblocks.3.attn.out_proj, visual.transformer.resblocks.4.attn.out_proj, visual.transformer.resblocks.5.attn.out_proj, visual.transformer.resblocks.6.attn.out_proj, visual.transformer.resblocks.7.attn.out_proj, visual.transformer.resblocks.8.attn.out_proj, visual.transformer.resblocks.9.attn.out_proj
[02/24 01:19:25][INFO] utils.py:  458: Flops: 2.732T
[02/24 01:19:25][INFO] utils.py:  460: Params: 385.400M, tunable Params: 82.222M
[02/24 01:19:25][INFO] train_vision.py:  284: train transforms: [Compose(
    <datasets.transforms.GroupScale object at 0x150262e4c2b0>
    Compose(
    <datasets.transforms.GroupRandomSizedCrop object at 0x150262e4cf10>
    <datasets.transforms.GroupRandomHorizontalFlip object at 0x150262e4c400>
)
    <datasets.transforms.GroupRandomGrayscale object at 0x150262e4c220>
), Compose(
    <datasets.transforms.Stack object at 0x150262e4c130>
    <datasets.transforms.ToTorchFormatTensor object at 0x150262e4c3d0>
    <datasets.transforms.GroupNormalize object at 0x150262e4c670>
)]
[02/24 01:19:25][INFO] train_vision.py:  285: val transforms: [Compose(
    <datasets.transforms.GroupScale object at 0x150262e4c5e0>
    <datasets.transforms.GroupCenterCrop object at 0x150262e4ca00>
), Compose(
    <datasets.transforms.Stack object at 0x150262e4c7f0>
    <datasets.transforms.ToTorchFormatTensor object at 0x150262e4c820>
    <datasets.transforms.GroupNormalize object at 0x150262e4cfd0>
)]
[02/24 01:19:25][INFO] train_vision.py:  355: => Using label smoothing: 0.1
[02/24 01:19:25][INFO] train_vision.py:  366: => loading checkpoint 'exp/s4v_selfy_vitl14_16x224_k400_run3/model_best.pt'
[02/24 01:19:25][INFO] train_vision.py:  373: => pop last fc layer
[02/24 01:19:38][INFO] train_vision.py:  544: Epoch: [0][0/209], lr: 2.00e-07, eta: 20:46:59	Time 11.931 (11.931)	Data 2.397 (2.397)	Mem 36.80GB	Prec@1 0.000 (0.000)	Loss 3.9426 (3.9426)
[02/24 01:19:38][INFO] distributed.py:  995: Reducer buckets have been rebuilt in this iteration.
[02/24 01:20:18][INFO] train_vision.py:  544: Epoch: [0][10/209], lr: 2.35e-06, eta: 8:14:17	Time 4.022 (4.737)	Data 0.057 (0.268)	Mem 37.74GB	Prec@1 0.000 (2.020)	Loss 3.9619 (3.9546)
[02/24 01:20:59][INFO] train_vision.py:  544: Epoch: [0][20/209], lr: 4.74e-06, eta: 7:38:25	Time 4.035 (4.400)	Data 0.046 (0.167)	Mem 37.74GB	Prec@1 11.111 (2.646)	Loss 3.8421 (3.9302)
[02/24 01:21:39][INFO] train_vision.py:  544: Epoch: [0][30/209], lr: 7.13e-06, eta: 7:25:33	Time 4.036 (4.284)	Data 0.063 (0.132)	Mem 37.74GB	Prec@1 0.000 (1.792)	Loss 3.8920 (3.9329)
[02/24 01:22:19][INFO] train_vision.py:  544: Epoch: [0][40/209], lr: 9.52e-06, eta: 7:18:37	Time 4.035 (4.224)	Data 0.047 (0.113)	Mem 37.74GB	Prec@1 11.111 (2.168)	Loss 3.8645 (3.9192)
[02/24 01:23:00][INFO] train_vision.py:  544: Epoch: [0][50/209], lr: 1.19e-05, eta: 7:14:09	Time 4.041 (4.187)	Data 0.060 (0.102)	Mem 37.74GB	Prec@1 0.000 (1.961)	Loss 3.9030 (3.9090)
[02/24 01:23:40][INFO] train_vision.py:  544: Epoch: [0][60/209], lr: 1.43e-05, eta: 7:10:55	Time 4.034 (4.163)	Data 0.047 (0.094)	Mem 37.74GB	Prec@1 0.000 (1.821)	Loss 3.8322 (3.8977)
[02/24 01:24:21][INFO] train_vision.py:  544: Epoch: [0][70/209], lr: 1.67e-05, eta: 7:08:27	Time 4.041 (4.146)	Data 0.061 (0.089)	Mem 37.74GB	Prec@1 0.000 (3.130)	Loss 3.7517 (3.8789)
[02/24 01:25:01][INFO] train_vision.py:  544: Epoch: [0][80/209], lr: 1.91e-05, eta: 7:06:24	Time 4.033 (4.133)	Data 0.049 (0.084)	Mem 37.74GB	Prec@1 11.111 (3.841)	Loss 3.5625 (3.8576)
[02/24 01:25:41][INFO] train_vision.py:  544: Epoch: [0][90/209], lr: 2.15e-05, eta: 7:04:40	Time 4.048 (4.122)	Data 0.061 (0.081)	Mem 37.74GB	Prec@1 0.000 (4.151)	Loss 3.7818 (3.8367)
[02/24 01:26:22][INFO] train_vision.py:  544: Epoch: [0][100/209], lr: 2.39e-05, eta: 7:03:08	Time 4.040 (4.114)	Data 0.047 (0.078)	Mem 37.74GB	Prec@1 11.111 (5.831)	Loss 3.5912 (3.7996)
[02/24 01:27:02][INFO] train_vision.py:  544: Epoch: [0][110/209], lr: 2.63e-05, eta: 7:01:47	Time 4.043 (4.108)	Data 0.059 (0.076)	Mem 37.74GB	Prec@1 0.000 (7.107)	Loss 3.4302 (3.7648)
[02/24 01:27:43][INFO] train_vision.py:  544: Epoch: [0][120/209], lr: 2.86e-05, eta: 7:00:31	Time 4.032 (4.102)	Data 0.051 (0.074)	Mem 37.74GB	Prec@1 22.222 (8.448)	Loss 3.5388 (3.7262)
[02/24 01:28:23][INFO] train_vision.py:  544: Epoch: [0][130/209], lr: 3.10e-05, eta: 6:59:21	Time 4.047 (4.097)	Data 0.058 (0.073)	Mem 37.74GB	Prec@1 33.333 (10.093)	Loss 3.0579 (3.6842)
[02/24 01:29:03][INFO] train_vision.py:  544: Epoch: [0][140/209], lr: 3.34e-05, eta: 6:58:14	Time 4.034 (4.093)	Data 0.049 (0.071)	Mem 37.74GB	Prec@1 22.222 (10.875)	Loss 2.9070 (3.6448)
[02/24 01:29:44][INFO] train_vision.py:  544: Epoch: [0][150/209], lr: 3.58e-05, eta: 6:57:12	Time 4.043 (4.090)	Data 0.060 (0.070)	Mem 37.74GB	Prec@1 44.444 (11.332)	Loss 2.7034 (3.6134)
[02/24 01:30:24][INFO] train_vision.py:  544: Epoch: [0][160/209], lr: 3.82e-05, eta: 6:56:13	Time 4.037 (4.087)	Data 0.049 (0.069)	Mem 37.74GB	Prec@1 33.333 (12.629)	Loss 2.6935 (3.5689)
[02/24 01:31:05][INFO] train_vision.py:  544: Epoch: [0][170/209], lr: 4.06e-05, eta: 6:55:15	Time 4.045 (4.084)	Data 0.061 (0.068)	Mem 37.74GB	Prec@1 11.111 (13.060)	Loss 3.3314 (3.5427)
[02/24 01:31:45][INFO] train_vision.py:  544: Epoch: [0][180/209], lr: 4.30e-05, eta: 6:54:19	Time 4.033 (4.081)	Data 0.049 (0.067)	Mem 37.74GB	Prec@1 44.444 (14.487)	Loss 2.3526 (3.5010)
[02/24 01:32:25][INFO] train_vision.py:  544: Epoch: [0][190/209], lr: 4.54e-05, eta: 6:53:26	Time 4.046 (4.079)	Data 0.062 (0.067)	Mem 37.74GB	Prec@1 22.222 (15.416)	Loss 2.6687 (3.4554)
[02/24 01:33:06][INFO] train_vision.py:  544: Epoch: [0][200/209], lr: 4.78e-05, eta: 6:52:30	Time 4.033 (4.077)	Data 0.048 (0.066)	Mem 37.74GB	Prec@1 33.333 (16.473)	Loss 2.9411 (3.4172)
[02/24 01:33:46][INFO] train_vision.py:  544: Epoch: [1][0/209], lr: 5.01e-05, eta: 10:24:30	Time 6.181 (6.181)	Data 2.204 (2.204)	Mem 37.74GB	Prec@1 33.333 (33.333)	Loss 2.7253 (2.7253)
[02/24 01:34:27][INFO] train_vision.py:  544: Epoch: [1][10/209], lr: 5.23e-05, eta: 7:08:59	Time 4.058 (4.253)	Data 0.054 (0.258)	Mem 37.74GB	Prec@1 0.000 (39.394)	Loss 3.3049 (2.6351)
[02/24 01:35:07][INFO] train_vision.py:  544: Epoch: [1][20/209], lr: 5.47e-05, eta: 6:58:39	Time 4.046 (4.158)	Data 0.066 (0.165)	Mem 37.74GB	Prec@1 44.444 (41.799)	Loss 2.2993 (2.5337)
[02/24 01:35:48][INFO] train_vision.py:  544: Epoch: [1][30/209], lr: 5.71e-05, eta: 6:54:33	Time 4.052 (4.124)	Data 0.051 (0.132)	Mem 37.74GB	Prec@1 33.333 (44.444)	Loss 2.6147 (2.4700)
[02/24 01:36:28][INFO] train_vision.py:  544: Epoch: [1][40/209], lr: 5.95e-05, eta: 6:52:08	Time 4.045 (4.106)	Data 0.064 (0.115)	Mem 37.74GB	Prec@1 100.000 (46.612)	Loss 1.6413 (2.4279)
[02/24 01:37:09][INFO] train_vision.py:  544: Epoch: [1][50/209], lr: 6.19e-05, eta: 6:50:26	Time 4.073 (4.096)	Data 0.064 (0.104)	Mem 37.74GB	Prec@1 11.111 (44.880)	Loss 3.0517 (2.4054)
[02/24 01:37:49][INFO] train_vision.py:  544: Epoch: [1][60/209], lr: 6.43e-05, eta: 6:49:02	Time 4.044 (4.089)	Data 0.053 (0.097)	Mem 37.74GB	Prec@1 44.444 (45.902)	Loss 2.4760 (2.3806)
[02/24 01:38:30][INFO] train_vision.py:  544: Epoch: [1][70/209], lr: 6.66e-05, eta: 6:47:51	Time 4.056 (4.084)	Data 0.063 (0.092)	Mem 37.74GB	Prec@1 77.778 (46.479)	Loss 1.9319 (2.3678)
[02/24 01:39:10][INFO] train_vision.py:  544: Epoch: [1][80/209], lr: 6.90e-05, eta: 6:46:43	Time 4.045 (4.080)	Data 0.057 (0.087)	Mem 37.74GB	Prec@1 55.556 (47.462)	Loss 2.1793 (2.3382)
[02/24 01:39:51][INFO] train_vision.py:  544: Epoch: [1][90/209], lr: 7.14e-05, eta: 6:45:42	Time 4.051 (4.076)	Data 0.062 (0.084)	Mem 37.74GB	Prec@1 33.333 (47.985)	Loss 2.4782 (2.3187)
[02/24 01:40:31][INFO] train_vision.py:  544: Epoch: [1][100/209], lr: 7.38e-05, eta: 6:44:44	Time 4.043 (4.073)	Data 0.058 (0.082)	Mem 37.74GB	Prec@1 33.333 (48.295)	Loss 2.6845 (2.2982)
[02/24 01:41:12][INFO] train_vision.py:  544: Epoch: [1][110/209], lr: 7.62e-05, eta: 6:43:51	Time 4.055 (4.071)	Data 0.056 (0.079)	Mem 37.74GB	Prec@1 33.333 (48.549)	Loss 2.1847 (2.2803)
[02/24 01:41:52][INFO] train_vision.py:  544: Epoch: [1][120/209], lr: 7.86e-05, eta: 6:42:59	Time 4.045 (4.069)	Data 0.060 (0.078)	Mem 37.74GB	Prec@1 88.889 (49.679)	Loss 1.7461 (2.2531)
[02/24 01:42:33][INFO] train_vision.py:  544: Epoch: [1][130/209], lr: 8.10e-05, eta: 6:42:07	Time 4.043 (4.067)	Data 0.057 (0.076)	Mem 37.74GB	Prec@1 22.222 (50.127)	Loss 2.5044 (2.2342)
[02/24 01:43:13][INFO] train_vision.py:  544: Epoch: [1][140/209], lr: 8.34e-05, eta: 6:41:19	Time 4.047 (4.066)	Data 0.060 (0.075)	Mem 37.74GB	Prec@1 33.333 (50.670)	Loss 2.4385 (2.2248)
[02/24 01:43:54][INFO] train_vision.py:  544: Epoch: [1][150/209], lr: 8.58e-05, eta: 6:40:31	Time 4.058 (4.065)	Data 0.073 (0.075)	Mem 37.74GB	Prec@1 44.444 (51.141)	Loss 1.9431 (2.2092)
[02/24 01:44:34][INFO] train_vision.py:  544: Epoch: [1][160/209], lr: 8.82e-05, eta: 6:39:44	Time 4.046 (4.064)	Data 0.060 (0.074)	Mem 37.74GB	Prec@1 66.667 (51.553)	Loss 1.5876 (2.1955)
[02/24 01:45:15][INFO] train_vision.py:  544: Epoch: [1][170/209], lr: 9.05e-05, eta: 6:38:58	Time 4.042 (4.063)	Data 0.061 (0.073)	Mem 37.74GB	Prec@1 44.444 (51.917)	Loss 2.5592 (2.1862)
[02/24 01:45:55][INFO] train_vision.py:  544: Epoch: [1][180/209], lr: 9.29e-05, eta: 6:38:11	Time 4.044 (4.062)	Data 0.060 (0.072)	Mem 37.74GB	Prec@1 66.667 (51.934)	Loss 1.4992 (2.1810)
[02/24 01:46:36][INFO] train_vision.py:  544: Epoch: [1][190/209], lr: 9.53e-05, eta: 6:37:25	Time 4.049 (4.061)	Data 0.058 (0.072)	Mem 37.74GB	Prec@1 55.556 (52.123)	Loss 2.0655 (2.1680)
[02/24 01:47:16][INFO] train_vision.py:  544: Epoch: [1][200/209], lr: 9.77e-05, eta: 6:36:39	Time 4.046 (4.060)	Data 0.060 (0.071)	Mem 37.74GB	Prec@1 66.667 (52.792)	Loss 1.6288 (2.1484)
[02/24 01:47:56][INFO] train_vision.py:  603: Test: [0/28]	Prec@1 58.333 (58.333)	Prec@5 94.444 (94.444)	mPrec@1 (33.715)	mPrec@5 (55.556)
[02/24 01:48:43][INFO] train_vision.py:  603: Test: [10/28]	Prec@1 47.222 (56.818)	Prec@5 93.056 (94.444)	mPrec@1 (35.329)	mPrec@5 (75.787)
[02/24 01:49:30][INFO] train_vision.py:  603: Test: [20/28]	Prec@1 54.167 (57.804)	Prec@5 94.444 (94.511)	mPrec@1 (36.218)	mPrec@5 (78.222)
[02/24 01:50:01][INFO] train_vision.py:  609: Overall Prec@1 56.984% Prec@5 94.028% mPrec@1 (37.166) mPrec@5 (79.436)
[02/24 01:50:01][INFO] train_vision.py:  454: Testing: 56.98380172976598/56.98380172976598
[02/24 01:50:01][INFO] train_vision.py:  455: Saving:
[02/24 01:50:13][INFO] train_vision.py:  544: Epoch: [2][0/209], lr: 1.00e-04, eta: 9:42:17	Time 5.969 (5.969)	Data 1.992 (1.992)	Mem 37.74GB	Prec@1 66.667 (66.667)	Loss 1.8707 (1.8707)
[02/24 01:50:53][INFO] train_vision.py:  544: Epoch: [2][10/209], lr: 1.02e-04, eta: 6:50:44	Time 4.029 (4.218)	Data 0.042 (0.232)	Mem 37.74GB	Prec@1 66.667 (57.576)	Loss 1.5091 (1.8249)
[02/24 01:51:34][INFO] train_vision.py:  544: Epoch: [2][20/209], lr: 1.05e-04, eta: 6:42:05	Time 4.041 (4.136)	Data 0.056 (0.150)	Mem 37.74GB	Prec@1 66.667 (60.317)	Loss 1.9709 (1.8040)
[02/24 01:52:14][INFO] train_vision.py:  544: Epoch: [2][30/209], lr: 1.07e-04, eta: 6:38:27	Time 4.029 (4.106)	Data 0.044 (0.120)	Mem 37.74GB	Prec@1 77.778 (61.290)	Loss 1.5916 (1.8336)
[02/24 01:52:55][INFO] train_vision.py:  544: Epoch: [2][40/209], lr: 1.09e-04, eta: 6:36:20	Time 4.052 (4.091)	Data 0.064 (0.105)	Mem 37.74GB	Prec@1 55.556 (61.247)	Loss 1.9407 (1.8386)
[02/24 01:53:35][INFO] train_vision.py:  544: Epoch: [2][50/209], lr: 1.12e-04, eta: 6:34:45	Time 4.045 (4.082)	Data 0.059 (0.095)	Mem 37.74GB	Prec@1 77.778 (63.617)	Loss 1.6777 (1.8057)
[02/24 01:54:16][INFO] train_vision.py:  544: Epoch: [2][60/209], lr: 1.14e-04, eta: 6:33:22	Time 4.040 (4.074)	Data 0.060 (0.088)	Mem 37.74GB	Prec@1 77.778 (63.388)	Loss 1.2932 (1.7960)
[02/24 01:54:56][INFO] train_vision.py:  544: Epoch: [2][70/209], lr: 1.17e-04, eta: 6:32:14	Time 4.046 (4.070)	Data 0.059 (0.084)	Mem 37.74GB	Prec@1 44.444 (64.319)	Loss 1.8230 (1.7881)
[02/24 01:55:36][INFO] train_vision.py:  544: Epoch: [2][80/209], lr: 1.19e-04, eta: 6:31:12	Time 4.044 (4.066)	Data 0.059 (0.080)	Mem 37.74GB	Prec@1 77.778 (64.198)	Loss 1.5550 (1.7778)
[02/24 01:56:17][INFO] train_vision.py:  544: Epoch: [2][90/209], lr: 1.21e-04, eta: 6:30:13	Time 4.039 (4.063)	Data 0.056 (0.077)	Mem 37.74GB	Prec@1 33.333 (64.591)	Loss 2.3517 (1.7702)
[02/24 01:56:57][INFO] train_vision.py:  544: Epoch: [2][100/209], lr: 1.24e-04, eta: 6:29:20	Time 4.046 (4.061)	Data 0.061 (0.075)	Mem 37.74GB	Prec@1 77.778 (64.686)	Loss 1.5587 (1.7698)
[02/24 01:57:38][INFO] train_vision.py:  544: Epoch: [2][110/209], lr: 1.26e-04, eta: 6:28:29	Time 4.043 (4.059)	Data 0.061 (0.073)	Mem 37.74GB	Prec@1 66.667 (64.965)	Loss 1.6516 (1.7573)
[02/24 01:58:18][INFO] train_vision.py:  544: Epoch: [2][120/209], lr: 1.29e-04, eta: 6:27:39	Time 4.043 (4.057)	Data 0.060 (0.071)	Mem 37.74GB	Prec@1 77.778 (65.289)	Loss 1.8494 (1.7513)
[02/24 01:58:58][INFO] train_vision.py:  544: Epoch: [2][130/209], lr: 1.31e-04, eta: 6:26:52	Time 4.049 (4.056)	Data 0.061 (0.070)	Mem 37.74GB	Prec@1 66.667 (65.394)	Loss 1.5491 (1.7500)
[02/24 01:59:39][INFO] train_vision.py:  544: Epoch: [2][140/209], lr: 1.33e-04, eta: 6:26:06	Time 4.042 (4.055)	Data 0.060 (0.069)	Mem 37.74GB	Prec@1 55.556 (65.406)	Loss 1.9262 (1.7453)
[02/24 02:00:19][INFO] train_vision.py:  544: Epoch: [2][150/209], lr: 1.36e-04, eta: 6:25:22	Time 4.051 (4.054)	Data 0.060 (0.068)	Mem 37.74GB	Prec@1 77.778 (65.269)	Loss 1.6230 (1.7499)
[02/24 02:01:00][INFO] train_vision.py:  544: Epoch: [2][160/209], lr: 1.38e-04, eta: 6:24:37	Time 4.053 (4.054)	Data 0.059 (0.067)	Mem 37.74GB	Prec@1 55.556 (65.217)	Loss 2.0609 (1.7552)
[02/24 02:01:40][INFO] train_vision.py:  544: Epoch: [2][170/209], lr: 1.40e-04, eta: 6:23:52	Time 4.041 (4.053)	Data 0.058 (0.067)	Mem 37.74GB	Prec@1 77.778 (65.172)	Loss 1.7376 (1.7506)
[02/24 02:02:20][INFO] train_vision.py:  544: Epoch: [2][180/209], lr: 1.43e-04, eta: 6:23:08	Time 4.039 (4.052)	Data 0.060 (0.066)	Mem 37.74GB	Prec@1 55.556 (64.948)	Loss 2.0284 (1.7570)
[02/24 02:03:01][INFO] train_vision.py:  544: Epoch: [2][190/209], lr: 1.45e-04, eta: 6:22:24	Time 4.040 (4.052)	Data 0.057 (0.066)	Mem 37.74GB	Prec@1 66.667 (65.212)	Loss 1.7916 (1.7496)
[02/24 02:03:41][INFO] train_vision.py:  544: Epoch: [2][200/209], lr: 1.48e-04, eta: 6:21:41	Time 4.041 (4.051)	Data 0.057 (0.065)	Mem 37.74GB	Prec@1 88.889 (65.616)	Loss 1.1633 (1.7423)
[02/24 02:04:19][INFO] train_vision.py:  544: Epoch: [3][0/209], lr: 1.50e-04, eta: 10:05:21	Time 6.435 (6.435)	Data 2.445 (2.445)	Mem 37.74GB	Prec@1 88.889 (88.889)	Loss 1.5348 (1.5348)
[02/24 02:05:00][INFO] train_vision.py:  544: Epoch: [3][10/209], lr: 1.52e-04, eta: 6:40:35	Time 4.063 (4.266)	Data 0.077 (0.280)	Mem 37.74GB	Prec@1 77.778 (77.778)	Loss 1.5080 (1.4498)
[02/24 02:05:40][INFO] train_vision.py:  544: Epoch: [3][20/209], lr: 1.55e-04, eta: 6:30:24	Time 4.068 (4.165)	Data 0.067 (0.176)	Mem 37.74GB	Prec@1 55.556 (71.958)	Loss 2.4385 (1.6041)
[02/24 02:06:21][INFO] train_vision.py:  544: Epoch: [3][30/209], lr: 1.57e-04, eta: 6:26:14	Time 4.041 (4.128)	Data 0.048 (0.137)	Mem 37.74GB	Prec@1 77.778 (71.685)	Loss 1.0640 (1.5785)
[02/24 02:07:02][INFO] train_vision.py:  544: Epoch: [3][40/209], lr: 1.59e-04, eta: 6:23:51	Time 4.058 (4.110)	Data 0.063 (0.118)	Mem 37.74GB	Prec@1 66.667 (71.003)	Loss 1.6276 (1.5811)
[02/24 02:07:42][INFO] train_vision.py:  544: Epoch: [3][50/209], lr: 1.62e-04, eta: 6:22:04	Time 4.042 (4.098)	Data 0.048 (0.107)	Mem 37.74GB	Prec@1 77.778 (68.845)	Loss 1.2542 (1.6209)
[02/24 02:08:23][INFO] train_vision.py:  544: Epoch: [3][60/209], lr: 1.64e-04, eta: 6:20:44	Time 4.062 (4.091)	Data 0.063 (0.099)	Mem 37.74GB	Prec@1 44.444 (69.035)	Loss 1.7182 (1.6243)
[02/24 02:09:03][INFO] train_vision.py:  544: Epoch: [3][70/209], lr: 1.67e-04, eta: 6:19:33	Time 4.040 (4.086)	Data 0.048 (0.093)	Mem 37.74GB	Prec@1 44.444 (70.423)	Loss 2.6426 (1.6072)
[02/24 02:09:44][INFO] train_vision.py:  544: Epoch: [3][80/209], lr: 1.69e-04, eta: 6:18:31	Time 4.051 (4.082)	Data 0.066 (0.089)	Mem 37.74GB	Prec@1 66.667 (69.684)	Loss 1.4493 (1.6143)
[02/24 02:10:24][INFO] train_vision.py:  544: Epoch: [3][90/209], lr: 1.71e-04, eta: 6:17:34	Time 4.049 (4.079)	Data 0.050 (0.087)	Mem 37.74GB	Prec@1 100.000 (70.085)	Loss 1.0002 (1.6139)
[02/24 02:11:05][INFO] train_vision.py:  544: Epoch: [3][100/209], lr: 1.74e-04, eta: 6:16:41	Time 4.044 (4.077)	Data 0.062 (0.084)	Mem 37.74GB	Prec@1 77.778 (70.737)	Loss 1.6092 (1.5933)
[02/24 02:11:45][INFO] train_vision.py:  544: Epoch: [3][110/209], lr: 1.76e-04, eta: 6:15:48	Time 4.042 (4.075)	Data 0.048 (0.082)	Mem 37.74GB	Prec@1 66.667 (70.370)	Loss 1.3470 (1.5901)
[02/24 02:12:26][INFO] train_vision.py:  544: Epoch: [3][120/209], lr: 1.78e-04, eta: 6:14:59	Time 4.064 (4.073)	Data 0.064 (0.080)	Mem 37.74GB	Prec@1 77.778 (70.983)	Loss 2.0657 (1.5804)
[02/24 02:13:06][INFO] train_vision.py:  544: Epoch: [3][130/209], lr: 1.81e-04, eta: 6:14:10	Time 4.045 (4.071)	Data 0.048 (0.079)	Mem 37.74GB	Prec@1 77.778 (71.841)	Loss 1.5536 (1.5639)
[02/24 02:13:47][INFO] train_vision.py:  544: Epoch: [3][140/209], lr: 1.83e-04, eta: 6:13:23	Time 4.064 (4.070)	Data 0.067 (0.078)	Mem 37.74GB	Prec@1 77.778 (71.395)	Loss 1.1937 (1.5706)
[02/24 02:14:27][INFO] train_vision.py:  544: Epoch: [3][150/209], lr: 1.86e-04, eta: 6:12:35	Time 4.039 (4.069)	Data 0.049 (0.076)	Mem 37.74GB	Prec@1 88.889 (71.597)	Loss 1.1766 (1.5655)
[02/24 02:15:08][INFO] train_vision.py:  544: Epoch: [3][160/209], lr: 1.88e-04, eta: 6:11:48	Time 4.058 (4.068)	Data 0.064 (0.076)	Mem 37.74GB	Prec@1 88.889 (72.188)	Loss 1.3564 (1.5549)
[02/24 02:15:48][INFO] train_vision.py:  544: Epoch: [3][170/209], lr: 1.90e-04, eta: 6:11:01	Time 4.035 (4.067)	Data 0.051 (0.075)	Mem 37.74GB	Prec@1 66.667 (72.125)	Loss 1.9613 (1.5602)
[02/24 02:16:29][INFO] train_vision.py:  544: Epoch: [3][180/209], lr: 1.93e-04, eta: 6:10:16	Time 4.069 (4.066)	Data 0.083 (0.075)	Mem 37.74GB	Prec@1 88.889 (72.437)	Loss 1.1357 (1.5532)
[02/24 02:17:09][INFO] train_vision.py:  544: Epoch: [3][190/209], lr: 1.95e-04, eta: 6:09:31	Time 4.042 (4.065)	Data 0.049 (0.074)	Mem 37.74GB	Prec@1 77.778 (72.542)	Loss 1.5347 (1.5491)
[02/24 02:17:50][INFO] train_vision.py:  544: Epoch: [3][200/209], lr: 1.98e-04, eta: 6:08:46	Time 4.057 (4.064)	Data 0.063 (0.074)	Mem 37.74GB	Prec@1 77.778 (72.803)	Loss 1.5088 (1.5445)
[02/24 02:18:29][INFO] train_vision.py:  603: Test: [0/28]	Prec@1 81.944 (81.944)	Prec@5 98.611 (98.611)	mPrec@1 (51.493)	mPrec@5 (61.806)
[02/24 02:19:17][INFO] train_vision.py:  603: Test: [10/28]	Prec@1 66.667 (75.000)	Prec@5 93.056 (97.096)	mPrec@1 (57.444)	mPrec@5 (88.714)
[02/24 02:20:04][INFO] train_vision.py:  603: Test: [20/28]	Prec@1 81.944 (76.455)	Prec@5 98.611 (97.553)	mPrec@1 (59.367)	mPrec@5 (89.531)
[02/24 02:20:34][INFO] train_vision.py:  609: Overall Prec@1 77.126% Prec@5 97.621% mPrec@1 (62.439) mPrec@5 (92.293)
[02/24 02:20:34][INFO] train_vision.py:  454: Testing: 77.12550264428019/77.12550264428019
[02/24 02:20:34][INFO] train_vision.py:  455: Saving:
[02/24 02:20:48][INFO] train_vision.py:  544: Epoch: [4][0/209], lr: 2.00e-04, eta: 9:08:55	Time 6.060 (6.060)	Data 2.087 (2.087)	Mem 37.74GB	Prec@1 66.667 (66.667)	Loss 1.6011 (1.6011)
[02/24 02:21:28][INFO] train_vision.py:  544: Epoch: [4][10/209], lr: 2.00e-04, eta: 6:23:35	Time 4.053 (4.243)	Data 0.050 (0.251)	Mem 37.74GB	Prec@1 66.667 (82.828)	Loss 1.7356 (1.2952)
[02/24 02:22:09][INFO] train_vision.py:  544: Epoch: [4][20/209], lr: 2.00e-04, eta: 6:15:00	Time 4.062 (4.155)	Data 0.063 (0.160)	Mem 37.74GB	Prec@1 77.778 (79.365)	Loss 1.4871 (1.4516)
[02/24 02:22:49][INFO] train_vision.py:  544: Epoch: [4][30/209], lr: 2.00e-04, eta: 6:11:23	Time 4.061 (4.123)	Data 0.065 (0.129)	Mem 37.74GB	Prec@1 77.778 (76.703)	Loss 1.3097 (1.4760)
[02/24 02:23:30][INFO] train_vision.py:  544: Epoch: [4][40/209], lr: 2.00e-04, eta: 6:09:15	Time 4.045 (4.107)	Data 0.057 (0.114)	Mem 37.74GB	Prec@1 66.667 (76.423)	Loss 1.4368 (1.4456)
[02/24 02:24:11][INFO] train_vision.py:  544: Epoch: [4][50/209], lr: 2.00e-04, eta: 6:07:38	Time 4.054 (4.096)	Data 0.070 (0.104)	Mem 37.74GB	Prec@1 77.778 (76.253)	Loss 1.7696 (1.4498)
[02/24 02:24:51][INFO] train_vision.py:  544: Epoch: [4][60/209], lr: 2.00e-04, eta: 6:06:22	Time 4.065 (4.090)	Data 0.079 (0.098)	Mem 37.74GB	Prec@1 55.556 (75.774)	Loss 1.6612 (1.4512)
[02/24 02:25:32][INFO] train_vision.py:  544: Epoch: [4][70/209], lr: 2.00e-04, eta: 6:05:16	Time 4.055 (4.085)	Data 0.058 (0.093)	Mem 37.74GB	Prec@1 77.778 (76.213)	Loss 1.2828 (1.4408)
[02/24 02:26:12][INFO] train_vision.py:  544: Epoch: [4][80/209], lr: 2.00e-04, eta: 6:04:19	Time 4.063 (4.082)	Data 0.062 (0.089)	Mem 37.74GB	Prec@1 100.000 (76.680)	Loss 1.2486 (1.4349)
[02/24 02:26:53][INFO] train_vision.py:  544: Epoch: [4][90/209], lr: 2.00e-04, eta: 6:03:25	Time 4.061 (4.080)	Data 0.061 (0.086)	Mem 37.74GB	Prec@1 77.778 (76.313)	Loss 1.5057 (1.4442)
[02/24 02:27:34][INFO] train_vision.py:  544: Epoch: [4][100/209], lr: 2.00e-04, eta: 6:02:35	Time 4.067 (4.078)	Data 0.068 (0.084)	Mem 37.74GB	Prec@1 77.778 (76.128)	Loss 1.4221 (1.4561)
[02/24 02:28:14][INFO] train_vision.py:  544: Epoch: [4][110/209], lr: 2.00e-04, eta: 6:01:46	Time 4.059 (4.076)	Data 0.059 (0.082)	Mem 37.74GB	Prec@1 66.667 (76.076)	Loss 1.6323 (1.4578)
[02/24 02:28:55][INFO] train_vision.py:  544: Epoch: [4][120/209], lr: 2.00e-04, eta: 6:00:58	Time 4.068 (4.075)	Data 0.065 (0.081)	Mem 37.74GB	Prec@1 66.667 (76.676)	Loss 1.6166 (1.4412)
[02/24 02:29:35][INFO] train_vision.py:  544: Epoch: [4][130/209], lr: 2.00e-04, eta: 6:00:11	Time 4.058 (4.074)	Data 0.061 (0.079)	Mem 37.74GB	Prec@1 77.778 (76.675)	Loss 1.3883 (1.4422)
[02/24 02:30:16][INFO] train_vision.py:  544: Epoch: [4][140/209], lr: 2.00e-04, eta: 5:59:25	Time 4.057 (4.073)	Data 0.062 (0.078)	Mem 37.74GB	Prec@1 66.667 (77.069)	Loss 1.4407 (1.4346)
[02/24 02:30:57][INFO] train_vision.py:  544: Epoch: [4][150/209], lr: 2.00e-04, eta: 5:58:39	Time 4.059 (4.072)	Data 0.055 (0.077)	Mem 37.74GB	Prec@1 66.667 (77.189)	Loss 1.6047 (1.4337)
[02/24 02:31:37][INFO] train_vision.py:  544: Epoch: [4][160/209], lr: 2.00e-04, eta: 5:57:55	Time 4.062 (4.071)	Data 0.067 (0.076)	Mem 37.74GB	Prec@1 88.889 (77.226)	Loss 1.3230 (1.4355)
[02/24 02:32:18][INFO] train_vision.py:  544: Epoch: [4][170/209], lr: 2.00e-04, eta: 5:57:10	Time 4.055 (4.070)	Data 0.069 (0.075)	Mem 37.74GB	Prec@1 77.778 (77.258)	Loss 1.3482 (1.4346)
[02/24 02:32:58][INFO] train_vision.py:  544: Epoch: [4][180/209], lr: 1.99e-04, eta: 5:56:25	Time 4.062 (4.070)	Data 0.052 (0.074)	Mem 37.74GB	Prec@1 77.778 (77.532)	Loss 1.1053 (1.4241)
[02/24 02:33:39][INFO] train_vision.py:  544: Epoch: [4][190/209], lr: 1.99e-04, eta: 5:55:40	Time 4.056 (4.069)	Data 0.061 (0.073)	Mem 37.74GB	Prec@1 77.778 (77.254)	Loss 1.7395 (1.4321)
[02/24 02:34:19][INFO] train_vision.py:  544: Epoch: [4][200/209], lr: 1.99e-04, eta: 5:54:56	Time 4.059 (4.068)	Data 0.058 (0.073)	Mem 37.74GB	Prec@1 77.778 (77.225)	Loss 1.2345 (1.4348)
[02/24 02:34:58][INFO] train_vision.py:  544: Epoch: [5][0/209], lr: 1.99e-04, eta: 9:19:23	Time 6.422 (6.422)	Data 2.364 (2.364)	Mem 37.74GB	Prec@1 88.889 (88.889)	Loss 1.5249 (1.5249)
[02/24 02:35:38][INFO] train_vision.py:  544: Epoch: [5][10/209], lr: 1.99e-04, eta: 6:11:54	Time 4.066 (4.278)	Data 0.076 (0.284)	Mem 37.74GB	Prec@1 66.667 (76.768)	Loss 1.6775 (1.5084)
[02/24 02:36:19][INFO] train_vision.py:  544: Epoch: [5][20/209], lr: 1.99e-04, eta: 6:01:55	Time 4.073 (4.171)	Data 0.091 (0.181)	Mem 37.74GB	Prec@1 66.667 (75.132)	Loss 1.5938 (1.5394)
[02/24 02:36:59][INFO] train_vision.py:  544: Epoch: [5][30/209], lr: 1.99e-04, eta: 5:57:35	Time 4.044 (4.129)	Data 0.043 (0.139)	Mem 37.74GB	Prec@1 77.778 (78.136)	Loss 1.3697 (1.4351)
[02/24 02:37:40][INFO] train_vision.py:  544: Epoch: [5][40/209], lr: 1.99e-04, eta: 5:55:07	Time 4.044 (4.109)	Data 0.049 (0.119)	Mem 37.74GB	Prec@1 77.778 (77.778)	Loss 1.5419 (1.4433)
[02/24 02:38:20][INFO] train_vision.py:  544: Epoch: [5][50/209], lr: 1.99e-04, eta: 5:53:21	Time 4.054 (4.096)	Data 0.057 (0.106)	Mem 37.74GB	Prec@1 77.778 (76.906)	Loss 1.1478 (1.4476)
[02/24 02:39:01][INFO] train_vision.py:  544: Epoch: [5][60/209], lr: 1.99e-04, eta: 5:51:54	Time 4.036 (4.087)	Data 0.049 (0.097)	Mem 37.74GB	Prec@1 66.667 (77.231)	Loss 1.4726 (1.4357)
[02/24 02:39:41][INFO] train_vision.py:  544: Epoch: [5][70/209], lr: 1.99e-04, eta: 5:50:43	Time 4.052 (4.081)	Data 0.057 (0.091)	Mem 37.74GB	Prec@1 88.889 (78.091)	Loss 1.3215 (1.4111)
[02/24 02:40:22][INFO] train_vision.py:  544: Epoch: [5][80/209], lr: 1.99e-04, eta: 5:49:40	Time 4.038 (4.077)	Data 0.049 (0.086)	Mem 37.74GB	Prec@1 77.778 (78.601)	Loss 1.0427 (1.3980)
[02/24 02:41:02][INFO] train_vision.py:  544: Epoch: [5][90/209], lr: 1.99e-04, eta: 5:48:42	Time 4.060 (4.074)	Data 0.059 (0.083)	Mem 37.74GB	Prec@1 66.667 (78.999)	Loss 1.5392 (1.3793)
[02/24 02:41:42][INFO] train_vision.py:  544: Epoch: [5][100/209], lr: 1.98e-04, eta: 5:47:47	Time 4.038 (4.071)	Data 0.048 (0.080)	Mem 37.74GB	Prec@1 77.778 (78.108)	Loss 1.3081 (1.3976)
[02/24 02:42:23][INFO] train_vision.py:  544: Epoch: [5][110/209], lr: 1.98e-04, eta: 5:46:54	Time 4.056 (4.069)	Data 0.057 (0.078)	Mem 37.74GB	Prec@1 100.000 (78.579)	Loss 0.8236 (1.3839)
[02/24 02:43:03][INFO] train_vision.py:  544: Epoch: [5][120/209], lr: 1.98e-04, eta: 5:46:04	Time 4.032 (4.067)	Data 0.048 (0.076)	Mem 37.74GB	Prec@1 66.667 (79.063)	Loss 1.5879 (1.3707)
[02/24 02:43:44][INFO] train_vision.py:  544: Epoch: [5][130/209], lr: 1.98e-04, eta: 5:45:15	Time 4.055 (4.065)	Data 0.057 (0.074)	Mem 37.74GB	Prec@1 77.778 (79.644)	Loss 1.3711 (1.3604)
[02/24 02:44:24][INFO] train_vision.py:  544: Epoch: [5][140/209], lr: 1.98e-04, eta: 5:44:27	Time 4.035 (4.064)	Data 0.047 (0.073)	Mem 37.74GB	Prec@1 88.889 (79.433)	Loss 1.1337 (1.3678)
[02/24 02:45:05][INFO] train_vision.py:  544: Epoch: [5][150/209], lr: 1.98e-04, eta: 5:43:41	Time 4.058 (4.063)	Data 0.059 (0.071)	Mem 37.74GB	Prec@1 100.000 (79.617)	Loss 0.9826 (1.3680)
[02/24 02:45:45][INFO] train_vision.py:  544: Epoch: [5][160/209], lr: 1.98e-04, eta: 5:42:55	Time 4.035 (4.061)	Data 0.050 (0.070)	Mem 37.74GB	Prec@1 88.889 (79.986)	Loss 1.0215 (1.3590)
[02/24 02:46:26][INFO] train_vision.py:  544: Epoch: [5][170/209], lr: 1.98e-04, eta: 5:42:10	Time 4.057 (4.061)	Data 0.057 (0.069)	Mem 37.74GB	Prec@1 55.556 (79.402)	Loss 2.0914 (1.3739)
[02/24 02:47:06][INFO] train_vision.py:  544: Epoch: [5][180/209], lr: 1.98e-04, eta: 5:41:26	Time 4.042 (4.060)	Data 0.049 (0.069)	Mem 37.74GB	Prec@1 77.778 (79.312)	Loss 1.2079 (1.3744)
[02/24 02:47:47][INFO] train_vision.py:  544: Epoch: [5][190/209], lr: 1.97e-04, eta: 5:40:42	Time 4.058 (4.059)	Data 0.061 (0.068)	Mem 37.74GB	Prec@1 77.778 (79.116)	Loss 1.2411 (1.3829)
[02/24 02:48:27][INFO] train_vision.py:  544: Epoch: [5][200/209], lr: 1.97e-04, eta: 5:39:58	Time 4.037 (4.059)	Data 0.049 (0.067)	Mem 37.74GB	Prec@1 100.000 (79.270)	Loss 0.8104 (1.3790)
[02/24 02:49:06][INFO] train_vision.py:  603: Test: [0/28]	Prec@1 77.778 (77.778)	Prec@5 95.833 (95.833)	mPrec@1 (45.590)	mPrec@5 (61.389)
[02/24 02:49:53][INFO] train_vision.py:  603: Test: [10/28]	Prec@1 79.167 (79.545)	Prec@5 97.222 (96.843)	mPrec@1 (67.095)	mPrec@5 (88.899)
[02/24 02:50:41][INFO] train_vision.py:  603: Test: [20/28]	Prec@1 76.389 (79.034)	Prec@5 98.611 (97.288)	mPrec@1 (68.062)	mPrec@5 (90.655)
[02/24 02:51:12][INFO] train_vision.py:  609: Overall Prec@1 78.846% Prec@5 97.267% mPrec@1 (72.278) mPrec@5 (95.143)
[02/24 02:51:12][INFO] train_vision.py:  454: Testing: 78.84614961446538/78.84614961446538
[02/24 02:51:12][INFO] train_vision.py:  455: Saving:
[02/24 02:51:25][INFO] train_vision.py:  544: Epoch: [6][0/209], lr: 1.97e-04, eta: 8:18:02	Time 5.956 (5.956)	Data 1.991 (1.991)	Mem 37.74GB	Prec@1 66.667 (66.667)	Loss 1.5782 (1.5782)
[02/24 02:52:05][INFO] train_vision.py:  544: Epoch: [6][10/209], lr: 1.97e-04, eta: 5:52:40	Time 4.064 (4.226)	Data 0.073 (0.248)	Mem 37.74GB	Prec@1 88.889 (80.808)	Loss 1.0510 (1.3637)
[02/24 02:52:46][INFO] train_vision.py:  544: Epoch: [6][20/209], lr: 1.97e-04, eta: 5:45:07	Time 4.059 (4.144)	Data 0.080 (0.164)	Mem 37.74GB	Prec@1 66.667 (83.069)	Loss 1.9531 (1.3262)
[02/24 02:53:26][INFO] train_vision.py:  544: Epoch: [6][30/209], lr: 1.97e-04, eta: 5:41:48	Time 4.035 (4.112)	Data 0.049 (0.128)	Mem 37.74GB	Prec@1 77.778 (83.154)	Loss 1.2289 (1.2735)
[02/24 02:54:07][INFO] train_vision.py:  544: Epoch: [6][40/209], lr: 1.97e-04, eta: 5:39:43	Time 4.049 (4.096)	Data 0.061 (0.110)	Mem 37.74GB	Prec@1 88.889 (81.572)	Loss 1.1782 (1.3088)
[02/24 02:54:47][INFO] train_vision.py:  544: Epoch: [6][50/209], lr: 1.96e-04, eta: 5:38:08	Time 4.033 (4.085)	Data 0.050 (0.098)	Mem 37.74GB	Prec@1 88.889 (83.007)	Loss 1.2207 (1.2723)
[02/24 02:55:28][INFO] train_vision.py:  544: Epoch: [6][60/209], lr: 1.96e-04, eta: 5:36:54	Time 4.049 (4.078)	Data 0.061 (0.091)	Mem 37.74GB	Prec@1 88.889 (82.149)	Loss 0.9091 (1.2852)
[02/24 02:56:08][INFO] train_vision.py:  544: Epoch: [6][70/209], lr: 1.96e-04, eta: 5:35:46	Time 4.033 (4.072)	Data 0.049 (0.085)	Mem 37.74GB	Prec@1 77.778 (81.690)	Loss 1.6648 (1.3038)
[02/24 02:56:49][INFO] train_vision.py:  544: Epoch: [6][80/209], lr: 1.96e-04, eta: 5:34:46	Time 4.045 (4.069)	Data 0.060 (0.081)	Mem 37.74GB	Prec@1 100.000 (81.893)	Loss 1.0956 (1.2954)
[02/24 02:57:29][INFO] train_vision.py:  544: Epoch: [6][90/209], lr: 1.96e-04, eta: 5:33:50	Time 4.037 (4.065)	Data 0.049 (0.078)	Mem 37.74GB	Prec@1 100.000 (82.418)	Loss 1.0513 (1.2829)
[02/24 02:58:09][INFO] train_vision.py:  544: Epoch: [6][100/209], lr: 1.96e-04, eta: 5:32:58	Time 4.042 (4.063)	Data 0.062 (0.075)	Mem 37.74GB	Prec@1 88.889 (81.738)	Loss 1.0908 (1.2989)
[02/24 02:58:50][INFO] train_vision.py:  544: Epoch: [6][110/209], lr: 1.95e-04, eta: 5:32:07	Time 4.036 (4.061)	Data 0.048 (0.073)	Mem 37.74GB	Prec@1 77.778 (81.782)	Loss 1.5614 (1.3010)
[02/24 02:59:30][INFO] train_vision.py:  544: Epoch: [6][120/209], lr: 1.95e-04, eta: 5:31:18	Time 4.047 (4.059)	Data 0.058 (0.071)	Mem 37.74GB	Prec@1 88.889 (82.002)	Loss 1.0435 (1.2978)
[02/24 03:00:11][INFO] train_vision.py:  544: Epoch: [6][130/209], lr: 1.95e-04, eta: 5:30:29	Time 4.035 (4.058)	Data 0.049 (0.070)	Mem 37.74GB	Prec@1 66.667 (81.849)	Loss 1.6578 (1.2960)
[02/24 03:00:51][INFO] train_vision.py:  544: Epoch: [6][140/209], lr: 1.95e-04, eta: 5:29:43	Time 4.048 (4.056)	Data 0.059 (0.069)	Mem 37.74GB	Prec@1 77.778 (81.954)	Loss 1.6450 (1.2895)
[02/24 03:01:31][INFO] train_vision.py:  544: Epoch: [6][150/209], lr: 1.95e-04, eta: 5:28:56	Time 4.038 (4.055)	Data 0.047 (0.067)	Mem 37.74GB	Prec@1 77.778 (81.604)	Loss 1.9702 (1.3048)
[02/24 03:02:12][INFO] train_vision.py:  544: Epoch: [6][160/209], lr: 1.95e-04, eta: 5:28:11	Time 4.045 (4.054)	Data 0.058 (0.066)	Mem 37.74GB	Prec@1 88.889 (81.366)	Loss 0.9610 (1.3004)
[02/24 03:02:52][INFO] train_vision.py:  544: Epoch: [6][170/209], lr: 1.94e-04, eta: 5:27:28	Time 4.058 (4.054)	Data 0.068 (0.066)	Mem 37.74GB	Prec@1 88.889 (81.546)	Loss 1.0042 (1.2992)
[02/24 03:03:33][INFO] train_vision.py:  544: Epoch: [6][180/209], lr: 1.94e-04, eta: 5:26:45	Time 4.042 (4.053)	Data 0.056 (0.065)	Mem 37.74GB	Prec@1 88.889 (81.338)	Loss 1.0682 (1.3000)
[02/24 03:04:13][INFO] train_vision.py:  544: Epoch: [6][190/209], lr: 1.94e-04, eta: 5:26:02	Time 4.057 (4.053)	Data 0.072 (0.065)	Mem 37.74GB	Prec@1 77.778 (81.501)	Loss 1.0973 (1.2983)
[02/24 03:04:53][INFO] train_vision.py:  544: Epoch: [6][200/209], lr: 1.94e-04, eta: 5:25:20	Time 4.044 (4.052)	Data 0.057 (0.064)	Mem 37.74GB	Prec@1 77.778 (81.426)	Loss 1.4529 (1.2969)
[02/24 03:05:32][INFO] train_vision.py:  544: Epoch: [7][0/209], lr: 1.94e-04, eta: 8:43:27	Time 6.532 (6.532)	Data 2.264 (2.264)	Mem 37.74GB	Prec@1 77.778 (77.778)	Loss 1.4289 (1.4289)
[02/24 03:06:12][INFO] train_vision.py:  544: Epoch: [7][10/209], lr: 1.93e-04, eta: 5:41:48	Time 4.036 (4.274)	Data 0.043 (0.259)	Mem 37.74GB	Prec@1 88.889 (83.838)	Loss 1.0718 (1.2467)
[02/24 03:06:53][INFO] train_vision.py:  544: Epoch: [7][20/209], lr: 1.93e-04, eta: 5:32:40	Time 4.056 (4.169)	Data 0.059 (0.163)	Mem 37.74GB	Prec@1 88.889 (81.481)	Loss 1.0376 (1.2813)
[02/24 03:07:33][INFO] train_vision.py:  544: Epoch: [7][30/209], lr: 1.93e-04, eta: 5:28:50	Time 4.050 (4.130)	Data 0.059 (0.128)	Mem 37.74GB	Prec@1 88.889 (81.004)	Loss 1.6017 (1.3078)
[02/24 03:08:14][INFO] train_vision.py:  544: Epoch: [7][40/209], lr: 1.93e-04, eta: 5:26:34	Time 4.059 (4.110)	Data 0.058 (0.110)	Mem 37.74GB	Prec@1 88.889 (82.656)	Loss 1.1944 (1.2611)
[02/24 03:08:54][INFO] train_vision.py:  544: Epoch: [7][50/209], lr: 1.93e-04, eta: 5:24:54	Time 4.038 (4.097)	Data 0.056 (0.100)	Mem 37.74GB	Prec@1 66.667 (81.699)	Loss 1.4903 (1.2798)
[02/24 03:09:35][INFO] train_vision.py:  544: Epoch: [7][60/209], lr: 1.92e-04, eta: 5:23:34	Time 4.058 (4.089)	Data 0.063 (0.093)	Mem 37.74GB	Prec@1 88.889 (81.239)	Loss 1.0601 (1.2915)
[02/24 03:10:15][INFO] train_vision.py:  544: Epoch: [7][70/209], lr: 1.92e-04, eta: 5:22:24	Time 4.043 (4.083)	Data 0.045 (0.088)	Mem 37.74GB	Prec@1 55.556 (81.690)	Loss 2.0616 (1.2831)
[02/24 03:10:56][INFO] train_vision.py:  544: Epoch: [7][80/209], lr: 1.92e-04, eta: 5:21:23	Time 4.057 (4.078)	Data 0.061 (0.084)	Mem 37.74GB	Prec@1 77.778 (81.893)	Loss 1.2565 (1.2808)
[02/24 03:11:36][INFO] train_vision.py:  544: Epoch: [7][90/209], lr: 1.92e-04, eta: 5:20:26	Time 4.041 (4.075)	Data 0.047 (0.081)	Mem 37.74GB	Prec@1 66.667 (81.441)	Loss 1.1576 (1.2827)
[02/24 03:12:17][INFO] train_vision.py:  544: Epoch: [7][100/209], lr: 1.91e-04, eta: 5:19:33	Time 4.060 (4.073)	Data 0.060 (0.079)	Mem 37.74GB	Prec@1 88.889 (81.408)	Loss 1.2058 (1.2943)
[02/24 03:12:57][INFO] train_vision.py:  544: Epoch: [7][110/209], lr: 1.91e-04, eta: 5:18:43	Time 4.053 (4.071)	Data 0.057 (0.077)	Mem 37.74GB	Prec@1 77.778 (81.181)	Loss 1.1438 (1.2897)
[02/24 03:13:38][INFO] train_vision.py:  544: Epoch: [7][120/209], lr: 1.91e-04, eta: 5:17:53	Time 4.058 (4.069)	Data 0.062 (0.075)	Mem 37.74GB	Prec@1 100.000 (81.084)	Loss 0.9552 (1.2959)
[02/24 03:14:18][INFO] train_vision.py:  544: Epoch: [7][130/209], lr: 1.91e-04, eta: 5:17:04	Time 4.042 (4.067)	Data 0.047 (0.074)	Mem 37.74GB	Prec@1 77.778 (81.510)	Loss 1.2380 (1.2845)
[02/24 03:14:58][INFO] train_vision.py:  544: Epoch: [7][140/209], lr: 1.90e-04, eta: 5:16:17	Time 4.045 (4.065)	Data 0.059 (0.072)	Mem 37.74GB	Prec@1 88.889 (81.639)	Loss 1.0043 (1.2732)
[02/24 03:15:39][INFO] train_vision.py:  544: Epoch: [7][150/209], lr: 1.90e-04, eta: 5:15:31	Time 4.052 (4.064)	Data 0.059 (0.072)	Mem 37.74GB	Prec@1 77.778 (81.678)	Loss 1.6097 (1.2728)
[02/24 03:16:19][INFO] train_vision.py:  544: Epoch: [7][160/209], lr: 1.90e-04, eta: 5:14:47	Time 4.056 (4.063)	Data 0.058 (0.071)	Mem 37.74GB	Prec@1 88.889 (81.988)	Loss 1.0270 (1.2610)
[02/24 03:17:00][INFO] train_vision.py:  544: Epoch: [7][170/209], lr: 1.90e-04, eta: 5:14:01	Time 4.045 (4.063)	Data 0.060 (0.070)	Mem 37.74GB	Prec@1 55.556 (82.196)	Loss 1.9408 (1.2571)
[02/24 03:17:40][INFO] train_vision.py:  544: Epoch: [7][180/209], lr: 1.89e-04, eta: 5:13:17	Time 4.057 (4.062)	Data 0.061 (0.069)	Mem 37.74GB	Prec@1 66.667 (81.952)	Loss 1.7207 (1.2610)
[02/24 03:18:21][INFO] train_vision.py:  544: Epoch: [7][190/209], lr: 1.89e-04, eta: 5:12:33	Time 4.049 (4.061)	Data 0.060 (0.069)	Mem 37.74GB	Prec@1 88.889 (82.315)	Loss 1.0786 (1.2554)
[02/24 03:19:01][INFO] train_vision.py:  544: Epoch: [7][200/209], lr: 1.89e-04, eta: 5:11:50	Time 4.058 (4.060)	Data 0.063 (0.068)	Mem 37.74GB	Prec@1 88.889 (82.477)	Loss 1.0555 (1.2494)
[02/24 03:19:40][INFO] train_vision.py:  603: Test: [0/28]	Prec@1 91.667 (91.667)	Prec@5 97.222 (97.222)	mPrec@1 (58.264)	mPrec@5 (61.597)
[02/24 03:20:27][INFO] train_vision.py:  603: Test: [10/28]	Prec@1 88.889 (90.278)	Prec@5 100.000 (98.359)	mPrec@1 (80.340)	mPrec@5 (90.709)
[02/24 03:21:15][INFO] train_vision.py:  603: Test: [20/28]	Prec@1 84.722 (90.476)	Prec@5 100.000 (98.743)	mPrec@1 (80.909)	mPrec@5 (91.953)
[02/24 03:21:46][INFO] train_vision.py:  609: Overall Prec@1 90.283% Prec@5 98.684% mPrec@1 (83.046) mPrec@5 (96.383)
[02/24 03:21:46][INFO] train_vision.py:  454: Testing: 90.28340093326955/90.28340093326955
[02/24 03:21:46][INFO] train_vision.py:  455: Saving:
[02/24 03:21:59][INFO] train_vision.py:  544: Epoch: [8][0/209], lr: 1.89e-04, eta: 7:36:37	Time 5.957 (5.957)	Data 1.994 (1.994)	Mem 37.74GB	Prec@1 77.778 (77.778)	Loss 1.3279 (1.3279)
[02/24 03:22:39][INFO] train_vision.py:  544: Epoch: [8][10/209], lr: 1.88e-04, eta: 5:22:45	Time 4.043 (4.220)	Data 0.069 (0.242)	Mem 37.74GB	Prec@1 88.889 (85.859)	Loss 1.2296 (1.1838)
[02/24 03:23:20][INFO] train_vision.py:  544: Epoch: [8][20/209], lr: 1.88e-04, eta: 5:15:47	Time 4.051 (4.138)	Data 0.059 (0.156)	Mem 37.74GB	Prec@1 77.778 (83.069)	Loss 1.1646 (1.2094)
[02/24 03:24:00][INFO] train_vision.py:  544: Epoch: [8][30/209], lr: 1.88e-04, eta: 5:12:53	Time 4.044 (4.109)	Data 0.060 (0.126)	Mem 37.74GB	Prec@1 100.000 (86.022)	Loss 0.8143 (1.1602)
[02/24 03:24:41][INFO] train_vision.py:  544: Epoch: [8][40/209], lr: 1.88e-04, eta: 5:11:04	Time 4.053 (4.094)	Data 0.065 (0.110)	Mem 37.74GB	Prec@1 66.667 (84.282)	Loss 1.4675 (1.1735)
[02/24 03:25:21][INFO] train_vision.py:  544: Epoch: [8][50/209], lr: 1.87e-04, eta: 5:09:43	Time 4.052 (4.085)	Data 0.060 (0.101)	Mem 37.74GB	Prec@1 77.778 (84.096)	Loss 1.1939 (1.1884)
[02/24 03:26:02][INFO] train_vision.py:  544: Epoch: [8][60/209], lr: 1.87e-04, eta: 5:08:35	Time 4.056 (4.079)	Data 0.064 (0.094)	Mem 37.74GB	Prec@1 77.778 (83.971)	Loss 1.4474 (1.1951)
[02/24 03:26:42][INFO] train_vision.py:  544: Epoch: [8][70/209], lr: 1.87e-04, eta: 5:07:34	Time 4.045 (4.075)	Data 0.049 (0.089)	Mem 37.74GB	Prec@1 77.778 (83.255)	Loss 1.4917 (1.2203)
[02/24 03:27:23][INFO] train_vision.py:  544: Epoch: [8][80/209], lr: 1.86e-04, eta: 5:06:39	Time 4.055 (4.071)	Data 0.067 (0.086)	Mem 37.74GB	Prec@1 88.889 (82.853)	Loss 1.0948 (1.2357)
[02/24 03:28:03][INFO] train_vision.py:  544: Epoch: [8][90/209], lr: 1.86e-04, eta: 5:05:47	Time 4.050 (4.069)	Data 0.059 (0.083)	Mem 37.74GB	Prec@1 100.000 (82.906)	Loss 0.8633 (1.2418)
[02/24 03:28:44][INFO] train_vision.py:  544: Epoch: [8][100/209], lr: 1.86e-04, eta: 5:04:58	Time 4.061 (4.067)	Data 0.079 (0.082)	Mem 37.74GB	Prec@1 77.778 (83.168)	Loss 1.5177 (1.2429)
[02/24 03:29:24][INFO] train_vision.py:  544: Epoch: [8][110/209], lr: 1.86e-04, eta: 5:04:09	Time 4.047 (4.065)	Data 0.060 (0.080)	Mem 37.74GB	Prec@1 66.667 (83.283)	Loss 1.7894 (1.2382)
[02/24 03:30:05][INFO] train_vision.py:  544: Epoch: [8][120/209], lr: 1.85e-04, eta: 5:03:21	Time 4.056 (4.064)	Data 0.066 (0.078)	Mem 37.74GB	Prec@1 77.778 (83.287)	Loss 1.4285 (1.2458)
[02/24 03:30:45][INFO] train_vision.py:  544: Epoch: [8][130/209], lr: 1.85e-04, eta: 5:02:34	Time 4.042 (4.062)	Data 0.046 (0.076)	Mem 37.74GB	Prec@1 88.889 (83.545)	Loss 1.3558 (1.2463)
[02/24 03:31:26][INFO] train_vision.py:  544: Epoch: [8][140/209], lr: 1.85e-04, eta: 5:01:49	Time 4.051 (4.061)	Data 0.066 (0.075)	Mem 37.74GB	Prec@1 77.778 (83.924)	Loss 1.5834 (1.2378)
[02/24 03:32:06][INFO] train_vision.py:  544: Epoch: [8][150/209], lr: 1.84e-04, eta: 5:01:04	Time 4.049 (4.060)	Data 0.059 (0.074)	Mem 37.74GB	Prec@1 77.778 (83.664)	Loss 1.3007 (1.2433)
[02/24 03:32:47][INFO] train_vision.py:  544: Epoch: [8][160/209], lr: 1.84e-04, eta: 5:00:19	Time 4.059 (4.059)	Data 0.081 (0.073)	Mem 37.74GB	Prec@1 55.556 (83.851)	Loss 1.8724 (1.2368)
[02/24 03:33:27][INFO] train_vision.py:  544: Epoch: [8][170/209], lr: 1.84e-04, eta: 4:59:35	Time 4.050 (4.059)	Data 0.060 (0.073)	Mem 37.74GB	Prec@1 66.667 (83.626)	Loss 1.4049 (1.2416)
[02/24 03:34:08][INFO] train_vision.py:  544: Epoch: [8][180/209], lr: 1.83e-04, eta: 4:58:51	Time 4.053 (4.058)	Data 0.062 (0.072)	Mem 37.74GB	Prec@1 77.778 (83.425)	Loss 1.3982 (1.2515)
[02/24 03:34:48][INFO] train_vision.py:  544: Epoch: [8][190/209], lr: 1.83e-04, eta: 4:58:08	Time 4.048 (4.057)	Data 0.057 (0.071)	Mem 37.74GB	Prec@1 88.889 (83.013)	Loss 1.1196 (1.2563)
[02/24 03:35:28][INFO] train_vision.py:  544: Epoch: [8][200/209], lr: 1.83e-04, eta: 4:57:25	Time 4.056 (4.057)	Data 0.067 (0.070)	Mem 37.74GB	Prec@1 88.889 (82.808)	Loss 1.3623 (1.2610)
[02/24 03:36:07][INFO] train_vision.py:  544: Epoch: [9][0/209], lr: 1.82e-04, eta: 8:06:20	Time 6.647 (6.647)	Data 2.251 (2.251)	Mem 37.74GB	Prec@1 77.778 (77.778)	Loss 1.1632 (1.1632)
[02/24 03:36:48][INFO] train_vision.py:  544: Epoch: [9][10/209], lr: 1.82e-04, eta: 5:13:37	Time 4.055 (4.296)	Data 0.070 (0.254)	Mem 37.74GB	Prec@1 88.889 (80.808)	Loss 1.2256 (1.3001)
[02/24 03:37:28][INFO] train_vision.py:  544: Epoch: [9][20/209], lr: 1.82e-04, eta: 5:04:48	Time 4.068 (4.185)	Data 0.047 (0.159)	Mem 37.74GB	Prec@1 77.778 (85.714)	Loss 1.2484 (1.1990)
[02/24 03:38:09][INFO] train_vision.py:  544: Epoch: [9][30/209], lr: 1.82e-04, eta: 5:01:12	Time 4.053 (4.145)	Data 0.062 (0.123)	Mem 37.74GB	Prec@1 77.778 (86.738)	Loss 1.2690 (1.1435)
[02/24 03:38:49][INFO] train_vision.py:  544: Epoch: [9][40/209], lr: 1.81e-04, eta: 4:59:05	Time 4.065 (4.125)	Data 0.046 (0.106)	Mem 37.74GB	Prec@1 88.889 (86.450)	Loss 1.1546 (1.1578)
[02/24 03:39:30][INFO] train_vision.py:  544: Epoch: [9][50/209], lr: 1.81e-04, eta: 4:57:28	Time 4.055 (4.113)	Data 0.058 (0.096)	Mem 37.74GB	Prec@1 66.667 (86.710)	Loss 1.5753 (1.1569)
[02/24 03:40:11][INFO] train_vision.py:  544: Epoch: [9][60/209], lr: 1.81e-04, eta: 4:56:09	Time 4.069 (4.104)	Data 0.048 (0.089)	Mem 37.74GB	Prec@1 77.778 (87.067)	Loss 1.2752 (1.1536)
[02/24 03:40:51][INFO] train_vision.py:  544: Epoch: [9][70/209], lr: 1.80e-04, eta: 4:54:58	Time 4.049 (4.097)	Data 0.057 (0.084)	Mem 37.74GB	Prec@1 55.556 (85.915)	Loss 1.6258 (1.1710)
[02/24 03:41:32][INFO] train_vision.py:  544: Epoch: [9][80/209], lr: 1.80e-04, eta: 4:53:55	Time 4.063 (4.092)	Data 0.066 (0.081)	Mem 37.74GB	Prec@1 77.778 (86.420)	Loss 1.2526 (1.1590)
[02/24 03:42:12][INFO] train_vision.py:  544: Epoch: [9][90/209], lr: 1.79e-04, eta: 4:52:57	Time 4.045 (4.088)	Data 0.059 (0.078)	Mem 37.74GB	Prec@1 100.000 (86.691)	Loss 0.7940 (1.1543)
[02/24 03:42:53][INFO] train_vision.py:  544: Epoch: [9][100/209], lr: 1.79e-04, eta: 4:52:01	Time 4.056 (4.084)	Data 0.059 (0.076)	Mem 37.74GB	Prec@1 88.889 (86.909)	Loss 1.0411 (1.1597)
[02/24 03:43:33][INFO] train_vision.py:  544: Epoch: [9][110/209], lr: 1.79e-04, eta: 4:51:08	Time 4.047 (4.081)	Data 0.059 (0.074)	Mem 37.74GB	Prec@1 77.778 (86.486)	Loss 1.3178 (1.1688)
[02/24 03:44:14][INFO] train_vision.py:  544: Epoch: [9][120/209], lr: 1.78e-04, eta: 4:50:17	Time 4.057 (4.079)	Data 0.057 (0.073)	Mem 37.74GB	Prec@1 88.889 (86.134)	Loss 1.1400 (1.1784)
[02/24 03:44:54][INFO] train_vision.py:  544: Epoch: [9][130/209], lr: 1.78e-04, eta: 4:49:28	Time 4.045 (4.077)	Data 0.059 (0.072)	Mem 37.74GB	Prec@1 88.889 (86.344)	Loss 1.1235 (1.1731)
[02/24 03:45:35][INFO] train_vision.py:  544: Epoch: [9][140/209], lr: 1.78e-04, eta: 4:48:39	Time 4.046 (4.075)	Data 0.068 (0.071)	Mem 37.74GB	Prec@1 88.889 (86.210)	Loss 1.0640 (1.1747)
[02/24 03:46:15][INFO] train_vision.py:  544: Epoch: [9][150/209], lr: 1.77e-04, eta: 4:47:52	Time 4.047 (4.074)	Data 0.061 (0.070)	Mem 37.74GB	Prec@1 88.889 (85.946)	Loss 1.1364 (1.1767)
[02/24 03:46:56][INFO] train_vision.py:  544: Epoch: [9][160/209], lr: 1.77e-04, eta: 4:47:05	Time 4.054 (4.072)	Data 0.059 (0.069)	Mem 37.74GB	Prec@1 100.000 (86.473)	Loss 0.9188 (1.1636)
[02/24 03:47:36][INFO] train_vision.py:  544: Epoch: [9][170/209], lr: 1.77e-04, eta: 4:46:19	Time 4.045 (4.071)	Data 0.060 (0.069)	Mem 37.74GB	Prec@1 100.000 (86.745)	Loss 0.8472 (1.1583)
[02/24 03:48:17][INFO] train_vision.py:  544: Epoch: [9][180/209], lr: 1.76e-04, eta: 4:45:33	Time 4.050 (4.070)	Data 0.065 (0.069)	Mem 37.74GB	Prec@1 100.000 (86.618)	Loss 0.7944 (1.1619)
[02/24 03:48:57][INFO] train_vision.py:  544: Epoch: [9][190/209], lr: 1.76e-04, eta: 4:44:48	Time 4.049 (4.069)	Data 0.061 (0.068)	Mem 37.74GB	Prec@1 100.000 (86.736)	Loss 0.7981 (1.1548)
[02/24 03:49:38][INFO] train_vision.py:  544: Epoch: [9][200/209], lr: 1.75e-04, eta: 4:44:03	Time 4.050 (4.068)	Data 0.066 (0.068)	Mem 37.74GB	Prec@1 88.889 (86.567)	Loss 1.1086 (1.1594)
[02/24 03:50:17][INFO] train_vision.py:  603: Test: [0/28]	Prec@1 91.667 (91.667)	Prec@5 97.222 (97.222)	mPrec@1 (57.222)	mPrec@5 (61.597)
[02/24 03:51:04][INFO] train_vision.py:  603: Test: [10/28]	Prec@1 79.167 (87.500)	Prec@5 97.222 (97.854)	mPrec@1 (77.491)	mPrec@5 (89.792)
[02/24 03:51:51][INFO] train_vision.py:  603: Test: [20/28]	Prec@1 79.167 (87.103)	Prec@5 100.000 (98.148)	mPrec@1 (77.935)	mPrec@5 (91.990)
[02/24 03:52:22][INFO] train_vision.py:  609: Overall Prec@1 86.943% Prec@5 98.128% mPrec@1 (82.722) mPrec@5 (96.212)
[02/24 03:52:22][INFO] train_vision.py:  454: Testing: 86.94331644035061/90.28340093326955
[02/24 03:52:22][INFO] train_vision.py:  455: Saving:
[02/24 03:52:32][INFO] train_vision.py:  544: Epoch: [10][0/209], lr: 1.75e-04, eta: 7:05:59	Time 6.113 (6.113)	Data 2.134 (2.134)	Mem 37.74GB	Prec@1 88.889 (88.889)	Loss 1.1897 (1.1897)
[02/24 03:53:12][INFO] train_vision.py:  544: Epoch: [10][10/209], lr: 1.75e-04, eta: 4:54:53	Time 4.052 (4.242)	Data 0.063 (0.244)	Mem 37.74GB	Prec@1 55.556 (82.828)	Loss 1.4150 (1.1156)
[02/24 03:53:53][INFO] train_vision.py:  544: Epoch: [10][20/209], lr: 1.74e-04, eta: 4:48:03	Time 4.057 (4.154)	Data 0.072 (0.159)	Mem 37.74GB	Prec@1 100.000 (85.714)	Loss 0.7887 (1.1477)
[02/24 03:54:33][INFO] train_vision.py:  544: Epoch: [10][30/209], lr: 1.74e-04, eta: 4:45:03	Time 4.048 (4.120)	Data 0.063 (0.126)	Mem 37.74GB	Prec@1 88.889 (86.380)	Loss 1.1045 (1.1554)
[02/24 03:55:14][INFO] train_vision.py:  544: Epoch: [10][40/209], lr: 1.74e-04, eta: 4:43:09	Time 4.041 (4.103)	Data 0.047 (0.109)	Mem 37.74GB	Prec@1 77.778 (88.347)	Loss 1.2571 (1.1170)
[02/24 03:55:54][INFO] train_vision.py:  544: Epoch: [10][50/209], lr: 1.73e-04, eta: 4:41:46	Time 4.053 (4.093)	Data 0.062 (0.099)	Mem 37.74GB	Prec@1 88.889 (87.146)	Loss 1.1797 (1.1312)
[02/24 03:56:35][INFO] train_vision.py:  544: Epoch: [10][60/209], lr: 1.73e-04, eta: 4:40:35	Time 4.049 (4.085)	Data 0.047 (0.091)	Mem 37.74GB	Prec@1 100.000 (86.339)	Loss 0.7925 (1.1582)
[02/24 03:57:15][INFO] train_vision.py:  544: Epoch: [10][70/209], lr: 1.72e-04, eta: 4:39:34	Time 4.050 (4.080)	Data 0.062 (0.086)	Mem 37.74GB	Prec@1 77.778 (86.385)	Loss 1.5536 (1.1611)
[02/24 03:57:56][INFO] train_vision.py:  544: Epoch: [10][80/209], lr: 1.72e-04, eta: 4:38:38	Time 4.044 (4.077)	Data 0.048 (0.083)	Mem 37.74GB	Prec@1 88.889 (86.008)	Loss 1.1860 (1.1636)
[02/24 03:58:36][INFO] train_vision.py:  544: Epoch: [10][90/209], lr: 1.72e-04, eta: 4:37:45	Time 4.052 (4.074)	Data 0.064 (0.080)	Mem 37.74GB	Prec@1 77.778 (86.325)	Loss 1.1566 (1.1624)
[02/24 03:59:17][INFO] train_vision.py:  544: Epoch: [10][100/209], lr: 1.71e-04, eta: 4:36:54	Time 4.049 (4.071)	Data 0.048 (0.077)	Mem 37.74GB	Prec@1 88.889 (86.469)	Loss 1.1996 (1.1582)
[02/24 03:59:57][INFO] train_vision.py:  544: Epoch: [10][110/209], lr: 1.71e-04, eta: 4:36:06	Time 4.052 (4.069)	Data 0.065 (0.075)	Mem 37.74GB	Prec@1 88.889 (86.687)	Loss 1.1623 (1.1489)
[02/24 04:00:38][INFO] train_vision.py:  544: Epoch: [10][120/209], lr: 1.70e-04, eta: 4:35:19	Time 4.050 (4.068)	Data 0.048 (0.074)	Mem 37.74GB	Prec@1 100.000 (86.961)	Loss 0.8645 (1.1482)
[02/24 04:01:19][INFO] train_vision.py:  544: Epoch: [10][130/209], lr: 1.70e-04, eta: 4:34:34	Time 4.049 (4.067)	Data 0.060 (0.073)	Mem 37.74GB	Prec@1 88.889 (86.344)	Loss 0.9449 (1.1575)
[02/24 04:01:59][INFO] train_vision.py:  544: Epoch: [10][140/209], lr: 1.70e-04, eta: 4:33:48	Time 4.050 (4.065)	Data 0.048 (0.072)	Mem 37.74GB	Prec@1 77.778 (86.367)	Loss 1.4184 (1.1576)
[02/24 04:02:40][INFO] train_vision.py:  544: Epoch: [10][150/209], lr: 1.69e-04, eta: 4:33:04	Time 4.065 (4.065)	Data 0.078 (0.071)	Mem 37.74GB	Prec@1 100.000 (86.240)	Loss 1.0489 (1.1635)
[02/24 04:03:20][INFO] train_vision.py:  544: Epoch: [10][160/209], lr: 1.69e-04, eta: 4:32:19	Time 4.045 (4.064)	Data 0.048 (0.070)	Mem 37.74GB	Prec@1 88.889 (85.990)	Loss 1.1402 (1.1683)
[02/24 04:04:01][INFO] train_vision.py:  544: Epoch: [10][170/209], lr: 1.68e-04, eta: 4:31:36	Time 4.046 (4.063)	Data 0.058 (0.069)	Mem 37.74GB	Prec@1 77.778 (85.835)	Loss 1.0650 (1.1697)
[02/24 04:04:41][INFO] train_vision.py:  544: Epoch: [10][180/209], lr: 1.68e-04, eta: 4:30:52	Time 4.045 (4.062)	Data 0.048 (0.069)	Mem 37.74GB	Prec@1 77.778 (85.758)	Loss 1.3396 (1.1723)
[02/24 04:05:22][INFO] train_vision.py:  544: Epoch: [10][190/209], lr: 1.68e-04, eta: 4:30:09	Time 4.047 (4.061)	Data 0.058 (0.068)	Mem 37.74GB	Prec@1 77.778 (85.922)	Loss 1.6306 (1.1680)
[02/24 04:06:02][INFO] train_vision.py:  544: Epoch: [10][200/209], lr: 1.67e-04, eta: 4:29:26	Time 4.047 (4.061)	Data 0.049 (0.067)	Mem 37.74GB	Prec@1 100.000 (85.904)	Loss 1.1390 (1.1683)
[02/24 04:06:40][INFO] train_vision.py:  544: Epoch: [11][0/209], lr: 1.67e-04, eta: 7:08:13	Time 6.469 (6.469)	Data 2.280 (2.280)	Mem 37.74GB	Prec@1 77.778 (77.778)	Loss 1.5667 (1.5667)
[02/24 04:07:21][INFO] train_vision.py:  544: Epoch: [11][10/209], lr: 1.66e-04, eta: 4:42:21	Time 4.034 (4.276)	Data 0.046 (0.264)	Mem 37.74GB	Prec@1 77.778 (87.879)	Loss 1.3137 (1.0948)
[02/24 04:08:01][INFO] train_vision.py:  544: Epoch: [11][20/209], lr: 1.66e-04, eta: 4:34:49	Time 4.052 (4.173)	Data 0.057 (0.169)	Mem 37.74GB	Prec@1 88.889 (88.360)	Loss 1.0866 (1.1118)
[02/24 04:08:42][INFO] train_vision.py:  544: Epoch: [11][30/209], lr: 1.65e-04, eta: 4:31:32	Time 4.049 (4.133)	Data 0.056 (0.134)	Mem 37.74GB	Prec@1 77.778 (87.097)	Loss 1.4501 (1.1391)
[02/24 04:09:22][INFO] train_vision.py:  544: Epoch: [11][40/209], lr: 1.65e-04, eta: 4:29:30	Time 4.048 (4.113)	Data 0.059 (0.116)	Mem 37.74GB	Prec@1 77.778 (85.908)	Loss 1.2879 (1.1545)
[02/24 04:10:03][INFO] train_vision.py:  544: Epoch: [11][50/209], lr: 1.65e-04, eta: 4:28:01	Time 4.048 (4.100)	Data 0.058 (0.105)	Mem 37.74GB	Prec@1 88.889 (85.403)	Loss 1.0083 (1.1642)
[02/24 04:10:43][INFO] train_vision.py:  544: Epoch: [11][60/209], lr: 1.64e-04, eta: 4:26:47	Time 4.046 (4.092)	Data 0.059 (0.098)	Mem 37.74GB	Prec@1 77.778 (85.064)	Loss 1.1664 (1.1687)
[02/24 04:11:24][INFO] train_vision.py:  544: Epoch: [11][70/209], lr: 1.64e-04, eta: 4:25:44	Time 4.050 (4.086)	Data 0.064 (0.093)	Mem 37.74GB	Prec@1 100.000 (84.351)	Loss 0.7556 (1.1931)
[02/24 04:12:05][INFO] train_vision.py:  544: Epoch: [11][80/209], lr: 1.63e-04, eta: 4:24:46	Time 4.046 (4.082)	Data 0.059 (0.089)	Mem 37.74GB	Prec@1 77.778 (83.951)	Loss 1.2123 (1.1858)
[02/24 04:12:45][INFO] train_vision.py:  544: Epoch: [11][90/209], lr: 1.63e-04, eta: 4:23:53	Time 4.049 (4.079)	Data 0.057 (0.086)	Mem 37.74GB	Prec@1 100.000 (84.737)	Loss 0.7530 (1.1644)
[02/24 04:13:26][INFO] train_vision.py:  544: Epoch: [11][100/209], lr: 1.62e-04, eta: 4:23:01	Time 4.051 (4.076)	Data 0.060 (0.084)	Mem 37.74GB	Prec@1 88.889 (85.259)	Loss 1.0062 (1.1568)
[02/24 04:14:06][INFO] train_vision.py:  544: Epoch: [11][110/209], lr: 1.62e-04, eta: 4:22:12	Time 4.051 (4.074)	Data 0.066 (0.082)	Mem 37.74GB	Prec@1 100.000 (85.485)	Loss 0.7713 (1.1493)
[02/24 04:14:47][INFO] train_vision.py:  544: Epoch: [11][120/209], lr: 1.61e-04, eta: 4:21:24	Time 4.053 (4.072)	Data 0.059 (0.080)	Mem 37.74GB	Prec@1 44.444 (85.308)	Loss 2.5564 (1.1602)
[02/24 04:15:27][INFO] train_vision.py:  544: Epoch: [11][130/209], lr: 1.61e-04, eta: 4:20:38	Time 4.056 (4.070)	Data 0.066 (0.079)	Mem 37.74GB	Prec@1 88.889 (85.411)	Loss 0.9835 (1.1614)
[02/24 04:16:08][INFO] train_vision.py:  544: Epoch: [11][140/209], lr: 1.60e-04, eta: 4:19:51	Time 4.045 (4.069)	Data 0.062 (0.078)	Mem 37.74GB	Prec@1 88.889 (85.658)	Loss 1.1159 (1.1618)
[02/24 04:16:48][INFO] train_vision.py:  544: Epoch: [11][150/209], lr: 1.60e-04, eta: 4:19:06	Time 4.053 (4.068)	Data 0.065 (0.077)	Mem 37.74GB	Prec@1 66.667 (85.872)	Loss 1.4443 (1.1536)
[02/24 04:17:29][INFO] train_vision.py:  544: Epoch: [11][160/209], lr: 1.60e-04, eta: 4:18:22	Time 4.044 (4.067)	Data 0.059 (0.076)	Mem 37.74GB	Prec@1 100.000 (85.783)	Loss 1.0651 (1.1581)
[02/24 04:18:09][INFO] train_vision.py:  544: Epoch: [11][170/209], lr: 1.59e-04, eta: 4:17:38	Time 4.058 (4.066)	Data 0.067 (0.075)	Mem 37.74GB	Prec@1 88.889 (85.510)	Loss 1.1248 (1.1708)
[02/24 04:18:50][INFO] train_vision.py:  544: Epoch: [11][180/209], lr: 1.59e-04, eta: 4:16:54	Time 4.051 (4.065)	Data 0.059 (0.074)	Mem 37.74GB	Prec@1 88.889 (85.697)	Loss 1.0145 (1.1654)
[02/24 04:19:30][INFO] train_vision.py:  544: Epoch: [11][190/209], lr: 1.58e-04, eta: 4:16:11	Time 4.056 (4.064)	Data 0.068 (0.074)	Mem 37.74GB	Prec@1 77.778 (85.573)	Loss 1.4826 (1.1672)
[02/24 04:20:11][INFO] train_vision.py:  544: Epoch: [11][200/209], lr: 1.58e-04, eta: 4:15:28	Time 4.047 (4.064)	Data 0.058 (0.073)	Mem 37.74GB	Prec@1 77.778 (85.185)	Loss 1.4619 (1.1817)
[02/24 04:20:50][INFO] train_vision.py:  603: Test: [0/28]	Prec@1 87.500 (87.500)	Prec@5 95.833 (95.833)	mPrec@1 (53.889)	mPrec@5 (61.389)
[02/24 04:21:37][INFO] train_vision.py:  603: Test: [10/28]	Prec@1 81.944 (86.616)	Prec@5 97.222 (97.854)	mPrec@1 (72.017)	mPrec@5 (90.408)
[02/24 04:22:24][INFO] train_vision.py:  603: Test: [20/28]	Prec@1 87.500 (86.971)	Prec@5 100.000 (98.479)	mPrec@1 (73.701)	mPrec@5 (92.183)
[02/24 04:22:55][INFO] train_vision.py:  609: Overall Prec@1 87.601% Prec@5 98.532% mPrec@1 (76.007) mPrec@5 (96.444)
[02/24 04:22:55][INFO] train_vision.py:  454: Testing: 87.60121318492813/90.28340093326955
[02/24 04:22:55][INFO] train_vision.py:  455: Saving:
[02/24 04:23:05][INFO] train_vision.py:  544: Epoch: [12][0/209], lr: 1.57e-04, eta: 6:28:23	Time 6.193 (6.193)	Data 2.211 (2.211)	Mem 37.74GB	Prec@1 88.889 (88.889)	Loss 1.0879 (1.0879)
[02/24 04:23:46][INFO] train_vision.py:  544: Epoch: [12][10/209], lr: 1.57e-04, eta: 4:25:56	Time 4.056 (4.252)	Data 0.073 (0.265)	Mem 37.74GB	Prec@1 66.667 (85.859)	Loss 1.5371 (1.1950)
[02/24 04:24:26][INFO] train_vision.py:  544: Epoch: [12][20/209], lr: 1.56e-04, eta: 4:19:27	Time 4.079 (4.159)	Data 0.084 (0.171)	Mem 37.74GB	Prec@1 55.556 (83.598)	Loss 1.5987 (1.1975)
[02/24 04:25:07][INFO] train_vision.py:  544: Epoch: [12][30/209], lr: 1.56e-04, eta: 4:16:46	Time 4.051 (4.127)	Data 0.069 (0.137)	Mem 37.74GB	Prec@1 77.778 (83.154)	Loss 1.2172 (1.2188)
[02/24 04:25:47][INFO] train_vision.py:  544: Epoch: [12][40/209], lr: 1.55e-04, eta: 4:15:07	Time 4.066 (4.112)	Data 0.054 (0.118)	Mem 37.74GB	Prec@1 66.667 (82.656)	Loss 1.3260 (1.2333)
[02/24 04:26:28][INFO] train_vision.py:  544: Epoch: [12][50/209], lr: 1.55e-04, eta: 4:13:50	Time 4.054 (4.102)	Data 0.070 (0.106)	Mem 37.74GB	Prec@1 100.000 (83.660)	Loss 0.8422 (1.2138)
[02/24 04:27:09][INFO] train_vision.py:  544: Epoch: [12][60/209], lr: 1.54e-04, eta: 4:12:44	Time 4.060 (4.095)	Data 0.048 (0.098)	Mem 37.74GB	Prec@1 100.000 (84.335)	Loss 0.9817 (1.2019)
[02/24 04:27:49][INFO] train_vision.py:  544: Epoch: [12][70/209], lr: 1.54e-04, eta: 4:11:45	Time 4.053 (4.090)	Data 0.073 (0.093)	Mem 37.74GB	Prec@1 88.889 (84.664)	Loss 1.1315 (1.1890)
[02/24 04:28:30][INFO] train_vision.py:  544: Epoch: [12][80/209], lr: 1.53e-04, eta: 4:10:51	Time 4.056 (4.087)	Data 0.038 (0.088)	Mem 37.74GB	Prec@1 100.000 (85.460)	Loss 0.7785 (1.1640)
[02/24 04:29:11][INFO] train_vision.py:  544: Epoch: [12][90/209], lr: 1.53e-04, eta: 4:10:01	Time 4.056 (4.084)	Data 0.077 (0.086)	Mem 37.74GB	Prec@1 88.889 (86.081)	Loss 0.9400 (1.1501)
[02/24 04:29:51][INFO] train_vision.py:  544: Epoch: [12][100/209], lr: 1.52e-04, eta: 4:09:12	Time 4.063 (4.082)	Data 0.048 (0.083)	Mem 37.74GB	Prec@1 100.000 (86.139)	Loss 0.7613 (1.1537)
[02/24 04:30:32][INFO] train_vision.py:  544: Epoch: [12][110/209], lr: 1.52e-04, eta: 4:08:24	Time 4.058 (4.080)	Data 0.077 (0.081)	Mem 37.74GB	Prec@1 66.667 (85.886)	Loss 1.8248 (1.1622)
[02/24 04:31:12][INFO] train_vision.py:  544: Epoch: [12][120/209], lr: 1.52e-04, eta: 4:07:38	Time 4.060 (4.079)	Data 0.046 (0.079)	Mem 37.74GB	Prec@1 88.889 (85.675)	Loss 1.0073 (1.1689)
[02/24 04:31:53][INFO] train_vision.py:  544: Epoch: [12][130/209], lr: 1.51e-04, eta: 4:06:54	Time 4.060 (4.078)	Data 0.081 (0.078)	Mem 37.74GB	Prec@1 88.889 (86.260)	Loss 1.1601 (1.1575)
[02/24 04:32:34][INFO] train_vision.py:  544: Epoch: [12][140/209], lr: 1.51e-04, eta: 4:06:08	Time 4.067 (4.076)	Data 0.048 (0.077)	Mem 37.74GB	Prec@1 100.000 (86.682)	Loss 0.7720 (1.1466)
[02/24 04:33:14][INFO] train_vision.py:  544: Epoch: [12][150/209], lr: 1.50e-04, eta: 4:05:24	Time 4.058 (4.075)	Data 0.079 (0.076)	Mem 37.74GB	Prec@1 55.556 (86.829)	Loss 2.3128 (1.1463)
[02/24 04:33:55][INFO] train_vision.py:  544: Epoch: [12][160/209], lr: 1.50e-04, eta: 4:04:41	Time 4.064 (4.075)	Data 0.055 (0.075)	Mem 37.74GB	Prec@1 88.889 (86.818)	Loss 1.2487 (1.1421)
[02/24 04:34:36][INFO] train_vision.py:  544: Epoch: [12][170/209], lr: 1.49e-04, eta: 4:03:57	Time 4.058 (4.074)	Data 0.079 (0.074)	Mem 37.74GB	Prec@1 100.000 (86.810)	Loss 0.9045 (1.1435)
[02/24 04:35:16][INFO] train_vision.py:  544: Epoch: [12][180/209], lr: 1.49e-04, eta: 4:03:15	Time 4.070 (4.073)	Data 0.048 (0.073)	Mem 37.74GB	Prec@1 88.889 (86.495)	Loss 1.0194 (1.1450)
[02/24 04:35:57][INFO] train_vision.py:  544: Epoch: [12][190/209], lr: 1.48e-04, eta: 4:02:32	Time 4.057 (4.073)	Data 0.074 (0.073)	Mem 37.74GB	Prec@1 88.889 (86.271)	Loss 0.9241 (1.1479)
[02/24 04:36:37][INFO] train_vision.py:  544: Epoch: [12][200/209], lr: 1.48e-04, eta: 4:01:50	Time 4.064 (4.073)	Data 0.046 (0.072)	Mem 37.74GB	Prec@1 66.667 (86.070)	Loss 1.5170 (1.1526)
[02/24 04:37:16][INFO] train_vision.py:  544: Epoch: [13][0/209], lr: 1.47e-04, eta: 6:20:42	Time 6.427 (6.427)	Data 2.278 (2.278)	Mem 37.74GB	Prec@1 88.889 (88.889)	Loss 0.9929 (0.9929)
[02/24 04:37:57][INFO] train_vision.py:  544: Epoch: [13][10/209], lr: 1.47e-04, eta: 4:12:26	Time 4.064 (4.274)	Data 0.062 (0.264)	Mem 37.74GB	Prec@1 77.778 (85.859)	Loss 0.9844 (1.1288)
[02/24 04:38:37][INFO] train_vision.py:  544: Epoch: [13][20/209], lr: 1.46e-04, eta: 4:05:45	Time 4.055 (4.173)	Data 0.076 (0.172)	Mem 37.74GB	Prec@1 77.778 (86.243)	Loss 1.1696 (1.1203)
[02/24 04:39:18][INFO] train_vision.py:  544: Epoch: [13][30/209], lr: 1.46e-04, eta: 4:02:54	Time 4.060 (4.136)	Data 0.050 (0.137)	Mem 37.74GB	Prec@1 88.889 (87.455)	Loss 0.9604 (1.1032)
[02/24 04:39:58][INFO] train_vision.py:  544: Epoch: [13][40/209], lr: 1.45e-04, eta: 4:01:08	Time 4.054 (4.117)	Data 0.073 (0.120)	Mem 37.74GB	Prec@1 77.778 (87.805)	Loss 1.0148 (1.0815)
[02/24 04:40:39][INFO] train_vision.py:  544: Epoch: [13][50/209], lr: 1.45e-04, eta: 3:59:41	Time 4.060 (4.104)	Data 0.040 (0.108)	Mem 37.74GB	Prec@1 88.889 (87.582)	Loss 0.9685 (1.0869)
[02/24 04:41:19][INFO] train_vision.py:  544: Epoch: [13][60/209], lr: 1.44e-04, eta: 3:58:35	Time 4.052 (4.097)	Data 0.075 (0.101)	Mem 37.74GB	Prec@1 88.889 (87.250)	Loss 0.8859 (1.0945)
[02/24 04:42:00][INFO] train_vision.py:  544: Epoch: [13][70/209], lr: 1.43e-04, eta: 3:57:37	Time 4.062 (4.092)	Data 0.043 (0.096)	Mem 37.74GB	Prec@1 100.000 (87.324)	Loss 0.9083 (1.1102)
[02/24 04:42:41][INFO] train_vision.py:  544: Epoch: [13][80/209], lr: 1.43e-04, eta: 3:56:43	Time 4.058 (4.089)	Data 0.074 (0.093)	Mem 37.74GB	Prec@1 66.667 (86.968)	Loss 1.5856 (1.1163)
[02/24 04:43:21][INFO] train_vision.py:  544: Epoch: [13][90/209], lr: 1.42e-04, eta: 3:55:52	Time 4.059 (4.086)	Data 0.038 (0.089)	Mem 37.74GB	Prec@1 100.000 (86.935)	Loss 0.9498 (1.1243)
[02/24 04:44:02][INFO] train_vision.py:  544: Epoch: [13][100/209], lr: 1.42e-04, eta: 3:55:03	Time 4.062 (4.083)	Data 0.078 (0.088)	Mem 37.74GB	Prec@1 100.000 (86.799)	Loss 0.8996 (1.1263)
[02/24 04:44:43][INFO] train_vision.py:  544: Epoch: [13][110/209], lr: 1.41e-04, eta: 3:54:16	Time 4.062 (4.081)	Data 0.044 (0.086)	Mem 37.74GB	Prec@1 100.000 (87.387)	Loss 0.7703 (1.1093)
[02/24 04:45:23][INFO] train_vision.py:  544: Epoch: [13][120/209], lr: 1.41e-04, eta: 3:53:30	Time 4.056 (4.080)	Data 0.075 (0.084)	Mem 37.74GB	Prec@1 100.000 (87.420)	Loss 0.8669 (1.1098)
[02/24 04:46:04][INFO] train_vision.py:  544: Epoch: [13][130/209], lr: 1.40e-04, eta: 3:52:44	Time 4.061 (4.078)	Data 0.036 (0.083)	Mem 37.74GB	Prec@1 88.889 (87.277)	Loss 1.2611 (1.1144)
[02/24 04:46:44][INFO] train_vision.py:  544: Epoch: [13][140/209], lr: 1.40e-04, eta: 3:51:59	Time 4.075 (4.077)	Data 0.075 (0.081)	Mem 37.74GB	Prec@1 88.889 (87.392)	Loss 1.3435 (1.1179)
[02/24 04:47:25][INFO] train_vision.py:  544: Epoch: [13][150/209], lr: 1.39e-04, eta: 3:51:15	Time 4.065 (4.076)	Data 0.046 (0.080)	Mem 37.74GB	Prec@1 77.778 (87.123)	Loss 1.0494 (1.1202)
[02/24 04:48:06][INFO] train_vision.py:  544: Epoch: [13][160/209], lr: 1.39e-04, eta: 3:50:32	Time 4.073 (4.076)	Data 0.077 (0.079)	Mem 37.74GB	Prec@1 100.000 (87.095)	Loss 0.7597 (1.1236)
[02/24 04:48:46][INFO] train_vision.py:  544: Epoch: [13][170/209], lr: 1.38e-04, eta: 3:49:49	Time 4.061 (4.075)	Data 0.054 (0.078)	Mem 37.74GB	Prec@1 88.889 (87.394)	Loss 1.1510 (1.1159)
[02/24 04:49:27][INFO] train_vision.py:  544: Epoch: [13][180/209], lr: 1.38e-04, eta: 3:49:07	Time 4.075 (4.075)	Data 0.077 (0.078)	Mem 37.74GB	Prec@1 88.889 (87.416)	Loss 1.0820 (1.1144)
[02/24 04:50:08][INFO] train_vision.py:  544: Epoch: [13][190/209], lr: 1.37e-04, eta: 3:48:25	Time 4.054 (4.074)	Data 0.047 (0.077)	Mem 37.74GB	Prec@1 88.889 (87.376)	Loss 1.0648 (1.1177)
[02/24 04:50:48][INFO] train_vision.py:  544: Epoch: [13][200/209], lr: 1.37e-04, eta: 3:47:42	Time 4.073 (4.074)	Data 0.078 (0.077)	Mem 37.74GB	Prec@1 77.778 (87.120)	Loss 1.3677 (1.1218)
[02/24 04:51:27][INFO] train_vision.py:  603: Test: [0/28]	Prec@1 91.667 (91.667)	Prec@5 97.222 (97.222)	mPrec@1 (57.569)	mPrec@5 (61.597)
[02/24 04:52:15][INFO] train_vision.py:  603: Test: [10/28]	Prec@1 91.667 (91.035)	Prec@5 98.611 (98.106)	mPrec@1 (79.192)	mPrec@5 (90.413)
[02/24 04:53:02][INFO] train_vision.py:  603: Test: [20/28]	Prec@1 87.500 (91.138)	Prec@5 100.000 (98.545)	mPrec@1 (80.491)	mPrec@5 (92.617)
[02/24 04:53:33][INFO] train_vision.py:  609: Overall Prec@1 90.789% Prec@5 98.532% mPrec@1 (85.243) mPrec@5 (96.745)
[02/24 04:53:33][INFO] train_vision.py:  454: Testing: 90.78947380776347/90.78947380776347
[02/24 04:53:33][INFO] train_vision.py:  455: Saving:
[02/24 04:53:46][INFO] train_vision.py:  544: Epoch: [14][0/209], lr: 1.36e-04, eta: 5:34:17	Time 5.996 (5.996)	Data 2.018 (2.018)	Mem 37.74GB	Prec@1 100.000 (100.000)	Loss 1.0061 (1.0061)
[02/24 04:54:27][INFO] train_vision.py:  544: Epoch: [14][10/209], lr: 1.36e-04, eta: 3:55:32	Time 4.063 (4.238)	Data 0.071 (0.244)	Mem 37.74GB	Prec@1 100.000 (90.909)	Loss 0.7589 (1.0894)
[02/24 04:55:07][INFO] train_vision.py:  544: Epoch: [14][20/209], lr: 1.35e-04, eta: 3:50:23	Time 4.074 (4.157)	Data 0.067 (0.161)	Mem 37.74GB	Prec@1 88.889 (88.889)	Loss 1.0275 (1.0991)
[02/24 04:55:48][INFO] train_vision.py:  544: Epoch: [14][30/209], lr: 1.35e-04, eta: 3:48:10	Time 4.073 (4.130)	Data 0.078 (0.132)	Mem 37.74GB	Prec@1 88.889 (90.323)	Loss 1.3688 (1.0636)
[02/24 04:56:29][INFO] train_vision.py:  544: Epoch: [14][40/209], lr: 1.34e-04, eta: 3:46:40	Time 4.059 (4.115)	Data 0.056 (0.117)	Mem 37.74GB	Prec@1 88.889 (88.889)	Loss 1.1808 (1.0884)
[02/24 04:57:09][INFO] train_vision.py:  544: Epoch: [14][50/209], lr: 1.33e-04, eta: 3:45:31	Time 4.065 (4.107)	Data 0.074 (0.108)	Mem 37.74GB	Prec@1 100.000 (88.671)	Loss 0.7656 (1.0973)
[02/24 04:57:50][INFO] train_vision.py:  544: Epoch: [14][60/209], lr: 1.33e-04, eta: 3:44:30	Time 4.047 (4.101)	Data 0.066 (0.102)	Mem 37.74GB	Prec@1 77.778 (88.342)	Loss 1.1557 (1.1031)
[02/24 04:58:31][INFO] train_vision.py:  544: Epoch: [14][70/209], lr: 1.32e-04, eta: 3:43:32	Time 4.063 (4.095)	Data 0.069 (0.097)	Mem 37.74GB	Prec@1 88.889 (88.732)	Loss 1.1422 (1.0887)
[02/24 04:59:11][INFO] train_vision.py:  544: Epoch: [14][80/209], lr: 1.32e-04, eta: 3:42:36	Time 4.056 (4.091)	Data 0.064 (0.093)	Mem 37.74GB	Prec@1 100.000 (88.340)	Loss 0.9362 (1.0942)
[02/24 04:59:52][INFO] train_vision.py:  544: Epoch: [14][90/209], lr: 1.31e-04, eta: 3:41:44	Time 4.070 (4.088)	Data 0.070 (0.090)	Mem 37.74GB	Prec@1 100.000 (88.889)	Loss 0.7738 (1.0795)
[02/24 05:00:33][INFO] train_vision.py:  544: Epoch: [14][100/209], lr: 1.31e-04, eta: 3:40:55	Time 4.060 (4.085)	Data 0.073 (0.088)	Mem 37.74GB	Prec@1 88.889 (88.559)	Loss 1.0057 (1.0848)
[02/24 05:01:13][INFO] train_vision.py:  544: Epoch: [14][110/209], lr: 1.30e-04, eta: 3:40:06	Time 4.066 (4.082)	Data 0.064 (0.085)	Mem 37.74GB	Prec@1 100.000 (88.589)	Loss 0.8235 (1.0800)
[02/24 05:01:54][INFO] train_vision.py:  544: Epoch: [14][120/209], lr: 1.30e-04, eta: 3:39:19	Time 4.058 (4.081)	Data 0.063 (0.084)	Mem 37.74GB	Prec@1 100.000 (88.246)	Loss 0.8799 (1.0916)
[02/24 05:02:34][INFO] train_vision.py:  544: Epoch: [14][130/209], lr: 1.29e-04, eta: 3:38:34	Time 4.061 (4.079)	Data 0.068 (0.083)	Mem 37.74GB	Prec@1 77.778 (87.786)	Loss 1.2852 (1.1004)
[02/24 05:03:15][INFO] train_vision.py:  544: Epoch: [14][140/209], lr: 1.29e-04, eta: 3:37:49	Time 4.057 (4.078)	Data 0.061 (0.081)	Mem 37.74GB	Prec@1 88.889 (87.943)	Loss 0.9586 (1.0953)
[02/24 05:03:56][INFO] train_vision.py:  544: Epoch: [14][150/209], lr: 1.28e-04, eta: 3:37:05	Time 4.064 (4.077)	Data 0.066 (0.080)	Mem 37.74GB	Prec@1 88.889 (87.417)	Loss 1.0796 (1.1038)
[02/24 05:04:36][INFO] train_vision.py:  544: Epoch: [14][160/209], lr: 1.27e-04, eta: 3:36:21	Time 4.065 (4.076)	Data 0.077 (0.079)	Mem 37.74GB	Prec@1 66.667 (87.509)	Loss 1.4057 (1.1030)
[02/24 05:05:17][INFO] train_vision.py:  544: Epoch: [14][170/209], lr: 1.27e-04, eta: 3:35:37	Time 4.075 (4.075)	Data 0.066 (0.078)	Mem 37.74GB	Prec@1 88.889 (87.199)	Loss 1.1434 (1.1033)
[02/24 05:05:57][INFO] train_vision.py:  544: Epoch: [14][180/209], lr: 1.26e-04, eta: 3:34:54	Time 4.053 (4.074)	Data 0.065 (0.078)	Mem 37.74GB	Prec@1 77.778 (87.293)	Loss 1.0826 (1.1030)
[02/24 05:06:38][INFO] train_vision.py:  544: Epoch: [14][190/209], lr: 1.26e-04, eta: 3:34:11	Time 4.074 (4.074)	Data 0.066 (0.077)	Mem 37.74GB	Prec@1 88.889 (87.202)	Loss 0.9415 (1.1074)
[02/24 05:07:19][INFO] train_vision.py:  544: Epoch: [14][200/209], lr: 1.25e-04, eta: 3:33:29	Time 4.060 (4.073)	Data 0.065 (0.076)	Mem 37.74GB	Prec@1 77.778 (87.009)	Loss 1.1753 (1.1145)
[02/24 05:07:57][INFO] train_vision.py:  544: Epoch: [15][0/209], lr: 1.25e-04, eta: 5:42:19	Time 6.550 (6.550)	Data 2.223 (2.223)	Mem 37.74GB	Prec@1 77.778 (77.778)	Loss 1.5616 (1.5616)
[02/24 05:08:38][INFO] train_vision.py:  544: Epoch: [15][10/209], lr: 1.24e-04, eta: 3:43:29	Time 4.083 (4.290)	Data 0.039 (0.252)	Mem 37.74GB	Prec@1 88.889 (82.828)	Loss 1.0838 (1.2130)
[02/24 05:09:18][INFO] train_vision.py:  544: Epoch: [15][20/209], lr: 1.24e-04, eta: 3:37:06	Time 4.056 (4.180)	Data 0.061 (0.160)	Mem 37.74GB	Prec@1 66.667 (86.243)	Loss 1.7953 (1.1616)
[02/24 05:09:59][INFO] train_vision.py:  544: Epoch: [15][30/209], lr: 1.23e-04, eta: 3:34:24	Time 4.045 (4.142)	Data 0.037 (0.127)	Mem 37.74GB	Prec@1 77.778 (86.380)	Loss 1.5168 (1.1747)
[02/24 05:10:40][INFO] train_vision.py:  544: Epoch: [15][40/209], lr: 1.23e-04, eta: 3:32:40	Time 4.062 (4.122)	Data 0.062 (0.110)	Mem 37.74GB	Prec@1 77.778 (86.992)	Loss 1.2414 (1.1538)
[02/24 05:11:20][INFO] train_vision.py:  544: Epoch: [15][50/209], lr: 1.22e-04, eta: 3:31:22	Time 4.045 (4.110)	Data 0.042 (0.100)	Mem 37.74GB	Prec@1 88.889 (87.800)	Loss 0.9230 (1.1294)
[02/24 05:12:01][INFO] train_vision.py:  544: Epoch: [15][60/209], lr: 1.21e-04, eta: 3:30:16	Time 4.084 (4.101)	Data 0.064 (0.093)	Mem 37.74GB	Prec@1 77.778 (87.432)	Loss 1.3093 (1.1217)
[02/24 05:12:41][INFO] train_vision.py:  544: Epoch: [15][70/209], lr: 1.21e-04, eta: 3:29:17	Time 4.050 (4.096)	Data 0.046 (0.089)	Mem 37.74GB	Prec@1 88.889 (86.698)	Loss 1.2061 (1.1356)
[02/24 05:13:22][INFO] train_vision.py:  544: Epoch: [15][80/209], lr: 1.20e-04, eta: 3:28:23	Time 4.071 (4.091)	Data 0.076 (0.086)	Mem 37.74GB	Prec@1 88.889 (87.380)	Loss 1.2542 (1.1259)
[02/24 05:14:03][INFO] train_vision.py:  544: Epoch: [15][90/209], lr: 1.20e-04, eta: 3:27:31	Time 4.042 (4.088)	Data 0.044 (0.083)	Mem 37.74GB	Prec@1 88.889 (87.912)	Loss 1.0678 (1.1092)
[02/24 05:14:43][INFO] train_vision.py:  544: Epoch: [15][100/209], lr: 1.19e-04, eta: 3:26:42	Time 4.075 (4.085)	Data 0.070 (0.081)	Mem 37.74GB	Prec@1 100.000 (88.339)	Loss 0.7434 (1.0962)
[02/24 05:15:24][INFO] train_vision.py:  544: Epoch: [15][110/209], lr: 1.19e-04, eta: 3:25:54	Time 4.047 (4.083)	Data 0.045 (0.080)	Mem 37.74GB	Prec@1 77.778 (88.488)	Loss 1.4988 (1.0957)
[02/24 05:16:04][INFO] train_vision.py:  544: Epoch: [15][120/209], lr: 1.18e-04, eta: 3:25:07	Time 4.064 (4.081)	Data 0.067 (0.078)	Mem 37.74GB	Prec@1 77.778 (88.338)	Loss 1.2829 (1.0949)
[02/24 05:16:45][INFO] train_vision.py:  544: Epoch: [15][130/209], lr: 1.17e-04, eta: 3:24:21	Time 4.048 (4.079)	Data 0.047 (0.077)	Mem 37.74GB	Prec@1 77.778 (88.295)	Loss 1.2635 (1.0985)
[02/24 05:17:26][INFO] train_vision.py:  544: Epoch: [15][140/209], lr: 1.17e-04, eta: 3:23:37	Time 4.073 (4.078)	Data 0.075 (0.076)	Mem 37.74GB	Prec@1 88.889 (88.258)	Loss 1.0690 (1.1016)
[02/24 05:18:06][INFO] train_vision.py:  544: Epoch: [15][150/209], lr: 1.16e-04, eta: 3:22:51	Time 4.051 (4.076)	Data 0.044 (0.075)	Mem 37.74GB	Prec@1 77.778 (88.300)	Loss 1.2538 (1.1026)
[02/24 05:18:47][INFO] train_vision.py:  544: Epoch: [15][160/209], lr: 1.16e-04, eta: 3:22:08	Time 4.066 (4.075)	Data 0.077 (0.074)	Mem 37.74GB	Prec@1 66.667 (88.130)	Loss 1.2776 (1.1062)
[02/24 05:19:27][INFO] train_vision.py:  544: Epoch: [15][170/209], lr: 1.15e-04, eta: 3:21:23	Time 4.047 (4.074)	Data 0.046 (0.073)	Mem 37.74GB	Prec@1 100.000 (88.304)	Loss 0.8848 (1.1041)
[02/24 05:20:08][INFO] train_vision.py:  544: Epoch: [15][180/209], lr: 1.15e-04, eta: 3:20:39	Time 4.067 (4.073)	Data 0.076 (0.072)	Mem 37.74GB	Prec@1 77.778 (88.214)	Loss 1.4002 (1.1023)
[02/24 05:20:48][INFO] train_vision.py:  544: Epoch: [15][190/209], lr: 1.14e-04, eta: 3:19:56	Time 4.051 (4.072)	Data 0.046 (0.072)	Mem 37.74GB	Prec@1 77.778 (88.133)	Loss 1.6353 (1.1012)
[02/24 05:21:29][INFO] train_vision.py:  544: Epoch: [15][200/209], lr: 1.14e-04, eta: 3:19:13	Time 4.059 (4.071)	Data 0.074 (0.071)	Mem 37.74GB	Prec@1 88.889 (88.281)	Loss 0.9697 (1.0967)
[02/24 05:22:08][INFO] train_vision.py:  603: Test: [0/28]	Prec@1 87.500 (87.500)	Prec@5 98.611 (98.611)	mPrec@1 (55.139)	mPrec@5 (61.806)
[02/24 05:22:55][INFO] train_vision.py:  603: Test: [10/28]	Prec@1 88.889 (85.732)	Prec@5 98.611 (97.980)	mPrec@1 (77.121)	mPrec@5 (89.971)
[02/24 05:23:42][INFO] train_vision.py:  603: Test: [20/28]	Prec@1 80.556 (85.847)	Prec@5 100.000 (98.479)	mPrec@1 (77.288)	mPrec@5 (92.573)
[02/24 05:24:13][INFO] train_vision.py:  609: Overall Prec@1 85.273% Prec@5 98.431% mPrec@1 (80.465) mPrec@5 (96.695)
[02/24 05:24:13][INFO] train_vision.py:  454: Testing: 85.27327660317363/90.78947380776347
[02/24 05:24:13][INFO] train_vision.py:  455: Saving:
[02/24 05:24:24][INFO] train_vision.py:  544: Epoch: [16][0/209], lr: 1.13e-04, eta: 4:59:50	Time 6.146 (6.146)	Data 2.173 (2.173)	Mem 37.74GB	Prec@1 88.889 (88.889)	Loss 1.4171 (1.4171)
[02/24 05:25:04][INFO] train_vision.py:  544: Epoch: [16][10/209], lr: 1.12e-04, eta: 3:27:01	Time 4.093 (4.258)	Data 0.097 (0.268)	Mem 37.74GB	Prec@1 66.667 (84.848)	Loss 1.3213 (1.1507)
[02/24 05:25:45][INFO] train_vision.py:  544: Epoch: [16][20/209], lr: 1.12e-04, eta: 3:21:50	Time 4.073 (4.166)	Data 0.072 (0.174)	Mem 37.74GB	Prec@1 88.889 (87.831)	Loss 1.5070 (1.1049)
[02/24 05:26:26][INFO] train_vision.py:  544: Epoch: [16][30/209], lr: 1.11e-04, eta: 3:19:35	Time 4.056 (4.134)	Data 0.054 (0.141)	Mem 37.74GB	Prec@1 88.889 (89.964)	Loss 1.2065 (1.0493)
[02/24 05:27:06][INFO] train_vision.py:  544: Epoch: [16][40/209], lr: 1.11e-04, eta: 3:18:03	Time 4.058 (4.116)	Data 0.078 (0.123)	Mem 37.74GB	Prec@1 88.889 (89.973)	Loss 1.2608 (1.0667)
[02/24 05:27:47][INFO] train_vision.py:  544: Epoch: [16][50/209], lr: 1.10e-04, eta: 3:16:45	Time 4.043 (4.103)	Data 0.050 (0.112)	Mem 37.74GB	Prec@1 77.778 (89.978)	Loss 1.3111 (1.0523)
[02/24 05:28:27][INFO] train_vision.py:  544: Epoch: [16][60/209], lr: 1.10e-04, eta: 3:15:40	Time 4.076 (4.095)	Data 0.079 (0.104)	Mem 37.74GB	Prec@1 77.778 (90.164)	Loss 1.2809 (1.0467)
[02/24 05:29:08][INFO] train_vision.py:  544: Epoch: [16][70/209], lr: 1.09e-04, eta: 3:14:42	Time 4.030 (4.089)	Data 0.046 (0.099)	Mem 37.74GB	Prec@1 77.778 (89.045)	Loss 1.2967 (1.0708)
[02/24 05:29:48][INFO] train_vision.py:  544: Epoch: [16][80/209], lr: 1.08e-04, eta: 3:13:47	Time 4.046 (4.084)	Data 0.065 (0.095)	Mem 37.74GB	Prec@1 77.778 (88.477)	Loss 0.9912 (1.0771)
[02/24 05:30:29][INFO] train_vision.py:  544: Epoch: [16][90/209], lr: 1.08e-04, eta: 3:12:55	Time 4.034 (4.080)	Data 0.049 (0.090)	Mem 37.74GB	Prec@1 66.667 (88.400)	Loss 1.4724 (1.0887)
[02/24 05:31:09][INFO] train_vision.py:  544: Epoch: [16][100/209], lr: 1.07e-04, eta: 3:12:05	Time 4.053 (4.077)	Data 0.064 (0.087)	Mem 37.74GB	Prec@1 88.889 (88.669)	Loss 1.0700 (1.0821)
[02/24 05:31:50][INFO] train_vision.py:  544: Epoch: [16][110/209], lr: 1.07e-04, eta: 3:11:16	Time 4.033 (4.074)	Data 0.049 (0.084)	Mem 37.74GB	Prec@1 88.889 (88.288)	Loss 1.0727 (1.0945)
[02/24 05:32:30][INFO] train_vision.py:  544: Epoch: [16][120/209], lr: 1.06e-04, eta: 3:10:30	Time 4.054 (4.072)	Data 0.064 (0.082)	Mem 37.74GB	Prec@1 88.889 (88.430)	Loss 1.0409 (1.0857)
[02/24 05:33:11][INFO] train_vision.py:  544: Epoch: [16][130/209], lr: 1.06e-04, eta: 3:09:44	Time 4.037 (4.070)	Data 0.048 (0.080)	Mem 37.74GB	Prec@1 88.889 (88.550)	Loss 1.1643 (1.0854)
[02/24 05:33:51][INFO] train_vision.py:  544: Epoch: [16][140/209], lr: 1.05e-04, eta: 3:09:00	Time 4.046 (4.069)	Data 0.063 (0.079)	Mem 37.74GB	Prec@1 100.000 (88.889)	Loss 0.8045 (1.0762)
[02/24 05:34:32][INFO] train_vision.py:  544: Epoch: [16][150/209], lr: 1.04e-04, eta: 3:08:15	Time 4.032 (4.067)	Data 0.050 (0.077)	Mem 37.74GB	Prec@1 88.889 (88.668)	Loss 0.8834 (1.0777)
[02/24 05:35:12][INFO] train_vision.py:  544: Epoch: [16][160/209], lr: 1.04e-04, eta: 3:07:31	Time 4.062 (4.066)	Data 0.065 (0.076)	Mem 37.74GB	Prec@1 100.000 (88.820)	Loss 0.7634 (1.0768)
[02/24 05:35:53][INFO] train_vision.py:  544: Epoch: [16][170/209], lr: 1.03e-04, eta: 3:06:47	Time 4.034 (4.065)	Data 0.048 (0.075)	Mem 37.74GB	Prec@1 88.889 (88.629)	Loss 1.1603 (1.0822)
[02/24 05:36:33][INFO] train_vision.py:  544: Epoch: [16][180/209], lr: 1.03e-04, eta: 3:06:05	Time 4.045 (4.065)	Data 0.063 (0.074)	Mem 37.74GB	Prec@1 100.000 (88.643)	Loss 0.8750 (1.0791)
[02/24 05:37:14][INFO] train_vision.py:  544: Epoch: [16][190/209], lr: 1.02e-04, eta: 3:05:22	Time 4.040 (4.064)	Data 0.050 (0.073)	Mem 37.74GB	Prec@1 88.889 (88.831)	Loss 1.0360 (1.0728)
[02/24 05:37:54][INFO] train_vision.py:  544: Epoch: [16][200/209], lr: 1.02e-04, eta: 3:04:39	Time 4.052 (4.063)	Data 0.062 (0.072)	Mem 37.74GB	Prec@1 88.889 (88.834)	Loss 1.0864 (1.0738)
[02/24 05:38:33][INFO] train_vision.py:  544: Epoch: [17][0/209], lr: 1.01e-04, eta: 5:07:51	Time 6.796 (6.796)	Data 2.355 (2.355)	Mem 37.74GB	Prec@1 66.667 (66.667)	Loss 1.5290 (1.5290)
[02/24 05:39:13][INFO] train_vision.py:  544: Epoch: [17][10/209], lr: 1.00e-04, eta: 3:13:51	Time 4.037 (4.295)	Data 0.055 (0.264)	Mem 37.74GB	Prec@1 88.889 (88.889)	Loss 1.2533 (1.1131)
[02/24 05:39:54][INFO] train_vision.py:  544: Epoch: [17][20/209], lr: 9.99e-05, eta: 3:07:56	Time 4.066 (4.180)	Data 0.061 (0.166)	Mem 37.74GB	Prec@1 88.889 (91.005)	Loss 1.1160 (1.0631)
[02/24 05:40:34][INFO] train_vision.py:  544: Epoch: [17][30/209], lr: 9.93e-05, eta: 3:05:20	Time 4.038 (4.137)	Data 0.047 (0.130)	Mem 37.74GB	Prec@1 100.000 (91.756)	Loss 0.7447 (1.0285)
[02/24 05:41:15][INFO] train_vision.py:  544: Epoch: [17][40/209], lr: 9.88e-05, eta: 3:03:41	Time 4.044 (4.116)	Data 0.045 (0.112)	Mem 37.74GB	Prec@1 100.000 (91.599)	Loss 0.9114 (1.0239)
[02/24 05:41:55][INFO] train_vision.py:  544: Epoch: [17][50/209], lr: 9.82e-05, eta: 3:02:24	Time 4.040 (4.102)	Data 0.050 (0.101)	Mem 37.74GB	Prec@1 88.889 (91.285)	Loss 0.8376 (1.0354)
[02/24 05:42:36][INFO] train_vision.py:  544: Epoch: [17][60/209], lr: 9.76e-05, eta: 3:01:19	Time 4.049 (4.093)	Data 0.048 (0.094)	Mem 37.74GB	Prec@1 100.000 (91.257)	Loss 0.7566 (1.0197)
[02/24 05:43:16][INFO] train_vision.py:  544: Epoch: [17][70/209], lr: 9.71e-05, eta: 3:00:21	Time 4.040 (4.087)	Data 0.045 (0.089)	Mem 37.74GB	Prec@1 77.778 (91.236)	Loss 1.0124 (1.0184)
[02/24 05:43:57][INFO] train_vision.py:  544: Epoch: [17][80/209], lr: 9.65e-05, eta: 2:59:27	Time 4.047 (4.082)	Data 0.060 (0.085)	Mem 37.74GB	Prec@1 100.000 (90.946)	Loss 0.7752 (1.0265)
[02/24 05:44:37][INFO] train_vision.py:  544: Epoch: [17][90/209], lr: 9.59e-05, eta: 2:58:35	Time 4.035 (4.078)	Data 0.049 (0.082)	Mem 37.74GB	Prec@1 88.889 (90.476)	Loss 1.0849 (1.0343)
[02/24 05:45:17][INFO] train_vision.py:  544: Epoch: [17][100/209], lr: 9.53e-05, eta: 2:57:46	Time 4.046 (4.074)	Data 0.057 (0.079)	Mem 37.74GB	Prec@1 88.889 (90.319)	Loss 1.3193 (1.0398)
[02/24 05:45:58][INFO] train_vision.py:  544: Epoch: [17][110/209], lr: 9.48e-05, eta: 2:56:58	Time 4.043 (4.072)	Data 0.047 (0.077)	Mem 37.74GB	Prec@1 77.778 (90.190)	Loss 1.3795 (1.0401)
[02/24 05:46:38][INFO] train_vision.py:  544: Epoch: [17][120/209], lr: 9.42e-05, eta: 2:56:12	Time 4.047 (4.069)	Data 0.070 (0.076)	Mem 37.74GB	Prec@1 100.000 (90.266)	Loss 0.8060 (1.0388)
[02/24 05:47:19][INFO] train_vision.py:  544: Epoch: [17][130/209], lr: 9.36e-05, eta: 2:55:26	Time 4.036 (4.067)	Data 0.048 (0.074)	Mem 37.74GB	Prec@1 100.000 (90.500)	Loss 1.0189 (1.0344)
[02/24 05:47:59][INFO] train_vision.py:  544: Epoch: [17][140/209], lr: 9.31e-05, eta: 2:54:41	Time 4.049 (4.066)	Data 0.063 (0.073)	Mem 37.74GB	Prec@1 88.889 (89.677)	Loss 0.9103 (1.0567)
[02/24 05:48:40][INFO] train_vision.py:  544: Epoch: [17][150/209], lr: 9.25e-05, eta: 2:53:57	Time 4.035 (4.064)	Data 0.048 (0.072)	Mem 37.74GB	Prec@1 100.000 (89.698)	Loss 0.9306 (1.0579)
[02/24 05:49:20][INFO] train_vision.py:  544: Epoch: [17][160/209], lr: 9.19e-05, eta: 2:53:14	Time 4.052 (4.063)	Data 0.071 (0.072)	Mem 37.74GB	Prec@1 88.889 (89.786)	Loss 1.2433 (1.0547)
[02/24 05:50:01][INFO] train_vision.py:  544: Epoch: [17][170/209], lr: 9.13e-05, eta: 2:52:31	Time 4.051 (4.063)	Data 0.071 (0.071)	Mem 37.74GB	Prec@1 77.778 (89.604)	Loss 1.3415 (1.0580)
[02/24 05:50:41][INFO] train_vision.py:  544: Epoch: [17][180/209], lr: 9.08e-05, eta: 2:51:48	Time 4.038 (4.062)	Data 0.051 (0.070)	Mem 37.74GB	Prec@1 100.000 (89.994)	Loss 0.7498 (1.0461)
[02/24 05:51:22][INFO] train_vision.py:  544: Epoch: [17][190/209], lr: 9.02e-05, eta: 2:51:05	Time 4.034 (4.061)	Data 0.046 (0.069)	Mem 37.74GB	Prec@1 100.000 (90.111)	Loss 0.7576 (1.0427)
[02/24 05:52:02][INFO] train_vision.py:  544: Epoch: [17][200/209], lr: 8.96e-05, eta: 2:50:22	Time 4.037 (4.060)	Data 0.046 (0.068)	Mem 37.74GB	Prec@1 88.889 (90.160)	Loss 0.9120 (1.0420)
[02/24 05:52:41][INFO] train_vision.py:  603: Test: [0/28]	Prec@1 94.444 (94.444)	Prec@5 98.611 (98.611)	mPrec@1 (59.306)	mPrec@5 (61.806)
[02/24 05:53:28][INFO] train_vision.py:  603: Test: [10/28]	Prec@1 90.278 (90.530)	Prec@5 98.611 (98.232)	mPrec@1 (81.498)	mPrec@5 (90.306)
[02/24 05:54:15][INFO] train_vision.py:  603: Test: [20/28]	Prec@1 90.278 (91.005)	Prec@5 98.611 (98.611)	mPrec@1 (81.682)	mPrec@5 (91.743)
[02/24 05:54:46][INFO] train_vision.py:  609: Overall Prec@1 91.093% Prec@5 98.684% mPrec@1 (85.342) mPrec@5 (96.278)
[02/24 05:54:46][INFO] train_vision.py:  454: Testing: 91.0931158336068/91.0931158336068
[02/24 05:54:46][INFO] train_vision.py:  455: Saving:
[02/24 05:54:59][INFO] train_vision.py:  544: Epoch: [18][0/209], lr: 8.91e-05, eta: 4:10:27	Time 5.989 (5.989)	Data 2.018 (2.018)	Mem 37.74GB	Prec@1 88.889 (88.889)	Loss 1.0292 (1.0292)
[02/24 05:55:40][INFO] train_vision.py:  544: Epoch: [18][10/209], lr: 8.86e-05, eta: 2:55:55	Time 4.039 (4.224)	Data 0.040 (0.237)	Mem 37.74GB	Prec@1 88.889 (90.909)	Loss 0.8888 (0.9966)
[02/24 05:56:20][INFO] train_vision.py:  544: Epoch: [18][20/209], lr: 8.80e-05, eta: 2:51:57	Time 4.060 (4.145)	Data 0.079 (0.155)	Mem 37.74GB	Prec@1 66.667 (91.005)	Loss 1.3439 (1.0072)
[02/24 05:57:01][INFO] train_vision.py:  544: Epoch: [18][30/209], lr: 8.74e-05, eta: 2:50:07	Time 4.058 (4.117)	Data 0.038 (0.126)	Mem 37.74GB	Prec@1 100.000 (89.606)	Loss 0.8266 (1.0664)
[02/24 05:57:41][INFO] train_vision.py:  544: Epoch: [18][40/209], lr: 8.69e-05, eta: 2:48:45	Time 4.043 (4.101)	Data 0.060 (0.110)	Mem 37.74GB	Prec@1 100.000 (89.702)	Loss 0.7390 (1.0418)
[02/24 05:58:22][INFO] train_vision.py:  544: Epoch: [18][50/209], lr: 8.63e-05, eta: 2:47:37	Time 4.047 (4.090)	Data 0.056 (0.100)	Mem 37.74GB	Prec@1 88.889 (90.196)	Loss 1.0285 (1.0329)
[02/24 05:59:02][INFO] train_vision.py:  544: Epoch: [18][60/209], lr: 8.57e-05, eta: 2:46:37	Time 4.059 (4.082)	Data 0.072 (0.092)	Mem 37.74GB	Prec@1 88.889 (90.710)	Loss 1.0758 (1.0214)
[02/24 05:59:43][INFO] train_vision.py:  544: Epoch: [18][70/209], lr: 8.52e-05, eta: 2:45:44	Time 4.042 (4.077)	Data 0.044 (0.087)	Mem 37.74GB	Prec@1 100.000 (90.297)	Loss 0.7463 (1.0159)
[02/24 06:00:23][INFO] train_vision.py:  544: Epoch: [18][80/209], lr: 8.46e-05, eta: 2:44:54	Time 4.051 (4.073)	Data 0.067 (0.083)	Mem 37.74GB	Prec@1 100.000 (90.809)	Loss 0.8162 (1.0080)
[02/24 06:01:03][INFO] train_vision.py:  544: Epoch: [18][90/209], lr: 8.40e-05, eta: 2:44:06	Time 4.047 (4.070)	Data 0.043 (0.080)	Mem 37.74GB	Prec@1 100.000 (90.598)	Loss 0.8033 (1.0110)
[02/24 06:01:44][INFO] train_vision.py:  544: Epoch: [18][100/209], lr: 8.35e-05, eta: 2:43:20	Time 4.053 (4.068)	Data 0.067 (0.077)	Mem 37.74GB	Prec@1 100.000 (90.209)	Loss 0.8090 (1.0133)
[02/24 06:02:24][INFO] train_vision.py:  544: Epoch: [18][110/209], lr: 8.29e-05, eta: 2:42:35	Time 4.053 (4.067)	Data 0.058 (0.076)	Mem 37.74GB	Prec@1 88.889 (90.390)	Loss 1.1123 (1.0105)
[02/24 06:03:05][INFO] train_vision.py:  544: Epoch: [18][120/209], lr: 8.23e-05, eta: 2:41:51	Time 4.052 (4.065)	Data 0.057 (0.074)	Mem 37.74GB	Prec@1 100.000 (90.450)	Loss 0.7540 (1.0095)
[02/24 06:03:45][INFO] train_vision.py:  544: Epoch: [18][130/209], lr: 8.18e-05, eta: 2:41:07	Time 4.051 (4.064)	Data 0.060 (0.073)	Mem 37.74GB	Prec@1 100.000 (90.246)	Loss 0.7480 (1.0161)
[02/24 06:04:26][INFO] train_vision.py:  544: Epoch: [18][140/209], lr: 8.12e-05, eta: 2:40:24	Time 4.044 (4.063)	Data 0.063 (0.072)	Mem 37.74GB	Prec@1 100.000 (90.307)	Loss 0.8433 (1.0140)
[02/24 06:05:06][INFO] train_vision.py:  544: Epoch: [18][150/209], lr: 8.07e-05, eta: 2:39:42	Time 4.046 (4.062)	Data 0.044 (0.071)	Mem 37.74GB	Prec@1 88.889 (90.213)	Loss 1.1947 (1.0130)
[02/24 06:05:47][INFO] train_vision.py:  544: Epoch: [18][160/209], lr: 8.01e-05, eta: 2:38:59	Time 4.050 (4.061)	Data 0.068 (0.070)	Mem 37.74GB	Prec@1 100.000 (90.476)	Loss 0.8663 (1.0085)
[02/24 06:06:27][INFO] train_vision.py:  544: Epoch: [18][170/209], lr: 7.95e-05, eta: 2:38:17	Time 4.044 (4.060)	Data 0.059 (0.070)	Mem 37.74GB	Prec@1 88.889 (90.578)	Loss 1.3634 (1.0096)
[02/24 06:07:08][INFO] train_vision.py:  544: Epoch: [18][180/209], lr: 7.90e-05, eta: 2:37:35	Time 4.041 (4.060)	Data 0.049 (0.069)	Mem 37.74GB	Prec@1 88.889 (90.546)	Loss 1.1927 (1.0100)
[02/24 06:07:48][INFO] train_vision.py:  544: Epoch: [18][190/209], lr: 7.84e-05, eta: 2:36:53	Time 4.051 (4.059)	Data 0.044 (0.068)	Mem 37.74GB	Prec@1 77.778 (90.634)	Loss 1.4267 (1.0109)
[02/24 06:08:29][INFO] train_vision.py:  544: Epoch: [18][200/209], lr: 7.79e-05, eta: 2:36:10	Time 4.037 (4.058)	Data 0.048 (0.067)	Mem 37.74GB	Prec@1 77.778 (90.326)	Loss 1.2097 (1.0200)
[02/24 06:09:07][INFO] train_vision.py:  544: Epoch: [19][0/209], lr: 7.73e-05, eta: 4:15:38	Time 6.669 (6.669)	Data 2.278 (2.278)	Mem 37.74GB	Prec@1 100.000 (100.000)	Loss 0.7655 (0.7655)
[02/24 06:09:48][INFO] train_vision.py:  544: Epoch: [19][10/209], lr: 7.68e-05, eta: 2:43:40	Time 4.057 (4.289)	Data 0.048 (0.256)	Mem 37.74GB	Prec@1 100.000 (89.899)	Loss 0.7461 (1.0281)
[02/24 06:10:28][INFO] train_vision.py:  544: Epoch: [19][20/209], lr: 7.63e-05, eta: 2:38:41	Time 4.049 (4.176)	Data 0.058 (0.162)	Mem 37.74GB	Prec@1 88.889 (92.063)	Loss 1.0768 (0.9918)
[02/24 06:11:09][INFO] train_vision.py:  544: Epoch: [19][30/209], lr: 7.57e-05, eta: 2:36:35	Time 4.054 (4.139)	Data 0.053 (0.130)	Mem 37.74GB	Prec@1 100.000 (93.190)	Loss 0.7762 (0.9526)
[02/24 06:11:49][INFO] train_vision.py:  544: Epoch: [19][40/209], lr: 7.51e-05, eta: 2:35:08	Time 4.051 (4.119)	Data 0.068 (0.114)	Mem 37.74GB	Prec@1 88.889 (91.057)	Loss 1.3760 (1.0091)
[02/24 06:12:30][INFO] train_vision.py:  544: Epoch: [19][50/209], lr: 7.46e-05, eta: 2:34:01	Time 4.070 (4.107)	Data 0.065 (0.104)	Mem 37.74GB	Prec@1 88.889 (91.285)	Loss 1.0100 (1.0070)
[02/24 06:13:11][INFO] train_vision.py:  544: Epoch: [19][60/209], lr: 7.40e-05, eta: 2:33:01	Time 4.050 (4.099)	Data 0.066 (0.098)	Mem 37.74GB	Prec@1 100.000 (91.621)	Loss 0.9146 (0.9989)
[02/24 06:13:51][INFO] train_vision.py:  544: Epoch: [19][70/209], lr: 7.35e-05, eta: 2:32:08	Time 4.062 (4.093)	Data 0.067 (0.093)	Mem 37.74GB	Prec@1 88.889 (92.019)	Loss 1.0938 (0.9877)
[02/24 06:14:32][INFO] train_vision.py:  544: Epoch: [19][80/209], lr: 7.29e-05, eta: 2:31:17	Time 4.060 (4.089)	Data 0.067 (0.090)	Mem 37.74GB	Prec@1 77.778 (91.632)	Loss 1.2697 (0.9987)
[02/24 06:15:12][INFO] train_vision.py:  544: Epoch: [19][90/209], lr: 7.24e-05, eta: 2:30:30	Time 4.066 (4.086)	Data 0.064 (0.087)	Mem 37.74GB	Prec@1 88.889 (90.965)	Loss 0.9635 (1.0163)
[02/24 06:15:53][INFO] train_vision.py:  544: Epoch: [19][100/209], lr: 7.18e-05, eta: 2:29:43	Time 4.062 (4.084)	Data 0.047 (0.085)	Mem 37.74GB	Prec@1 88.889 (90.869)	Loss 1.0453 (1.0225)
[02/24 06:16:34][INFO] train_vision.py:  544: Epoch: [19][110/209], lr: 7.13e-05, eta: 2:28:58	Time 4.064 (4.082)	Data 0.067 (0.083)	Mem 37.74GB	Prec@1 88.889 (90.591)	Loss 1.2180 (1.0315)
[02/24 06:17:14][INFO] train_vision.py:  544: Epoch: [19][120/209], lr: 7.08e-05, eta: 2:28:14	Time 4.055 (4.080)	Data 0.050 (0.081)	Mem 37.74GB	Prec@1 66.667 (90.634)	Loss 1.6021 (1.0280)
[02/24 06:17:55][INFO] train_vision.py:  544: Epoch: [19][130/209], lr: 7.02e-05, eta: 2:27:30	Time 4.074 (4.079)	Data 0.064 (0.080)	Mem 37.74GB	Prec@1 88.889 (90.755)	Loss 1.0336 (1.0214)
[02/24 06:18:35][INFO] train_vision.py:  544: Epoch: [19][140/209], lr: 6.97e-05, eta: 2:26:47	Time 4.059 (4.077)	Data 0.065 (0.079)	Mem 37.74GB	Prec@1 77.778 (90.859)	Loss 1.3114 (1.0168)
[02/24 06:19:16][INFO] train_vision.py:  544: Epoch: [19][150/209], lr: 6.91e-05, eta: 2:26:03	Time 4.067 (4.076)	Data 0.067 (0.078)	Mem 37.74GB	Prec@1 88.889 (90.876)	Loss 1.0437 (1.0194)
[02/24 06:19:57][INFO] train_vision.py:  544: Epoch: [19][160/209], lr: 6.86e-05, eta: 2:25:20	Time 4.056 (4.075)	Data 0.062 (0.077)	Mem 37.74GB	Prec@1 100.000 (90.614)	Loss 0.8259 (1.0210)
[02/24 06:20:37][INFO] train_vision.py:  544: Epoch: [19][170/209], lr: 6.80e-05, eta: 2:24:38	Time 4.068 (4.074)	Data 0.066 (0.076)	Mem 37.74GB	Prec@1 100.000 (90.643)	Loss 0.7533 (1.0168)
[02/24 06:21:18][INFO] train_vision.py:  544: Epoch: [19][180/209], lr: 6.75e-05, eta: 2:23:55	Time 4.047 (4.073)	Data 0.068 (0.076)	Mem 37.74GB	Prec@1 100.000 (90.669)	Loss 0.7367 (1.0122)
[02/24 06:21:58][INFO] train_vision.py:  544: Epoch: [19][190/209], lr: 6.70e-05, eta: 2:23:13	Time 4.066 (4.073)	Data 0.065 (0.075)	Mem 37.74GB	Prec@1 88.889 (90.692)	Loss 1.0838 (1.0085)
[02/24 06:22:39][INFO] train_vision.py:  544: Epoch: [19][200/209], lr: 6.64e-05, eta: 2:22:31	Time 4.056 (4.072)	Data 0.067 (0.075)	Mem 37.74GB	Prec@1 77.778 (90.547)	Loss 1.4313 (1.0126)
[02/24 06:23:18][INFO] train_vision.py:  603: Test: [0/28]	Prec@1 91.667 (91.667)	Prec@5 97.222 (97.222)	mPrec@1 (56.181)	mPrec@5 (61.597)
[02/24 06:24:05][INFO] train_vision.py:  603: Test: [10/28]	Prec@1 90.278 (91.414)	Prec@5 97.222 (98.232)	mPrec@1 (82.980)	mPrec@5 (90.399)
[02/24 06:24:52][INFO] train_vision.py:  603: Test: [20/28]	Prec@1 90.278 (91.601)	Prec@5 100.000 (98.611)	mPrec@1 (83.347)	mPrec@5 (91.848)
[02/24 06:25:23][INFO] train_vision.py:  609: Overall Prec@1 91.549% Prec@5 98.482% mPrec@1 (85.940) mPrec@5 (96.149)
[02/24 06:25:23][INFO] train_vision.py:  454: Testing: 91.54858293417495/91.54858293417495
[02/24 06:25:23][INFO] train_vision.py:  455: Saving:
[02/24 06:25:36][INFO] train_vision.py:  544: Epoch: [20][0/209], lr: 6.59e-05, eta: 3:29:19	Time 6.006 (6.006)	Data 2.049 (2.049)	Mem 37.74GB	Prec@1 88.889 (88.889)	Loss 0.9858 (0.9858)
[02/24 06:26:17][INFO] train_vision.py:  544: Epoch: [20][10/209], lr: 6.54e-05, eta: 2:26:29	Time 4.061 (4.224)	Data 0.077 (0.244)	Mem 37.74GB	Prec@1 88.889 (82.828)	Loss 0.9867 (1.1567)
[02/24 06:26:57][INFO] train_vision.py:  544: Epoch: [20][20/209], lr: 6.49e-05, eta: 2:22:49	Time 4.036 (4.138)	Data 0.052 (0.155)	Mem 37.74GB	Prec@1 88.889 (87.302)	Loss 0.9085 (1.0609)
[02/24 06:27:38][INFO] train_vision.py:  544: Epoch: [20][30/209], lr: 6.43e-05, eta: 2:21:06	Time 4.044 (4.108)	Data 0.055 (0.122)	Mem 37.74GB	Prec@1 77.778 (88.530)	Loss 1.1652 (1.0321)
[02/24 06:28:18][INFO] train_vision.py:  544: Epoch: [20][40/209], lr: 6.38e-05, eta: 2:19:54	Time 4.036 (4.093)	Data 0.044 (0.106)	Mem 37.74GB	Prec@1 100.000 (90.244)	Loss 0.7380 (0.9947)
[02/24 06:28:59][INFO] train_vision.py:  544: Epoch: [20][50/209], lr: 6.33e-05, eta: 2:18:54	Time 4.038 (4.084)	Data 0.058 (0.096)	Mem 37.74GB	Prec@1 100.000 (91.285)	Loss 0.7534 (0.9720)
[02/24 06:29:39][INFO] train_vision.py:  544: Epoch: [20][60/209], lr: 6.28e-05, eta: 2:18:02	Time 4.053 (4.078)	Data 0.036 (0.088)	Mem 37.74GB	Prec@1 55.556 (91.257)	Loss 1.4175 (0.9679)
[02/24 06:30:20][INFO] train_vision.py:  544: Epoch: [20][70/209], lr: 6.22e-05, eta: 2:17:12	Time 4.052 (4.074)	Data 0.055 (0.083)	Mem 37.74GB	Prec@1 100.000 (91.549)	Loss 0.7384 (0.9656)
[02/24 06:31:00][INFO] train_vision.py:  544: Epoch: [20][80/209], lr: 6.17e-05, eta: 2:16:25	Time 4.048 (4.070)	Data 0.052 (0.078)	Mem 37.74GB	Prec@1 88.889 (91.358)	Loss 1.0647 (0.9704)
[02/24 06:31:41][INFO] train_vision.py:  544: Epoch: [20][90/209], lr: 6.12e-05, eta: 2:15:39	Time 4.047 (4.068)	Data 0.059 (0.076)	Mem 37.74GB	Prec@1 100.000 (91.697)	Loss 0.8861 (0.9705)
[02/24 06:32:21][INFO] train_vision.py:  544: Epoch: [20][100/209], lr: 6.07e-05, eta: 2:14:54	Time 4.048 (4.066)	Data 0.047 (0.073)	Mem 37.74GB	Prec@1 88.889 (91.639)	Loss 1.1397 (0.9718)
[02/24 06:33:01][INFO] train_vision.py:  544: Epoch: [20][110/209], lr: 6.01e-05, eta: 2:14:10	Time 4.043 (4.064)	Data 0.047 (0.071)	Mem 37.74GB	Prec@1 88.889 (90.891)	Loss 0.9731 (0.9941)
[02/24 06:33:42][INFO] train_vision.py:  544: Epoch: [20][120/209], lr: 5.96e-05, eta: 2:13:26	Time 4.045 (4.062)	Data 0.046 (0.069)	Mem 37.74GB	Prec@1 100.000 (90.634)	Loss 0.7935 (1.0013)
[02/24 06:34:22][INFO] train_vision.py:  544: Epoch: [20][130/209], lr: 5.91e-05, eta: 2:12:43	Time 4.045 (4.061)	Data 0.057 (0.068)	Mem 37.74GB	Prec@1 77.778 (90.161)	Loss 1.2728 (1.0142)
[02/24 06:35:03][INFO] train_vision.py:  544: Epoch: [20][140/209], lr: 5.86e-05, eta: 2:11:59	Time 4.037 (4.059)	Data 0.048 (0.067)	Mem 37.74GB	Prec@1 88.889 (90.386)	Loss 1.1682 (1.0133)
[02/24 06:35:43][INFO] train_vision.py:  544: Epoch: [20][150/209], lr: 5.81e-05, eta: 2:11:17	Time 4.038 (4.058)	Data 0.048 (0.066)	Mem 37.74GB	Prec@1 88.889 (90.655)	Loss 0.8778 (1.0062)
[02/24 06:36:24][INFO] train_vision.py:  544: Epoch: [20][160/209], lr: 5.75e-05, eta: 2:10:35	Time 4.046 (4.058)	Data 0.048 (0.065)	Mem 37.74GB	Prec@1 100.000 (90.614)	Loss 0.7504 (1.0044)
[02/24 06:37:04][INFO] train_vision.py:  544: Epoch: [20][170/209], lr: 5.70e-05, eta: 2:09:53	Time 4.040 (4.057)	Data 0.046 (0.064)	Mem 37.74GB	Prec@1 100.000 (90.838)	Loss 0.7443 (0.9992)
[02/24 06:37:44][INFO] train_vision.py:  544: Epoch: [20][180/209], lr: 5.65e-05, eta: 2:09:10	Time 4.036 (4.056)	Data 0.051 (0.064)	Mem 37.74GB	Prec@1 77.778 (90.853)	Loss 1.2997 (0.9968)
[02/24 06:38:25][INFO] train_vision.py:  544: Epoch: [20][190/209], lr: 5.60e-05, eta: 2:08:28	Time 4.040 (4.055)	Data 0.047 (0.063)	Mem 37.74GB	Prec@1 88.889 (90.867)	Loss 0.9709 (0.9971)
[02/24 06:39:05][INFO] train_vision.py:  544: Epoch: [20][200/209], lr: 5.55e-05, eta: 2:07:46	Time 4.038 (4.054)	Data 0.047 (0.063)	Mem 37.74GB	Prec@1 88.889 (90.879)	Loss 1.1395 (0.9985)
[02/24 06:39:44][INFO] train_vision.py:  544: Epoch: [21][0/209], lr: 5.50e-05, eta: 3:30:16	Time 6.704 (6.704)	Data 2.404 (2.404)	Mem 37.74GB	Prec@1 88.889 (88.889)	Loss 0.8590 (0.8590)
[02/24 06:40:24][INFO] train_vision.py:  544: Epoch: [21][10/209], lr: 5.45e-05, eta: 2:14:07	Time 4.076 (4.299)	Data 0.085 (0.275)	Mem 37.74GB	Prec@1 66.667 (87.879)	Loss 1.4115 (1.0399)
[02/24 06:41:05][INFO] train_vision.py:  544: Epoch: [21][20/209], lr: 5.40e-05, eta: 2:09:54	Time 4.069 (4.186)	Data 0.078 (0.174)	Mem 37.74GB	Prec@1 100.000 (91.005)	Loss 0.7405 (0.9928)
[02/24 06:41:46][INFO] train_vision.py:  544: Epoch: [21][30/209], lr: 5.35e-05, eta: 2:07:57	Time 4.075 (4.146)	Data 0.083 (0.140)	Mem 37.74GB	Prec@1 100.000 (91.039)	Loss 0.8622 (0.9896)
[02/24 06:42:26][INFO] train_vision.py:  544: Epoch: [21][40/209], lr: 5.30e-05, eta: 2:06:36	Time 4.073 (4.124)	Data 0.076 (0.120)	Mem 37.74GB	Prec@1 100.000 (91.328)	Loss 0.8200 (0.9899)
[02/24 06:43:07][INFO] train_vision.py:  544: Epoch: [21][50/209], lr: 5.25e-05, eta: 2:05:33	Time 4.071 (4.112)	Data 0.078 (0.110)	Mem 37.74GB	Prec@1 66.667 (91.503)	Loss 1.2207 (0.9715)
[02/24 06:43:47][INFO] train_vision.py:  544: Epoch: [21][60/209], lr: 5.20e-05, eta: 2:04:34	Time 4.069 (4.102)	Data 0.079 (0.101)	Mem 37.74GB	Prec@1 100.000 (91.439)	Loss 0.8404 (0.9717)
[02/24 06:44:28][INFO] train_vision.py:  544: Epoch: [21][70/209], lr: 5.15e-05, eta: 2:03:41	Time 4.063 (4.096)	Data 0.060 (0.095)	Mem 37.74GB	Prec@1 88.889 (90.610)	Loss 1.0311 (0.9797)
[02/24 06:45:08][INFO] train_vision.py:  544: Epoch: [21][80/209], lr: 5.10e-05, eta: 2:02:50	Time 4.043 (4.090)	Data 0.058 (0.091)	Mem 37.74GB	Prec@1 100.000 (89.575)	Loss 0.7494 (1.0008)
[02/24 06:45:49][INFO] train_vision.py:  544: Epoch: [21][90/209], lr: 5.05e-05, eta: 2:02:02	Time 4.052 (4.086)	Data 0.059 (0.087)	Mem 37.74GB	Prec@1 100.000 (89.744)	Loss 0.8838 (0.9996)
[02/24 06:46:29][INFO] train_vision.py:  544: Epoch: [21][100/209], lr: 5.01e-05, eta: 2:01:16	Time 4.049 (4.083)	Data 0.058 (0.085)	Mem 37.74GB	Prec@1 88.889 (90.099)	Loss 1.0551 (0.9932)
[02/24 06:47:10][INFO] train_vision.py:  544: Epoch: [21][110/209], lr: 4.96e-05, eta: 2:00:30	Time 4.048 (4.080)	Data 0.057 (0.082)	Mem 37.74GB	Prec@1 100.000 (90.090)	Loss 0.7778 (0.9932)
[02/24 06:47:51][INFO] train_vision.py:  544: Epoch: [21][120/209], lr: 4.91e-05, eta: 1:59:45	Time 4.046 (4.078)	Data 0.059 (0.080)	Mem 37.74GB	Prec@1 88.889 (90.450)	Loss 1.1963 (0.9883)
[02/24 06:48:31][INFO] train_vision.py:  544: Epoch: [21][130/209], lr: 4.86e-05, eta: 1:59:01	Time 4.058 (4.076)	Data 0.062 (0.079)	Mem 37.74GB	Prec@1 66.667 (90.161)	Loss 1.5937 (0.9941)
[02/24 06:49:12][INFO] train_vision.py:  544: Epoch: [21][140/209], lr: 4.81e-05, eta: 1:58:17	Time 4.047 (4.074)	Data 0.059 (0.078)	Mem 37.74GB	Prec@1 100.000 (90.229)	Loss 0.7328 (0.9915)
[02/24 06:49:52][INFO] train_vision.py:  544: Epoch: [21][150/209], lr: 4.76e-05, eta: 1:57:34	Time 4.055 (4.073)	Data 0.057 (0.076)	Mem 37.74GB	Prec@1 88.889 (90.508)	Loss 0.9300 (0.9870)
[02/24 06:50:33][INFO] train_vision.py:  544: Epoch: [21][160/209], lr: 4.71e-05, eta: 1:56:51	Time 4.045 (4.072)	Data 0.059 (0.075)	Mem 37.74GB	Prec@1 88.889 (90.476)	Loss 1.0199 (0.9892)
[02/24 06:51:13][INFO] train_vision.py:  544: Epoch: [21][170/209], lr: 4.67e-05, eta: 1:56:08	Time 4.056 (4.070)	Data 0.058 (0.074)	Mem 37.74GB	Prec@1 100.000 (90.578)	Loss 0.9101 (0.9891)
[02/24 06:51:54][INFO] train_vision.py:  544: Epoch: [21][180/209], lr: 4.62e-05, eta: 1:55:25	Time 4.045 (4.069)	Data 0.059 (0.074)	Mem 37.74GB	Prec@1 100.000 (90.485)	Loss 0.7480 (0.9882)
[02/24 06:52:34][INFO] train_vision.py:  544: Epoch: [21][190/209], lr: 4.57e-05, eta: 1:54:43	Time 4.064 (4.068)	Data 0.058 (0.073)	Mem 37.74GB	Prec@1 66.667 (90.460)	Loss 1.2970 (0.9906)
[02/24 06:53:15][INFO] train_vision.py:  544: Epoch: [21][200/209], lr: 4.52e-05, eta: 1:54:01	Time 4.050 (4.067)	Data 0.071 (0.072)	Mem 37.74GB	Prec@1 100.000 (90.603)	Loss 0.9325 (0.9886)
[02/24 06:53:54][INFO] train_vision.py:  603: Test: [0/28]	Prec@1 93.056 (93.056)	Prec@5 97.222 (97.222)	mPrec@1 (58.958)	mPrec@5 (61.597)
[02/24 06:54:41][INFO] train_vision.py:  603: Test: [10/28]	Prec@1 87.500 (92.172)	Prec@5 98.611 (98.359)	mPrec@1 (84.846)	mPrec@5 (90.434)
[02/24 06:55:28][INFO] train_vision.py:  603: Test: [20/28]	Prec@1 90.278 (92.923)	Prec@5 100.000 (98.810)	mPrec@1 (85.852)	mPrec@5 (91.720)
[02/24 06:55:59][INFO] train_vision.py:  609: Overall Prec@1 92.662% Prec@5 98.836% mPrec@1 (88.345) mPrec@5 (96.248)
[02/24 06:55:59][INFO] train_vision.py:  454: Testing: 92.66194304184393/92.66194304184393
[02/24 06:55:59][INFO] train_vision.py:  455: Saving:
[02/24 06:56:14][INFO] train_vision.py:  544: Epoch: [22][0/209], lr: 4.48e-05, eta: 2:51:44	Time 6.160 (6.160)	Data 2.194 (2.194)	Mem 37.74GB	Prec@1 88.889 (88.889)	Loss 0.9325 (0.9325)
[02/24 06:56:54][INFO] train_vision.py:  544: Epoch: [22][10/209], lr: 4.43e-05, eta: 1:57:44	Time 4.056 (4.248)	Data 0.063 (0.261)	Mem 37.74GB	Prec@1 100.000 (93.939)	Loss 0.7274 (0.9017)
[02/24 06:57:35][INFO] train_vision.py:  544: Epoch: [22][20/209], lr: 4.39e-05, eta: 1:54:35	Time 4.068 (4.160)	Data 0.065 (0.165)	Mem 37.74GB	Prec@1 100.000 (92.063)	Loss 0.7368 (0.9284)
[02/24 06:58:15][INFO] train_vision.py:  544: Epoch: [22][30/209], lr: 4.34e-05, eta: 1:53:02	Time 4.063 (4.128)	Data 0.060 (0.131)	Mem 37.74GB	Prec@1 88.889 (92.115)	Loss 0.8518 (0.9464)
[02/24 06:58:56][INFO] train_vision.py:  544: Epoch: [22][40/209], lr: 4.29e-05, eta: 1:51:56	Time 4.060 (4.113)	Data 0.058 (0.114)	Mem 37.74GB	Prec@1 88.889 (91.057)	Loss 0.9612 (0.9651)
[02/24 06:59:37][INFO] train_vision.py:  544: Epoch: [22][50/209], lr: 4.25e-05, eta: 1:51:02	Time 4.053 (4.105)	Data 0.060 (0.104)	Mem 37.74GB	Prec@1 100.000 (91.285)	Loss 0.7526 (0.9710)
[02/24 07:00:18][INFO] train_vision.py:  544: Epoch: [22][60/209], lr: 4.20e-05, eta: 1:50:13	Time 4.075 (4.100)	Data 0.064 (0.098)	Mem 37.74GB	Prec@1 88.889 (90.710)	Loss 1.1958 (0.9861)
[02/24 07:00:58][INFO] train_vision.py:  544: Epoch: [22][70/209], lr: 4.16e-05, eta: 1:49:24	Time 4.075 (4.095)	Data 0.065 (0.093)	Mem 37.74GB	Prec@1 100.000 (89.828)	Loss 0.8716 (1.0068)
[02/24 07:01:39][INFO] train_vision.py:  544: Epoch: [22][80/209], lr: 4.11e-05, eta: 1:48:37	Time 4.061 (4.092)	Data 0.058 (0.089)	Mem 37.74GB	Prec@1 100.000 (90.261)	Loss 0.8068 (0.9988)
[02/24 07:02:20][INFO] train_vision.py:  544: Epoch: [22][90/209], lr: 4.06e-05, eta: 1:47:52	Time 4.066 (4.089)	Data 0.066 (0.087)	Mem 37.74GB	Prec@1 100.000 (89.866)	Loss 0.7460 (1.0086)
[02/24 07:03:00][INFO] train_vision.py:  544: Epoch: [22][100/209], lr: 4.02e-05, eta: 1:47:09	Time 4.079 (4.088)	Data 0.058 (0.084)	Mem 37.74GB	Prec@1 88.889 (89.989)	Loss 0.8214 (1.0054)
[02/24 07:03:41][INFO] train_vision.py:  544: Epoch: [22][110/209], lr: 3.97e-05, eta: 1:46:26	Time 4.063 (4.086)	Data 0.066 (0.083)	Mem 37.74GB	Prec@1 100.000 (90.290)	Loss 0.7858 (0.9980)
[02/24 07:04:22][INFO] train_vision.py:  544: Epoch: [22][120/209], lr: 3.93e-05, eta: 1:45:43	Time 4.058 (4.085)	Data 0.059 (0.081)	Mem 37.74GB	Prec@1 100.000 (90.450)	Loss 0.8445 (0.9979)
[02/24 07:05:02][INFO] train_vision.py:  544: Epoch: [22][130/209], lr: 3.88e-05, eta: 1:45:00	Time 4.063 (4.083)	Data 0.065 (0.080)	Mem 37.74GB	Prec@1 88.889 (90.670)	Loss 1.2350 (0.9952)
[02/24 07:05:43][INFO] train_vision.py:  544: Epoch: [22][140/209], lr: 3.84e-05, eta: 1:44:18	Time 4.068 (4.083)	Data 0.066 (0.079)	Mem 37.74GB	Prec@1 88.889 (90.701)	Loss 0.9279 (0.9994)
[02/24 07:06:24][INFO] train_vision.py:  544: Epoch: [22][150/209], lr: 3.80e-05, eta: 1:43:36	Time 4.061 (4.082)	Data 0.064 (0.078)	Mem 37.74GB	Prec@1 88.889 (91.023)	Loss 0.9097 (0.9931)
[02/24 07:07:04][INFO] train_vision.py:  544: Epoch: [22][160/209], lr: 3.75e-05, eta: 1:42:53	Time 4.065 (4.080)	Data 0.057 (0.077)	Mem 37.74GB	Prec@1 77.778 (91.166)	Loss 1.2550 (0.9894)
[02/24 07:07:45][INFO] train_vision.py:  544: Epoch: [22][170/209], lr: 3.71e-05, eta: 1:42:12	Time 4.064 (4.080)	Data 0.063 (0.076)	Mem 37.74GB	Prec@1 100.000 (91.098)	Loss 0.7456 (0.9874)
[02/24 07:08:26][INFO] train_vision.py:  544: Epoch: [22][180/209], lr: 3.66e-05, eta: 1:41:30	Time 4.065 (4.080)	Data 0.065 (0.075)	Mem 37.74GB	Prec@1 66.667 (91.037)	Loss 1.1630 (0.9871)
[02/24 07:09:07][INFO] train_vision.py:  544: Epoch: [22][190/209], lr: 3.62e-05, eta: 1:40:48	Time 4.060 (4.079)	Data 0.065 (0.075)	Mem 37.74GB	Prec@1 100.000 (91.274)	Loss 0.7324 (0.9824)
[02/24 07:09:47][INFO] train_vision.py:  544: Epoch: [22][200/209], lr: 3.58e-05, eta: 1:40:07	Time 4.072 (4.078)	Data 0.066 (0.074)	Mem 37.74GB	Prec@1 100.000 (91.487)	Loss 0.7725 (0.9751)
[02/24 07:10:26][INFO] train_vision.py:  544: Epoch: [23][0/209], lr: 3.54e-05, eta: 2:41:27	Time 6.617 (6.617)	Data 2.530 (2.530)	Mem 37.74GB	Prec@1 77.778 (77.778)	Loss 1.7042 (1.7042)
[02/24 07:11:07][INFO] train_vision.py:  544: Epoch: [23][10/209], lr: 3.50e-05, eta: 1:44:26	Time 4.083 (4.310)	Data 0.090 (0.299)	Mem 37.74GB	Prec@1 88.889 (92.929)	Loss 0.8395 (1.0011)
[02/24 07:11:48][INFO] train_vision.py:  544: Epoch: [23][20/209], lr: 3.45e-05, eta: 1:41:02	Time 4.074 (4.198)	Data 0.075 (0.192)	Mem 37.74GB	Prec@1 100.000 (90.476)	Loss 0.7653 (1.0039)
[02/24 07:12:28][INFO] train_vision.py:  544: Epoch: [23][30/209], lr: 3.41e-05, eta: 1:39:17	Time 4.046 (4.155)	Data 0.043 (0.146)	Mem 37.74GB	Prec@1 100.000 (89.606)	Loss 0.7648 (1.0042)
[02/24 07:13:09][INFO] train_vision.py:  544: Epoch: [23][40/209], lr: 3.37e-05, eta: 1:38:03	Time 4.070 (4.132)	Data 0.044 (0.122)	Mem 37.74GB	Prec@1 66.667 (90.786)	Loss 1.1297 (0.9693)
[02/24 07:13:49][INFO] train_vision.py:  544: Epoch: [23][50/209], lr: 3.33e-05, eta: 1:37:01	Time 4.055 (4.117)	Data 0.039 (0.108)	Mem 37.74GB	Prec@1 77.778 (91.721)	Loss 1.1076 (0.9510)
[02/24 07:14:30][INFO] train_vision.py:  544: Epoch: [23][60/209], lr: 3.29e-05, eta: 1:36:05	Time 4.057 (4.107)	Data 0.061 (0.098)	Mem 37.74GB	Prec@1 100.000 (91.439)	Loss 0.9527 (0.9563)
[02/24 07:15:10][INFO] train_vision.py:  544: Epoch: [23][70/209], lr: 3.24e-05, eta: 1:35:15	Time 4.065 (4.100)	Data 0.046 (0.091)	Mem 37.74GB	Prec@1 88.889 (90.767)	Loss 1.0162 (0.9757)
[02/24 07:15:51][INFO] train_vision.py:  544: Epoch: [23][80/209], lr: 3.20e-05, eta: 1:34:27	Time 4.066 (4.095)	Data 0.058 (0.087)	Mem 37.74GB	Prec@1 77.778 (90.535)	Loss 1.1610 (0.9804)
[02/24 07:16:32][INFO] train_vision.py:  544: Epoch: [23][90/209], lr: 3.16e-05, eta: 1:33:41	Time 4.058 (4.091)	Data 0.044 (0.083)	Mem 37.74GB	Prec@1 100.000 (90.842)	Loss 0.7360 (0.9715)
[02/24 07:17:12][INFO] train_vision.py:  544: Epoch: [23][100/209], lr: 3.12e-05, eta: 1:32:56	Time 4.069 (4.088)	Data 0.063 (0.081)	Mem 37.74GB	Prec@1 100.000 (91.089)	Loss 0.8063 (0.9637)
[02/24 07:17:53][INFO] train_vision.py:  544: Epoch: [23][110/209], lr: 3.08e-05, eta: 1:32:11	Time 4.065 (4.085)	Data 0.049 (0.078)	Mem 37.74GB	Prec@1 100.000 (91.391)	Loss 0.8953 (0.9582)
[02/24 07:18:33][INFO] train_vision.py:  544: Epoch: [23][120/209], lr: 3.04e-05, eta: 1:31:28	Time 4.067 (4.083)	Data 0.059 (0.077)	Mem 37.74GB	Prec@1 88.889 (91.276)	Loss 0.8493 (0.9570)
[02/24 07:19:14][INFO] train_vision.py:  544: Epoch: [23][130/209], lr: 3.00e-05, eta: 1:30:44	Time 4.060 (4.082)	Data 0.046 (0.075)	Mem 37.74GB	Prec@1 88.889 (91.179)	Loss 0.8876 (0.9612)
[02/24 07:19:55][INFO] train_vision.py:  544: Epoch: [23][140/209], lr: 2.96e-05, eta: 1:30:01	Time 4.056 (4.080)	Data 0.049 (0.074)	Mem 37.74GB	Prec@1 100.000 (91.253)	Loss 0.8380 (0.9625)
[02/24 07:20:35][INFO] train_vision.py:  544: Epoch: [23][150/209], lr: 2.92e-05, eta: 1:29:19	Time 4.050 (4.079)	Data 0.048 (0.073)	Mem 37.74GB	Prec@1 88.889 (91.317)	Loss 0.9241 (0.9630)
[02/24 07:21:16][INFO] train_vision.py:  544: Epoch: [23][160/209], lr: 2.88e-05, eta: 1:28:37	Time 4.074 (4.078)	Data 0.058 (0.072)	Mem 37.74GB	Prec@1 100.000 (91.442)	Loss 0.7612 (0.9616)
[02/24 07:21:56][INFO] train_vision.py:  544: Epoch: [23][170/209], lr: 2.84e-05, eta: 1:27:55	Time 4.043 (4.077)	Data 0.046 (0.071)	Mem 37.74GB	Prec@1 100.000 (91.293)	Loss 0.7801 (0.9670)
[02/24 07:22:37][INFO] train_vision.py:  544: Epoch: [23][180/209], lr: 2.80e-05, eta: 1:27:13	Time 4.050 (4.076)	Data 0.057 (0.070)	Mem 37.74GB	Prec@1 77.778 (91.406)	Loss 1.2271 (0.9663)
[02/24 07:23:18][INFO] train_vision.py:  544: Epoch: [23][190/209], lr: 2.77e-05, eta: 1:26:31	Time 4.049 (4.075)	Data 0.046 (0.070)	Mem 37.74GB	Prec@1 100.000 (91.449)	Loss 0.8412 (0.9676)
[02/24 07:23:58][INFO] train_vision.py:  544: Epoch: [23][200/209], lr: 2.73e-05, eta: 1:25:49	Time 4.051 (4.074)	Data 0.054 (0.069)	Mem 37.74GB	Prec@1 88.889 (91.542)	Loss 1.2095 (0.9649)
[02/24 07:24:37][INFO] train_vision.py:  603: Test: [0/28]	Prec@1 91.667 (91.667)	Prec@5 97.222 (97.222)	mPrec@1 (56.181)	mPrec@5 (61.597)
[02/24 07:25:24][INFO] train_vision.py:  603: Test: [10/28]	Prec@1 91.667 (91.793)	Prec@5 97.222 (98.232)	mPrec@1 (82.537)	mPrec@5 (90.672)
[02/24 07:26:12][INFO] train_vision.py:  603: Test: [20/28]	Prec@1 90.278 (92.659)	Prec@5 100.000 (98.677)	mPrec@1 (83.531)	mPrec@5 (91.814)
[02/24 07:26:43][INFO] train_vision.py:  609: Overall Prec@1 92.004% Prec@5 98.735% mPrec@1 (85.985) mPrec@5 (96.346)
[02/24 07:26:43][INFO] train_vision.py:  454: Testing: 92.00404839766652/92.66194304184393
[02/24 07:26:43][INFO] train_vision.py:  455: Saving:
[02/24 07:26:54][INFO] train_vision.py:  544: Epoch: [24][0/209], lr: 2.69e-05, eta: 2:10:08	Time 6.222 (6.222)	Data 2.240 (2.240)	Mem 37.74GB	Prec@1 100.000 (100.000)	Loss 0.7294 (0.7294)
[02/24 07:27:34][INFO] train_vision.py:  544: Epoch: [24][10/209], lr: 2.66e-05, eta: 1:28:15	Time 4.059 (4.254)	Data 0.063 (0.261)	Mem 37.74GB	Prec@1 100.000 (92.929)	Loss 0.7413 (0.9707)
[02/24 07:28:15][INFO] train_vision.py:  544: Epoch: [24][20/209], lr: 2.62e-05, eta: 1:25:44	Time 4.062 (4.165)	Data 0.050 (0.166)	Mem 37.74GB	Prec@1 100.000 (92.063)	Loss 0.7399 (0.9868)
[02/24 07:28:55][INFO] train_vision.py:  544: Epoch: [24][30/209], lr: 2.58e-05, eta: 1:24:22	Time 4.061 (4.133)	Data 0.065 (0.132)	Mem 37.74GB	Prec@1 77.778 (91.756)	Loss 1.2958 (0.9987)
[02/24 07:29:36][INFO] train_vision.py:  544: Epoch: [24][40/209], lr: 2.54e-05, eta: 1:23:19	Time 4.043 (4.115)	Data 0.061 (0.113)	Mem 37.74GB	Prec@1 100.000 (92.412)	Loss 0.8742 (0.9709)
[02/24 07:30:17][INFO] train_vision.py:  544: Epoch: [24][50/209], lr: 2.51e-05, eta: 1:22:25	Time 4.056 (4.104)	Data 0.063 (0.102)	Mem 37.74GB	Prec@1 100.000 (91.939)	Loss 1.0141 (0.9853)
[02/24 07:30:57][INFO] train_vision.py:  544: Epoch: [24][60/209], lr: 2.47e-05, eta: 1:21:34	Time 4.056 (4.096)	Data 0.061 (0.095)	Mem 37.74GB	Prec@1 88.889 (92.168)	Loss 1.3175 (0.9777)
[02/24 07:31:38][INFO] train_vision.py:  544: Epoch: [24][70/209], lr: 2.43e-05, eta: 1:20:47	Time 4.056 (4.091)	Data 0.063 (0.091)	Mem 37.74GB	Prec@1 100.000 (91.862)	Loss 0.7662 (0.9831)
[02/24 07:32:18][INFO] train_vision.py:  544: Epoch: [24][80/209], lr: 2.40e-05, eta: 1:20:01	Time 4.050 (4.086)	Data 0.055 (0.087)	Mem 37.74GB	Prec@1 100.000 (92.455)	Loss 0.8750 (0.9656)
[02/24 07:32:59][INFO] train_vision.py:  544: Epoch: [24][90/209], lr: 2.36e-05, eta: 1:19:16	Time 4.055 (4.083)	Data 0.062 (0.084)	Mem 37.74GB	Prec@1 100.000 (92.430)	Loss 0.7925 (0.9598)
[02/24 07:33:39][INFO] train_vision.py:  544: Epoch: [24][100/209], lr: 2.33e-05, eta: 1:18:32	Time 4.064 (4.080)	Data 0.062 (0.081)	Mem 37.74GB	Prec@1 100.000 (92.409)	Loss 0.8055 (0.9581)
[02/24 07:34:20][INFO] train_vision.py:  544: Epoch: [24][110/209], lr: 2.29e-05, eta: 1:17:49	Time 4.057 (4.078)	Data 0.064 (0.079)	Mem 37.74GB	Prec@1 88.889 (92.092)	Loss 1.0949 (0.9679)
[02/24 07:35:01][INFO] train_vision.py:  544: Epoch: [24][120/209], lr: 2.26e-05, eta: 1:17:07	Time 4.079 (4.077)	Data 0.056 (0.077)	Mem 37.74GB	Prec@1 77.778 (91.919)	Loss 1.2716 (0.9703)
[02/24 07:35:41][INFO] train_vision.py:  544: Epoch: [24][130/209], lr: 2.22e-05, eta: 1:16:24	Time 4.063 (4.075)	Data 0.064 (0.076)	Mem 37.74GB	Prec@1 100.000 (91.942)	Loss 0.7474 (0.9695)
[02/24 07:36:22][INFO] train_vision.py:  544: Epoch: [24][140/209], lr: 2.19e-05, eta: 1:15:42	Time 4.052 (4.074)	Data 0.055 (0.074)	Mem 37.74GB	Prec@1 66.667 (91.805)	Loss 1.6230 (0.9715)
[02/24 07:37:02][INFO] train_vision.py:  544: Epoch: [24][150/209], lr: 2.15e-05, eta: 1:15:00	Time 4.056 (4.073)	Data 0.062 (0.073)	Mem 37.74GB	Prec@1 100.000 (92.200)	Loss 0.7694 (0.9601)
[02/24 07:37:43][INFO] train_vision.py:  544: Epoch: [24][160/209], lr: 2.12e-05, eta: 1:14:18	Time 4.061 (4.072)	Data 0.062 (0.072)	Mem 37.74GB	Prec@1 100.000 (92.133)	Loss 0.7613 (0.9597)
[02/24 07:38:23][INFO] train_vision.py:  544: Epoch: [24][170/209], lr: 2.08e-05, eta: 1:13:37	Time 4.057 (4.071)	Data 0.062 (0.071)	Mem 37.74GB	Prec@1 88.889 (91.943)	Loss 0.9270 (0.9641)
[02/24 07:39:04][INFO] train_vision.py:  544: Epoch: [24][180/209], lr: 2.05e-05, eta: 1:12:55	Time 4.052 (4.070)	Data 0.056 (0.070)	Mem 37.74GB	Prec@1 100.000 (91.958)	Loss 0.7489 (0.9631)
[02/24 07:39:45][INFO] train_vision.py:  544: Epoch: [24][190/209], lr: 2.02e-05, eta: 1:12:14	Time 4.061 (4.070)	Data 0.065 (0.070)	Mem 37.74GB	Prec@1 88.889 (92.088)	Loss 1.0748 (0.9585)
[02/24 07:40:25][INFO] train_vision.py:  544: Epoch: [24][200/209], lr: 1.99e-05, eta: 1:11:33	Time 4.074 (4.069)	Data 0.062 (0.069)	Mem 37.74GB	Prec@1 100.000 (92.261)	Loss 0.7892 (0.9566)
[02/24 07:41:04][INFO] train_vision.py:  544: Epoch: [25][0/209], lr: 1.95e-05, eta: 1:54:06	Time 6.546 (6.546)	Data 2.458 (2.458)	Mem 37.74GB	Prec@1 88.889 (88.889)	Loss 1.0074 (1.0074)
[02/24 07:41:44][INFO] train_vision.py:  544: Epoch: [25][10/209], lr: 1.92e-05, eta: 1:14:10	Time 4.056 (4.296)	Data 0.045 (0.288)	Mem 37.74GB	Prec@1 88.889 (92.929)	Loss 1.1556 (0.9854)
[02/24 07:42:25][INFO] train_vision.py:  544: Epoch: [25][20/209], lr: 1.89e-05, eta: 1:11:37	Time 4.067 (4.188)	Data 0.076 (0.185)	Mem 37.74GB	Prec@1 88.889 (90.476)	Loss 1.1099 (0.9899)
[02/24 07:43:06][INFO] train_vision.py:  544: Epoch: [25][30/209], lr: 1.86e-05, eta: 1:10:14	Time 4.053 (4.148)	Data 0.058 (0.146)	Mem 37.74GB	Prec@1 88.889 (90.681)	Loss 1.0276 (0.9611)
[02/24 07:43:46][INFO] train_vision.py:  544: Epoch: [25][40/209], lr: 1.83e-05, eta: 1:09:13	Time 4.068 (4.129)	Data 0.081 (0.127)	Mem 37.74GB	Prec@1 100.000 (91.057)	Loss 0.7943 (0.9670)
[02/24 07:44:27][INFO] train_vision.py:  544: Epoch: [25][50/209], lr: 1.80e-05, eta: 1:08:19	Time 4.056 (4.116)	Data 0.058 (0.114)	Mem 37.74GB	Prec@1 77.778 (90.632)	Loss 1.5209 (0.9948)
[02/24 07:45:08][INFO] train_vision.py:  544: Epoch: [25][60/209], lr: 1.77e-05, eta: 1:07:28	Time 4.062 (4.106)	Data 0.066 (0.105)	Mem 37.74GB	Prec@1 88.889 (90.528)	Loss 1.1707 (0.9887)
[02/24 07:45:48][INFO] train_vision.py:  544: Epoch: [25][70/209], lr: 1.73e-05, eta: 1:06:41	Time 4.056 (4.100)	Data 0.055 (0.098)	Mem 37.74GB	Prec@1 100.000 (91.549)	Loss 0.8525 (0.9652)
[02/24 07:46:29][INFO] train_vision.py:  544: Epoch: [25][80/209], lr: 1.70e-05, eta: 1:05:55	Time 4.064 (4.094)	Data 0.062 (0.093)	Mem 37.74GB	Prec@1 100.000 (90.946)	Loss 0.7306 (0.9751)
[02/24 07:47:09][INFO] train_vision.py:  544: Epoch: [25][90/209], lr: 1.67e-05, eta: 1:05:10	Time 4.056 (4.090)	Data 0.065 (0.089)	Mem 37.74GB	Prec@1 100.000 (91.209)	Loss 0.7511 (0.9751)
[02/24 07:47:50][INFO] train_vision.py:  544: Epoch: [25][100/209], lr: 1.64e-05, eta: 1:04:25	Time 4.046 (4.086)	Data 0.055 (0.086)	Mem 37.74GB	Prec@1 100.000 (91.529)	Loss 0.7592 (0.9657)
[02/24 07:48:30][INFO] train_vision.py:  544: Epoch: [25][110/209], lr: 1.61e-05, eta: 1:03:41	Time 4.047 (4.083)	Data 0.047 (0.083)	Mem 37.74GB	Prec@1 88.889 (91.892)	Loss 0.9761 (0.9590)
[02/24 07:49:11][INFO] train_vision.py:  544: Epoch: [25][120/209], lr: 1.59e-05, eta: 1:02:58	Time 4.057 (4.080)	Data 0.038 (0.080)	Mem 37.74GB	Prec@1 66.667 (91.736)	Loss 1.5402 (0.9661)
[02/24 07:49:51][INFO] train_vision.py:  544: Epoch: [25][130/209], lr: 1.56e-05, eta: 1:02:15	Time 4.042 (4.078)	Data 0.046 (0.078)	Mem 37.74GB	Prec@1 77.778 (91.433)	Loss 1.1020 (0.9705)
[02/24 07:50:32][INFO] train_vision.py:  544: Epoch: [25][140/209], lr: 1.53e-05, eta: 1:01:32	Time 4.054 (4.076)	Data 0.053 (0.076)	Mem 37.74GB	Prec@1 100.000 (91.568)	Loss 0.8041 (0.9654)
[02/24 07:51:12][INFO] train_vision.py:  544: Epoch: [25][150/209], lr: 1.50e-05, eta: 1:00:50	Time 4.045 (4.074)	Data 0.047 (0.074)	Mem 37.74GB	Prec@1 88.889 (91.538)	Loss 1.1590 (0.9652)
[02/24 07:51:53][INFO] train_vision.py:  544: Epoch: [25][160/209], lr: 1.47e-05, eta: 1:00:08	Time 4.055 (4.073)	Data 0.056 (0.073)	Mem 37.74GB	Prec@1 88.889 (91.511)	Loss 1.0583 (0.9659)
[02/24 07:52:33][INFO] train_vision.py:  544: Epoch: [25][170/209], lr: 1.44e-05, eta: 0:59:26	Time 4.041 (4.071)	Data 0.046 (0.072)	Mem 37.74GB	Prec@1 100.000 (91.488)	Loss 0.7558 (0.9634)
[02/24 07:53:14][INFO] train_vision.py:  544: Epoch: [25][180/209], lr: 1.42e-05, eta: 0:58:44	Time 4.052 (4.070)	Data 0.056 (0.071)	Mem 37.74GB	Prec@1 88.889 (91.529)	Loss 1.0694 (0.9634)
[02/24 07:53:54][INFO] train_vision.py:  544: Epoch: [25][190/209], lr: 1.39e-05, eta: 0:58:03	Time 4.046 (4.069)	Data 0.048 (0.069)	Mem 37.74GB	Prec@1 88.889 (91.507)	Loss 1.0604 (0.9654)
[02/24 07:54:35][INFO] train_vision.py:  544: Epoch: [25][200/209], lr: 1.36e-05, eta: 0:57:21	Time 4.051 (4.068)	Data 0.057 (0.069)	Mem 37.74GB	Prec@1 88.889 (91.653)	Loss 0.8912 (0.9601)
[02/24 07:55:14][INFO] train_vision.py:  603: Test: [0/28]	Prec@1 91.667 (91.667)	Prec@5 97.222 (97.222)	mPrec@1 (56.875)	mPrec@5 (61.597)
[02/24 07:56:01][INFO] train_vision.py:  603: Test: [10/28]	Prec@1 90.278 (91.667)	Prec@5 98.611 (97.980)	mPrec@1 (82.261)	mPrec@5 (90.101)
[02/24 07:56:48][INFO] train_vision.py:  603: Test: [20/28]	Prec@1 91.667 (91.601)	Prec@5 100.000 (98.413)	mPrec@1 (82.013)	mPrec@5 (91.480)
[02/24 07:57:19][INFO] train_vision.py:  609: Overall Prec@1 91.498% Prec@5 98.532% mPrec@1 (85.021) mPrec@5 (96.046)
[02/24 07:57:19][INFO] train_vision.py:  454: Testing: 91.4979749362961/92.66194304184393
[02/24 07:57:19][INFO] train_vision.py:  455: Saving:
[02/24 07:57:30][INFO] train_vision.py:  544: Epoch: [26][0/209], lr: 1.33e-05, eta: 1:27:14	Time 6.254 (6.254)	Data 2.280 (2.280)	Mem 37.74GB	Prec@1 100.000 (100.000)	Loss 0.7831 (0.7831)
[02/24 07:58:10][INFO] train_vision.py:  544: Epoch: [26][10/209], lr: 1.31e-05, eta: 0:58:38	Time 4.067 (4.254)	Data 0.051 (0.256)	Mem 37.74GB	Prec@1 100.000 (95.960)	Loss 0.7660 (0.8554)
[02/24 07:58:51][INFO] train_vision.py:  544: Epoch: [26][20/209], lr: 1.28e-05, eta: 0:56:39	Time 4.053 (4.161)	Data 0.048 (0.161)	Mem 37.74GB	Prec@1 88.889 (92.593)	Loss 1.0302 (0.9196)
[02/24 07:59:32][INFO] train_vision.py:  544: Epoch: [26][30/209], lr: 1.26e-05, eta: 0:55:31	Time 4.074 (4.128)	Data 0.061 (0.127)	Mem 37.74GB	Prec@1 100.000 (93.190)	Loss 0.7635 (0.9090)
[02/24 08:00:12][INFO] train_vision.py:  544: Epoch: [26][40/209], lr: 1.23e-05, eta: 0:54:37	Time 4.053 (4.112)	Data 0.056 (0.111)	Mem 37.74GB	Prec@1 88.889 (93.767)	Loss 1.1114 (0.9083)
[02/24 08:00:53][INFO] train_vision.py:  544: Epoch: [26][50/209], lr: 1.21e-05, eta: 0:53:48	Time 4.059 (4.102)	Data 0.063 (0.101)	Mem 37.74GB	Prec@1 100.000 (93.464)	Loss 0.7217 (0.9146)
[02/24 08:01:33][INFO] train_vision.py:  544: Epoch: [26][60/209], lr: 1.18e-05, eta: 0:53:02	Time 4.074 (4.096)	Data 0.065 (0.096)	Mem 37.74GB	Prec@1 88.889 (93.989)	Loss 1.0345 (0.9093)
[02/24 08:02:14][INFO] train_vision.py:  544: Epoch: [26][70/209], lr: 1.16e-05, eta: 0:52:18	Time 4.064 (4.092)	Data 0.065 (0.091)	Mem 37.74GB	Prec@1 88.889 (93.271)	Loss 0.9852 (0.9292)
[02/24 08:02:55][INFO] train_vision.py:  544: Epoch: [26][80/209], lr: 1.13e-05, eta: 0:51:34	Time 4.059 (4.088)	Data 0.064 (0.088)	Mem 37.74GB	Prec@1 100.000 (93.141)	Loss 0.7369 (0.9238)
[02/24 08:03:35][INFO] train_vision.py:  544: Epoch: [26][90/209], lr: 1.11e-05, eta: 0:50:51	Time 4.061 (4.085)	Data 0.063 (0.084)	Mem 37.74GB	Prec@1 88.889 (92.674)	Loss 0.9335 (0.9293)
[02/24 08:04:16][INFO] train_vision.py:  544: Epoch: [26][100/209], lr: 1.09e-05, eta: 0:50:09	Time 4.077 (4.084)	Data 0.074 (0.082)	Mem 37.74GB	Prec@1 88.889 (92.739)	Loss 1.0970 (0.9343)
[02/24 08:04:57][INFO] train_vision.py:  544: Epoch: [26][110/209], lr: 1.06e-05, eta: 0:49:27	Time 4.060 (4.082)	Data 0.063 (0.080)	Mem 37.74GB	Prec@1 88.889 (92.593)	Loss 1.0092 (0.9389)
[02/24 08:05:37][INFO] train_vision.py:  544: Epoch: [26][120/209], lr: 1.04e-05, eta: 0:48:45	Time 4.061 (4.080)	Data 0.063 (0.079)	Mem 37.74GB	Prec@1 100.000 (92.929)	Loss 0.7910 (0.9332)
[02/24 08:06:18][INFO] train_vision.py:  544: Epoch: [26][130/209], lr: 1.02e-05, eta: 0:48:03	Time 4.069 (4.079)	Data 0.062 (0.077)	Mem 37.74GB	Prec@1 66.667 (92.875)	Loss 1.4859 (0.9365)
[02/24 08:06:59][INFO] train_vision.py:  544: Epoch: [26][140/209], lr: 9.93e-06, eta: 0:47:21	Time 4.054 (4.077)	Data 0.063 (0.076)	Mem 37.74GB	Prec@1 88.889 (92.671)	Loss 1.0562 (0.9420)
[02/24 08:07:39][INFO] train_vision.py:  544: Epoch: [26][150/209], lr: 9.71e-06, eta: 0:46:40	Time 4.056 (4.076)	Data 0.062 (0.075)	Mem 37.74GB	Prec@1 88.889 (92.568)	Loss 1.0112 (0.9425)
[02/24 08:08:20][INFO] train_vision.py:  544: Epoch: [26][160/209], lr: 9.49e-06, eta: 0:45:58	Time 4.055 (4.075)	Data 0.063 (0.074)	Mem 37.74GB	Prec@1 88.889 (92.685)	Loss 0.9372 (0.9397)
[02/24 08:09:00][INFO] train_vision.py:  544: Epoch: [26][170/209], lr: 9.27e-06, eta: 0:45:17	Time 4.058 (4.074)	Data 0.066 (0.073)	Mem 37.74GB	Prec@1 100.000 (92.593)	Loss 0.7368 (0.9373)
[02/24 08:09:41][INFO] train_vision.py:  544: Epoch: [26][180/209], lr: 9.06e-06, eta: 0:44:36	Time 4.060 (4.073)	Data 0.065 (0.073)	Mem 37.74GB	Prec@1 100.000 (92.756)	Loss 0.7940 (0.9318)
[02/24 08:10:22][INFO] train_vision.py:  544: Epoch: [26][190/209], lr: 8.85e-06, eta: 0:43:55	Time 4.070 (4.073)	Data 0.064 (0.072)	Mem 37.74GB	Prec@1 88.889 (92.903)	Loss 0.8967 (0.9309)
[02/24 08:11:02][INFO] train_vision.py:  544: Epoch: [26][200/209], lr: 8.64e-06, eta: 0:43:14	Time 4.063 (4.073)	Data 0.062 (0.071)	Mem 37.74GB	Prec@1 100.000 (92.924)	Loss 0.7383 (0.9294)
[02/24 08:11:41][INFO] train_vision.py:  603: Test: [0/28]	Prec@1 93.056 (93.056)	Prec@5 97.222 (97.222)	mPrec@1 (58.264)	mPrec@5 (61.597)
[02/24 08:12:29][INFO] train_vision.py:  603: Test: [10/28]	Prec@1 90.278 (91.919)	Prec@5 97.222 (97.727)	mPrec@1 (82.724)	mPrec@5 (90.072)
[02/24 08:13:16][INFO] train_vision.py:  603: Test: [20/28]	Prec@1 90.278 (92.130)	Prec@5 100.000 (98.347)	mPrec@1 (82.857)	mPrec@5 (91.515)
[02/24 08:13:47][INFO] train_vision.py:  609: Overall Prec@1 91.953% Prec@5 98.482% mPrec@1 (85.663) mPrec@5 (96.093)
[02/24 08:13:47][INFO] train_vision.py:  454: Testing: 91.95344135732303/92.66194304184393
[02/24 08:13:47][INFO] train_vision.py:  455: Saving:
[02/24 08:13:58][INFO] train_vision.py:  544: Epoch: [27][0/209], lr: 8.43e-06, eta: 1:05:35	Time 6.267 (6.267)	Data 2.288 (2.288)	Mem 37.74GB	Prec@1 100.000 (100.000)	Loss 0.7692 (0.7692)
[02/24 08:14:39][INFO] train_vision.py:  544: Epoch: [27][10/209], lr: 8.25e-06, eta: 0:43:55	Time 4.072 (4.264)	Data 0.081 (0.269)	Mem 37.74GB	Prec@1 88.889 (94.949)	Loss 0.9073 (0.8707)
[02/24 08:15:19][INFO] train_vision.py:  544: Epoch: [27][20/209], lr: 8.05e-06, eta: 0:42:16	Time 4.085 (4.171)	Data 0.079 (0.173)	Mem 37.74GB	Prec@1 100.000 (93.122)	Loss 0.8663 (0.9196)
[02/24 08:16:00][INFO] train_vision.py:  544: Epoch: [27][30/209], lr: 7.86e-06, eta: 0:41:15	Time 4.073 (4.139)	Data 0.061 (0.138)	Mem 37.74GB	Prec@1 100.000 (91.756)	Loss 0.8361 (0.9564)
[02/24 08:16:41][INFO] train_vision.py:  544: Epoch: [27][40/209], lr: 7.67e-06, eta: 0:40:23	Time 4.080 (4.122)	Data 0.066 (0.119)	Mem 37.74GB	Prec@1 77.778 (90.515)	Loss 1.4313 (0.9785)
[02/24 08:17:21][INFO] train_vision.py:  544: Epoch: [27][50/209], lr: 7.48e-06, eta: 0:39:36	Time 4.065 (4.111)	Data 0.065 (0.107)	Mem 37.74GB	Prec@1 100.000 (91.285)	Loss 0.7246 (0.9574)
[02/24 08:18:02][INFO] train_vision.py:  544: Epoch: [27][60/209], lr: 7.29e-06, eta: 0:38:50	Time 4.062 (4.103)	Data 0.060 (0.099)	Mem 37.74GB	Prec@1 88.889 (91.621)	Loss 0.9236 (0.9407)
[02/24 08:18:43][INFO] train_vision.py:  544: Epoch: [27][70/209], lr: 7.11e-06, eta: 0:38:06	Time 4.063 (4.097)	Data 0.061 (0.094)	Mem 37.74GB	Prec@1 100.000 (92.175)	Loss 0.7519 (0.9279)
[02/24 08:19:23][INFO] train_vision.py:  544: Epoch: [27][80/209], lr: 6.93e-06, eta: 0:37:22	Time 4.058 (4.093)	Data 0.057 (0.090)	Mem 37.74GB	Prec@1 77.778 (92.044)	Loss 1.1444 (0.9372)
[02/24 08:20:04][INFO] train_vision.py:  544: Epoch: [27][90/209], lr: 6.75e-06, eta: 0:36:40	Time 4.060 (4.089)	Data 0.062 (0.087)	Mem 37.74GB	Prec@1 77.778 (92.063)	Loss 1.5082 (0.9428)
[02/24 08:20:44][INFO] train_vision.py:  544: Epoch: [27][100/209], lr: 6.58e-06, eta: 0:35:57	Time 4.060 (4.086)	Data 0.064 (0.084)	Mem 37.74GB	Prec@1 88.889 (91.639)	Loss 0.9492 (0.9513)
[02/24 08:21:25][INFO] train_vision.py:  544: Epoch: [27][110/209], lr: 6.41e-06, eta: 0:35:15	Time 4.065 (4.084)	Data 0.067 (0.082)	Mem 37.74GB	Prec@1 88.889 (91.992)	Loss 1.1824 (0.9437)
[02/24 08:22:06][INFO] train_vision.py:  544: Epoch: [27][120/209], lr: 6.24e-06, eta: 0:34:33	Time 4.077 (4.083)	Data 0.062 (0.081)	Mem 37.74GB	Prec@1 77.778 (91.736)	Loss 1.1311 (0.9475)
[02/24 08:22:46][INFO] train_vision.py:  544: Epoch: [27][130/209], lr: 6.07e-06, eta: 0:33:52	Time 4.059 (4.081)	Data 0.062 (0.079)	Mem 37.74GB	Prec@1 77.778 (91.518)	Loss 1.3086 (0.9513)
[02/24 08:23:27][INFO] train_vision.py:  544: Epoch: [27][140/209], lr: 5.91e-06, eta: 0:33:10	Time 4.079 (4.080)	Data 0.068 (0.078)	Mem 37.74GB	Prec@1 77.778 (91.411)	Loss 1.1788 (0.9505)
[02/24 08:24:08][INFO] train_vision.py:  544: Epoch: [27][150/209], lr: 5.76e-06, eta: 0:32:29	Time 4.058 (4.079)	Data 0.063 (0.077)	Mem 37.74GB	Prec@1 100.000 (91.464)	Loss 0.7373 (0.9524)
[02/24 08:24:48][INFO] train_vision.py:  544: Epoch: [27][160/209], lr: 5.60e-06, eta: 0:31:48	Time 4.063 (4.077)	Data 0.063 (0.076)	Mem 37.74GB	Prec@1 88.889 (91.304)	Loss 0.9551 (0.9576)
[02/24 08:25:29][INFO] train_vision.py:  544: Epoch: [27][170/209], lr: 5.45e-06, eta: 0:31:07	Time 4.059 (4.077)	Data 0.061 (0.075)	Mem 37.74GB	Prec@1 88.889 (91.358)	Loss 1.0156 (0.9552)
[02/24 08:26:09][INFO] train_vision.py:  544: Epoch: [27][180/209], lr: 5.30e-06, eta: 0:30:25	Time 4.056 (4.076)	Data 0.049 (0.074)	Mem 37.74GB	Prec@1 100.000 (91.651)	Loss 0.7427 (0.9501)
[02/24 08:26:50][INFO] train_vision.py:  544: Epoch: [27][190/209], lr: 5.16e-06, eta: 0:29:44	Time 4.072 (4.075)	Data 0.064 (0.073)	Mem 37.74GB	Prec@1 88.889 (91.739)	Loss 1.0068 (0.9484)
[02/24 08:27:31][INFO] train_vision.py:  544: Epoch: [27][200/209], lr: 5.02e-06, eta: 0:29:03	Time 4.059 (4.074)	Data 0.057 (0.073)	Mem 37.74GB	Prec@1 100.000 (91.819)	Loss 0.7678 (0.9482)
[02/24 08:28:10][INFO] train_vision.py:  603: Test: [0/28]	Prec@1 91.667 (91.667)	Prec@5 97.222 (97.222)	mPrec@1 (56.181)	mPrec@5 (61.597)
[02/24 08:28:57][INFO] train_vision.py:  603: Test: [10/28]	Prec@1 90.278 (92.045)	Prec@5 97.222 (97.727)	mPrec@1 (82.787)	mPrec@5 (90.090)
[02/24 08:29:44][INFO] train_vision.py:  603: Test: [20/28]	Prec@1 91.667 (92.196)	Prec@5 100.000 (98.413)	mPrec@1 (82.458)	mPrec@5 (92.567)
[02/24 08:30:15][INFO] train_vision.py:  609: Overall Prec@1 91.700% Prec@5 98.482% mPrec@1 (85.042) mPrec@5 (96.773)
[02/24 08:30:15][INFO] train_vision.py:  454: Testing: 91.70040430231133/92.66194304184393
[02/24 08:30:15][INFO] train_vision.py:  455: Saving:
[02/24 08:30:26][INFO] train_vision.py:  544: Epoch: [28][0/209], lr: 4.88e-06, eta: 0:43:35	Time 6.242 (6.242)	Data 2.255 (2.255)	Mem 37.74GB	Prec@1 100.000 (100.000)	Loss 0.7266 (0.7266)
[02/24 08:31:07][INFO] train_vision.py:  544: Epoch: [28][10/209], lr: 4.75e-06, eta: 0:29:02	Time 4.069 (4.259)	Data 0.074 (0.264)	Mem 37.74GB	Prec@1 100.000 (94.949)	Loss 0.8677 (0.8950)
[02/24 08:31:48][INFO] train_vision.py:  544: Epoch: [28][20/209], lr: 4.62e-06, eta: 0:27:42	Time 4.079 (4.167)	Data 0.088 (0.171)	Mem 37.74GB	Prec@1 100.000 (94.709)	Loss 0.7533 (0.8993)
[02/24 08:32:28][INFO] train_vision.py:  544: Epoch: [28][30/209], lr: 4.49e-06, eta: 0:26:47	Time 4.062 (4.132)	Data 0.057 (0.135)	Mem 37.74GB	Prec@1 88.889 (92.473)	Loss 1.2354 (0.9485)
[02/24 08:33:09][INFO] train_vision.py:  544: Epoch: [28][40/209], lr: 4.37e-06, eta: 0:25:58	Time 4.046 (4.113)	Data 0.045 (0.115)	Mem 37.74GB	Prec@1 100.000 (92.954)	Loss 0.7270 (0.9417)
[02/24 08:33:49][INFO] train_vision.py:  544: Epoch: [28][50/209], lr: 4.24e-06, eta: 0:25:13	Time 4.054 (4.101)	Data 0.048 (0.103)	Mem 37.74GB	Prec@1 100.000 (93.028)	Loss 0.7251 (0.9295)
[02/24 08:34:30][INFO] train_vision.py:  544: Epoch: [28][60/209], lr: 4.12e-06, eta: 0:24:29	Time 4.038 (4.093)	Data 0.045 (0.095)	Mem 37.74GB	Prec@1 88.889 (92.532)	Loss 0.9700 (0.9388)
[02/24 08:35:10][INFO] train_vision.py:  544: Epoch: [28][70/209], lr: 4.01e-06, eta: 0:23:46	Time 4.059 (4.088)	Data 0.058 (0.089)	Mem 37.74GB	Prec@1 100.000 (92.801)	Loss 0.8353 (0.9394)
[02/24 08:35:51][INFO] train_vision.py:  544: Epoch: [28][80/209], lr: 3.90e-06, eta: 0:23:04	Time 4.051 (4.083)	Data 0.045 (0.085)	Mem 37.74GB	Prec@1 100.000 (93.141)	Loss 0.7852 (0.9307)
[02/24 08:36:31][INFO] train_vision.py:  544: Epoch: [28][90/209], lr: 3.79e-06, eta: 0:22:22	Time 4.049 (4.080)	Data 0.048 (0.082)	Mem 37.74GB	Prec@1 88.889 (93.529)	Loss 1.0893 (0.9220)
[02/24 08:37:12][INFO] train_vision.py:  544: Epoch: [28][100/209], lr: 3.68e-06, eta: 0:21:40	Time 4.042 (4.078)	Data 0.044 (0.079)	Mem 37.74GB	Prec@1 88.889 (93.509)	Loss 1.0108 (0.9238)
[02/24 08:37:52][INFO] train_vision.py:  544: Epoch: [28][110/209], lr: 3.58e-06, eta: 0:20:59	Time 4.053 (4.076)	Data 0.053 (0.077)	Mem 37.74GB	Prec@1 100.000 (93.594)	Loss 0.7271 (0.9204)
[02/24 08:38:33][INFO] train_vision.py:  544: Epoch: [28][120/209], lr: 3.48e-06, eta: 0:20:17	Time 4.040 (4.074)	Data 0.044 (0.075)	Mem 37.74GB	Prec@1 100.000 (93.664)	Loss 0.7463 (0.9194)
[02/24 08:39:14][INFO] train_vision.py:  544: Epoch: [28][130/209], lr: 3.38e-06, eta: 0:19:36	Time 4.055 (4.072)	Data 0.056 (0.073)	Mem 37.74GB	Prec@1 88.889 (93.639)	Loss 1.0258 (0.9178)
[02/24 08:39:54][INFO] train_vision.py:  544: Epoch: [28][140/209], lr: 3.29e-06, eta: 0:18:55	Time 4.041 (4.071)	Data 0.045 (0.072)	Mem 37.74GB	Prec@1 100.000 (93.459)	Loss 0.8416 (0.9217)
[02/24 08:40:35][INFO] train_vision.py:  544: Epoch: [28][150/209], lr: 3.19e-06, eta: 0:18:14	Time 4.055 (4.070)	Data 0.057 (0.071)	Mem 37.74GB	Prec@1 100.000 (93.377)	Loss 0.7282 (0.9238)
[02/24 08:41:15][INFO] train_vision.py:  544: Epoch: [28][160/209], lr: 3.11e-06, eta: 0:17:33	Time 4.037 (4.069)	Data 0.044 (0.070)	Mem 37.74GB	Prec@1 88.889 (93.582)	Loss 0.8510 (0.9186)
[02/24 08:41:56][INFO] train_vision.py:  544: Epoch: [28][170/209], lr: 3.02e-06, eta: 0:16:52	Time 4.052 (4.068)	Data 0.047 (0.069)	Mem 37.74GB	Prec@1 100.000 (93.437)	Loss 0.8845 (0.9193)
[02/24 08:42:36][INFO] train_vision.py:  544: Epoch: [28][180/209], lr: 2.94e-06, eta: 0:16:12	Time 4.042 (4.067)	Data 0.047 (0.068)	Mem 37.74GB	Prec@1 88.889 (93.493)	Loss 1.0524 (0.9204)
[02/24 08:43:17][INFO] train_vision.py:  544: Epoch: [28][190/209], lr: 2.87e-06, eta: 0:15:31	Time 4.055 (4.067)	Data 0.047 (0.068)	Mem 37.74GB	Prec@1 100.000 (93.659)	Loss 1.0373 (0.9215)
[02/24 08:43:57][INFO] train_vision.py:  544: Epoch: [28][200/209], lr: 2.79e-06, eta: 0:14:50	Time 4.048 (4.066)	Data 0.047 (0.067)	Mem 37.74GB	Prec@1 100.000 (93.643)	Loss 0.9645 (0.9195)
[02/24 08:44:36][INFO] train_vision.py:  603: Test: [0/28]	Prec@1 93.056 (93.056)	Prec@5 97.222 (97.222)	mPrec@1 (58.958)	mPrec@5 (61.597)
[02/24 08:45:24][INFO] train_vision.py:  603: Test: [10/28]	Prec@1 90.278 (91.667)	Prec@5 98.611 (97.727)	mPrec@1 (82.691)	mPrec@5 (89.749)
[02/24 08:46:11][INFO] train_vision.py:  603: Test: [20/28]	Prec@1 93.056 (92.328)	Prec@5 100.000 (98.347)	mPrec@1 (83.134)	mPrec@5 (92.375)
[02/24 08:46:42][INFO] train_vision.py:  609: Overall Prec@1 92.156% Prec@5 98.431% mPrec@1 (85.886) mPrec@5 (96.596)
[02/24 08:46:42][INFO] train_vision.py:  454: Testing: 92.1558700129088/92.66194304184393
[02/24 08:46:42][INFO] train_vision.py:  455: Saving:
[02/24 08:46:53][INFO] train_vision.py:  544: Epoch: [29][0/209], lr: 2.72e-06, eta: 0:21:51	Time 6.244 (6.244)	Data 2.261 (2.261)	Mem 37.74GB	Prec@1 100.000 (100.000)	Loss 0.8413 (0.8413)
[02/24 08:47:33][INFO] train_vision.py:  544: Epoch: [29][10/209], lr: 2.66e-06, eta: 0:14:12	Time 4.080 (4.265)	Data 0.078 (0.273)	Mem 37.74GB	Prec@1 100.000 (91.919)	Loss 0.7603 (0.9186)
[02/24 08:48:14][INFO] train_vision.py:  544: Epoch: [29][20/209], lr: 2.60e-06, eta: 0:13:12	Time 4.065 (4.168)	Data 0.061 (0.173)	Mem 37.74GB	Prec@1 100.000 (94.180)	Loss 0.7254 (0.8769)
[02/24 08:48:55][INFO] train_vision.py:  544: Epoch: [29][30/209], lr: 2.54e-06, eta: 0:12:24	Time 4.062 (4.134)	Data 0.077 (0.140)	Mem 37.74GB	Prec@1 66.667 (93.190)	Loss 1.1973 (0.8943)
[02/24 08:49:35][INFO] train_vision.py:  544: Epoch: [29][40/209], lr: 2.48e-06, eta: 0:11:39	Time 4.064 (4.117)	Data 0.064 (0.120)	Mem 37.74GB	Prec@1 100.000 (92.683)	Loss 0.7446 (0.8982)
[02/24 08:50:16][INFO] train_vision.py:  544: Epoch: [29][50/209], lr: 2.42e-06, eta: 0:10:56	Time 4.048 (4.105)	Data 0.053 (0.108)	Mem 37.74GB	Prec@1 100.000 (92.157)	Loss 0.8737 (0.9120)
[02/24 08:50:56][INFO] train_vision.py:  544: Epoch: [29][60/209], lr: 2.37e-06, eta: 0:10:14	Time 4.057 (4.097)	Data 0.062 (0.100)	Mem 37.74GB	Prec@1 77.778 (91.621)	Loss 1.5664 (0.9319)
[02/24 08:51:37][INFO] train_vision.py:  544: Epoch: [29][70/209], lr: 2.32e-06, eta: 0:09:32	Time 4.057 (4.091)	Data 0.057 (0.094)	Mem 37.74GB	Prec@1 100.000 (91.862)	Loss 0.7495 (0.9294)
[02/24 08:52:17][INFO] train_vision.py:  544: Epoch: [29][80/209], lr: 2.28e-06, eta: 0:08:51	Time 4.053 (4.086)	Data 0.063 (0.089)	Mem 37.74GB	Prec@1 88.889 (91.495)	Loss 1.0191 (0.9337)
[02/24 08:52:58][INFO] train_vision.py:  544: Epoch: [29][90/209], lr: 2.24e-06, eta: 0:08:09	Time 4.049 (4.083)	Data 0.042 (0.086)	Mem 37.74GB	Prec@1 100.000 (91.331)	Loss 0.7263 (0.9381)
[02/24 08:53:39][INFO] train_vision.py:  544: Epoch: [29][100/209], lr: 2.20e-06, eta: 0:07:28	Time 4.064 (4.080)	Data 0.062 (0.083)	Mem 37.74GB	Prec@1 100.000 (91.309)	Loss 0.7368 (0.9383)
[02/24 08:54:19][INFO] train_vision.py:  544: Epoch: [29][110/209], lr: 2.17e-06, eta: 0:06:47	Time 4.043 (4.078)	Data 0.046 (0.081)	Mem 37.74GB	Prec@1 88.889 (91.592)	Loss 1.0598 (0.9377)
[02/24 08:55:00][INFO] train_vision.py:  544: Epoch: [29][120/209], lr: 2.13e-06, eta: 0:06:06	Time 4.056 (4.076)	Data 0.062 (0.079)	Mem 37.74GB	Prec@1 88.889 (91.644)	Loss 0.9162 (0.9400)
[02/24 08:55:40][INFO] train_vision.py:  544: Epoch: [29][130/209], lr: 2.11e-06, eta: 0:05:25	Time 4.051 (4.075)	Data 0.058 (0.078)	Mem 37.74GB	Prec@1 88.889 (91.773)	Loss 1.0662 (0.9373)
[02/24 08:56:21][INFO] train_vision.py:  544: Epoch: [29][140/209], lr: 2.08e-06, eta: 0:04:45	Time 4.061 (4.073)	Data 0.064 (0.076)	Mem 37.74GB	Prec@1 100.000 (91.805)	Loss 0.7882 (0.9392)
[02/24 08:57:01][INFO] train_vision.py:  544: Epoch: [29][150/209], lr: 2.06e-06, eta: 0:04:04	Time 4.058 (4.072)	Data 0.057 (0.075)	Mem 37.74GB	Prec@1 88.889 (91.685)	Loss 0.8547 (0.9423)
[02/24 08:57:42][INFO] train_vision.py:  544: Epoch: [29][160/209], lr: 2.04e-06, eta: 0:03:23	Time 4.066 (4.072)	Data 0.077 (0.074)	Mem 37.74GB	Prec@1 88.889 (91.511)	Loss 0.9203 (0.9496)
[02/24 08:58:23][INFO] train_vision.py:  544: Epoch: [29][170/209], lr: 2.03e-06, eta: 0:02:42	Time 4.057 (4.071)	Data 0.058 (0.074)	Mem 37.74GB	Prec@1 88.889 (91.358)	Loss 0.9576 (0.9540)
[02/24 08:59:03][INFO] train_vision.py:  544: Epoch: [29][180/209], lr: 2.01e-06, eta: 0:02:02	Time 4.064 (4.070)	Data 0.063 (0.073)	Mem 37.74GB	Prec@1 88.889 (91.283)	Loss 1.2811 (0.9593)
[02/24 08:59:44][INFO] train_vision.py:  544: Epoch: [29][190/209], lr: 2.01e-06, eta: 0:01:21	Time 4.056 (4.070)	Data 0.058 (0.072)	Mem 37.74GB	Prec@1 88.889 (91.099)	Loss 0.8931 (0.9653)
[02/24 09:00:24][INFO] train_vision.py:  544: Epoch: [29][200/209], lr: 2.00e-06, eta: 0:00:40	Time 4.057 (4.069)	Data 0.063 (0.072)	Mem 37.74GB	Prec@1 88.889 (91.266)	Loss 1.0184 (0.9627)
[02/24 09:01:03][INFO] train_vision.py:  603: Test: [0/28]	Prec@1 93.056 (93.056)	Prec@5 97.222 (97.222)	mPrec@1 (58.958)	mPrec@5 (61.597)
[02/24 09:01:51][INFO] train_vision.py:  603: Test: [10/28]	Prec@1 91.667 (92.172)	Prec@5 98.611 (97.980)	mPrec@1 (82.912)	mPrec@5 (89.881)
[02/24 09:02:38][INFO] train_vision.py:  603: Test: [20/28]	Prec@1 91.667 (92.196)	Prec@5 100.000 (98.479)	mPrec@1 (82.589)	mPrec@5 (92.443)
[02/24 09:03:09][INFO] train_vision.py:  609: Overall Prec@1 92.004% Prec@5 98.532% mPrec@1 (85.351) mPrec@5 (96.648)
[02/24 09:03:09][INFO] train_vision.py:  454: Testing: 92.00404796523121/92.66194304184393
[02/24 09:03:09][INFO] train_vision.py:  455: Saving:
[02/24 09:03:37][DEBUG] cmd.py: 1253: Popen(['git', 'rev-parse', '--show-toplevel'], cwd=/home/anonymous/research/CorrelationSideTuning, stdin=None, shell=False, universal_newlines=False)
[02/24 09:03:37][DEBUG] cmd.py: 1253: Popen(['git', 'rev-parse', '--show-toplevel'], cwd=/home/anonymous/research/CorrelationSideTuning, stdin=None, shell=False, universal_newlines=False)
[02/24 09:03:38][DEBUG] cmd.py: 1253: Popen(['git', 'cat-file', '--batch-check'], cwd=/home/anonymous/research/CorrelationSideTuning, stdin=<valid stream>, shell=False, universal_newlines=False)
[02/24 09:03:44][INFO] model.py:  404: dropout used:[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0]
[02/24 09:03:45][INFO] model.py:  444: dropout used:[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0]
[02/24 09:03:46][INFO] model.py:  404: dropout used:[0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0]
[02/24 09:03:48][INFO] model.py:  921: loading clip pretrained model!
[02/24 09:03:48][INFO] utils.py:  456: Model:
VideoCLIP(
  (visual): VisualTransformer(
    (conv1): Conv2d(3, 1024, kernel_size=(14, 14), stride=(14, 14), bias=False)
    (dropout): Dropout(p=0.0, inplace=False)
    (ln_pre): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
    (transformer): Transformer(
      (resblocks): ModuleList(
        (0): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (1): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (2): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (3): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (4): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (5): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (6): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (7): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (8): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (9): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (10): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (11): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (12): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (13): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (14): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (15): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (16): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (17): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (18): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (19): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (20): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (21): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (22): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
        (23): ResidualAttentionBlock(
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=1024, out_features=1024, bias=True)
          )
          (ln_1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (drop_path): Identity()
          (mlp): Sequential(
            (c_fc): Linear(in_features=1024, out_features=4096, bias=True)
            (gelu): QuickGELU()
            (c_proj): Linear(in_features=4096, out_features=1024, bias=True)
          )
          (ln_2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          (control_point1): AfterReconstruction()
          (control_point2): AfterReconstruction()
          (control_atm): AfterReconstruction()
        )
      )
    )
    (side_network): SideNetwork(
      (resblocks): ModuleList(
        (0): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (1): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (2): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (3): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (4): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (5): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (6): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (7): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (8): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (9): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (10): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (11): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (12): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (13): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (14): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (15): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (16): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (17): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (18): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (19): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (20): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (21): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (22): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
        (23): AttnCBlock(
          (bn_1): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (conv): Sequential(
            (0): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
            (1): Conv3d(448, 448, kernel_size=(3, 1, 1), stride=(1, 1, 1), padding=(1, 0, 0), groups=448)
            (2): Conv3d(448, 448, kernel_size=(1, 1, 1), stride=(1, 1, 1))
          )
          (drop_path): Identity()
          (bn_2): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (mlp): CMlp(
            (fc1): Linear(in_features=448, out_features=1792, bias=True)
            (act): GELU(approximate=none)
            (fc2): Linear(in_features=1792, out_features=448, bias=True)
            (drop): Dropout(p=0.0, inplace=False)
          )
          (attn): MultiheadAttention(
            (out_proj): NonDynamicallyQuantizableLinear(in_features=448, out_features=448, bias=True)
          )
          (ln_1): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
        )
      )
      (adaptation): ModuleList(
        (0): Linear(in_features=1024, out_features=448, bias=True)
        (1): Linear(in_features=1024, out_features=448, bias=True)
        (2): Linear(in_features=1024, out_features=448, bias=True)
        (3): Linear(in_features=1024, out_features=448, bias=True)
        (4): Linear(in_features=1024, out_features=448, bias=True)
        (5): Linear(in_features=1024, out_features=448, bias=True)
        (6): Linear(in_features=1024, out_features=448, bias=True)
        (7): Linear(in_features=1024, out_features=448, bias=True)
        (8): Linear(in_features=1024, out_features=448, bias=True)
        (9): Linear(in_features=1024, out_features=448, bias=True)
        (10): Linear(in_features=1024, out_features=448, bias=True)
        (11): Linear(in_features=1024, out_features=448, bias=True)
        (12): Linear(in_features=1024, out_features=448, bias=True)
        (13): Linear(in_features=1024, out_features=448, bias=True)
        (14): Linear(in_features=1024, out_features=448, bias=True)
        (15): Linear(in_features=1024, out_features=448, bias=True)
        (16): Linear(in_features=1024, out_features=448, bias=True)
        (17): Linear(in_features=1024, out_features=448, bias=True)
        (18): Linear(in_features=1024, out_features=448, bias=True)
        (19): Linear(in_features=1024, out_features=448, bias=True)
        (20): Linear(in_features=1024, out_features=448, bias=True)
        (21): Linear(in_features=1024, out_features=448, bias=True)
        (22): Linear(in_features=1024, out_features=448, bias=True)
        (23): Linear(in_features=1024, out_features=448, bias=True)
      )
      (lns_pre): ModuleList(
        (0): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (3): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (4): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (5): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (6): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (7): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (8): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (9): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (10): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (11): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (12): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (13): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (14): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (15): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (16): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (17): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (18): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (19): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (20): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (21): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (22): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (23): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
      )
      (moss_layers): ModuleList(
        (0): MOSSBlock(
          (stss_encoders): ModuleList(
            (0): STSSEncoder(
              (ln_pre): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
              (in_proj): Linear(in_features=1024, out_features=256, bias=True)
              (stss_transformation): STSSTransformation()
              (stss_extraction): STSSExtraction(
                (conv0): Sequential(
                  (0): Conv3d(81, 96, kernel_size=(1, 1, 1), stride=(1, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
              )
              (stss_integration): STSSIntegration(
                (conv0): Sequential(
                  (0): Conv3d(96, 96, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
                (conv1): Sequential(
                  (0): Conv3d(96, 96, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
                (conv2_fuse): Sequential(
                  (0): Rearrange('(b l) c t h w -> b (l c) t h w', l=5)
                  (1): Conv3d(480, 192, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (2): BatchNorm3d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (3): GELU(approximate=none)
                )
              )
              (out_proj): Linear(in_features=192, out_features=448, bias=True)
            )
            (1): STSSEncoder(
              (ln_pre): LayerNorm((448,), eps=1e-05, elementwise_affine=True)
              (in_proj): Linear(in_features=448, out_features=256, bias=True)
              (stss_transformation): STSSTransformation()
              (stss_extraction): STSSExtraction(
                (conv0): Sequential(
                  (0): Conv3d(81, 96, kernel_size=(1, 1, 1), stride=(1, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
              )
              (stss_integration): STSSIntegration(
                (conv0): Sequential(
                  (0): Conv3d(96, 96, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
                (conv1): Sequential(
                  (0): Conv3d(96, 96, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (1): BatchNorm3d(96, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): GELU(approximate=none)
                )
                (conv2_fuse): Sequential(
                  (0): Rearrange('(b l) c t h w -> b (l c) t h w', l=5)
                  (1): Conv3d(480, 192, kernel_size=(1, 3, 3), stride=(1, 1, 1), padding=(0, 1, 1), bias=False)
                  (2): BatchNorm3d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (3): GELU(approximate=none)
                )
              )
              (out_proj): Linear(in_features=192, out_features=448, bias=True)
            )
          )
        )
      )
    )
    (side_post_bn): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
    (side_conv1): Conv3d(3, 448, kernel_size=(3, 14, 14), stride=(1, 14, 14), padding=(1, 0, 0))
    (side_pre_bn3d): BatchNorm3d(448, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
  )
  (fusion_model): video_header()
  (drop_out): Dropout(p=0, inplace=False)
  (fc): Linear(in_features=448, out_features=48, bias=True)
)
[02/24 09:05:57][WARNING] jit_analysis.py:  501: Unsupported operator aten::div encountered 205 time(s)
[02/24 09:05:57][WARNING] jit_analysis.py:  501: Unsupported operator aten::add_ encountered 58 time(s)
[02/24 09:05:57][WARNING] jit_analysis.py:  501: Unsupported operator aten::mul encountered 463 time(s)
[02/24 09:05:57][WARNING] jit_analysis.py:  501: Unsupported operator aten::mul_ encountered 90 time(s)
[02/24 09:05:57][WARNING] jit_analysis.py:  501: Unsupported operator aten::add encountered 171 time(s)
[02/24 09:05:57][WARNING] jit_analysis.py:  501: Unsupported operator aten::softmax encountered 24 time(s)
[02/24 09:05:57][WARNING] jit_analysis.py:  501: Unsupported operator aten::sigmoid encountered 24 time(s)
[02/24 09:05:57][WARNING] jit_analysis.py:  501: Unsupported operator prim::PythonOp.CheckpointFunction encountered 72 time(s)
[02/24 09:05:57][WARNING] jit_analysis.py:  501: Unsupported operator aten::pad encountered 38 time(s)
[02/24 09:05:57][WARNING] jit_analysis.py:  501: Unsupported operator aten::unfold encountered 2 time(s)
[02/24 09:05:57][WARNING] jit_analysis.py:  501: Unsupported operator aten::norm encountered 4 time(s)
[02/24 09:05:57][WARNING] jit_analysis.py:  501: Unsupported operator aten::clamp_min encountered 4 time(s)
[02/24 09:05:57][WARNING] jit_analysis.py:  501: Unsupported operator aten::expand_as encountered 4 time(s)
[02/24 09:05:57][WARNING] jit_analysis.py:  501: Unsupported operator aten::diagonal encountered 36 time(s)
[02/24 09:05:57][WARNING] jit_analysis.py:  501: Unsupported operator aten::gelu encountered 8 time(s)
[02/24 09:05:57][WARNING] jit_analysis.py:  501: Unsupported operator aten::sum encountered 1 time(s)
[02/24 09:05:57][WARNING] jit_analysis.py:  501: Unsupported operator aten::mean encountered 1 time(s)
[02/24 09:05:57][WARNING] jit_analysis.py:  513: The following submodules of the model were never called during the trace of the graph. They may be unused, or they were accessed by direct calls to .forward() or via other python methods. In the latter case they will have zeros for statistics, though their statistics will still contribute to their parent calling module.
fusion_model, visual.dropout, visual.side_network.resblocks.0.attn, visual.side_network.resblocks.0.attn.out_proj, visual.side_network.resblocks.0.conv, visual.side_network.resblocks.0.conv.0, visual.side_network.resblocks.0.conv.1, visual.side_network.resblocks.0.conv.2, visual.side_network.resblocks.0.mlp, visual.side_network.resblocks.0.mlp.act, visual.side_network.resblocks.0.mlp.drop, visual.side_network.resblocks.0.mlp.fc1, visual.side_network.resblocks.0.mlp.fc2, visual.side_network.resblocks.1.attn, visual.side_network.resblocks.1.attn.out_proj, visual.side_network.resblocks.1.conv, visual.side_network.resblocks.1.conv.0, visual.side_network.resblocks.1.conv.1, visual.side_network.resblocks.1.conv.2, visual.side_network.resblocks.1.mlp, visual.side_network.resblocks.1.mlp.act, visual.side_network.resblocks.1.mlp.drop, visual.side_network.resblocks.1.mlp.fc1, visual.side_network.resblocks.1.mlp.fc2, visual.side_network.resblocks.10.attn, visual.side_network.resblocks.10.attn.out_proj, visual.side_network.resblocks.10.conv, visual.side_network.resblocks.10.conv.0, visual.side_network.resblocks.10.conv.1, visual.side_network.resblocks.10.conv.2, visual.side_network.resblocks.10.mlp, visual.side_network.resblocks.10.mlp.act, visual.side_network.resblocks.10.mlp.drop, visual.side_network.resblocks.10.mlp.fc1, visual.side_network.resblocks.10.mlp.fc2, visual.side_network.resblocks.11.attn, visual.side_network.resblocks.11.attn.out_proj, visual.side_network.resblocks.11.conv, visual.side_network.resblocks.11.conv.0, visual.side_network.resblocks.11.conv.1, visual.side_network.resblocks.11.conv.2, visual.side_network.resblocks.11.mlp, visual.side_network.resblocks.11.mlp.act, visual.side_network.resblocks.11.mlp.drop, visual.side_network.resblocks.11.mlp.fc1, visual.side_network.resblocks.11.mlp.fc2, visual.side_network.resblocks.12.attn, visual.side_network.resblocks.12.attn.out_proj, visual.side_network.resblocks.12.conv, visual.side_network.resblocks.12.conv.0, visual.side_network.resblocks.12.conv.1, visual.side_network.resblocks.12.conv.2, visual.side_network.resblocks.12.mlp, visual.side_network.resblocks.12.mlp.act, visual.side_network.resblocks.12.mlp.drop, visual.side_network.resblocks.12.mlp.fc1, visual.side_network.resblocks.12.mlp.fc2, visual.side_network.resblocks.13.attn, visual.side_network.resblocks.13.attn.out_proj, visual.side_network.resblocks.13.conv, visual.side_network.resblocks.13.conv.0, visual.side_network.resblocks.13.conv.1, visual.side_network.resblocks.13.conv.2, visual.side_network.resblocks.13.mlp, visual.side_network.resblocks.13.mlp.act, visual.side_network.resblocks.13.mlp.drop, visual.side_network.resblocks.13.mlp.fc1, visual.side_network.resblocks.13.mlp.fc2, visual.side_network.resblocks.14.attn, visual.side_network.resblocks.14.attn.out_proj, visual.side_network.resblocks.14.conv, visual.side_network.resblocks.14.conv.0, visual.side_network.resblocks.14.conv.1, visual.side_network.resblocks.14.conv.2, visual.side_network.resblocks.14.mlp, visual.side_network.resblocks.14.mlp.act, visual.side_network.resblocks.14.mlp.drop, visual.side_network.resblocks.14.mlp.fc1, visual.side_network.resblocks.14.mlp.fc2, visual.side_network.resblocks.15.attn, visual.side_network.resblocks.15.attn.out_proj, visual.side_network.resblocks.15.conv, visual.side_network.resblocks.15.conv.0, visual.side_network.resblocks.15.conv.1, visual.side_network.resblocks.15.conv.2, visual.side_network.resblocks.15.mlp, visual.side_network.resblocks.15.mlp.act, visual.side_network.resblocks.15.mlp.drop, visual.side_network.resblocks.15.mlp.fc1, visual.side_network.resblocks.15.mlp.fc2, visual.side_network.resblocks.16.attn, visual.side_network.resblocks.16.attn.out_proj, visual.side_network.resblocks.16.conv, visual.side_network.resblocks.16.conv.0, visual.side_network.resblocks.16.conv.1, visual.side_network.resblocks.16.conv.2, visual.side_network.resblocks.16.mlp, visual.side_network.resblocks.16.mlp.act, visual.side_network.resblocks.16.mlp.drop, visual.side_network.resblocks.16.mlp.fc1, visual.side_network.resblocks.16.mlp.fc2, visual.side_network.resblocks.17.attn, visual.side_network.resblocks.17.attn.out_proj, visual.side_network.resblocks.17.conv, visual.side_network.resblocks.17.conv.0, visual.side_network.resblocks.17.conv.1, visual.side_network.resblocks.17.conv.2, visual.side_network.resblocks.17.mlp, visual.side_network.resblocks.17.mlp.act, visual.side_network.resblocks.17.mlp.drop, visual.side_network.resblocks.17.mlp.fc1, visual.side_network.resblocks.17.mlp.fc2, visual.side_network.resblocks.18.attn, visual.side_network.resblocks.18.attn.out_proj, visual.side_network.resblocks.18.conv, visual.side_network.resblocks.18.conv.0, visual.side_network.resblocks.18.conv.1, visual.side_network.resblocks.18.conv.2, visual.side_network.resblocks.18.mlp, visual.side_network.resblocks.18.mlp.act, visual.side_network.resblocks.18.mlp.drop, visual.side_network.resblocks.18.mlp.fc1, visual.side_network.resblocks.18.mlp.fc2, visual.side_network.resblocks.19.attn, visual.side_network.resblocks.19.attn.out_proj, visual.side_network.resblocks.19.conv, visual.side_network.resblocks.19.conv.0, visual.side_network.resblocks.19.conv.1, visual.side_network.resblocks.19.conv.2, visual.side_network.resblocks.19.mlp, visual.side_network.resblocks.19.mlp.act, visual.side_network.resblocks.19.mlp.drop, visual.side_network.resblocks.19.mlp.fc1, visual.side_network.resblocks.19.mlp.fc2, visual.side_network.resblocks.2.attn, visual.side_network.resblocks.2.attn.out_proj, visual.side_network.resblocks.2.conv, visual.side_network.resblocks.2.conv.0, visual.side_network.resblocks.2.conv.1, visual.side_network.resblocks.2.conv.2, visual.side_network.resblocks.2.mlp, visual.side_network.resblocks.2.mlp.act, visual.side_network.resblocks.2.mlp.drop, visual.side_network.resblocks.2.mlp.fc1, visual.side_network.resblocks.2.mlp.fc2, visual.side_network.resblocks.20.attn, visual.side_network.resblocks.20.attn.out_proj, visual.side_network.resblocks.20.conv, visual.side_network.resblocks.20.conv.0, visual.side_network.resblocks.20.conv.1, visual.side_network.resblocks.20.conv.2, visual.side_network.resblocks.20.mlp, visual.side_network.resblocks.20.mlp.act, visual.side_network.resblocks.20.mlp.drop, visual.side_network.resblocks.20.mlp.fc1, visual.side_network.resblocks.20.mlp.fc2, visual.side_network.resblocks.21.attn, visual.side_network.resblocks.21.attn.out_proj, visual.side_network.resblocks.21.conv, visual.side_network.resblocks.21.conv.0, visual.side_network.resblocks.21.conv.1, visual.side_network.resblocks.21.conv.2, visual.side_network.resblocks.21.mlp, visual.side_network.resblocks.21.mlp.act, visual.side_network.resblocks.21.mlp.drop, visual.side_network.resblocks.21.mlp.fc1, visual.side_network.resblocks.21.mlp.fc2, visual.side_network.resblocks.22.attn, visual.side_network.resblocks.22.attn.out_proj, visual.side_network.resblocks.22.conv, visual.side_network.resblocks.22.conv.0, visual.side_network.resblocks.22.conv.1, visual.side_network.resblocks.22.conv.2, visual.side_network.resblocks.22.mlp, visual.side_network.resblocks.22.mlp.act, visual.side_network.resblocks.22.mlp.drop, visual.side_network.resblocks.22.mlp.fc1, visual.side_network.resblocks.22.mlp.fc2, visual.side_network.resblocks.23.attn, visual.side_network.resblocks.23.attn.out_proj, visual.side_network.resblocks.23.conv, visual.side_network.resblocks.23.conv.0, visual.side_network.resblocks.23.conv.1, visual.side_network.resblocks.23.conv.2, visual.side_network.resblocks.23.mlp, visual.side_network.resblocks.23.mlp.act, visual.side_network.resblocks.23.mlp.drop, visual.side_network.resblocks.23.mlp.fc1, visual.side_network.resblocks.23.mlp.fc2, visual.side_network.resblocks.3.attn, visual.side_network.resblocks.3.attn.out_proj, visual.side_network.resblocks.3.conv, visual.side_network.resblocks.3.conv.0, visual.side_network.resblocks.3.conv.1, visual.side_network.resblocks.3.conv.2, visual.side_network.resblocks.3.mlp, visual.side_network.resblocks.3.mlp.act, visual.side_network.resblocks.3.mlp.drop, visual.side_network.resblocks.3.mlp.fc1, visual.side_network.resblocks.3.mlp.fc2, visual.side_network.resblocks.4.attn, visual.side_network.resblocks.4.attn.out_proj, visual.side_network.resblocks.4.conv, visual.side_network.resblocks.4.conv.0, visual.side_network.resblocks.4.conv.1, visual.side_network.resblocks.4.conv.2, visual.side_network.resblocks.4.mlp, visual.side_network.resblocks.4.mlp.act, visual.side_network.resblocks.4.mlp.drop, visual.side_network.resblocks.4.mlp.fc1, visual.side_network.resblocks.4.mlp.fc2, visual.side_network.resblocks.5.attn, visual.side_network.resblocks.5.attn.out_proj, visual.side_network.resblocks.5.conv, visual.side_network.resblocks.5.conv.0, visual.side_network.resblocks.5.conv.1, visual.side_network.resblocks.5.conv.2, visual.side_network.resblocks.5.mlp, visual.side_network.resblocks.5.mlp.act, visual.side_network.resblocks.5.mlp.drop, visual.side_network.resblocks.5.mlp.fc1, visual.side_network.resblocks.5.mlp.fc2, visual.side_network.resblocks.6.attn, visual.side_network.resblocks.6.attn.out_proj, visual.side_network.resblocks.6.conv, visual.side_network.resblocks.6.conv.0, visual.side_network.resblocks.6.conv.1, visual.side_network.resblocks.6.conv.2, visual.side_network.resblocks.6.mlp, visual.side_network.resblocks.6.mlp.act, visual.side_network.resblocks.6.mlp.drop, visual.side_network.resblocks.6.mlp.fc1, visual.side_network.resblocks.6.mlp.fc2, visual.side_network.resblocks.7.attn, visual.side_network.resblocks.7.attn.out_proj, visual.side_network.resblocks.7.conv, visual.side_network.resblocks.7.conv.0, visual.side_network.resblocks.7.conv.1, visual.side_network.resblocks.7.conv.2, visual.side_network.resblocks.7.mlp, visual.side_network.resblocks.7.mlp.act, visual.side_network.resblocks.7.mlp.drop, visual.side_network.resblocks.7.mlp.fc1, visual.side_network.resblocks.7.mlp.fc2, visual.side_network.resblocks.8.attn, visual.side_network.resblocks.8.attn.out_proj, visual.side_network.resblocks.8.conv, visual.side_network.resblocks.8.conv.0, visual.side_network.resblocks.8.conv.1, visual.side_network.resblocks.8.conv.2, visual.side_network.resblocks.8.mlp, visual.side_network.resblocks.8.mlp.act, visual.side_network.resblocks.8.mlp.drop, visual.side_network.resblocks.8.mlp.fc1, visual.side_network.resblocks.8.mlp.fc2, visual.side_network.resblocks.9.attn, visual.side_network.resblocks.9.attn.out_proj, visual.side_network.resblocks.9.conv, visual.side_network.resblocks.9.conv.0, visual.side_network.resblocks.9.conv.1, visual.side_network.resblocks.9.conv.2, visual.side_network.resblocks.9.mlp, visual.side_network.resblocks.9.mlp.act, visual.side_network.resblocks.9.mlp.drop, visual.side_network.resblocks.9.mlp.fc1, visual.side_network.resblocks.9.mlp.fc2, visual.transformer.resblocks.0.attn.out_proj, visual.transformer.resblocks.1.attn.out_proj, visual.transformer.resblocks.10.attn.out_proj, visual.transformer.resblocks.11.attn.out_proj, visual.transformer.resblocks.12.attn.out_proj, visual.transformer.resblocks.13.attn.out_proj, visual.transformer.resblocks.14.attn.out_proj, visual.transformer.resblocks.15.attn.out_proj, visual.transformer.resblocks.16.attn.out_proj, visual.transformer.resblocks.17.attn.out_proj, visual.transformer.resblocks.18.attn.out_proj, visual.transformer.resblocks.19.attn.out_proj, visual.transformer.resblocks.2.attn.out_proj, visual.transformer.resblocks.20.attn.out_proj, visual.transformer.resblocks.21.attn.out_proj, visual.transformer.resblocks.22.attn.out_proj, visual.transformer.resblocks.23.attn.out_proj, visual.transformer.resblocks.3.attn.out_proj, visual.transformer.resblocks.4.attn.out_proj, visual.transformer.resblocks.5.attn.out_proj, visual.transformer.resblocks.6.attn.out_proj, visual.transformer.resblocks.7.attn.out_proj, visual.transformer.resblocks.8.attn.out_proj, visual.transformer.resblocks.9.attn.out_proj
[02/24 09:05:57][INFO] utils.py:  458: Flops: 2.732T
[02/24 09:05:57][INFO] utils.py:  460: Params: 385.400M, tunable Params: 385.400M
[02/24 09:05:58][INFO] test_vision.py:  278: load model: epoch 21
[02/24 09:06:18][INFO] test_vision.py:  385: Test: [0/83], average 0.7226 sec/video 	Prec@1 91.667 (91.667)	Prec@5 95.833 (95.833)	mPrec@1 34.722	mPrec@5 36.806
[02/24 09:07:50][INFO] test_vision.py:  385: Test: [10/83], average 0.4157 sec/video 	Prec@1 95.833 (92.803)	Prec@5 100.000 (98.864)	mPrec@1 75.542	mPrec@5 80.663
[02/24 09:09:23][INFO] test_vision.py:  385: Test: [20/83], average 0.4017 sec/video 	Prec@1 83.333 (92.460)	Prec@5 95.833 (98.810)	mPrec@1 80.199	mPrec@5 86.897
[02/24 09:10:56][INFO] test_vision.py:  385: Test: [30/83], average 0.3969 sec/video 	Prec@1 83.333 (91.935)	Prec@5 95.833 (98.656)	mPrec@1 84.393	mPrec@5 90.923
[02/24 09:12:28][INFO] test_vision.py:  385: Test: [40/83], average 0.3945 sec/video 	Prec@1 87.500 (92.480)	Prec@5 95.833 (98.577)	mPrec@1 85.115	mPrec@5 90.617
[02/24 09:14:01][INFO] test_vision.py:  385: Test: [50/83], average 0.3930 sec/video 	Prec@1 95.833 (92.810)	Prec@5 100.000 (98.775)	mPrec@1 84.142	mPrec@5 90.708
[02/24 09:15:34][INFO] test_vision.py:  385: Test: [60/83], average 0.3920 sec/video 	Prec@1 95.833 (93.033)	Prec@5 100.000 (98.975)	mPrec@1 86.137	mPrec@5 92.932
[02/24 09:17:07][INFO] test_vision.py:  385: Test: [70/83], average 0.3913 sec/video 	Prec@1 95.833 (92.958)	Prec@5 100.000 (99.002)	mPrec@1 88.145	mPrec@5 95.015
[02/24 09:18:40][INFO] test_vision.py:  385: Test: [80/83], average 0.3908 sec/video 	Prec@1 95.833 (92.644)	Prec@5 95.833 (99.023)	mPrec@1 88.546	mPrec@5 97.120
[02/24 09:18:54][INFO] test_vision.py:  398: -----Evaluation is finished------
[02/24 09:18:54][INFO] test_vision.py:  404: Overall Prec@1 92.713% Prec@5 99.038%	mPrec@1 (88.545)	mPrec@5 (97.125)
[02/24 09:18:54][INFO] test_vision.py:  298: Per-class accuracies saved to ./exp/s4v_selfy_vitl14_32x224_diving48_run4/per_class_accuracies.txt
[02/24 09:18:54][INFO] test_vision.py:  308: Per-sample results saved to ./exp/s4v_selfy_vitl14_32x224_diving48_run4/per_sample_results.txt
