2023-12-21 14:26:15   INFO  **********************Start logging**********************
2023-12-21 14:26:15   INFO  CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7
2023-12-21 14:26:15   INFO  total_batch_size: 8
2023-12-21 14:26:15   INFO  cfg_file         ./cfgs/picture_models/picture_nuscenes_occupancy.yaml
2023-12-21 14:26:15   INFO  batch_size       1
2023-12-21 14:26:15   INFO  epochs           24
2023-12-21 14:26:15   INFO  workers          4
2023-12-21 14:26:15   INFO  extra_tag        default
2023-12-21 14:26:15   INFO  ckpt             None
2023-12-21 14:26:15   INFO  pretrained_model nuscenes_pretrain_model.pth
2023-12-21 14:26:15   INFO  launcher         pytorch
2023-12-21 14:26:15   INFO  tcp_port         18888
2023-12-21 14:26:15   INFO  sync_bn          True
2023-12-21 14:26:15   INFO  fix_random_seed  False
2023-12-21 14:26:15   INFO  ckpt_save_interval 20
2023-12-21 14:26:15   INFO  local_rank       0
2023-12-21 14:26:15   INFO  max_ckpt_save_num 30
2023-12-21 14:26:15   INFO  merge_all_iters_to_one_epoch False
2023-12-21 14:26:15   INFO  set_cfgs         None
2023-12-21 14:26:15   INFO  max_waiting_mins 0
2023-12-21 14:26:15   INFO  start_epoch      0
2023-12-21 14:26:15   INFO  num_epochs_to_eval 0
2023-12-21 14:26:15   INFO  save_to_file     False
2023-12-21 14:26:15   INFO  use_tqdm_to_record False
2023-12-21 14:26:15   INFO  logger_iter_interval 50
2023-12-21 14:26:15   INFO  ckpt_save_time_interval 300
2023-12-21 14:26:15   INFO  wo_gpu_stat      False
2023-12-21 14:26:15   INFO  fp16             False
2023-12-21 14:26:15   INFO  cfg.ROOT_DIR: xxxxxxxxxxxxx
2023-12-21 14:26:15   INFO  cfg.LOCAL_RANK: 0
2023-12-21 14:26:15   INFO  cfg.CLASS_NAMES: ['car', 'truck', 'construction_vehicle', 'bus', 'trailer', 'barrier', 'motorcycle', 'bicycle', 'pedestrian', 'traffic_cone']
2023-12-21 14:26:15   INFO  
cfg.DATA_CONFIG = edict()
2023-12-21 14:26:15   INFO  cfg.DATA_CONFIG.DATASET: NuScenesOccDataset
2023-12-21 14:26:15   INFO  cfg.DATA_CONFIG.DATA_PATH: ../data/nuscenes
2023-12-21 14:26:15   INFO  cfg.DATA_CONFIG.OCC_PATH: '../data/nuscenes/nuScenes-Occupancy'
2023-12-21 14:26:15   INFO  cfg.DATA_CONFIG.VERSION: v1.0-trainval
2023-12-21 14:26:15   INFO  cfg.DATA_CONFIG.PRED_VELOCITY: True
2023-12-21 14:26:15   INFO  cfg.DATA_CONFIG.SET_NAN_VELOCITY_TO_ZEROS: True
2023-12-21 14:26:15   INFO  cfg.DATA_CONFIG.FILTER_MIN_POINTS_IN_GT: 1
2023-12-21 14:26:15   INFO  
cfg.DATA_CONFIG.DATA_SPLIT = edict()
2023-12-21 14:26:15   INFO  cfg.DATA_CONFIG.DATA_SPLIT.train: train
2023-12-21 14:26:15   INFO  cfg.DATA_CONFIG.DATA_SPLIT.test: val
2023-12-21 14:26:15   INFO  
cfg.DATA_CONFIG.INFO_PATH = edict()
2023-12-21 14:26:15   INFO  cfg.DATA_CONFIG.INFO_PATH.train: ['nuscenes_occ_infos_train.pkl']
2023-12-21 14:26:15   INFO  cfg.DATA_CONFIG.INFO_PATH.test: ['nuscenes_occ_infos_val.pkl']
2023-12-21 14:26:15   INFO  cfg.DATA_CONFIG.POINT_CLOUD_RANGE: [-51.2, -51.2, -5.0, 51.2, 51.2, 3.0]
2023-12-21 14:26:15   INFO  cfg.DATA_CONFIG.OCC_SIZE: [ 512, 512, 40 ]
2023-12-21 14:26:15   INFO  
cfg.DATA_CONFIG.DATA_AUGMENTOR = edict()
2023-12-21 14:26:15   INFO  cfg.DATA_CONFIG.DATA_AUGMENTOR.DISABLE_AUG_LIST: ['placeholder']
2023-12-21 14:26:15   INFO  cfg.DATA_CONFIG.DATA_AUGMENTOR.AUG_CONFIG_LIST: [{'NAME': 'random_world_flip', 'ALONG_AXIS_LIST': ['x', 'y']}, {'NAME': 'random_world_rotation', 'WORLD_ROT_ANGLE': [-0.78539816, 0.78539816]}, {'NAME': 'random_world_scaling', 'WORLD_SCALE_RANGE': [0.9, 1.1]}, {'NAME': 'random_world_translation', 'NOISE_TRANSLATE_STD': [0.5, 0.5, 0.5]}]
2023-12-21 14:26:15   INFO  
cfg.DATA_CONFIG.POINT_FEATURE_ENCODING = edict()
2023-12-21 14:26:15   INFO  cfg.DATA_CONFIG.POINT_FEATURE_ENCODING.encoding_type: absolute_coordinates_encoding
2023-12-21 14:26:15   INFO  cfg.DATA_CONFIG.POINT_FEATURE_ENCODING.used_feature_list: ['x', 'y', 'z', 'intensity', 'timestamp']
2023-12-21 14:26:15   INFO  cfg.DATA_CONFIG.POINT_FEATURE_ENCODING.src_feature_list: ['x', 'y', 'z', 'intensity', 'timestamp']
2023-12-21 14:26:15   INFO  cfg.DATA_CONFIG.DATA_PROCESSOR: [{'NAME': 'mask_points_and_boxes_outside_rangeV2', 'REMOVE_OUTSIDE_BOXES': True}, {'NAME': 'shuffle_points', 'SHUFFLE_ENABLED': {'train': True, 'test': True}}, {'NAME': 'transform_points_to_voxels_placeholder', 'VOXEL_SIZE': [0.3, 0.3, 8.0]}]
2023-12-21 14:26:15   INFO  cfg.DATA_CONFIG._BASE_CONFIG_: cfgs/dataset_configs/nuscenes_occ_dataset.yaml
2023-12-21 14:26:15   INFO  
cfg.MODEL = edict()
2023-12-21 14:26:15   INFO  cfg.MODEL.NAME: OccNet
2023-12-21 14:26:15   INFO  
cfg.MODEL.VFE = edict()
2023-12-21 14:26:15   INFO  cfg.MODEL.VFE.NAME: HardSimpleVFE
2023-12-21 14:26:15   INFO  cfg.MODEL.VFE.NUM_FEATURES: 5
2023-12-21 14:26:15   INFO  
cfg.MODEL.BACKBONE_3D = edict()
2023-12-21 14:26:15   INFO  cfg.MODEL.BACKBONE_3D.NAME: DSVT
2023-12-21 14:26:15   INFO  
cfg.MODEL.BACKBONE_3D.INPUT_LAYER = edict()
2023-12-21 14:26:15   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.sparse_shape: [ 512, 512, 40 ]
2023-12-21 14:26:15   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.downsample_stride: [ [ 1, 1, 4 ], [ 1, 1, 4 ], [ 1, 1, 2 ] ]
2023-12-21 14:26:15   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.d_model: [ 256, 256, 256, 256 ]
2023-12-21 14:26:15   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.set_info: [[90, 4]]
2023-12-21 14:26:15   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.window_shape: [ [ 48, 1 ], [ 48, 1 ], [ 48, 1 ], [ 48, 1 ] ]
2023-12-21 14:26:15   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.hybrid_factor: [ [ 12, 12, 32 ], [ 12, 12, 8 ], [ 12, 12, 2 ], [ 12, 12, 1 ] ]
2023-12-21 14:26:15   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.shifts_list: [ [ [ 0, 0, 0 ], [ 6, 6, 0 ] ], [ [ 0, 0, 0 ], [ 6, 6, 0 ] ], [ [ 0, 0, 0 ], [ 6, 6, 0 ] ], [ [ 0, 0, 0 ], [ 6, 6, 0 ] ] ]
2023-12-21 14:26:15   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.normalize_pos: False
2023-12-21 14:26:15   INFO  cfg.MODEL.BACKBONE_3D.block_name: [ 'DSVTBlock','DSVTBlock','DSVTBlock','DSVTBlock' ]
2023-12-21 14:26:15   INFO  cfg.MODEL.BACKBONE_3D.set_info: [ [ 48, 1 ], [ 48, 1 ], [ 48, 1 ], [ 48, 1 ] ]
2023-12-21 14:26:15   INFO  cfg.MODEL.BACKBONE_3D.d_model: [ 256, 256, 256, 256 ]
2023-12-21 14:26:15   INFO  cfg.MODEL.BACKBONE_3D.nhead: [ 8, 8, 8, 8 ]
2023-12-21 14:26:15   INFO  cfg.MODEL.BACKBONE_3D.dim_feedforward: [ 384, 384, 384, 384 ]
2023-12-21 14:26:15   INFO  cfg.MODEL.BACKBONE_3D.dropout: 0.0
2023-12-21 14:26:15   INFO  cfg.MODEL.BACKBONE_3D.activation: gelu
2023-12-21 14:26:15   INFO  cfg.MODEL.BACKBONE_3D.activation: 'attention'
2023-12-21 14:26:15   INFO  cfg.MODEL.BACKBONE_3D.output_shape: [ 512, 512 ]
2023-12-21 14:26:15   INFO  cfg.MODEL.BACKBONE_3D.conv_out_channel: 256
2023-12-21 14:26:15   INFO  
cfg.MODEL.BACKBONE_2D = edict()
2023-12-21 14:26:15   INFO  cfg.MODEL.BACKBONE_2D.NAME: BaseBEVResBackbone
2023-12-21 14:26:15   INFO  cfg.MODEL.BACKBONE_2D.LAYER_NUMS: [ 3, 5, 5, 5 ]
2023-12-21 14:26:15   INFO  cfg.MODEL.BACKBONE_2D.LAYER_STRIDES: [ 1, 2, 2, 4 ]
2023-12-21 14:26:15   INFO  cfg.MODEL.BACKBONE_2D.NUM_FILTERS: [ 80, 160, 320, 640 ]
2023-12-21 14:26:15   INFO  cfg.MODEL.BACKBONE_2D.UPSAMPLE_STRIDES: [ 1, 2, 4, 4 ]
2023-12-21 14:26:15   INFO  cfg.MODEL.BACKBONE_2D.NUM_UPSAMPLE_FILTERS: [ 256, 256, 256, 256 ]
2023-12-21 14:26:15   INFO  
cfg.MODEL.DENSE_HEAD = edict()
2023-12-21 14:26:15   INFO  cfg.MODEL.DENSE_HEAD.CLASS_AGNOSTIC: False
2023-12-21 14:26:15   INFO  cfg.MODEL.DENSE_HEAD.NAME: OccHead
2023-12-21 14:26:15   INFO  cfg.MODEL.DENSE_HEAD.SOFT_WEIGHTS: True
2023-12-21 14:26:15   INFO  cfg.MODEL.DENSE_HEAD.SAMPLE_FROM_VOXEL: False
2023-12-21 14:26:15   INFO  cfg.MODEL.DENSE_HEAD.SAMPLE_FROM_IMG: False
2023-12-21 14:26:15   INFO  cfg.MODEL.DENSE_HEAD.FINAL_OCC_SIZE: [512, 512, 40]
2023-12-21 14:26:15   INFO  cfg.MODEL.DENSE_HEAD.FINE_TOPK: 15000
2023-12-21 14:26:15   INFO  cfg.MODEL.DENSE_HEAD.EMPTY_IDX: 0
2023-12-21 14:26:15   INFO  cfg.MODEL.DENSE_HEAD.OUT_CHANNEL: 17
2023-12-21 14:26:15   INFO  cfg.MODEL.DENSE_HEAD.NUM_LEVEL: 4
2023-12-21 14:26:15   INFO  cfg.MODEL.DENSE_HEAD.HIDDEN_CHANNEL: 256
2023-12-21 14:26:15   INFO  cfg.MODEL.DENSE_HEAD.LOSS_CONFIG.LOSS_WEIGHTS: {'voxel_ce_weight': 1.0, 'voxel_sem_scal_weight': 1.0, 'voxel_geo_scal_weight': 1.0, 'voxel_lovasz_weight': 1.0,}
2023-12-21 14:26:15   INFO  
cfg.OPTIMIZATION = edict()
2023-12-21 14:26:15   INFO  cfg.OPTIMIZATION.BATCH_SIZE_PER_GPU: 1
2023-12-21 14:26:15   INFO  cfg.OPTIMIZATION.NUM_EPOCHS: 24
2023-12-21 14:26:15   INFO  cfg.OPTIMIZATION.OPTIMIZER: adamw
2023-12-21 14:26:15   INFO  cfg.OPTIMIZATION.LR: 0.0003
2023-12-21 14:26:15   INFO  cfg.OPTIMIZATION.WEIGHT_DECAY: 0.01
2023-12-21 14:26:15   INFO  cfg.OPTIMIZATION.MOMENTUM: 0.9
2023-12-21 14:26:15   INFO  cfg.OPTIMIZATION.MOMS: [0.95, 0.85]
2023-12-21 14:26:15   INFO  cfg.OPTIMIZATION.PCT_START: 0.4
2023-12-21 14:26:15   INFO  cfg.OPTIMIZATION.DIV_FACTOR: 10
2023-12-21 14:26:15   INFO  cfg.OPTIMIZATION.DECAY_STEP_LIST: [35, 45]
2023-12-21 14:26:15   INFO  cfg.OPTIMIZATION.LR_DECAY: 0.1
2023-12-21 14:26:15   INFO  cfg.OPTIMIZATION.LR_CLIP: 1e-07
2023-12-21 14:26:15   INFO  cfg.OPTIMIZATION.LR_WARMUP: False
2023-12-21 14:26:15   INFO  cfg.OPTIMIZATION.WARMUP_EPOCH: 1
2023-12-21 14:26:15   INFO  cfg.OPTIMIZATION.GRAD_NORM_CLIP: 35
2023-12-21 14:26:15   INFO  cfg.OPTIMIZATION.LOSS_SCALE_FP16: 4.0
2023-12-21 14:26:15   INFO  
cfg.HOOK = edict()
2023-12-21 14:26:15   INFO  
cfg.HOOK.DisableAugmentationHook = edict()
2023-12-21 14:26:15   INFO  cfg.TAG: picture_nuscenes_occupancy
2023-12-21 14:26:15   INFO  cfg.EXP_GROUP_PATH: cfgs/picture_models
2023-12-21 14:26:15   INFO  Loading GT database to shared memory
2023-12-21 14:26:24   INFO  GT database has been saved to shared memory
2023-12-21 14:26:24   INFO  Loading NuScenes dataset
2023-12-21 14:26:32   INFO  Total samples for NuScenes dataset: 28130
2023-12-21 14:26:32   INFO  DistributedDataParallel(
  (module): TransFusion(
    (vfe): DynamicPillarVFE(
      (pfn_layers): ModuleList(
        (0): PFNLayerV2(
          (linear): Linear(in_features=11, out_features=64, bias=False)
          (norm): SyncBatchNorm(64, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
          (relu): ReLU()
        )
        (1): PFNLayerV2(
          (linear): Linear(in_features=256, out_features=256, bias=False)
          (norm): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
          (relu): ReLU()
        )
      )
    )
    (backbone_3d): DSVT(
      (input_layer): DSVTInputLayer(
        (posembed_layers): ModuleList(
          (0): ModuleList(
            (0): ModuleList(
              (0): PositionEmbeddingLearned(
                (position_embedding_head): Sequential(
                  (0): Linear(in_features=2, out_features=256, bias=True)
                  (1): SyncBatchNorm(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): ReLU(inplace=True)
                  (3): Linear(in_features=256, out_features=256, bias=True)
                )
              )
              (1): PositionEmbeddingLearned(
                (position_embedding_head): Sequential(
                  (0): Linear(in_features=2, out_features=256, bias=True)
                  (1): SyncBatchNorm(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): ReLU(inplace=True)
                  (3): Linear(in_features=256, out_features=256, bias=True)
                )
              )
            )
            (1): ModuleList(
              (0): PositionEmbeddingLearned(
                (position_embedding_head): Sequential(
                  (0): Linear(in_features=2, out_features=256, bias=True)
                  (1): SyncBatchNorm(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): ReLU(inplace=True)
                  (3): Linear(in_features=256, out_features=256, bias=True)
                )
              )
              (1): PositionEmbeddingLearned(
                (position_embedding_head): Sequential(
                  (0): Linear(in_features=2, out_features=256, bias=True)
                  (1): SyncBatchNorm(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): ReLU(inplace=True)
                  (3): Linear(in_features=256, out_features=256, bias=True)
                )
              )
            )
            (2): ModuleList(
              (0): PositionEmbeddingLearned(
                (position_embedding_head): Sequential(
                  (0): Linear(in_features=2, out_features=256, bias=True)
                  (1): SyncBatchNorm(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): ReLU(inplace=True)
                  (3): Linear(in_features=256, out_features=256, bias=True)
                )
              )
              (1): PositionEmbeddingLearned(
                (position_embedding_head): Sequential(
                  (0): Linear(in_features=2, out_features=256, bias=True)
                  (1): SyncBatchNorm(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): ReLU(inplace=True)
                  (3): Linear(in_features=256, out_features=256, bias=True)
                )
              )
            )
            (3): ModuleList(
              (0): PositionEmbeddingLearned(
                (position_embedding_head): Sequential(
                  (0): Linear(in_features=2, out_features=256, bias=True)
                  (1): SyncBatchNorm(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): ReLU(inplace=True)
                  (3): Linear(in_features=256, out_features=256, bias=True)
                )
              )
              (1): PositionEmbeddingLearned(
                (position_embedding_head): Sequential(
                  (0): Linear(in_features=2, out_features=256, bias=True)
                  (1): SyncBatchNorm(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): ReLU(inplace=True)
                  (3): Linear(in_features=256, out_features=256, bias=True)
                )
              )
            )
          )
        )
      )
      (stage_0): ModuleList(
        (0): DSVTBlock(
          (encoder_list): ModuleList(
            (0): DSVT_EncoderLayer(
              (win_attn): SetAttention(
                (self_attn): MultiheadAttention(
                  (out_proj): NonDynamicallyQuantizableLinear(in_features=256, out_features=256, bias=True)
                )
                (linear1): Linear(in_features=256, out_features=256, bias=True)
                (dropout): Dropout(p=0, inplace=False)
                (linear2): Linear(in_features=256, out_features=256, bias=True)
                (norm1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
                (norm2): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
                (dropout1): Identity()
                (dropout2): Identity()
              )
              (norm): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
            )
            (1): DSVT_EncoderLayer(
              (win_attn): SetAttention(
                (self_attn): MultiheadAttention(
                  (out_proj): NonDynamicallyQuantizableLinear(in_features=256, out_features=256, bias=True)
                )
                (linear1): Linear(in_features=256, out_features=256, bias=True)
                (dropout): Dropout(p=0, inplace=False)
                (linear2): Linear(in_features=256, out_features=256, bias=True)
                (norm1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
                (norm2): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
                (dropout1): Identity()
                (dropout2): Identity()
              )
              (norm): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
            )
          )
        )
        (1): DSVTBlock(
          (encoder_list): ModuleList(
            (0): DSVT_EncoderLayer(
              (win_attn): SetAttention(
                (self_attn): MultiheadAttention(
                  (out_proj): NonDynamicallyQuantizableLinear(in_features=256, out_features=256, bias=True)
                )
                (linear1): Linear(in_features=256, out_features=256, bias=True)
                (dropout): Dropout(p=0, inplace=False)
                (linear2): Linear(in_features=256, out_features=256, bias=True)
                (norm1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
                (norm2): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
                (dropout1): Identity()
                (dropout2): Identity()
              )
              (norm): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
            )
            (1): DSVT_EncoderLayer(
              (win_attn): SetAttention(
                (self_attn): MultiheadAttention(
                  (out_proj): NonDynamicallyQuantizableLinear(in_features=256, out_features=256, bias=True)
                )
                (linear1): Linear(in_features=256, out_features=256, bias=True)
                (dropout): Dropout(p=0, inplace=False)
                (linear2): Linear(in_features=256, out_features=256, bias=True)
                (norm1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
                (norm2): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
                (dropout1): Identity()
                (dropout2): Identity()
              )
              (norm): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
            )
          )
        )
        (2): DSVTBlock(
          (encoder_list): ModuleList(
            (0): DSVT_EncoderLayer(
              (win_attn): SetAttention(
                (self_attn): MultiheadAttention(
                  (out_proj): NonDynamicallyQuantizableLinear(in_features=256, out_features=256, bias=True)
                )
                (linear1): Linear(in_features=256, out_features=256, bias=True)
                (dropout): Dropout(p=0, inplace=False)
                (linear2): Linear(in_features=256, out_features=256, bias=True)
                (norm1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
                (norm2): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
                (dropout1): Identity()
                (dropout2): Identity()
              )
              (norm): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
            )
            (1): DSVT_EncoderLayer(
              (win_attn): SetAttention(
                (self_attn): MultiheadAttention(
                  (out_proj): NonDynamicallyQuantizableLinear(in_features=256, out_features=256, bias=True)
                )
                (linear1): Linear(in_features=256, out_features=256, bias=True)
                (dropout): Dropout(p=0, inplace=False)
                (linear2): Linear(in_features=256, out_features=256, bias=True)
                (norm1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
                (norm2): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
                (dropout1): Identity()
                (dropout2): Identity()
              )
              (norm): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
            )
          )
        )
        (3): DSVTBlock(
          (encoder_list): ModuleList(
            (0): DSVT_EncoderLayer(
              (win_attn): SetAttention(
                (self_attn): MultiheadAttention(
                  (out_proj): NonDynamicallyQuantizableLinear(in_features=256, out_features=256, bias=True)
                )
                (linear1): Linear(in_features=256, out_features=256, bias=True)
                (dropout): Dropout(p=0, inplace=False)
                (linear2): Linear(in_features=256, out_features=256, bias=True)
                (norm1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
                (norm2): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
                (dropout1): Identity()
                (dropout2): Identity()
              )
              (norm): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
            )
            (1): DSVT_EncoderLayer(
              (win_attn): SetAttention(
                (self_attn): MultiheadAttention(
                  (out_proj): NonDynamicallyQuantizableLinear(in_features=256, out_features=256, bias=True)
                )
                (linear1): Linear(in_features=256, out_features=256, bias=True)
                (dropout): Dropout(p=0, inplace=False)
                (linear2): Linear(in_features=256, out_features=256, bias=True)
                (norm1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
                (norm2): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
                (dropout1): Identity()
                (dropout2): Identity()
              )
              (norm): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
            )
          )
        )
      )
      (residual_norm_stage_0): ModuleList(
        (0): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
        (1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
        (2): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
        (3): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
      )
    )
    (map_to_bev_module): None
    (pfe): None
    (backbone_2d): BaseBEVResBackbone(
      (blocks): ModuleList(
        (0): Sequential(
          (0): BasicBlock(
            (conv1): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu1): ReLU()
            (conv2): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn2): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu2): ReLU()
            (downsample_layer): Sequential(
              (0): Conv2d(256, 256, kernel_size=(1, 1), stride=(1, 1), bias=False)
              (1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            )
          )
          (1): BasicBlock(
            (conv1): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu1): ReLU()
            (conv2): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn2): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu2): ReLU()
          )
        )
        (1): Sequential(
          (0): BasicBlock(
            (conv1): Conv2d(256, 256, kernel_size=(3, 3), stride=(2, 2), padding=(1, 1), bias=False)
            (bn1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu1): ReLU()
            (conv2): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn2): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu2): ReLU()
            (downsample_layer): Sequential(
              (0): Conv2d(256, 256, kernel_size=(1, 1), stride=(2, 2), bias=False)
              (1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            )
          )
          (1): BasicBlock(
            (conv1): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu1): ReLU()
            (conv2): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn2): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu2): ReLU()
          )
          (2): BasicBlock(
            (conv1): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu1): ReLU()
            (conv2): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn2): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu2): ReLU()
          )
        )
        (2): Sequential(
          (0): BasicBlock(
            (conv1): Conv2d(256, 256, kernel_size=(3, 3), stride=(2, 2), padding=(1, 1), bias=False)
            (bn1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu1): ReLU()
            (conv2): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn2): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu2): ReLU()
            (downsample_layer): Sequential(
              (0): Conv2d(256, 256, kernel_size=(1, 1), stride=(2, 2), bias=False)
              (1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            )
          )
          (1): BasicBlock(
            (conv1): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu1): ReLU()
            (conv2): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn2): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu2): ReLU()
          )
          (2): BasicBlock(
            (conv1): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu1): ReLU()
            (conv2): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn2): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu2): ReLU()
          )
        )
      )
      (deblocks): ModuleList(
        (0): Sequential(
          (0): Conv2d(256, 256, kernel_size=(2, 2), stride=(2, 2), bias=False)
          (1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
          (2): ReLU()
        )
        (1): Sequential(
          (0): ConvTranspose2d(256, 256, kernel_size=(1, 1), stride=(1, 1), bias=False)
          (1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
          (2): ReLU()
        )
        (2): Sequential(
          (0): ConvTranspose2d(256, 256, kernel_size=(2, 2), stride=(2, 2), bias=False)
          (1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
          (2): ReLU()
        )
      )
    )
    (dense_head): OccHead(
      (img_mlp_0): Sequential(
        (0): Conv2d(512, 256, kernel_size=(2, 2), stride=(1, 1), bias=False)
        (1): GroupNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
        (2): ReLU()
        )
      (img_mlp) = Sequential(
        (0): Conv2d(256, 64, kernel_size=(2, 2), stride=(1, 1), bias=False)
        (1): BatchNorm2d(64, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
        (2): ReLU()
        )
      
      (fine_mlp) = Sequential(
        (0): Conv2d(128, 64, kernel_size=(2, 2), stride=(1, 1), bias=False)
        (1): BatchNorm2d(64, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
        (2): ReLU()
        (3): Linear(64, 20)
        )
      (occ_convs): ModuleList(
        (0): Sequential(
          (0): Conv2d(512, 256, kernel_size=(1, 1), stride=(1, 1), bias=False)
          (1): BatchNorm2d(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
          (2): ReLU()
        )
        (1): Sequential(
          (0): Conv2d(256, 128, kernel_size=(1, 1), stride=(1, 1), bias=False)
          (1): BatchNorm2d(128, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
          (2): ReLU()
        )
        (2): Sequential(
          (0): Conv2d(128, 64, kernel_size=(1, 1), stride=(1, 1), bias=False)
          (1): BatchNorm2d(64, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
          (2): ReLU()
        )
        (3): Sequential(
          (0): Conv2d(64, 64, kernel_size=(1, 1), stride=(1, 1), bias=False)
          (1): BatchNorm2d(64, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
          (2): ReLU()
        )
      )
      (occ_pred_conv): Sequential(
        (0): Conv2d(512, 256, kernel_size=(1, 1), stride=(1, 1), bias=False)
        (1): BatchNorm2d(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
        (2): ReLU()
        (3): Conv2d(256, 64, kernel_size=(1, 1), stride=(1, 1), bias=False)
        )
      (voxel_soft_weights): Sequential(
        (0): Conv2d(512, 256, kernel_size=(1, 1), stride=(1, 1), bias=False)
        (1): BatchNorm2d(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
        (2): ReLU()
        (3): Conv2d(256, 64, kernel_size=(1, 1), stride=(1, 1), bias=False)
        )
      )
    )
    (point_head): None
    (roi_head): None
  )
2023-12-21 14:27:28   INFO  Total number of parameters: 11576884
2023-12-21 14:27:28   INFO  **********************Start training cfgs/picture_models/picture_nuscenes_occupancy(default)**********************
2023-12-21 14:29:10   INFO  epoch: 0/24, acc_iter=50, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:00:55/1:02:56, time_cost(all): 0:00:55/1 day, 1:48:37, loss=3.151585027718938, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.97(1.03), norm=4.920996559985247, lr=0.000379968723343759
2023-12-21 14:30:06   INFO  epoch: 0/24, acc_iter=100, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:01:51/1:05:23, time_cost(all): 0:01:51/1 day, 2:41:41, loss=3.017291078891625, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.09(1.03), norm=0.5550811526290303, lr=0.000459937446687518
2023-12-21 14:31:02   INFO  epoch: 0/24, acc_iter=150, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:02:47/1:03:39, time_cost(all): 0:02:47/1 day, 2:59:03, loss=2.882997130064313, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.98(1.03), norm=0.7606677235302559, lr=0.000539906170031277
2023-12-21 14:31:58   INFO  epoch: 0/24, acc_iter=200, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:03:43/1:03:23, time_cost(all): 0:03:43/1 day, 2:02:20, loss=2.748703181237, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.08(1.03), norm=1.9017041071433491, lr=0.000619874893375035
2023-12-21 14:32:53   INFO  epoch: 0/24, acc_iter=250, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:04:38/1:01:57, time_cost(all): 0:04:38/1 day, 1:34:54, loss=2.614409232409688, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.91(1.03), norm=3.0621988482709064, lr=0.000699843616718794
2023-12-21 14:33:49   INFO  epoch: 0/24, acc_iter=300, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:05:34/0:59:27, time_cost(all): 0:05:34/1 day, 0:57:03, loss=2.480115283582375, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.93(1.03), norm=4.3806406800696545, lr=0.000779812340062553
2023-12-21 14:34:45   INFO  epoch: 0/24, acc_iter=350, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:06:30/1:01:27, time_cost(all): 0:06:30/1 day, 2:05:01, loss=2.345821334755063, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.84(1.03), norm=0.5110732218197576, lr=0.000859781063406312
2023-12-21 14:35:41   INFO  epoch: 0/24, acc_iter=400, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:07:26/0:59:18, time_cost(all): 0:07:26/1 day, 2:48:54, loss=2.21152738592775, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.99(1.03), norm=1.834941605716509, lr=0.000939749786750071
2023-12-21 14:36:36   INFO  epoch: 0/24, acc_iter=450, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:08:21/0:55:28, time_cost(all): 0:08:21/1 day, 1:44:13, loss=2.077233437100438, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.83(1.03), norm=3.36421018267916, lr=0.00101971851009383
2023-12-21 14:37:32   INFO  epoch: 0/24, acc_iter=500, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:09:17/0:58:41, time_cost(all): 0:09:17/1 day, 2:19:42, loss=1.942939488273125, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.16(1.03), norm=3.547587720253264, lr=0.001099687233437589
2023-12-21 14:38:28   INFO  epoch: 0/24, acc_iter=550, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:10:13/0:57:23, time_cost(all): 0:10:13/1 day, 0:45:54, loss=1.808645539445813, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.1(1.03), norm=2.381137474751167, lr=0.001179655956781347
2023-12-21 14:39:24   INFO  epoch: 0/24, acc_iter=600, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:11:09/0:56:09, time_cost(all): 0:11:09/1 day, 1:43:25, loss=1.6743515906185, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.9(1.03), norm=3.725713110848736, lr=0.001259624680125106
2023-12-21 14:40:19   INFO  epoch: 0/24, acc_iter=650, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:12:04/0:55:39, time_cost(all): 0:12:04/1 day, 1:08:03, loss=1.540057641791188, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.87(1.03), norm=4.566954589522392, lr=0.001339593403468865
2023-12-21 14:41:15   INFO  epoch: 0/24, acc_iter=700, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:13:00/0:53:04, time_cost(all): 0:13:00/1 day, 2:39:33, loss=1.405763692963875, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.19(1.03), norm=2.44796724350645, lr=0.001419562126812624
2023-12-21 14:42:11   INFO  epoch: 0/24, acc_iter=750, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:13:56/0:50:28, time_cost(all): 0:13:56/1 day, 2:19:12, loss=1.271469744136563, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.95(1.03), norm=2.608476311008557, lr=0.001499530850156383
2023-12-21 14:43:07   INFO  epoch: 0/24, acc_iter=800, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:14:52/0:49:05, time_cost(all): 0:14:52/1 day, 1:54:56, loss=1.13717579530925, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.83(1.03), norm=0.7348020197240495, lr=0.001579499573500142
2023-12-21 14:44:03   INFO  epoch: 0/24, acc_iter=850, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:15:48/0:50:58, time_cost(all): 0:15:48/1 day, 2:21:10, loss=1.002881846481938, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.87(1.03), norm=3.6875476376031218, lr=0.001659468296843901
2023-12-21 14:44:58   INFO  epoch: 0/24, acc_iter=900, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:16:43/0:48:45, time_cost(all): 0:16:43/1 day, 1:42:11, loss=0.868587897654625, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.95(1.03), norm=2.305082370896255, lr=0.00173943702018766
2023-12-21 14:45:54   INFO  epoch: 0/24, acc_iter=950, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:17:39/0:49:59, time_cost(all): 0:17:39/1 day, 2:04:53, loss=0.734293948827313, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.21(1.03), norm=0.6155408663943308, lr=0.001819405743531418
2023-12-21 14:46:50   INFO  epoch: 0/24, acc_iter=1000, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:18:35/0:45:22, time_cost(all): 0:18:35/1 day, 2:25:11, loss=0.622335926394831, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.83(1.03), norm=3.2157212114886753, lr=0.001899374466875178
2023-12-21 14:47:46   INFO  epoch: 0/24, acc_iter=1050, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:19:31/0:47:49, time_cost(all): 0:19:31/1 day, 0:34:32, loss=0.599803708680657, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.04(1.03), norm=2.5879552833833186, lr=0.001979343190218936
2023-12-21 14:48:41   INFO  epoch: 0/24, acc_iter=1100, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:20:26/0:44:48, time_cost(all): 0:20:26/1 day, 1:03:15, loss=0.599607417361313, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.92(1.03), norm=4.168241343064837, lr=0.002059311913562695
2023-12-21 14:49:37   INFO  epoch: 0/24, acc_iter=1150, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:21:22/0:43:21, time_cost(all): 0:21:22/1 day, 1:39:35, loss=0.59941112604197, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.12(1.03), norm=3.4416723988573708, lr=0.002139280636906454
2023-12-21 14:50:33   INFO  epoch: 0/24, acc_iter=1200, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:22:18/0:42:10, time_cost(all): 0:22:18/1 day, 2:27:35, loss=0.599214834722626, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.21(1.03), norm=4.8063806062826115, lr=0.002219249360250213
2023-12-21 14:51:29   INFO  epoch: 0/24, acc_iter=1250, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:23:14/0:40:34, time_cost(all): 0:23:14/1 day, 2:46:30, loss=0.599018543403283, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.96(1.03), norm=2.70721198648687, lr=0.002299218083593972
2023-12-21 14:52:24   INFO  epoch: 0/24, acc_iter=1300, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:24:09/0:40:40, time_cost(all): 0:24:09/1 day, 2:57:05, loss=0.598822252083939, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.87(1.03), norm=0.6234903354726813, lr=0.00237918680693773
2023-12-21 14:53:20   INFO  epoch: 0/24, acc_iter=1350, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:25:05/0:40:42, time_cost(all): 0:25:05/1 day, 1:51:16, loss=0.598625960764596, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.17(1.03), norm=1.1575697079866998, lr=0.002459155530281489
2023-12-21 14:54:16   INFO  epoch: 0/24, acc_iter=1400, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:26:01/0:38:20, time_cost(all): 0:26:01/1 day, 1:10:15, loss=0.598429669445253, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.09(1.03), norm=4.626751939053618, lr=0.002539124253625248
2023-12-21 14:55:12   INFO  epoch: 0/24, acc_iter=1450, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:26:57/0:39:21, time_cost(all): 0:26:57/1 day, 2:07:21, loss=0.598233378125909, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.95(1.03), norm=4.424092571056923, lr=0.002619092976969007
2023-12-21 14:56:08   INFO  epoch: 0/24, acc_iter=1500, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:27:53/0:36:38, time_cost(all): 0:27:53/1 day, 1:55:28, loss=0.598037086806566, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.09(1.03), norm=4.783354178967227, lr=0.002699061700312766
2023-12-21 14:57:03   INFO  epoch: 0/24, acc_iter=1550, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:28:48/0:37:53, time_cost(all): 0:28:48/1 day, 1:07:02, loss=0.597840795487222, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.85(1.03), norm=2.446614845156388, lr=0.002779030423656525
2023-12-21 14:57:59   INFO  epoch: 0/24, acc_iter=1600, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:29:44/0:35:28, time_cost(all): 0:29:44/1 day, 0:46:48, loss=0.597644504167879, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.05(1.03), norm=0.7475515468500301, lr=0.002858999147000284
2023-12-21 14:58:55   INFO  epoch: 0/24, acc_iter=1650, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:30:40/0:36:25, time_cost(all): 0:30:40/1 day, 2:48:45, loss=0.597448212848535, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.09(1.03), norm=2.3367585033123275, lr=0.002938967870344043
2023-12-21 14:59:51   INFO  epoch: 0/24, acc_iter=1700, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:31:36/0:33:16, time_cost(all): 0:31:36/1 day, 1:07:04, loss=0.597251921529192, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.0(1.03), norm=4.96759065913986, lr=0.003047341484219505
2023-12-21 15:00:46   INFO  epoch: 0/24, acc_iter=1750, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:32:31/0:33:46, time_cost(all): 0:32:31/1 day, 0:32:40, loss=0.597055630209849, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.22(1.03), norm=3.7284690849610023, lr=0.003247263292578902
2023-12-21 15:01:42   INFO  epoch: 0/24, acc_iter=1800, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:33:27/0:32:36, time_cost(all): 0:33:27/1 day, 0:21:49, loss=0.596859338890505, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.91(1.03), norm=4.499663418738612, lr=0.003447185100938299
2023-12-21 15:02:38   INFO  epoch: 0/24, acc_iter=1850, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:34:23/0:31:42, time_cost(all): 0:34:23/1 day, 2:45:07, loss=0.596663047571162, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.2(1.03), norm=2.060873873922855, lr=0.003647106909297696
2023-12-21 15:03:34   INFO  epoch: 0/24, acc_iter=1900, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:35:19/0:29:15, time_cost(all): 0:35:19/1 day, 1:23:19, loss=0.596466756251818, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.93(1.03), norm=4.521290592119133, lr=0.003847028717657093
2023-12-21 15:04:29   INFO  epoch: 0/24, acc_iter=1950, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:36:14/0:27:57, time_cost(all): 0:36:14/1 day, 0:28:00, loss=0.596270464932475, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.01(1.03), norm=2.0464433672945166, lr=0.00404695052601649
2023-12-21 15:05:25   INFO  epoch: 0/24, acc_iter=2000, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:37:10/0:28:20, time_cost(all): 0:37:10/1 day, 1:13:40, loss=0.596074173613131, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.18(1.03), norm=1.3443989261988318, lr=0.004246872334375888
2023-12-21 15:06:21   INFO  epoch: 0/24, acc_iter=2050, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:38:06/0:26:47, time_cost(all): 0:38:06/1 day, 0:29:18, loss=0.595877882293788, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.01(1.03), norm=0.5393254890804415, lr=0.004446794142735285
2023-12-21 15:07:17   INFO  epoch: 0/24, acc_iter=2100, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:39:02/0:26:49, time_cost(all): 0:39:02/1 day, 2:45:23, loss=0.595681590974445, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.01(1.03), norm=1.5854679527017987, lr=0.004646715951094682
2023-12-21 15:08:13   INFO  epoch: 0/24, acc_iter=2150, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:39:58/0:24:44, time_cost(all): 0:39:58/1 day, 1:47:13, loss=0.595485299655101, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.2(1.03), norm=2.249087945946666, lr=0.004846637759454079
2023-12-21 15:09:08   INFO  epoch: 0/24, acc_iter=2200, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:40:53/0:24:10, time_cost(all): 0:40:53/1 day, 2:35:35, loss=0.595289008335758, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.03(1.03), norm=2.013274332891161, lr=0.005046559567813476
2023-12-21 15:10:04   INFO  epoch: 0/24, acc_iter=2250, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:41:49/0:23:32, time_cost(all): 0:41:49/1 day, 1:22:00, loss=0.595092717016414, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.04(1.03), norm=4.612400423766042, lr=0.005246481376172873
2023-12-21 15:11:00   INFO  epoch: 0/24, acc_iter=2300, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:42:45/0:23:17, time_cost(all): 0:42:45/1 day, 1:26:44, loss=0.594896425697071, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.06(1.03), norm=4.01046606331812, lr=0.00544640318453227
2023-12-21 15:11:56   INFO  epoch: 0/24, acc_iter=2350, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:43:41/0:21:33, time_cost(all): 0:43:41/1 day, 1:57:54, loss=0.594700134377727, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.09(1.03), norm=2.4564428847729487, lr=0.005646324992891668
2023-12-21 15:12:51   INFO  epoch: 0/24, acc_iter=2400, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:44:36/0:21:30, time_cost(all): 0:44:36/1 day, 2:31:44, loss=0.594503843058384, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.08(1.03), norm=3.8662380907420775, lr=0.005846246801251065
2023-12-21 15:13:47   INFO  epoch: 0/24, acc_iter=2450, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:45:32/0:19:04, time_cost(all): 0:45:32/1 day, 1:03:16, loss=0.59430755173904, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.02(1.03), norm=4.390965061995251, lr=0.006046168609610462
2023-12-21 15:14:43   INFO  epoch: 0/24, acc_iter=2500, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:46:28/0:18:30, time_cost(all): 0:46:28/1 day, 1:10:05, loss=0.594111260419697, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.14(1.03), norm=0.8047143717957788, lr=0.006246090417969859
2023-12-21 15:15:39   INFO  epoch: 0/24, acc_iter=2550, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:47:24/0:17:28, time_cost(all): 0:47:24/1 day, 1:08:09, loss=0.593914969100354, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.85(1.03), norm=3.536847308520062, lr=0.006446012226329257
2023-12-21 15:16:34   INFO  epoch: 0/24, acc_iter=2600, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:48:19/0:17:27, time_cost(all): 0:48:19/1 day, 1:42:50, loss=0.59371867778101, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.99(1.03), norm=0.6250817348357761, lr=0.006645934034688654
2023-12-21 15:17:30   INFO  epoch: 0/24, acc_iter=2650, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:49:15/0:15:34, time_cost(all): 0:49:15/1 day, 1:15:46, loss=0.593522386461667, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.05(1.03), norm=3.555924994949324, lr=0.006845855843048051
2023-12-21 15:18:26   INFO  epoch: 0/24, acc_iter=2700, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:50:11/0:15:03, time_cost(all): 0:50:11/1 day, 0:22:13, loss=0.593326095142323, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.2(1.03), norm=3.7618403053200273, lr=0.007045777651407448
2023-12-21 15:19:22   INFO  epoch: 0/24, acc_iter=2750, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 0:51:07/0:14:27, time_cost(all): 0:51:07/1 day, 0:03:13, loss=0.59312980382298, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.94(1.03), norm=3.5167661074726837, lr=0.007245699459766846
2023-12-21 15:20:18   INFO  epoch: 0/24, acc_iter=2800, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 0:52:03/0:12:44, time_cost(all): 0:52:03/1 day, 2:23:17, loss=0.592933512503636, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.95(1.03), norm=1.6158844391667804, lr=0.007445621268126243
2023-12-21 15:21:13   INFO  epoch: 0/24, acc_iter=2850, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 0:52:58/0:11:59, time_cost(all): 0:52:58/1 day, 2:00:28, loss=0.592737221184293, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.87(1.03), norm=1.394739776202864, lr=0.007645543076485638
2023-12-21 15:22:09   INFO  epoch: 0/24, acc_iter=2900, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 0:53:54/0:11:04, time_cost(all): 0:53:54/1 day, 0:25:36, loss=0.59254092986495, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.0(1.03), norm=4.47380976449245, lr=0.007845464884845037
2023-12-21 15:23:05   INFO  epoch: 0/24, acc_iter=2950, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 0:54:50/0:10:37, time_cost(all): 0:54:50/1 day, 0:55:41, loss=0.592344638545606, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.16(1.03), norm=0.9471151374801932, lr=0.008045386693204435
2023-12-21 15:24:01   INFO  epoch: 0/24, acc_iter=3000, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 0:55:46/0:09:18, time_cost(all): 0:55:46/1 day, 1:45:47, loss=0.592148347226263, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.94(1.03), norm=2.53252292866584, lr=0.00824530850156383
2023-12-21 15:24:56   INFO  epoch: 0/24, acc_iter=3050, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 0:56:41/0:08:58, time_cost(all): 0:56:41/1 day, 1:07:12, loss=0.591952055906919, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.12(1.03), norm=4.453511366769442, lr=0.008445230309923227
2023-12-21 15:25:52   INFO  epoch: 0/24, acc_iter=3100, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 0:57:37/0:07:25, time_cost(all): 0:57:37/1 day, 1:12:03, loss=0.591755764587576, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.87(1.03), norm=0.6343231224661273, lr=0.008645152118282626
2023-12-21 15:26:48   INFO  epoch: 0/24, acc_iter=3150, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 0:58:33/0:06:29, time_cost(all): 0:58:33/1 day, 1:29:26, loss=0.591559473268232, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.83(1.03), norm=1.5802047821101086, lr=0.008845073926642022
2023-12-21 15:27:44   INFO  epoch: 0/24, acc_iter=3200, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 0:59:29/0:05:49, time_cost(all): 0:59:29/1 day, 0:57:18, loss=0.591363181948889, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.02(1.03), norm=1.3807985703246664, lr=0.00904499573500142
2023-12-21 15:28:39   INFO  epoch: 0/24, acc_iter=3250, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:00:24/0:05:08, time_cost(all): 1:00:24/1 day, 0:37:17, loss=0.591166890629546, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.04(1.03), norm=1.7748372857020505, lr=0.009244917543360816
2023-12-21 15:29:35   INFO  epoch: 0/24, acc_iter=3300, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:01:20/0:03:53, time_cost(all): 1:01:20/1 day, 0:00:01, loss=0.590970599310202, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.08(1.03), norm=2.1559070521991504, lr=0.009444839351720214
2023-12-21 15:30:31   INFO  epoch: 0/24, acc_iter=3350, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:02:16/0:03:12, time_cost(all): 1:02:16/1 day, 1:51:53, loss=0.590774307990859, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.06(1.03), norm=4.297879381298222, lr=0.009644761160079611
2023-12-21 15:31:27   INFO  epoch: 0/24, acc_iter=3400, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:03:12/0:02:15, time_cost(all): 1:03:12/1 day, 0:48:36, loss=0.590578016671515, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.0(1.03), norm=1.4516000104085531, lr=0.009844682968439008
2023-12-21 15:32:23   INFO  epoch: 0/24, acc_iter=3450, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:04:08/0:01:17, time_cost(all): 1:04:08/1 day, 1:46:24, loss=0.590381725352172, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.04(1.03), norm=3.8450101930499314, lr=0.010044604776798405
2023-12-21 15:33:18   INFO  epoch: 0/24, acc_iter=3500, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:05:03/0:00:19, time_cost(all): 1:05:03/1 day, 1:47:10, loss=0.590185434032828, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.1(1.03), norm=2.8232642659816576, lr=0.010244526585157803
2023-12-21 15:34:14   INFO  epoch: 1/24, acc_iter=3567, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:00:55/1:05:00, time_cost(all): 1:05:59/1 day, 0:33:34, loss=0.589922403664908, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.95(1.03), norm=2.5621200217867086, lr=0.010512421808359394
2023-12-21 15:35:10   INFO  epoch: 1/24, acc_iter=3617, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:01:51/1:01:09, time_cost(all): 1:06:55/1 day, 0:19:05, loss=0.589726112345565, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.84(1.03), norm=1.3719434148597398, lr=0.010712343616718792
2023-12-21 15:36:06   INFO  epoch: 1/24, acc_iter=3667, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:02:47/1:03:17, time_cost(all): 1:07:51/1 day, 2:11:49, loss=0.589529821026221, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.98(1.03), norm=1.889650592234137, lr=0.010912265425078189
2023-12-21 15:37:01   INFO  epoch: 1/24, acc_iter=3717, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:03:43/1:00:00, time_cost(all): 1:08:46/1 day, 2:08:39, loss=0.589333529706878, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.04(1.03), norm=4.751105230511112, lr=0.011112187233437586
2023-12-21 15:37:57   INFO  epoch: 1/24, acc_iter=3767, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:04:38/0:57:42, time_cost(all): 1:09:42/1 day, 0:20:33, loss=0.589137238387534, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.1(1.03), norm=2.5457740128935034, lr=0.011312109041796983
2023-12-21 15:38:53   INFO  epoch: 1/24, acc_iter=3817, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:05:34/1:00:48, time_cost(all): 1:10:38/1 day, 1:13:01, loss=0.588940947068191, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.2(1.03), norm=4.209554782364416, lr=0.011512030850156382
2023-12-21 15:39:49   INFO  epoch: 1/24, acc_iter=3867, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:06:30/0:59:25, time_cost(all): 1:11:34/1 day, 1:18:57, loss=0.588744655748847, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.08(1.03), norm=2.1836772573348404, lr=0.01171195265851578
2023-12-21 15:40:44   INFO  epoch: 1/24, acc_iter=3917, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:07:26/1:00:29, time_cost(all): 1:12:29/23:58:15, loss=0.588548364429504, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.88(1.03), norm=0.7303344164915526, lr=0.011911874466875177
2023-12-21 15:41:40   INFO  epoch: 1/24, acc_iter=3967, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:08:21/0:56:01, time_cost(all): 1:13:25/1 day, 0:01:02, loss=0.588352073110161, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.06(1.03), norm=4.423952560116243, lr=0.012111796275234572
2023-12-21 15:42:36   INFO  epoch: 1/24, acc_iter=4017, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:09:17/0:56:13, time_cost(all): 1:14:21/1 day, 0:49:58, loss=0.588155781790817, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.18(1.03), norm=2.206733926280635, lr=0.01231171808359397
2023-12-21 15:43:32   INFO  epoch: 1/24, acc_iter=4067, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:10:13/0:52:49, time_cost(all): 1:15:17/1 day, 1:35:50, loss=0.587959490471474, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.02(1.03), norm=2.140441913418955, lr=0.012511639891953367
2023-12-21 15:44:28   INFO  epoch: 1/24, acc_iter=4117, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:11:09/0:52:36, time_cost(all): 1:16:13/1 day, 0:34:13, loss=0.58776319915213, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.18(1.03), norm=0.8642942203066226, lr=0.012711561700312764
2023-12-21 15:45:23   INFO  epoch: 1/24, acc_iter=4167, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:12:04/0:52:44, time_cost(all): 1:17:08/1 day, 1:38:30, loss=0.587566907832787, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.09(1.03), norm=4.46758261991843, lr=0.012911483508672161
2023-12-21 15:46:19   INFO  epoch: 1/24, acc_iter=4217, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:13:00/0:54:29, time_cost(all): 1:18:04/1 day, 0:18:41, loss=0.587370616513443, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.12(1.03), norm=0.5595239830737131, lr=0.01311140531703156
2023-12-21 15:47:15   INFO  epoch: 1/24, acc_iter=4267, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:13:56/0:52:10, time_cost(all): 1:19:00/1 day, 2:03:57, loss=0.5871743251941, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.21(1.03), norm=1.7404569419943479, lr=0.013311327125390956
2023-12-21 15:48:11   INFO  epoch: 1/24, acc_iter=4317, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:14:52/0:49:50, time_cost(all): 1:19:56/1 day, 0:45:38, loss=0.586978033874757, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.22(1.03), norm=2.821748670676109, lr=0.013511248933750353
2023-12-21 15:49:06   INFO  epoch: 1/24, acc_iter=4367, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:15:48/0:50:17, time_cost(all): 1:20:51/23:39:18, loss=0.586781742555413, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.14(1.03), norm=2.329659638420361, lr=0.01371117074210975
2023-12-21 15:50:02   INFO  epoch: 1/24, acc_iter=4417, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:16:43/0:50:31, time_cost(all): 1:21:47/1 day, 1:25:13, loss=0.58658545123607, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.06(1.03), norm=1.793521857139145, lr=0.013911092550469148
2023-12-21 15:50:58   INFO  epoch: 1/24, acc_iter=4467, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:17:39/0:49:35, time_cost(all): 1:22:43/1 day, 0:00:37, loss=0.586389159916726, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.17(1.03), norm=4.326249822079328, lr=0.014111014358828545
2023-12-21 15:51:54   INFO  epoch: 1/24, acc_iter=4517, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:18:35/0:44:39, time_cost(all): 1:23:39/1 day, 1:33:56, loss=0.586192868597383, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.2(1.03), norm=3.948313059871314, lr=0.014310936167187942
2023-12-21 15:52:49   INFO  epoch: 1/24, acc_iter=4567, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:19:31/0:47:36, time_cost(all): 1:24:34/1 day, 0:39:23, loss=0.585996577278039, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.22(1.03), norm=3.3670103427594347, lr=0.014510857975547338
2023-12-21 15:53:45   INFO  epoch: 1/24, acc_iter=4617, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:20:26/0:45:56, time_cost(all): 1:25:30/1 day, 0:27:36, loss=0.585800285958696, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.99(1.03), norm=2.9159918381502337, lr=0.014710779783906737
2023-12-21 15:54:41   INFO  epoch: 1/24, acc_iter=4667, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:21:22/0:45:22, time_cost(all): 1:26:26/1 day, 1:14:10, loss=0.585603994639353, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.01(1.03), norm=4.887857740310338, lr=0.014910701592266134
2023-12-21 15:55:37   INFO  epoch: 1/24, acc_iter=4717, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:22:18/0:45:11, time_cost(all): 1:27:22/1 day, 1:07:59, loss=0.585407703320009, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.99(1.03), norm=1.7233659658353981, lr=0.015110623400625531
2023-12-21 15:56:32   INFO  epoch: 1/24, acc_iter=4767, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:23:14/0:43:04, time_cost(all): 1:28:17/1 day, 0:39:39, loss=0.585211412000666, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.88(1.03), norm=1.9299003312483594, lr=0.015310545208984928
2023-12-21 15:57:28   INFO  epoch: 1/24, acc_iter=4817, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:24:09/0:40:11, time_cost(all): 1:29:13/1 day, 1:25:29, loss=0.585015120681322, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.2(1.03), norm=2.9292757979414015, lr=0.015510467017344326
2023-12-21 15:58:24   INFO  epoch: 1/24, acc_iter=4867, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:25:05/0:40:24, time_cost(all): 1:30:09/1 day, 1:14:12, loss=0.584818829361979, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.84(1.03), norm=2.172725481835551, lr=0.015710388825703723
2023-12-21 15:59:20   INFO  epoch: 1/24, acc_iter=4917, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:26:01/0:37:56, time_cost(all): 1:31:05/1 day, 1:25:23, loss=0.584622538042635, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.0(1.03), norm=3.1407977950162316, lr=0.01591031063406312
2023-12-21 16:00:16   INFO  epoch: 1/24, acc_iter=4967, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:26:57/0:37:15, time_cost(all): 1:32:01/1 day, 0:40:30, loss=0.584426246723292, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.21(1.03), norm=4.984717146058213, lr=0.016110232442422517
2023-12-21 16:01:11   INFO  epoch: 1/24, acc_iter=5017, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:27:53/0:36:07, time_cost(all): 1:32:56/23:30:46, loss=0.584229955403949, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.04(1.03), norm=0.9204784817818157, lr=0.016310154250781916
2023-12-21 16:02:07   INFO  epoch: 1/24, acc_iter=5067, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:28:48/0:35:16, time_cost(all): 1:33:52/1 day, 0:51:28, loss=0.584033664084605, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.89(1.03), norm=1.9313818314025228, lr=0.016510076059141312
2023-12-21 16:03:03   INFO  epoch: 1/24, acc_iter=5117, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:29:44/0:36:55, time_cost(all): 1:34:48/1 day, 0:24:27, loss=0.583837372765262, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.07(1.03), norm=2.769401013855236, lr=0.016709997867500707
2023-12-21 16:03:59   INFO  epoch: 1/24, acc_iter=5167, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:30:40/0:36:08, time_cost(all): 1:35:44/1 day, 0:49:02, loss=0.583641081445918, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.0(1.03), norm=2.471150679315608, lr=0.016909919675860103
2023-12-21 16:04:54   INFO  epoch: 1/24, acc_iter=5217, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:31:36/0:32:27, time_cost(all): 1:36:39/1 day, 0:09:44, loss=0.583444790126575, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.16(1.03), norm=4.8897588226417215, lr=0.017109841484219505
2023-12-21 16:05:50   INFO  epoch: 1/24, acc_iter=5267, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:32:31/0:32:32, time_cost(all): 1:37:35/1 day, 1:06:20, loss=0.583248498807231, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.88(1.03), norm=2.327852608243273, lr=0.0173097632925789
2023-12-21 16:06:46   INFO  epoch: 1/24, acc_iter=5317, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:33:27/0:31:41, time_cost(all): 1:38:31/23:21:52, loss=0.583052207487888, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.23(1.03), norm=2.1801972449702784, lr=0.0175096851009383
2023-12-21 16:07:42   INFO  epoch: 1/24, acc_iter=5367, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:34:23/0:29:55, time_cost(all): 1:39:27/1 day, 0:49:35, loss=0.582855916168545, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.03(1.03), norm=3.427561634736573, lr=0.017709606909297695
2023-12-21 16:08:37   INFO  epoch: 1/24, acc_iter=5417, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:35:19/0:29:19, time_cost(all): 1:40:22/1 day, 1:34:14, loss=0.582659624849201, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.01(1.03), norm=4.778886863102306, lr=0.01790952871765709
2023-12-21 16:09:33   INFO  epoch: 1/24, acc_iter=5467, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:36:14/0:27:46, time_cost(all): 1:41:18/1 day, 0:06:57, loss=0.582463333529858, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.14(1.03), norm=3.131877457682044, lr=0.01810945052601649
2023-12-21 16:10:29   INFO  epoch: 1/24, acc_iter=5517, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:37:10/0:28:58, time_cost(all): 1:42:14/1 day, 1:06:19, loss=0.582267042210514, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.99(1.03), norm=4.116743689676195, lr=0.018309372334375885
2023-12-21 16:11:25   INFO  epoch: 1/24, acc_iter=5567, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:38:06/0:28:30, time_cost(all): 1:43:10/1 day, 1:10:00, loss=0.582070750891171, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.88(1.03), norm=4.758702774360944, lr=0.018509294142735284
2023-12-21 16:12:21   INFO  epoch: 1/24, acc_iter=5617, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:39:02/0:25:28, time_cost(all): 1:44:06/1 day, 0:04:14, loss=0.581874459571827, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.15(1.03), norm=1.5275908057586096, lr=0.01870921595109468
2023-12-21 16:13:16   INFO  epoch: 1/24, acc_iter=5667, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:39:58/0:26:04, time_cost(all): 1:45:01/1 day, 0:55:52, loss=0.581678168252484, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.9(1.03), norm=3.744819857896469, lr=0.018909137759454075
2023-12-21 16:14:12   INFO  epoch: 1/24, acc_iter=5717, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:40:53/0:24:34, time_cost(all): 1:45:57/1 day, 0:51:47, loss=0.58148187693314, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.06(1.03), norm=1.3686380729028977, lr=0.019109059567813474
2023-12-21 16:15:08   INFO  epoch: 1/24, acc_iter=5767, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:41:49/0:22:58, time_cost(all): 1:46:53/1 day, 0:21:54, loss=0.581285585613797, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.91(1.03), norm=4.618960586025302, lr=0.01930898137617287
2023-12-21 16:16:04   INFO  epoch: 1/24, acc_iter=5817, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:42:45/0:22:52, time_cost(all): 1:47:49/1 day, 1:22:27, loss=0.581089294294454, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.9(1.03), norm=1.755051135257228, lr=0.01950890318453227
2023-12-21 16:16:59   INFO  epoch: 1/24, acc_iter=5867, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:43:41/0:21:24, time_cost(all): 1:48:44/23:33:42, loss=0.58089300297511, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.18(1.03), norm=4.503418795448733, lr=0.019708824992891665
2023-12-21 16:17:55   INFO  epoch: 1/24, acc_iter=5917, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:44:36/0:20:09, time_cost(all): 1:49:40/1 day, 0:57:24, loss=0.580696711655767, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.18(1.03), norm=4.394086841535298, lr=0.01990874680125106
2023-12-21 16:18:51   INFO  epoch: 1/24, acc_iter=5967, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:45:32/0:19:31, time_cost(all): 1:50:36/1 day, 0:37:38, loss=0.580500420336423, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.89(1.03), norm=2.198403670370917, lr=0.02010866860961046
2023-12-21 16:19:47   INFO  epoch: 1/24, acc_iter=6017, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:46:28/0:19:50, time_cost(all): 1:51:32/1 day, 0:14:11, loss=0.58030412901708, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.04(1.03), norm=0.5896940502543715, lr=0.020308590417969858
2023-12-21 16:20:42   INFO  epoch: 1/24, acc_iter=6067, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:47:24/0:17:39, time_cost(all): 1:52:27/23:10:57, loss=0.580107837697736, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.94(1.03), norm=4.9643835683313595, lr=0.020508512226329257
2023-12-21 16:21:38   INFO  epoch: 1/24, acc_iter=6117, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:48:19/0:16:28, time_cost(all): 1:53:23/1 day, 0:11:04, loss=0.579911546378393, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.94(1.03), norm=2.315151932934932, lr=0.020708434034688653
2023-12-21 16:22:34   INFO  epoch: 1/24, acc_iter=6167, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:49:15/0:16:19, time_cost(all): 1:54:19/1 day, 0:34:48, loss=0.57971525505905, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.98(1.03), norm=3.810186825070943, lr=0.020908355843048048
2023-12-21 16:23:30   INFO  epoch: 1/24, acc_iter=6217, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:50:11/0:14:45, time_cost(all): 1:55:15/1 day, 1:08:25, loss=0.579518963739706, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.9(1.03), norm=1.7897107071831904, lr=0.021108277651407447
2023-12-21 16:24:26   INFO  epoch: 1/24, acc_iter=6267, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 0:51:07/0:14:48, time_cost(all): 1:56:11/23:34:13, loss=0.579322672420363, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.92(1.03), norm=2.1077659154702166, lr=0.021308199459766843
2023-12-21 16:25:21   INFO  epoch: 1/24, acc_iter=6317, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 0:52:03/0:13:07, time_cost(all): 1:57:06/1 day, 0:13:04, loss=0.579126381101019, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.16(1.03), norm=4.515922451722849, lr=0.02150812126812624
2023-12-21 16:26:17   INFO  epoch: 1/24, acc_iter=6367, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 0:52:58/0:12:43, time_cost(all): 1:58:02/23:06:22, loss=0.578930089781676, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.99(1.03), norm=4.437182563237501, lr=0.021708043076485637
2023-12-21 16:27:13   INFO  epoch: 1/24, acc_iter=6417, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 0:53:54/0:11:12, time_cost(all): 1:58:58/23:07:05, loss=0.578733798462332, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.08(1.03), norm=3.17324226455531, lr=0.021907964884845036
2023-12-21 16:28:09   INFO  epoch: 1/24, acc_iter=6467, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 0:54:50/0:10:47, time_cost(all): 1:59:54/23:46:29, loss=0.578537507142989, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.18(1.03), norm=3.7227683898333934, lr=0.02210788669320443
2023-12-21 16:29:04   INFO  epoch: 1/24, acc_iter=6517, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 0:55:46/0:09:55, time_cost(all): 2:00:49/23:27:41, loss=0.578341215823646, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.91(1.03), norm=4.802981785950422, lr=0.022307808501563827
2023-12-21 16:30:00   INFO  epoch: 1/24, acc_iter=6567, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 0:56:41/0:08:42, time_cost(all): 2:01:45/1 day, 0:02:47, loss=0.578144924504302, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.05(1.03), norm=3.4486145382850184, lr=0.022507730309923226
2023-12-21 16:30:56   INFO  epoch: 1/24, acc_iter=6617, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 0:57:37/0:07:42, time_cost(all): 2:02:41/23:51:52, loss=0.577948633184959, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.0(1.03), norm=4.343743111059078, lr=0.02270765211828262
2023-12-21 16:31:52   INFO  epoch: 1/24, acc_iter=6667, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 0:58:33/0:06:53, time_cost(all): 2:03:37/23:05:15, loss=0.577752341865615, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.89(1.03), norm=3.1883745596663733, lr=0.02290757392664202
2023-12-21 16:32:47   INFO  epoch: 1/24, acc_iter=6717, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 0:59:29/0:05:57, time_cost(all): 2:04:32/23:16:04, loss=0.577556050546272, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.2(1.03), norm=4.661820735488989, lr=0.023107495735001416
2023-12-21 16:33:43   INFO  epoch: 1/24, acc_iter=6767, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:00:24/0:04:53, time_cost(all): 2:05:28/23:24:17, loss=0.577359759226928, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.97(1.03), norm=3.6889713854364987, lr=0.023307417543360815
2023-12-21 16:34:39   INFO  epoch: 1/24, acc_iter=6817, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:01:20/0:03:51, time_cost(all): 2:06:24/23:52:15, loss=0.577163467907585, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.85(1.03), norm=2.1410229574956516, lr=0.023507339351720214
2023-12-21 16:35:35   INFO  epoch: 1/24, acc_iter=6867, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:02:16/0:03:01, time_cost(all): 2:07:20/23:28:14, loss=0.576967176588242, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.01(1.03), norm=3.3710534670930237, lr=0.02370726116007961
2023-12-21 16:36:31   INFO  epoch: 1/24, acc_iter=6917, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:03:12/0:02:14, time_cost(all): 2:08:16/23:43:44, loss=0.576770885268898, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.23(1.03), norm=2.7769951808495605, lr=0.02390718296843901
2023-12-21 16:37:26   INFO  epoch: 1/24, acc_iter=6967, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:04:08/0:01:16, time_cost(all): 2:09:11/23:44:57, loss=0.576574593949555, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.14(1.03), norm=1.1164589567263887, lr=0.024107104776798404
2023-12-21 16:38:22   INFO  epoch: 1/24, acc_iter=7017, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:05:03/0:00:19, time_cost(all): 2:10:07/23:28:25, loss=0.576378302630211, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.88(1.03), norm=4.828085201720654, lr=0.0243070265851578
2023-12-21 16:39:18   INFO  epoch: 2/24, acc_iter=7084, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:00:55/1:07:27, time_cost(all): 2:11:03/23:33:31, loss=0.576115272262291, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.93(1.03), norm=4.227882215296919, lr=0.024574921808359393
2023-12-21 16:40:14   INFO  epoch: 2/24, acc_iter=7134, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:01:51/1:05:49, time_cost(all): 2:11:59/1 day, 0:33:53, loss=0.575918980942948, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.09(1.03), norm=2.464197701968822, lr=0.02477484361671879
2023-12-21 16:41:09   INFO  epoch: 2/24, acc_iter=7184, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:02:47/1:03:52, time_cost(all): 2:12:54/1 day, 0:23:00, loss=0.575722689623604, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.12(1.03), norm=3.0512839715644358, lr=0.024974765425078187
2023-12-21 16:42:05   INFO  epoch: 2/24, acc_iter=7234, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:03:43/0:59:56, time_cost(all): 2:13:50/1 day, 0:35:22, loss=0.575526398304261, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.08(1.03), norm=0.6253331498401324, lr=0.025174687233437583
2023-12-21 16:43:01   INFO  epoch: 2/24, acc_iter=7284, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:04:38/1:00:01, time_cost(all): 2:14:46/22:44:22, loss=0.575330106984917, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.93(1.03), norm=4.090860138139547, lr=0.025374609041796982
2023-12-21 16:43:57   INFO  epoch: 2/24, acc_iter=7334, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:05:34/0:57:23, time_cost(all): 2:15:42/1 day, 0:49:16, loss=0.575133815665574, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.99(1.03), norm=0.7401416599729735, lr=0.02557453085015638
2023-12-21 16:44:52   INFO  epoch: 2/24, acc_iter=7384, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:06:30/0:57:37, time_cost(all): 2:16:37/23:11:22, loss=0.57493752434623, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.08(1.03), norm=1.8292645445032834, lr=0.025774452658515776
2023-12-21 16:45:48   INFO  epoch: 2/24, acc_iter=7434, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:07:26/0:58:19, time_cost(all): 2:17:33/1 day, 0:28:55, loss=0.574741233026887, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.97(1.03), norm=1.861085802660341, lr=0.025974374466875175
2023-12-21 16:46:44   INFO  epoch: 2/24, acc_iter=7484, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:08:21/0:55:03, time_cost(all): 2:18:29/23:13:31, loss=0.574544941707544, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.04(1.03), norm=2.006576504664976, lr=0.02617429627523457
2023-12-21 16:47:40   INFO  epoch: 2/24, acc_iter=7534, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:09:17/0:56:28, time_cost(all): 2:19:25/1 day, 0:56:56, loss=0.5743486503882, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.08(1.03), norm=1.9783238294515344, lr=0.026374218083593966
2023-12-21 16:48:36   INFO  epoch: 2/24, acc_iter=7584, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:10:13/0:52:35, time_cost(all): 2:20:21/23:36:39, loss=0.574152359068857, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.95(1.03), norm=3.0234879591514483, lr=0.026574139891953365
2023-12-21 16:49:31   INFO  epoch: 2/24, acc_iter=7634, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:11:09/0:53:32, time_cost(all): 2:21:16/1 day, 0:38:21, loss=0.573956067749513, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.23(1.03), norm=4.880560584502582, lr=0.02677406170031276
2023-12-21 16:50:27   INFO  epoch: 2/24, acc_iter=7684, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:12:04/0:55:01, time_cost(all): 2:22:12/23:13:54, loss=0.57375977643017, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.86(1.03), norm=4.9453868362984155, lr=0.02697398350867216
2023-12-21 16:51:23   INFO  epoch: 2/24, acc_iter=7734, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:13:00/0:53:51, time_cost(all): 2:23:08/1 day, 0:56:46, loss=0.573563485110826, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.95(1.03), norm=2.700812680573869, lr=0.027173905317031555
2023-12-21 16:52:19   INFO  epoch: 2/24, acc_iter=7784, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:13:56/0:48:59, time_cost(all): 2:24:04/23:39:17, loss=0.573367193791483, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.93(1.03), norm=2.2402062041718454, lr=0.027373827125390954
2023-12-21 16:53:14   INFO  epoch: 2/24, acc_iter=7834, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:14:52/0:48:10, time_cost(all): 2:24:59/23:38:00, loss=0.57317090247214, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.04(1.03), norm=1.2053776507890614, lr=0.02757374893375035
2023-12-21 16:54:10   INFO  epoch: 2/24, acc_iter=7884, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:15:48/0:51:59, time_cost(all): 2:25:55/1 day, 0:17:15, loss=0.572974611152796, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.19(1.03), norm=4.383052127365657, lr=0.027773670742109745
2023-12-21 16:55:06   INFO  epoch: 2/24, acc_iter=7934, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:16:43/0:48:44, time_cost(all): 2:26:51/22:58:28, loss=0.572778319833453, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.98(1.03), norm=3.193428650067249, lr=0.027973592550469144
2023-12-21 16:56:02   INFO  epoch: 2/24, acc_iter=7984, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:17:39/0:45:39, time_cost(all): 2:27:47/23:56:55, loss=0.572582028514109, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.02(1.03), norm=2.0869028055281538, lr=0.02817351435882854
2023-12-21 16:56:57   INFO  epoch: 2/24, acc_iter=8034, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:18:35/0:48:34, time_cost(all): 2:28:42/1 day, 0:02:55, loss=0.572385737194766, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.98(1.03), norm=3.8158833866323825, lr=0.02837343616718794
2023-12-21 16:57:53   INFO  epoch: 2/24, acc_iter=8084, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:19:31/0:43:44, time_cost(all): 2:29:38/23:15:11, loss=0.572189445875422, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.11(1.03), norm=4.408974433965726, lr=0.028573357975547338
2023-12-21 16:58:49   INFO  epoch: 2/24, acc_iter=8134, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:20:26/0:45:43, time_cost(all): 2:30:34/1 day, 0:46:21, loss=0.571993154556079, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.94(1.03), norm=2.123706486624968, lr=0.028773279783906733
2023-12-21 16:59:45   INFO  epoch: 2/24, acc_iter=8184, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:21:22/0:44:30, time_cost(all): 2:31:30/23:37:12, loss=0.571796863236736, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.09(1.03), norm=4.79057593646441, lr=0.028973201592266132
2023-12-21 17:00:41   INFO  epoch: 2/24, acc_iter=8234, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:22:18/0:44:18, time_cost(all): 2:32:26/1 day, 0:01:29, loss=0.571600571917392, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.08(1.03), norm=1.7165847504350678, lr=0.029173123400625528
2023-12-21 17:01:36   INFO  epoch: 2/24, acc_iter=8284, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:23:14/0:42:45, time_cost(all): 2:33:21/22:35:22, loss=0.571404280598049, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.03(1.03), norm=1.406532110130568, lr=0.029373045208984927
2023-12-21 17:02:32   INFO  epoch: 2/24, acc_iter=8334, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:24:09/0:42:16, time_cost(all): 2:34:17/23:46:31, loss=0.571207989278705, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.84(1.03), norm=1.7677255442728486, lr=0.029572967017344323
2023-12-21 17:03:28   INFO  epoch: 2/24, acc_iter=8384, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:25:05/0:41:13, time_cost(all): 2:35:13/1 day, 0:24:54, loss=0.571011697959362, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.91(1.03), norm=2.0824129723962104, lr=0.029772888825703718
2023-12-21 17:04:24   INFO  epoch: 2/24, acc_iter=8434, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:26:01/0:39:41, time_cost(all): 2:36:09/1 day, 0:23:44, loss=0.570815406640018, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.91(1.03), norm=3.758666110812391, lr=0.029972810634063117
2023-12-21 17:05:19   INFO  epoch: 2/24, acc_iter=8484, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:26:57/0:38:34, time_cost(all): 2:37:04/23:30:16, loss=0.570619115320675, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.08(1.03), norm=4.822704765705135, lr=0.029980537189586196
2023-12-21 17:06:15   INFO  epoch: 2/24, acc_iter=8534, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:27:53/0:37:31, time_cost(all): 2:38:00/22:32:02, loss=0.570422824001331, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.17(1.03), norm=1.488604411181111, lr=0.02995801078864429
2023-12-21 17:07:11   INFO  epoch: 2/24, acc_iter=8584, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:28:48/0:37:56, time_cost(all): 2:38:56/23:42:27, loss=0.570226532681988, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.13(1.03), norm=4.443083500823343, lr=0.029935484387702386
2023-12-21 17:08:07   INFO  epoch: 2/24, acc_iter=8634, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:29:44/0:34:50, time_cost(all): 2:39:52/22:19:00, loss=0.570030241362645, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.01(1.03), norm=2.5968070376798185, lr=0.029912957986760483
2023-12-21 17:09:02   INFO  epoch: 2/24, acc_iter=8684, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:30:40/0:36:21, time_cost(all): 2:40:47/23:19:03, loss=0.569833950043301, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.88(1.03), norm=4.548078247608049, lr=0.02989043158581858
2023-12-21 17:09:58   INFO  epoch: 2/24, acc_iter=8734, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:31:36/0:34:38, time_cost(all): 2:41:43/23:23:59, loss=0.569637658723958, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.1(1.03), norm=3.4617353922118133, lr=0.029867905184876677
2023-12-21 17:10:54   INFO  epoch: 2/24, acc_iter=8784, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:32:31/0:32:22, time_cost(all): 2:42:39/23:02:20, loss=0.569441367404614, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.18(1.03), norm=0.9799940223881326, lr=0.02984537878393477
2023-12-21 17:11:50   INFO  epoch: 2/24, acc_iter=8834, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:33:27/0:31:40, time_cost(all): 2:43:35/22:46:58, loss=0.569245076085271, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.93(1.03), norm=1.3151750071989592, lr=0.029822852382992867
2023-12-21 17:12:46   INFO  epoch: 2/24, acc_iter=8884, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:34:23/0:31:28, time_cost(all): 2:44:31/1 day, 0:02:52, loss=0.569048784765927, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.07(1.03), norm=4.649928842030499, lr=0.029800325982050964
2023-12-21 17:13:41   INFO  epoch: 2/24, acc_iter=8934, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:35:19/0:31:32, time_cost(all): 2:45:26/1 day, 0:29:51, loss=0.568852493446584, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.22(1.03), norm=1.1868023692192509, lr=0.02977779958110906
2023-12-21 17:14:37   INFO  epoch: 2/24, acc_iter=8984, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:36:14/0:27:55, time_cost(all): 2:46:22/23:10:17, loss=0.568656202127241, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.09(1.03), norm=4.020551484477217, lr=0.029755273180167154
2023-12-21 17:15:33   INFO  epoch: 2/24, acc_iter=9034, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:37:10/0:29:06, time_cost(all): 2:47:18/23:30:32, loss=0.568459910807897, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.9(1.03), norm=4.7509785046608854, lr=0.02973274677922525
2023-12-21 17:16:29   INFO  epoch: 2/24, acc_iter=9084, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:38:06/0:27:38, time_cost(all): 2:48:14/22:13:10, loss=0.568263619488554, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.02(1.03), norm=1.0312564947640253, lr=0.029710220378283348
2023-12-21 17:17:24   INFO  epoch: 2/24, acc_iter=9134, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:39:02/0:25:03, time_cost(all): 2:49:09/23:42:46, loss=0.56806732816921, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.06(1.03), norm=2.4569067116153995, lr=0.029687693977341445
2023-12-21 17:18:20   INFO  epoch: 2/24, acc_iter=9184, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:39:58/0:24:59, time_cost(all): 2:50:05/23:50:26, loss=0.567871036849867, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.02(1.03), norm=1.5562884877904408, lr=0.02966516757639954
2023-12-21 17:19:16   INFO  epoch: 2/24, acc_iter=9234, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:40:53/0:24:40, time_cost(all): 2:51:01/23:32:10, loss=0.567674745530523, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.12(1.03), norm=0.7922189062430893, lr=0.029642641175457635
2023-12-21 17:20:12   INFO  epoch: 2/24, acc_iter=9284, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:41:49/0:24:19, time_cost(all): 2:51:57/22:32:24, loss=0.56747845421118, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.02(1.03), norm=1.17038783657042, lr=0.029620114774515732
2023-12-21 17:21:07   INFO  epoch: 2/24, acc_iter=9334, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:42:45/0:23:00, time_cost(all): 2:52:52/23:40:16, loss=0.567282162891837, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.16(1.03), norm=4.120308420678471, lr=0.02959758837357383
2023-12-21 17:22:03   INFO  epoch: 2/24, acc_iter=9384, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:43:41/0:21:51, time_cost(all): 2:53:48/22:22:28, loss=0.567085871572493, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.0(1.03), norm=2.862096933011347, lr=0.029575061972631923
2023-12-21 17:22:59   INFO  epoch: 2/24, acc_iter=9434, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:44:36/0:21:10, time_cost(all): 2:54:44/22:45:44, loss=0.56688958025315, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.22(1.03), norm=3.141556790897215, lr=0.02955253557169002
2023-12-21 17:23:55   INFO  epoch: 2/24, acc_iter=9484, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:45:32/0:20:20, time_cost(all): 2:55:40/1 day, 0:16:04, loss=0.566693288933806, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.21(1.03), norm=1.7380986539619665, lr=0.029530009170748116
2023-12-21 17:24:50   INFO  epoch: 2/24, acc_iter=9534, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:46:28/0:19:50, time_cost(all): 2:56:35/23:36:18, loss=0.566496997614463, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.07(1.03), norm=1.4999628162098615, lr=0.029507482769806213
2023-12-21 17:25:46   INFO  epoch: 2/24, acc_iter=9584, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:47:24/0:17:51, time_cost(all): 2:57:31/23:08:12, loss=0.566300706295119, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.83(1.03), norm=4.314862974912782, lr=0.029484956368864307
2023-12-21 17:26:42   INFO  epoch: 2/24, acc_iter=9634, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:48:19/0:17:23, time_cost(all): 2:58:27/23:58:33, loss=0.566104414975776, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.0(1.03), norm=0.70911218585881, lr=0.029462429967922404
2023-12-21 17:27:38   INFO  epoch: 2/24, acc_iter=9684, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:49:15/0:16:03, time_cost(all): 2:59:23/22:59:14, loss=0.565908123656433, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.2(1.03), norm=4.949326261762343, lr=0.0294399035669805
2023-12-21 17:28:34   INFO  epoch: 2/24, acc_iter=9734, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:50:11/0:14:33, time_cost(all): 3:00:19/1 day, 0:07:35, loss=0.565711832337089, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.22(1.03), norm=0.9529010302564785, lr=0.029417377166038598
2023-12-21 17:29:29   INFO  epoch: 2/24, acc_iter=9784, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 0:51:07/0:14:16, time_cost(all): 3:01:14/23:08:01, loss=0.565515541017746, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.97(1.03), norm=2.8673175009503034, lr=0.029394850765096694
2023-12-21 17:30:25   INFO  epoch: 2/24, acc_iter=9834, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 0:52:03/0:13:29, time_cost(all): 3:02:10/22:50:55, loss=0.565319249698402, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.23(1.03), norm=2.922068286083215, lr=0.029372324364154788
2023-12-21 17:31:21   INFO  epoch: 2/24, acc_iter=9884, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 0:52:58/0:12:00, time_cost(all): 3:03:06/22:59:43, loss=0.565122958379059, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.9(1.03), norm=3.1574697815490995, lr=0.029349797963212885
2023-12-21 17:32:17   INFO  epoch: 2/24, acc_iter=9934, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 0:53:54/0:11:03, time_cost(all): 3:04:02/23:02:25, loss=0.564926667059715, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.18(1.03), norm=4.289390715609239, lr=0.029327271562270982
2023-12-21 17:33:12   INFO  epoch: 2/24, acc_iter=9984, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 0:54:50/0:10:50, time_cost(all): 3:04:57/22:38:20, loss=0.564730375740372, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.83(1.03), norm=2.9179372742756975, lr=0.02930474516132908
2023-12-21 17:34:08   INFO  epoch: 2/24, acc_iter=10034, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 0:55:46/0:09:38, time_cost(all): 3:05:53/22:46:24, loss=0.564534084421028, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.13(1.03), norm=1.697621478451485, lr=0.029282218760387172
2023-12-21 17:35:04   INFO  epoch: 2/24, acc_iter=10084, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 0:56:41/0:08:27, time_cost(all): 3:06:49/23:44:29, loss=0.564337793101685, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.83(1.03), norm=3.290703324356081, lr=0.02925969235944527
2023-12-21 17:36:00   INFO  epoch: 2/24, acc_iter=10134, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 0:57:37/0:08:04, time_cost(all): 3:07:45/21:57:11, loss=0.564141501782342, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.94(1.03), norm=3.3181776757032666, lr=0.029237165958503366
2023-12-21 17:36:55   INFO  epoch: 2/24, acc_iter=10184, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 0:58:33/0:06:48, time_cost(all): 3:08:40/23:54:22, loss=0.563945210462998, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.99(1.03), norm=3.227008611045484, lr=0.029214639557561463
2023-12-21 17:37:51   INFO  epoch: 2/24, acc_iter=10234, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 0:59:29/0:06:02, time_cost(all): 3:09:36/22:35:20, loss=0.563748919143655, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.93(1.03), norm=4.307509158634419, lr=0.029192113156619556
2023-12-21 17:38:47   INFO  epoch: 2/24, acc_iter=10284, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:00:24/0:04:52, time_cost(all): 3:10:32/21:56:08, loss=0.563552627824311, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.09(1.03), norm=3.0159197302085197, lr=0.029169586755677653
2023-12-21 17:39:43   INFO  epoch: 2/24, acc_iter=10334, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:01:20/0:03:56, time_cost(all): 3:11:28/22:44:50, loss=0.563356336504968, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.11(1.03), norm=2.2290195630107426, lr=0.02914706035473575
2023-12-21 17:40:39   INFO  epoch: 2/24, acc_iter=10384, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:02:16/0:03:07, time_cost(all): 3:12:24/23:26:16, loss=0.563160045185624, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.06(1.03), norm=3.5154681655903257, lr=0.029124533953793847
2023-12-21 17:41:34   INFO  epoch: 2/24, acc_iter=10434, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:03:12/0:02:09, time_cost(all): 3:13:19/22:47:44, loss=0.562963753866281, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.17(1.03), norm=0.9512449830062791, lr=0.02910200755285194
2023-12-21 17:42:30   INFO  epoch: 2/24, acc_iter=10484, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:04:08/0:01:11, time_cost(all): 3:14:15/23:21:06, loss=0.562767462546938, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.05(1.03), norm=1.569262770270464, lr=0.029079481151910037
2023-12-21 17:43:26   INFO  epoch: 2/24, acc_iter=10534, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:05:03/0:00:19, time_cost(all): 3:15:11/1 day, 0:00:54, loss=0.562571171227594, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.1(1.03), norm=0.6280802713062742, lr=0.029056954750968134
2023-12-21 17:44:22   INFO  epoch: 3/24, acc_iter=10601, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:00:55/1:05:02, time_cost(all): 3:16:07/22:38:24, loss=0.562308140859674, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.95(1.03), norm=0.8628341434926574, lr=0.029026769373705984
2023-12-21 17:45:17   INFO  epoch: 3/24, acc_iter=10651, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:01:51/1:01:04, time_cost(all): 3:17:02/22:12:14, loss=0.562111849540331, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.13(1.03), norm=3.406090961528557, lr=0.02900424297276408
2023-12-21 17:46:13   INFO  epoch: 3/24, acc_iter=10701, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:02:47/1:01:39, time_cost(all): 3:17:58/23:14:05, loss=0.561915558220987, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.13(1.03), norm=3.6942910332127146, lr=0.028981716571822174
2023-12-21 17:47:09   INFO  epoch: 3/24, acc_iter=10751, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:03:43/1:02:17, time_cost(all): 3:18:54/21:57:03, loss=0.561719266901644, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.95(1.03), norm=1.9329671173199001, lr=0.02895919017088027
2023-12-21 17:48:05   INFO  epoch: 3/24, acc_iter=10801, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:04:38/0:57:46, time_cost(all): 3:19:50/22:39:42, loss=0.5615229755823, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.98(1.03), norm=3.578745035492209, lr=0.028936663769938368
2023-12-21 17:49:00   INFO  epoch: 3/24, acc_iter=10851, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:05:34/0:57:51, time_cost(all): 3:20:45/22:11:59, loss=0.561326684262957, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.97(1.03), norm=3.817262178387172, lr=0.028914137368996465
2023-12-21 17:49:56   INFO  epoch: 3/24, acc_iter=10901, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:06:30/0:57:26, time_cost(all): 3:21:41/22:16:36, loss=0.561130392943613, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.15(1.03), norm=3.3762677466825535, lr=0.028891610968054558
2023-12-21 17:50:52   INFO  epoch: 3/24, acc_iter=10951, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:07:26/1:00:20, time_cost(all): 3:22:37/23:10:22, loss=0.56093410162427, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.87(1.03), norm=1.9861224104089161, lr=0.028869084567112655
2023-12-21 17:51:48   INFO  epoch: 3/24, acc_iter=11001, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:08:21/0:57:03, time_cost(all): 3:23:33/22:56:24, loss=0.560737810304927, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.18(1.03), norm=1.1752063612150252, lr=0.028846558166170752
2023-12-21 17:52:44   INFO  epoch: 3/24, acc_iter=11051, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:09:17/0:54:20, time_cost(all): 3:24:29/23:05:44, loss=0.560541518985583, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.02(1.03), norm=0.8894575261662332, lr=0.02882403176522885
2023-12-21 17:53:39   INFO  epoch: 3/24, acc_iter=11101, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:10:13/0:54:37, time_cost(all): 3:25:24/23:07:20, loss=0.56034522766624, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.98(1.03), norm=4.119814299584979, lr=0.028801505364286942
2023-12-21 17:54:35   INFO  epoch: 3/24, acc_iter=11151, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:11:09/0:56:05, time_cost(all): 3:26:20/23:49:48, loss=0.560148936346896, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.93(1.03), norm=3.378396756808325, lr=0.02877897896334504
2023-12-21 17:55:31   INFO  epoch: 3/24, acc_iter=11201, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:12:04/0:53:02, time_cost(all): 3:27:16/23:13:55, loss=0.559952645027553, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.96(1.03), norm=1.761791287103858, lr=0.028756452562403136
2023-12-21 17:56:27   INFO  epoch: 3/24, acc_iter=11251, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:13:00/0:52:29, time_cost(all): 3:28:12/22:13:22, loss=0.559756353708209, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.21(1.03), norm=2.75701208563792, lr=0.028733926161461233
2023-12-21 17:57:22   INFO  epoch: 3/24, acc_iter=11301, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:13:56/0:51:38, time_cost(all): 3:29:07/21:42:55, loss=0.559560062388866, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.98(1.03), norm=3.9962759675833226, lr=0.028711399760519327
2023-12-21 17:58:18   INFO  epoch: 3/24, acc_iter=11351, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:14:52/0:48:23, time_cost(all): 3:30:03/22:04:36, loss=0.559363771069523, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.92(1.03), norm=3.1273765950337, lr=0.028688873359577424
2023-12-21 17:59:14   INFO  epoch: 3/24, acc_iter=11401, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:15:48/0:50:07, time_cost(all): 3:30:59/23:12:48, loss=0.559167479750179, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.08(1.03), norm=2.637853942276175, lr=0.02866634695863552
2023-12-21 18:00:10   INFO  epoch: 3/24, acc_iter=11451, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:16:43/0:50:19, time_cost(all): 3:31:55/22:32:04, loss=0.558971188430836, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.92(1.03), norm=3.9579235981285543, lr=0.028643820557693617
2023-12-21 18:01:05   INFO  epoch: 3/24, acc_iter=11501, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:17:39/0:47:46, time_cost(all): 3:32:50/21:40:40, loss=0.558774897111492, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.2(1.03), norm=4.899133397593272, lr=0.028621294156751714
2023-12-21 18:02:01   INFO  epoch: 3/24, acc_iter=11551, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:18:35/0:46:40, time_cost(all): 3:33:46/22:58:50, loss=0.558578605792149, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.18(1.03), norm=2.391522157745218, lr=0.028598767755809808
2023-12-21 18:02:57   INFO  epoch: 3/24, acc_iter=11601, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:19:31/0:44:30, time_cost(all): 3:34:42/21:52:07, loss=0.558382314472805, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.97(1.03), norm=3.0781908687002995, lr=0.028576241354867905
2023-12-21 18:03:53   INFO  epoch: 3/24, acc_iter=11651, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:20:26/0:44:44, time_cost(all): 3:35:38/22:35:40, loss=0.558186023153462, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.91(1.03), norm=1.6002605291818037, lr=0.028553714953926
2023-12-21 18:04:49   INFO  epoch: 3/24, acc_iter=11701, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:21:22/0:46:03, time_cost(all): 3:36:34/22:14:34, loss=0.557989731834118, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.84(1.03), norm=4.484650567155269, lr=0.0285311885529841
2023-12-21 18:05:44   INFO  epoch: 3/24, acc_iter=11751, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:22:18/0:43:54, time_cost(all): 3:37:29/22:49:22, loss=0.557793440514775, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.13(1.03), norm=4.273339839875131, lr=0.028508662152042192
2023-12-21 18:06:40   INFO  epoch: 3/24, acc_iter=11801, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:23:14/0:41:19, time_cost(all): 3:38:25/22:03:44, loss=0.557597149195432, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.89(1.03), norm=3.7312747349293813, lr=0.02848613575110029
2023-12-21 18:07:36   INFO  epoch: 3/24, acc_iter=11851, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:24:09/0:42:19, time_cost(all): 3:39:21/23:29:14, loss=0.557400857876088, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.88(1.03), norm=2.304558713556335, lr=0.028463609350158386
2023-12-21 18:08:32   INFO  epoch: 3/24, acc_iter=11901, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:25:05/0:40:27, time_cost(all): 3:40:17/22:41:17, loss=0.557204566556745, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.07(1.03), norm=1.821815624617407, lr=0.028441082949216483
2023-12-21 18:09:27   INFO  epoch: 3/24, acc_iter=11951, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:26:01/0:39:16, time_cost(all): 3:41:12/22:08:56, loss=0.557008275237401, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.84(1.03), norm=2.131151729524043, lr=0.028418556548274576
2023-12-21 18:10:23   INFO  epoch: 3/24, acc_iter=12001, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:26:57/0:38:42, time_cost(all): 3:42:08/21:46:13, loss=0.556811983918058, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.88(1.03), norm=1.1738949870919042, lr=0.028396030147332673
2023-12-21 18:11:19   INFO  epoch: 3/24, acc_iter=12051, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:27:53/0:38:21, time_cost(all): 3:43:04/22:00:53, loss=0.556615692598714, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.2(1.03), norm=1.4703528805299066, lr=0.02837350374639077
2023-12-21 18:12:15   INFO  epoch: 3/24, acc_iter=12101, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:28:48/0:34:51, time_cost(all): 3:44:00/21:29:40, loss=0.556419401279371, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.93(1.03), norm=4.352399498946156, lr=0.028350977345448867
2023-12-21 18:13:10   INFO  epoch: 3/24, acc_iter=12151, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:29:44/0:34:52, time_cost(all): 3:44:55/22:18:13, loss=0.556223109960028, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.12(1.03), norm=1.7198940831398533, lr=0.02832845094450696
2023-12-21 18:14:06   INFO  epoch: 3/24, acc_iter=12201, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:30:40/0:33:47, time_cost(all): 3:45:51/23:10:45, loss=0.556026818640684, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.02(1.03), norm=2.387545871586066, lr=0.028305924543565057
2023-12-21 18:15:02   INFO  epoch: 3/24, acc_iter=12251, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:31:36/0:35:04, time_cost(all): 3:46:47/23:27:35, loss=0.555830527321341, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.95(1.03), norm=2.0148299032884864, lr=0.028283398142623154
2023-12-21 18:15:58   INFO  epoch: 3/24, acc_iter=12301, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:32:31/0:31:41, time_cost(all): 3:47:43/22:03:16, loss=0.555634236001997, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.91(1.03), norm=0.8436609593955777, lr=0.02826087174168125
2023-12-21 18:16:54   INFO  epoch: 3/24, acc_iter=12351, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:33:27/0:31:13, time_cost(all): 3:48:39/21:55:34, loss=0.555437944682654, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.85(1.03), norm=2.606914134347923, lr=0.028238345340739344
2023-12-21 18:17:49   INFO  epoch: 3/24, acc_iter=12401, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:34:23/0:31:08, time_cost(all): 3:49:34/22:35:14, loss=0.55524165336331, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.15(1.03), norm=2.8989498059839405, lr=0.02821581893979744
2023-12-21 18:18:45   INFO  epoch: 3/24, acc_iter=12451, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:35:19/0:29:30, time_cost(all): 3:50:30/23:10:43, loss=0.555045362043967, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.0(1.03), norm=2.0295227882566587, lr=0.02819329253885554
2023-12-21 18:19:41   INFO  epoch: 3/24, acc_iter=12501, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:36:14/0:29:36, time_cost(all): 3:51:26/21:25:17, loss=0.554849070724624, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.95(1.03), norm=4.7576275904925875, lr=0.028170766137913635
2023-12-21 18:20:37   INFO  epoch: 3/24, acc_iter=12551, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:37:10/0:28:57, time_cost(all): 3:52:22/22:38:58, loss=0.55465277940528, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.09(1.03), norm=2.4428925554654337, lr=0.028148239736971732
2023-12-21 18:21:32   INFO  epoch: 3/24, acc_iter=12601, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:38:06/0:28:14, time_cost(all): 3:53:17/22:28:44, loss=0.554456488085937, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.16(1.03), norm=1.0983755962544015, lr=0.028125713336029826
2023-12-21 18:22:28   INFO  epoch: 3/24, acc_iter=12651, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:39:02/0:26:31, time_cost(all): 3:54:13/22:21:14, loss=0.554260196766593, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.98(1.03), norm=0.6643040915753773, lr=0.028103186935087923
2023-12-21 18:23:24   INFO  epoch: 3/24, acc_iter=12701, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:39:58/0:25:29, time_cost(all): 3:55:09/21:17:26, loss=0.55406390544725, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.87(1.03), norm=4.138977182792617, lr=0.02808066053414602
2023-12-21 18:24:20   INFO  epoch: 3/24, acc_iter=12751, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:40:53/0:24:43, time_cost(all): 3:56:05/21:14:06, loss=0.553867614127906, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.22(1.03), norm=4.107597247794415, lr=0.028058134133204116
2023-12-21 18:25:15   INFO  epoch: 3/24, acc_iter=12801, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:41:49/0:24:23, time_cost(all): 3:57:00/22:07:42, loss=0.553671322808563, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.19(1.03), norm=1.639653890599078, lr=0.02803560773226221
2023-12-21 18:26:11   INFO  epoch: 3/24, acc_iter=12851, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:42:45/0:22:10, time_cost(all): 3:57:56/21:13:22, loss=0.55347503148922, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.07(1.03), norm=4.30243359911144, lr=0.028013081331320307
2023-12-21 18:27:07   INFO  epoch: 3/24, acc_iter=12901, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:43:41/0:21:31, time_cost(all): 3:58:52/21:11:18, loss=0.553278740169876, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.22(1.03), norm=0.9961716563382357, lr=0.027990554930378404
2023-12-21 18:28:03   INFO  epoch: 3/24, acc_iter=12951, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:44:36/0:20:51, time_cost(all): 3:59:48/21:14:54, loss=0.553082448850533, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.84(1.03), norm=4.418132218146747, lr=0.0279680285294365
2023-12-21 18:28:59   INFO  epoch: 3/24, acc_iter=13001, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:45:32/0:19:34, time_cost(all): 4:00:44/23:09:49, loss=0.552886157531189, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.85(1.03), norm=4.431522275677989, lr=0.027945502128494594
2023-12-21 18:29:54   INFO  epoch: 3/24, acc_iter=13051, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:46:28/0:19:40, time_cost(all): 4:01:39/22:46:23, loss=0.552689866211846, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.14(1.03), norm=1.930623350713432, lr=0.02792297572755269
2023-12-21 18:30:50   INFO  epoch: 3/24, acc_iter=13101, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:47:24/0:18:08, time_cost(all): 4:02:35/22:47:14, loss=0.552493574892502, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.93(1.03), norm=1.625284365967134, lr=0.027900449326610788
2023-12-21 18:31:46   INFO  epoch: 3/24, acc_iter=13151, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:48:19/0:16:11, time_cost(all): 4:03:31/22:11:10, loss=0.552297283573159, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.05(1.03), norm=4.9682058120760235, lr=0.027877922925668885
2023-12-21 18:32:42   INFO  epoch: 3/24, acc_iter=13201, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:49:15/0:16:04, time_cost(all): 4:04:27/21:27:11, loss=0.552100992253815, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.93(1.03), norm=2.4379018767458183, lr=0.02785539652472698
2023-12-21 18:33:37   INFO  epoch: 3/24, acc_iter=13251, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:50:11/0:15:50, time_cost(all): 4:05:22/22:29:38, loss=0.551904700934472, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.98(1.03), norm=1.850702333595196, lr=0.027832870123785075
2023-12-21 18:34:33   INFO  epoch: 3/24, acc_iter=13301, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 0:51:07/0:14:09, time_cost(all): 4:06:18/21:21:01, loss=0.551708409615129, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.23(1.03), norm=0.6768098899993611, lr=0.027810343722843172
2023-12-21 18:35:29   INFO  epoch: 3/24, acc_iter=13351, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 0:52:03/0:12:41, time_cost(all): 4:07:14/22:01:30, loss=0.551512118295785, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.07(1.03), norm=3.5374508173748564, lr=0.02778781732190127
2023-12-21 18:36:25   INFO  epoch: 3/24, acc_iter=13401, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 0:52:58/0:12:26, time_cost(all): 4:08:10/21:43:34, loss=0.551315826976442, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.05(1.03), norm=4.148151817550238, lr=0.027765290920959362
2023-12-21 18:37:20   INFO  epoch: 3/24, acc_iter=13451, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 0:53:54/0:11:22, time_cost(all): 4:09:05/22:52:50, loss=0.551119535657098, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.98(1.03), norm=3.685934610280211, lr=0.02774276452001746
2023-12-21 18:38:16   INFO  epoch: 3/24, acc_iter=13501, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 0:54:50/0:10:34, time_cost(all): 4:10:01/22:40:32, loss=0.550923244337755, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.99(1.03), norm=3.205915491860527, lr=0.027720238119075556
2023-12-21 18:39:12   INFO  epoch: 3/24, acc_iter=13551, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 0:55:46/0:09:39, time_cost(all): 4:10:57/22:15:03, loss=0.550726953018411, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.1(1.03), norm=3.2325280741367406, lr=0.027697711718133653
2023-12-21 18:40:08   INFO  epoch: 3/24, acc_iter=13601, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 0:56:41/0:08:51, time_cost(all): 4:11:53/21:25:42, loss=0.550530661699068, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.13(1.03), norm=0.7229080440563789, lr=0.02767518531719175
2023-12-21 18:41:04   INFO  epoch: 3/24, acc_iter=13651, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 0:57:37/0:08:00, time_cost(all): 4:12:49/21:14:45, loss=0.550334370379725, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.1(1.03), norm=4.405207180001181, lr=0.027652658916249843
2023-12-21 18:41:59   INFO  epoch: 3/24, acc_iter=13701, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 0:58:33/0:06:47, time_cost(all): 4:13:44/21:43:10, loss=0.550138079060381, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.96(1.03), norm=4.7870120347759055, lr=0.02763013251530794
2023-12-21 18:42:55   INFO  epoch: 3/24, acc_iter=13751, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 0:59:29/0:06:08, time_cost(all): 4:14:40/21:21:48, loss=0.549941787741038, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.92(1.03), norm=2.057003730882584, lr=0.027607606114366037
2023-12-21 18:43:51   INFO  epoch: 3/24, acc_iter=13801, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:00:24/0:05:00, time_cost(all): 4:15:36/21:05:58, loss=0.549745496421694, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.93(1.03), norm=4.50076508063392, lr=0.027585079713424134
2023-12-21 18:44:47   INFO  epoch: 3/24, acc_iter=13851, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:01:20/0:03:59, time_cost(all): 4:16:32/20:59:35, loss=0.549549205102351, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.86(1.03), norm=3.8962680445321656, lr=0.027562553312482228
2023-12-21 18:45:42   INFO  epoch: 3/24, acc_iter=13901, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:02:16/0:03:10, time_cost(all): 4:17:27/22:16:40, loss=0.549352913783007, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.19(1.03), norm=4.982807028847791, lr=0.027540026911540325
2023-12-21 18:46:38   INFO  epoch: 3/24, acc_iter=13951, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:03:12/0:02:05, time_cost(all): 4:18:23/22:55:20, loss=0.549156622463664, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.96(1.03), norm=2.5534696085558557, lr=0.02751750051059842
2023-12-21 18:47:34   INFO  epoch: 3/24, acc_iter=14001, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:04:08/0:01:12, time_cost(all): 4:19:19/22:22:47, loss=0.548960331144321, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.9(1.03), norm=2.0960047848240895, lr=0.02749497410965652
2023-12-21 18:48:30   INFO  epoch: 3/24, acc_iter=14051, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:05:03/0:00:19, time_cost(all): 4:20:15/21:56:38, loss=0.548764039824977, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.84(1.03), norm=1.324062331621878, lr=0.027472447708714612
2023-12-21 18:49:25   INFO  epoch: 4/24, acc_iter=14118, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:00:55/1:06:03, time_cost(all): 4:21:10/21:51:30, loss=0.548501009457057, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.85(1.03), norm=3.888656408235986, lr=0.02744226233145246
2023-12-21 18:50:21   INFO  epoch: 4/24, acc_iter=14168, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:01:51/1:02:47, time_cost(all): 4:22:06/22:50:42, loss=0.548304718137714, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.94(1.03), norm=1.868638679391203, lr=0.027419735930510558
2023-12-21 18:51:17   INFO  epoch: 4/24, acc_iter=14218, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:02:47/1:02:58, time_cost(all): 4:23:02/21:10:16, loss=0.54810842681837, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.92(1.03), norm=4.0000117355994425, lr=0.027397209529568655
2023-12-21 18:52:13   INFO  epoch: 4/24, acc_iter=14268, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:03:43/1:00:06, time_cost(all): 4:23:58/21:55:44, loss=0.547912135499027, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.01(1.03), norm=4.564550893558476, lr=0.027374683128626752
2023-12-21 18:53:08   INFO  epoch: 4/24, acc_iter=14318, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:04:38/0:59:46, time_cost(all): 4:24:53/22:34:25, loss=0.547715844179683, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.89(1.03), norm=2.1684482212564262, lr=0.027352156727684845
2023-12-21 18:54:04   INFO  epoch: 4/24, acc_iter=14368, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:05:34/0:58:09, time_cost(all): 4:25:49/21:56:30, loss=0.54751955286034, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.87(1.03), norm=3.119037235349544, lr=0.027329630326742942
2023-12-21 18:55:00   INFO  epoch: 4/24, acc_iter=14418, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:06:30/0:59:35, time_cost(all): 4:26:45/22:22:11, loss=0.547323261540996, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.96(1.03), norm=2.211130005900033, lr=0.02730710392580104
2023-12-21 18:55:56   INFO  epoch: 4/24, acc_iter=14468, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:07:26/1:00:45, time_cost(all): 4:27:41/22:24:46, loss=0.547126970221653, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.91(1.03), norm=1.4010448365268457, lr=0.027284577524859136
2023-12-21 18:56:52   INFO  epoch: 4/24, acc_iter=14518, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:08:21/0:58:32, time_cost(all): 4:28:37/21:32:18, loss=0.54693067890231, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.9(1.03), norm=2.494530618340006, lr=0.02726205112391723
2023-12-21 18:57:47   INFO  epoch: 4/24, acc_iter=14568, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:09:17/0:58:12, time_cost(all): 4:29:32/20:59:07, loss=0.546734387582966, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.04(1.03), norm=1.2627052850491989, lr=0.027239524722975327
2023-12-21 18:58:43   INFO  epoch: 4/24, acc_iter=14618, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:10:13/0:54:01, time_cost(all): 4:30:28/21:27:47, loss=0.546538096263623, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.98(1.03), norm=3.4242179876078995, lr=0.027216998322033423
2023-12-21 18:59:39   INFO  epoch: 4/24, acc_iter=14668, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:11:09/0:53:41, time_cost(all): 4:31:24/20:37:30, loss=0.546341804944279, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.06(1.03), norm=3.1786538551593457, lr=0.02719447192109152
2023-12-21 19:00:35   INFO  epoch: 4/24, acc_iter=14718, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:12:04/0:53:13, time_cost(all): 4:32:20/22:09:37, loss=0.546145513624936, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.16(1.03), norm=4.855200803804169, lr=0.027171945520149614
2023-12-21 19:01:30   INFO  epoch: 4/24, acc_iter=14768, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:13:00/0:53:11, time_cost(all): 4:33:15/22:34:33, loss=0.545949222305592, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.03(1.03), norm=4.466295297693355, lr=0.02714941911920771
2023-12-21 19:02:26   INFO  epoch: 4/24, acc_iter=14818, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:13:56/0:53:20, time_cost(all): 4:34:11/21:09:45, loss=0.545752930986249, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.87(1.03), norm=0.9466449207001382, lr=0.027126892718265808
2023-12-21 19:03:22   INFO  epoch: 4/24, acc_iter=14868, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:14:52/0:48:08, time_cost(all): 4:35:07/21:51:54, loss=0.545556639666905, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.16(1.03), norm=1.722078812829559, lr=0.027104366317323905
2023-12-21 19:04:18   INFO  epoch: 4/24, acc_iter=14918, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:15:48/0:47:15, time_cost(all): 4:36:03/20:41:13, loss=0.545360348347562, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.99(1.03), norm=0.6824303537393475, lr=0.027081839916381998
2023-12-21 19:05:13   INFO  epoch: 4/24, acc_iter=14968, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:16:43/0:47:57, time_cost(all): 4:36:58/21:57:16, loss=0.545164057028219, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.12(1.03), norm=2.9289136728228558, lr=0.027059313515440095
2023-12-21 19:06:09   INFO  epoch: 4/24, acc_iter=15018, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:17:39/0:49:08, time_cost(all): 4:37:54/21:21:10, loss=0.544967765708875, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.9(1.03), norm=0.595971037062638, lr=0.027036787114498192
2023-12-21 19:07:05   INFO  epoch: 4/24, acc_iter=15068, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:18:35/0:44:34, time_cost(all): 4:38:50/21:54:42, loss=0.544771474389532, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.2(1.03), norm=4.357152698349379, lr=0.02701426071355629
2023-12-21 19:08:01   INFO  epoch: 4/24, acc_iter=15118, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:19:31/0:46:59, time_cost(all): 4:39:46/22:19:37, loss=0.544575183070188, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.06(1.03), norm=1.9973330930616624, lr=0.026991734312614382
2023-12-21 19:08:57   INFO  epoch: 4/24, acc_iter=15168, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:20:26/0:44:21, time_cost(all): 4:40:42/22:15:11, loss=0.544378891750845, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.16(1.03), norm=3.7145170151873788, lr=0.02696920791167248
2023-12-21 19:09:52   INFO  epoch: 4/24, acc_iter=15218, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:21:22/0:44:32, time_cost(all): 4:41:37/21:11:46, loss=0.544182600431501, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.1(1.03), norm=2.4263600033653616, lr=0.026946681510730576
2023-12-21 19:10:48   INFO  epoch: 4/24, acc_iter=15268, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:22:18/0:43:38, time_cost(all): 4:42:33/22:27:47, loss=0.543986309112158, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.03(1.03), norm=3.1792231621949187, lr=0.026924155109788673
2023-12-21 19:11:44   INFO  epoch: 4/24, acc_iter=15318, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:23:14/0:43:13, time_cost(all): 4:43:29/21:53:43, loss=0.543790017792815, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.99(1.03), norm=2.553497777986247, lr=0.02690162870884677
2023-12-21 19:12:40   INFO  epoch: 4/24, acc_iter=15368, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:24:09/0:43:02, time_cost(all): 4:44:25/20:55:36, loss=0.543593726473471, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.0(1.03), norm=3.303614799506753, lr=0.026879102307904863
2023-12-21 19:13:35   INFO  epoch: 4/24, acc_iter=15418, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:25:05/0:39:57, time_cost(all): 4:45:20/21:50:50, loss=0.543397435154128, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.12(1.03), norm=1.8937167750756263, lr=0.02685657590696296
2023-12-21 19:14:31   INFO  epoch: 4/24, acc_iter=15468, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:26:01/0:37:28, time_cost(all): 4:46:16/21:22:04, loss=0.543201143834784, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.86(1.03), norm=0.6883293952839246, lr=0.026834049506021057
2023-12-21 19:15:27   INFO  epoch: 4/24, acc_iter=15518, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:26:57/0:38:31, time_cost(all): 4:47:12/22:16:56, loss=0.543004852515441, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.93(1.03), norm=3.683543449905138, lr=0.026811523105079154
2023-12-21 19:16:23   INFO  epoch: 4/24, acc_iter=15568, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:27:53/0:37:51, time_cost(all): 4:48:08/20:48:45, loss=0.542808561196097, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.04(1.03), norm=3.17982426060011, lr=0.026788996704137247
2023-12-21 19:17:18   INFO  epoch: 4/24, acc_iter=15618, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:28:48/0:36:59, time_cost(all): 4:49:03/21:16:56, loss=0.542612269876754, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.18(1.03), norm=2.665273248329399, lr=0.026766470303195344
2023-12-21 19:18:14   INFO  epoch: 4/24, acc_iter=15668, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:29:44/0:36:34, time_cost(all): 4:49:59/22:09:30, loss=0.542415978557411, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.96(1.03), norm=0.7697142608349402, lr=0.02674394390225344
2023-12-21 19:19:10   INFO  epoch: 4/24, acc_iter=15718, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:30:40/0:34:46, time_cost(all): 4:50:55/20:15:16, loss=0.542219687238067, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.15(1.03), norm=4.469543326348683, lr=0.026721417501311538
2023-12-21 19:20:06   INFO  epoch: 4/24, acc_iter=15768, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:31:36/0:32:54, time_cost(all): 4:51:51/21:28:17, loss=0.542023395918724, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.19(1.03), norm=0.7576960945392338, lr=0.02669889110036963
2023-12-21 19:21:02   INFO  epoch: 4/24, acc_iter=15818, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:32:31/0:32:50, time_cost(all): 4:52:47/20:37:37, loss=0.54182710459938, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.21(1.03), norm=1.4757184207256675, lr=0.02667636469942773
2023-12-21 19:21:57   INFO  epoch: 4/24, acc_iter=15868, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:33:27/0:30:34, time_cost(all): 4:53:42/20:41:50, loss=0.541630813280037, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.98(1.03), norm=4.998153728310692, lr=0.026653838298485825
2023-12-21 19:22:53   INFO  epoch: 4/24, acc_iter=15918, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:34:23/0:30:36, time_cost(all): 4:54:38/20:18:49, loss=0.541434521960693, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.1(1.03), norm=1.1543057623948412, lr=0.026631311897543922
2023-12-21 19:23:49   INFO  epoch: 4/24, acc_iter=15968, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:35:19/0:29:13, time_cost(all): 4:55:34/21:01:33, loss=0.54123823064135, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.22(1.03), norm=2.796067686882176, lr=0.026608785496602016
2023-12-21 19:24:45   INFO  epoch: 4/24, acc_iter=16018, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:36:14/0:30:14, time_cost(all): 4:56:30/21:30:54, loss=0.541041939322007, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.99(1.03), norm=2.3006258944673923, lr=0.026586259095660113
2023-12-21 19:25:40   INFO  epoch: 4/24, acc_iter=16068, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:37:10/0:27:02, time_cost(all): 4:57:25/20:39:16, loss=0.540845648002663, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.07(1.03), norm=2.5521162146528185, lr=0.02656373269471821
2023-12-21 19:26:36   INFO  epoch: 4/24, acc_iter=16118, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:38:06/0:27:07, time_cost(all): 4:58:21/21:36:53, loss=0.54064935668332, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.18(1.03), norm=2.5907862107501134, lr=0.026541206293776307
2023-12-21 19:27:32   INFO  epoch: 4/24, acc_iter=16168, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:39:02/0:25:42, time_cost(all): 4:59:17/21:14:37, loss=0.540453065363976, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.98(1.03), norm=3.6192622938879717, lr=0.0265186798928344
2023-12-21 19:28:28   INFO  epoch: 4/24, acc_iter=16218, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:39:58/0:25:07, time_cost(all): 5:00:13/20:48:29, loss=0.540256774044633, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.01(1.03), norm=4.491319473059177, lr=0.026496153491892497
2023-12-21 19:29:23   INFO  epoch: 4/24, acc_iter=16268, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:40:53/0:24:02, time_cost(all): 5:01:08/20:52:11, loss=0.540060482725289, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.19(1.03), norm=0.7495610950235385, lr=0.026473627090950594
2023-12-21 19:30:19   INFO  epoch: 4/24, acc_iter=16318, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:41:49/0:22:42, time_cost(all): 5:02:04/21:23:53, loss=0.539864191405946, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.22(1.03), norm=2.0430651290539585, lr=0.02645110069000869
2023-12-21 19:31:15   INFO  epoch: 4/24, acc_iter=16368, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:42:45/0:22:22, time_cost(all): 5:03:00/21:35:27, loss=0.539667900086602, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.9(1.03), norm=3.110888838572303, lr=0.026428574289066788
2023-12-21 19:32:11   INFO  epoch: 4/24, acc_iter=16418, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:43:41/0:20:55, time_cost(all): 5:03:56/20:43:15, loss=0.539471608767259, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.06(1.03), norm=3.9251962345649787, lr=0.02640604788812488
2023-12-21 19:33:07   INFO  epoch: 4/24, acc_iter=16468, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:44:36/0:19:47, time_cost(all): 5:04:52/21:17:14, loss=0.539275317447916, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.98(1.03), norm=2.896828543488465, lr=0.026383521487182978
2023-12-21 19:34:02   INFO  epoch: 4/24, acc_iter=16518, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:45:32/0:19:02, time_cost(all): 5:05:47/21:41:34, loss=0.539079026128572, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.16(1.03), norm=3.707371041961522, lr=0.026360995086241075
2023-12-21 19:34:58   INFO  epoch: 4/24, acc_iter=16568, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:46:28/0:19:40, time_cost(all): 5:06:43/21:38:24, loss=0.538882734809229, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.98(1.03), norm=4.51083916383709, lr=0.026338468685299172
2023-12-21 19:35:54   INFO  epoch: 4/24, acc_iter=16618, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:47:24/0:18:03, time_cost(all): 5:07:39/20:00:02, loss=0.538686443489885, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.04(1.03), norm=4.927959217334225, lr=0.02631594228435727
2023-12-21 19:36:50   INFO  epoch: 4/24, acc_iter=16668, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:48:19/0:17:06, time_cost(all): 5:08:35/20:01:41, loss=0.538490152170542, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.06(1.03), norm=1.4857396771119937, lr=0.026293415883415362
2023-12-21 19:37:45   INFO  epoch: 4/24, acc_iter=16718, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:49:15/0:15:45, time_cost(all): 5:09:30/20:40:30, loss=0.538293860851198, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.9(1.03), norm=4.842290076203138, lr=0.02627088948247346
2023-12-21 19:38:41   INFO  epoch: 4/24, acc_iter=16768, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:50:11/0:15:04, time_cost(all): 5:10:26/19:57:43, loss=0.538097569531855, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.19(1.03), norm=1.8783369989434002, lr=0.026248363081531556
2023-12-21 19:39:37   INFO  epoch: 4/24, acc_iter=16818, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 0:51:07/0:14:52, time_cost(all): 5:11:22/20:39:18, loss=0.537901278212512, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.9(1.03), norm=1.5971273989531174, lr=0.02622583668058965
2023-12-21 19:40:33   INFO  epoch: 4/24, acc_iter=16868, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 0:52:03/0:13:41, time_cost(all): 5:12:18/21:10:48, loss=0.537704986893168, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.19(1.03), norm=3.1535065112591307, lr=0.026203310279647746
2023-12-21 19:41:28   INFO  epoch: 4/24, acc_iter=16918, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 0:52:58/0:12:37, time_cost(all): 5:13:13/21:42:37, loss=0.537508695573825, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.03(1.03), norm=1.7601286166668788, lr=0.026180783878705843
2023-12-21 19:42:24   INFO  epoch: 4/24, acc_iter=16968, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 0:53:54/0:11:52, time_cost(all): 5:14:09/21:55:06, loss=0.537312404254481, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.91(1.03), norm=4.260498172428294, lr=0.02615825747776394
2023-12-21 19:43:20   INFO  epoch: 4/24, acc_iter=17018, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 0:54:50/0:10:49, time_cost(all): 5:15:05/20:01:39, loss=0.537116112935138, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.1(1.03), norm=2.0167642015085274, lr=0.026135731076822037
2023-12-21 19:44:16   INFO  epoch: 4/24, acc_iter=17068, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 0:55:46/0:09:52, time_cost(all): 5:16:01/21:47:57, loss=0.536919821615794, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.07(1.03), norm=1.0223471675937108, lr=0.02611320467588013
2023-12-21 19:45:12   INFO  epoch: 4/24, acc_iter=17118, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 0:56:41/0:09:06, time_cost(all): 5:16:57/21:22:52, loss=0.536723530296451, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.19(1.03), norm=4.685105175528012, lr=0.026090678274938228
2023-12-21 19:46:07   INFO  epoch: 4/24, acc_iter=17168, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 0:57:37/0:07:30, time_cost(all): 5:17:52/19:52:42, loss=0.536527238977108, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.02(1.03), norm=1.672324510847819, lr=0.026068151873996324
2023-12-21 19:47:03   INFO  epoch: 4/24, acc_iter=17218, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 0:58:33/0:07:04, time_cost(all): 5:18:48/19:59:25, loss=0.536330947657764, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.14(1.03), norm=2.71876078261593, lr=0.026045625473054418
2023-12-21 19:47:59   INFO  epoch: 4/24, acc_iter=17268, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 0:59:29/0:06:09, time_cost(all): 5:19:44/20:16:08, loss=0.536134656338421, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.86(1.03), norm=4.171649732793341, lr=0.026023099072112515
2023-12-21 19:48:55   INFO  epoch: 4/24, acc_iter=17318, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:00:24/0:04:53, time_cost(all): 5:20:40/20:19:00, loss=0.535938365019077, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.2(1.03), norm=2.0543107389038844, lr=0.02600057267117061
2023-12-21 19:49:50   INFO  epoch: 4/24, acc_iter=17368, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:01:20/0:04:13, time_cost(all): 5:21:35/20:26:58, loss=0.535742073699734, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.07(1.03), norm=4.5829192475495475, lr=0.02597804627022871
2023-12-21 19:50:46   INFO  epoch: 4/24, acc_iter=17418, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:02:16/0:03:12, time_cost(all): 5:22:31/20:16:51, loss=0.53554578238039, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.0(1.03), norm=0.8769780902356725, lr=0.025955519869286806
2023-12-21 19:51:42   INFO  epoch: 4/24, acc_iter=17468, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:03:12/0:02:16, time_cost(all): 5:23:27/20:52:48, loss=0.535349491061047, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.01(1.03), norm=3.2194400340963583, lr=0.0259329934683449
2023-12-21 19:52:38   INFO  epoch: 4/24, acc_iter=17518, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:04:08/0:01:16, time_cost(all): 5:24:23/20:39:18, loss=0.535153199741704, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.06(1.03), norm=1.3671559525870263, lr=0.025910467067402996
2023-12-21 19:53:33   INFO  epoch: 4/24, acc_iter=17568, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:05:03/0:00:18, time_cost(all): 5:25:18/21:12:32, loss=0.53495690842236, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.09(1.03), norm=4.796527515822722, lr=0.025887940666461093
2023-12-21 19:54:29   INFO  epoch: 5/24, acc_iter=17635, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:00:55/1:02:30, time_cost(all): 5:26:14/20:03:06, loss=0.53469387805444, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.17(1.03), norm=3.376169129679783, lr=0.025857755289198942
2023-12-21 19:55:25   INFO  epoch: 5/24, acc_iter=17685, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:01:51/1:03:41, time_cost(all): 5:27:10/19:50:37, loss=0.534497586735096, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.86(1.03), norm=1.655335452001199, lr=0.02583522888825704
2023-12-21 19:56:21   INFO  epoch: 5/24, acc_iter=17735, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:02:47/1:00:37, time_cost(all): 5:28:06/21:35:23, loss=0.534301295415753, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.19(1.03), norm=2.6641037073422, lr=0.025812702487315133
2023-12-21 19:57:17   INFO  epoch: 5/24, acc_iter=17785, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:03:43/1:00:58, time_cost(all): 5:29:02/21:01:24, loss=0.53410500409641, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.93(1.03), norm=4.122595356780516, lr=0.02579017608637323
2023-12-21 19:58:12   INFO  epoch: 5/24, acc_iter=17835, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:04:38/1:02:40, time_cost(all): 5:29:57/20:13:36, loss=0.533908712777066, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.12(1.03), norm=0.8847279242576555, lr=0.025767649685431326
2023-12-21 19:59:08   INFO  epoch: 5/24, acc_iter=17885, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:05:34/0:56:51, time_cost(all): 5:30:53/21:38:33, loss=0.533712421457723, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.15(1.03), norm=1.484958073996952, lr=0.02574512328448942
2023-12-21 20:00:04   INFO  epoch: 5/24, acc_iter=17935, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:06:30/0:56:46, time_cost(all): 5:31:49/21:36:11, loss=0.533516130138379, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.11(1.03), norm=2.4523561982945132, lr=0.025722596883547517
2023-12-21 20:01:00   INFO  epoch: 5/24, acc_iter=17985, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:07:26/0:57:26, time_cost(all): 5:32:45/21:02:17, loss=0.533319838819036, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.05(1.03), norm=3.499323547864128, lr=0.025700070482605614
2023-12-21 20:01:55   INFO  epoch: 5/24, acc_iter=18035, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:08:21/0:58:04, time_cost(all): 5:33:40/19:54:24, loss=0.533123547499692, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.16(1.03), norm=0.5901969980619872, lr=0.02567754408166371
2023-12-21 20:02:51   INFO  epoch: 5/24, acc_iter=18085, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:09:17/0:54:28, time_cost(all): 5:34:36/20:01:46, loss=0.532927256180349, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.96(1.03), norm=4.595114483637979, lr=0.025655017680721807
2023-12-21 20:03:47   INFO  epoch: 5/24, acc_iter=18135, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:10:13/0:55:40, time_cost(all): 5:35:32/20:19:50, loss=0.532730964861005, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.87(1.03), norm=3.5077772784069805, lr=0.0256324912797799
2023-12-21 20:04:43   INFO  epoch: 5/24, acc_iter=18185, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:11:09/0:53:39, time_cost(all): 5:36:28/20:51:02, loss=0.532534673541662, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.08(1.03), norm=2.715148701441664, lr=0.025609964878837998
2023-12-21 20:05:38   INFO  epoch: 5/24, acc_iter=18235, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:12:04/0:54:02, time_cost(all): 5:37:23/20:15:35, loss=0.532338382222319, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.03(1.03), norm=2.779639667004524, lr=0.025587438477896095
2023-12-21 20:06:34   INFO  epoch: 5/24, acc_iter=18285, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:13:00/0:52:22, time_cost(all): 5:38:19/21:04:38, loss=0.532142090902975, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.02(1.03), norm=3.0097874472844723, lr=0.02556491207695419
2023-12-21 20:07:30   INFO  epoch: 5/24, acc_iter=18335, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:13:56/0:51:46, time_cost(all): 5:39:15/19:52:33, loss=0.531945799583632, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.94(1.03), norm=2.140972918201584, lr=0.02554238567601229
2023-12-21 20:08:26   INFO  epoch: 5/24, acc_iter=18385, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:14:52/0:48:27, time_cost(all): 5:40:11/21:02:26, loss=0.531749508264288, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.06(1.03), norm=0.8383720491281057, lr=0.025519859275070382
2023-12-21 20:09:22   INFO  epoch: 5/24, acc_iter=18435, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:15:48/0:49:00, time_cost(all): 5:41:07/21:28:32, loss=0.531553216944945, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.95(1.03), norm=3.5018472119137813, lr=0.02549733287412848
2023-12-21 20:10:17   INFO  epoch: 5/24, acc_iter=18485, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:16:43/0:48:25, time_cost(all): 5:42:02/19:43:28, loss=0.531356925625601, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.04(1.03), norm=3.5739334790740074, lr=0.025474806473186576
2023-12-21 20:11:13   INFO  epoch: 5/24, acc_iter=18535, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:17:39/0:47:20, time_cost(all): 5:42:58/20:49:33, loss=0.531160634306258, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.01(1.03), norm=1.3590777801446106, lr=0.02545228007224467
2023-12-21 20:12:09   INFO  epoch: 5/24, acc_iter=18585, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:18:35/0:47:20, time_cost(all): 5:43:54/19:46:40, loss=0.530964342986915, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.96(1.03), norm=3.1005104443548617, lr=0.025429753671302766
2023-12-21 20:13:05   INFO  epoch: 5/24, acc_iter=18635, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:19:31/0:46:08, time_cost(all): 5:44:50/20:48:29, loss=0.530768051667571, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.01(1.03), norm=1.821352603520493, lr=0.025407227270360863
2023-12-21 20:14:00   INFO  epoch: 5/24, acc_iter=18685, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:20:26/0:46:09, time_cost(all): 5:45:45/20:28:01, loss=0.530571760348228, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.16(1.03), norm=0.8571573649331574, lr=0.02538470086941896
2023-12-21 20:14:56   INFO  epoch: 5/24, acc_iter=18735, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:21:22/0:41:50, time_cost(all): 5:46:41/20:59:21, loss=0.530375469028884, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.0(1.03), norm=2.8378817313593196, lr=0.025362174468477057
2023-12-21 20:15:52   INFO  epoch: 5/24, acc_iter=18785, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:22:18/0:43:01, time_cost(all): 5:47:37/19:23:32, loss=0.530179177709541, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.85(1.03), norm=0.8258793960359523, lr=0.02533964806753515
2023-12-21 20:16:48   INFO  epoch: 5/24, acc_iter=18835, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:23:14/0:42:15, time_cost(all): 5:48:33/20:07:07, loss=0.529982886390197, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.89(1.03), norm=3.6300169124014325, lr=0.025317121666593247
2023-12-21 20:17:43   INFO  epoch: 5/24, acc_iter=18885, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:24:09/0:41:24, time_cost(all): 5:49:28/20:52:00, loss=0.529786595070854, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.93(1.03), norm=0.7125548808406332, lr=0.025294595265651344
2023-12-21 20:18:39   INFO  epoch: 5/24, acc_iter=18935, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:25:05/0:39:54, time_cost(all): 5:50:24/20:00:52, loss=0.529590303751511, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.87(1.03), norm=4.735056284338731, lr=0.025272068864709438
2023-12-21 20:19:35   INFO  epoch: 5/24, acc_iter=18985, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:26:01/0:38:31, time_cost(all): 5:51:20/21:13:00, loss=0.529394012432167, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.98(1.03), norm=4.315114234022876, lr=0.025249542463767535
2023-12-21 20:20:31   INFO  epoch: 5/24, acc_iter=19035, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:26:57/0:37:00, time_cost(all): 5:52:16/20:23:35, loss=0.529197721112824, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.86(1.03), norm=3.805924308341683, lr=0.02522701606282563
2023-12-21 20:21:26   INFO  epoch: 5/24, acc_iter=19085, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:27:53/0:37:21, time_cost(all): 5:53:11/20:50:22, loss=0.52900142979348, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=1.7407174753571535, lr=0.02520448966188373
2023-12-21 20:22:22   INFO  epoch: 5/24, acc_iter=19135, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:28:48/0:35:43, time_cost(all): 5:54:07/19:58:30, loss=0.528805138474137, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.17(1.03), norm=3.5501875193934063, lr=0.025181963260941825
2023-12-21 20:23:18   INFO  epoch: 5/24, acc_iter=19185, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:29:44/0:34:27, time_cost(all): 5:55:03/20:43:35, loss=0.528608847154793, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.94(1.03), norm=3.5057081599176514, lr=0.02515943685999992
2023-12-21 20:24:14   INFO  epoch: 5/24, acc_iter=19235, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:30:40/0:36:15, time_cost(all): 5:55:59/20:45:38, loss=0.52841255583545, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.91(1.03), norm=1.0877296273059958, lr=0.025136910459058016
2023-12-21 20:25:10   INFO  epoch: 5/24, acc_iter=19285, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:31:36/0:32:31, time_cost(all): 5:56:55/19:38:50, loss=0.528216264516107, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.2(1.03), norm=1.6395109858965666, lr=0.025114384058116113
2023-12-21 20:26:05   INFO  epoch: 5/24, acc_iter=19335, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:32:31/0:31:41, time_cost(all): 5:57:50/19:35:10, loss=0.528019973196763, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.99(1.03), norm=4.626330594499123, lr=0.025091857657174206
2023-12-21 20:27:01   INFO  epoch: 5/24, acc_iter=19385, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:33:27/0:30:40, time_cost(all): 5:58:46/20:39:30, loss=0.52782368187742, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.18(1.03), norm=1.1773964839079853, lr=0.025069331256232303
2023-12-21 20:27:57   INFO  epoch: 5/24, acc_iter=19435, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:34:23/0:30:11, time_cost(all): 5:59:42/20:18:53, loss=0.527627390558076, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.01(1.03), norm=2.735108559152374, lr=0.0250468048552904
2023-12-21 20:28:53   INFO  epoch: 5/24, acc_iter=19485, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:35:19/0:28:50, time_cost(all): 6:00:38/20:35:21, loss=0.527431099238733, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.07(1.03), norm=4.422700582455335, lr=0.025024278454348497
2023-12-21 20:29:48   INFO  epoch: 5/24, acc_iter=19535, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:36:14/0:27:50, time_cost(all): 6:01:33/19:09:54, loss=0.527234807919389, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.83(1.03), norm=1.7428375103296823, lr=0.025001752053406594
2023-12-21 20:30:44   INFO  epoch: 5/24, acc_iter=19585, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:37:10/0:27:27, time_cost(all): 6:02:29/20:57:33, loss=0.527038516600046, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.07(1.03), norm=2.5785836065400316, lr=0.024979225652464687
2023-12-21 20:31:40   INFO  epoch: 5/24, acc_iter=19635, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:38:06/0:26:24, time_cost(all): 6:03:25/20:06:10, loss=0.526842225280703, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.22(1.03), norm=3.3416825482118475, lr=0.024956699251522784
2023-12-21 20:32:36   INFO  epoch: 5/24, acc_iter=19685, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:39:02/0:26:14, time_cost(all): 6:04:21/20:55:27, loss=0.526645933961359, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.85(1.03), norm=1.4183342512157235, lr=0.02493417285058088
2023-12-21 20:33:31   INFO  epoch: 5/24, acc_iter=19735, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:39:58/0:24:38, time_cost(all): 6:05:16/19:21:38, loss=0.526449642642016, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.13(1.03), norm=4.202445866403391, lr=0.024911646449638978
2023-12-21 20:34:27   INFO  epoch: 5/24, acc_iter=19785, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:40:53/0:24:08, time_cost(all): 6:06:12/19:47:04, loss=0.526253351322672, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.97(1.03), norm=3.234386278901442, lr=0.024889120048697075
2023-12-21 20:35:23   INFO  epoch: 5/24, acc_iter=19835, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:41:49/0:22:36, time_cost(all): 6:07:08/19:03:39, loss=0.526057060003329, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.96(1.03), norm=4.5068207791731245, lr=0.02486659364775517
2023-12-21 20:36:19   INFO  epoch: 5/24, acc_iter=19885, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:42:45/0:23:35, time_cost(all): 6:08:04/20:10:22, loss=0.525860768683985, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.03(1.03), norm=3.975387435526235, lr=0.024844067246813265
2023-12-21 20:37:15   INFO  epoch: 5/24, acc_iter=19935, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:43:41/0:22:23, time_cost(all): 6:09:00/20:35:07, loss=0.525664477364642, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.9(1.03), norm=0.5613198234713813, lr=0.024821540845871362
2023-12-21 20:38:10   INFO  epoch: 5/24, acc_iter=19985, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:44:36/0:20:16, time_cost(all): 6:09:55/20:32:15, loss=0.525468186045299, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.93(1.03), norm=1.2357501302369727, lr=0.024799014444929456
2023-12-21 20:39:06   INFO  epoch: 5/24, acc_iter=20035, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:45:32/0:19:49, time_cost(all): 6:10:51/20:00:59, loss=0.525271894725955, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.93(1.03), norm=0.9031048744409762, lr=0.024776488043987552
2023-12-21 20:40:02   INFO  epoch: 5/24, acc_iter=20085, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:46:28/0:19:09, time_cost(all): 6:11:47/19:12:01, loss=0.525075603406612, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.14(1.03), norm=4.880701256315141, lr=0.02475396164304565
2023-12-21 20:40:58   INFO  epoch: 5/24, acc_iter=20135, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:47:24/0:18:38, time_cost(all): 6:12:43/19:08:15, loss=0.524879312087268, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.84(1.03), norm=4.117927590814203, lr=0.024731435242103746
2023-12-21 20:41:53   INFO  epoch: 5/24, acc_iter=20185, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:48:19/0:16:27, time_cost(all): 6:13:38/19:31:20, loss=0.524683020767925, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.86(1.03), norm=3.325837194512567, lr=0.024708908841161843
2023-12-21 20:42:49   INFO  epoch: 5/24, acc_iter=20235, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:49:15/0:15:49, time_cost(all): 6:14:34/19:15:45, loss=0.524486729448581, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.87(1.03), norm=2.65875588214789, lr=0.024686382440219937
2023-12-21 20:43:45   INFO  epoch: 5/24, acc_iter=20285, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:50:11/0:15:29, time_cost(all): 6:15:30/19:03:10, loss=0.524290438129238, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.96(1.03), norm=4.054842809145878, lr=0.024663856039278034
2023-12-21 20:44:41   INFO  epoch: 5/24, acc_iter=20335, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 0:51:07/0:14:38, time_cost(all): 6:16:26/19:57:28, loss=0.524094146809895, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.95(1.03), norm=3.2325265122954683, lr=0.02464132963833613
2023-12-21 20:45:36   INFO  epoch: 5/24, acc_iter=20385, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 0:52:03/0:13:04, time_cost(all): 6:17:21/19:00:22, loss=0.523897855490551, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.87(1.03), norm=3.0819522679851894, lr=0.024618803237394224
2023-12-21 20:46:32   INFO  epoch: 5/24, acc_iter=20435, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 0:52:58/0:11:47, time_cost(all): 6:18:17/20:10:17, loss=0.523701564171208, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.91(1.03), norm=4.642378419375833, lr=0.02459627683645232
2023-12-21 20:47:28   INFO  epoch: 5/24, acc_iter=20485, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 0:53:54/0:11:38, time_cost(all): 6:19:13/19:20:28, loss=0.523505272851864, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.17(1.03), norm=0.9124612492625437, lr=0.024573750435510418
2023-12-21 20:48:24   INFO  epoch: 5/24, acc_iter=20535, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 0:54:50/0:10:32, time_cost(all): 6:20:09/20:48:05, loss=0.523308981532521, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.14(1.03), norm=1.2896817646084169, lr=0.024551224034568515
2023-12-21 20:49:20   INFO  epoch: 5/24, acc_iter=20585, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 0:55:46/0:09:40, time_cost(all): 6:21:05/20:44:24, loss=0.523112690213177, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.02(1.03), norm=1.9692499458309518, lr=0.02452869763362661
2023-12-21 20:50:15   INFO  epoch: 5/24, acc_iter=20635, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 0:56:41/0:08:33, time_cost(all): 6:22:00/20:28:04, loss=0.522916398893834, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.86(1.03), norm=4.50065209098566, lr=0.024506171232684705
2023-12-21 20:51:11   INFO  epoch: 5/24, acc_iter=20685, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 0:57:37/0:07:54, time_cost(all): 6:22:56/19:06:04, loss=0.52272010757449, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.91(1.03), norm=3.068020205948225, lr=0.024483644831742802
2023-12-21 20:52:07   INFO  epoch: 5/24, acc_iter=20735, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 0:58:33/0:06:52, time_cost(all): 6:23:52/19:09:48, loss=0.522523816255147, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.15(1.03), norm=2.4943289972926097, lr=0.0244611184308009
2023-12-21 20:53:03   INFO  epoch: 5/24, acc_iter=20785, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 0:59:29/0:05:55, time_cost(all): 6:24:48/19:50:08, loss=0.522327524935804, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.93(1.03), norm=2.2736820820583263, lr=0.024438592029858996
2023-12-21 20:53:58   INFO  epoch: 5/24, acc_iter=20835, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:00:24/0:04:47, time_cost(all): 6:25:43/20:02:44, loss=0.52213123361646, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.98(1.03), norm=3.2527677399572323, lr=0.024416065628917093
2023-12-21 20:54:54   INFO  epoch: 5/24, acc_iter=20885, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:01:20/0:03:58, time_cost(all): 6:26:39/20:23:49, loss=0.521934942297117, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.02(1.03), norm=0.6910314956857447, lr=0.024393539227975186
2023-12-21 20:55:50   INFO  epoch: 5/24, acc_iter=20935, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:02:16/0:02:58, time_cost(all): 6:27:35/19:25:25, loss=0.521738650977773, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.98(1.03), norm=4.3481904344530955, lr=0.024371012827033283
2023-12-21 20:56:46   INFO  epoch: 5/24, acc_iter=20985, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:03:12/0:02:15, time_cost(all): 6:28:31/19:27:21, loss=0.52154235965843, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.11(1.03), norm=2.3356111254150953, lr=0.02434848642609138
2023-12-21 20:57:41   INFO  epoch: 5/24, acc_iter=21035, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:04:08/0:01:16, time_cost(all): 6:29:26/18:54:27, loss=0.521346068339086, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.07(1.03), norm=4.844949794091988, lr=0.024325960025149477
2023-12-21 20:58:37   INFO  epoch: 5/24, acc_iter=21085, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:05:03/0:00:19, time_cost(all): 6:30:22/19:13:58, loss=0.521149777019743, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.88(1.03), norm=3.313616400805194, lr=0.024303433624207574
2023-12-21 20:59:33   INFO  epoch: 6/24, acc_iter=21152, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:00:55/1:01:48, time_cost(all): 6:31:18/20:06:38, loss=0.520886746651823, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.12(1.03), norm=0.6930222878478747, lr=0.02427324824694542
2023-12-21 21:00:29   INFO  epoch: 6/24, acc_iter=21202, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:01:51/1:02:32, time_cost(all): 6:32:14/19:15:55, loss=0.520690455332479, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.0(1.03), norm=3.863768149536988, lr=0.024250721846003517
2023-12-21 21:01:25   INFO  epoch: 6/24, acc_iter=21252, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:02:47/0:59:59, time_cost(all): 6:33:10/19:54:52, loss=0.520494164013136, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.21(1.03), norm=2.059731080005977, lr=0.024228195445061614
2023-12-21 21:02:20   INFO  epoch: 6/24, acc_iter=21302, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:03:43/1:01:28, time_cost(all): 6:34:05/19:15:19, loss=0.520297872693792, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.84(1.03), norm=4.547779255172512, lr=0.024205669044119707
2023-12-21 21:03:16   INFO  epoch: 6/24, acc_iter=21352, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:04:38/1:00:29, time_cost(all): 6:35:01/19:00:43, loss=0.520101581374449, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.01(1.03), norm=2.9202178479347394, lr=0.024183142643177804
2023-12-21 21:04:12   INFO  epoch: 6/24, acc_iter=21402, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:05:34/1:02:07, time_cost(all): 6:35:57/19:39:52, loss=0.519905290055106, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.87(1.03), norm=2.0514003927649362, lr=0.0241606162422359
2023-12-21 21:05:08   INFO  epoch: 6/24, acc_iter=21452, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:06:30/1:00:13, time_cost(all): 6:36:53/20:20:13, loss=0.519708998735762, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.98(1.03), norm=4.530384922897143, lr=0.024138089841293998
2023-12-21 21:06:03   INFO  epoch: 6/24, acc_iter=21502, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:07:26/0:58:37, time_cost(all): 6:37:48/19:17:32, loss=0.519512707416419, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.97(1.03), norm=3.9671368872343047, lr=0.024115563440352095
2023-12-21 21:06:59   INFO  epoch: 6/24, acc_iter=21552, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:08:21/0:57:49, time_cost(all): 6:38:44/19:28:06, loss=0.519316416097075, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.21(1.03), norm=1.5476060957623694, lr=0.024093037039410188
2023-12-21 21:07:55   INFO  epoch: 6/24, acc_iter=21602, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:09:17/0:55:47, time_cost(all): 6:39:40/19:14:07, loss=0.519120124777732, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.18(1.03), norm=4.555462779591861, lr=0.024070510638468285
2023-12-21 21:08:51   INFO  epoch: 6/24, acc_iter=21652, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:10:13/0:54:01, time_cost(all): 6:40:36/19:02:51, loss=0.518923833458388, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.92(1.03), norm=4.324647983746316, lr=0.024047984237526382
2023-12-21 21:09:46   INFO  epoch: 6/24, acc_iter=21702, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:11:09/0:55:10, time_cost(all): 6:41:31/18:31:59, loss=0.518727542139045, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.99(1.03), norm=0.7028915923358844, lr=0.024025457836584475
2023-12-21 21:10:42   INFO  epoch: 6/24, acc_iter=21752, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:12:04/0:50:46, time_cost(all): 6:42:27/20:20:36, loss=0.518531250819702, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.96(1.03), norm=1.890250273965164, lr=0.024002931435642572
2023-12-21 21:11:38   INFO  epoch: 6/24, acc_iter=21802, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:13:00/0:54:06, time_cost(all): 6:43:23/18:31:59, loss=0.518334959500358, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.08(1.03), norm=2.291184688426693, lr=0.02398040503470067
2023-12-21 21:12:34   INFO  epoch: 6/24, acc_iter=21852, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:13:56/0:49:02, time_cost(all): 6:44:19/18:44:43, loss=0.518138668181015, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.91(1.03), norm=1.9282493370600884, lr=0.023957878633758766
2023-12-21 21:13:30   INFO  epoch: 6/24, acc_iter=21902, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:14:52/0:51:10, time_cost(all): 6:45:15/18:41:28, loss=0.517942376861671, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.02(1.03), norm=3.7059279455765077, lr=0.023935352232816863
2023-12-21 21:14:25   INFO  epoch: 6/24, acc_iter=21952, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:15:48/0:48:16, time_cost(all): 6:46:10/19:57:24, loss=0.517746085542328, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.21(1.03), norm=0.6365482132429885, lr=0.023912825831874956
2023-12-21 21:15:21   INFO  epoch: 6/24, acc_iter=22002, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:16:43/0:48:52, time_cost(all): 6:47:06/20:15:22, loss=0.517549794222984, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.99(1.03), norm=3.8798583445725803, lr=0.023890299430933053
2023-12-21 21:16:17   INFO  epoch: 6/24, acc_iter=22052, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:17:39/0:45:25, time_cost(all): 6:48:02/19:30:41, loss=0.517353502903641, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.99(1.03), norm=2.1482805934641367, lr=0.02386777302999115
2023-12-21 21:17:13   INFO  epoch: 6/24, acc_iter=22102, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:18:35/0:48:01, time_cost(all): 6:48:58/19:51:05, loss=0.517157211584298, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.92(1.03), norm=3.8616467583287277, lr=0.023845246629049247
2023-12-21 21:18:08   INFO  epoch: 6/24, acc_iter=22152, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:19:31/0:47:35, time_cost(all): 6:49:53/19:53:18, loss=0.516960920264954, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.07(1.03), norm=3.8229111088721854, lr=0.023822720228107344
2023-12-21 21:19:04   INFO  epoch: 6/24, acc_iter=22202, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:20:26/0:42:58, time_cost(all): 6:50:49/18:30:12, loss=0.516764628945611, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.05(1.03), norm=0.7839665951631134, lr=0.023800193827165438
2023-12-21 21:20:00   INFO  epoch: 6/24, acc_iter=22252, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:21:22/0:45:02, time_cost(all): 6:51:45/19:04:22, loss=0.516568337626267, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.11(1.03), norm=0.9488261665635904, lr=0.023777667426223534
2023-12-21 21:20:56   INFO  epoch: 6/24, acc_iter=22302, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:22:18/0:43:38, time_cost(all): 6:52:41/19:27:39, loss=0.516372046306924, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.96(1.03), norm=1.6943399987298857, lr=0.02375514102528163
2023-12-21 21:21:51   INFO  epoch: 6/24, acc_iter=22352, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:23:14/0:40:45, time_cost(all): 6:53:36/20:04:44, loss=0.51617575498758, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.02(1.03), norm=4.987104846175495, lr=0.023732614624339725
2023-12-21 21:22:47   INFO  epoch: 6/24, acc_iter=22402, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:24:09/0:39:20, time_cost(all): 6:54:32/19:12:38, loss=0.515979463668237, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.07(1.03), norm=1.5895710674756711, lr=0.023710088223397822
2023-12-21 21:23:43   INFO  epoch: 6/24, acc_iter=22452, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:25:05/0:42:14, time_cost(all): 6:55:28/19:51:30, loss=0.515783172348893, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.18(1.03), norm=3.7746515418706466, lr=0.02368756182245592
2023-12-21 21:24:39   INFO  epoch: 6/24, acc_iter=22502, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:26:01/0:39:19, time_cost(all): 6:56:24/18:33:44, loss=0.51558688102955, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.07(1.03), norm=0.5465020345665851, lr=0.023665035421514016
2023-12-21 21:25:35   INFO  epoch: 6/24, acc_iter=22552, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:26:57/0:37:03, time_cost(all): 6:57:20/19:25:27, loss=0.515390589710207, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.11(1.03), norm=0.692545656218386, lr=0.023642509020572113
2023-12-21 21:26:30   INFO  epoch: 6/24, acc_iter=22602, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:27:53/0:39:08, time_cost(all): 6:58:15/19:49:25, loss=0.515194298390863, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.91(1.03), norm=0.814622065554732, lr=0.023619982619630206
2023-12-21 21:27:26   INFO  epoch: 6/24, acc_iter=22652, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:28:48/0:37:38, time_cost(all): 6:59:11/19:41:50, loss=0.51499800707152, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.01(1.03), norm=4.031800984984722, lr=0.023597456218688303
2023-12-21 21:28:22   INFO  epoch: 6/24, acc_iter=22702, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:29:44/0:37:12, time_cost(all): 7:00:07/19:57:30, loss=0.514801715752176, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.02(1.03), norm=4.981438841111299, lr=0.0235749298177464
2023-12-21 21:29:18   INFO  epoch: 6/24, acc_iter=22752, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:30:40/0:34:13, time_cost(all): 7:01:03/18:22:59, loss=0.514605424432833, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.95(1.03), norm=4.152674999310635, lr=0.023552403416804493
2023-12-21 21:30:13   INFO  epoch: 6/24, acc_iter=22802, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:31:36/0:34:11, time_cost(all): 7:01:58/19:19:50, loss=0.514409133113489, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.1(1.03), norm=3.1630016530630427, lr=0.02352987701586259
2023-12-21 21:31:09   INFO  epoch: 6/24, acc_iter=22852, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:32:31/0:33:48, time_cost(all): 7:02:54/19:32:46, loss=0.514212841794146, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.94(1.03), norm=4.958371644904714, lr=0.023507350614920687
2023-12-21 21:32:05   INFO  epoch: 6/24, acc_iter=22902, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:33:27/0:31:58, time_cost(all): 7:03:50/18:24:39, loss=0.514016550474803, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.12(1.03), norm=3.311871208012371, lr=0.023484824213978784
2023-12-21 21:33:01   INFO  epoch: 6/24, acc_iter=22952, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:34:23/0:30:52, time_cost(all): 7:04:46/18:23:07, loss=0.513820259155459, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.96(1.03), norm=3.087732944522789, lr=0.02346229781303688
2023-12-21 21:33:56   INFO  epoch: 6/24, acc_iter=23002, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:35:19/0:29:45, time_cost(all): 7:05:41/18:07:24, loss=0.513623967836116, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.17(1.03), norm=3.160779606002814, lr=0.023439771412094974
2023-12-21 21:34:52   INFO  epoch: 6/24, acc_iter=23052, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:36:14/0:29:57, time_cost(all): 7:06:37/19:37:18, loss=0.513427676516772, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.12(1.03), norm=4.965034880576765, lr=0.02341724501115307
2023-12-21 21:35:48   INFO  epoch: 6/24, acc_iter=23102, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:37:10/0:29:31, time_cost(all): 7:07:33/19:23:28, loss=0.513231385197429, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.04(1.03), norm=4.148126384402738, lr=0.023394718610211168
2023-12-21 21:36:44   INFO  epoch: 6/24, acc_iter=23152, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:38:06/0:27:07, time_cost(all): 7:08:29/18:22:24, loss=0.513035093878085, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.13(1.03), norm=0.7467512685902646, lr=0.023372192209269265
2023-12-21 21:37:40   INFO  epoch: 6/24, acc_iter=23202, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:39:02/0:26:01, time_cost(all): 7:09:25/19:02:40, loss=0.512838802558742, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.02(1.03), norm=4.637846606555406, lr=0.023349665808327362
2023-12-21 21:38:35   INFO  epoch: 6/24, acc_iter=23252, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:39:58/0:24:46, time_cost(all): 7:10:20/18:41:06, loss=0.512642511239399, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.0(1.03), norm=2.7629978868243454, lr=0.023327139407385455
2023-12-21 21:39:31   INFO  epoch: 6/24, acc_iter=23302, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:40:53/0:24:24, time_cost(all): 7:11:16/18:09:02, loss=0.512446219920055, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.85(1.03), norm=4.227743963154141, lr=0.023304613006443552
2023-12-21 21:40:27   INFO  epoch: 6/24, acc_iter=23352, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:41:49/0:22:38, time_cost(all): 7:12:12/19:44:56, loss=0.512249928600712, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.15(1.03), norm=4.759374188881723, lr=0.02328208660550165
2023-12-21 21:41:23   INFO  epoch: 6/24, acc_iter=23402, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:42:45/0:22:06, time_cost(all): 7:13:08/19:38:11, loss=0.512053637281368, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.17(1.03), norm=1.4192862532946289, lr=0.023259560204559743
2023-12-21 21:42:18   INFO  epoch: 6/24, acc_iter=23452, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:43:41/0:20:56, time_cost(all): 7:14:03/18:49:01, loss=0.511857345962025, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.84(1.03), norm=3.531793731657577, lr=0.023237033803617843
2023-12-21 21:43:14   INFO  epoch: 6/24, acc_iter=23502, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:44:36/0:20:34, time_cost(all): 7:14:59/18:06:25, loss=0.511661054642681, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.15(1.03), norm=4.501673921573133, lr=0.023214507402675937
2023-12-21 21:44:10   INFO  epoch: 6/24, acc_iter=23552, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:45:32/0:20:05, time_cost(all): 7:15:55/18:51:23, loss=0.511464763323338, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.16(1.03), norm=2.959971797806623, lr=0.023191981001734033
2023-12-21 21:45:06   INFO  epoch: 6/24, acc_iter=23602, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:46:28/0:18:14, time_cost(all): 7:16:51/18:25:13, loss=0.511268472003995, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.96(1.03), norm=4.355837134676884, lr=0.02316945460079213
2023-12-21 21:46:01   INFO  epoch: 6/24, acc_iter=23652, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:47:24/0:17:09, time_cost(all): 7:17:46/17:59:16, loss=0.511072180684651, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.86(1.03), norm=4.056500119905663, lr=0.023146928199850224
2023-12-21 21:46:57   INFO  epoch: 6/24, acc_iter=23702, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:48:19/0:17:25, time_cost(all): 7:18:42/18:12:22, loss=0.510875889365308, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.88(1.03), norm=0.6409131838854476, lr=0.02312440179890832
2023-12-21 21:47:53   INFO  epoch: 6/24, acc_iter=23752, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:49:15/0:16:20, time_cost(all): 7:19:38/18:28:15, loss=0.510679598045964, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.96(1.03), norm=4.601220371994091, lr=0.023101875397966418
2023-12-21 21:48:49   INFO  epoch: 6/24, acc_iter=23802, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:50:11/0:15:14, time_cost(all): 7:20:34/19:12:30, loss=0.510483306726621, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.17(1.03), norm=2.7089130361808507, lr=0.02307934899702451
2023-12-21 21:49:44   INFO  epoch: 6/24, acc_iter=23852, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 0:51:07/0:14:31, time_cost(all): 7:21:29/18:38:06, loss=0.510287015407277, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.94(1.03), norm=3.4348236779900283, lr=0.02305682259608261
2023-12-21 21:50:40   INFO  epoch: 6/24, acc_iter=23902, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 0:52:03/0:13:42, time_cost(all): 7:22:25/19:39:25, loss=0.510090724087934, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.07(1.03), norm=1.6576260751621317, lr=0.023034296195140705
2023-12-21 21:51:36   INFO  epoch: 6/24, acc_iter=23952, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 0:52:58/0:12:24, time_cost(all): 7:23:21/18:20:03, loss=0.509894432768591, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.94(1.03), norm=1.2606658194323928, lr=0.023011769794198802
2023-12-21 21:52:32   INFO  epoch: 6/24, acc_iter=24002, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 0:53:54/0:11:39, time_cost(all): 7:24:17/18:35:13, loss=0.509698141449247, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.02(1.03), norm=3.268995637527826, lr=0.0229892433932569
2023-12-21 21:53:28   INFO  epoch: 6/24, acc_iter=24052, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 0:54:50/0:10:38, time_cost(all): 7:25:13/19:17:54, loss=0.509501850129904, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.83(1.03), norm=0.897370912578046, lr=0.022966716992314992
2023-12-21 21:54:23   INFO  epoch: 6/24, acc_iter=24102, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 0:55:46/0:09:14, time_cost(all): 7:26:08/18:47:28, loss=0.50930555881056, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.91(1.03), norm=1.9869163124088165, lr=0.02294419059137309
2023-12-21 21:55:19   INFO  epoch: 6/24, acc_iter=24152, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 0:56:41/0:08:53, time_cost(all): 7:27:04/19:15:05, loss=0.509109267491217, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.03(1.03), norm=4.039803168205573, lr=0.022921664190431186
2023-12-21 21:56:15   INFO  epoch: 6/24, acc_iter=24202, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 0:57:37/0:07:57, time_cost(all): 7:28:00/18:53:52, loss=0.508912976171873, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.89(1.03), norm=3.009711713404701, lr=0.02289913778948928
2023-12-21 21:57:11   INFO  epoch: 6/24, acc_iter=24252, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 0:58:33/0:06:41, time_cost(all): 7:28:56/18:09:37, loss=0.50871668485253, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.83(1.03), norm=3.729003849364753, lr=0.02287661138854738
2023-12-21 21:58:06   INFO  epoch: 6/24, acc_iter=24302, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 0:59:29/0:05:42, time_cost(all): 7:29:51/18:15:28, loss=0.508520393533186, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.06(1.03), norm=3.0536322338671016, lr=0.022854084987605473
2023-12-21 21:59:02   INFO  epoch: 6/24, acc_iter=24352, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:00:24/0:04:56, time_cost(all): 7:30:47/19:34:03, loss=0.508324102213843, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.13(1.03), norm=0.5926362426740215, lr=0.02283155858666357
2023-12-21 21:59:58   INFO  epoch: 6/24, acc_iter=24402, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:01:20/0:03:56, time_cost(all): 7:31:43/17:54:17, loss=0.5081278108945, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.09(1.03), norm=2.4267137652652644, lr=0.022809032185721667
2023-12-21 22:00:54   INFO  epoch: 6/24, acc_iter=24452, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:02:16/0:03:15, time_cost(all): 7:32:39/19:23:03, loss=0.507931519575156, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.13(1.03), norm=1.5324294609265197, lr=0.022786505784779764
2023-12-21 22:01:49   INFO  epoch: 6/24, acc_iter=24502, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:03:12/0:02:13, time_cost(all): 7:33:34/18:43:18, loss=0.507735228255813, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.84(1.03), norm=3.2463848864581033, lr=0.022763979383837857
2023-12-21 22:02:45   INFO  epoch: 6/24, acc_iter=24552, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:04:08/0:01:12, time_cost(all): 7:34:30/17:42:17, loss=0.507538936936469, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.02(1.03), norm=3.3636556939302737, lr=0.022741452982895954
2023-12-21 22:03:41   INFO  epoch: 6/24, acc_iter=24602, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:05:03/0:00:18, time_cost(all): 7:35:26/18:47:26, loss=0.507342645617126, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.14(1.03), norm=1.8006123386134232, lr=0.02271892658195405
2023-12-21 22:04:37   INFO  epoch: 7/24, acc_iter=24669, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:00:55/1:01:26, time_cost(all): 7:36:22/18:33:07, loss=0.507079615249206, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.83(1.03), norm=1.4476778139149848, lr=0.0226887412046919
2023-12-21 22:05:33   INFO  epoch: 7/24, acc_iter=24719, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:01:51/1:02:20, time_cost(all): 7:37:18/18:39:20, loss=0.506883323929862, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.2(1.03), norm=2.8671093070075075, lr=0.022666214803749994
2023-12-21 22:06:28   INFO  epoch: 7/24, acc_iter=24769, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:02:47/1:03:37, time_cost(all): 7:38:13/19:23:30, loss=0.506687032610519, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.97(1.03), norm=2.697587659137965, lr=0.02264368840280809
2023-12-21 22:07:24   INFO  epoch: 7/24, acc_iter=24819, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:03:43/1:03:07, time_cost(all): 7:39:09/17:58:23, loss=0.506490741291175, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.09(1.03), norm=4.424739772203441, lr=0.022621162001866188
2023-12-21 22:08:20   INFO  epoch: 7/24, acc_iter=24869, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:04:38/1:00:38, time_cost(all): 7:40:05/17:49:12, loss=0.506294449971832, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.05(1.03), norm=4.329276390084296, lr=0.022598635600924285
2023-12-21 22:09:16   INFO  epoch: 7/24, acc_iter=24919, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:05:34/1:00:20, time_cost(all): 7:41:01/18:59:11, loss=0.506098158652489, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.16(1.03), norm=2.9816772346553173, lr=0.022576109199982382
2023-12-21 22:10:11   INFO  epoch: 7/24, acc_iter=24969, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:06:30/0:59:29, time_cost(all): 7:41:56/18:54:56, loss=0.505901867333145, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.92(1.03), norm=2.4557597891800973, lr=0.022553582799040475
2023-12-21 22:11:07   INFO  epoch: 7/24, acc_iter=25019, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:07:26/0:56:43, time_cost(all): 7:42:52/18:04:15, loss=0.505705576013802, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.14(1.03), norm=2.9538250162206934, lr=0.022531056398098572
2023-12-21 22:12:03   INFO  epoch: 7/24, acc_iter=25069, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:08:21/0:56:33, time_cost(all): 7:43:48/18:41:05, loss=0.505509284694458, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.23(1.03), norm=4.253545650497967, lr=0.02250852999715667
2023-12-21 22:12:59   INFO  epoch: 7/24, acc_iter=25119, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:09:17/0:58:18, time_cost(all): 7:44:44/17:40:36, loss=0.505312993375115, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.16(1.03), norm=4.363305916002902, lr=0.022486003596214766
2023-12-21 22:13:54   INFO  epoch: 7/24, acc_iter=25169, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:10:13/0:55:01, time_cost(all): 7:45:39/18:19:18, loss=0.505116702055771, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.88(1.03), norm=4.836401551784457, lr=0.02246347719527286
2023-12-21 22:14:50   INFO  epoch: 7/24, acc_iter=25219, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:11:09/0:54:38, time_cost(all): 7:46:35/17:34:37, loss=0.504920410736428, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.04(1.03), norm=3.6506351359736726, lr=0.022440950794330956
2023-12-21 22:15:46   INFO  epoch: 7/24, acc_iter=25269, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:12:04/0:53:09, time_cost(all): 7:47:31/18:23:20, loss=0.504724119417085, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.94(1.03), norm=1.4821978799585238, lr=0.022418424393389053
2023-12-21 22:16:42   INFO  epoch: 7/24, acc_iter=25319, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:13:00/0:52:29, time_cost(all): 7:48:27/18:42:16, loss=0.504527828097741, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.94(1.03), norm=3.001839695913457, lr=0.02239589799244715
2023-12-21 22:17:38   INFO  epoch: 7/24, acc_iter=25369, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:13:56/0:49:15, time_cost(all): 7:49:23/17:52:03, loss=0.504331536778398, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.94(1.03), norm=2.1670787184632045, lr=0.022373371591505244
2023-12-21 22:18:33   INFO  epoch: 7/24, acc_iter=25419, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:14:52/0:50:46, time_cost(all): 7:50:18/18:29:37, loss=0.504135245459054, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.96(1.03), norm=3.9276786518327063, lr=0.02235084519056334
2023-12-21 22:19:29   INFO  epoch: 7/24, acc_iter=25469, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:15:48/0:49:06, time_cost(all): 7:51:14/17:41:54, loss=0.503938954139711, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.15(1.03), norm=2.511226950601934, lr=0.022328318789621437
2023-12-21 22:20:25   INFO  epoch: 7/24, acc_iter=25519, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:16:43/0:48:49, time_cost(all): 7:52:10/17:48:09, loss=0.503742662820367, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.91(1.03), norm=1.9385421700523646, lr=0.022305792388679534
2023-12-21 22:21:21   INFO  epoch: 7/24, acc_iter=25569, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:17:39/0:49:28, time_cost(all): 7:53:06/18:52:24, loss=0.503546371501024, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.83(1.03), norm=3.9311366767717386, lr=0.02228326598773763
2023-12-21 22:22:16   INFO  epoch: 7/24, acc_iter=25619, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:18:35/0:44:52, time_cost(all): 7:54:01/18:25:19, loss=0.50335008018168, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.95(1.03), norm=0.8678125554527714, lr=0.022260739586795725
2023-12-21 22:23:12   INFO  epoch: 7/24, acc_iter=25669, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:19:31/0:47:00, time_cost(all): 7:54:57/17:59:59, loss=0.503153788862337, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.2(1.03), norm=3.0711954724247943, lr=0.02223821318585382
2023-12-21 22:24:08   INFO  epoch: 7/24, acc_iter=25719, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:20:26/0:45:00, time_cost(all): 7:55:53/18:14:31, loss=0.502957497542994, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.93(1.03), norm=1.1215352551481832, lr=0.02221568678491192
2023-12-21 22:25:04   INFO  epoch: 7/24, acc_iter=25769, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:21:22/0:46:00, time_cost(all): 7:56:49/18:29:54, loss=0.50276120622365, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.15(1.03), norm=1.1701187577571566, lr=0.022193160383970012
2023-12-21 22:25:59   INFO  epoch: 7/24, acc_iter=25819, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:22:18/0:41:26, time_cost(all): 7:57:44/17:33:41, loss=0.502564914904307, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.06(1.03), norm=1.06398855083409, lr=0.022170633983028112
2023-12-21 22:26:55   INFO  epoch: 7/24, acc_iter=25869, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:23:14/0:41:30, time_cost(all): 7:58:40/19:00:23, loss=0.502368623584963, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.12(1.03), norm=1.4785430349857465, lr=0.022148107582086206
2023-12-21 22:27:51   INFO  epoch: 7/24, acc_iter=25919, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:24:09/0:41:22, time_cost(all): 7:59:36/18:07:13, loss=0.50217233226562, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.21(1.03), norm=3.2357452425155433, lr=0.022125581181144303
2023-12-21 22:28:47   INFO  epoch: 7/24, acc_iter=25969, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:25:05/0:42:10, time_cost(all): 8:00:32/17:22:53, loss=0.501976040946276, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.18(1.03), norm=3.7068213646988846, lr=0.0221030547802024
2023-12-21 22:29:43   INFO  epoch: 7/24, acc_iter=26019, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:26:01/0:38:59, time_cost(all): 8:01:28/17:53:09, loss=0.501779749626933, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.04(1.03), norm=4.774443641112383, lr=0.022080528379260493
2023-12-21 22:30:38   INFO  epoch: 7/24, acc_iter=26069, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:26:57/0:37:46, time_cost(all): 8:02:23/17:17:00, loss=0.50158345830759, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.94(1.03), norm=4.50375146336731, lr=0.02205800197831859
2023-12-21 22:31:34   INFO  epoch: 7/24, acc_iter=26119, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:27:53/0:35:57, time_cost(all): 8:03:19/18:22:52, loss=0.501387166988246, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.1(1.03), norm=1.083507443678921, lr=0.022035475577376687
2023-12-21 22:32:30   INFO  epoch: 7/24, acc_iter=26169, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:28:48/0:35:01, time_cost(all): 8:04:15/18:55:42, loss=0.501190875668903, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.2(1.03), norm=1.1776790412056173, lr=0.02201294917643478
2023-12-21 22:33:26   INFO  epoch: 7/24, acc_iter=26219, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:29:44/0:34:26, time_cost(all): 8:05:11/17:15:53, loss=0.500994584349559, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.19(1.03), norm=1.9217515534455398, lr=0.02199042277549288
2023-12-21 22:34:21   INFO  epoch: 7/24, acc_iter=26269, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:30:40/0:33:56, time_cost(all): 8:06:06/17:43:08, loss=0.500798293030216, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.87(1.03), norm=2.992840932405876, lr=0.021967896374550974
2023-12-21 22:35:17   INFO  epoch: 7/24, acc_iter=26319, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:31:36/0:34:54, time_cost(all): 8:07:02/18:17:50, loss=0.500602001710872, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.95(1.03), norm=2.9653600829325644, lr=0.02194536997360907
2023-12-21 22:36:13   INFO  epoch: 7/24, acc_iter=26369, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:32:31/0:32:26, time_cost(all): 8:07:58/18:07:41, loss=0.500405710391529, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.18(1.03), norm=3.606698496217038, lr=0.021922843572667168
2023-12-21 22:37:09   INFO  epoch: 7/24, acc_iter=26419, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:33:27/0:33:03, time_cost(all): 8:08:54/18:35:43, loss=0.500209419072186, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.11(1.03), norm=1.7390100157303146, lr=0.02190031717172526
2023-12-21 22:38:04   INFO  epoch: 7/24, acc_iter=26469, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:34:23/0:29:55, time_cost(all): 8:09:49/17:37:37, loss=0.500013127752842, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.04(1.03), norm=0.9527981698294237, lr=0.02187779077078336
2023-12-21 22:39:00   INFO  epoch: 7/24, acc_iter=26519, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:35:19/0:30:25, time_cost(all): 8:10:45/17:31:14, loss=0.499816836433499, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.01(1.03), norm=3.673872006498109, lr=0.021855264369841455
2023-12-21 22:39:56   INFO  epoch: 7/24, acc_iter=26569, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:36:14/0:27:47, time_cost(all): 8:11:41/17:48:21, loss=0.499620545114155, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.92(1.03), norm=2.3178673189823717, lr=0.02183273796889955
2023-12-21 22:40:52   INFO  epoch: 7/24, acc_iter=26619, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:37:10/0:28:03, time_cost(all): 8:12:37/17:38:40, loss=0.499424253794812, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.89(1.03), norm=0.7494019661462854, lr=0.02181021156795765
2023-12-21 22:41:48   INFO  epoch: 7/24, acc_iter=26669, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:38:06/0:28:16, time_cost(all): 8:13:33/17:49:02, loss=0.499227962475468, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.11(1.03), norm=4.7606685406372105, lr=0.021787685167015743
2023-12-21 22:42:43   INFO  epoch: 7/24, acc_iter=26719, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:39:02/0:25:48, time_cost(all): 8:14:28/18:46:16, loss=0.499031671156125, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.1(1.03), norm=4.023792331099832, lr=0.02176515876607384
2023-12-21 22:43:39   INFO  epoch: 7/24, acc_iter=26769, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:39:58/0:24:16, time_cost(all): 8:15:24/18:37:52, loss=0.498835379836782, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.2(1.03), norm=0.889850579086811, lr=0.021742632365131936
2023-12-21 22:44:35   INFO  epoch: 7/24, acc_iter=26819, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:40:53/0:23:33, time_cost(all): 8:16:20/18:09:39, loss=0.498639088517438, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.01(1.03), norm=3.6281323061859174, lr=0.02172010596419003
2023-12-21 22:45:31   INFO  epoch: 7/24, acc_iter=26869, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:41:49/0:22:49, time_cost(all): 8:17:16/17:25:44, loss=0.498442797198095, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.0(1.03), norm=0.9777242050214172, lr=0.021697579563248127
2023-12-21 22:46:26   INFO  epoch: 7/24, acc_iter=26919, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:42:45/0:22:25, time_cost(all): 8:18:11/18:10:52, loss=0.498246505878751, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.98(1.03), norm=0.7240900920017175, lr=0.021675053162306224
2023-12-21 22:47:22   INFO  epoch: 7/24, acc_iter=26969, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:43:41/0:22:12, time_cost(all): 8:19:07/17:51:28, loss=0.498050214559408, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.84(1.03), norm=1.1594966664352462, lr=0.021652526761364317
2023-12-21 22:48:18   INFO  epoch: 7/24, acc_iter=27019, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:44:36/0:20:18, time_cost(all): 8:20:03/17:26:54, loss=0.497853923240064, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.11(1.03), norm=1.5619201042148656, lr=0.021630000360422418
2023-12-21 22:49:14   INFO  epoch: 7/24, acc_iter=27069, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:45:32/0:19:31, time_cost(all): 8:20:59/18:12:55, loss=0.497657631920721, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.96(1.03), norm=3.8903346384557715, lr=0.02160747395948051
2023-12-21 22:50:09   INFO  epoch: 7/24, acc_iter=27119, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:46:28/0:19:00, time_cost(all): 8:21:54/16:56:03, loss=0.497461340601377, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.07(1.03), norm=4.038614800556239, lr=0.021584947558538608
2023-12-21 22:51:05   INFO  epoch: 7/24, acc_iter=27169, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:47:24/0:17:14, time_cost(all): 8:22:50/18:12:48, loss=0.497265049282034, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.16(1.03), norm=2.9893286912589856, lr=0.021562421157596705
2023-12-21 22:52:01   INFO  epoch: 7/24, acc_iter=27219, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:48:19/0:17:44, time_cost(all): 8:23:46/17:06:18, loss=0.497068757962691, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.14(1.03), norm=2.018560422754743, lr=0.021539894756654798
2023-12-21 22:52:57   INFO  epoch: 7/24, acc_iter=27269, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:49:15/0:16:26, time_cost(all): 8:24:42/17:46:59, loss=0.496872466643347, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.07(1.03), norm=2.6009849495361865, lr=0.0215173683557129
2023-12-21 22:53:53   INFO  epoch: 7/24, acc_iter=27319, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:50:11/0:15:22, time_cost(all): 8:25:38/17:42:40, loss=0.496676175324004, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.2(1.03), norm=2.1241790016751683, lr=0.021494841954770992
2023-12-21 22:54:48   INFO  epoch: 7/24, acc_iter=27369, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 0:51:07/0:13:45, time_cost(all): 8:26:33/17:56:34, loss=0.49647988400466, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.9(1.03), norm=0.5152421405292542, lr=0.02147231555382909
2023-12-21 22:55:44   INFO  epoch: 7/24, acc_iter=27419, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 0:52:03/0:13:58, time_cost(all): 8:27:29/16:53:25, loss=0.496283592685317, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.22(1.03), norm=0.6323084699523234, lr=0.021449789152887186
2023-12-21 22:56:40   INFO  epoch: 7/24, acc_iter=27469, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 0:52:58/0:12:53, time_cost(all): 8:28:25/18:14:45, loss=0.496087301365973, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.12(1.03), norm=2.4572313394280325, lr=0.02142726275194528
2023-12-21 22:57:36   INFO  epoch: 7/24, acc_iter=27519, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 0:53:54/0:11:25, time_cost(all): 8:29:21/17:14:32, loss=0.49589101004663, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.02(1.03), norm=4.444847021124074, lr=0.02140473635100338
2023-12-21 22:58:31   INFO  epoch: 7/24, acc_iter=27569, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 0:54:50/0:10:28, time_cost(all): 8:30:16/16:53:14, loss=0.495694718727287, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.96(1.03), norm=4.149780375954742, lr=0.021382209950061473
2023-12-21 22:59:27   INFO  epoch: 7/24, acc_iter=27619, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 0:55:46/0:09:17, time_cost(all): 8:31:12/18:27:58, loss=0.495498427407943, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.22(1.03), norm=1.937984091528508, lr=0.02135968354911957
2023-12-21 23:00:23   INFO  epoch: 7/24, acc_iter=27669, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 0:56:41/0:08:56, time_cost(all): 8:32:08/16:51:28, loss=0.4953021360886, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.9(1.03), norm=1.762949481891956, lr=0.021337157148177667
2023-12-21 23:01:19   INFO  epoch: 7/24, acc_iter=27719, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 0:57:37/0:07:50, time_cost(all): 8:33:04/17:30:18, loss=0.495105844769256, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.03(1.03), norm=4.508089354360254, lr=0.02131463074723576
2023-12-21 23:02:14   INFO  epoch: 7/24, acc_iter=27769, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 0:58:33/0:07:06, time_cost(all): 8:33:59/17:33:08, loss=0.494909553449913, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.9(1.03), norm=2.3332375570180037, lr=0.02129210434629386
2023-12-21 23:03:10   INFO  epoch: 7/24, acc_iter=27819, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 0:59:29/0:05:57, time_cost(all): 8:34:55/17:55:51, loss=0.494713262130569, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.12(1.03), norm=2.2800957299380755, lr=0.021269577945351954
2023-12-21 23:04:06   INFO  epoch: 7/24, acc_iter=27869, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:00:24/0:05:09, time_cost(all): 8:35:51/16:44:57, loss=0.494516970811226, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.85(1.03), norm=3.182209250270217, lr=0.02124705154441005
2023-12-21 23:05:02   INFO  epoch: 7/24, acc_iter=27919, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:01:20/0:04:11, time_cost(all): 8:36:47/17:02:24, loss=0.494320679491883, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.13(1.03), norm=4.708883559802865, lr=0.021224525143468148
2023-12-21 23:05:58   INFO  epoch: 7/24, acc_iter=27969, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:02:16/0:03:14, time_cost(all): 8:37:43/18:20:31, loss=0.494124388172539, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.95(1.03), norm=4.8371262591795725, lr=0.02120199874252624
2023-12-21 23:06:53   INFO  epoch: 7/24, acc_iter=28019, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:03:12/0:02:10, time_cost(all): 8:38:38/17:28:46, loss=0.493928096853196, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.15(1.03), norm=2.0827574675261555, lr=0.02117947234158434
2023-12-21 23:07:49   INFO  epoch: 7/24, acc_iter=28069, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:04:08/0:01:12, time_cost(all): 8:39:34/17:18:44, loss=0.493731805533852, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.04(1.03), norm=0.7302464762476741, lr=0.021156945940642435
2023-12-21 23:08:45   INFO  epoch: 7/24, acc_iter=28119, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:05:03/0:00:18, time_cost(all): 8:40:30/17:43:19, loss=0.493535514214509, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.08(1.03), norm=2.003973844885934, lr=0.02113441953970053
2023-12-21 23:09:41   INFO  epoch: 8/24, acc_iter=28186, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:00:55/1:04:49, time_cost(all): 8:41:26/17:59:55, loss=0.493272483846589, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.04(1.03), norm=1.1376112512370884, lr=0.02110423416243838
2023-12-21 23:10:36   INFO  epoch: 8/24, acc_iter=28236, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:01:51/1:02:09, time_cost(all): 8:42:21/16:44:22, loss=0.493076192527245, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.9(1.03), norm=3.4193811257011135, lr=0.021081707761496475
2023-12-21 23:11:32   INFO  epoch: 8/24, acc_iter=28286, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:02:47/1:04:49, time_cost(all): 8:43:17/16:39:51, loss=0.492879901207902, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.08(1.03), norm=1.7389597859179502, lr=0.021059181360554572
2023-12-21 23:12:28   INFO  epoch: 8/24, acc_iter=28336, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:03:43/1:04:40, time_cost(all): 8:44:13/17:37:54, loss=0.492683609888558, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.85(1.03), norm=4.36436199379887, lr=0.02103665495961267
2023-12-21 23:13:24   INFO  epoch: 8/24, acc_iter=28386, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:04:38/1:03:17, time_cost(all): 8:45:09/16:46:14, loss=0.492487318569215, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.86(1.03), norm=3.4016190362182175, lr=0.021014128558670762
2023-12-21 23:14:19   INFO  epoch: 8/24, acc_iter=28436, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:05:34/0:57:11, time_cost(all): 8:46:04/17:47:39, loss=0.492291027249871, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.23(1.03), norm=3.9567918635311283, lr=0.02099160215772886
2023-12-21 23:15:15   INFO  epoch: 8/24, acc_iter=28486, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:06:30/1:00:12, time_cost(all): 8:47:00/17:05:42, loss=0.492094735930528, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.12(1.03), norm=0.913462454913333, lr=0.020969075756786956
2023-12-21 23:16:11   INFO  epoch: 8/24, acc_iter=28536, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:07:26/1:00:37, time_cost(all): 8:47:56/16:45:53, loss=0.491898444611185, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.13(1.03), norm=1.835207653027834, lr=0.02094654935584505
2023-12-21 23:17:07   INFO  epoch: 8/24, acc_iter=28586, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:08:21/0:58:25, time_cost(all): 8:48:52/17:31:06, loss=0.491702153291841, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.89(1.03), norm=3.3202559411862427, lr=0.02092402295490315
2023-12-21 23:18:02   INFO  epoch: 8/24, acc_iter=28636, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:09:17/0:58:11, time_cost(all): 8:49:47/17:09:51, loss=0.491505861972498, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.85(1.03), norm=3.3358081050937596, lr=0.020901496553961244
2023-12-21 23:18:58   INFO  epoch: 8/24, acc_iter=28686, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:10:13/0:55:43, time_cost(all): 8:50:43/17:25:27, loss=0.491309570653154, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.12(1.03), norm=0.7121740408501526, lr=0.02087897015301934
2023-12-21 23:19:54   INFO  epoch: 8/24, acc_iter=28736, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:11:09/0:55:54, time_cost(all): 8:51:39/17:10:14, loss=0.491113279333811, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.2(1.03), norm=4.618167556432536, lr=0.020856443752077437
2023-12-21 23:20:50   INFO  epoch: 8/24, acc_iter=28786, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:12:04/0:55:41, time_cost(all): 8:52:35/16:53:35, loss=0.490916988014467, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.15(1.03), norm=4.473931478720836, lr=0.02083391735113553
2023-12-21 23:21:46   INFO  epoch: 8/24, acc_iter=28836, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:13:00/0:54:39, time_cost(all): 8:53:31/16:35:23, loss=0.490720696695124, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.97(1.03), norm=3.4616884700908264, lr=0.020811390950193628
2023-12-21 23:22:41   INFO  epoch: 8/24, acc_iter=28886, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:13:56/0:52:32, time_cost(all): 8:54:26/17:55:12, loss=0.490524405375781, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.11(1.03), norm=2.0414562205724347, lr=0.020788864549251725
2023-12-21 23:23:37   INFO  epoch: 8/24, acc_iter=28936, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:14:52/0:50:15, time_cost(all): 8:55:22/17:27:13, loss=0.490328114056437, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.04(1.03), norm=1.3124273755593714, lr=0.020766338148309818
2023-12-21 23:24:33   INFO  epoch: 8/24, acc_iter=28986, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:15:48/0:47:17, time_cost(all): 8:56:18/16:35:48, loss=0.490131822737094, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.99(1.03), norm=4.0700911740458565, lr=0.02074381174736792
2023-12-21 23:25:29   INFO  epoch: 8/24, acc_iter=29036, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:16:43/0:47:37, time_cost(all): 8:57:14/16:38:41, loss=0.48993553141775, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.01(1.03), norm=3.3104962459306804, lr=0.020721285346426012
2023-12-21 23:26:24   INFO  epoch: 8/24, acc_iter=29086, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:17:39/0:47:32, time_cost(all): 8:58:09/16:47:32, loss=0.489739240098407, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.93(1.03), norm=2.289090926514209, lr=0.02069875894548411
2023-12-21 23:27:20   INFO  epoch: 8/24, acc_iter=29136, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:18:35/0:45:45, time_cost(all): 8:59:05/17:20:49, loss=0.489542948779063, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.85(1.03), norm=3.2260378282276294, lr=0.020676232544542206
2023-12-21 23:28:16   INFO  epoch: 8/24, acc_iter=29186, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:19:31/0:44:09, time_cost(all): 9:00:01/16:45:14, loss=0.48934665745972, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.84(1.03), norm=1.0204545299784287, lr=0.0206537061436003
2023-12-21 23:29:12   INFO  epoch: 8/24, acc_iter=29236, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:20:26/0:44:54, time_cost(all): 9:00:57/16:16:53, loss=0.489150366140377, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.86(1.03), norm=2.2174618808992745, lr=0.020631179742658396
2023-12-21 23:30:07   INFO  epoch: 8/24, acc_iter=29286, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:21:22/0:44:40, time_cost(all): 9:01:52/17:03:23, loss=0.488954074821033, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.89(1.03), norm=3.333677555261232, lr=0.020608653341716493
2023-12-21 23:31:03   INFO  epoch: 8/24, acc_iter=29336, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:22:18/0:42:47, time_cost(all): 9:02:48/16:19:10, loss=0.48875778350169, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.84(1.03), norm=4.239250530172036, lr=0.020586126940774586
2023-12-21 23:31:59   INFO  epoch: 8/24, acc_iter=29386, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:23:14/0:43:30, time_cost(all): 9:03:44/16:14:10, loss=0.488561492182346, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.9(1.03), norm=1.1211028643357817, lr=0.020563600539832687
2023-12-21 23:32:55   INFO  epoch: 8/24, acc_iter=29436, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:24:09/0:39:54, time_cost(all): 9:04:40/16:44:07, loss=0.488365200863003, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.01(1.03), norm=0.781798868579153, lr=0.02054107413889078
2023-12-21 23:33:51   INFO  epoch: 8/24, acc_iter=29486, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:25:05/0:40:28, time_cost(all): 9:05:36/17:42:12, loss=0.488168909543659, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.97(1.03), norm=2.78128949434444, lr=0.020518547737948877
2023-12-21 23:34:46   INFO  epoch: 8/24, acc_iter=29536, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:26:01/0:39:21, time_cost(all): 9:06:31/17:52:18, loss=0.487972618224316, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.2(1.03), norm=4.980658223306047, lr=0.020496021337006974
2023-12-21 23:35:42   INFO  epoch: 8/24, acc_iter=29586, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:26:57/0:36:58, time_cost(all): 9:07:27/17:40:53, loss=0.487776326904973, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.14(1.03), norm=1.1179427816802017, lr=0.02047349493606507
2023-12-21 23:36:38   INFO  epoch: 8/24, acc_iter=29636, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:27:53/0:38:56, time_cost(all): 9:08:23/16:53:53, loss=0.487580035585629, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.08(1.03), norm=3.540610077560026, lr=0.020450968535123168
2023-12-21 23:37:34   INFO  epoch: 8/24, acc_iter=29686, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:28:48/0:38:05, time_cost(all): 9:09:19/16:57:46, loss=0.487383744266286, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.94(1.03), norm=0.8173837602667738, lr=0.02042844213418126
2023-12-21 23:38:29   INFO  epoch: 8/24, acc_iter=29736, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:29:44/0:37:03, time_cost(all): 9:10:14/16:54:41, loss=0.487187452946942, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.16(1.03), norm=2.8829221950494985, lr=0.02040591573323936
2023-12-21 23:39:25   INFO  epoch: 8/24, acc_iter=29786, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:30:40/0:33:27, time_cost(all): 9:11:10/16:52:30, loss=0.486991161627599, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.02(1.03), norm=1.64322508002006, lr=0.020383389332297455
2023-12-21 23:40:21   INFO  epoch: 8/24, acc_iter=29836, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:31:36/0:33:03, time_cost(all): 9:12:06/16:20:23, loss=0.486794870308255, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.17(1.03), norm=1.3614068400932275, lr=0.02036086293135555
2023-12-21 23:41:17   INFO  epoch: 8/24, acc_iter=29886, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:32:31/0:31:21, time_cost(all): 9:13:02/17:04:13, loss=0.486598578988912, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.94(1.03), norm=1.9121525988014783, lr=0.02033833653041365
2023-12-21 23:42:12   INFO  epoch: 8/24, acc_iter=29936, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:33:27/0:32:00, time_cost(all): 9:13:57/17:06:24, loss=0.486402287669569, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.89(1.03), norm=3.1323214694221937, lr=0.020315810129471742
2023-12-21 23:43:08   INFO  epoch: 8/24, acc_iter=29986, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:34:23/0:31:22, time_cost(all): 9:14:53/17:14:48, loss=0.486205996350225, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.17(1.03), norm=4.791438172066279, lr=0.02029328372852984
2023-12-21 23:44:04   INFO  epoch: 8/24, acc_iter=30036, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:35:19/0:29:19, time_cost(all): 9:15:49/17:31:23, loss=0.486009705030882, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.05(1.03), norm=4.870214074359466, lr=0.020270757327587936
2023-12-21 23:45:00   INFO  epoch: 8/24, acc_iter=30086, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:36:14/0:29:03, time_cost(all): 9:16:45/17:07:58, loss=0.485813413711538, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.88(1.03), norm=0.9790502519806612, lr=0.02024823092664603
2023-12-21 23:45:56   INFO  epoch: 8/24, acc_iter=30136, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:37:10/0:27:40, time_cost(all): 9:17:41/17:35:00, loss=0.485617122392195, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.99(1.03), norm=3.8125952005751738, lr=0.020225704525704127
2023-12-21 23:46:51   INFO  epoch: 8/24, acc_iter=30186, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:38:06/0:27:54, time_cost(all): 9:18:36/16:41:11, loss=0.485420831072851, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.99(1.03), norm=3.500575729885416, lr=0.020203178124762224
2023-12-21 23:47:47   INFO  epoch: 8/24, acc_iter=30236, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:39:02/0:27:12, time_cost(all): 9:19:32/16:59:52, loss=0.485224539753508, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.01(1.03), norm=3.3681871072677554, lr=0.020180651723820317
2023-12-21 23:48:43   INFO  epoch: 8/24, acc_iter=30286, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:39:58/0:24:11, time_cost(all): 9:20:28/17:02:14, loss=0.485028248434164, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.98(1.03), norm=0.7350241866430132, lr=0.020158125322878417
2023-12-21 23:49:39   INFO  epoch: 8/24, acc_iter=30336, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:40:53/0:23:38, time_cost(all): 9:21:24/17:25:45, loss=0.484831957114821, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.05(1.03), norm=2.4822988404722537, lr=0.02013559892193651
2023-12-21 23:50:34   INFO  epoch: 8/24, acc_iter=30386, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:41:49/0:23:49, time_cost(all): 9:22:19/16:35:29, loss=0.484635665795478, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.89(1.03), norm=2.0356746072997827, lr=0.020113072520994608
2023-12-21 23:51:30   INFO  epoch: 8/24, acc_iter=30436, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:42:45/0:22:51, time_cost(all): 9:23:15/17:29:27, loss=0.484439374476134, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.95(1.03), norm=1.0574511580743537, lr=0.020090546120052705
2023-12-21 23:52:26   INFO  epoch: 8/24, acc_iter=30486, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:43:41/0:21:46, time_cost(all): 9:24:11/16:11:57, loss=0.484243083156791, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.01(1.03), norm=4.474350562199347, lr=0.020068019719110798
2023-12-21 23:53:22   INFO  epoch: 8/24, acc_iter=30536, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:44:36/0:21:13, time_cost(all): 9:25:07/16:48:36, loss=0.484046791837447, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.15(1.03), norm=2.238328390379446, lr=0.020045493318168895
2023-12-21 23:54:17   INFO  epoch: 8/24, acc_iter=30586, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:45:32/0:20:37, time_cost(all): 9:26:02/16:22:28, loss=0.483850500518104, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.96(1.03), norm=2.9211703971417258, lr=0.020022966917226992
2023-12-21 23:55:13   INFO  epoch: 8/24, acc_iter=30636, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:46:28/0:19:15, time_cost(all): 9:26:58/16:22:53, loss=0.48365420919876, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.15(1.03), norm=1.777967765271789, lr=0.020000440516285085
2023-12-21 23:56:09   INFO  epoch: 8/24, acc_iter=30686, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:47:24/0:17:35, time_cost(all): 9:27:54/16:10:07, loss=0.483457917879417, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.18(1.03), norm=4.349297945716671, lr=0.019977914115343186
2023-12-21 23:57:05   INFO  epoch: 8/24, acc_iter=30736, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:48:19/0:16:52, time_cost(all): 9:28:50/16:21:02, loss=0.483261626560074, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.12(1.03), norm=2.306685962917295, lr=0.01995538771440128
2023-12-21 23:58:01   INFO  epoch: 8/24, acc_iter=30786, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:49:15/0:16:42, time_cost(all): 9:29:46/16:49:40, loss=0.48306533524073, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.96(1.03), norm=0.9141467670287107, lr=0.019932861313459376
2023-12-21 23:58:56   INFO  epoch: 8/24, acc_iter=30836, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:50:11/0:15:38, time_cost(all): 9:30:41/17:19:53, loss=0.482869043921387, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.12(1.03), norm=2.3922113153061066, lr=0.019910334912517473
2023-12-21 23:59:52   INFO  epoch: 8/24, acc_iter=30886, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 0:51:07/0:14:56, time_cost(all): 9:31:37/17:15:04, loss=0.482672752602043, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.96(1.03), norm=4.933687857104031, lr=0.019887808511575567
2023-12-22 00:00:48   INFO  epoch: 8/24, acc_iter=30936, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 0:52:03/0:13:10, time_cost(all): 9:32:33/16:22:54, loss=0.4824764612827, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.92(1.03), norm=1.419839629767108, lr=0.019865282110633663
2023-12-22 00:01:44   INFO  epoch: 8/24, acc_iter=30986, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 0:52:58/0:12:12, time_cost(all): 9:33:29/17:17:03, loss=0.482280169963356, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.94(1.03), norm=2.6189916453819406, lr=0.01984275570969176
2023-12-22 00:02:39   INFO  epoch: 8/24, acc_iter=31036, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 0:53:54/0:11:14, time_cost(all): 9:34:24/16:05:20, loss=0.482083878644013, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.06(1.03), norm=1.684465175855556, lr=0.019820229308749854
2023-12-22 00:03:35   INFO  epoch: 8/24, acc_iter=31086, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 0:54:50/0:10:39, time_cost(all): 9:35:20/15:51:00, loss=0.48188758732467, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.9(1.03), norm=3.3847525033679715, lr=0.019797702907807954
2023-12-22 00:04:31   INFO  epoch: 8/24, acc_iter=31136, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 0:55:46/0:09:56, time_cost(all): 9:36:16/16:22:49, loss=0.481691296005326, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.14(1.03), norm=2.6817756939354407, lr=0.019775176506866048
2023-12-22 00:05:27   INFO  epoch: 8/24, acc_iter=31186, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 0:56:41/0:08:26, time_cost(all): 9:37:12/16:48:06, loss=0.481495004685983, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.04(1.03), norm=3.0937380073919574, lr=0.019752650105924145
2023-12-22 00:06:22   INFO  epoch: 8/24, acc_iter=31236, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 0:57:37/0:07:36, time_cost(all): 9:38:07/16:13:31, loss=0.481298713366639, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.11(1.03), norm=3.4569278510526336, lr=0.01973012370498224
2023-12-22 00:07:18   INFO  epoch: 8/24, acc_iter=31286, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 0:58:33/0:06:38, time_cost(all): 9:39:03/17:08:04, loss=0.481102422047296, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.03(1.03), norm=1.1207369513978336, lr=0.019707597304040335
2023-12-22 00:08:14   INFO  epoch: 8/24, acc_iter=31336, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 0:59:29/0:05:57, time_cost(all): 9:39:59/16:45:20, loss=0.480906130727952, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.89(1.03), norm=0.6593464385904879, lr=0.019685070903098432
2023-12-22 00:09:10   INFO  epoch: 8/24, acc_iter=31386, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:00:24/0:04:48, time_cost(all): 9:40:55/17:16:42, loss=0.480709839408609, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.16(1.03), norm=2.438958970346828, lr=0.01966254450215653
2023-12-22 00:10:06   INFO  epoch: 8/24, acc_iter=31436, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:01:20/0:04:07, time_cost(all): 9:41:51/15:59:03, loss=0.480513548089265, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.88(1.03), norm=4.476413474683824, lr=0.019640018101214622
2023-12-22 00:11:01   INFO  epoch: 8/24, acc_iter=31486, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:02:16/0:02:58, time_cost(all): 9:42:46/15:37:57, loss=0.480317256769922, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.13(1.03), norm=2.1328344158821975, lr=0.019617491700272723
2023-12-22 00:11:57   INFO  epoch: 8/24, acc_iter=31536, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:03:12/0:02:10, time_cost(all): 9:43:42/15:38:40, loss=0.480120965450579, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.21(1.03), norm=1.717262205642532, lr=0.019594965299330816
2023-12-22 00:12:53   INFO  epoch: 8/24, acc_iter=31586, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:04:08/0:01:14, time_cost(all): 9:44:38/16:25:53, loss=0.479924674131235, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.87(1.03), norm=1.7716990790104228, lr=0.019572438898388913
2023-12-22 00:13:49   INFO  epoch: 8/24, acc_iter=31636, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:05:03/0:00:18, time_cost(all): 9:45:34/16:47:04, loss=0.479728382811892, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.96(1.03), norm=2.5143359465941075, lr=0.01954991249744701
2023-12-22 00:14:44   INFO  epoch: 9/24, acc_iter=31703, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:00:55/1:03:58, time_cost(all): 9:46:29/16:57:01, loss=0.479465352443972, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.14(1.03), norm=1.7608137693227859, lr=0.01951972712018486
2023-12-22 00:15:40   INFO  epoch: 9/24, acc_iter=31753, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:01:51/1:01:14, time_cost(all): 9:47:25/16:39:20, loss=0.479269061124628, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.23(1.03), norm=1.0635938884575147, lr=0.019497200719242956
2023-12-22 00:16:36   INFO  epoch: 9/24, acc_iter=31803, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:02:47/1:02:31, time_cost(all): 9:48:21/16:36:37, loss=0.479072769805285, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.09(1.03), norm=4.298435968182039, lr=0.01947467431830105
2023-12-22 00:17:32   INFO  epoch: 9/24, acc_iter=31853, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:03:43/0:59:49, time_cost(all): 9:49:17/15:55:45, loss=0.478876478485941, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.14(1.03), norm=2.1446882663369657, lr=0.019452147917359146
2023-12-22 00:18:27   INFO  epoch: 9/24, acc_iter=31903, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:04:38/1:00:52, time_cost(all): 9:50:12/15:44:08, loss=0.478680187166598, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.12(1.03), norm=1.5673785257667052, lr=0.019429621516417243
2023-12-22 00:19:23   INFO  epoch: 9/24, acc_iter=31953, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:05:34/1:01:11, time_cost(all): 9:51:08/16:26:26, loss=0.478483895847254, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.89(1.03), norm=2.9795397827113743, lr=0.019407095115475337
2023-12-22 00:20:19   INFO  epoch: 9/24, acc_iter=32003, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:06:30/1:01:03, time_cost(all): 9:52:04/16:29:00, loss=0.478287604527911, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.13(1.03), norm=3.544331946021133, lr=0.019384568714533437
2023-12-22 00:21:15   INFO  epoch: 9/24, acc_iter=32053, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:07:26/0:59:30, time_cost(all): 9:53:00/16:16:04, loss=0.478091313208568, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.92(1.03), norm=1.658824574838793, lr=0.01936204231359153
2023-12-22 00:22:11   INFO  epoch: 9/24, acc_iter=32103, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:08:21/0:58:42, time_cost(all): 9:53:56/16:35:58, loss=0.477895021889224, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.05(1.03), norm=1.2233925774081802, lr=0.019339515912649628
2023-12-22 00:23:06   INFO  epoch: 9/24, acc_iter=32153, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:09:17/0:58:17, time_cost(all): 9:54:51/16:09:35, loss=0.477698730569881, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.15(1.03), norm=3.34416989974277, lr=0.019316989511707724
2023-12-22 00:24:02   INFO  epoch: 9/24, acc_iter=32203, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:10:13/0:57:01, time_cost(all): 9:55:47/16:56:45, loss=0.477502439250537, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.89(1.03), norm=3.498591235794914, lr=0.019294463110765818
2023-12-22 00:24:58   INFO  epoch: 9/24, acc_iter=32253, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:11:09/0:54:31, time_cost(all): 9:56:43/15:35:42, loss=0.477306147931194, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.84(1.03), norm=4.621968677174321, lr=0.01927193670982392
2023-12-22 00:25:54   INFO  epoch: 9/24, acc_iter=32303, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:12:04/0:51:11, time_cost(all): 9:57:39/16:26:51, loss=0.47710985661185, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.08(1.03), norm=2.2308667880170088, lr=0.019249410308882012
2023-12-22 00:26:49   INFO  epoch: 9/24, acc_iter=32353, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:13:00/0:52:28, time_cost(all): 9:58:34/15:26:28, loss=0.476913565292507, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.21(1.03), norm=2.6098068917628443, lr=0.01922688390794011
2023-12-22 00:27:45   INFO  epoch: 9/24, acc_iter=32403, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:13:56/0:50:07, time_cost(all): 9:59:30/15:46:14, loss=0.476717273973163, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.17(1.03), norm=3.0211498366705714, lr=0.019204357506998206
2023-12-22 00:28:41   INFO  epoch: 9/24, acc_iter=32453, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:14:52/0:51:33, time_cost(all): 10:00:26/16:42:55, loss=0.47652098265382, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.93(1.03), norm=2.228892054450755, lr=0.0191818311060563
2023-12-22 00:29:37   INFO  epoch: 9/24, acc_iter=32503, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:15:48/0:50:37, time_cost(all): 10:01:22/16:30:57, loss=0.476324691334477, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.16(1.03), norm=2.72039118479178, lr=0.019159304705114396
2023-12-22 00:30:32   INFO  epoch: 9/24, acc_iter=32553, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:16:43/0:46:45, time_cost(all): 10:02:17/15:18:42, loss=0.476128400015133, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.13(1.03), norm=1.6987121269930565, lr=0.019136778304172493
2023-12-22 00:31:28   INFO  epoch: 9/24, acc_iter=32603, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:17:39/0:46:44, time_cost(all): 10:03:13/15:21:22, loss=0.47593210869579, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.96(1.03), norm=1.472836485879782, lr=0.019114251903230586
2023-12-22 00:32:24   INFO  epoch: 9/24, acc_iter=32653, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:18:35/0:45:59, time_cost(all): 10:04:09/15:20:07, loss=0.475735817376446, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.92(1.03), norm=1.4429964232765229, lr=0.019091725502288687
2023-12-22 00:33:20   INFO  epoch: 9/24, acc_iter=32703, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:19:31/0:47:12, time_cost(all): 10:05:05/15:29:07, loss=0.475539526057103, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.88(1.03), norm=1.0008395721515462, lr=0.01906919910134678
2023-12-22 00:34:16   INFO  epoch: 9/24, acc_iter=32753, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:20:26/0:42:50, time_cost(all): 10:06:01/16:15:54, loss=0.475343234737759, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.99(1.03), norm=2.2448868444565795, lr=0.019046672700404877
2023-12-22 00:35:11   INFO  epoch: 9/24, acc_iter=32803, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:21:22/0:45:52, time_cost(all): 10:06:56/16:31:53, loss=0.475146943418416, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.15(1.03), norm=0.6161628419285636, lr=0.019024146299462974
2023-12-22 00:36:07   INFO  epoch: 9/24, acc_iter=32853, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:22:18/0:43:21, time_cost(all): 10:07:52/16:23:48, loss=0.474950652099073, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.22(1.03), norm=1.7993604317577319, lr=0.019001619898521067
2023-12-22 00:37:03   INFO  epoch: 9/24, acc_iter=32903, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:23:14/0:41:20, time_cost(all): 10:08:48/16:13:10, loss=0.474754360779729, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.94(1.03), norm=2.5262981711734875, lr=0.018979093497579164
2023-12-22 00:37:59   INFO  epoch: 9/24, acc_iter=32953, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:24:09/0:39:39, time_cost(all): 10:09:44/15:34:36, loss=0.474558069460386, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.07(1.03), norm=1.051843695355596, lr=0.01895656709663726
2023-12-22 00:38:54   INFO  epoch: 9/24, acc_iter=33003, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:25:05/0:39:27, time_cost(all): 10:10:39/16:18:32, loss=0.474361778141042, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.04(1.03), norm=3.9750633358293364, lr=0.018934040695695355
2023-12-22 00:39:50   INFO  epoch: 9/24, acc_iter=33053, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:26:01/0:38:07, time_cost(all): 10:11:35/16:00:21, loss=0.474165486821699, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.11(1.03), norm=1.7793380650422854, lr=0.018911514294753455
2023-12-22 00:40:46   INFO  epoch: 9/24, acc_iter=33103, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:26:57/0:37:58, time_cost(all): 10:12:31/15:46:52, loss=0.473969195502355, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.87(1.03), norm=0.9801442178120441, lr=0.01888898789381155
2023-12-22 00:41:42   INFO  epoch: 9/24, acc_iter=33153, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:27:53/0:37:35, time_cost(all): 10:13:27/16:31:17, loss=0.473772904183012, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.18(1.03), norm=1.944863490705276, lr=0.018866461492869645
2023-12-22 00:42:37   INFO  epoch: 9/24, acc_iter=33203, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:28:48/0:37:30, time_cost(all): 10:14:22/16:36:36, loss=0.473576612863669, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.87(1.03), norm=4.9180094613857035, lr=0.018843935091927742
2023-12-22 00:43:33   INFO  epoch: 9/24, acc_iter=33253, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:29:44/0:36:51, time_cost(all): 10:15:18/16:30:20, loss=0.473380321544325, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.99(1.03), norm=1.4230699241492508, lr=0.018821408690985836
2023-12-22 00:44:29   INFO  epoch: 9/24, acc_iter=33303, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:30:40/0:35:10, time_cost(all): 10:16:14/15:09:32, loss=0.473184030224982, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.13(1.03), norm=2.211353680096472, lr=0.018798882290043933
2023-12-22 00:45:25   INFO  epoch: 9/24, acc_iter=33353, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:31:36/0:32:30, time_cost(all): 10:17:10/15:25:02, loss=0.472987738905638, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.95(1.03), norm=4.741526098662112, lr=0.01877635588910203
2023-12-22 00:46:20   INFO  epoch: 9/24, acc_iter=33403, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:32:31/0:33:48, time_cost(all): 10:18:05/15:31:48, loss=0.472791447586295, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.1(1.03), norm=0.5759942874963122, lr=0.018753829488160123
2023-12-22 00:47:16   INFO  epoch: 9/24, acc_iter=33453, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:33:27/0:30:20, time_cost(all): 10:19:01/16:30:16, loss=0.472595156266951, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.05(1.03), norm=2.0535365079342043, lr=0.018731303087218223
2023-12-22 00:48:12   INFO  epoch: 9/24, acc_iter=33503, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:34:23/0:31:28, time_cost(all): 10:19:57/15:13:32, loss=0.472398864947608, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.9(1.03), norm=1.0285596071824186, lr=0.018708776686276317
2023-12-22 00:49:08   INFO  epoch: 9/24, acc_iter=33553, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:35:19/0:29:53, time_cost(all): 10:20:53/16:33:13, loss=0.472202573628265, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.03(1.03), norm=2.871391545842651, lr=0.018686250285334414
2023-12-22 00:50:04   INFO  epoch: 9/24, acc_iter=33603, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:36:14/0:29:38, time_cost(all): 10:21:49/15:11:44, loss=0.472006282308921, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.84(1.03), norm=2.3057205386833166, lr=0.01866372388439251
2023-12-22 00:50:59   INFO  epoch: 9/24, acc_iter=33653, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:37:10/0:28:41, time_cost(all): 10:22:44/16:05:43, loss=0.471809990989578, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.01(1.03), norm=3.734827607056698, lr=0.018641197483450604
2023-12-22 00:51:55   INFO  epoch: 9/24, acc_iter=33703, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:38:06/0:26:49, time_cost(all): 10:23:40/15:41:28, loss=0.471613699670234, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.98(1.03), norm=3.290257297608803, lr=0.0186186710825087
2023-12-22 00:52:51   INFO  epoch: 9/24, acc_iter=33753, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:39:02/0:27:38, time_cost(all): 10:24:36/16:09:49, loss=0.471417408350891, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.84(1.03), norm=3.391641545909772, lr=0.018596144681566798
2023-12-22 00:53:47   INFO  epoch: 9/24, acc_iter=33803, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:39:58/0:25:42, time_cost(all): 10:25:32/15:26:12, loss=0.471221117031547, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.98(1.03), norm=1.4686505919713064, lr=0.01857361828062489
2023-12-22 00:54:42   INFO  epoch: 9/24, acc_iter=33853, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:40:53/0:24:28, time_cost(all): 10:26:27/16:00:21, loss=0.471024825712204, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.02(1.03), norm=0.9742213142592646, lr=0.018551091879682992
2023-12-22 00:55:38   INFO  epoch: 9/24, acc_iter=33903, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:41:49/0:23:53, time_cost(all): 10:27:23/16:21:11, loss=0.470828534392861, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.1(1.03), norm=4.892142937084378, lr=0.018528565478741085
2023-12-22 00:56:34   INFO  epoch: 9/24, acc_iter=33953, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:42:45/0:23:01, time_cost(all): 10:28:19/16:17:51, loss=0.470632243073517, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.08(1.03), norm=2.2482740288741425, lr=0.018506039077799182
2023-12-22 00:57:30   INFO  epoch: 9/24, acc_iter=34003, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:43:41/0:21:51, time_cost(all): 10:29:15/16:08:08, loss=0.470435951754174, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.15(1.03), norm=2.2770222914137035, lr=0.01848351267685728
2023-12-22 00:58:25   INFO  epoch: 9/24, acc_iter=34053, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:44:36/0:19:53, time_cost(all): 10:30:10/15:47:46, loss=0.47023966043483, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.89(1.03), norm=4.193148424287445, lr=0.018460986275915373
2023-12-22 00:59:21   INFO  epoch: 9/24, acc_iter=34103, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:45:32/0:20:40, time_cost(all): 10:31:06/15:24:50, loss=0.470043369115487, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.22(1.03), norm=1.3851298518663595, lr=0.01843845987497347
2023-12-22 01:00:17   INFO  epoch: 9/24, acc_iter=34153, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:46:28/0:18:34, time_cost(all): 10:32:02/14:55:43, loss=0.469847077796143, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.95(1.03), norm=4.739431283996242, lr=0.018415933474031566
2023-12-22 01:01:13   INFO  epoch: 9/24, acc_iter=34203, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:47:24/0:18:22, time_cost(all): 10:32:58/16:04:36, loss=0.4696507864768, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.16(1.03), norm=3.1762437028573047, lr=0.018393407073089663
2023-12-22 01:02:09   INFO  epoch: 9/24, acc_iter=34253, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:48:19/0:17:40, time_cost(all): 10:33:54/16:04:39, loss=0.469454495157457, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.2(1.03), norm=0.724817440421871, lr=0.01837088067214776
2023-12-22 01:03:04   INFO  epoch: 9/24, acc_iter=34303, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:49:15/0:15:27, time_cost(all): 10:34:49/16:17:36, loss=0.469258203838113, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.2(1.03), norm=2.4029614942627564, lr=0.018348354271205854
2023-12-22 01:04:00   INFO  epoch: 9/24, acc_iter=34353, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:50:11/0:15:14, time_cost(all): 10:35:45/15:27:06, loss=0.46906191251877, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.84(1.03), norm=1.2137248447824682, lr=0.018325827870263954
2023-12-22 01:04:56   INFO  epoch: 9/24, acc_iter=34403, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 0:51:07/0:14:45, time_cost(all): 10:36:41/15:54:04, loss=0.468865621199426, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.84(1.03), norm=3.210012302578296, lr=0.018303301469322047
2023-12-22 01:05:52   INFO  epoch: 9/24, acc_iter=34453, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 0:52:03/0:13:17, time_cost(all): 10:37:37/15:09:18, loss=0.468669329880083, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.85(1.03), norm=3.5223885501422294, lr=0.018280775068380144
2023-12-22 01:06:47   INFO  epoch: 9/24, acc_iter=34503, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 0:52:58/0:12:36, time_cost(all): 10:38:32/15:20:46, loss=0.468473038560739, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.08(1.03), norm=4.431245760734617, lr=0.01825824866743824
2023-12-22 01:07:43   INFO  epoch: 9/24, acc_iter=34553, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 0:53:54/0:11:47, time_cost(all): 10:39:28/15:33:58, loss=0.468276747241396, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.05(1.03), norm=4.95207611283947, lr=0.018235722266496335
2023-12-22 01:08:39   INFO  epoch: 9/24, acc_iter=34603, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 0:54:50/0:10:48, time_cost(all): 10:40:24/14:59:27, loss=0.468080455922052, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.1(1.03), norm=2.520129553624068, lr=0.01821319586555443
2023-12-22 01:09:35   INFO  epoch: 9/24, acc_iter=34653, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 0:55:46/0:10:03, time_cost(all): 10:41:20/15:31:07, loss=0.467884164602709, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.19(1.03), norm=3.9027205861705005, lr=0.01819066946461253
2023-12-22 01:10:30   INFO  epoch: 9/24, acc_iter=34703, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 0:56:41/0:08:51, time_cost(all): 10:42:15/15:18:16, loss=0.467687873283366, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.93(1.03), norm=1.1831748795235548, lr=0.018168143063670622
2023-12-22 01:11:26   INFO  epoch: 9/24, acc_iter=34753, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 0:57:37/0:07:54, time_cost(all): 10:43:11/15:06:17, loss=0.467491581964022, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.08(1.03), norm=2.025511000544811, lr=0.018145616662728722
2023-12-22 01:12:22   INFO  epoch: 9/24, acc_iter=34803, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 0:58:33/0:06:40, time_cost(all): 10:44:07/14:47:34, loss=0.467295290644679, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.23(1.03), norm=0.5805432102865826, lr=0.018123090261786816
2023-12-22 01:13:18   INFO  epoch: 9/24, acc_iter=34853, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 0:59:29/0:05:40, time_cost(all): 10:45:03/15:14:18, loss=0.467098999325335, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.16(1.03), norm=4.6936039098545965, lr=0.018100563860844913
2023-12-22 01:14:14   INFO  epoch: 9/24, acc_iter=34903, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:00:24/0:04:51, time_cost(all): 10:45:59/14:58:26, loss=0.466902708005992, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.21(1.03), norm=1.3305524997239198, lr=0.01807803745990301
2023-12-22 01:15:09   INFO  epoch: 9/24, acc_iter=34953, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:01:20/0:03:52, time_cost(all): 10:46:54/15:23:50, loss=0.466706416686648, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.01(1.03), norm=1.6859614504994103, lr=0.018055511058961103
2023-12-22 01:16:05   INFO  epoch: 9/24, acc_iter=35003, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:02:16/0:03:06, time_cost(all): 10:47:50/15:14:42, loss=0.466510125367305, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.11(1.03), norm=4.068668722198708, lr=0.0180329846580192
2023-12-22 01:17:01   INFO  epoch: 9/24, acc_iter=35053, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:03:12/0:02:10, time_cost(all): 10:48:46/14:37:49, loss=0.466313834047962, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.92(1.03), norm=2.6963630891472428, lr=0.018010458257077297
2023-12-22 01:17:57   INFO  epoch: 9/24, acc_iter=35103, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:04:08/0:01:16, time_cost(all): 10:49:42/15:52:13, loss=0.466117542728618, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.05(1.03), norm=3.570504456849526, lr=0.01798793185613539
2023-12-22 01:18:52   INFO  epoch: 9/24, acc_iter=35153, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:05:03/0:00:19, time_cost(all): 10:50:37/15:34:16, loss=0.465921251409275, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.12(1.03), norm=1.944348200727387, lr=0.01796540545519349
2023-12-22 01:19:48   INFO  epoch: 10/24, acc_iter=35220, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:00:55/1:04:14, time_cost(all): 10:51:33/15:35:18, loss=0.465658221041354, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.97(1.03), norm=4.293171741955675, lr=0.017935220077931337
2023-12-22 01:20:44   INFO  epoch: 10/24, acc_iter=35270, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:01:51/1:05:36, time_cost(all): 10:52:29/15:42:43, loss=0.465461929722011, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.99(1.03), norm=2.839040617252502, lr=0.017912693676989434
2023-12-22 01:21:40   INFO  epoch: 10/24, acc_iter=35320, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:02:47/1:00:45, time_cost(all): 10:53:25/15:48:34, loss=0.465265638402668, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.02(1.03), norm=1.2911398463272106, lr=0.01789016727604753
2023-12-22 01:22:35   INFO  epoch: 10/24, acc_iter=35370, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:03:43/1:03:56, time_cost(all): 10:54:20/15:54:05, loss=0.465069347083324, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.01(1.03), norm=3.612453764154847, lr=0.017867640875105624
2023-12-22 01:23:31   INFO  epoch: 10/24, acc_iter=35420, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:04:38/1:02:01, time_cost(all): 10:55:16/15:06:57, loss=0.464873055763981, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.89(1.03), norm=3.1673995874736485, lr=0.01784511447416372
2023-12-22 01:24:27   INFO  epoch: 10/24, acc_iter=35470, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:05:34/0:58:19, time_cost(all): 10:56:12/15:12:13, loss=0.464676764444637, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.1(1.03), norm=4.082295842668231, lr=0.017822588073221818
2023-12-22 01:25:23   INFO  epoch: 10/24, acc_iter=35520, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:06:30/1:01:11, time_cost(all): 10:57:08/14:52:48, loss=0.464480473125294, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.91(1.03), norm=4.131149452018765, lr=0.01780006167227991
2023-12-22 01:26:19   INFO  epoch: 10/24, acc_iter=35570, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:07:26/0:59:02, time_cost(all): 10:58:04/15:00:53, loss=0.46428418180595, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.94(1.03), norm=0.6389362757602088, lr=0.01777753527133801
2023-12-22 01:27:14   INFO  epoch: 10/24, acc_iter=35620, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:08:21/0:56:34, time_cost(all): 10:58:59/15:26:40, loss=0.464087890486607, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.13(1.03), norm=3.980990585276181, lr=0.017755008870396105
2023-12-22 01:28:10   INFO  epoch: 10/24, acc_iter=35670, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:09:17/0:53:52, time_cost(all): 10:59:55/14:41:57, loss=0.463891599167264, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.98(1.03), norm=4.085368711842741, lr=0.017732482469454202
2023-12-22 01:29:06   INFO  epoch: 10/24, acc_iter=35720, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:10:13/0:55:42, time_cost(all): 11:00:51/14:52:41, loss=0.46369530784792, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.16(1.03), norm=4.5478519234205015, lr=0.0177099560685123
2023-12-22 01:30:02   INFO  epoch: 10/24, acc_iter=35770, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:11:09/0:53:37, time_cost(all): 11:01:47/14:41:05, loss=0.463499016528577, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.11(1.03), norm=3.663557941008409, lr=0.017687429667570392
2023-12-22 01:30:57   INFO  epoch: 10/24, acc_iter=35820, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:12:04/0:53:58, time_cost(all): 11:02:42/14:42:06, loss=0.463302725209233, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.15(1.03), norm=3.111445014101597, lr=0.01766490326662849
2023-12-22 01:31:53   INFO  epoch: 10/24, acc_iter=35870, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:13:00/0:50:40, time_cost(all): 11:03:38/14:55:16, loss=0.46310643388989, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.02(1.03), norm=2.4871256539560767, lr=0.017642376865686586
2023-12-22 01:32:49   INFO  epoch: 10/24, acc_iter=35920, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:13:56/0:50:10, time_cost(all): 11:04:34/14:51:15, loss=0.462910142570546, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.83(1.03), norm=4.958382270809921, lr=0.01761985046474468
2023-12-22 01:33:45   INFO  epoch: 10/24, acc_iter=35970, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:14:52/0:51:42, time_cost(all): 11:05:30/15:16:53, loss=0.462713851251203, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.83(1.03), norm=0.8340702984389874, lr=0.01759732406380278
2023-12-22 01:34:40   INFO  epoch: 10/24, acc_iter=36020, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:15:48/0:51:16, time_cost(all): 11:06:25/14:27:57, loss=0.46251755993186, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.85(1.03), norm=4.4631901215975915, lr=0.017574797662860873
2023-12-22 01:35:36   INFO  epoch: 10/24, acc_iter=36070, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:16:43/0:49:51, time_cost(all): 11:07:21/14:43:35, loss=0.462321268612516, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.04(1.03), norm=3.0801400108425936, lr=0.01755227126191897
2023-12-22 01:36:32   INFO  epoch: 10/24, acc_iter=36120, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:17:39/0:45:54, time_cost(all): 11:08:17/15:23:55, loss=0.462124977293173, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.89(1.03), norm=3.0429325480472698, lr=0.017529744860977067
2023-12-22 01:37:28   INFO  epoch: 10/24, acc_iter=36170, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:18:35/0:48:21, time_cost(all): 11:09:13/15:22:11, loss=0.461928685973829, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.09(1.03), norm=2.7436096815918676, lr=0.01750721846003516
2023-12-22 01:38:24   INFO  epoch: 10/24, acc_iter=36220, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:19:31/0:44:57, time_cost(all): 11:10:09/14:49:55, loss=0.461732394654486, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.0(1.03), norm=1.0334972132386238, lr=0.017484692059093258
2023-12-22 01:39:19   INFO  epoch: 10/24, acc_iter=36270, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:20:26/0:45:21, time_cost(all): 11:11:04/15:04:13, loss=0.461536103335142, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.86(1.03), norm=3.2890710389983635, lr=0.017462165658151355
2023-12-22 01:40:15   INFO  epoch: 10/24, acc_iter=36320, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:21:22/0:45:07, time_cost(all): 11:12:00/15:30:08, loss=0.461339812015799, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.84(1.03), norm=0.8370131194491566, lr=0.01743963925720945
2023-12-22 01:41:11   INFO  epoch: 10/24, acc_iter=36370, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:22:18/0:42:06, time_cost(all): 11:12:56/14:33:43, loss=0.461143520696456, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.01(1.03), norm=1.317434798534038, lr=0.01741711285626755
2023-12-22 01:42:07   INFO  epoch: 10/24, acc_iter=36420, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:23:14/0:42:43, time_cost(all): 11:13:52/15:15:28, loss=0.460947229377112, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.23(1.03), norm=4.876463343839713, lr=0.017394586455325642
2023-12-22 01:43:02   INFO  epoch: 10/24, acc_iter=36470, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:24:09/0:39:26, time_cost(all): 11:14:47/14:26:44, loss=0.460750938057769, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.02(1.03), norm=4.84248169357109, lr=0.017372060054383742
2023-12-22 01:43:58   INFO  epoch: 10/24, acc_iter=36520, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:25:05/0:42:11, time_cost(all): 11:15:43/15:00:05, loss=0.460554646738425, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.87(1.03), norm=2.4298115189443594, lr=0.017349533653441836
2023-12-22 01:44:54   INFO  epoch: 10/24, acc_iter=36570, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:26:01/0:40:15, time_cost(all): 11:16:39/14:10:09, loss=0.460358355419082, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.86(1.03), norm=1.1441267696384605, lr=0.017327007252499933
2023-12-22 01:45:50   INFO  epoch: 10/24, acc_iter=36620, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:26:57/0:36:52, time_cost(all): 11:17:35/14:29:10, loss=0.460162064099738, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.13(1.03), norm=4.968701500306903, lr=0.01730448085155803
2023-12-22 01:46:45   INFO  epoch: 10/24, acc_iter=36670, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:27:53/0:36:23, time_cost(all): 11:18:30/14:38:15, loss=0.459965772780395, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.15(1.03), norm=1.1188687749077284, lr=0.017281954450616123
2023-12-22 01:47:41   INFO  epoch: 10/24, acc_iter=36720, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:28:48/0:36:33, time_cost(all): 11:19:26/14:17:54, loss=0.459769481461051, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.17(1.03), norm=2.6395950844606157, lr=0.017259428049674223
2023-12-22 01:48:37   INFO  epoch: 10/24, acc_iter=36770, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:29:44/0:34:37, time_cost(all): 11:20:22/14:56:09, loss=0.459573190141708, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.0(1.03), norm=0.7163899035207612, lr=0.017236901648732317
2023-12-22 01:49:33   INFO  epoch: 10/24, acc_iter=36820, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:30:40/0:34:09, time_cost(all): 11:21:18/14:15:37, loss=0.459376898822365, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.93(1.03), norm=2.920042861178781, lr=0.017214375247790414
2023-12-22 01:50:29   INFO  epoch: 10/24, acc_iter=36870, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:31:36/0:33:33, time_cost(all): 11:22:14/15:21:10, loss=0.459180607503021, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.92(1.03), norm=1.7021133841733787, lr=0.01719184884684851
2023-12-22 01:51:24   INFO  epoch: 10/24, acc_iter=36920, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:32:31/0:34:18, time_cost(all): 11:23:09/14:11:51, loss=0.458984316183678, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.2(1.03), norm=0.800379808885058, lr=0.017169322445906604
2023-12-22 01:52:20   INFO  epoch: 10/24, acc_iter=36970, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:33:27/0:30:44, time_cost(all): 11:24:05/14:30:24, loss=0.458788024864334, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.2(1.03), norm=3.6060204487992213, lr=0.0171467960449647
2023-12-22 01:53:16   INFO  epoch: 10/24, acc_iter=37020, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:34:23/0:29:31, time_cost(all): 11:25:01/14:54:55, loss=0.458591733544991, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.86(1.03), norm=2.011288709806849, lr=0.017124269644022798
2023-12-22 01:54:12   INFO  epoch: 10/24, acc_iter=37070, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:35:19/0:30:57, time_cost(all): 11:25:57/14:32:29, loss=0.458395442225647, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.1(1.03), norm=1.1215597476899608, lr=0.01710174324308089
2023-12-22 01:55:07   INFO  epoch: 10/24, acc_iter=37120, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:36:14/0:28:20, time_cost(all): 11:26:52/15:04:41, loss=0.458199150906304, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.08(1.03), norm=3.9650868045959538, lr=0.01707921684213899
2023-12-22 01:56:03   INFO  epoch: 10/24, acc_iter=37170, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:37:10/0:26:54, time_cost(all): 11:27:48/14:44:51, loss=0.458002859586961, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.11(1.03), norm=2.2073038618555016, lr=0.017056690441197085
2023-12-22 01:56:59   INFO  epoch: 10/24, acc_iter=37220, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:38:06/0:28:25, time_cost(all): 11:28:44/15:10:44, loss=0.457806568267617, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.12(1.03), norm=3.264259769235509, lr=0.017034164040255182
2023-12-22 01:57:55   INFO  epoch: 10/24, acc_iter=37270, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:39:02/0:26:30, time_cost(all): 11:29:40/15:18:47, loss=0.457610276948274, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.06(1.03), norm=0.6661827580008153, lr=0.01701163763931328
2023-12-22 01:58:50   INFO  epoch: 10/24, acc_iter=37320, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:39:58/0:26:33, time_cost(all): 11:30:35/14:28:30, loss=0.45741398562893, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.01(1.03), norm=3.6456599354870827, lr=0.016989111238371372
2023-12-22 01:59:46   INFO  epoch: 10/24, acc_iter=37370, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:40:53/0:24:49, time_cost(all): 11:31:31/14:59:19, loss=0.457217694309587, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.94(1.03), norm=4.104659397244307, lr=0.01696658483742947
2023-12-22 02:00:42   INFO  epoch: 10/24, acc_iter=37420, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:41:49/0:23:08, time_cost(all): 11:32:27/14:28:20, loss=0.457021402990243, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.12(1.03), norm=0.5962008353135075, lr=0.016944058436487566
2023-12-22 02:01:38   INFO  epoch: 10/24, acc_iter=37470, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:42:45/0:21:45, time_cost(all): 11:33:23/14:24:57, loss=0.4568251116709, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.23(1.03), norm=2.535512474578952, lr=0.01692153203554566
2023-12-22 02:02:34   INFO  epoch: 10/24, acc_iter=37520, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:43:41/0:22:44, time_cost(all): 11:34:19/14:58:03, loss=0.456628820351557, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.1(1.03), norm=3.2037818985531774, lr=0.01689900563460376
2023-12-22 02:03:29   INFO  epoch: 10/24, acc_iter=37570, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:44:36/0:20:59, time_cost(all): 11:35:14/14:19:42, loss=0.456432529032213, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.97(1.03), norm=1.6629985450385938, lr=0.016876479233661854
2023-12-22 02:04:25   INFO  epoch: 10/24, acc_iter=37620, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:45:32/0:19:09, time_cost(all): 11:36:10/14:06:51, loss=0.45623623771287, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.89(1.03), norm=0.8934301595868868, lr=0.01685395283271995
2023-12-22 02:05:21   INFO  epoch: 10/24, acc_iter=37670, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:46:28/0:19:10, time_cost(all): 11:37:06/14:14:20, loss=0.456039946393526, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.01(1.03), norm=4.215016504377157, lr=0.016831426431778047
2023-12-22 02:06:17   INFO  epoch: 10/24, acc_iter=37720, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:47:24/0:18:25, time_cost(all): 11:38:02/14:03:47, loss=0.455843655074183, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.1(1.03), norm=1.387086415169865, lr=0.01680890003083614
2023-12-22 02:07:12   INFO  epoch: 10/24, acc_iter=37770, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:48:19/0:17:14, time_cost(all): 11:38:57/15:09:52, loss=0.455647363754839, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.17(1.03), norm=1.768472300261625, lr=0.016786373629894238
2023-12-22 02:08:08   INFO  epoch: 10/24, acc_iter=37820, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:49:15/0:16:00, time_cost(all): 11:39:53/13:52:18, loss=0.455451072435496, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.15(1.03), norm=0.8399550136465552, lr=0.016763847228952335
2023-12-22 02:09:04   INFO  epoch: 10/24, acc_iter=37870, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:50:11/0:15:15, time_cost(all): 11:40:49/14:39:48, loss=0.455254781116153, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.04(1.03), norm=4.747581827819932, lr=0.016741320828010428
2023-12-22 02:10:00   INFO  epoch: 10/24, acc_iter=37920, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 0:51:07/0:14:39, time_cost(all): 11:41:45/13:57:14, loss=0.455058489796809, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.19(1.03), norm=0.7629908493600706, lr=0.01671879442706853
2023-12-22 02:10:55   INFO  epoch: 10/24, acc_iter=37970, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 0:52:03/0:12:42, time_cost(all): 11:42:40/14:43:31, loss=0.454862198477466, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.89(1.03), norm=2.5803326222795806, lr=0.016696268026126622
2023-12-22 02:11:51   INFO  epoch: 10/24, acc_iter=38020, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 0:52:58/0:12:10, time_cost(all): 11:43:36/13:56:54, loss=0.454665907158122, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.98(1.03), norm=3.751011032971382, lr=0.01667374162518472
2023-12-22 02:12:47   INFO  epoch: 10/24, acc_iter=38070, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 0:53:54/0:11:47, time_cost(all): 11:44:32/13:54:17, loss=0.454469615838779, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.12(1.03), norm=3.960581569133773, lr=0.016651215224242816
2023-12-22 02:13:43   INFO  epoch: 10/24, acc_iter=38120, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 0:54:50/0:10:06, time_cost(all): 11:45:28/14:17:17, loss=0.454273324519435, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.19(1.03), norm=4.395396261816217, lr=0.01662868882330091
2023-12-22 02:14:38   INFO  epoch: 10/24, acc_iter=38170, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 0:55:46/0:09:49, time_cost(all): 11:46:23/14:31:06, loss=0.454077033200092, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.9(1.03), norm=0.8033202451165048, lr=0.016606162422359006
2023-12-22 02:15:34   INFO  epoch: 10/24, acc_iter=38220, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 0:56:41/0:08:20, time_cost(all): 11:47:19/14:29:48, loss=0.453880741880749, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.96(1.03), norm=0.8253654945442978, lr=0.016583636021417103
2023-12-22 02:16:30   INFO  epoch: 10/24, acc_iter=38270, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 0:57:37/0:07:30, time_cost(all): 11:48:15/14:47:50, loss=0.453684450561405, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.1(1.03), norm=2.666027848729412, lr=0.016561109620475196
2023-12-22 02:17:26   INFO  epoch: 10/24, acc_iter=38320, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 0:58:33/0:07:04, time_cost(all): 11:49:11/13:55:35, loss=0.453488159242062, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.89(1.03), norm=4.301809068738107, lr=0.016538583219533297
2023-12-22 02:18:22   INFO  epoch: 10/24, acc_iter=38370, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 0:59:29/0:06:00, time_cost(all): 11:50:07/14:09:30, loss=0.453291867922718, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.88(1.03), norm=4.576406614166967, lr=0.01651605681859139
2023-12-22 02:19:17   INFO  epoch: 10/24, acc_iter=38420, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:00:24/0:04:50, time_cost(all): 11:51:02/13:48:49, loss=0.453095576603375, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.14(1.03), norm=2.51998342290468, lr=0.016493530417649487
2023-12-22 02:20:13   INFO  epoch: 10/24, acc_iter=38470, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:01:20/0:04:13, time_cost(all): 11:51:58/14:38:18, loss=0.452899285284031, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.96(1.03), norm=0.9747467981649021, lr=0.016471004016707584
2023-12-22 02:21:09   INFO  epoch: 10/24, acc_iter=38520, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:02:16/0:03:09, time_cost(all): 11:52:54/14:44:42, loss=0.452702993964688, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.02(1.03), norm=4.91354295008481, lr=0.016448477615765678
2023-12-22 02:22:05   INFO  epoch: 10/24, acc_iter=38570, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:03:12/0:02:09, time_cost(all): 11:53:50/14:12:37, loss=0.452506702645344, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.06(1.03), norm=2.4348528144655623, lr=0.016425951214823774
2023-12-22 02:23:00   INFO  epoch: 10/24, acc_iter=38620, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:04:08/0:01:11, time_cost(all): 11:54:45/14:49:52, loss=0.452310411326001, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.05(1.03), norm=1.5171360183043108, lr=0.01640342481388187
2023-12-22 02:23:56   INFO  epoch: 10/24, acc_iter=38670, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:05:03/0:00:19, time_cost(all): 11:55:41/14:06:50, loss=0.452114120006658, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.84(1.03), norm=3.7488051243901874, lr=0.01638089841293997
2023-12-22 02:24:52   INFO  epoch: 11/24, acc_iter=38737, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:00:55/1:01:57, time_cost(all): 11:56:37/13:44:59, loss=0.451851089638737, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.89(1.03), norm=1.2896272243575804, lr=0.016350713035677818
2023-12-22 02:25:48   INFO  epoch: 11/24, acc_iter=38787, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:01:51/1:03:08, time_cost(all): 11:57:33/14:41:32, loss=0.451654798319394, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.08(1.03), norm=4.737075282071131, lr=0.01632818663473591
2023-12-22 02:26:43   INFO  epoch: 11/24, acc_iter=38837, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:02:47/1:04:32, time_cost(all): 11:58:28/13:48:01, loss=0.451458507000051, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.16(1.03), norm=2.373343166246018, lr=0.01630566023379401
2023-12-22 02:27:39   INFO  epoch: 11/24, acc_iter=38887, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:03:43/1:03:33, time_cost(all): 11:59:24/13:59:49, loss=0.451262215680707, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.02(1.03), norm=3.607233871187013, lr=0.016283133832852105
2023-12-22 02:28:35   INFO  epoch: 11/24, acc_iter=38937, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:04:38/1:03:44, time_cost(all): 12:00:20/14:19:57, loss=0.451065924361364, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.98(1.03), norm=4.856608776215535, lr=0.0162606074319102
2023-12-22 02:29:31   INFO  epoch: 11/24, acc_iter=38987, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:05:34/0:58:34, time_cost(all): 12:01:16/14:00:36, loss=0.45086963304202, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.9(1.03), norm=3.0898488500685093, lr=0.0162380810309683
2023-12-22 02:30:27   INFO  epoch: 11/24, acc_iter=39037, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:06:30/1:00:05, time_cost(all): 12:02:12/14:29:39, loss=0.450673341722677, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.18(1.03), norm=2.6700486151907694, lr=0.016215554630026392
2023-12-22 02:31:22   INFO  epoch: 11/24, acc_iter=39087, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:07:26/0:55:11, time_cost(all): 12:03:07/14:35:31, loss=0.450477050403333, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.14(1.03), norm=1.2463602949215111, lr=0.016193028229084493
2023-12-22 02:32:18   INFO  epoch: 11/24, acc_iter=39137, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:08:21/0:59:44, time_cost(all): 12:04:03/14:33:59, loss=0.45028075908399, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.06(1.03), norm=3.0377663656436296, lr=0.016170501828142586
2023-12-22 02:33:14   INFO  epoch: 11/24, acc_iter=39187, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:09:17/0:55:27, time_cost(all): 12:04:59/14:46:07, loss=0.450084467764647, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.02(1.03), norm=4.511658888434492, lr=0.01614797542720068
2023-12-22 02:34:10   INFO  epoch: 11/24, acc_iter=39237, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:10:13/0:57:37, time_cost(all): 12:05:55/13:44:21, loss=0.449888176445303, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.93(1.03), norm=4.564084362503796, lr=0.01612544902625878
2023-12-22 02:35:05   INFO  epoch: 11/24, acc_iter=39287, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:11:09/0:53:23, time_cost(all): 12:06:50/14:27:48, loss=0.44969188512596, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.89(1.03), norm=4.029128862586546, lr=0.016102922625316873
2023-12-22 02:36:01   INFO  epoch: 11/24, acc_iter=39337, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:12:04/0:54:26, time_cost(all): 12:07:46/13:37:41, loss=0.449495593806616, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.94(1.03), norm=3.947598834587685, lr=0.016080396224374967
2023-12-22 02:36:57   INFO  epoch: 11/24, acc_iter=39387, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:13:00/0:51:44, time_cost(all): 12:08:42/13:51:57, loss=0.449299302487273, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.92(1.03), norm=0.8074785916673515, lr=0.016057869823433067
2023-12-22 02:37:53   INFO  epoch: 11/24, acc_iter=39437, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:13:56/0:53:06, time_cost(all): 12:09:38/14:38:44, loss=0.449103011167929, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.03(1.03), norm=4.942811238309799, lr=0.01603534342249116
2023-12-22 02:38:48   INFO  epoch: 11/24, acc_iter=39487, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:14:52/0:51:46, time_cost(all): 12:10:33/14:16:58, loss=0.448906719848586, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.18(1.03), norm=1.3407153698505279, lr=0.01601281702154926
2023-12-22 02:39:44   INFO  epoch: 11/24, acc_iter=39537, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:15:48/0:47:42, time_cost(all): 12:11:29/13:19:27, loss=0.448710428529242, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.11(1.03), norm=2.184365226392718, lr=0.015990290620607354
2023-12-22 02:40:40   INFO  epoch: 11/24, acc_iter=39587, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:16:43/0:48:33, time_cost(all): 12:12:25/13:59:44, loss=0.448514137209899, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.97(1.03), norm=1.0481435786821405, lr=0.015967764219665448
2023-12-22 02:41:36   INFO  epoch: 11/24, acc_iter=39637, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:17:39/0:50:01, time_cost(all): 12:13:21/13:17:32, loss=0.448317845890556, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.18(1.03), norm=2.813222380905258, lr=0.01594523781872355
2023-12-22 02:42:32   INFO  epoch: 11/24, acc_iter=39687, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:18:35/0:48:18, time_cost(all): 12:14:17/14:05:34, loss=0.448121554571212, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.99(1.03), norm=3.169186470711415, lr=0.01592271141778164
2023-12-22 02:43:27   INFO  epoch: 11/24, acc_iter=39737, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:19:31/0:43:41, time_cost(all): 12:15:12/13:39:18, loss=0.447925263251869, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.84(1.03), norm=1.3164839314196422, lr=0.015900185016839735
2023-12-22 02:44:23   INFO  epoch: 11/24, acc_iter=39787, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:20:26/0:47:08, time_cost(all): 12:16:08/13:42:18, loss=0.447728971932525, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.93(1.03), norm=1.368145792586401, lr=0.015877658615897836
2023-12-22 02:45:19   INFO  epoch: 11/24, acc_iter=39837, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:21:22/0:43:57, time_cost(all): 12:17:04/13:53:51, loss=0.447532680613182, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.99(1.03), norm=1.4806759978274824, lr=0.01585513221495593
2023-12-22 02:46:15   INFO  epoch: 11/24, acc_iter=39887, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:22:18/0:42:00, time_cost(all): 12:18:00/13:45:00, loss=0.447336389293838, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.87(1.03), norm=1.2843330483463467, lr=0.01583260581401403
2023-12-22 02:47:10   INFO  epoch: 11/24, acc_iter=39937, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:23:14/0:43:07, time_cost(all): 12:18:55/13:11:26, loss=0.447140097974495, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.91(1.03), norm=1.453824157432534, lr=0.015810079413072123
2023-12-22 02:48:06   INFO  epoch: 11/24, acc_iter=39987, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:24:09/0:39:40, time_cost(all): 12:19:51/13:35:56, loss=0.446943806655152, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.22(1.03), norm=3.497567218157407, lr=0.015787553012130216
2023-12-22 02:49:02   INFO  epoch: 11/24, acc_iter=40037, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:25:05/0:39:40, time_cost(all): 12:20:47/14:19:40, loss=0.446747515335808, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.14(1.03), norm=0.5378434706844035, lr=0.015765026611188317
2023-12-22 02:49:58   INFO  epoch: 11/24, acc_iter=40087, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:26:01/0:39:21, time_cost(all): 12:21:43/14:24:41, loss=0.446551224016465, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.09(1.03), norm=2.795620567866056, lr=0.01574250021024641
2023-12-22 02:50:53   INFO  epoch: 11/24, acc_iter=40137, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:26:57/0:38:29, time_cost(all): 12:22:38/14:12:47, loss=0.446354932697121, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.94(1.03), norm=3.9256813459175963, lr=0.015719973809304504
2023-12-22 02:51:49   INFO  epoch: 11/24, acc_iter=40187, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:27:53/0:36:16, time_cost(all): 12:23:34/13:54:57, loss=0.446158641377778, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.16(1.03), norm=3.880698883103866, lr=0.015697447408362604
2023-12-22 02:52:45   INFO  epoch: 11/24, acc_iter=40237, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:28:48/0:37:23, time_cost(all): 12:24:30/13:56:03, loss=0.445962350058434, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.86(1.03), norm=0.7897999937169919, lr=0.015674921007420697
2023-12-22 02:53:41   INFO  epoch: 11/24, acc_iter=40287, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:29:44/0:34:01, time_cost(all): 12:25:26/14:21:01, loss=0.445766058739091, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.19(1.03), norm=1.0886392847263466, lr=0.015652394606478798
2023-12-22 02:54:37   INFO  epoch: 11/24, acc_iter=40337, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:30:40/0:35:28, time_cost(all): 12:26:22/13:38:39, loss=0.445569767419748, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.92(1.03), norm=4.247455229792024, lr=0.01562986820553689
2023-12-22 02:55:32   INFO  epoch: 11/24, acc_iter=40387, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:31:36/0:33:46, time_cost(all): 12:27:17/14:22:17, loss=0.445373476100404, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.14(1.03), norm=1.7112156712858542, lr=0.015607341804594986
2023-12-22 02:56:28   INFO  epoch: 11/24, acc_iter=40437, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:32:31/0:33:15, time_cost(all): 12:28:13/14:16:02, loss=0.445177184781061, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.07(1.03), norm=2.5578743602859815, lr=0.015584815403653083
2023-12-22 02:57:24   INFO  epoch: 11/24, acc_iter=40487, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:33:27/0:31:33, time_cost(all): 12:29:09/14:00:25, loss=0.444980893461717, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.99(1.03), norm=3.861767929110409, lr=0.01556228900271118
2023-12-22 02:58:20   INFO  epoch: 11/24, acc_iter=40537, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:34:23/0:32:10, time_cost(all): 12:30:05/13:41:37, loss=0.444784602142374, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.03(1.03), norm=4.947892940416484, lr=0.015539762601769274
2023-12-22 02:59:15   INFO  epoch: 11/24, acc_iter=40587, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:35:19/0:29:30, time_cost(all): 12:31:00/13:08:56, loss=0.44458831082303, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.95(1.03), norm=0.610591505296415, lr=0.01551723620082737
2023-12-22 03:00:11   INFO  epoch: 11/24, acc_iter=40637, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:36:14/0:29:11, time_cost(all): 12:31:56/13:05:48, loss=0.444392019503687, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.86(1.03), norm=4.545491720859674, lr=0.015494709799885467
2023-12-22 03:01:07   INFO  epoch: 11/24, acc_iter=40687, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:37:10/0:27:45, time_cost(all): 12:32:52/13:45:41, loss=0.444195728184343, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.93(1.03), norm=0.6911773617321808, lr=0.015472183398943564
2023-12-22 03:02:03   INFO  epoch: 11/24, acc_iter=40737, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:38:06/0:27:28, time_cost(all): 12:33:48/12:56:28, loss=0.443999436865, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.1(1.03), norm=2.7553629084973004, lr=0.015449656998001661
2023-12-22 03:02:58   INFO  epoch: 11/24, acc_iter=40787, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:39:02/0:25:06, time_cost(all): 12:34:43/13:27:10, loss=0.443803145545657, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.94(1.03), norm=3.8121349068599963, lr=0.015427130597059755
2023-12-22 03:03:54   INFO  epoch: 11/24, acc_iter=40837, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:39:58/0:26:21, time_cost(all): 12:35:39/13:28:20, loss=0.443606854226313, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.05(1.03), norm=1.7897783379913013, lr=0.015404604196117852
2023-12-22 03:04:50   INFO  epoch: 11/24, acc_iter=40887, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:40:53/0:25:11, time_cost(all): 12:36:35/13:46:21, loss=0.44341056290697, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.95(1.03), norm=1.2696392417364246, lr=0.015382077795175949
2023-12-22 03:05:46   INFO  epoch: 11/24, acc_iter=40937, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:41:49/0:24:40, time_cost(all): 12:37:31/13:35:54, loss=0.443214271587626, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.14(1.03), norm=3.6581829158393115, lr=0.015359551394234046
2023-12-22 03:06:42   INFO  epoch: 11/24, acc_iter=40987, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:42:45/0:22:42, time_cost(all): 12:38:27/13:34:29, loss=0.443017980268283, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.96(1.03), norm=4.100286986735128, lr=0.015337024993292139
2023-12-22 03:07:37   INFO  epoch: 11/24, acc_iter=41037, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:43:41/0:21:14, time_cost(all): 12:39:22/13:12:33, loss=0.442821688948939, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.89(1.03), norm=2.8098262408248704, lr=0.015314498592350236
2023-12-22 03:08:33   INFO  epoch: 11/24, acc_iter=41087, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:44:36/0:20:32, time_cost(all): 12:40:18/12:49:59, loss=0.442625397629596, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.18(1.03), norm=3.9612877565990936, lr=0.015291972191408333
2023-12-22 03:09:29   INFO  epoch: 11/24, acc_iter=41137, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:45:32/0:19:03, time_cost(all): 12:41:14/13:21:08, loss=0.442429106310253, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.14(1.03), norm=4.986919638180638, lr=0.01526944579046643
2023-12-22 03:10:25   INFO  epoch: 11/24, acc_iter=41187, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:46:28/0:18:15, time_cost(all): 12:42:10/13:38:10, loss=0.442232814990909, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.9(1.03), norm=4.53339677931179, lr=0.015246919389524525
2023-12-22 03:11:20   INFO  epoch: 11/24, acc_iter=41237, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:47:24/0:17:25, time_cost(all): 12:43:05/13:32:32, loss=0.442036523671566, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.84(1.03), norm=2.023891565710096, lr=0.015224392988582622
2023-12-22 03:12:16   INFO  epoch: 11/24, acc_iter=41287, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:48:19/0:17:12, time_cost(all): 12:44:01/13:49:29, loss=0.441840232352222, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.12(1.03), norm=4.749965379039395, lr=0.015201866587640717
2023-12-22 03:13:12   INFO  epoch: 11/24, acc_iter=41337, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:49:15/0:16:01, time_cost(all): 12:44:57/14:03:56, loss=0.441643941032879, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.06(1.03), norm=2.583625821937179, lr=0.015179340186698814
2023-12-22 03:14:08   INFO  epoch: 11/24, acc_iter=41387, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:50:11/0:15:39, time_cost(all): 12:45:53/12:55:16, loss=0.441447649713535, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.01(1.03), norm=3.6456343548500594, lr=0.015156813785756909
2023-12-22 03:15:03   INFO  epoch: 11/24, acc_iter=41437, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 0:51:07/0:14:44, time_cost(all): 12:46:48/13:22:41, loss=0.441251358394192, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.98(1.03), norm=4.169886947399908, lr=0.015134287384815006
2023-12-22 03:15:59   INFO  epoch: 11/24, acc_iter=41487, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 0:52:03/0:13:00, time_cost(all): 12:47:44/13:52:25, loss=0.441055067074849, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.09(1.03), norm=1.9686745227840685, lr=0.015111760983873103
2023-12-22 03:16:55   INFO  epoch: 11/24, acc_iter=41537, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 0:52:58/0:11:55, time_cost(all): 12:48:40/12:58:34, loss=0.440858775755505, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.05(1.03), norm=3.914879332839768, lr=0.0150892345829312
2023-12-22 03:17:51   INFO  epoch: 11/24, acc_iter=41587, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 0:53:54/0:11:15, time_cost(all): 12:49:36/12:45:21, loss=0.440662484436162, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.93(1.03), norm=4.5805159627913, lr=0.015066708181989293
2023-12-22 03:18:47   INFO  epoch: 11/24, acc_iter=41637, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 0:54:50/0:10:11, time_cost(all): 12:50:32/13:05:09, loss=0.440466193116818, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.87(1.03), norm=1.9585491349095296, lr=0.01504418178104739
2023-12-22 03:19:42   INFO  epoch: 11/24, acc_iter=41687, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 0:55:46/0:10:00, time_cost(all): 12:51:27/13:45:29, loss=0.440269901797475, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.23(1.03), norm=4.8673414428821875, lr=0.015021655380105487
2023-12-22 03:20:38   INFO  epoch: 11/24, acc_iter=41737, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 0:56:41/0:08:57, time_cost(all): 12:52:23/13:16:03, loss=0.440073610478131, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.88(1.03), norm=1.6351336310040658, lr=0.014999128979163584
2023-12-22 03:21:34   INFO  epoch: 11/24, acc_iter=41787, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 0:57:37/0:07:45, time_cost(all): 12:53:19/13:50:08, loss=0.439877319158788, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.83(1.03), norm=0.9437150593869965, lr=0.014976602578221677
2023-12-22 03:22:30   INFO  epoch: 11/24, acc_iter=41837, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 0:58:33/0:06:40, time_cost(all): 12:54:15/13:15:48, loss=0.439681027839445, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.14(1.03), norm=3.462928975848827, lr=0.014954076177279774
2023-12-22 03:23:25   INFO  epoch: 11/24, acc_iter=41887, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 0:59:29/0:06:09, time_cost(all): 12:55:10/13:04:06, loss=0.439484736520101, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.92(1.03), norm=4.042476585240479, lr=0.014931549776337871
2023-12-22 03:24:21   INFO  epoch: 11/24, acc_iter=41937, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:00:24/0:04:53, time_cost(all): 12:56:06/13:27:39, loss=0.439288445200758, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.85(1.03), norm=3.271416435885062, lr=0.014909023375395968
2023-12-22 03:25:17   INFO  epoch: 11/24, acc_iter=41987, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:01:20/0:03:54, time_cost(all): 12:57:02/12:46:42, loss=0.439092153881414, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.87(1.03), norm=0.5524831884728143, lr=0.014886496974454062
2023-12-22 03:26:13   INFO  epoch: 11/24, acc_iter=42037, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:02:16/0:02:59, time_cost(all): 12:57:58/13:46:51, loss=0.438895862562071, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.23(1.03), norm=0.9618733685503533, lr=0.014863970573512159
2023-12-22 03:27:08   INFO  epoch: 11/24, acc_iter=42087, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:03:12/0:02:08, time_cost(all): 12:58:53/12:40:39, loss=0.438699571242727, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.03(1.03), norm=2.0093579711488996, lr=0.014841444172570255
2023-12-22 03:28:04   INFO  epoch: 11/24, acc_iter=42137, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:04:08/0:01:16, time_cost(all): 12:59:49/13:01:14, loss=0.438503279923384, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.18(1.03), norm=3.1298900081891565, lr=0.014818917771628352
2023-12-22 03:29:00   INFO  epoch: 11/24, acc_iter=42187, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:05:03/0:00:18, time_cost(all): 13:00:45/13:31:32, loss=0.438306988604041, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.1(1.03), norm=1.0530641253574613, lr=0.014796391370686446
2023-12-22 03:29:56   INFO  epoch: 12/24, acc_iter=42254, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:00:55/1:02:27, time_cost(all): 13:01:41/12:56:41, loss=0.43804395823612, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.96(1.03), norm=2.1809630321304274, lr=0.014766205993424295
2023-12-22 03:30:52   INFO  epoch: 12/24, acc_iter=42304, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:01:51/1:04:02, time_cost(all): 13:02:37/13:01:33, loss=0.437847666916777, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.03(1.03), norm=4.564245698869386, lr=0.014743679592482392
2023-12-22 03:31:47   INFO  epoch: 12/24, acc_iter=42354, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:02:47/1:05:02, time_cost(all): 13:03:32/12:27:52, loss=0.437651375597434, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.04(1.03), norm=4.98203650716772, lr=0.014721153191540489
2023-12-22 03:32:43   INFO  epoch: 12/24, acc_iter=42404, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:03:43/1:01:57, time_cost(all): 13:04:28/13:21:28, loss=0.43745508427809, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.08(1.03), norm=2.2772398760280463, lr=0.014698626790598582
2023-12-22 03:33:39   INFO  epoch: 12/24, acc_iter=42454, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:04:38/1:00:11, time_cost(all): 13:05:24/12:45:09, loss=0.437258792958747, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.16(1.03), norm=1.6344716006277025, lr=0.01467610038965668
2023-12-22 03:34:35   INFO  epoch: 12/24, acc_iter=42504, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:05:34/1:00:19, time_cost(all): 13:06:20/13:26:09, loss=0.437062501639403, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.01(1.03), norm=3.352728786793916, lr=0.014653573988714776
2023-12-22 03:35:30   INFO  epoch: 12/24, acc_iter=42554, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:06:30/0:56:02, time_cost(all): 13:07:15/12:30:55, loss=0.43686621032006, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.93(1.03), norm=4.947658522922061, lr=0.014631047587772873
2023-12-22 03:36:26   INFO  epoch: 12/24, acc_iter=42604, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:07:26/0:58:40, time_cost(all): 13:08:11/13:25:33, loss=0.436669919000716, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.91(1.03), norm=0.5199947680063544, lr=0.014608521186830967
2023-12-22 03:37:22   INFO  epoch: 12/24, acc_iter=42654, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:08:21/0:57:39, time_cost(all): 13:09:07/12:22:20, loss=0.436473627681373, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.23(1.03), norm=4.7665020403019955, lr=0.014585994785889064
2023-12-22 03:38:18   INFO  epoch: 12/24, acc_iter=42704, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:09:17/0:56:09, time_cost(all): 13:10:03/13:37:18, loss=0.436277336362029, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.18(1.03), norm=4.713585040649265, lr=0.01456346838494716
2023-12-22 03:39:13   INFO  epoch: 12/24, acc_iter=42754, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:10:13/0:56:06, time_cost(all): 13:10:58/12:40:37, loss=0.436081045042686, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.22(1.03), norm=2.8867111301216473, lr=0.014540941984005257
2023-12-22 03:40:09   INFO  epoch: 12/24, acc_iter=42804, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:11:09/0:54:53, time_cost(all): 13:11:54/13:29:25, loss=0.435884753723343, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.91(1.03), norm=2.9987015157344556, lr=0.01451841558306335
2023-12-22 03:41:05   INFO  epoch: 12/24, acc_iter=42854, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:12:04/0:53:22, time_cost(all): 13:12:50/13:15:19, loss=0.435688462403999, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.22(1.03), norm=1.6608730710729231, lr=0.014495889182121448
2023-12-22 03:42:01   INFO  epoch: 12/24, acc_iter=42904, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:13:00/0:53:03, time_cost(all): 13:13:46/12:57:21, loss=0.435492171084656, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.11(1.03), norm=2.40596501010371, lr=0.014473362781179545
2023-12-22 03:42:56   INFO  epoch: 12/24, acc_iter=42954, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:13:56/0:50:37, time_cost(all): 13:14:41/13:02:57, loss=0.435295879765312, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.08(1.03), norm=3.5322123620996395, lr=0.014450836380237642
2023-12-22 03:43:52   INFO  epoch: 12/24, acc_iter=43004, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:14:52/0:51:56, time_cost(all): 13:15:37/12:40:56, loss=0.435099588445969, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.84(1.03), norm=4.670935176470212, lr=0.014428309979295735
2023-12-22 03:44:48   INFO  epoch: 12/24, acc_iter=43054, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:15:48/0:47:53, time_cost(all): 13:16:33/12:17:56, loss=0.434903297126625, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.16(1.03), norm=4.087355414248219, lr=0.014405783578353832
2023-12-22 03:45:44   INFO  epoch: 12/24, acc_iter=43104, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:16:43/0:48:35, time_cost(all): 13:17:29/12:23:08, loss=0.434707005807282, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.9(1.03), norm=4.390870194342092, lr=0.014383257177411929
2023-12-22 03:46:40   INFO  epoch: 12/24, acc_iter=43154, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:17:39/0:48:18, time_cost(all): 13:18:25/13:27:32, loss=0.434510714487939, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.04(1.03), norm=1.072421515607614, lr=0.014360730776470026
2023-12-22 03:47:35   INFO  epoch: 12/24, acc_iter=43204, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:18:35/0:48:50, time_cost(all): 13:19:20/13:07:12, loss=0.434314423168595, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.08(1.03), norm=2.528498345615077, lr=0.01433820437552812
2023-12-22 03:48:31   INFO  epoch: 12/24, acc_iter=43254, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:19:31/0:46:40, time_cost(all): 13:20:16/12:22:07, loss=0.434118131849252, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.9(1.03), norm=3.7858143845555863, lr=0.014315677974586216
2023-12-22 03:49:27   INFO  epoch: 12/24, acc_iter=43304, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:20:26/0:45:33, time_cost(all): 13:21:12/12:19:24, loss=0.433921840529908, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.22(1.03), norm=1.0188936651957214, lr=0.014293151573644313
2023-12-22 03:50:23   INFO  epoch: 12/24, acc_iter=43354, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:21:22/0:43:42, time_cost(all): 13:22:08/13:02:09, loss=0.433725549210565, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.08(1.03), norm=4.381316008413502, lr=0.01427062517270241
2023-12-22 03:51:18   INFO  epoch: 12/24, acc_iter=43404, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:22:18/0:41:34, time_cost(all): 13:23:03/12:57:58, loss=0.433529257891221, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.06(1.03), norm=1.8190532437271303, lr=0.014248098771760503
2023-12-22 03:52:14   INFO  epoch: 12/24, acc_iter=43454, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:23:14/0:42:59, time_cost(all): 13:23:59/13:21:49, loss=0.433332966571878, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.22(1.03), norm=4.3013942422979, lr=0.0142255723708186
2023-12-22 03:53:10   INFO  epoch: 12/24, acc_iter=43504, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:24:09/0:42:13, time_cost(all): 13:24:55/12:11:52, loss=0.433136675252535, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.19(1.03), norm=1.5140355354132657, lr=0.014203045969876697
2023-12-22 03:54:06   INFO  epoch: 12/24, acc_iter=43554, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:25:05/0:41:09, time_cost(all): 13:25:51/12:47:17, loss=0.432940383933191, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.1(1.03), norm=1.3649536008212826, lr=0.014180519568934794
2023-12-22 03:55:01   INFO  epoch: 12/24, acc_iter=43604, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:26:01/0:39:07, time_cost(all): 13:26:46/12:04:25, loss=0.432744092613848, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.1(1.03), norm=1.8692480912551812, lr=0.014157993167992888
2023-12-22 03:55:57   INFO  epoch: 12/24, acc_iter=43654, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:26:57/0:39:14, time_cost(all): 13:27:42/12:49:17, loss=0.432547801294504, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.13(1.03), norm=2.3642498199862496, lr=0.014135466767050985
2023-12-22 03:56:53   INFO  epoch: 12/24, acc_iter=43704, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:27:53/0:37:43, time_cost(all): 13:28:38/12:45:20, loss=0.432351509975161, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.13(1.03), norm=4.966108334706167, lr=0.014112940366109081
2023-12-22 03:57:49   INFO  epoch: 12/24, acc_iter=43754, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:28:48/0:35:01, time_cost(all): 13:29:34/12:28:51, loss=0.432155218655817, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.19(1.03), norm=1.9869963046657748, lr=0.014090413965167178
2023-12-22 03:58:45   INFO  epoch: 12/24, acc_iter=43804, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:29:44/0:35:29, time_cost(all): 13:30:30/13:01:30, loss=0.431958927336474, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.84(1.03), norm=3.151616895520015, lr=0.014067887564225272
2023-12-22 03:59:40   INFO  epoch: 12/24, acc_iter=43854, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:30:40/0:33:16, time_cost(all): 13:31:25/12:37:21, loss=0.43176263601713, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.84(1.03), norm=4.251819451175643, lr=0.014045361163283369
2023-12-22 04:00:36   INFO  epoch: 12/24, acc_iter=43904, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:31:36/0:34:14, time_cost(all): 13:32:21/12:05:51, loss=0.431566344697787, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.86(1.03), norm=2.400277520364378, lr=0.014022834762341466
2023-12-22 04:01:32   INFO  epoch: 12/24, acc_iter=43954, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:32:31/0:33:39, time_cost(all): 13:33:17/13:07:22, loss=0.431370053378444, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.02(1.03), norm=0.5642487223824579, lr=0.014000308361399563
2023-12-22 04:02:28   INFO  epoch: 12/24, acc_iter=44004, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:33:27/0:30:25, time_cost(all): 13:34:13/12:58:20, loss=0.4311737620591, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.07(1.03), norm=2.434076103333301, lr=0.013977781960457656
2023-12-22 04:03:23   INFO  epoch: 12/24, acc_iter=44054, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:34:23/0:31:08, time_cost(all): 13:35:08/12:36:03, loss=0.430977470739757, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.15(1.03), norm=1.1244874825810922, lr=0.013955255559515753
2023-12-22 04:04:19   INFO  epoch: 12/24, acc_iter=44104, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:35:19/0:29:48, time_cost(all): 13:36:04/13:07:47, loss=0.430781179420413, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.05(1.03), norm=0.8779387163587871, lr=0.01393272915857385
2023-12-22 04:05:15   INFO  epoch: 12/24, acc_iter=44154, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:36:14/0:29:36, time_cost(all): 13:37:00/12:46:31, loss=0.43058488810107, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.16(1.03), norm=1.0387181919621302, lr=0.013910202757631947
2023-12-22 04:06:11   INFO  epoch: 12/24, acc_iter=44204, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:37:10/0:29:31, time_cost(all): 13:37:56/13:08:42, loss=0.430388596781726, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.05(1.03), norm=1.9470265231285566, lr=0.01388767635669004
2023-12-22 04:07:06   INFO  epoch: 12/24, acc_iter=44254, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:38:06/0:27:18, time_cost(all): 13:38:51/12:39:18, loss=0.430192305462383, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.04(1.03), norm=4.55827274860989, lr=0.013865149955748137
2023-12-22 04:08:02   INFO  epoch: 12/24, acc_iter=44304, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:39:02/0:25:56, time_cost(all): 13:39:47/13:00:13, loss=0.42999601414304, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.04(1.03), norm=4.458312590884762, lr=0.013842623554806234
2023-12-22 04:08:58   INFO  epoch: 12/24, acc_iter=44354, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:39:58/0:25:16, time_cost(all): 13:40:43/12:31:27, loss=0.429799722823696, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.19(1.03), norm=4.635976508287248, lr=0.013820097153864331
2023-12-22 04:09:54   INFO  epoch: 12/24, acc_iter=44404, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:40:53/0:25:29, time_cost(all): 13:41:39/12:10:51, loss=0.429603431504353, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.21(1.03), norm=2.047099715831079, lr=0.013797570752922428
2023-12-22 04:10:50   INFO  epoch: 12/24, acc_iter=44454, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:41:49/0:22:35, time_cost(all): 13:42:35/12:04:15, loss=0.429407140185009, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.97(1.03), norm=0.6516234386243418, lr=0.013775044351980521
2023-12-22 04:11:45   INFO  epoch: 12/24, acc_iter=44504, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:42:45/0:22:55, time_cost(all): 13:43:30/12:24:14, loss=0.429210848865666, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.02(1.03), norm=4.04568055551429, lr=0.013752517951038618
2023-12-22 04:12:41   INFO  epoch: 12/24, acc_iter=44554, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:43:41/0:22:38, time_cost(all): 13:44:26/12:26:48, loss=0.429014557546322, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=2.201149239798059, lr=0.013729991550096715
2023-12-22 04:13:37   INFO  epoch: 12/24, acc_iter=44604, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:44:36/0:20:23, time_cost(all): 13:45:22/12:49:35, loss=0.428818266226979, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.99(1.03), norm=2.8005533167572008, lr=0.013707465149154812
2023-12-22 04:14:33   INFO  epoch: 12/24, acc_iter=44654, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:45:32/0:20:31, time_cost(all): 13:46:18/11:45:54, loss=0.428621974907636, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.1(1.03), norm=2.303449990572492, lr=0.013684938748212909
2023-12-22 04:15:28   INFO  epoch: 12/24, acc_iter=44704, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:46:28/0:18:33, time_cost(all): 13:47:13/12:53:07, loss=0.428425683588292, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.16(1.03), norm=4.689244845914644, lr=0.013662412347271006
2023-12-22 04:16:24   INFO  epoch: 12/24, acc_iter=44754, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:47:24/0:18:20, time_cost(all): 13:48:09/12:46:30, loss=0.428229392268949, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.98(1.03), norm=4.689211343619129, lr=0.0136398859463291
2023-12-22 04:17:20   INFO  epoch: 12/24, acc_iter=44804, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:48:19/0:16:31, time_cost(all): 13:49:05/12:25:47, loss=0.428033100949605, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.86(1.03), norm=2.346801492927103, lr=0.013617359545387196
2023-12-22 04:18:16   INFO  epoch: 12/24, acc_iter=44854, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:49:15/0:15:41, time_cost(all): 13:50:01/12:16:25, loss=0.427836809630262, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.97(1.03), norm=1.2824100644439125, lr=0.013594833144445293
2023-12-22 04:19:11   INFO  epoch: 12/24, acc_iter=44904, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:50:11/0:14:54, time_cost(all): 13:50:56/12:30:46, loss=0.427640518310918, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.97(1.03), norm=2.6713244623770436, lr=0.01357230674350339
2023-12-22 04:20:07   INFO  epoch: 12/24, acc_iter=44954, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 0:51:07/0:13:52, time_cost(all): 13:51:52/11:44:26, loss=0.427444226991575, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.87(1.03), norm=0.8937352263387068, lr=0.013549780342561487
2023-12-22 04:21:03   INFO  epoch: 12/24, acc_iter=45004, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 0:52:03/0:13:43, time_cost(all): 13:52:48/12:42:17, loss=0.427247935672232, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.08(1.03), norm=3.074785622308556, lr=0.01352725394161958
2023-12-22 04:21:59   INFO  epoch: 12/24, acc_iter=45054, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 0:52:58/0:13:00, time_cost(all): 13:53:44/12:36:51, loss=0.427051644352888, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.21(1.03), norm=4.748400054632043, lr=0.013504727540677677
2023-12-22 04:22:55   INFO  epoch: 12/24, acc_iter=45104, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 0:53:54/0:11:38, time_cost(all): 13:54:40/12:43:20, loss=0.426855353033545, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.01(1.03), norm=2.1827049548377127, lr=0.013482201139735774
2023-12-22 04:23:50   INFO  epoch: 12/24, acc_iter=45154, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 0:54:50/0:10:08, time_cost(all): 13:55:35/12:45:37, loss=0.426659061714201, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.16(1.03), norm=1.2454735792054956, lr=0.013459674738793871
2023-12-22 04:24:46   INFO  epoch: 12/24, acc_iter=45204, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 0:55:46/0:09:28, time_cost(all): 13:56:31/12:15:44, loss=0.426462770394858, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.06(1.03), norm=2.417032968425069, lr=0.013437148337851968
2023-12-22 04:25:42   INFO  epoch: 12/24, acc_iter=45254, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 0:56:41/0:08:52, time_cost(all): 13:57:27/12:39:05, loss=0.426266479075514, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.0(1.03), norm=3.5345587851736884, lr=0.013414621936910062
2023-12-22 04:26:38   INFO  epoch: 12/24, acc_iter=45304, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 0:57:37/0:07:51, time_cost(all): 13:58:23/11:54:04, loss=0.426070187756171, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.19(1.03), norm=0.6476968151931886, lr=0.013392095535968158
2023-12-22 04:27:33   INFO  epoch: 12/24, acc_iter=45354, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 0:58:33/0:06:47, time_cost(all): 13:59:18/12:10:43, loss=0.425873896436827, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.1(1.03), norm=1.3281596170443182, lr=0.013369569135026255
2023-12-22 04:28:29   INFO  epoch: 12/24, acc_iter=45404, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 0:59:29/0:05:49, time_cost(all): 14:00:14/12:22:56, loss=0.425677605117484, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.95(1.03), norm=1.613524643840819, lr=0.013347042734084352
2023-12-22 04:29:25   INFO  epoch: 12/24, acc_iter=45454, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:00:24/0:05:06, time_cost(all): 14:01:10/11:47:36, loss=0.425481313798141, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.08(1.03), norm=4.564630891635436, lr=0.013324516333142446
2023-12-22 04:30:21   INFO  epoch: 12/24, acc_iter=45504, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:01:20/0:04:06, time_cost(all): 14:02:06/11:33:30, loss=0.425285022478797, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.14(1.03), norm=3.070770349714192, lr=0.013301989932200543
2023-12-22 04:31:16   INFO  epoch: 12/24, acc_iter=45554, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:02:16/0:02:59, time_cost(all): 14:03:01/11:42:12, loss=0.425088731159454, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.95(1.03), norm=2.5782345765940695, lr=0.01327946353125864
2023-12-22 04:32:12   INFO  epoch: 12/24, acc_iter=45604, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:03:12/0:02:14, time_cost(all): 14:03:57/12:32:57, loss=0.42489243984011, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.88(1.03), norm=4.311849376954859, lr=0.013256937130316736
2023-12-22 04:33:08   INFO  epoch: 12/24, acc_iter=45654, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:04:08/0:01:13, time_cost(all): 14:04:53/11:32:49, loss=0.424696148520767, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.19(1.03), norm=1.4297537103570919, lr=0.01323441072937483
2023-12-22 04:34:04   INFO  epoch: 12/24, acc_iter=45704, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:05:03/0:00:18, time_cost(all): 14:05:49/11:53:26, loss=0.424499857201423, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.02(1.03), norm=4.970254676288139, lr=0.013211884328432927
2023-12-22 04:35:00   INFO  epoch: 13/24, acc_iter=45771, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:00:55/1:03:30, time_cost(all): 14:06:45/12:16:27, loss=0.424236826833503, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.93(1.03), norm=3.0908183795630446, lr=0.013181698951170776
2023-12-22 04:35:55   INFO  epoch: 13/24, acc_iter=45821, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:01:51/1:03:53, time_cost(all): 14:07:40/11:53:17, loss=0.42404053551416, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.05(1.03), norm=2.951091685593001, lr=0.013159172550228873
2023-12-22 04:36:51   INFO  epoch: 13/24, acc_iter=45871, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:02:47/1:00:17, time_cost(all): 14:08:36/11:32:27, loss=0.423844244194816, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.04(1.03), norm=0.8271597578263483, lr=0.013136646149286967
2023-12-22 04:37:47   INFO  epoch: 13/24, acc_iter=45921, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:03:43/0:59:29, time_cost(all): 14:09:32/12:25:58, loss=0.423647952875473, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.12(1.03), norm=1.8825328228706601, lr=0.013114119748345063
2023-12-22 04:38:43   INFO  epoch: 13/24, acc_iter=45971, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:04:38/1:02:50, time_cost(all): 14:10:28/11:54:22, loss=0.42345166155613, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.07(1.03), norm=2.694278420155875, lr=0.01309159334740316
2023-12-22 04:39:38   INFO  epoch: 13/24, acc_iter=46021, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:05:34/1:02:40, time_cost(all): 14:11:23/12:23:28, loss=0.423255370236786, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.05(1.03), norm=3.7178303346946415, lr=0.013069066946461257
2023-12-22 04:40:34   INFO  epoch: 13/24, acc_iter=46071, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:06:30/1:01:03, time_cost(all): 14:12:19/12:30:29, loss=0.423059078917443, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.02(1.03), norm=1.5277373698947028, lr=0.01304654054551935
2023-12-22 04:41:30   INFO  epoch: 13/24, acc_iter=46121, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:07:26/0:58:17, time_cost(all): 14:13:15/11:23:03, loss=0.422862787598099, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.96(1.03), norm=1.1313511967308114, lr=0.013024014144577448
2023-12-22 04:42:26   INFO  epoch: 13/24, acc_iter=46171, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:08:21/0:56:59, time_cost(all): 14:14:11/12:17:12, loss=0.422666496278756, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.12(1.03), norm=3.4140337056817196, lr=0.013001487743635545
2023-12-22 04:43:21   INFO  epoch: 13/24, acc_iter=46221, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:09:17/0:55:58, time_cost(all): 14:15:06/11:29:48, loss=0.422470204959412, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.12(1.03), norm=4.578715458893669, lr=0.012978961342693641
2023-12-22 04:44:17   INFO  epoch: 13/24, acc_iter=46271, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:10:13/0:54:55, time_cost(all): 14:16:02/11:56:02, loss=0.422273913640069, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.15(1.03), norm=3.668291906403924, lr=0.012956434941751735
2023-12-22 04:45:13   INFO  epoch: 13/24, acc_iter=46321, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:11:09/0:54:22, time_cost(all): 14:16:58/11:38:20, loss=0.422077622320726, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.99(1.03), norm=3.5655551264745187, lr=0.012933908540809832
2023-12-22 04:46:09   INFO  epoch: 13/24, acc_iter=46371, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:12:04/0:55:21, time_cost(all): 14:17:54/12:02:29, loss=0.421881331001382, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.12(1.03), norm=1.5753113970420678, lr=0.012911382139867929
2023-12-22 04:47:05   INFO  epoch: 13/24, acc_iter=46421, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:13:00/0:50:12, time_cost(all): 14:18:50/11:25:30, loss=0.421685039682039, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.95(1.03), norm=3.744689569997496, lr=0.012888855738926026
2023-12-22 04:48:00   INFO  epoch: 13/24, acc_iter=46471, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:13:56/0:49:01, time_cost(all): 14:19:45/11:55:41, loss=0.421488748362695, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.89(1.03), norm=0.6351354298305167, lr=0.012866329337984119
2023-12-22 04:48:56   INFO  epoch: 13/24, acc_iter=46521, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:14:52/0:48:07, time_cost(all): 14:20:41/11:43:09, loss=0.421292457043352, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.05(1.03), norm=1.383224639238485, lr=0.012843802937042216
2023-12-22 04:49:52   INFO  epoch: 13/24, acc_iter=46571, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:15:48/0:51:20, time_cost(all): 14:21:37/12:20:43, loss=0.421096165724008, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.97(1.03), norm=2.967379715976074, lr=0.012821276536100313
2023-12-22 04:50:48   INFO  epoch: 13/24, acc_iter=46621, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:16:43/0:47:26, time_cost(all): 14:22:33/12:06:00, loss=0.420899874404665, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.16(1.03), norm=4.975207056689729, lr=0.01279875013515841
2023-12-22 04:51:43   INFO  epoch: 13/24, acc_iter=46671, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:17:39/0:46:16, time_cost(all): 14:23:28/11:58:39, loss=0.420703583085322, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.18(1.03), norm=2.665873916544779, lr=0.012776223734216503
2023-12-22 04:52:39   INFO  epoch: 13/24, acc_iter=46721, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:18:35/0:45:03, time_cost(all): 14:24:24/12:16:44, loss=0.420507291765978, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.91(1.03), norm=1.758792793718123, lr=0.0127536973332746
2023-12-22 04:53:35   INFO  epoch: 13/24, acc_iter=46771, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:19:31/0:44:42, time_cost(all): 14:25:20/11:11:59, loss=0.420311000446635, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.85(1.03), norm=1.8607926993880062, lr=0.012731170932332697
2023-12-22 04:54:31   INFO  epoch: 13/24, acc_iter=46821, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:20:26/0:46:43, time_cost(all): 14:26:16/11:58:07, loss=0.420114709127291, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.22(1.03), norm=2.2247308761653963, lr=0.012708644531390794
2023-12-22 04:55:26   INFO  epoch: 13/24, acc_iter=46871, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:21:22/0:44:34, time_cost(all): 14:27:11/11:29:44, loss=0.419918417807948, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.15(1.03), norm=4.16114582403972, lr=0.012686118130448888
2023-12-22 04:56:22   INFO  epoch: 13/24, acc_iter=46921, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:22:18/0:42:13, time_cost(all): 14:28:07/11:24:21, loss=0.419722126488604, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.93(1.03), norm=1.6369286227819646, lr=0.012663591729506984
2023-12-22 04:57:18   INFO  epoch: 13/24, acc_iter=46971, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:23:14/0:41:14, time_cost(all): 14:29:03/11:15:19, loss=0.419525835169261, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.13(1.03), norm=2.746972015677293, lr=0.012641065328565081
2023-12-22 04:58:14   INFO  epoch: 13/24, acc_iter=47021, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:24:09/0:40:17, time_cost(all): 14:29:59/11:54:11, loss=0.419329543849917, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.16(1.03), norm=3.1188937443943145, lr=0.012618538927623178
2023-12-22 04:59:10   INFO  epoch: 13/24, acc_iter=47071, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:25:05/0:40:14, time_cost(all): 14:30:55/11:17:44, loss=0.419133252530574, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.87(1.03), norm=3.9573194504898095, lr=0.012596012526681272
2023-12-22 05:00:05   INFO  epoch: 13/24, acc_iter=47121, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:26:01/0:39:19, time_cost(all): 14:31:50/11:30:52, loss=0.418936961211231, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.19(1.03), norm=1.1953256244211252, lr=0.012573486125739369
2023-12-22 05:01:01   INFO  epoch: 13/24, acc_iter=47171, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:26:57/0:37:19, time_cost(all): 14:32:46/11:50:19, loss=0.418740669891887, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.94(1.03), norm=2.7479520140872413, lr=0.012550959724797466
2023-12-22 05:01:57   INFO  epoch: 13/24, acc_iter=47221, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:27:53/0:38:31, time_cost(all): 14:33:42/11:44:20, loss=0.418544378572544, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.12(1.03), norm=3.1039234320518343, lr=0.012528433323855562
2023-12-22 05:02:53   INFO  epoch: 13/24, acc_iter=47271, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:28:48/0:36:32, time_cost(all): 14:34:38/12:00:50, loss=0.4183480872532, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.17(1.03), norm=4.905830768647497, lr=0.012505906922913656
2023-12-22 05:03:48   INFO  epoch: 13/24, acc_iter=47321, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:29:44/0:35:41, time_cost(all): 14:35:33/11:47:49, loss=0.418151795933857, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.98(1.03), norm=2.9797899693320966, lr=0.012483380521971753
2023-12-22 05:04:44   INFO  epoch: 13/24, acc_iter=47371, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:30:40/0:33:33, time_cost(all): 14:36:29/11:32:20, loss=0.417955504614513, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.07(1.03), norm=0.8468110716130861, lr=0.01246085412102985
2023-12-22 05:05:40   INFO  epoch: 13/24, acc_iter=47421, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:31:36/0:33:06, time_cost(all): 14:37:25/11:44:54, loss=0.41775921329517, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.14(1.03), norm=4.557554693074339, lr=0.012438327720087947
2023-12-22 05:06:36   INFO  epoch: 13/24, acc_iter=47471, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:32:31/0:33:18, time_cost(all): 14:38:21/11:21:01, loss=0.417562921975827, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.05(1.03), norm=2.61299396409256, lr=0.01241580131914604
2023-12-22 05:07:31   INFO  epoch: 13/24, acc_iter=47521, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:33:27/0:32:21, time_cost(all): 14:39:16/11:14:50, loss=0.417366630656483, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.93(1.03), norm=1.4509216061453438, lr=0.012393274918204137
2023-12-22 05:08:27   INFO  epoch: 13/24, acc_iter=47571, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:34:23/0:31:28, time_cost(all): 14:40:12/11:14:35, loss=0.41717033933714, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.85(1.03), norm=4.8207940907372455, lr=0.012370748517262234
2023-12-22 05:09:23   INFO  epoch: 13/24, acc_iter=47621, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:35:19/0:28:52, time_cost(all): 14:41:08/11:37:18, loss=0.416974048017796, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.04(1.03), norm=2.8703814957175564, lr=0.01234822211632033
2023-12-22 05:10:19   INFO  epoch: 13/24, acc_iter=47671, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:36:14/0:27:43, time_cost(all): 14:42:04/11:57:26, loss=0.416777756698453, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.99(1.03), norm=2.081610713436371, lr=0.012325695715378424
2023-12-22 05:11:14   INFO  epoch: 13/24, acc_iter=47721, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:37:10/0:27:15, time_cost(all): 14:42:59/11:20:10, loss=0.416581465379109, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.98(1.03), norm=4.191045229741901, lr=0.012303169314436521
2023-12-22 05:12:10   INFO  epoch: 13/24, acc_iter=47771, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:38:06/0:26:19, time_cost(all): 14:43:55/11:08:10, loss=0.416385174059766, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.12(1.03), norm=3.1145557652902243, lr=0.012280642913494618
2023-12-22 05:13:06   INFO  epoch: 13/24, acc_iter=47821, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:39:02/0:27:20, time_cost(all): 14:44:51/11:09:36, loss=0.416188882740423, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.84(1.03), norm=1.7359132803951152, lr=0.012258116512552715
2023-12-22 05:14:02   INFO  epoch: 13/24, acc_iter=47871, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:39:58/0:26:10, time_cost(all): 14:45:47/11:14:17, loss=0.415992591421079, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.12(1.03), norm=0.5864299900067929, lr=0.012235590111610808
2023-12-22 05:14:58   INFO  epoch: 13/24, acc_iter=47921, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:40:53/0:24:21, time_cost(all): 14:46:43/11:21:44, loss=0.415796300101736, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.02(1.03), norm=4.4510363894579505, lr=0.012213063710668905
2023-12-22 05:15:53   INFO  epoch: 13/24, acc_iter=47971, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:41:49/0:24:36, time_cost(all): 14:47:38/10:53:00, loss=0.415600008782392, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.18(1.03), norm=4.441398682565117, lr=0.012190537309727002
2023-12-22 05:16:49   INFO  epoch: 13/24, acc_iter=48021, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:42:45/0:22:30, time_cost(all): 14:48:34/11:34:54, loss=0.415403717463049, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.97(1.03), norm=2.5128785653156003, lr=0.0121680109087851
2023-12-22 05:17:45   INFO  epoch: 13/24, acc_iter=48071, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:43:41/0:21:24, time_cost(all): 14:49:30/11:07:11, loss=0.415207426143705, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.1(1.03), norm=4.172823994873743, lr=0.012145484507843193
2023-12-22 05:18:41   INFO  epoch: 13/24, acc_iter=48121, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:44:36/0:21:22, time_cost(all): 14:50:26/11:31:54, loss=0.415011134824362, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.88(1.03), norm=4.727679893357553, lr=0.01212295810690129
2023-12-22 05:19:36   INFO  epoch: 13/24, acc_iter=48171, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:45:32/0:20:15, time_cost(all): 14:51:21/10:52:16, loss=0.414814843505019, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.94(1.03), norm=4.840237116297129, lr=0.012100431705959386
2023-12-22 05:20:32   INFO  epoch: 13/24, acc_iter=48221, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:46:28/0:18:02, time_cost(all): 14:52:17/11:30:05, loss=0.414618552185675, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.88(1.03), norm=2.7891127507735263, lr=0.012077905305017483
2023-12-22 05:21:28   INFO  epoch: 13/24, acc_iter=48271, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:47:24/0:18:39, time_cost(all): 14:53:13/11:33:22, loss=0.414422260866332, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.13(1.03), norm=3.622170108400123, lr=0.012055378904075577
2023-12-22 05:22:24   INFO  epoch: 13/24, acc_iter=48321, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:48:19/0:17:43, time_cost(all): 14:54:09/10:42:36, loss=0.414225969546988, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.9(1.03), norm=4.553004083877518, lr=0.012032852503133674
2023-12-22 05:23:19   INFO  epoch: 13/24, acc_iter=48371, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:49:15/0:16:12, time_cost(all): 14:55:04/11:36:23, loss=0.414029678227645, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.85(1.03), norm=3.4256569430474526, lr=0.01201032610219177
2023-12-22 05:24:15   INFO  epoch: 13/24, acc_iter=48421, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:50:11/0:14:36, time_cost(all): 14:56:00/11:15:59, loss=0.413833386908301, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.19(1.03), norm=4.398886761665901, lr=0.011987799701249868
2023-12-22 05:25:11   INFO  epoch: 13/24, acc_iter=48471, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 0:51:07/0:14:05, time_cost(all): 14:56:56/11:21:45, loss=0.413637095588958, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.04(1.03), norm=2.7713723435954085, lr=0.011965273300307961
2023-12-22 05:26:07   INFO  epoch: 13/24, acc_iter=48521, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 0:52:03/0:13:46, time_cost(all): 14:57:52/11:15:32, loss=0.413440804269614, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.19(1.03), norm=4.20008085496176, lr=0.011942746899366058
2023-12-22 05:27:03   INFO  epoch: 13/24, acc_iter=48571, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 0:52:58/0:12:43, time_cost(all): 14:58:48/10:52:22, loss=0.413244512950271, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.18(1.03), norm=1.9501965871521478, lr=0.011920220498424155
2023-12-22 05:27:58   INFO  epoch: 13/24, acc_iter=48621, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 0:53:54/0:11:57, time_cost(all): 14:59:43/11:24:46, loss=0.413048221630928, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.08(1.03), norm=1.2833435768510413, lr=0.011897694097482252
2023-12-22 05:28:54   INFO  epoch: 13/24, acc_iter=48671, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 0:54:50/0:10:04, time_cost(all): 15:00:39/11:18:43, loss=0.412851930311584, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.04(1.03), norm=1.9388891572460105, lr=0.011875167696540345
2023-12-22 05:29:50   INFO  epoch: 13/24, acc_iter=48721, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 0:55:46/0:09:40, time_cost(all): 15:01:35/11:33:11, loss=0.412655638992241, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.85(1.03), norm=2.557064991727446, lr=0.011852641295598442
2023-12-22 05:30:46   INFO  epoch: 13/24, acc_iter=48771, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 0:56:41/0:08:30, time_cost(all): 15:02:31/10:52:24, loss=0.412459347672897, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.98(1.03), norm=4.513998094807134, lr=0.011830114894656539
2023-12-22 05:31:41   INFO  epoch: 13/24, acc_iter=48821, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 0:57:37/0:07:39, time_cost(all): 15:03:26/10:34:00, loss=0.412263056353554, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.91(1.03), norm=4.6126473780234845, lr=0.011807588493714636
2023-12-22 05:32:37   INFO  epoch: 13/24, acc_iter=48871, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 0:58:33/0:06:39, time_cost(all): 15:04:22/10:54:45, loss=0.41206676503421, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.12(1.03), norm=3.51721460850929, lr=0.01178506209277273
2023-12-22 05:33:33   INFO  epoch: 13/24, acc_iter=48921, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 0:59:29/0:05:58, time_cost(all): 15:05:18/10:35:54, loss=0.411870473714867, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.0(1.03), norm=2.3737290342132233, lr=0.011762535691830826
2023-12-22 05:34:29   INFO  epoch: 13/24, acc_iter=48971, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:00:24/0:04:56, time_cost(all): 15:06:14/11:31:15, loss=0.411674182395524, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.03(1.03), norm=3.311128617055525, lr=0.011740009290888923
2023-12-22 05:35:24   INFO  epoch: 13/24, acc_iter=49021, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:01:20/0:04:14, time_cost(all): 15:07:09/11:09:04, loss=0.41147789107618, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.96(1.03), norm=0.6206648890879206, lr=0.01171748288994702
2023-12-22 05:36:20   INFO  epoch: 13/24, acc_iter=49071, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:02:16/0:03:00, time_cost(all): 15:08:05/10:34:17, loss=0.411281599756837, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.17(1.03), norm=3.3151909153004415, lr=0.011694956489005114
2023-12-22 05:37:16   INFO  epoch: 13/24, acc_iter=49121, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:03:12/0:02:12, time_cost(all): 15:09:01/10:55:30, loss=0.411085308437493, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.86(1.03), norm=0.8111190558743742, lr=0.01167243008806321
2023-12-22 05:38:12   INFO  epoch: 13/24, acc_iter=49171, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:04:08/0:01:14, time_cost(all): 15:09:57/10:56:10, loss=0.41088901711815, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.17(1.03), norm=2.950114088196331, lr=0.011649903687121307
2023-12-22 05:39:08   INFO  epoch: 13/24, acc_iter=49221, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:05:03/0:00:19, time_cost(all): 15:10:53/11:09:26, loss=0.410692725798806, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.05(1.03), norm=0.5102392489354346, lr=0.011627377286179404
2023-12-22 05:40:03   INFO  epoch: 14/24, acc_iter=49288, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:00:55/1:05:34, time_cost(all): 15:11:48/11:11:19, loss=0.410429695430886, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.86(1.03), norm=4.8111847571777355, lr=0.011597191908917254
2023-12-22 05:40:59   INFO  epoch: 14/24, acc_iter=49338, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:01:51/1:02:12, time_cost(all): 15:12:44/11:17:36, loss=0.410233404111543, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.97(1.03), norm=4.0668889546822875, lr=0.01157466550797535
2023-12-22 05:41:55   INFO  epoch: 14/24, acc_iter=49388, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:02:47/1:03:50, time_cost(all): 15:13:40/11:08:45, loss=0.410037112792199, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.93(1.03), norm=1.9415328148259512, lr=0.011552139107033448
2023-12-22 05:42:51   INFO  epoch: 14/24, acc_iter=49438, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:03:43/0:58:37, time_cost(all): 15:14:36/11:11:19, loss=0.409840821472856, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.84(1.03), norm=4.761543552906307, lr=0.011529612706091544
2023-12-22 05:43:46   INFO  epoch: 14/24, acc_iter=49488, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:04:38/1:01:32, time_cost(all): 15:15:31/11:10:06, loss=0.409644530153512, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.1(1.03), norm=4.15673422838448, lr=0.011507086305149638
2023-12-22 05:44:42   INFO  epoch: 14/24, acc_iter=49538, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:05:34/1:00:50, time_cost(all): 15:16:27/10:52:59, loss=0.409448238834169, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.09(1.03), norm=0.7237183730295569, lr=0.011484559904207735
2023-12-22 05:45:38   INFO  epoch: 14/24, acc_iter=49588, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:06:30/0:58:00, time_cost(all): 15:17:23/11:22:56, loss=0.409251947514826, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.11(1.03), norm=3.392539836930853, lr=0.011462033503265832
2023-12-22 05:46:34   INFO  epoch: 14/24, acc_iter=49638, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:07:26/0:55:09, time_cost(all): 15:18:19/11:13:09, loss=0.409055656195482, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.88(1.03), norm=2.779414793868945, lr=0.011439507102323929
2023-12-22 05:47:29   INFO  epoch: 14/24, acc_iter=49688, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:08:21/0:59:44, time_cost(all): 15:19:14/11:18:15, loss=0.408859364876139, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.1(1.03), norm=4.0484797631859015, lr=0.011416980701382022
2023-12-22 05:48:25   INFO  epoch: 14/24, acc_iter=49738, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:09:17/0:55:45, time_cost(all): 15:20:10/11:13:08, loss=0.408663073556795, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.93(1.03), norm=0.5024509175959583, lr=0.011394454300440119
2023-12-22 05:49:21   INFO  epoch: 14/24, acc_iter=49788, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:10:13/0:52:50, time_cost(all): 15:21:06/10:57:29, loss=0.408466782237452, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.17(1.03), norm=1.3468923410026488, lr=0.011371927899498216
2023-12-22 05:50:17   INFO  epoch: 14/24, acc_iter=49838, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:11:09/0:54:29, time_cost(all): 15:22:02/11:03:34, loss=0.408270490918108, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.98(1.03), norm=3.435985930821804, lr=0.011349401498556313
2023-12-22 05:51:13   INFO  epoch: 14/24, acc_iter=49888, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:12:04/0:53:01, time_cost(all): 15:22:58/10:50:18, loss=0.408074199598765, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.21(1.03), norm=1.0624626485709263, lr=0.01132687509761441
2023-12-22 05:52:08   INFO  epoch: 14/24, acc_iter=49938, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:13:00/0:52:20, time_cost(all): 15:23:53/10:30:59, loss=0.407877908279422, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.07(1.03), norm=2.006520524359085, lr=0.011304348696672503
2023-12-22 05:53:04   INFO  epoch: 14/24, acc_iter=49988, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:13:56/0:50:51, time_cost(all): 15:24:49/10:58:12, loss=0.407681616960078, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.86(1.03), norm=2.9329465791460425, lr=0.0112818222957306
2023-12-22 05:54:00   INFO  epoch: 14/24, acc_iter=50038, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:14:52/0:50:33, time_cost(all): 15:25:45/11:11:54, loss=0.407485325640735, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.05(1.03), norm=3.9959270285538677, lr=0.011259295894788697
2023-12-22 05:54:56   INFO  epoch: 14/24, acc_iter=50088, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:15:48/0:47:52, time_cost(all): 15:26:41/10:13:51, loss=0.407289034321391, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.89(1.03), norm=1.0201282932918498, lr=0.011236769493846794
2023-12-22 05:55:51   INFO  epoch: 14/24, acc_iter=50138, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:16:43/0:48:25, time_cost(all): 15:27:36/10:51:16, loss=0.407092743002048, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.21(1.03), norm=2.3963226519694727, lr=0.011214243092904887
2023-12-22 05:56:47   INFO  epoch: 14/24, acc_iter=50188, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:17:39/0:46:28, time_cost(all): 15:28:32/10:47:31, loss=0.406896451682704, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.09(1.03), norm=3.2876689982982565, lr=0.011191716691962984
2023-12-22 05:57:43   INFO  epoch: 14/24, acc_iter=50238, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:18:35/0:46:44, time_cost(all): 15:29:28/10:31:45, loss=0.406700160363361, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.1(1.03), norm=3.5523476929842177, lr=0.011169190291021081
2023-12-22 05:58:39   INFO  epoch: 14/24, acc_iter=50288, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:19:31/0:44:15, time_cost(all): 15:30:24/10:53:56, loss=0.406503869044018, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.17(1.03), norm=3.821017889756635, lr=0.011146663890079178
2023-12-22 05:59:34   INFO  epoch: 14/24, acc_iter=50338, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:20:26/0:43:06, time_cost(all): 15:31:19/10:29:08, loss=0.406307577724674, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.89(1.03), norm=3.0833121360965494, lr=0.011124137489137272
2023-12-22 06:00:30   INFO  epoch: 14/24, acc_iter=50388, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:21:22/0:43:17, time_cost(all): 15:32:15/10:17:22, loss=0.406111286405331, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.88(1.03), norm=1.0637413941300804, lr=0.011101611088195368
2023-12-22 06:01:26   INFO  epoch: 14/24, acc_iter=50438, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:22:18/0:42:29, time_cost(all): 15:33:11/10:48:42, loss=0.405914995085987, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.1(1.03), norm=1.184696813245809, lr=0.011079084687253465
2023-12-22 06:02:22   INFO  epoch: 14/24, acc_iter=50488, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:23:14/0:43:17, time_cost(all): 15:34:07/10:45:42, loss=0.405718703766644, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.23(1.03), norm=2.1496665541185354, lr=0.011056558286311562
2023-12-22 06:03:18   INFO  epoch: 14/24, acc_iter=50538, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:24:09/0:39:40, time_cost(all): 15:35:03/10:53:01, loss=0.4055224124473, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.89(1.03), norm=2.4305841483115005, lr=0.011034031885369656
2023-12-22 06:04:13   INFO  epoch: 14/24, acc_iter=50588, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:25:05/0:42:02, time_cost(all): 15:35:58/10:39:40, loss=0.405326121127957, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.91(1.03), norm=1.8064151702265818, lr=0.011011505484427753
2023-12-22 06:05:09   INFO  epoch: 14/24, acc_iter=50638, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:26:01/0:38:41, time_cost(all): 15:36:54/10:07:43, loss=0.405129829808614, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.07(1.03), norm=0.6498732682953725, lr=0.01098897908348585
2023-12-22 06:06:05   INFO  epoch: 14/24, acc_iter=50688, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:26:57/0:38:56, time_cost(all): 15:37:50/10:55:07, loss=0.40493353848927, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.22(1.03), norm=3.1983794897304647, lr=0.010966452682543947
2023-12-22 06:07:01   INFO  epoch: 14/24, acc_iter=50738, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:27:53/0:37:15, time_cost(all): 15:38:46/10:12:02, loss=0.404737247169927, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.17(1.03), norm=4.546451242161551, lr=0.01094392628160204
2023-12-22 06:07:56   INFO  epoch: 14/24, acc_iter=50788, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:28:48/0:37:29, time_cost(all): 15:39:41/10:51:05, loss=0.404540955850583, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.84(1.03), norm=4.33190740519724, lr=0.010921399880660137
2023-12-22 06:08:52   INFO  epoch: 14/24, acc_iter=50838, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:29:44/0:35:32, time_cost(all): 15:40:37/10:26:53, loss=0.40434466453124, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.9(1.03), norm=3.665651383107385, lr=0.010898873479718234
2023-12-22 06:09:48   INFO  epoch: 14/24, acc_iter=50888, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:30:40/0:34:01, time_cost(all): 15:41:33/10:32:37, loss=0.404148373211896, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.0(1.03), norm=1.2179817448765886, lr=0.01087634707877633
2023-12-22 06:10:44   INFO  epoch: 14/24, acc_iter=50938, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:31:36/0:34:49, time_cost(all): 15:42:29/9:59:39, loss=0.403952081892553, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.01(1.03), norm=3.0922918984622787, lr=0.010853820677834424
2023-12-22 06:11:39   INFO  epoch: 14/24, acc_iter=50988, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:32:31/0:34:05, time_cost(all): 15:43:24/10:31:13, loss=0.403755790573209, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.97(1.03), norm=0.5894837932912294, lr=0.010831294276892521
2023-12-22 06:12:35   INFO  epoch: 14/24, acc_iter=51038, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:33:27/0:33:15, time_cost(all): 15:44:20/10:26:26, loss=0.403559499253866, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.85(1.03), norm=2.6480297969149347, lr=0.010808767875950618
2023-12-22 06:13:31   INFO  epoch: 14/24, acc_iter=51088, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:34:23/0:30:09, time_cost(all): 15:45:16/10:37:14, loss=0.403363207934523, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.87(1.03), norm=1.6374490165179274, lr=0.010786241475008715
2023-12-22 06:14:27   INFO  epoch: 14/24, acc_iter=51138, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:35:19/0:30:46, time_cost(all): 15:46:12/10:04:38, loss=0.403166916615179, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.94(1.03), norm=3.0460913183777643, lr=0.010763715074066808
2023-12-22 06:15:23   INFO  epoch: 14/24, acc_iter=51188, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:36:14/0:28:05, time_cost(all): 15:47:08/10:07:05, loss=0.402970625295836, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.2(1.03), norm=0.5709601650713743, lr=0.010741188673124905
2023-12-22 06:16:18   INFO  epoch: 14/24, acc_iter=51238, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:37:10/0:27:24, time_cost(all): 15:48:03/9:56:34, loss=0.402774333976492, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.18(1.03), norm=2.6915772002684015, lr=0.010718662272183002
2023-12-22 06:17:14   INFO  epoch: 14/24, acc_iter=51288, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:38:06/0:25:58, time_cost(all): 15:48:59/10:36:32, loss=0.402578042657149, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.84(1.03), norm=4.143403042776835, lr=0.010696135871241099
2023-12-22 06:18:10   INFO  epoch: 14/24, acc_iter=51338, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:39:02/0:25:51, time_cost(all): 15:49:55/10:09:56, loss=0.402381751337805, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.21(1.03), norm=1.7554441151496873, lr=0.010673609470299193
2023-12-22 06:19:06   INFO  epoch: 14/24, acc_iter=51388, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:39:58/0:26:02, time_cost(all): 15:50:51/10:18:03, loss=0.402185460018462, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.02(1.03), norm=1.2076445764227772, lr=0.01065108306935729
2023-12-22 06:20:01   INFO  epoch: 14/24, acc_iter=51438, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:40:53/0:25:17, time_cost(all): 15:51:46/10:09:21, loss=0.401989168699119, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.92(1.03), norm=4.647172844349139, lr=0.010628556668415386
2023-12-22 06:20:57   INFO  epoch: 14/24, acc_iter=51488, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:41:49/0:23:14, time_cost(all): 15:52:42/9:53:06, loss=0.401792877379775, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.18(1.03), norm=4.98636320041316, lr=0.010606030267473483
2023-12-22 06:21:53   INFO  epoch: 14/24, acc_iter=51538, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:42:45/0:22:06, time_cost(all): 15:53:38/10:02:54, loss=0.401596586060432, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.98(1.03), norm=4.360776340993766, lr=0.010583503866531577
2023-12-22 06:22:49   INFO  epoch: 14/24, acc_iter=51588, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:43:41/0:22:37, time_cost(all): 15:54:34/9:44:07, loss=0.401400294741088, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.86(1.03), norm=4.8999907877259785, lr=0.010560977465589674
2023-12-22 06:23:44   INFO  epoch: 14/24, acc_iter=51638, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:44:36/0:20:23, time_cost(all): 15:55:29/10:11:46, loss=0.401204003421745, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.16(1.03), norm=2.7122409245062307, lr=0.01053845106464777
2023-12-22 06:24:40   INFO  epoch: 14/24, acc_iter=51688, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:45:32/0:20:37, time_cost(all): 15:56:25/9:53:51, loss=0.401007712102401, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.23(1.03), norm=3.9264568334109393, lr=0.010515924663705867
2023-12-22 06:25:36   INFO  epoch: 14/24, acc_iter=51738, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:46:28/0:18:06, time_cost(all): 15:57:21/10:08:23, loss=0.400811420783058, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.13(1.03), norm=1.9653466212872224, lr=0.010493398262763961
2023-12-22 06:26:32   INFO  epoch: 14/24, acc_iter=51788, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:47:24/0:18:30, time_cost(all): 15:58:17/10:33:32, loss=0.400615129463715, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.19(1.03), norm=1.9577060023969808, lr=0.010470871861822058
2023-12-22 06:27:28   INFO  epoch: 14/24, acc_iter=51838, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:48:19/0:16:46, time_cost(all): 15:59:13/10:02:19, loss=0.400418838144371, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.87(1.03), norm=4.3308596047545365, lr=0.010448345460880155
2023-12-22 06:28:23   INFO  epoch: 14/24, acc_iter=51888, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:49:15/0:15:51, time_cost(all): 16:00:08/10:31:16, loss=0.400222546825028, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.85(1.03), norm=1.7524010011285225, lr=0.010425819059938252
2023-12-22 06:29:19   INFO  epoch: 14/24, acc_iter=51938, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:50:11/0:14:35, time_cost(all): 16:01:04/10:22:34, loss=0.400026255505684, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.94(1.03), norm=2.086039163440821, lr=0.010403292658996345
2023-12-22 06:30:15   INFO  epoch: 14/24, acc_iter=51988, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 0:51:07/0:13:53, time_cost(all): 16:02:00/9:38:03, loss=0.399829964186341, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.87(1.03), norm=0.977729395468935, lr=0.010380766258054442
2023-12-22 06:31:11   INFO  epoch: 14/24, acc_iter=52038, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 0:52:03/0:13:29, time_cost(all): 16:02:56/10:18:42, loss=0.399633672866997, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.01(1.03), norm=1.0235227341318918, lr=0.010358239857112539
2023-12-22 06:32:06   INFO  epoch: 14/24, acc_iter=52088, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 0:52:58/0:12:23, time_cost(all): 16:03:51/9:39:19, loss=0.399437381547654, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.06(1.03), norm=2.3598061664790326, lr=0.010335713456170636
2023-12-22 06:33:02   INFO  epoch: 14/24, acc_iter=52138, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 0:53:54/0:12:01, time_cost(all): 16:04:47/9:38:57, loss=0.399241090228311, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.97(1.03), norm=4.3993802178630235, lr=0.01031318705522873
2023-12-22 06:33:58   INFO  epoch: 14/24, acc_iter=52188, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 0:54:50/0:10:33, time_cost(all): 16:05:43/9:52:47, loss=0.399044798908967, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.94(1.03), norm=2.737824844062874, lr=0.010290660654286826
2023-12-22 06:34:54   INFO  epoch: 14/24, acc_iter=52238, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 0:55:46/0:09:39, time_cost(all): 16:06:39/10:18:37, loss=0.398848507589624, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.18(1.03), norm=2.6816828814614584, lr=0.010268134253344923
2023-12-22 06:35:49   INFO  epoch: 14/24, acc_iter=52288, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 0:56:41/0:08:20, time_cost(all): 16:07:34/10:11:21, loss=0.39865221627028, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.96(1.03), norm=1.468012380254772, lr=0.01024560785240302
2023-12-22 06:36:45   INFO  epoch: 14/24, acc_iter=52338, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 0:57:37/0:07:48, time_cost(all): 16:08:30/10:19:31, loss=0.398455924950937, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.16(1.03), norm=4.6711531419662755, lr=0.010223081451461113
2023-12-22 06:37:41   INFO  epoch: 14/24, acc_iter=52388, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 0:58:33/0:06:41, time_cost(all): 16:09:26/9:44:19, loss=0.398259633631593, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.12(1.03), norm=0.6432187378206704, lr=0.01020055505051921
2023-12-22 06:38:37   INFO  epoch: 14/24, acc_iter=52438, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 0:59:29/0:05:52, time_cost(all): 16:10:22/10:19:35, loss=0.39806334231225, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.18(1.03), norm=2.4376361726256928, lr=0.010178028649577307
2023-12-22 06:39:32   INFO  epoch: 14/24, acc_iter=52488, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:00:24/0:04:44, time_cost(all): 16:11:17/10:20:22, loss=0.397867050992907, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.22(1.03), norm=4.863549922100761, lr=0.010155502248635404
2023-12-22 06:40:28   INFO  epoch: 14/24, acc_iter=52538, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:01:20/0:04:11, time_cost(all): 16:12:13/9:43:23, loss=0.397670759673563, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.89(1.03), norm=1.9378099439301173, lr=0.010132975847693498
2023-12-22 06:41:24   INFO  epoch: 14/24, acc_iter=52588, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:02:16/0:03:02, time_cost(all): 16:13:09/9:52:58, loss=0.39747446835422, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.87(1.03), norm=1.4657998089565216, lr=0.010110449446751595
2023-12-22 06:42:20   INFO  epoch: 14/24, acc_iter=52638, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:03:12/0:02:12, time_cost(all): 16:14:05/9:39:11, loss=0.397278177034876, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.05(1.03), norm=2.153224178505768, lr=0.010087923045809691
2023-12-22 06:43:16   INFO  epoch: 14/24, acc_iter=52688, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:04:08/0:01:12, time_cost(all): 16:15:01/9:46:08, loss=0.397081885715533, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.01(1.03), norm=1.2588174406546386, lr=0.010065396644867788
2023-12-22 06:44:11   INFO  epoch: 14/24, acc_iter=52738, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:05:03/0:00:19, time_cost(all): 16:15:56/10:13:30, loss=0.396885594396189, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.1(1.03), norm=3.155909522908197, lr=0.010042870243925882
2023-12-22 06:45:07   INFO  epoch: 15/24, acc_iter=52805, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:00:55/1:01:39, time_cost(all): 16:16:52/10:18:53, loss=0.396622564028269, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.21(1.03), norm=1.3325002784636135, lr=0.010012684866663731
2023-12-22 06:46:03   INFO  epoch: 15/24, acc_iter=52855, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:01:51/1:00:59, time_cost(all): 16:17:48/9:48:25, loss=0.396426272708926, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.88(1.03), norm=4.14358795517756, lr=0.009990158465721828
2023-12-22 06:46:59   INFO  epoch: 15/24, acc_iter=52905, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:02:47/1:00:53, time_cost(all): 16:18:44/10:01:48, loss=0.396229981389582, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.19(1.03), norm=4.660620711142597, lr=0.009967632064779925
2023-12-22 06:47:54   INFO  epoch: 15/24, acc_iter=52955, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:03:43/1:00:51, time_cost(all): 16:19:39/9:48:56, loss=0.396033690070239, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.19(1.03), norm=3.810510669389652, lr=0.009945105663838019
2023-12-22 06:48:50   INFO  epoch: 15/24, acc_iter=53005, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:04:38/1:00:44, time_cost(all): 16:20:35/10:00:41, loss=0.395837398750895, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.11(1.03), norm=1.2674436641936087, lr=0.009922579262896115
2023-12-22 06:49:46   INFO  epoch: 15/24, acc_iter=53055, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:05:34/1:01:49, time_cost(all): 16:21:31/10:11:12, loss=0.395641107431552, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.04(1.03), norm=2.641344402709528, lr=0.009900052861954212
2023-12-22 06:50:42   INFO  epoch: 15/24, acc_iter=53105, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:06:30/0:56:40, time_cost(all): 16:22:27/9:24:10, loss=0.395444816112209, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.12(1.03), norm=1.5633482765615214, lr=0.00987752646101231
2023-12-22 06:51:37   INFO  epoch: 15/24, acc_iter=53155, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:07:26/0:58:20, time_cost(all): 16:23:22/9:28:52, loss=0.395248524792865, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.16(1.03), norm=3.6147162431300868, lr=0.009855000060070403
2023-12-22 06:52:33   INFO  epoch: 15/24, acc_iter=53205, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:08:21/0:55:24, time_cost(all): 16:24:18/9:17:45, loss=0.395052233473522, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.13(1.03), norm=1.1613695799187849, lr=0.0098324736591285
2023-12-22 06:53:29   INFO  epoch: 15/24, acc_iter=53255, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:09:17/0:53:42, time_cost(all): 16:25:14/9:21:25, loss=0.394855942154178, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.97(1.03), norm=3.608527964999257, lr=0.009809947258186597
2023-12-22 06:54:25   INFO  epoch: 15/24, acc_iter=53305, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:10:13/0:57:12, time_cost(all): 16:26:10/9:24:37, loss=0.394659650834835, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.07(1.03), norm=1.3888234943337083, lr=0.009787420857244693
2023-12-22 06:55:21   INFO  epoch: 15/24, acc_iter=53355, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:11:09/0:52:40, time_cost(all): 16:27:06/9:59:22, loss=0.394463359515491, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.9(1.03), norm=4.81074768798163, lr=0.009764894456302787
2023-12-22 06:56:16   INFO  epoch: 15/24, acc_iter=53405, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:12:04/0:55:33, time_cost(all): 16:28:01/9:41:57, loss=0.394267068196148, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.04(1.03), norm=1.6251703896405414, lr=0.009742368055360884
2023-12-22 06:57:12   INFO  epoch: 15/24, acc_iter=53455, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:13:00/0:50:56, time_cost(all): 16:28:57/9:42:40, loss=0.394070776876804, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.92(1.03), norm=3.127642478851461, lr=0.00971984165441898
2023-12-22 06:58:08   INFO  epoch: 15/24, acc_iter=53505, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:13:56/0:52:05, time_cost(all): 16:29:53/9:12:52, loss=0.393874485557461, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.88(1.03), norm=4.216611660288974, lr=0.009697315253477078
2023-12-22 06:59:04   INFO  epoch: 15/24, acc_iter=53555, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:14:52/0:51:09, time_cost(all): 16:30:49/9:18:15, loss=0.393678194238118, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.9(1.03), norm=2.3004071739894654, lr=0.009674788852535175
2023-12-22 06:59:59   INFO  epoch: 15/24, acc_iter=53605, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:15:48/0:50:22, time_cost(all): 16:31:44/9:22:12, loss=0.393481902918774, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.94(1.03), norm=4.193973514452971, lr=0.009652262451593268
2023-12-22 07:00:55   INFO  epoch: 15/24, acc_iter=53655, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:16:43/0:49:33, time_cost(all): 16:32:40/9:20:45, loss=0.393285611599431, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.03(1.03), norm=1.9142222335794912, lr=0.009629736050651365
2023-12-22 07:01:51   INFO  epoch: 15/24, acc_iter=53705, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:17:39/0:47:04, time_cost(all): 16:33:36/9:44:51, loss=0.393089320280087, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.22(1.03), norm=1.1368374849946563, lr=0.009607209649709462
2023-12-22 07:02:47   INFO  epoch: 15/24, acc_iter=53755, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:18:35/0:46:11, time_cost(all): 16:34:32/9:58:27, loss=0.392893028960744, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.02(1.03), norm=0.8804583972550792, lr=0.009584683248767559
2023-12-22 07:03:42   INFO  epoch: 15/24, acc_iter=53805, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:19:31/0:46:49, time_cost(all): 16:35:27/9:21:03, loss=0.3926967376414, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.16(1.03), norm=3.1992731509219023, lr=0.009562156847825656
2023-12-22 07:04:38   INFO  epoch: 15/24, acc_iter=53855, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:20:26/0:44:00, time_cost(all): 16:36:23/9:14:27, loss=0.392500446322057, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.86(1.03), norm=3.6599775775191596, lr=0.009539630446883753
2023-12-22 07:05:34   INFO  epoch: 15/24, acc_iter=53905, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:21:22/0:43:58, time_cost(all): 16:37:19/9:42:46, loss=0.392304155002714, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.0(1.03), norm=4.422549055466453, lr=0.009517104045941846
2023-12-22 07:06:30   INFO  epoch: 15/24, acc_iter=53955, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:22:18/0:41:23, time_cost(all): 16:38:15/9:26:03, loss=0.39210786368337, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.92(1.03), norm=3.4731120025964928, lr=0.009494577644999943
2023-12-22 07:07:26   INFO  epoch: 15/24, acc_iter=54005, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:23:14/0:43:18, time_cost(all): 16:39:11/9:57:36, loss=0.391911572364027, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.0(1.03), norm=1.2575456716355564, lr=0.00947205124405804
2023-12-22 07:08:21   INFO  epoch: 15/24, acc_iter=54055, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:24:09/0:42:57, time_cost(all): 16:40:06/9:46:36, loss=0.391715281044683, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.14(1.03), norm=2.9232949456288715, lr=0.009449524843116137
2023-12-22 07:09:17   INFO  epoch: 15/24, acc_iter=54105, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:25:05/0:41:44, time_cost(all): 16:41:02/9:38:53, loss=0.39151898972534, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.23(1.03), norm=3.714364297633361, lr=0.009426998442174234
2023-12-22 07:10:13   INFO  epoch: 15/24, acc_iter=54155, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:26:01/0:39:42, time_cost(all): 16:41:58/9:37:51, loss=0.391322698405996, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.04(1.03), norm=3.923426355928857, lr=0.009404472041232327
2023-12-22 07:11:09   INFO  epoch: 15/24, acc_iter=54205, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:26:57/0:38:19, time_cost(all): 16:42:54/9:08:27, loss=0.391126407086653, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.14(1.03), norm=4.199341486135188, lr=0.009381945640290424
2023-12-22 07:12:04   INFO  epoch: 15/24, acc_iter=54255, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:27:53/0:38:44, time_cost(all): 16:43:49/9:20:42, loss=0.39093011576731, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.16(1.03), norm=1.4575254243393476, lr=0.009359419239348521
2023-12-22 07:13:00   INFO  epoch: 15/24, acc_iter=54305, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:28:48/0:38:23, time_cost(all): 16:44:45/9:48:26, loss=0.390733824447966, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.95(1.03), norm=3.2222158848001285, lr=0.009336892838406618
2023-12-22 07:13:56   INFO  epoch: 15/24, acc_iter=54355, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:29:44/0:34:53, time_cost(all): 16:45:41/9:50:11, loss=0.390537533128623, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.23(1.03), norm=2.2476620283848305, lr=0.009314366437464715
2023-12-22 07:14:52   INFO  epoch: 15/24, acc_iter=54405, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:30:40/0:33:46, time_cost(all): 16:46:37/9:00:01, loss=0.390341241809279, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.92(1.03), norm=4.584862518819441, lr=0.009291840036522808
2023-12-22 07:15:47   INFO  epoch: 15/24, acc_iter=54455, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:31:36/0:33:52, time_cost(all): 16:47:32/9:20:50, loss=0.390144950489936, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.06(1.03), norm=3.43182021075529, lr=0.009269313635580905
2023-12-22 07:16:43   INFO  epoch: 15/24, acc_iter=54505, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:32:31/0:33:47, time_cost(all): 16:48:28/9:32:10, loss=0.389948659170592, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.93(1.03), norm=2.0696765403477686, lr=0.009246787234639002
2023-12-22 07:17:39   INFO  epoch: 15/24, acc_iter=54555, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:33:27/0:32:53, time_cost(all): 16:49:24/9:07:39, loss=0.389752367851249, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.99(1.03), norm=3.328831036683877, lr=0.009224260833697099
2023-12-22 07:18:35   INFO  epoch: 15/24, acc_iter=54605, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:34:23/0:31:50, time_cost(all): 16:50:20/8:52:33, loss=0.389556076531906, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.85(1.03), norm=3.393872365897398, lr=0.009201734432755192
2023-12-22 07:19:31   INFO  epoch: 15/24, acc_iter=54655, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:35:19/0:30:15, time_cost(all): 16:51:16/9:36:57, loss=0.389359785212562, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.91(1.03), norm=3.1991040233043186, lr=0.00917920803181329
2023-12-22 07:20:26   INFO  epoch: 15/24, acc_iter=54705, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:36:14/0:27:46, time_cost(all): 16:52:11/8:58:55, loss=0.389163493893219, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.17(1.03), norm=0.8010877402181494, lr=0.009156681630871386
2023-12-22 07:21:22   INFO  epoch: 15/24, acc_iter=54755, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:37:10/0:27:08, time_cost(all): 16:53:07/9:13:50, loss=0.388967202573875, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.12(1.03), norm=3.427725710359376, lr=0.009134155229929483
2023-12-22 07:22:18   INFO  epoch: 15/24, acc_iter=54805, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:38:06/0:25:56, time_cost(all): 16:54:03/9:01:03, loss=0.388770911254532, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.85(1.03), norm=0.8129905361777783, lr=0.009111628828987577
2023-12-22 07:23:14   INFO  epoch: 15/24, acc_iter=54855, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:39:02/0:27:23, time_cost(all): 16:54:59/9:32:13, loss=0.388574619935188, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.98(1.03), norm=3.3725480380436283, lr=0.009089102428045674
2023-12-22 07:24:09   INFO  epoch: 15/24, acc_iter=54905, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:39:58/0:26:04, time_cost(all): 16:55:54/8:51:34, loss=0.388378328615845, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.08(1.03), norm=3.740514899612295, lr=0.00906657602710377
2023-12-22 07:25:05   INFO  epoch: 15/24, acc_iter=54955, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:40:53/0:24:31, time_cost(all): 16:56:50/9:38:21, loss=0.388182037296502, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.85(1.03), norm=1.8446472925413029, lr=0.009044049626161867
2023-12-22 07:26:01   INFO  epoch: 15/24, acc_iter=55005, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:41:49/0:23:45, time_cost(all): 16:57:46/9:22:13, loss=0.387985745977158, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.15(1.03), norm=3.6206058353530532, lr=0.00902152322521996
2023-12-22 07:26:57   INFO  epoch: 15/24, acc_iter=55055, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:42:45/0:22:31, time_cost(all): 16:58:42/9:24:38, loss=0.387789454657815, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.89(1.03), norm=3.688610718368809, lr=0.008998996824278058
2023-12-22 07:27:52   INFO  epoch: 15/24, acc_iter=55105, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:43:41/0:22:36, time_cost(all): 16:59:37/9:27:51, loss=0.387593163338471, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.23(1.03), norm=1.2157153300861554, lr=0.008976470423336155
2023-12-22 07:28:48   INFO  epoch: 15/24, acc_iter=55155, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:44:36/0:21:44, time_cost(all): 17:00:33/9:06:59, loss=0.387396872019128, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.91(1.03), norm=0.8246895234573755, lr=0.008953944022394252
2023-12-22 07:29:44   INFO  epoch: 15/24, acc_iter=55205, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:45:32/0:19:25, time_cost(all): 17:01:29/9:09:21, loss=0.387200580699784, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.86(1.03), norm=4.744077452755583, lr=0.008931417621452345
2023-12-22 07:30:40   INFO  epoch: 15/24, acc_iter=55255, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:46:28/0:17:59, time_cost(all): 17:02:25/9:28:39, loss=0.387004289380441, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.11(1.03), norm=4.755635736906242, lr=0.008908891220510442
2023-12-22 07:31:36   INFO  epoch: 15/24, acc_iter=55305, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:47:24/0:18:13, time_cost(all): 17:03:21/9:14:15, loss=0.386807998061098, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.9(1.03), norm=2.3645470512012254, lr=0.008886364819568539
2023-12-22 07:32:31   INFO  epoch: 15/24, acc_iter=55355, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:48:19/0:17:28, time_cost(all): 17:04:16/9:08:46, loss=0.386611706741754, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.05(1.03), norm=3.7764181756113646, lr=0.008863838418626636
2023-12-22 07:33:27   INFO  epoch: 15/24, acc_iter=55405, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:49:15/0:16:15, time_cost(all): 17:05:12/9:15:26, loss=0.386415415422411, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.9(1.03), norm=2.498864294737574, lr=0.00884131201768473
2023-12-22 07:34:23   INFO  epoch: 15/24, acc_iter=55455, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:50:11/0:15:47, time_cost(all): 17:06:08/8:50:45, loss=0.386219124103067, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.99(1.03), norm=4.530908609591479, lr=0.008818785616742826
2023-12-22 07:35:19   INFO  epoch: 15/24, acc_iter=55505, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 0:51:07/0:14:07, time_cost(all): 17:07:04/9:23:02, loss=0.386022832783724, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.92(1.03), norm=2.384861757049739, lr=0.008796259215800923
2023-12-22 07:36:14   INFO  epoch: 15/24, acc_iter=55555, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 0:52:03/0:13:00, time_cost(all): 17:07:59/8:56:57, loss=0.38582654146438, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.15(1.03), norm=3.047172919332368, lr=0.00877373281485902
2023-12-22 07:37:10   INFO  epoch: 15/24, acc_iter=55605, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 0:52:58/0:12:31, time_cost(all): 17:08:55/9:11:15, loss=0.385630250145037, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.1(1.03), norm=2.3934540855599487, lr=0.008751206413917113
2023-12-22 07:38:06   INFO  epoch: 15/24, acc_iter=55655, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 0:53:54/0:11:19, time_cost(all): 17:09:51/8:34:14, loss=0.385433958825693, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.86(1.03), norm=3.3953508942333346, lr=0.00872868001297521
2023-12-22 07:39:02   INFO  epoch: 15/24, acc_iter=55705, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 0:54:50/0:10:01, time_cost(all): 17:10:47/8:47:13, loss=0.38523766750635, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.83(1.03), norm=1.7876989869206688, lr=0.008706153612033307
2023-12-22 07:39:57   INFO  epoch: 15/24, acc_iter=55755, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 0:55:46/0:09:42, time_cost(all): 17:11:42/8:31:14, loss=0.385041376187007, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.12(1.03), norm=0.9048642588746993, lr=0.008683627211091404
2023-12-22 07:40:53   INFO  epoch: 15/24, acc_iter=55805, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 0:56:41/0:08:26, time_cost(all): 17:12:38/8:43:32, loss=0.384845084867663, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.09(1.03), norm=0.9616313217835777, lr=0.008661100810149498
2023-12-22 07:41:49   INFO  epoch: 15/24, acc_iter=55855, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 0:57:37/0:07:44, time_cost(all): 17:13:34/9:17:00, loss=0.38464879354832, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.04(1.03), norm=1.2105955819844192, lr=0.008638574409207594
2023-12-22 07:42:45   INFO  epoch: 15/24, acc_iter=55905, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 0:58:33/0:06:54, time_cost(all): 17:14:30/9:11:25, loss=0.384452502228976, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.19(1.03), norm=1.45817295350587, lr=0.008616048008265691
2023-12-22 07:43:41   INFO  epoch: 15/24, acc_iter=55955, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 0:59:29/0:06:00, time_cost(all): 17:15:26/9:18:31, loss=0.384256210909633, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.2(1.03), norm=3.0604741355247227, lr=0.008593521607323788
2023-12-22 07:44:36   INFO  epoch: 15/24, acc_iter=56005, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:00:24/0:04:54, time_cost(all): 17:16:21/8:50:38, loss=0.384059919590289, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.91(1.03), norm=2.5705519769350027, lr=0.008570995206381882
2023-12-22 07:45:32   INFO  epoch: 15/24, acc_iter=56055, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:01:20/0:03:55, time_cost(all): 17:17:17/8:56:11, loss=0.383863628270946, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.99(1.03), norm=0.7762080482030753, lr=0.008548468805439979
2023-12-22 07:46:28   INFO  epoch: 15/24, acc_iter=56105, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:02:16/0:03:12, time_cost(all): 17:18:13/8:40:33, loss=0.383667336951603, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.95(1.03), norm=0.9768718894359063, lr=0.008525942404498076
2023-12-22 07:47:24   INFO  epoch: 15/24, acc_iter=56155, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:03:12/0:02:04, time_cost(all): 17:19:09/8:54:56, loss=0.383471045632259, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.97(1.03), norm=3.1312607023601013, lr=0.008503416003556172
2023-12-22 07:48:19   INFO  epoch: 15/24, acc_iter=56205, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:04:08/0:01:11, time_cost(all): 17:20:04/8:31:27, loss=0.383274754312916, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.05(1.03), norm=3.853302671224459, lr=0.008480889602614266
2023-12-22 07:49:15   INFO  epoch: 15/24, acc_iter=56255, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:05:03/0:00:19, time_cost(all): 17:21:00/8:54:22, loss=0.383078462993572, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.19(1.03), norm=2.9809591721544004, lr=0.008458363201672363
2023-12-22 07:50:11   INFO  epoch: 16/24, acc_iter=56322, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:00:55/1:06:01, time_cost(all): 17:21:56/8:44:29, loss=0.382815432625652, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.11(1.03), norm=3.7765314187832506, lr=0.008428177824410212
2023-12-22 07:51:07   INFO  epoch: 16/24, acc_iter=56372, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:01:51/1:01:12, time_cost(all): 17:22:52/8:35:25, loss=0.382619141306309, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.19(1.03), norm=1.803479076633915, lr=0.008405651423468309
2023-12-22 07:52:02   INFO  epoch: 16/24, acc_iter=56422, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:02:47/1:03:07, time_cost(all): 17:23:47/8:35:18, loss=0.382422849986965, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.86(1.03), norm=1.904740696466778, lr=0.008383125022526403
2023-12-22 07:52:58   INFO  epoch: 16/24, acc_iter=56472, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:03:43/1:03:50, time_cost(all): 17:24:43/8:24:52, loss=0.382226558667622, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.98(1.03), norm=3.246422412326758, lr=0.0083605986215845
2023-12-22 07:53:54   INFO  epoch: 16/24, acc_iter=56522, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:04:38/0:58:21, time_cost(all): 17:25:39/8:38:05, loss=0.382030267348278, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.97(1.03), norm=1.2153869747355963, lr=0.008338072220642596
2023-12-22 07:54:50   INFO  epoch: 16/24, acc_iter=56572, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:05:34/1:00:10, time_cost(all): 17:26:35/8:40:54, loss=0.381833976028935, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.96(1.03), norm=0.5038384084097931, lr=0.008315545819700693
2023-12-22 07:55:45   INFO  epoch: 16/24, acc_iter=56622, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:06:30/0:59:12, time_cost(all): 17:27:30/9:01:12, loss=0.381637684709591, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.89(1.03), norm=4.627712957543655, lr=0.008293019418758787
2023-12-22 07:56:41   INFO  epoch: 16/24, acc_iter=56672, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:07:26/0:59:32, time_cost(all): 17:28:26/9:05:54, loss=0.381441393390248, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.97(1.03), norm=1.3297778277983263, lr=0.008270493017816884
2023-12-22 07:57:37   INFO  epoch: 16/24, acc_iter=56722, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:08:21/0:55:14, time_cost(all): 17:29:22/8:21:47, loss=0.381245102070905, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.94(1.03), norm=4.058987152906269, lr=0.00824796661687498
2023-12-22 07:58:33   INFO  epoch: 16/24, acc_iter=56772, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:09:17/0:54:43, time_cost(all): 17:30:18/8:41:28, loss=0.381048810751561, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.05(1.03), norm=4.383652885720037, lr=0.008225440215933078
2023-12-22 07:59:29   INFO  epoch: 16/24, acc_iter=56822, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:10:13/0:56:15, time_cost(all): 17:31:14/8:57:12, loss=0.380852519432218, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.91(1.03), norm=3.734251377946658, lr=0.008202913814991171
2023-12-22 08:00:24   INFO  epoch: 16/24, acc_iter=56872, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:11:09/0:53:46, time_cost(all): 17:32:09/8:36:32, loss=0.380656228112874, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.97(1.03), norm=4.28210784993615, lr=0.008180387414049268
2023-12-22 08:01:20   INFO  epoch: 16/24, acc_iter=56922, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:12:04/0:55:20, time_cost(all): 17:33:05/8:45:58, loss=0.380459936793531, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.85(1.03), norm=4.767231999414241, lr=0.008157861013107365
2023-12-22 08:02:16   INFO  epoch: 16/24, acc_iter=56972, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:13:00/0:52:14, time_cost(all): 17:34:01/8:09:40, loss=0.380263645474187, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.04(1.03), norm=3.8805623228221693, lr=0.008135334612165462
2023-12-22 08:03:12   INFO  epoch: 16/24, acc_iter=57022, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:13:56/0:52:17, time_cost(all): 17:34:57/8:31:14, loss=0.380067354154844, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.84(1.03), norm=3.2415236043313267, lr=0.008112808211223555
2023-12-22 08:04:07   INFO  epoch: 16/24, acc_iter=57072, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:14:52/0:50:30, time_cost(all): 17:35:52/8:23:27, loss=0.379871062835501, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.9(1.03), norm=2.028683062096428, lr=0.008090281810281652
2023-12-22 08:05:03   INFO  epoch: 16/24, acc_iter=57122, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:15:48/0:51:14, time_cost(all): 17:36:48/8:13:19, loss=0.379674771516157, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.87(1.03), norm=0.9730108127746073, lr=0.008067755409339749
2023-12-22 08:05:59   INFO  epoch: 16/24, acc_iter=57172, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:16:43/0:48:49, time_cost(all): 17:37:44/8:41:47, loss=0.379478480196814, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=4.4799097402097265, lr=0.008045229008397846
2023-12-22 08:06:55   INFO  epoch: 16/24, acc_iter=57222, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:17:39/0:47:38, time_cost(all): 17:38:40/8:18:56, loss=0.37928218887747, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.98(1.03), norm=0.8714706827236316, lr=0.00802270260745594
2023-12-22 08:07:50   INFO  epoch: 16/24, acc_iter=57272, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:18:35/0:45:40, time_cost(all): 17:39:35/8:43:09, loss=0.379085897558127, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.89(1.03), norm=1.142873984132391, lr=0.008000176206514036
2023-12-22 08:08:46   INFO  epoch: 16/24, acc_iter=57322, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:19:31/0:45:35, time_cost(all): 17:40:31/8:22:32, loss=0.378889606238783, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.84(1.03), norm=4.977468083010889, lr=0.007977649805572133
2023-12-22 08:09:42   INFO  epoch: 16/24, acc_iter=57372, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:20:26/0:44:55, time_cost(all): 17:41:27/8:46:13, loss=0.37869331491944, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.08(1.03), norm=3.5519635826132734, lr=0.00795512340463023
2023-12-22 08:10:38   INFO  epoch: 16/24, acc_iter=57422, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:21:22/0:44:32, time_cost(all): 17:42:23/8:03:02, loss=0.378497023600097, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.01(1.03), norm=2.5746162069275638, lr=0.007932597003688324
2023-12-22 08:11:34   INFO  epoch: 16/24, acc_iter=57472, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:22:18/0:44:55, time_cost(all): 17:43:19/8:48:06, loss=0.378300732280753, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.14(1.03), norm=1.1844878403424055, lr=0.00791007060274642
2023-12-22 08:12:29   INFO  epoch: 16/24, acc_iter=57522, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:23:14/0:43:50, time_cost(all): 17:44:14/8:20:29, loss=0.37810444096141, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.14(1.03), norm=3.699480057780113, lr=0.007887544201804517
2023-12-22 08:13:25   INFO  epoch: 16/24, acc_iter=57572, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:24:09/0:41:19, time_cost(all): 17:45:10/8:06:29, loss=0.377908149642066, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.16(1.03), norm=4.119737690754603, lr=0.007865017800862614
2023-12-22 08:14:21   INFO  epoch: 16/24, acc_iter=57622, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:25:05/0:40:44, time_cost(all): 17:46:06/8:11:40, loss=0.377711858322723, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.92(1.03), norm=1.0097389396502012, lr=0.007842491399920708
2023-12-22 08:15:17   INFO  epoch: 16/24, acc_iter=57672, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:26:01/0:38:55, time_cost(all): 17:47:02/8:08:52, loss=0.377515567003379, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.97(1.03), norm=1.282004674056206, lr=0.007819964998978805
2023-12-22 08:16:12   INFO  epoch: 16/24, acc_iter=57722, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:26:57/0:39:59, time_cost(all): 17:47:57/8:34:45, loss=0.377319275684036, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.89(1.03), norm=3.1430848780198244, lr=0.007797438598036902
2023-12-22 08:17:08   INFO  epoch: 16/24, acc_iter=57772, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:27:53/0:39:21, time_cost(all): 17:48:53/8:30:36, loss=0.377122984364692, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.98(1.03), norm=0.611962352770355, lr=0.007774912197094998
2023-12-22 08:18:04   INFO  epoch: 16/24, acc_iter=57822, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:28:48/0:37:28, time_cost(all): 17:49:49/8:24:25, loss=0.376926693045349, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.91(1.03), norm=0.9019541365442989, lr=0.007752385796153092
2023-12-22 08:19:00   INFO  epoch: 16/24, acc_iter=57872, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:29:44/0:36:00, time_cost(all): 17:50:45/8:08:17, loss=0.376730401726006, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.87(1.03), norm=4.92369558761384, lr=0.007729859395211189
2023-12-22 08:19:55   INFO  epoch: 16/24, acc_iter=57922, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:30:40/0:34:52, time_cost(all): 17:51:40/8:36:31, loss=0.376534110406662, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.87(1.03), norm=3.909907266347676, lr=0.007707332994269286
2023-12-22 08:20:51   INFO  epoch: 16/24, acc_iter=57972, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:31:36/0:34:31, time_cost(all): 17:52:36/8:17:54, loss=0.376337819087319, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.87(1.03), norm=3.704656143059345, lr=0.007684806593327383
2023-12-22 08:21:47   INFO  epoch: 16/24, acc_iter=58022, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:32:31/0:33:53, time_cost(all): 17:53:32/8:12:45, loss=0.376141527767975, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.18(1.03), norm=2.280743933947573, lr=0.007662280192385476
2023-12-22 08:22:43   INFO  epoch: 16/24, acc_iter=58072, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:33:27/0:30:56, time_cost(all): 17:54:28/8:37:01, loss=0.375945236448632, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.9(1.03), norm=1.2151982022545986, lr=0.007639753791443573
2023-12-22 08:23:39   INFO  epoch: 16/24, acc_iter=58122, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:34:23/0:29:56, time_cost(all): 17:55:24/8:10:47, loss=0.375748945129288, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.08(1.03), norm=1.865116340976182, lr=0.00761722739050167
2023-12-22 08:24:34   INFO  epoch: 16/24, acc_iter=58172, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:35:19/0:31:05, time_cost(all): 17:56:19/7:55:15, loss=0.375552653809945, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.17(1.03), norm=4.434037975251819, lr=0.007594700989559767
2023-12-22 08:25:30   INFO  epoch: 16/24, acc_iter=58222, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:36:14/0:29:35, time_cost(all): 17:57:15/8:12:16, loss=0.375356362490602, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.84(1.03), norm=3.2234492082598902, lr=0.00757217458861786
2023-12-22 08:26:26   INFO  epoch: 16/24, acc_iter=58272, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:37:10/0:28:19, time_cost(all): 17:58:11/7:54:32, loss=0.375160071171258, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.98(1.03), norm=1.802838159447056, lr=0.007549648187675957
2023-12-22 08:27:22   INFO  epoch: 16/24, acc_iter=58322, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:38:06/0:27:43, time_cost(all): 17:59:07/8:21:57, loss=0.374963779851915, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.93(1.03), norm=3.009830627818321, lr=0.007527121786734054
2023-12-22 08:28:17   INFO  epoch: 16/24, acc_iter=58372, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:39:02/0:26:35, time_cost(all): 18:00:02/8:13:03, loss=0.374767488532571, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.09(1.03), norm=4.081766888700371, lr=0.007504595385792151
2023-12-22 08:29:13   INFO  epoch: 16/24, acc_iter=58422, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:39:58/0:25:43, time_cost(all): 18:00:58/7:53:31, loss=0.374571197213228, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.97(1.03), norm=1.5681998268931916, lr=0.007482068984850248
2023-12-22 08:30:09   INFO  epoch: 16/24, acc_iter=58472, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:40:53/0:23:23, time_cost(all): 18:01:54/7:52:36, loss=0.374374905893884, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.9(1.03), norm=2.3948632034646966, lr=0.007459542583908345
2023-12-22 08:31:05   INFO  epoch: 16/24, acc_iter=58522, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:41:49/0:24:13, time_cost(all): 18:02:50/7:44:19, loss=0.374178614574541, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.99(1.03), norm=1.6802323814491529, lr=0.007437016182966438
2023-12-22 08:32:00   INFO  epoch: 16/24, acc_iter=58572, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:42:45/0:23:04, time_cost(all): 18:03:45/8:11:36, loss=0.373982323255198, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.14(1.03), norm=2.112397317032523, lr=0.007414489782024535
2023-12-22 08:32:56   INFO  epoch: 16/24, acc_iter=58622, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:43:41/0:21:27, time_cost(all): 18:04:41/7:58:19, loss=0.373786031935854, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.02(1.03), norm=4.6666797827948585, lr=0.007391963381082632
2023-12-22 08:33:52   INFO  epoch: 16/24, acc_iter=58672, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:44:36/0:20:55, time_cost(all): 18:05:37/8:20:20, loss=0.373589740616511, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.96(1.03), norm=2.349749037143114, lr=0.007369436980140729
2023-12-22 08:34:48   INFO  epoch: 16/24, acc_iter=58722, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:45:32/0:19:28, time_cost(all): 18:06:33/8:23:51, loss=0.373393449297167, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.13(1.03), norm=3.550021817444899, lr=0.007346910579198826
2023-12-22 08:35:44   INFO  epoch: 16/24, acc_iter=58772, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:46:28/0:19:18, time_cost(all): 18:07:29/7:57:10, loss=0.373197157977824, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.08(1.03), norm=4.296939772270228, lr=0.007324384178256923
2023-12-22 08:36:39   INFO  epoch: 16/24, acc_iter=58822, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:47:24/0:17:06, time_cost(all): 18:08:24/8:14:47, loss=0.37300086665848, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.06(1.03), norm=2.7388086898758304, lr=0.007301857777315016
2023-12-22 08:37:35   INFO  epoch: 16/24, acc_iter=58872, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:48:19/0:16:59, time_cost(all): 18:09:20/7:35:58, loss=0.372804575339137, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.0(1.03), norm=0.8149346307615055, lr=0.007279331376373113
2023-12-22 08:38:31   INFO  epoch: 16/24, acc_iter=58922, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:49:15/0:16:35, time_cost(all): 18:10:16/7:45:52, loss=0.372608284019794, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.15(1.03), norm=3.072374927744707, lr=0.00725680497543121
2023-12-22 08:39:27   INFO  epoch: 16/24, acc_iter=58972, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:50:11/0:14:30, time_cost(all): 18:11:12/8:17:44, loss=0.37241199270045, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.98(1.03), norm=1.8129836746330428, lr=0.007234278574489307
2023-12-22 08:40:22   INFO  epoch: 16/24, acc_iter=59022, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 0:51:07/0:14:31, time_cost(all): 18:12:07/7:43:12, loss=0.372215701381107, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.2(1.03), norm=4.546226811396674, lr=0.007211752173547404
2023-12-22 08:41:18   INFO  epoch: 16/24, acc_iter=59072, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 0:52:03/0:13:16, time_cost(all): 18:13:03/8:18:42, loss=0.372019410061763, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.16(1.03), norm=2.0506713625002395, lr=0.007189225772605497
2023-12-22 08:42:14   INFO  epoch: 16/24, acc_iter=59122, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 0:52:58/0:12:32, time_cost(all): 18:13:59/7:55:49, loss=0.37182311874242, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.11(1.03), norm=4.336112533762179, lr=0.007166699371663594
2023-12-22 08:43:10   INFO  epoch: 16/24, acc_iter=59172, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 0:53:54/0:11:11, time_cost(all): 18:14:55/7:41:13, loss=0.371626827423076, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.12(1.03), norm=2.1250490660738537, lr=0.007144172970721691
2023-12-22 08:44:05   INFO  epoch: 16/24, acc_iter=59222, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 0:54:50/0:10:54, time_cost(all): 18:15:50/8:16:17, loss=0.371430536103733, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.97(1.03), norm=3.874028440151284, lr=0.007121646569779788
2023-12-22 08:45:01   INFO  epoch: 16/24, acc_iter=59272, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 0:55:46/0:09:16, time_cost(all): 18:16:46/7:57:29, loss=0.371234244784389, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.1(1.03), norm=4.571042382830393, lr=0.007099120168837882
2023-12-22 08:45:57   INFO  epoch: 16/24, acc_iter=59322, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 0:56:41/0:08:43, time_cost(all): 18:17:42/7:30:52, loss=0.371037953465046, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.06(1.03), norm=1.736254050422866, lr=0.007076593767895979
2023-12-22 08:46:53   INFO  epoch: 16/24, acc_iter=59372, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 0:57:37/0:07:22, time_cost(all): 18:18:38/7:29:53, loss=0.370841662145703, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.87(1.03), norm=4.285834881960297, lr=0.007054067366954075
2023-12-22 08:47:49   INFO  epoch: 16/24, acc_iter=59422, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 0:58:33/0:06:29, time_cost(all): 18:19:34/7:37:41, loss=0.370645370826359, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.09(1.03), norm=2.9415993155597544, lr=0.007031540966012172
2023-12-22 08:48:44   INFO  epoch: 16/24, acc_iter=59472, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 0:59:29/0:05:55, time_cost(all): 18:20:29/7:35:07, loss=0.370449079507016, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.19(1.03), norm=3.8973174127074226, lr=0.007009014565070266
2023-12-22 08:49:40   INFO  epoch: 16/24, acc_iter=59522, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:00:24/0:04:54, time_cost(all): 18:21:25/7:47:05, loss=0.370252788187672, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.91(1.03), norm=1.1340246750678613, lr=0.006986488164128363
2023-12-22 08:50:36   INFO  epoch: 16/24, acc_iter=59572, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:01:20/0:04:13, time_cost(all): 18:22:21/7:32:49, loss=0.370056496868329, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.22(1.03), norm=3.6067099637910123, lr=0.00696396176318646
2023-12-22 08:51:32   INFO  epoch: 16/24, acc_iter=59622, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:02:16/0:03:04, time_cost(all): 18:23:17/7:31:57, loss=0.369860205548985, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.05(1.03), norm=3.2867107593604175, lr=0.006941435362244557
2023-12-22 08:52:27   INFO  epoch: 16/24, acc_iter=59672, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:03:12/0:02:14, time_cost(all): 18:24:12/8:04:40, loss=0.369663914229642, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.11(1.03), norm=3.935379795432404, lr=0.00691890896130265
2023-12-22 08:53:23   INFO  epoch: 16/24, acc_iter=59722, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:04:08/0:01:13, time_cost(all): 18:25:08/7:56:14, loss=0.369467622910299, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.96(1.03), norm=4.639463495244144, lr=0.006896382560360747
2023-12-22 08:54:19   INFO  epoch: 16/24, acc_iter=59772, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:05:03/0:00:19, time_cost(all): 18:26:04/7:39:10, loss=0.369271331590955, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.07(1.03), norm=4.585519303177364, lr=0.006873856159418844
2023-12-22 08:55:15   INFO  epoch: 17/24, acc_iter=59839, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:00:55/1:05:08, time_cost(all): 18:27:00/7:56:01, loss=0.369008301223035, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.08(1.03), norm=2.766726033474857, lr=0.006843670782156693
2023-12-22 08:56:10   INFO  epoch: 17/24, acc_iter=59889, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:01:51/1:06:13, time_cost(all): 18:27:55/7:39:32, loss=0.368812009903692, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.18(1.03), norm=0.7377311367194967, lr=0.006821144381214787
2023-12-22 08:57:06   INFO  epoch: 17/24, acc_iter=59939, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:02:47/1:03:50, time_cost(all): 18:28:51/7:54:51, loss=0.368615718584348, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.15(1.03), norm=1.4845123167800127, lr=0.006798617980272884
2023-12-22 08:58:02   INFO  epoch: 17/24, acc_iter=59989, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:03:43/1:03:09, time_cost(all): 18:29:47/7:20:02, loss=0.368419427265005, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.84(1.03), norm=3.7117369749766103, lr=0.00677609157933098
2023-12-22 08:58:58   INFO  epoch: 17/24, acc_iter=60039, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:04:38/1:01:21, time_cost(all): 18:30:43/7:40:18, loss=0.368223135945661, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.07(1.03), norm=0.8748887191125574, lr=0.006753565178389077
2023-12-22 08:59:54   INFO  epoch: 17/24, acc_iter=60089, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:05:34/0:58:27, time_cost(all): 18:31:39/7:31:27, loss=0.368026844626318, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.07(1.03), norm=0.8820489675952202, lr=0.006731038777447171
2023-12-22 09:00:49   INFO  epoch: 17/24, acc_iter=60139, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:06:30/0:56:14, time_cost(all): 18:32:34/7:47:18, loss=0.367830553306974, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.19(1.03), norm=3.6272964567185118, lr=0.006708512376505268
2023-12-22 09:01:45   INFO  epoch: 17/24, acc_iter=60189, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:07:26/0:59:29, time_cost(all): 18:33:30/7:41:44, loss=0.367634261987631, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.13(1.03), norm=0.5858202309315581, lr=0.006685985975563365
2023-12-22 09:02:41   INFO  epoch: 17/24, acc_iter=60239, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:08:21/0:58:47, time_cost(all): 18:34:26/7:28:52, loss=0.367437970668288, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.84(1.03), norm=2.6560638565333394, lr=0.006663459574621462
2023-12-22 09:03:37   INFO  epoch: 17/24, acc_iter=60289, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:09:17/0:56:20, time_cost(all): 18:35:22/7:44:17, loss=0.367241679348944, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.04(1.03), norm=1.3557057658451392, lr=0.006640933173679555
2023-12-22 09:04:32   INFO  epoch: 17/24, acc_iter=60339, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:10:13/0:56:04, time_cost(all): 18:36:17/7:34:31, loss=0.367045388029601, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.03(1.03), norm=2.271052774278295, lr=0.006618406772737652
2023-12-22 09:05:28   INFO  epoch: 17/24, acc_iter=60389, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:11:09/0:53:25, time_cost(all): 18:37:13/7:32:04, loss=0.366849096710257, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.02(1.03), norm=2.973907336692074, lr=0.006595880371795749
2023-12-22 09:06:24   INFO  epoch: 17/24, acc_iter=60439, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:12:04/0:51:37, time_cost(all): 18:38:09/7:29:53, loss=0.366652805390914, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.12(1.03), norm=0.7499788458677142, lr=0.006573353970853846
2023-12-22 09:07:20   INFO  epoch: 17/24, acc_iter=60489, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:13:00/0:50:09, time_cost(all): 18:39:05/7:14:24, loss=0.36645651407157, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.87(1.03), norm=3.4484099308974496, lr=0.006550827569911939
2023-12-22 09:08:15   INFO  epoch: 17/24, acc_iter=60539, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:13:56/0:51:16, time_cost(all): 18:40:00/7:22:01, loss=0.366260222752227, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.88(1.03), norm=3.1776795815445906, lr=0.006528301168970036
2023-12-22 09:09:11   INFO  epoch: 17/24, acc_iter=60589, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:14:52/0:48:22, time_cost(all): 18:40:56/7:12:50, loss=0.366063931432884, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.19(1.03), norm=0.6095670327382801, lr=0.006505774768028133
2023-12-22 09:10:07   INFO  epoch: 17/24, acc_iter=60639, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:15:48/0:51:58, time_cost(all): 18:41:52/7:40:48, loss=0.36586764011354, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.07(1.03), norm=2.2283302813238803, lr=0.00648324836708623
2023-12-22 09:11:03   INFO  epoch: 17/24, acc_iter=60689, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:16:43/0:49:39, time_cost(all): 18:42:48/7:40:05, loss=0.365671348794197, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.98(1.03), norm=3.0040116484895263, lr=0.006460721966144323
2023-12-22 09:11:59   INFO  epoch: 17/24, acc_iter=60739, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:17:39/0:46:31, time_cost(all): 18:43:44/7:27:35, loss=0.365475057474853, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.95(1.03), norm=2.568384406130805, lr=0.00643819556520242
2023-12-22 09:12:54   INFO  epoch: 17/24, acc_iter=60789, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:18:35/0:46:33, time_cost(all): 18:44:39/7:04:02, loss=0.36527876615551, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.98(1.03), norm=3.8821302726811973, lr=0.006415669164260517
2023-12-22 09:13:50   INFO  epoch: 17/24, acc_iter=60839, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:19:31/0:44:08, time_cost(all): 18:45:35/7:11:00, loss=0.365082474836166, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.01(1.03), norm=4.344374866864394, lr=0.006393142763318614
2023-12-22 09:14:46   INFO  epoch: 17/24, acc_iter=60889, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:20:26/0:44:50, time_cost(all): 18:46:31/7:36:20, loss=0.364886183516823, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.85(1.03), norm=3.112739331770964, lr=0.006370616362376708
2023-12-22 09:15:42   INFO  epoch: 17/24, acc_iter=60939, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:21:22/0:44:33, time_cost(all): 18:47:27/7:22:53, loss=0.364689892197479, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.13(1.03), norm=2.7172193254746837, lr=0.006348089961434805
2023-12-22 09:16:37   INFO  epoch: 17/24, acc_iter=60989, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:22:18/0:43:20, time_cost(all): 18:48:22/7:30:45, loss=0.364493600878136, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.04(1.03), norm=3.1441140098224687, lr=0.006325563560492901
2023-12-22 09:17:33   INFO  epoch: 17/24, acc_iter=61039, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:23:14/0:41:27, time_cost(all): 18:49:18/7:17:01, loss=0.364297309558793, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.08(1.03), norm=0.7617935101109898, lr=0.006303037159550998
2023-12-22 09:18:29   INFO  epoch: 17/24, acc_iter=61089, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:24:09/0:42:36, time_cost(all): 18:50:14/7:29:45, loss=0.364101018239449, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.0(1.03), norm=4.787569335575554, lr=0.006280510758609092
2023-12-22 09:19:25   INFO  epoch: 17/24, acc_iter=61139, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:25:05/0:41:24, time_cost(all): 18:51:10/7:37:38, loss=0.363904726920106, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.15(1.03), norm=3.630073941814172, lr=0.006257984357667189
2023-12-22 09:20:20   INFO  epoch: 17/24, acc_iter=61189, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:26:01/0:39:09, time_cost(all): 18:52:05/7:28:59, loss=0.363708435600762, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.07(1.03), norm=3.1477295468420357, lr=0.006235457956725286
2023-12-22 09:21:16   INFO  epoch: 17/24, acc_iter=61239, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:26:57/0:39:56, time_cost(all): 18:53:01/6:56:02, loss=0.363512144281419, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.15(1.03), norm=2.0840122653310726, lr=0.006212931555783383
2023-12-22 09:22:12   INFO  epoch: 17/24, acc_iter=61289, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:27:53/0:37:46, time_cost(all): 18:53:57/7:02:28, loss=0.363315852962075, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.15(1.03), norm=1.983376111623564, lr=0.006190405154841476
2023-12-22 09:23:08   INFO  epoch: 17/24, acc_iter=61339, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:28:48/0:38:17, time_cost(all): 18:54:53/7:10:42, loss=0.363119561642732, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.09(1.03), norm=1.703515302429711, lr=0.006167878753899573
2023-12-22 09:24:03   INFO  epoch: 17/24, acc_iter=61389, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:29:44/0:33:59, time_cost(all): 18:55:48/7:20:13, loss=0.362923270323389, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.09(1.03), norm=4.1040148257645335, lr=0.00614535235295767
2023-12-22 09:24:59   INFO  epoch: 17/24, acc_iter=61439, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:30:40/0:36:06, time_cost(all): 18:56:44/7:27:16, loss=0.362726979004045, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.19(1.03), norm=2.87015330674979, lr=0.006122825952015767
2023-12-22 09:25:55   INFO  epoch: 17/24, acc_iter=61489, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:31:36/0:34:25, time_cost(all): 18:57:40/6:51:48, loss=0.362530687684702, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.88(1.03), norm=4.931357932695504, lr=0.00610029955107386
2023-12-22 09:26:51   INFO  epoch: 17/24, acc_iter=61539, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:32:31/0:31:55, time_cost(all): 18:58:36/7:08:22, loss=0.362334396365358, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.91(1.03), norm=3.4336922432350745, lr=0.006077773150131957
2023-12-22 09:27:47   INFO  epoch: 17/24, acc_iter=61589, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:33:27/0:31:59, time_cost(all): 18:59:32/7:26:25, loss=0.362138105046015, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.98(1.03), norm=4.433307439242529, lr=0.006055246749190054
2023-12-22 09:28:42   INFO  epoch: 17/24, acc_iter=61639, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:34:23/0:31:31, time_cost(all): 19:00:27/7:19:53, loss=0.361941813726671, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.84(1.03), norm=3.923889268707106, lr=0.006032720348248151
2023-12-22 09:29:38   INFO  epoch: 17/24, acc_iter=61689, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:35:19/0:28:59, time_cost(all): 19:01:23/7:01:22, loss=0.361745522407328, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.03(1.03), norm=4.693264237614515, lr=0.006010193947306244
2023-12-22 09:30:34   INFO  epoch: 17/24, acc_iter=61739, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:36:14/0:30:16, time_cost(all): 19:02:19/7:24:53, loss=0.361549231087985, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.0(1.03), norm=1.0195911273572227, lr=0.005987667546364341
2023-12-22 09:31:30   INFO  epoch: 17/24, acc_iter=61789, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:37:10/0:29:36, time_cost(all): 19:03:15/7:11:27, loss=0.361352939768641, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.17(1.03), norm=1.903213378542381, lr=0.005965141145422438
2023-12-22 09:32:25   INFO  epoch: 17/24, acc_iter=61839, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:38:06/0:27:10, time_cost(all): 19:04:10/6:51:14, loss=0.361156648449298, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.96(1.03), norm=1.8753217723303348, lr=0.005942614744480535
2023-12-22 09:33:21   INFO  epoch: 17/24, acc_iter=61889, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:39:02/0:27:14, time_cost(all): 19:05:06/7:06:49, loss=0.360960357129954, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.22(1.03), norm=2.84922530735661, lr=0.005920088343538629
2023-12-22 09:34:17   INFO  epoch: 17/24, acc_iter=61939, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:39:58/0:26:05, time_cost(all): 19:06:02/7:07:29, loss=0.360764065810611, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.07(1.03), norm=1.9008916817789485, lr=0.005897561942596725
2023-12-22 09:35:13   INFO  epoch: 17/24, acc_iter=61989, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:40:53/0:25:05, time_cost(all): 19:06:58/7:05:19, loss=0.360567774491267, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.94(1.03), norm=3.313563056325153, lr=0.005875035541654822
2023-12-22 09:36:08   INFO  epoch: 17/24, acc_iter=62039, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:41:49/0:22:25, time_cost(all): 19:07:53/6:53:35, loss=0.360371483171924, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.1(1.03), norm=0.5198756259217803, lr=0.005852509140712919
2023-12-22 09:37:04   INFO  epoch: 17/24, acc_iter=62089, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:42:45/0:21:32, time_cost(all): 19:08:49/7:02:36, loss=0.360175191852581, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.21(1.03), norm=0.8226304505951489, lr=0.005829982739771013
2023-12-22 09:38:00   INFO  epoch: 17/24, acc_iter=62139, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:43:41/0:21:05, time_cost(all): 19:09:45/6:40:57, loss=0.359978900533237, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.99(1.03), norm=1.3911559873324582, lr=0.00580745633882911
2023-12-22 09:38:56   INFO  epoch: 17/24, acc_iter=62189, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:44:36/0:20:37, time_cost(all): 19:10:41/6:48:06, loss=0.359782609213894, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.88(1.03), norm=2.074865673583015, lr=0.005784929937887207
2023-12-22 09:39:52   INFO  epoch: 17/24, acc_iter=62239, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:45:32/0:20:12, time_cost(all): 19:11:37/6:50:32, loss=0.35958631789455, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.93(1.03), norm=1.1881894880544501, lr=0.005762403536945303
2023-12-22 09:40:47   INFO  epoch: 17/24, acc_iter=62289, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:46:28/0:18:13, time_cost(all): 19:12:32/6:38:08, loss=0.359390026575207, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.04(1.03), norm=2.8743796233591254, lr=0.005739877136003397
2023-12-22 09:41:43   INFO  epoch: 17/24, acc_iter=62339, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:47:24/0:18:00, time_cost(all): 19:13:28/6:37:42, loss=0.359193735255863, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.07(1.03), norm=1.8678368285899705, lr=0.005717350735061494
2023-12-22 09:42:39   INFO  epoch: 17/24, acc_iter=62389, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:48:19/0:16:33, time_cost(all): 19:14:24/6:45:59, loss=0.35899744393652, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.92(1.03), norm=0.9451099915341173, lr=0.005694824334119591
2023-12-22 09:43:35   INFO  epoch: 17/24, acc_iter=62439, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:49:15/0:16:01, time_cost(all): 19:15:20/6:40:25, loss=0.358801152617176, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.2(1.03), norm=0.7998293491275243, lr=0.005672297933177688
2023-12-22 09:44:30   INFO  epoch: 17/24, acc_iter=62489, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:50:11/0:15:27, time_cost(all): 19:16:15/6:43:36, loss=0.358604861297833, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.9(1.03), norm=1.8675923856431784, lr=0.005649771532235781
2023-12-22 09:45:26   INFO  epoch: 17/24, acc_iter=62539, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 0:51:07/0:13:53, time_cost(all): 19:17:11/7:03:13, loss=0.35840856997849, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.99(1.03), norm=4.320607742545236, lr=0.005627245131293878
2023-12-22 09:46:22   INFO  epoch: 17/24, acc_iter=62589, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 0:52:03/0:13:06, time_cost(all): 19:18:07/6:43:56, loss=0.358212278659146, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.94(1.03), norm=2.456339522691807, lr=0.005604718730351975
2023-12-22 09:47:18   INFO  epoch: 17/24, acc_iter=62639, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 0:52:58/0:12:09, time_cost(all): 19:19:03/7:08:55, loss=0.358015987339803, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.1(1.03), norm=4.991151197877665, lr=0.005582192329410072
2023-12-22 09:48:13   INFO  epoch: 17/24, acc_iter=62689, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 0:53:54/0:11:28, time_cost(all): 19:19:58/6:39:34, loss=0.357819696020459, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.19(1.03), norm=3.6646136530272133, lr=0.005559665928468165
2023-12-22 09:49:09   INFO  epoch: 17/24, acc_iter=62739, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 0:54:50/0:10:48, time_cost(all): 19:20:54/6:38:14, loss=0.357623404701116, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.94(1.03), norm=1.199727252696095, lr=0.005537139527526262
2023-12-22 09:50:05   INFO  epoch: 17/24, acc_iter=62789, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 0:55:46/0:09:57, time_cost(all): 19:21:50/6:56:43, loss=0.357427113381772, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.91(1.03), norm=1.6541780622674944, lr=0.005514613126584359
2023-12-22 09:51:01   INFO  epoch: 17/24, acc_iter=62839, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 0:56:41/0:08:39, time_cost(all): 19:22:46/6:57:35, loss=0.357230822062429, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.97(1.03), norm=1.2700795933140432, lr=0.005492086725642456
2023-12-22 09:51:57   INFO  epoch: 17/24, acc_iter=62889, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 0:57:37/0:07:31, time_cost(all): 19:23:42/6:58:32, loss=0.357034530743086, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.19(1.03), norm=0.9272726623144403, lr=0.005469560324700549
2023-12-22 09:52:52   INFO  epoch: 17/24, acc_iter=62939, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 0:58:33/0:06:31, time_cost(all): 19:24:37/6:49:40, loss=0.356838239423742, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.87(1.03), norm=3.073353865647451, lr=0.005447033923758646
2023-12-22 09:53:48   INFO  epoch: 17/24, acc_iter=62989, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 0:59:29/0:05:37, time_cost(all): 19:25:33/6:59:56, loss=0.356641948104399, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.13(1.03), norm=3.360797145865519, lr=0.005424507522816743
2023-12-22 09:54:44   INFO  epoch: 17/24, acc_iter=63039, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:00:24/0:04:54, time_cost(all): 19:26:29/6:54:34, loss=0.356445656785055, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.21(1.03), norm=2.597857107131475, lr=0.00540198112187484
2023-12-22 09:55:40   INFO  epoch: 17/24, acc_iter=63089, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:01:20/0:04:05, time_cost(all): 19:27:25/6:58:13, loss=0.356249365465712, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.9(1.03), norm=4.7434823051615895, lr=0.005379454720932937
2023-12-22 09:56:35   INFO  epoch: 17/24, acc_iter=63139, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:02:16/0:03:09, time_cost(all): 19:28:20/6:36:54, loss=0.356053074146368, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.09(1.03), norm=0.6146249285643491, lr=0.005356928319991031
2023-12-22 09:57:31   INFO  epoch: 17/24, acc_iter=63189, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:03:12/0:02:14, time_cost(all): 19:29:16/6:52:29, loss=0.355856782827025, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.9(1.03), norm=3.9633539739261927, lr=0.005334401919049127
2023-12-22 09:58:27   INFO  epoch: 17/24, acc_iter=63239, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:04:08/0:01:12, time_cost(all): 19:30:12/6:21:18, loss=0.355660491507682, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.85(1.03), norm=4.037093871919953, lr=0.005311875518107224
2023-12-22 09:59:23   INFO  epoch: 17/24, acc_iter=63289, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:05:03/0:00:19, time_cost(all): 19:31:08/6:27:16, loss=0.355464200188338, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.16(1.03), norm=3.5396017079042976, lr=0.005289349117165321
2023-12-22 10:00:18   INFO  epoch: 18/24, acc_iter=63356, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:00:55/1:05:38, time_cost(all): 19:32:03/6:20:21, loss=0.355201169820418, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.1(1.03), norm=1.2964821242090283, lr=0.005259163739903171
2023-12-22 10:01:14   INFO  epoch: 18/24, acc_iter=63406, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:01:51/1:05:54, time_cost(all): 19:32:59/6:36:44, loss=0.355004878501074, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.87(1.03), norm=1.9103785102181692, lr=0.005236637338961268
2023-12-22 10:02:10   INFO  epoch: 18/24, acc_iter=63456, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:02:47/1:01:05, time_cost(all): 19:33:55/6:49:45, loss=0.354808587181731, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.13(1.03), norm=2.9319338665594943, lr=0.005214110938019365
2023-12-22 10:03:06   INFO  epoch: 18/24, acc_iter=63506, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:03:43/0:59:51, time_cost(all): 19:34:51/6:19:28, loss=0.354612295862388, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.02(1.03), norm=1.563065666443151, lr=0.005191584537077461
2023-12-22 10:04:02   INFO  epoch: 18/24, acc_iter=63556, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:04:38/0:58:07, time_cost(all): 19:35:47/6:51:51, loss=0.354416004543044, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.88(1.03), norm=4.677581428357275, lr=0.005169058136135555
2023-12-22 10:04:57   INFO  epoch: 18/24, acc_iter=63606, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:05:34/1:01:48, time_cost(all): 19:36:42/6:26:27, loss=0.354219713223701, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.86(1.03), norm=1.372152360729343, lr=0.005146531735193652
2023-12-22 10:05:53   INFO  epoch: 18/24, acc_iter=63656, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:06:30/1:00:01, time_cost(all): 19:37:38/6:16:26, loss=0.354023421904357, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.1(1.03), norm=3.0975754016541175, lr=0.005124005334251749
2023-12-22 10:06:49   INFO  epoch: 18/24, acc_iter=63706, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:07:26/0:58:07, time_cost(all): 19:38:34/6:11:15, loss=0.353827130585014, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.84(1.03), norm=1.892607136273077, lr=0.005101478933309846
2023-12-22 10:07:45   INFO  epoch: 18/24, acc_iter=63756, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:08:21/0:55:40, time_cost(all): 19:39:30/6:14:23, loss=0.35363083926567, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.2(1.03), norm=0.7698618062755814, lr=0.005078952532367939
2023-12-22 10:08:40   INFO  epoch: 18/24, acc_iter=63806, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:09:17/0:55:42, time_cost(all): 19:40:25/6:22:55, loss=0.353434547946327, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.0(1.03), norm=3.4143492453453446, lr=0.005056426131426036
2023-12-22 10:09:36   INFO  epoch: 18/24, acc_iter=63856, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:10:13/0:55:45, time_cost(all): 19:41:21/6:32:07, loss=0.353238256626984, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.9(1.03), norm=4.263176541058316, lr=0.005033899730484133
2023-12-22 10:10:32   INFO  epoch: 18/24, acc_iter=63906, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:11:09/0:53:42, time_cost(all): 19:42:17/6:26:41, loss=0.35304196530764, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.98(1.03), norm=1.185969413875405, lr=0.00501137332954223
2023-12-22 10:11:28   INFO  epoch: 18/24, acc_iter=63956, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:12:04/0:54:36, time_cost(all): 19:43:13/6:43:10, loss=0.352845673988297, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.13(1.03), norm=2.0122902696455274, lr=0.004988846928600323
2023-12-22 10:12:23   INFO  epoch: 18/24, acc_iter=64006, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:13:00/0:51:20, time_cost(all): 19:44:08/6:18:47, loss=0.352649382668953, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.86(1.03), norm=4.498072573297556, lr=0.00496632052765842
2023-12-22 10:13:19   INFO  epoch: 18/24, acc_iter=64056, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:13:56/0:53:30, time_cost(all): 19:45:04/6:09:47, loss=0.35245309134961, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.14(1.03), norm=3.6283824118192407, lr=0.004943794126716517
2023-12-22 10:14:15   INFO  epoch: 18/24, acc_iter=64106, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:14:52/0:48:25, time_cost(all): 19:46:00/6:04:02, loss=0.352256800030266, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.09(1.03), norm=3.8416249581002915, lr=0.004921267725774614
2023-12-22 10:15:11   INFO  epoch: 18/24, acc_iter=64156, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:15:48/0:49:20, time_cost(all): 19:46:56/6:34:26, loss=0.352060508710923, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.15(1.03), norm=1.685288561899507, lr=0.004898741324832707
2023-12-22 10:16:07   INFO  epoch: 18/24, acc_iter=64206, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:16:43/0:49:52, time_cost(all): 19:47:52/6:17:21, loss=0.35186421739158, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.12(1.03), norm=4.858917091995543, lr=0.004876214923890804
2023-12-22 10:17:02   INFO  epoch: 18/24, acc_iter=64256, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:17:39/0:45:58, time_cost(all): 19:48:47/6:27:31, loss=0.351667926072236, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.91(1.03), norm=3.387412193332691, lr=0.004853688522948901
2023-12-22 10:17:58   INFO  epoch: 18/24, acc_iter=64306, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:18:35/0:48:33, time_cost(all): 19:49:43/6:03:18, loss=0.351471634752893, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.18(1.03), norm=2.8838640545724994, lr=0.004831162122006998
2023-12-22 10:18:54   INFO  epoch: 18/24, acc_iter=64356, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:19:31/0:44:36, time_cost(all): 19:50:39/6:22:53, loss=0.351275343433549, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.18(1.03), norm=4.351566539588056, lr=0.004808635721065092
2023-12-22 10:19:50   INFO  epoch: 18/24, acc_iter=64406, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:20:26/0:44:12, time_cost(all): 19:51:35/6:30:16, loss=0.351079052114206, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.94(1.03), norm=3.858921404598583, lr=0.004786109320123189
2023-12-22 10:20:45   INFO  epoch: 18/24, acc_iter=64456, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:21:22/0:42:13, time_cost(all): 19:52:30/6:27:58, loss=0.350882760794862, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.84(1.03), norm=0.8238019543918078, lr=0.004763582919181285
2023-12-22 10:21:41   INFO  epoch: 18/24, acc_iter=64506, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:22:18/0:43:50, time_cost(all): 19:53:26/6:20:37, loss=0.350686469475519, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.17(1.03), norm=2.168375588258935, lr=0.004741056518239382
2023-12-22 10:22:37   INFO  epoch: 18/24, acc_iter=64556, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:23:14/0:40:15, time_cost(all): 19:54:22/6:09:54, loss=0.350490178156176, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.21(1.03), norm=2.538605557822165, lr=0.004718530117297476
2023-12-22 10:23:33   INFO  epoch: 18/24, acc_iter=64606, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:24:09/0:40:12, time_cost(all): 19:55:18/6:21:36, loss=0.350293886836832, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.84(1.03), norm=3.960133641222022, lr=0.004696003716355573
2023-12-22 10:24:28   INFO  epoch: 18/24, acc_iter=64656, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:25:05/0:39:52, time_cost(all): 19:56:13/6:13:40, loss=0.350097595517489, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.21(1.03), norm=4.200801290013477, lr=0.00467347731541367
2023-12-22 10:25:24   INFO  epoch: 18/24, acc_iter=64706, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:26:01/0:40:25, time_cost(all): 19:57:09/6:14:02, loss=0.349901304198145, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.1(1.03), norm=2.874907232217279, lr=0.004650950914471767
2023-12-22 10:26:20   INFO  epoch: 18/24, acc_iter=64756, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:26:57/0:36:53, time_cost(all): 19:58:05/5:58:58, loss=0.349705012878802, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.11(1.03), norm=3.7282144288466066, lr=0.00462842451352986
2023-12-22 10:27:16   INFO  epoch: 18/24, acc_iter=64806, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:27:53/0:37:30, time_cost(all): 19:59:01/6:15:21, loss=0.349508721559458, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.01(1.03), norm=1.39871063023013, lr=0.004605898112587957
2023-12-22 10:28:12   INFO  epoch: 18/24, acc_iter=64856, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:28:48/0:35:22, time_cost(all): 19:59:57/6:07:30, loss=0.349312430240115, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.04(1.03), norm=3.6677413249996422, lr=0.004583371711646054
2023-12-22 10:29:07   INFO  epoch: 18/24, acc_iter=64906, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:29:44/0:35:35, time_cost(all): 20:00:52/5:55:29, loss=0.349116138920771, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.19(1.03), norm=2.535322438619034, lr=0.004560845310704151
2023-12-22 10:30:03   INFO  epoch: 18/24, acc_iter=64956, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:30:40/0:33:47, time_cost(all): 20:01:48/6:03:48, loss=0.348919847601428, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.91(1.03), norm=2.21931138482136, lr=0.004538318909762244
2023-12-22 10:30:59   INFO  epoch: 18/24, acc_iter=65006, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:31:36/0:32:07, time_cost(all): 20:02:44/5:48:26, loss=0.348723556282085, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.13(1.03), norm=0.526980791994387, lr=0.004515792508820341
2023-12-22 10:31:55   INFO  epoch: 18/24, acc_iter=65056, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:32:31/0:33:49, time_cost(all): 20:03:40/6:03:26, loss=0.348527264962741, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.17(1.03), norm=0.9381891267719755, lr=0.004493266107878438
2023-12-22 10:32:50   INFO  epoch: 18/24, acc_iter=65106, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:33:27/0:30:25, time_cost(all): 20:04:35/5:58:23, loss=0.348330973643398, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.11(1.03), norm=3.238150776790888, lr=0.004470739706936535
2023-12-22 10:33:46   INFO  epoch: 18/24, acc_iter=65156, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:34:23/0:32:20, time_cost(all): 20:05:31/5:45:29, loss=0.348134682324054, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.17(1.03), norm=3.1786787546063104, lr=0.004448213305994628
2023-12-22 10:34:42   INFO  epoch: 18/24, acc_iter=65206, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:35:19/0:29:28, time_cost(all): 20:06:27/5:55:00, loss=0.347938391004711, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.17(1.03), norm=1.7578815063003206, lr=0.004425686905052725
2023-12-22 10:35:38   INFO  epoch: 18/24, acc_iter=65256, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:36:14/0:29:34, time_cost(all): 20:07:23/5:58:17, loss=0.347742099685367, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.05(1.03), norm=2.9345400784040825, lr=0.004403160504110822
2023-12-22 10:36:33   INFO  epoch: 18/24, acc_iter=65306, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:37:10/0:29:27, time_cost(all): 20:08:18/6:00:57, loss=0.347545808366024, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.2(1.03), norm=1.210091453726974, lr=0.004380634103168919
2023-12-22 10:37:29   INFO  epoch: 18/24, acc_iter=65356, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:38:06/0:26:28, time_cost(all): 20:09:14/5:50:51, loss=0.347349517046681, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.08(1.03), norm=3.848727000133728, lr=0.004358107702227013
2023-12-22 10:38:25   INFO  epoch: 18/24, acc_iter=65406, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:39:02/0:25:59, time_cost(all): 20:10:10/5:56:39, loss=0.347153225727337, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.0(1.03), norm=4.3544922380028455, lr=0.00433558130128511
2023-12-22 10:39:21   INFO  epoch: 18/24, acc_iter=65456, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:39:58/0:24:35, time_cost(all): 20:11:06/6:04:28, loss=0.346956934407994, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.21(1.03), norm=3.639006209422585, lr=0.004313054900343206
2023-12-22 10:40:17   INFO  epoch: 18/24, acc_iter=65506, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:40:53/0:25:05, time_cost(all): 20:12:02/5:54:47, loss=0.34676064308865, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.85(1.03), norm=2.3091734072450105, lr=0.004290528499401303
2023-12-22 10:41:12   INFO  epoch: 18/24, acc_iter=65556, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:41:49/0:24:22, time_cost(all): 20:12:57/5:39:53, loss=0.346564351769307, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.86(1.03), norm=1.7298994203651976, lr=0.004268002098459397
2023-12-22 10:42:08   INFO  epoch: 18/24, acc_iter=65606, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:42:45/0:22:04, time_cost(all): 20:13:53/5:42:01, loss=0.346368060449963, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.03(1.03), norm=3.125522054285997, lr=0.004245475697517494
2023-12-22 10:43:04   INFO  epoch: 18/24, acc_iter=65656, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:43:41/0:22:24, time_cost(all): 20:14:49/5:51:07, loss=0.34617176913062, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.07(1.03), norm=3.2275456143012478, lr=0.004222949296575591
2023-12-22 10:44:00   INFO  epoch: 18/24, acc_iter=65706, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:44:36/0:20:27, time_cost(all): 20:15:45/5:38:49, loss=0.345975477811277, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.11(1.03), norm=2.0610089507182456, lr=0.004200422895633688
2023-12-22 10:44:55   INFO  epoch: 18/24, acc_iter=65756, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:45:32/0:19:06, time_cost(all): 20:16:40/5:50:43, loss=0.345779186491933, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.1(1.03), norm=2.067027015819737, lr=0.004177896494691781
2023-12-22 10:45:51   INFO  epoch: 18/24, acc_iter=65806, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:46:28/0:18:27, time_cost(all): 20:17:36/6:05:57, loss=0.34558289517259, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.84(1.03), norm=4.7474291764614085, lr=0.004155370093749878
2023-12-22 10:46:47   INFO  epoch: 18/24, acc_iter=65856, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:47:24/0:18:19, time_cost(all): 20:18:32/5:42:04, loss=0.345386603853246, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.89(1.03), norm=2.0569559944643547, lr=0.004132843692807975
2023-12-22 10:47:43   INFO  epoch: 18/24, acc_iter=65906, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:48:19/0:16:12, time_cost(all): 20:19:28/5:37:41, loss=0.345190312533903, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.1(1.03), norm=3.4479486917609976, lr=0.004110317291866072
2023-12-22 10:48:38   INFO  epoch: 18/24, acc_iter=65956, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:49:15/0:16:01, time_cost(all): 20:20:23/5:33:42, loss=0.344994021214559, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.19(1.03), norm=4.5436808456018944, lr=0.004087790890924165
2023-12-22 10:49:34   INFO  epoch: 18/24, acc_iter=66006, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:50:11/0:15:52, time_cost(all): 20:21:19/5:57:07, loss=0.344797729895216, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.08(1.03), norm=1.6905408678527745, lr=0.004065264489982262
2023-12-22 10:50:30   INFO  epoch: 18/24, acc_iter=66056, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 0:51:07/0:14:10, time_cost(all): 20:22:15/5:57:46, loss=0.344601438575873, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.85(1.03), norm=3.0345317057357155, lr=0.004042738089040359
2023-12-22 10:51:26   INFO  epoch: 18/24, acc_iter=66106, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 0:52:03/0:13:55, time_cost(all): 20:23:11/5:29:33, loss=0.344405147256529, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.05(1.03), norm=3.2188859950945075, lr=0.004020211688098456
2023-12-22 10:52:21   INFO  epoch: 18/24, acc_iter=66156, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 0:52:58/0:12:10, time_cost(all): 20:24:06/6:01:06, loss=0.344208855937186, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.22(1.03), norm=1.592673302764866, lr=0.003997685287156549
2023-12-22 10:53:17   INFO  epoch: 18/24, acc_iter=66206, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 0:53:54/0:11:41, time_cost(all): 20:25:02/5:28:56, loss=0.344012564617842, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.01(1.03), norm=3.2215513032909757, lr=0.003975158886214646
2023-12-22 10:54:13   INFO  epoch: 18/24, acc_iter=66256, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 0:54:50/0:10:45, time_cost(all): 20:25:58/5:36:43, loss=0.343816273298499, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.17(1.03), norm=0.8864071085718082, lr=0.003952632485272743
2023-12-22 10:55:09   INFO  epoch: 18/24, acc_iter=66306, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 0:55:46/0:09:52, time_cost(all): 20:26:54/5:35:03, loss=0.343619981979155, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.06(1.03), norm=0.7081121492314293, lr=0.00393010608433084
2023-12-22 10:56:05   INFO  epoch: 18/24, acc_iter=66356, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 0:56:41/0:08:53, time_cost(all): 20:27:50/5:26:29, loss=0.343423690659812, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.9(1.03), norm=1.9901969429352906, lr=0.003907579683388934
2023-12-22 10:57:00   INFO  epoch: 18/24, acc_iter=66406, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 0:57:37/0:08:01, time_cost(all): 20:28:45/5:26:45, loss=0.343227399340469, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.06(1.03), norm=4.708833446622717, lr=0.00388505328244703
2023-12-22 10:57:56   INFO  epoch: 18/24, acc_iter=66456, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 0:58:33/0:06:39, time_cost(all): 20:29:41/5:25:57, loss=0.343031108021125, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.05(1.03), norm=4.950841575302881, lr=0.003862526881505127
2023-12-22 10:58:52   INFO  epoch: 18/24, acc_iter=66506, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 0:59:29/0:05:47, time_cost(all): 20:30:37/5:23:35, loss=0.342834816701782, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.89(1.03), norm=4.688807547529298, lr=0.003840000480563224
2023-12-22 10:59:48   INFO  epoch: 18/24, acc_iter=66556, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:00:24/0:04:58, time_cost(all): 20:31:33/5:26:31, loss=0.342638525382438, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.09(1.03), norm=4.180119695633758, lr=0.003817474079621318
2023-12-22 11:00:43   INFO  epoch: 18/24, acc_iter=66606, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:01:20/0:04:08, time_cost(all): 20:32:28/5:28:53, loss=0.342442234063095, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.21(1.03), norm=2.376509672064328, lr=0.003794947678679415
2023-12-22 11:01:39   INFO  epoch: 18/24, acc_iter=66656, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:02:16/0:03:05, time_cost(all): 20:33:24/5:25:28, loss=0.342245942743751, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.01(1.03), norm=2.872826438805109, lr=0.003772421277737512
2023-12-22 11:02:35   INFO  epoch: 18/24, acc_iter=66706, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:03:12/0:02:07, time_cost(all): 20:34:20/5:36:31, loss=0.342049651424408, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.13(1.03), norm=4.094331672792021, lr=0.003749894876795608
2023-12-22 11:03:31   INFO  epoch: 18/24, acc_iter=66756, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:04:08/0:01:17, time_cost(all): 20:35:16/5:42:37, loss=0.341853360105064, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.06(1.03), norm=4.941430634519282, lr=0.003727368475853702
2023-12-22 11:04:26   INFO  epoch: 18/24, acc_iter=66806, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:05:03/0:00:18, time_cost(all): 20:36:11/5:45:43, loss=0.341657068785721, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.09(1.03), norm=3.2200488720007012, lr=0.003704842074911799
2023-12-22 11:05:22   INFO  epoch: 19/24, acc_iter=66873, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:00:55/1:01:51, time_cost(all): 20:37:07/5:39:03, loss=0.341394038417801, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.13(1.03), norm=2.968499942420858, lr=0.003674656697649648
2023-12-22 11:06:18   INFO  epoch: 19/24, acc_iter=66923, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:01:51/1:05:34, time_cost(all): 20:38:03/5:16:39, loss=0.341197747098457, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.01(1.03), norm=4.306049445267573, lr=0.003652130296707745
2023-12-22 11:07:14   INFO  epoch: 19/24, acc_iter=66973, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:02:47/1:05:12, time_cost(all): 20:38:59/5:37:40, loss=0.341001455779114, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.2(1.03), norm=2.2642058794224953, lr=0.003629603895765839
2023-12-22 11:08:10   INFO  epoch: 19/24, acc_iter=67023, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:03:43/1:03:23, time_cost(all): 20:39:55/5:31:26, loss=0.340805164459771, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.21(1.03), norm=1.6100382385446104, lr=0.003607077494823935
2023-12-22 11:09:05   INFO  epoch: 19/24, acc_iter=67073, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:04:38/1:02:31, time_cost(all): 20:40:50/5:23:06, loss=0.340608873140427, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.87(1.03), norm=3.9975805735589396, lr=0.003584551093882032
2023-12-22 11:10:01   INFO  epoch: 19/24, acc_iter=67123, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:05:34/1:00:08, time_cost(all): 20:41:46/5:11:43, loss=0.340412581821084, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.87(1.03), norm=4.240449582970255, lr=0.003562024692940129
2023-12-22 11:10:57   INFO  epoch: 19/24, acc_iter=67173, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:06:30/0:58:28, time_cost(all): 20:42:42/5:36:45, loss=0.34021629050174, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.08(1.03), norm=1.4999991948824238, lr=0.003539498291998223
2023-12-22 11:11:53   INFO  epoch: 19/24, acc_iter=67223, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:07:26/0:57:50, time_cost(all): 20:43:38/5:29:42, loss=0.340019999182397, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.91(1.03), norm=2.979106931390099, lr=0.00351697189105632
2023-12-22 11:12:48   INFO  epoch: 19/24, acc_iter=67273, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:08:21/0:54:35, time_cost(all): 20:44:33/5:37:29, loss=0.339823707863053, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.85(1.03), norm=1.8715268545007184, lr=0.003494445490114417
2023-12-22 11:13:44   INFO  epoch: 19/24, acc_iter=67323, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:09:17/0:57:19, time_cost(all): 20:45:29/5:35:46, loss=0.33962741654371, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.84(1.03), norm=3.3411101996328028, lr=0.003471919089172514
2023-12-22 11:14:40   INFO  epoch: 19/24, acc_iter=67373, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:10:13/0:53:03, time_cost(all): 20:46:25/5:32:19, loss=0.339431125224367, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.95(1.03), norm=2.3380573743291615, lr=0.00344939268823061
2023-12-22 11:15:36   INFO  epoch: 19/24, acc_iter=67423, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:11:09/0:56:37, time_cost(all): 20:47:21/5:08:50, loss=0.339234833905023, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.95(1.03), norm=2.8334634013687046, lr=0.003426866287288704
2023-12-22 11:16:31   INFO  epoch: 19/24, acc_iter=67473, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:12:04/0:53:04, time_cost(all): 20:48:16/5:27:33, loss=0.33903854258568, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.97(1.03), norm=2.2817553145560256, lr=0.003404339886346801
2023-12-22 11:17:27   INFO  epoch: 19/24, acc_iter=67523, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:13:00/0:52:09, time_cost(all): 20:49:12/5:09:34, loss=0.338842251266336, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.06(1.03), norm=0.8000635042006565, lr=0.003381813485404898
2023-12-22 11:18:23   INFO  epoch: 19/24, acc_iter=67573, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:13:56/0:51:12, time_cost(all): 20:50:08/5:28:58, loss=0.338645959946993, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.11(1.03), norm=2.2794193288547513, lr=0.003359287084462995
2023-12-22 11:19:19   INFO  epoch: 19/24, acc_iter=67623, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:14:52/0:48:13, time_cost(all): 20:51:04/5:07:45, loss=0.338449668627649, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.92(1.03), norm=4.942450786893271, lr=0.003336760683521092
2023-12-22 11:20:15   INFO  epoch: 19/24, acc_iter=67673, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:15:48/0:49:39, time_cost(all): 20:52:00/5:29:38, loss=0.338253377308306, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.85(1.03), norm=0.6649537174887519, lr=0.003314234282579185
2023-12-22 11:21:10   INFO  epoch: 19/24, acc_iter=67723, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:16:43/0:50:54, time_cost(all): 20:52:55/5:10:42, loss=0.338057085988963, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.04(1.03), norm=1.9514123469474465, lr=0.003291707881637282
2023-12-22 11:22:06   INFO  epoch: 19/24, acc_iter=67773, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:17:39/0:45:55, time_cost(all): 20:53:51/5:23:20, loss=0.337860794669619, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.2(1.03), norm=3.0324928650415433, lr=0.003269181480695379
2023-12-22 11:23:02   INFO  epoch: 19/24, acc_iter=67823, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:18:35/0:48:01, time_cost(all): 20:54:47/5:06:24, loss=0.337664503350276, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.02(1.03), norm=0.95394787373152, lr=0.003246655079753476
2023-12-22 11:23:58   INFO  epoch: 19/24, acc_iter=67873, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:19:31/0:46:30, time_cost(all): 20:55:43/5:05:58, loss=0.337468212030932, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.1(1.03), norm=4.730882033126881, lr=0.003224128678811573
2023-12-22 11:24:53   INFO  epoch: 19/24, acc_iter=67923, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:20:26/0:44:59, time_cost(all): 20:56:38/5:05:17, loss=0.337271920711589, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.08(1.03), norm=4.031165400778457, lr=0.00320160227786967
2023-12-22 11:25:49   INFO  epoch: 19/24, acc_iter=67973, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:21:22/0:44:30, time_cost(all): 20:57:34/5:12:16, loss=0.337075629392245, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.2(1.03), norm=0.8720003198496042, lr=0.003179075876927763
2023-12-22 11:26:45   INFO  epoch: 19/24, acc_iter=68023, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:22:18/0:41:58, time_cost(all): 20:58:30/5:09:53, loss=0.336879338072902, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.98(1.03), norm=0.8770389186247372, lr=0.00315654947598586
2023-12-22 11:27:41   INFO  epoch: 19/24, acc_iter=68073, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:23:14/0:42:53, time_cost(all): 20:59:26/5:16:10, loss=0.336683046753558, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.94(1.03), norm=4.176341902186075, lr=0.003134023075043957
2023-12-22 11:28:36   INFO  epoch: 19/24, acc_iter=68123, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:24:09/0:40:29, time_cost(all): 21:00:21/5:05:41, loss=0.336486755434215, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.98(1.03), norm=4.011175543467724, lr=0.003111496674102054
2023-12-22 11:29:32   INFO  epoch: 19/24, acc_iter=68173, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:25:05/0:40:09, time_cost(all): 21:01:17/4:58:09, loss=0.336290464114872, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.0(1.03), norm=4.950713565894055, lr=0.003088970273160151
2023-12-22 11:30:28   INFO  epoch: 19/24, acc_iter=68223, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:26:01/0:38:58, time_cost(all): 21:02:13/4:56:10, loss=0.336094172795528, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.12(1.03), norm=1.6460421508801035, lr=0.003066443872218244
2023-12-22 11:31:24   INFO  epoch: 19/24, acc_iter=68273, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:26:57/0:38:00, time_cost(all): 21:03:09/5:12:56, loss=0.335897881476185, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.19(1.03), norm=3.3048307407417976, lr=0.003043917471276341
2023-12-22 11:32:20   INFO  epoch: 19/24, acc_iter=68323, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:27:53/0:37:44, time_cost(all): 21:04:05/4:50:25, loss=0.335701590156841, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.93(1.03), norm=1.1616292448554266, lr=0.003021391070334438
2023-12-22 11:33:15   INFO  epoch: 19/24, acc_iter=68373, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:28:48/0:34:59, time_cost(all): 21:05:00/4:50:57, loss=0.335505298837498, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.04(1.03), norm=2.3502527973474243, lr=0.002999407290638748
2023-12-22 11:34:11   INFO  epoch: 19/24, acc_iter=68423, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:29:44/0:35:10, time_cost(all): 21:05:56/5:10:05, loss=0.335309007518154, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.23(1.03), norm=4.505035267519329, lr=0.002987647184264666
2023-12-22 11:35:07   INFO  epoch: 19/24, acc_iter=68473, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:30:40/0:35:50, time_cost(all): 21:06:52/5:11:48, loss=0.335112716198811, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.13(1.03), norm=4.4029429516994165, lr=0.002975887077890584
2023-12-22 11:36:03   INFO  epoch: 19/24, acc_iter=68523, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:31:36/0:33:07, time_cost(all): 21:07:48/5:05:27, loss=0.334916424879468, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.86(1.03), norm=1.8552733499543133, lr=0.002964126971516502
2023-12-22 11:36:58   INFO  epoch: 19/24, acc_iter=68573, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:32:31/0:32:19, time_cost(all): 21:08:43/4:58:18, loss=0.334720133560124, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.98(1.03), norm=0.8156447657368227, lr=0.00295236686514242
2023-12-22 11:37:54   INFO  epoch: 19/24, acc_iter=68623, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:33:27/0:32:24, time_cost(all): 21:09:39/5:08:54, loss=0.334523842240781, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.87(1.03), norm=4.478536240153413, lr=0.002940606758768337
2023-12-22 11:38:50   INFO  epoch: 19/24, acc_iter=68673, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:34:23/0:31:44, time_cost(all): 21:10:35/5:03:50, loss=0.334327550921437, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.21(1.03), norm=1.9496486249702834, lr=0.002928846652394255
2023-12-22 11:39:46   INFO  epoch: 19/24, acc_iter=68723, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:35:19/0:29:44, time_cost(all): 21:11:31/5:00:50, loss=0.334131259602094, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.03(1.03), norm=1.0271025374122507, lr=0.002917086546020173
2023-12-22 11:40:41   INFO  epoch: 19/24, acc_iter=68773, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:36:14/0:27:56, time_cost(all): 21:12:26/5:01:47, loss=0.33393496828275, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.99(1.03), norm=2.7166115517133993, lr=0.002905326439646091
2023-12-22 11:41:37   INFO  epoch: 19/24, acc_iter=68823, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:37:10/0:27:00, time_cost(all): 21:13:22/4:57:20, loss=0.333738676963407, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.09(1.03), norm=2.7708278678092038, lr=0.002893566333272009
2023-12-22 11:42:33   INFO  epoch: 19/24, acc_iter=68873, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:38:06/0:26:50, time_cost(all): 21:14:18/5:05:12, loss=0.333542385644064, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.02(1.03), norm=1.012101144093611, lr=0.002881806226897926
2023-12-22 11:43:29   INFO  epoch: 19/24, acc_iter=68923, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:39:02/0:27:06, time_cost(all): 21:15:14/4:49:30, loss=0.33334609432472, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.94(1.03), norm=1.5531121038627183, lr=0.002870046120523844
2023-12-22 11:44:25   INFO  epoch: 19/24, acc_iter=68973, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:39:58/0:26:27, time_cost(all): 21:16:10/5:03:50, loss=0.333149803005377, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.95(1.03), norm=4.6902140617347525, lr=0.002858286014149762
2023-12-22 11:45:20   INFO  epoch: 19/24, acc_iter=69023, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:40:53/0:24:46, time_cost(all): 21:17:05/4:42:10, loss=0.332953511686033, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.11(1.03), norm=0.5304786802488275, lr=0.00284652590777568
2023-12-22 11:46:16   INFO  epoch: 19/24, acc_iter=69073, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:41:49/0:24:01, time_cost(all): 21:18:01/4:47:59, loss=0.33275722036669, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.89(1.03), norm=3.3698053317212673, lr=0.002834765801401597
2023-12-22 11:47:12   INFO  epoch: 19/24, acc_iter=69123, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:42:45/0:22:13, time_cost(all): 21:18:57/4:51:42, loss=0.332560929047346, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.05(1.03), norm=3.609338236073894, lr=0.002823005695027515
2023-12-22 11:48:08   INFO  epoch: 19/24, acc_iter=69173, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:43:41/0:20:51, time_cost(all): 21:19:53/4:51:33, loss=0.332364637728003, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.95(1.03), norm=1.2463572604679651, lr=0.002811245588653433
2023-12-22 11:49:03   INFO  epoch: 19/24, acc_iter=69223, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:44:36/0:20:00, time_cost(all): 21:20:48/5:01:19, loss=0.332168346408659, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.05(1.03), norm=1.5690950488008812, lr=0.002799485482279351
2023-12-22 11:49:59   INFO  epoch: 19/24, acc_iter=69273, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:45:32/0:20:01, time_cost(all): 21:21:44/4:33:35, loss=0.331972055089316, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.22(1.03), norm=3.154392717817599, lr=0.002787725375905269
2023-12-22 11:50:55   INFO  epoch: 19/24, acc_iter=69323, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:46:28/0:19:36, time_cost(all): 21:22:40/4:33:07, loss=0.331775763769973, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.08(1.03), norm=1.4732811203511407, lr=0.002775965269531186
2023-12-22 11:51:51   INFO  epoch: 19/24, acc_iter=69373, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:47:24/0:18:39, time_cost(all): 21:23:36/4:48:04, loss=0.331579472450629, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.2(1.03), norm=2.4395279528382625, lr=0.002764205163157104
2023-12-22 11:52:46   INFO  epoch: 19/24, acc_iter=69423, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:48:19/0:16:53, time_cost(all): 21:24:31/4:47:24, loss=0.331383181131286, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.93(1.03), norm=4.316640514914854, lr=0.002752445056783022
2023-12-22 11:53:42   INFO  epoch: 19/24, acc_iter=69473, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:49:15/0:15:54, time_cost(all): 21:25:27/4:41:49, loss=0.331186889811942, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.05(1.03), norm=1.511119579676162, lr=0.00274068495040894
2023-12-22 11:54:38   INFO  epoch: 19/24, acc_iter=69523, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:50:11/0:15:34, time_cost(all): 21:26:23/4:40:46, loss=0.330990598492599, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.89(1.03), norm=0.6431880203465367, lr=0.002728924844034857
2023-12-22 11:55:34   INFO  epoch: 19/24, acc_iter=69573, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 0:51:07/0:14:40, time_cost(all): 21:27:19/4:45:52, loss=0.330794307173255, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.07(1.03), norm=2.6197607987843536, lr=0.002717164737660776
2023-12-22 11:56:30   INFO  epoch: 19/24, acc_iter=69623, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 0:52:03/0:12:49, time_cost(all): 21:28:15/4:32:10, loss=0.330598015853912, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.19(1.03), norm=4.426341360913987, lr=0.002705404631286693
2023-12-22 11:57:25   INFO  epoch: 19/24, acc_iter=69673, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 0:52:58/0:12:46, time_cost(all): 21:29:10/4:47:41, loss=0.330401724534569, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.98(1.03), norm=3.3018323276679222, lr=0.002693644524912611
2023-12-22 11:58:21   INFO  epoch: 19/24, acc_iter=69723, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 0:53:54/0:11:30, time_cost(all): 21:30:06/4:32:07, loss=0.330205433215225, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.85(1.03), norm=2.9832114049976424, lr=0.002681884418538529
2023-12-22 11:59:17   INFO  epoch: 19/24, acc_iter=69773, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 0:54:50/0:10:47, time_cost(all): 21:31:02/4:36:41, loss=0.330009141895882, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.85(1.03), norm=1.688985337499874, lr=0.002670124312164446
2023-12-22 12:00:13   INFO  epoch: 19/24, acc_iter=69823, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 0:55:46/0:09:09, time_cost(all): 21:31:58/4:41:08, loss=0.329812850576538, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.87(1.03), norm=3.272832253336469, lr=0.002658364205790364
2023-12-22 12:01:08   INFO  epoch: 19/24, acc_iter=69873, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 0:56:41/0:08:38, time_cost(all): 21:32:53/4:46:21, loss=0.329616559257195, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.91(1.03), norm=1.3074274855057255, lr=0.002646604099416282
2023-12-22 12:02:04   INFO  epoch: 19/24, acc_iter=69923, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 0:57:37/0:07:48, time_cost(all): 21:33:49/4:45:35, loss=0.329420267937851, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.02(1.03), norm=3.423136916503667, lr=0.0026348439930422
2023-12-22 12:03:00   INFO  epoch: 19/24, acc_iter=69973, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 0:58:33/0:06:43, time_cost(all): 21:34:45/4:21:32, loss=0.329223976618508, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.17(1.03), norm=2.1012000901724495, lr=0.002623083886668118
2023-12-22 12:03:56   INFO  epoch: 19/24, acc_iter=70023, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 0:59:29/0:05:37, time_cost(all): 21:35:41/4:21:53, loss=0.329027685299165, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.88(1.03), norm=4.236579758752111, lr=0.002611323780294036
2023-12-22 12:04:51   INFO  epoch: 19/24, acc_iter=70073, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:00:24/0:05:09, time_cost(all): 21:36:36/4:19:20, loss=0.328831393979821, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.84(1.03), norm=4.524735311596325, lr=0.002599563673919953
2023-12-22 12:05:47   INFO  epoch: 19/24, acc_iter=70123, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:01:20/0:04:12, time_cost(all): 21:37:32/4:34:03, loss=0.328635102660478, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.07(1.03), norm=4.3908282526658935, lr=0.002587803567545871
2023-12-22 12:06:43   INFO  epoch: 19/24, acc_iter=70173, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:02:16/0:03:02, time_cost(all): 21:38:28/4:17:43, loss=0.328438811341134, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.93(1.03), norm=4.663975765409041, lr=0.002576043461171789
2023-12-22 12:07:39   INFO  epoch: 19/24, acc_iter=70223, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:03:12/0:02:10, time_cost(all): 21:39:24/4:19:52, loss=0.328242520021791, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.02(1.03), norm=3.040277834615111, lr=0.002564283354797707
2023-12-22 12:08:35   INFO  epoch: 19/24, acc_iter=70273, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:04:08/0:01:18, time_cost(all): 21:40:20/4:37:23, loss=0.328046228702447, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.97(1.03), norm=1.9420273902452951, lr=0.002552523248423625
2023-12-22 12:09:30   INFO  epoch: 19/24, acc_iter=70323, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:05:03/0:00:18, time_cost(all): 21:41:15/4:27:38, loss=0.327849937383104, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.06(1.03), norm=2.5362420456276418, lr=0.002540763142049542
2023-12-22 12:10:26   INFO  epoch: 20/24, acc_iter=70390, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:00:55/1:06:39, time_cost(all): 21:42:11/4:17:05, loss=0.327586907015184, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.22(1.03), norm=2.0876870638414524, lr=0.002525004599508272
2023-12-22 12:11:22   INFO  epoch: 20/24, acc_iter=70440, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:01:51/1:02:52, time_cost(all): 21:43:07/4:32:01, loss=0.32739061569584, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.16(1.03), norm=4.319885895698967, lr=0.00251324449313419
2023-12-22 12:12:18   INFO  epoch: 20/24, acc_iter=70490, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:02:47/1:00:55, time_cost(all): 21:44:03/4:14:17, loss=0.327194324376497, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.96(1.03), norm=4.683959976388813, lr=0.002501484386760108
2023-12-22 12:13:13   INFO  epoch: 20/24, acc_iter=70540, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:03:43/1:01:23, time_cost(all): 21:44:58/4:31:31, loss=0.326998033057153, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.0(1.03), norm=2.4604253091833037, lr=0.002489724280386026
2023-12-22 12:14:09   INFO  epoch: 20/24, acc_iter=70590, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:04:38/1:01:20, time_cost(all): 21:45:54/4:10:07, loss=0.32680174173781, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.85(1.03), norm=0.6785891755716538, lr=0.002477964174011943
2023-12-22 12:15:05   INFO  epoch: 20/24, acc_iter=70640, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:05:34/1:00:16, time_cost(all): 21:46:50/4:17:20, loss=0.326605450418467, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.95(1.03), norm=4.395144081168906, lr=0.002466204067637861
2023-12-22 12:16:01   INFO  epoch: 20/24, acc_iter=70690, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:06:30/0:57:28, time_cost(all): 21:47:46/4:15:13, loss=0.326409159099123, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.86(1.03), norm=1.9965389751705307, lr=0.002454443961263779
2023-12-22 12:16:56   INFO  epoch: 20/24, acc_iter=70740, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:07:26/1:00:09, time_cost(all): 21:48:41/4:22:53, loss=0.32621286777978, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.9(1.03), norm=3.336955021009711, lr=0.002442683854889697
2023-12-22 12:17:52   INFO  epoch: 20/24, acc_iter=70790, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:08:21/0:55:36, time_cost(all): 21:49:37/4:09:19, loss=0.326016576460436, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.07(1.03), norm=1.7317509843964052, lr=0.002430923748515615
2023-12-22 12:18:48   INFO  epoch: 20/24, acc_iter=70840, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:09:17/0:57:57, time_cost(all): 21:50:33/4:29:04, loss=0.325820285141093, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.13(1.03), norm=4.892771861510334, lr=0.002419163642141532
2023-12-22 12:19:44   INFO  epoch: 20/24, acc_iter=70890, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:10:13/0:53:05, time_cost(all): 21:51:29/4:16:14, loss=0.325623993821749, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.22(1.03), norm=3.0445230841700077, lr=0.00240740353576745
2023-12-22 12:20:39   INFO  epoch: 20/24, acc_iter=70940, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:11:09/0:51:46, time_cost(all): 21:52:24/4:12:11, loss=0.325427702502406, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.07(1.03), norm=2.2191236170848465, lr=0.002395643429393368
2023-12-22 12:21:35   INFO  epoch: 20/24, acc_iter=70990, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:12:04/0:51:07, time_cost(all): 21:53:20/4:18:51, loss=0.325231411183063, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.84(1.03), norm=3.3399247567627643, lr=0.002383883323019286
2023-12-22 12:22:31   INFO  epoch: 20/24, acc_iter=71040, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:13:00/0:53:53, time_cost(all): 21:54:16/4:10:21, loss=0.325035119863719, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.95(1.03), norm=3.3691966354277443, lr=0.002372123216645203
2023-12-22 12:23:27   INFO  epoch: 20/24, acc_iter=71090, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:13:56/0:50:47, time_cost(all): 21:55:12/4:14:35, loss=0.324838828544376, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.85(1.03), norm=4.306740468537441, lr=0.002360363110271121
2023-12-22 12:24:23   INFO  epoch: 20/24, acc_iter=71140, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:14:52/0:49:51, time_cost(all): 21:56:08/4:07:43, loss=0.324642537225032, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.95(1.03), norm=3.856877724430375, lr=0.002348603003897039
2023-12-22 12:25:18   INFO  epoch: 20/24, acc_iter=71190, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:15:48/0:48:54, time_cost(all): 21:57:03/4:19:41, loss=0.324446245905689, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.07(1.03), norm=1.2562364994150779, lr=0.002336842897522957
2023-12-22 12:26:14   INFO  epoch: 20/24, acc_iter=71240, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:16:43/0:49:21, time_cost(all): 21:57:59/4:23:30, loss=0.324249954586345, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.22(1.03), norm=3.5766067674939404, lr=0.002325082791148875
2023-12-22 12:27:10   INFO  epoch: 20/24, acc_iter=71290, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:17:39/0:45:38, time_cost(all): 21:58:55/3:59:30, loss=0.324053663267002, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.07(1.03), norm=1.440139884206827, lr=0.002313322684774792
2023-12-22 12:28:06   INFO  epoch: 20/24, acc_iter=71340, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:18:35/0:47:40, time_cost(all): 21:59:51/4:20:14, loss=0.323857371947659, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.97(1.03), norm=4.245553827966535, lr=0.00230156257840071
2023-12-22 12:29:01   INFO  epoch: 20/24, acc_iter=71390, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:19:31/0:47:06, time_cost(all): 22:00:46/4:05:11, loss=0.323661080628315, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.06(1.03), norm=1.025525282052386, lr=0.002289802472026628
2023-12-22 12:29:57   INFO  epoch: 20/24, acc_iter=71440, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:20:26/0:43:46, time_cost(all): 22:01:42/4:07:17, loss=0.323464789308972, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.94(1.03), norm=3.4024732219422837, lr=0.002278042365652546
2023-12-22 12:30:53   INFO  epoch: 20/24, acc_iter=71490, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:21:22/0:43:07, time_cost(all): 22:02:38/3:57:43, loss=0.323268497989628, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.92(1.03), norm=3.6610018323812707, lr=0.002266282259278464
2023-12-22 12:31:49   INFO  epoch: 20/24, acc_iter=71540, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:22:18/0:43:20, time_cost(all): 22:03:34/3:53:59, loss=0.323072206670285, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.94(1.03), norm=0.8984711219444149, lr=0.002254522152904382
2023-12-22 12:32:44   INFO  epoch: 20/24, acc_iter=71590, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:23:14/0:43:45, time_cost(all): 22:04:29/3:56:09, loss=0.322875915350941, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.12(1.03), norm=4.102065368729542, lr=0.002242762046530299
2023-12-22 12:33:40   INFO  epoch: 20/24, acc_iter=71640, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:24:09/0:39:27, time_cost(all): 22:05:25/4:07:39, loss=0.322679624031598, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.91(1.03), norm=2.050243500026567, lr=0.002231001940156217
2023-12-22 12:34:36   INFO  epoch: 20/24, acc_iter=71690, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:25:05/0:39:35, time_cost(all): 22:06:21/4:11:46, loss=0.322483332712255, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.88(1.03), norm=1.6563140294146408, lr=0.002219241833782135
2023-12-22 12:35:32   INFO  epoch: 20/24, acc_iter=71740, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:26:01/0:38:10, time_cost(all): 22:07:17/3:59:06, loss=0.322287041392911, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.06(1.03), norm=2.417461327233412, lr=0.002207481727408052
2023-12-22 12:36:28   INFO  epoch: 20/24, acc_iter=71790, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:26:57/0:38:25, time_cost(all): 22:08:13/4:08:18, loss=0.322090750073568, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.95(1.03), norm=1.104137698216169, lr=0.002195721621033971
2023-12-22 12:37:23   INFO  epoch: 20/24, acc_iter=71840, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:27:53/0:39:16, time_cost(all): 22:09:08/3:50:19, loss=0.321894458754224, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.12(1.03), norm=2.046400838230162, lr=0.002183961514659888
2023-12-22 12:38:19   INFO  epoch: 20/24, acc_iter=71890, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:28:48/0:36:03, time_cost(all): 22:10:04/4:07:17, loss=0.321698167434881, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.12(1.03), norm=3.870652230073823, lr=0.002172201408285806
2023-12-22 12:39:15   INFO  epoch: 20/24, acc_iter=71940, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:29:44/0:34:17, time_cost(all): 22:11:00/3:51:12, loss=0.321501876115537, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.99(1.03), norm=3.067667060804209, lr=0.002160441301911724
2023-12-22 12:40:11   INFO  epoch: 20/24, acc_iter=71990, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:30:40/0:33:57, time_cost(all): 22:11:56/3:55:33, loss=0.321305584796194, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.83(1.03), norm=4.488122803600708, lr=0.002148681195537642
2023-12-22 12:41:06   INFO  epoch: 20/24, acc_iter=72040, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:31:36/0:32:20, time_cost(all): 22:12:51/4:04:45, loss=0.32110929347685, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.03(1.03), norm=2.4688507580251513, lr=0.002136921089163559
2023-12-22 12:42:02   INFO  epoch: 20/24, acc_iter=72090, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:32:31/0:33:36, time_cost(all): 22:13:47/3:44:35, loss=0.320913002157507, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.17(1.03), norm=3.000981875856223, lr=0.002125160982789477
2023-12-22 12:42:58   INFO  epoch: 20/24, acc_iter=72140, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:33:27/0:30:35, time_cost(all): 22:14:43/3:56:44, loss=0.320716710838164, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.15(1.03), norm=2.827399741261175, lr=0.002113400876415395
2023-12-22 12:43:54   INFO  epoch: 20/24, acc_iter=72190, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:34:23/0:29:36, time_cost(all): 22:15:39/3:53:34, loss=0.32052041951882, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.04(1.03), norm=4.688892575087059, lr=0.002101640770041313
2023-12-22 12:44:49   INFO  epoch: 20/24, acc_iter=72240, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:35:19/0:30:35, time_cost(all): 22:16:34/3:55:34, loss=0.320324128199477, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.99(1.03), norm=2.693756970344938, lr=0.002089880663667231
2023-12-22 12:45:45   INFO  epoch: 20/24, acc_iter=72290, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:36:14/0:28:21, time_cost(all): 22:17:30/3:43:30, loss=0.320127836880133, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.84(1.03), norm=4.442196772062335, lr=0.002078120557293148
2023-12-22 12:46:41   INFO  epoch: 20/24, acc_iter=72340, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:37:10/0:28:49, time_cost(all): 22:18:26/3:41:45, loss=0.31993154556079, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.89(1.03), norm=3.1319125697258166, lr=0.002066360450919066
2023-12-22 12:47:37   INFO  epoch: 20/24, acc_iter=72390, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:38:06/0:27:37, time_cost(all): 22:19:22/3:48:55, loss=0.319735254241446, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.84(1.03), norm=1.5179704003009487, lr=0.002054600344544984
2023-12-22 12:48:33   INFO  epoch: 20/24, acc_iter=72440, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:39:02/0:26:17, time_cost(all): 22:20:18/3:59:42, loss=0.319538962922103, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.16(1.03), norm=2.3056388267256684, lr=0.002042840238170902
2023-12-22 12:49:28   INFO  epoch: 20/24, acc_iter=72490, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:39:58/0:25:03, time_cost(all): 22:21:13/3:45:05, loss=0.31934267160276, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.08(1.03), norm=0.983921372016806, lr=0.002031080131796819
2023-12-22 12:50:24   INFO  epoch: 20/24, acc_iter=72540, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:40:53/0:24:36, time_cost(all): 22:22:09/3:54:49, loss=0.319146380283416, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.99(1.03), norm=0.5443506266265145, lr=0.002019320025422737
2023-12-22 12:51:20   INFO  epoch: 20/24, acc_iter=72590, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:41:49/0:24:13, time_cost(all): 22:23:05/3:50:30, loss=0.318950088964073, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.96(1.03), norm=0.8496994572971865, lr=0.002007559919048655
2023-12-22 12:52:16   INFO  epoch: 20/24, acc_iter=72640, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:42:45/0:22:55, time_cost(all): 22:24:01/3:53:51, loss=0.318753797644729, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.12(1.03), norm=2.8207999416551113, lr=0.001995799812674573
2023-12-22 12:53:11   INFO  epoch: 20/24, acc_iter=72690, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:43:41/0:22:16, time_cost(all): 22:24:56/3:37:21, loss=0.318557506325386, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.05(1.03), norm=0.7351652704948619, lr=0.001984039706300491
2023-12-22 12:54:07   INFO  epoch: 20/24, acc_iter=72740, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:44:36/0:21:46, time_cost(all): 22:25:52/3:41:25, loss=0.318361215006042, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.2(1.03), norm=1.2910820589100975, lr=0.001972279599926408
2023-12-22 12:55:03   INFO  epoch: 20/24, acc_iter=72790, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:45:32/0:19:02, time_cost(all): 22:26:48/3:49:41, loss=0.318164923686699, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.14(1.03), norm=3.701101867551687, lr=0.001960519493552327
2023-12-22 12:55:59   INFO  epoch: 20/24, acc_iter=72840, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:46:28/0:18:33, time_cost(all): 22:27:44/3:34:03, loss=0.317968632367356, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.05(1.03), norm=4.815794653026611, lr=0.001948759387178244
2023-12-22 12:56:54   INFO  epoch: 20/24, acc_iter=72890, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:47:24/0:17:47, time_cost(all): 22:28:39/3:44:37, loss=0.317772341048012, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.99(1.03), norm=3.653626231830565, lr=0.001936999280804162
2023-12-22 12:57:50   INFO  epoch: 20/24, acc_iter=72940, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:48:19/0:16:50, time_cost(all): 22:29:35/3:41:13, loss=0.317576049728669, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.1(1.03), norm=3.7417526326601798, lr=0.00192523917443008
2023-12-22 12:58:46   INFO  epoch: 20/24, acc_iter=72990, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:49:15/0:16:14, time_cost(all): 22:30:31/3:37:07, loss=0.317379758409325, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.89(1.03), norm=2.6044651969917747, lr=0.001913479068055998
2023-12-22 12:59:42   INFO  epoch: 20/24, acc_iter=73040, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:50:11/0:15:55, time_cost(all): 22:31:27/3:43:56, loss=0.317183467089982, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.13(1.03), norm=2.9489492980590333, lr=0.001901718961681915
2023-12-22 13:00:38   INFO  epoch: 20/24, acc_iter=73090, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 0:51:07/0:13:35, time_cost(all): 22:32:23/3:33:57, loss=0.316987175770638, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.01(1.03), norm=3.7253053681301855, lr=0.001889958855307833
2023-12-22 13:01:33   INFO  epoch: 20/24, acc_iter=73140, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 0:52:03/0:13:33, time_cost(all): 22:33:18/3:41:25, loss=0.316790884451295, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.07(1.03), norm=3.6220520265822285, lr=0.001878198748933751
2023-12-22 13:02:29   INFO  epoch: 20/24, acc_iter=73190, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 0:52:58/0:12:17, time_cost(all): 22:34:14/3:33:18, loss=0.316594593131952, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.96(1.03), norm=4.682331970909442, lr=0.001866438642559669
2023-12-22 13:03:25   INFO  epoch: 20/24, acc_iter=73240, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 0:53:54/0:11:35, time_cost(all): 22:35:10/3:43:01, loss=0.316398301812608, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.94(1.03), norm=1.9596920406840896, lr=0.001854678536185586
2023-12-22 13:04:21   INFO  epoch: 20/24, acc_iter=73290, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 0:54:50/0:10:26, time_cost(all): 22:36:06/3:30:01, loss=0.316202010493265, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.88(1.03), norm=1.3221006824610053, lr=0.001842918429811504
2023-12-22 13:05:16   INFO  epoch: 20/24, acc_iter=73340, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 0:55:46/0:09:12, time_cost(all): 22:37:01/3:36:31, loss=0.316005719173921, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.85(1.03), norm=1.6303931639281752, lr=0.001831158323437422
2023-12-22 13:06:12   INFO  epoch: 20/24, acc_iter=73390, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 0:56:41/0:08:17, time_cost(all): 22:37:57/3:20:40, loss=0.315809427854578, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.92(1.03), norm=4.700655170290348, lr=0.00181939821706334
2023-12-22 13:07:08   INFO  epoch: 20/24, acc_iter=73440, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 0:57:37/0:07:51, time_cost(all): 22:38:53/3:27:52, loss=0.315613136535234, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.85(1.03), norm=2.8475995202511872, lr=0.001807638110689258
2023-12-22 13:08:04   INFO  epoch: 20/24, acc_iter=73490, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 0:58:33/0:06:43, time_cost(all): 22:39:49/3:23:51, loss=0.315416845215891, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.0(1.03), norm=2.9641430395317943, lr=0.001795878004315175
2023-12-22 13:08:59   INFO  epoch: 20/24, acc_iter=73540, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 0:59:29/0:05:39, time_cost(all): 22:40:44/3:20:38, loss=0.315220553896548, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.18(1.03), norm=4.405065628414172, lr=0.001784117897941093
2023-12-22 13:09:55   INFO  epoch: 20/24, acc_iter=73590, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:00:24/0:05:09, time_cost(all): 22:41:40/3:21:25, loss=0.315024262577204, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.93(1.03), norm=2.3571293617634006, lr=0.001772357791567011
2023-12-22 13:10:51   INFO  epoch: 20/24, acc_iter=73640, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:01:20/0:04:01, time_cost(all): 22:42:36/3:22:27, loss=0.314827971257861, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.23(1.03), norm=0.8716394296104006, lr=0.001760597685192929
2023-12-22 13:11:47   INFO  epoch: 20/24, acc_iter=73690, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:02:16/0:03:05, time_cost(all): 22:43:32/3:32:08, loss=0.314631679938517, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.87(1.03), norm=4.843229911501874, lr=0.001748837578818847
2023-12-22 13:12:43   INFO  epoch: 20/24, acc_iter=73740, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:03:12/0:02:10, time_cost(all): 22:44:28/3:32:33, loss=0.314435388619174, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.99(1.03), norm=1.7370039654283511, lr=0.001737077472444764
2023-12-22 13:13:38   INFO  epoch: 20/24, acc_iter=73790, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:04:08/0:01:17, time_cost(all): 22:45:23/3:24:53, loss=0.31423909729983, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.14(1.03), norm=4.667762311454577, lr=0.001725317366070682
2023-12-22 13:14:34   INFO  epoch: 20/24, acc_iter=73840, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:05:03/0:00:19, time_cost(all): 22:46:19/3:17:53, loss=0.314042805980487, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.9(1.03), norm=4.999830107594407, lr=0.0017135572596966
2023-12-22 13:15:30   INFO  epoch: 21/24, acc_iter=73907, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:00:55/1:04:23, time_cost(all): 22:47:15/3:16:27, loss=0.313779775612567, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.9(1.03), norm=3.5874858239490792, lr=0.00169779871715533
2023-12-22 13:16:26   INFO  epoch: 21/24, acc_iter=73957, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:01:51/1:02:39, time_cost(all): 22:48:11/3:16:41, loss=0.313583484293223, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.08(1.03), norm=4.484242389500206, lr=0.001686038610781248
2023-12-22 13:17:21   INFO  epoch: 21/24, acc_iter=74007, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:02:47/1:01:24, time_cost(all): 22:49:06/3:15:47, loss=0.31338719297388, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.05(1.03), norm=0.927683741334794, lr=0.001674278504407166
2023-12-22 13:18:17   INFO  epoch: 21/24, acc_iter=74057, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:03:43/1:01:11, time_cost(all): 22:50:02/3:19:09, loss=0.313190901654536, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.21(1.03), norm=4.639409314961571, lr=0.001662518398033083
2023-12-22 13:19:13   INFO  epoch: 21/24, acc_iter=74107, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:04:38/1:01:45, time_cost(all): 22:50:58/3:17:47, loss=0.312994610335193, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.86(1.03), norm=0.766442051584016, lr=0.001650758291659001
2023-12-22 13:20:09   INFO  epoch: 21/24, acc_iter=74157, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:05:34/0:59:23, time_cost(all): 22:51:54/3:22:53, loss=0.31279831901585, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.94(1.03), norm=1.2222036198574964, lr=0.001638998185284919
2023-12-22 13:21:04   INFO  epoch: 21/24, acc_iter=74207, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:06:30/0:58:25, time_cost(all): 22:52:49/3:15:54, loss=0.312602027696506, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.87(1.03), norm=0.8847691603028853, lr=0.001627238078910836
2023-12-22 13:22:00   INFO  epoch: 21/24, acc_iter=74257, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:07:26/0:58:19, time_cost(all): 22:53:45/3:08:09, loss=0.312405736377163, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.12(1.03), norm=3.393854001780393, lr=0.001615477972536755
2023-12-22 13:22:56   INFO  epoch: 21/24, acc_iter=74307, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:08:21/0:58:49, time_cost(all): 22:54:41/3:10:12, loss=0.312209445057819, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.18(1.03), norm=0.7550851049352889, lr=0.001603717866162672
2023-12-22 13:23:52   INFO  epoch: 21/24, acc_iter=74357, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:09:17/0:58:22, time_cost(all): 22:55:37/3:19:26, loss=0.312013153738476, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.19(1.03), norm=1.2464262384262703, lr=0.00159195775978859
2023-12-22 13:24:48   INFO  epoch: 21/24, acc_iter=74407, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:10:13/0:54:33, time_cost(all): 22:56:33/3:14:00, loss=0.311816862419132, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.91(1.03), norm=2.89558855702424, lr=0.001580197653414508
2023-12-22 13:25:43   INFO  epoch: 21/24, acc_iter=74457, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:11:09/0:51:30, time_cost(all): 22:57:28/3:13:37, loss=0.311620571099789, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.84(1.03), norm=1.219298328592239, lr=0.001568437547040426
2023-12-22 13:26:39   INFO  epoch: 21/24, acc_iter=74507, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:12:04/0:51:16, time_cost(all): 22:58:24/3:15:58, loss=0.311424279780446, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.99(1.03), norm=1.3647338982889652, lr=0.001556677440666343
2023-12-22 13:27:35   INFO  epoch: 21/24, acc_iter=74557, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:13:00/0:52:30, time_cost(all): 22:59:20/3:07:06, loss=0.311227988461102, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.08(1.03), norm=3.3341461210275316, lr=0.001544917334292261
2023-12-22 13:28:31   INFO  epoch: 21/24, acc_iter=74607, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:13:56/0:52:01, time_cost(all): 23:00:16/3:09:37, loss=0.311031697141759, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.83(1.03), norm=1.2562031629250343, lr=0.001533157227918179
2023-12-22 13:29:26   INFO  epoch: 21/24, acc_iter=74657, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:14:52/0:51:38, time_cost(all): 23:01:11/3:09:05, loss=0.310835405822415, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.01(1.03), norm=1.8717726042051794, lr=0.001521397121544097
2023-12-22 13:30:22   INFO  epoch: 21/24, acc_iter=74707, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:15:48/0:48:22, time_cost(all): 23:02:07/2:58:07, loss=0.310639114503072, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.19(1.03), norm=2.351960631900861, lr=0.001509637015170015
2023-12-22 13:31:18   INFO  epoch: 21/24, acc_iter=74757, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:16:43/0:48:51, time_cost(all): 23:03:03/3:01:49, loss=0.310442823183728, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.2(1.03), norm=3.952937029746019, lr=0.001497876908795932
2023-12-22 13:32:14   INFO  epoch: 21/24, acc_iter=74807, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:17:39/0:46:16, time_cost(all): 23:03:59/3:04:29, loss=0.310246531864385, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.92(1.03), norm=1.7733864144360607, lr=0.00148611680242185
2023-12-22 13:33:09   INFO  epoch: 21/24, acc_iter=74857, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:18:35/0:45:31, time_cost(all): 23:04:54/2:58:02, loss=0.310050240545041, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.13(1.03), norm=3.1761447233604003, lr=0.001474356696047768
2023-12-22 13:34:05   INFO  epoch: 21/24, acc_iter=74907, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:19:31/0:43:47, time_cost(all): 23:05:50/2:55:04, loss=0.309853949225698, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.21(1.03), norm=1.0115632403476122, lr=0.001462596589673686
2023-12-22 13:35:01   INFO  epoch: 21/24, acc_iter=74957, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:20:26/0:45:26, time_cost(all): 23:06:46/3:09:36, loss=0.309657657906355, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.91(1.03), norm=1.665020786855079, lr=0.001450836483299604
2023-12-22 13:35:57   INFO  epoch: 21/24, acc_iter=75007, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:21:22/0:44:12, time_cost(all): 23:07:42/2:58:04, loss=0.309461366587011, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.96(1.03), norm=2.705122131040658, lr=0.001439076376925521
2023-12-22 13:36:53   INFO  epoch: 21/24, acc_iter=75057, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:22:18/0:41:14, time_cost(all): 23:08:38/3:08:21, loss=0.309265075267668, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.15(1.03), norm=4.330132903704217, lr=0.001427316270551439
2023-12-22 13:37:48   INFO  epoch: 21/24, acc_iter=75107, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:23:14/0:40:05, time_cost(all): 23:09:33/2:55:37, loss=0.309068783948324, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.1(1.03), norm=0.9835339933051698, lr=0.001415556164177357
2023-12-22 13:38:44   INFO  epoch: 21/24, acc_iter=75157, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:24:09/0:42:56, time_cost(all): 23:10:29/3:00:07, loss=0.308872492628981, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.85(1.03), norm=2.2296674695654866, lr=0.001403796057803275
2023-12-22 13:39:40   INFO  epoch: 21/24, acc_iter=75207, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:25:05/0:40:47, time_cost(all): 23:11:25/2:51:42, loss=0.308676201309637, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.08(1.03), norm=3.1652212170419176, lr=0.001392035951429192
2023-12-22 13:40:36   INFO  epoch: 21/24, acc_iter=75257, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:26:01/0:37:45, time_cost(all): 23:12:21/3:00:07, loss=0.308479909990294, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.18(1.03), norm=1.0064254592634796, lr=0.00138027584505511
2023-12-22 13:41:31   INFO  epoch: 21/24, acc_iter=75307, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:26:57/0:37:20, time_cost(all): 23:13:16/2:57:20, loss=0.308283618670951, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.15(1.03), norm=3.2968733933728327, lr=0.001368515738681028
2023-12-22 13:42:27   INFO  epoch: 21/24, acc_iter=75357, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:27:53/0:36:32, time_cost(all): 23:14:12/2:55:14, loss=0.308087327351607, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.14(1.03), norm=3.867541997109116, lr=0.001356755632306946
2023-12-22 13:43:23   INFO  epoch: 21/24, acc_iter=75407, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:28:48/0:36:43, time_cost(all): 23:15:08/2:52:45, loss=0.307891036032264, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.12(1.03), norm=1.4805303097329576, lr=0.001344995525932864
2023-12-22 13:44:19   INFO  epoch: 21/24, acc_iter=75457, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:29:44/0:34:15, time_cost(all): 23:16:04/2:58:25, loss=0.30769474471292, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.15(1.03), norm=4.151659864262951, lr=0.001333235419558781
2023-12-22 13:45:14   INFO  epoch: 21/24, acc_iter=75507, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:30:40/0:34:20, time_cost(all): 23:16:59/2:46:14, loss=0.307498453393577, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.17(1.03), norm=3.393596286171727, lr=0.001321475313184699
2023-12-22 13:46:10   INFO  epoch: 21/24, acc_iter=75557, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:31:36/0:34:26, time_cost(all): 23:17:55/2:51:00, loss=0.307302162074233, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.12(1.03), norm=3.6332338766441286, lr=0.001309715206810617
2023-12-22 13:47:06   INFO  epoch: 21/24, acc_iter=75607, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:32:31/0:32:24, time_cost(all): 23:18:51/2:51:40, loss=0.30710587075489, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.92(1.03), norm=1.2297194998800318, lr=0.001297955100436535
2023-12-22 13:48:02   INFO  epoch: 21/24, acc_iter=75657, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:33:27/0:32:51, time_cost(all): 23:19:47/2:53:30, loss=0.306909579435547, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.91(1.03), norm=1.9702731635955297, lr=0.001286194994062453
2023-12-22 13:48:57   INFO  epoch: 21/24, acc_iter=75707, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:34:23/0:31:24, time_cost(all): 23:20:42/2:47:52, loss=0.306713288116203, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.2(1.03), norm=2.1369601111817857, lr=0.00127443488768837
2023-12-22 13:49:53   INFO  epoch: 21/24, acc_iter=75757, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:35:19/0:29:02, time_cost(all): 23:21:38/2:55:33, loss=0.30651699679686, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.95(1.03), norm=0.6322230090467922, lr=0.001262674781314288
2023-12-22 13:50:49   INFO  epoch: 21/24, acc_iter=75807, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:36:14/0:28:42, time_cost(all): 23:22:34/2:42:49, loss=0.306320705477516, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.14(1.03), norm=1.493916247206673, lr=0.001250914674940206
2023-12-22 13:51:45   INFO  epoch: 21/24, acc_iter=75857, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:37:10/0:27:22, time_cost(all): 23:23:30/2:43:17, loss=0.306124414158173, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.06(1.03), norm=1.2134338848387558, lr=0.001239154568566124
2023-12-22 13:52:41   INFO  epoch: 21/24, acc_iter=75907, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:38:06/0:26:09, time_cost(all): 23:24:26/2:43:10, loss=0.305928122838829, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.98(1.03), norm=2.7872666923415905, lr=0.001227394462192042
2023-12-22 13:53:36   INFO  epoch: 21/24, acc_iter=75957, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:39:02/0:27:14, time_cost(all): 23:25:21/2:47:45, loss=0.305731831519486, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.08(1.03), norm=0.6370954815360919, lr=0.001215634355817959
2023-12-22 13:54:32   INFO  epoch: 21/24, acc_iter=76007, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:39:58/0:24:32, time_cost(all): 23:26:17/2:39:49, loss=0.305535540200143, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.07(1.03), norm=1.7580149542569043, lr=0.001203874249443877
2023-12-22 13:55:28   INFO  epoch: 21/24, acc_iter=76057, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:40:53/0:24:12, time_cost(all): 23:27:13/2:37:04, loss=0.305339248880799, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.98(1.03), norm=0.736121166200205, lr=0.001192114143069795
2023-12-22 13:56:24   INFO  epoch: 21/24, acc_iter=76107, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:41:49/0:23:49, time_cost(all): 23:28:09/2:34:30, loss=0.305142957561456, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.87(1.03), norm=2.9068807461644166, lr=0.001180354036695713
2023-12-22 13:57:19   INFO  epoch: 21/24, acc_iter=76157, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:42:45/0:21:49, time_cost(all): 23:29:04/2:43:55, loss=0.304946666242112, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.19(1.03), norm=0.7231320810555084, lr=0.001168593930321631
2023-12-22 13:58:15   INFO  epoch: 21/24, acc_iter=76207, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:43:41/0:21:57, time_cost(all): 23:30:00/2:35:17, loss=0.304750374922769, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.0(1.03), norm=4.056514956335208, lr=0.001156833823947549
2023-12-22 13:59:11   INFO  epoch: 21/24, acc_iter=76257, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:44:36/0:21:04, time_cost(all): 23:30:56/2:38:31, loss=0.304554083603425, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.01(1.03), norm=3.252711452552743, lr=0.001145073717573466
2023-12-22 14:00:07   INFO  epoch: 21/24, acc_iter=76307, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:45:32/0:19:26, time_cost(all): 23:31:52/2:34:36, loss=0.304357792284082, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.86(1.03), norm=1.7097242853051926, lr=0.001133313611199384
2023-12-22 14:01:02   INFO  epoch: 21/24, acc_iter=76357, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:46:28/0:18:51, time_cost(all): 23:32:47/2:29:59, loss=0.304161500964738, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.91(1.03), norm=3.271562738139456, lr=0.001121553504825302
2023-12-22 14:01:58   INFO  epoch: 21/24, acc_iter=76407, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:47:24/0:17:04, time_cost(all): 23:33:43/2:36:00, loss=0.303965209645395, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.86(1.03), norm=1.312992076562613, lr=0.001109793398451219
2023-12-22 14:02:54   INFO  epoch: 21/24, acc_iter=76457, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:48:19/0:16:56, time_cost(all): 23:34:39/2:31:14, loss=0.303768918326052, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.2(1.03), norm=1.1218038894295732, lr=0.001098033292077137
2023-12-22 14:03:50   INFO  epoch: 21/24, acc_iter=76507, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:49:15/0:15:35, time_cost(all): 23:35:35/2:35:02, loss=0.303572627006708, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.9(1.03), norm=4.014631711913111, lr=0.001086273185703055
2023-12-22 14:04:46   INFO  epoch: 21/24, acc_iter=76557, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:50:11/0:15:12, time_cost(all): 23:36:31/2:38:29, loss=0.303376335687365, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.91(1.03), norm=4.468513694368422, lr=0.001074513079328973
2023-12-22 14:05:41   INFO  epoch: 21/24, acc_iter=76607, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 0:51:07/0:14:33, time_cost(all): 23:37:26/2:37:41, loss=0.303180044368021, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.97(1.03), norm=3.1179150364062416, lr=0.001062752972954891
2023-12-22 14:06:37   INFO  epoch: 21/24, acc_iter=76657, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 0:52:03/0:13:32, time_cost(all): 23:38:22/2:23:37, loss=0.302983753048678, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.1(1.03), norm=2.430022395880746, lr=0.001050992866580809
2023-12-22 14:07:33   INFO  epoch: 21/24, acc_iter=76707, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 0:52:58/0:12:23, time_cost(all): 23:39:18/2:25:40, loss=0.302787461729334, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.95(1.03), norm=2.1904152518802755, lr=0.001039232760206726
2023-12-22 14:08:29   INFO  epoch: 21/24, acc_iter=76757, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 0:53:54/0:11:25, time_cost(all): 23:40:14/2:30:07, loss=0.302591170409991, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.92(1.03), norm=3.2441195773266154, lr=0.001027472653832644
2023-12-22 14:09:24   INFO  epoch: 21/24, acc_iter=76807, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 0:54:50/0:10:59, time_cost(all): 23:41:09/2:29:20, loss=0.302394879090648, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.1(1.03), norm=1.5973853090691112, lr=0.001015712547458562
2023-12-22 14:10:20   INFO  epoch: 21/24, acc_iter=76857, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 0:55:46/0:10:00, time_cost(all): 23:42:05/2:20:19, loss=0.302198587771304, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.16(1.03), norm=1.0936603997785714, lr=0.00100395244108448
2023-12-22 14:11:16   INFO  epoch: 21/24, acc_iter=76907, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 0:56:41/0:08:39, time_cost(all): 23:43:01/2:25:10, loss=0.302002296451961, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.03(1.03), norm=2.62230920146198, lr=0.000992192334710398
2023-12-22 14:12:12   INFO  epoch: 21/24, acc_iter=76957, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 0:57:37/0:07:25, time_cost(all): 23:43:57/2:22:11, loss=0.301806005132617, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.21(1.03), norm=4.023145282204858, lr=0.000980432228336315
2023-12-22 14:13:07   INFO  epoch: 21/24, acc_iter=77007, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 0:58:33/0:06:42, time_cost(all): 23:44:52/2:28:05, loss=0.301609713813274, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.12(1.03), norm=3.2379222908356726, lr=0.000968672121962233
2023-12-22 14:14:03   INFO  epoch: 21/24, acc_iter=77057, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 0:59:29/0:06:10, time_cost(all): 23:45:48/2:21:31, loss=0.30141342249393, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.91(1.03), norm=2.1695443016621723, lr=0.000956912015588151
2023-12-22 14:14:59   INFO  epoch: 21/24, acc_iter=77107, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:00:24/0:05:08, time_cost(all): 23:46:44/2:25:06, loss=0.301217131174587, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.9(1.03), norm=0.9797098267900998, lr=0.000945151909214069
2023-12-22 14:15:55   INFO  epoch: 21/24, acc_iter=77157, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:01:20/0:04:10, time_cost(all): 23:47:40/2:16:40, loss=0.301020839855244, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.94(1.03), norm=2.951089050194466, lr=0.000933391802839986
2023-12-22 14:16:51   INFO  epoch: 21/24, acc_iter=77207, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:02:16/0:03:10, time_cost(all): 23:48:36/2:15:48, loss=0.3008245485359, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.92(1.03), norm=0.8546919170364007, lr=0.000921631696465904
2023-12-22 14:17:46   INFO  epoch: 21/24, acc_iter=77257, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:03:12/0:02:05, time_cost(all): 23:49:31/2:18:39, loss=0.300628257216557, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.98(1.03), norm=0.6387758975104674, lr=0.000909871590091822
2023-12-22 14:18:42   INFO  epoch: 21/24, acc_iter=77307, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:04:08/0:01:11, time_cost(all): 23:50:27/2:20:59, loss=0.300431965897213, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.2(1.03), norm=4.867544685244317, lr=0.00089811148371774
2023-12-22 14:19:38   INFO  epoch: 21/24, acc_iter=77357, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:05:03/0:00:18, time_cost(all): 23:51:23/2:15:31, loss=0.30023567457787, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.18(1.03), norm=4.772003241553167, lr=0.000886351377343658
2023-12-22 14:20:34   INFO  epoch: 22/24, acc_iter=77424, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:00:55/1:04:22, time_cost(all): 23:52:19/2:10:18, loss=0.29997264420995, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.9(1.03), norm=0.7804693202305117, lr=0.000870592834802388
2023-12-22 14:21:29   INFO  epoch: 22/24, acc_iter=77474, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:01:51/1:02:40, time_cost(all): 23:53:14/2:10:24, loss=0.299776352890606, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.14(1.03), norm=2.255838413587168, lr=0.000858832728428305
2023-12-22 14:22:25   INFO  epoch: 22/24, acc_iter=77524, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:02:47/1:05:18, time_cost(all): 23:54:10/2:08:20, loss=0.299580061571263, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.15(1.03), norm=3.718240495725576, lr=0.000847072622054223
2023-12-22 14:23:21   INFO  epoch: 22/24, acc_iter=77574, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:03:43/1:00:12, time_cost(all): 23:55:06/2:19:01, loss=0.299383770251919, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.93(1.03), norm=3.7262324579605473, lr=0.000835312515680141
2023-12-22 14:24:17   INFO  epoch: 22/24, acc_iter=77624, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:04:38/1:03:34, time_cost(all): 23:56:02/2:18:16, loss=0.299187478932576, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.89(1.03), norm=3.5044100917357346, lr=0.000823552409306059
2023-12-22 14:25:12   INFO  epoch: 22/24, acc_iter=77674, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:05:34/1:00:36, time_cost(all): 23:56:57/2:14:22, loss=0.298991187613232, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.19(1.03), norm=1.7345630475200657, lr=0.000811792302931976
2023-12-22 14:26:08   INFO  epoch: 22/24, acc_iter=77724, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:06:30/0:57:41, time_cost(all): 23:57:53/2:13:30, loss=0.298794896293889, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.89(1.03), norm=4.328008768571068, lr=0.000800032196557895
2023-12-22 14:27:04   INFO  epoch: 22/24, acc_iter=77774, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:07:26/0:55:28, time_cost(all): 23:58:49/2:14:19, loss=0.298598604974546, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.04(1.03), norm=1.5517310944451805, lr=0.000788272090183812
2023-12-22 14:28:00   INFO  epoch: 22/24, acc_iter=77824, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:08:21/0:56:52, time_cost(all): 23:59:45/2:08:48, loss=0.298402313655202, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.92(1.03), norm=0.7056702434385967, lr=0.00077651198380973
2023-12-22 14:28:56   INFO  epoch: 22/24, acc_iter=77874, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:09:17/0:58:09, time_cost(all): 1 day, 0:00:41/2:05:54, loss=0.298206022335859, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.2(1.03), norm=1.1996901065061458, lr=0.000764751877435648
2023-12-22 14:29:51   INFO  epoch: 22/24, acc_iter=77924, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:10:13/0:57:06, time_cost(all): 1 day, 0:01:36/2:07:12, loss=0.298009731016515, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.21(1.03), norm=0.590812291680775, lr=0.000752991771061565
2023-12-22 14:30:47   INFO  epoch: 22/24, acc_iter=77974, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:11:09/0:56:52, time_cost(all): 1 day, 0:02:32/2:08:45, loss=0.297813439697172, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.01(1.03), norm=3.208897192841162, lr=0.000741231664687483
2023-12-22 14:31:43   INFO  epoch: 22/24, acc_iter=78024, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:12:04/0:53:09, time_cost(all): 1 day, 0:03:28/2:11:51, loss=0.297617148377828, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.15(1.03), norm=4.301089184988301, lr=0.000729471558313401
2023-12-22 14:32:39   INFO  epoch: 22/24, acc_iter=78074, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:13:00/0:50:05, time_cost(all): 1 day, 0:04:24/2:07:00, loss=0.297420857058485, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.01(1.03), norm=2.3494295582750064, lr=0.000717711451939319
2023-12-22 14:33:34   INFO  epoch: 22/24, acc_iter=78124, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:13:56/0:50:39, time_cost(all): 1 day, 0:05:19/2:07:33, loss=0.297224565739142, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.97(1.03), norm=2.6161317591675397, lr=0.000705951345565237
2023-12-22 14:34:30   INFO  epoch: 22/24, acc_iter=78174, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:14:52/0:50:07, time_cost(all): 1 day, 0:06:15/2:00:28, loss=0.297028274419798, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.06(1.03), norm=4.126208556874276, lr=0.000694191239191155
2023-12-22 14:35:26   INFO  epoch: 22/24, acc_iter=78224, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:15:48/0:48:05, time_cost(all): 1 day, 0:07:11/2:06:09, loss=0.296831983100455, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.05(1.03), norm=4.269921886146184, lr=0.000682431132817072
2023-12-22 14:36:22   INFO  epoch: 22/24, acc_iter=78274, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:16:43/0:46:47, time_cost(all): 1 day, 0:08:07/2:05:04, loss=0.296635691781111, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.16(1.03), norm=1.9765149503229642, lr=0.00067067102644299
2023-12-22 14:37:17   INFO  epoch: 22/24, acc_iter=78324, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:17:39/0:46:31, time_cost(all): 1 day, 0:09:02/2:05:19, loss=0.296439400461768, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.83(1.03), norm=1.653179123306397, lr=0.000658910920068908
2023-12-22 14:38:13   INFO  epoch: 22/24, acc_iter=78374, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:18:35/0:47:33, time_cost(all): 1 day, 0:09:58/1:54:46, loss=0.296243109142424, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.18(1.03), norm=4.479612860613366, lr=0.000647150813694825
2023-12-22 14:39:09   INFO  epoch: 22/24, acc_iter=78424, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:19:31/0:45:24, time_cost(all): 1 day, 0:10:54/2:01:48, loss=0.296046817823081, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.14(1.03), norm=4.398867232536906, lr=0.000635390707320743
2023-12-22 14:40:05   INFO  epoch: 22/24, acc_iter=78474, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:20:26/0:47:01, time_cost(all): 1 day, 0:11:50/1:53:58, loss=0.295850526503738, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.17(1.03), norm=2.1028248309955746, lr=0.000623630600946661
2023-12-22 14:41:01   INFO  epoch: 22/24, acc_iter=78524, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:21:22/0:46:01, time_cost(all): 1 day, 0:12:46/1:52:23, loss=0.295654235184394, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.16(1.03), norm=2.915725136004303, lr=0.000611870494572579
2023-12-22 14:41:56   INFO  epoch: 22/24, acc_iter=78574, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:22:18/0:43:35, time_cost(all): 1 day, 0:13:41/1:56:45, loss=0.295457943865051, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.89(1.03), norm=4.44909096928788, lr=0.000600110388198497
2023-12-22 14:42:52   INFO  epoch: 22/24, acc_iter=78624, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:23:14/0:43:55, time_cost(all): 1 day, 0:14:37/1:48:50, loss=0.295261652545707, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.12(1.03), norm=0.5746489159584879, lr=0.000588350281824415
2023-12-22 14:43:48   INFO  epoch: 22/24, acc_iter=78674, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:24:09/0:41:21, time_cost(all): 1 day, 0:15:33/1:55:06, loss=0.295065361226364, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.94(1.03), norm=2.0017887252028244, lr=0.000576590175450332
2023-12-22 14:44:44   INFO  epoch: 22/24, acc_iter=78724, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:25:05/0:41:55, time_cost(all): 1 day, 0:16:29/1:48:27, loss=0.29486906990702, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.02(1.03), norm=3.373520163431354, lr=0.00056483006907625
2023-12-22 14:45:39   INFO  epoch: 22/24, acc_iter=78774, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:26:01/0:38:28, time_cost(all): 1 day, 0:17:24/1:55:47, loss=0.294672778587677, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.04(1.03), norm=3.8734071829908996, lr=0.000553069962702168
2023-12-22 14:46:35   INFO  epoch: 22/24, acc_iter=78824, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:26:57/0:40:02, time_cost(all): 1 day, 0:18:20/1:47:31, loss=0.294476487268334, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.12(1.03), norm=2.994110745258427, lr=0.000541309856328086
2023-12-22 14:47:31   INFO  epoch: 22/24, acc_iter=78874, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:27:53/0:36:59, time_cost(all): 1 day, 0:19:16/1:52:05, loss=0.29428019594899, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.05(1.03), norm=3.8304339458073566, lr=0.000529549749954004
2023-12-22 14:48:27   INFO  epoch: 22/24, acc_iter=78924, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:28:48/0:34:53, time_cost(all): 1 day, 0:20:12/1:52:08, loss=0.294083904629647, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.11(1.03), norm=3.62636196584255, lr=0.000517789643579921
2023-12-22 14:49:22   INFO  epoch: 22/24, acc_iter=78974, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:29:44/0:36:41, time_cost(all): 1 day, 0:21:07/1:51:51, loss=0.293887613310303, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.91(1.03), norm=3.1581851657279936, lr=0.000506029537205839
2023-12-22 14:50:18   INFO  epoch: 22/24, acc_iter=79024, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:30:40/0:34:35, time_cost(all): 1 day, 0:22:03/1:47:44, loss=0.29369132199096, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.05(1.03), norm=2.3492352412150934, lr=0.000494269430831757
2023-12-22 14:51:14   INFO  epoch: 22/24, acc_iter=79074, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:31:36/0:33:16, time_cost(all): 1 day, 0:22:59/1:50:13, loss=0.293495030671616, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.07(1.03), norm=3.4557593066211423, lr=0.000482509324457675
2023-12-22 14:52:10   INFO  epoch: 22/24, acc_iter=79124, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:32:31/0:32:48, time_cost(all): 1 day, 0:23:55/1:42:03, loss=0.293298739352273, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.97(1.03), norm=3.8689931625991854, lr=0.000470749218083592
2023-12-22 14:53:06   INFO  epoch: 22/24, acc_iter=79174, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:33:27/0:32:01, time_cost(all): 1 day, 0:24:51/1:39:30, loss=0.29310244803293, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.02(1.03), norm=3.4427434432683524, lr=0.00045898911170951
2023-12-22 14:54:01   INFO  epoch: 22/24, acc_iter=79224, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:34:23/0:30:39, time_cost(all): 1 day, 0:25:46/1:46:35, loss=0.292906156713586, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.13(1.03), norm=1.7617753445542959, lr=0.000447229005335428
2023-12-22 14:54:57   INFO  epoch: 22/24, acc_iter=79274, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:35:19/0:30:13, time_cost(all): 1 day, 0:26:42/1:39:05, loss=0.292709865394243, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.07(1.03), norm=0.6699229153882938, lr=0.000435468898961346
2023-12-22 14:55:53   INFO  epoch: 22/24, acc_iter=79324, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:36:14/0:28:50, time_cost(all): 1 day, 0:27:38/1:39:07, loss=0.292513574074899, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.83(1.03), norm=1.424282775575599, lr=0.000423708792587264
2023-12-22 14:56:49   INFO  epoch: 22/24, acc_iter=79374, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:37:10/0:28:58, time_cost(all): 1 day, 0:28:34/1:44:46, loss=0.292317282755556, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.88(1.03), norm=0.6043083373629209, lr=0.000411948686213181
2023-12-22 14:57:44   INFO  epoch: 22/24, acc_iter=79424, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:38:06/0:26:31, time_cost(all): 1 day, 0:29:29/1:35:54, loss=0.292120991436212, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.86(1.03), norm=2.254989306700084, lr=0.000400188579839099
2023-12-22 14:58:40   INFO  epoch: 22/24, acc_iter=79474, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:39:02/0:26:39, time_cost(all): 1 day, 0:30:25/1:38:12, loss=0.291924700116869, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.13(1.03), norm=1.5735351675449325, lr=0.000388428473465017
2023-12-22 14:59:36   INFO  epoch: 22/24, acc_iter=79524, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:39:58/0:24:08, time_cost(all): 1 day, 0:31:21/1:34:28, loss=0.291728408797525, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.94(1.03), norm=2.1823552072616117, lr=0.000376668367090935
2023-12-22 15:00:32   INFO  epoch: 22/24, acc_iter=79574, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:40:53/0:24:20, time_cost(all): 1 day, 0:32:17/1:32:07, loss=0.291532117478182, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.91(1.03), norm=1.2937208654216992, lr=0.000364908260716853
2023-12-22 15:01:27   INFO  epoch: 22/24, acc_iter=79624, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:41:49/0:22:28, time_cost(all): 1 day, 0:33:12/1:39:42, loss=0.291335826158839, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.17(1.03), norm=1.9204662703169997, lr=0.000353148154342771
2023-12-22 15:02:23   INFO  epoch: 22/24, acc_iter=79674, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:42:45/0:22:59, time_cost(all): 1 day, 0:34:08/1:38:16, loss=0.291139534839495, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.86(1.03), norm=0.7003256410505291, lr=0.000341388047968688
2023-12-22 15:03:19   INFO  epoch: 22/24, acc_iter=79724, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:43:41/0:21:52, time_cost(all): 1 day, 0:35:04/1:31:48, loss=0.290943243520152, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.98(1.03), norm=3.01140134026033, lr=0.000329627941594606
2023-12-22 15:04:15   INFO  epoch: 22/24, acc_iter=79774, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:44:36/0:19:53, time_cost(all): 1 day, 0:36:00/1:28:53, loss=0.290746952200808, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.07(1.03), norm=4.2053496217931725, lr=0.000317867835220524
2023-12-22 15:05:11   INFO  epoch: 22/24, acc_iter=79824, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:45:32/0:20:11, time_cost(all): 1 day, 0:36:56/1:28:14, loss=0.290550660881465, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.96(1.03), norm=2.280249629937363, lr=0.000306107728846442
2023-12-22 15:06:06   INFO  epoch: 22/24, acc_iter=79874, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:46:28/0:18:20, time_cost(all): 1 day, 0:37:51/1:35:07, loss=0.290354369562121, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.88(1.03), norm=1.8304826696771062, lr=0.000298418422898303
2023-12-22 15:07:02   INFO  epoch: 22/24, acc_iter=79924, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:47:24/0:18:17, time_cost(all): 1 day, 0:38:47/1:25:57, loss=0.290158078242778, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.09(1.03), norm=4.36352094087446, lr=0.000295127858022936
2023-12-22 15:07:58   INFO  epoch: 22/24, acc_iter=79974, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:48:19/0:16:39, time_cost(all): 1 day, 0:39:43/1:32:39, loss=0.289961786923435, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.14(1.03), norm=4.25695913097932, lr=0.000291837293147568
2023-12-22 15:08:54   INFO  epoch: 22/24, acc_iter=80024, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:49:15/0:16:13, time_cost(all): 1 day, 0:40:39/1:32:34, loss=0.289765495604091, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.03(1.03), norm=1.067333641445825, lr=0.000288546728272201
2023-12-22 15:09:49   INFO  epoch: 22/24, acc_iter=80074, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:50:11/0:15:38, time_cost(all): 1 day, 0:41:34/1:26:42, loss=0.289569204284748, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.02(1.03), norm=4.213048696396387, lr=0.000285256163396834
2023-12-22 15:10:45   INFO  epoch: 22/24, acc_iter=80124, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 0:51:07/0:14:15, time_cost(all): 1 day, 0:42:30/1:28:21, loss=0.289372912965404, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.85(1.03), norm=3.3897099417691763, lr=0.000281965598521467
2023-12-22 15:11:41   INFO  epoch: 22/24, acc_iter=80174, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 0:52:03/0:13:39, time_cost(all): 1 day, 0:43:26/1:24:12, loss=0.289176621646061, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.07(1.03), norm=1.490559527548691, lr=0.000278675033646099
2023-12-22 15:12:37   INFO  epoch: 22/24, acc_iter=80224, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 0:52:58/0:12:40, time_cost(all): 1 day, 0:44:22/1:24:44, loss=0.288980330326717, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.15(1.03), norm=3.834310344084053, lr=0.000275384468770732
2023-12-22 15:13:32   INFO  epoch: 22/24, acc_iter=80274, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 0:53:54/0:11:52, time_cost(all): 1 day, 0:45:17/1:26:19, loss=0.288784039007374, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.21(1.03), norm=0.5835519829038209, lr=0.000272093903895365
2023-12-22 15:14:28   INFO  epoch: 22/24, acc_iter=80324, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 0:54:50/0:10:16, time_cost(all): 1 day, 0:46:13/1:22:38, loss=0.288587747688031, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.13(1.03), norm=2.760279276489155, lr=0.000268803339019998
2023-12-22 15:15:24   INFO  epoch: 22/24, acc_iter=80374, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 0:55:46/0:09:32, time_cost(all): 1 day, 0:47:09/1:25:02, loss=0.288391456368687, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.12(1.03), norm=1.2445928902370476, lr=0.00026551277414463
2023-12-22 15:16:20   INFO  epoch: 22/24, acc_iter=80424, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 0:56:41/0:08:58, time_cost(all): 1 day, 0:48:05/1:22:07, loss=0.288195165049344, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.99(1.03), norm=3.0852831428239393, lr=0.000262222209269263
2023-12-22 15:17:15   INFO  epoch: 22/24, acc_iter=80474, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 0:57:37/0:07:43, time_cost(all): 1 day, 0:49:00/1:24:03, loss=0.28799887373, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.16(1.03), norm=4.574252472903388, lr=0.000258931644393896
2023-12-22 15:18:11   INFO  epoch: 22/24, acc_iter=80524, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 0:58:33/0:06:59, time_cost(all): 1 day, 0:49:56/1:22:37, loss=0.287802582410657, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.01(1.03), norm=2.323096664785457, lr=0.000255641079518529
2023-12-22 15:19:07   INFO  epoch: 22/24, acc_iter=80574, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 0:59:29/0:05:50, time_cost(all): 1 day, 0:50:52/1:14:30, loss=0.287606291091313, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.98(1.03), norm=4.864910165012791, lr=0.000252350514643161
2023-12-22 15:20:03   INFO  epoch: 22/24, acc_iter=80624, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:00:24/0:04:59, time_cost(all): 1 day, 0:51:48/1:14:12, loss=0.28740999977197, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.11(1.03), norm=0.8221929080562407, lr=0.000249059949767794
2023-12-22 15:20:59   INFO  epoch: 22/24, acc_iter=80674, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:01:20/0:04:02, time_cost(all): 1 day, 0:52:44/1:13:48, loss=0.287213708452627, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.15(1.03), norm=2.1218366284542145, lr=0.000245769384892427
2023-12-22 15:21:54   INFO  epoch: 22/24, acc_iter=80724, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:02:16/0:03:05, time_cost(all): 1 day, 0:53:39/1:14:29, loss=0.287017417133283, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.99(1.03), norm=2.9894326521345214, lr=0.00024247882001706
2023-12-22 15:22:50   INFO  epoch: 22/24, acc_iter=80774, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:03:12/0:02:08, time_cost(all): 1 day, 0:54:35/1:12:29, loss=0.28682112581394, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.21(1.03), norm=2.452040502227325, lr=0.000239188255141692
2023-12-22 15:23:46   INFO  epoch: 22/24, acc_iter=80824, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:04:08/0:01:13, time_cost(all): 1 day, 0:55:31/1:12:28, loss=0.286624834494596, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.09(1.03), norm=2.6540493305146113, lr=0.000235897690266325
2023-12-22 15:24:42   INFO  epoch: 22/24, acc_iter=80874, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:05:03/0:00:19, time_cost(all): 1 day, 0:56:27/1:13:55, loss=0.286428543175253, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.89(1.03), norm=3.442718841287416, lr=0.000232607125390958
2023-12-22 15:25:37   INFO  epoch: 23/24, acc_iter=80941, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:00:55/1:06:38, time_cost(all): 1 day, 0:57:22/1:09:51, loss=0.286165512807333, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.13(1.03), norm=1.0747703801087414, lr=0.000228197768457966
2023-12-22 15:26:33   INFO  epoch: 23/24, acc_iter=80991, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:01:51/1:04:03, time_cost(all): 1 day, 0:58:18/1:14:09, loss=0.285969221487989, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.14(1.03), norm=4.171097918521255, lr=0.000224907203582598
2023-12-22 15:27:29   INFO  epoch: 23/24, acc_iter=81041, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:02:47/1:04:42, time_cost(all): 1 day, 0:59:14/1:08:15, loss=0.285772930168646, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.21(1.03), norm=0.8005974441649003, lr=0.000221616638707231
2023-12-22 15:28:25   INFO  epoch: 23/24, acc_iter=81091, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:03:43/1:04:43, time_cost(all): 1 day, 1:00:10/1:06:32, loss=0.285576638849302, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.02(1.03), norm=2.3058044237358213, lr=0.000218326073831864
2023-12-22 15:29:20   INFO  epoch: 23/24, acc_iter=81141, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:04:38/1:00:56, time_cost(all): 1 day, 1:01:05/1:05:15, loss=0.285380347529959, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.93(1.03), norm=2.7280522809591785, lr=0.000215035508956497
2023-12-22 15:30:16   INFO  epoch: 23/24, acc_iter=81191, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:05:34/1:02:16, time_cost(all): 1 day, 1:02:01/1:10:07, loss=0.285184056210615, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.07(1.03), norm=0.7601855091465818, lr=0.000211744944081129
2023-12-22 15:31:12   INFO  epoch: 23/24, acc_iter=81241, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:06:30/0:59:09, time_cost(all): 1 day, 1:02:57/1:07:51, loss=0.284987764891272, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.21(1.03), norm=3.118433855155809, lr=0.000208454379205762
2023-12-22 15:32:08   INFO  epoch: 23/24, acc_iter=81291, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:07:26/0:59:11, time_cost(all): 1 day, 1:03:53/1:07:32, loss=0.284791473571929, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.98(1.03), norm=4.8299588365857, lr=0.000205163814330395
2023-12-22 15:33:04   INFO  epoch: 23/24, acc_iter=81341, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:08:21/0:58:28, time_cost(all): 1 day, 1:04:49/1:01:16, loss=0.284595182252585, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.15(1.03), norm=4.677839608489031, lr=0.000201873249455028
2023-12-22 15:33:59   INFO  epoch: 23/24, acc_iter=81391, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:09:17/0:55:37, time_cost(all): 1 day, 1:05:44/1:05:41, loss=0.284398890933242, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.01(1.03), norm=1.7637880141601399, lr=0.00019858268457966
2023-12-22 15:34:55   INFO  epoch: 23/24, acc_iter=81441, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:10:13/0:53:33, time_cost(all): 1 day, 1:06:40/1:02:06, loss=0.284202599613898, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.08(1.03), norm=4.4970175026412935, lr=0.000195292119704293
2023-12-22 15:35:51   INFO  epoch: 23/24, acc_iter=81491, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:11:09/0:53:42, time_cost(all): 1 day, 1:07:36/1:00:55, loss=0.284006308294555, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.02(1.03), norm=0.9445148934046049, lr=0.000192001554828926
2023-12-22 15:36:47   INFO  epoch: 23/24, acc_iter=81541, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:12:04/0:53:33, time_cost(all): 1 day, 1:08:32/1:02:24, loss=0.283810016975211, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.97(1.03), norm=2.46772915386398, lr=0.000188710989953559
2023-12-22 15:37:42   INFO  epoch: 23/24, acc_iter=81591, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:13:00/0:51:08, time_cost(all): 1 day, 1:09:27/0:58:37, loss=0.283613725655868, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.18(1.03), norm=1.419332841260255, lr=0.000185420425078191
2023-12-22 15:38:38   INFO  epoch: 23/24, acc_iter=81641, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:13:56/0:49:42, time_cost(all): 1 day, 1:10:23/1:00:11, loss=0.283417434336524, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.88(1.03), norm=2.9414722159328837, lr=0.000182129860202824
2023-12-22 15:39:34   INFO  epoch: 23/24, acc_iter=81691, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:14:52/0:48:26, time_cost(all): 1 day, 1:11:19/0:56:12, loss=0.283221143017181, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.15(1.03), norm=2.558357641818554, lr=0.000178839295327457
2023-12-22 15:40:30   INFO  epoch: 23/24, acc_iter=81741, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:15:48/0:51:39, time_cost(all): 1 day, 1:12:15/0:54:19, loss=0.283024851697838, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.98(1.03), norm=4.2751466409656365, lr=0.00017554873045209
2023-12-22 15:41:25   INFO  epoch: 23/24, acc_iter=81791, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:16:43/0:48:31, time_cost(all): 1 day, 1:13:10/0:53:27, loss=0.282828560378494, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.96(1.03), norm=3.5758972597718466, lr=0.000172258165576722
2023-12-22 15:42:21   INFO  epoch: 23/24, acc_iter=81841, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:17:39/0:47:01, time_cost(all): 1 day, 1:14:06/0:53:45, loss=0.282632269059151, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.14(1.03), norm=2.826544658548995, lr=0.000168967600701355
2023-12-22 15:43:17   INFO  epoch: 23/24, acc_iter=81891, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:18:35/0:49:06, time_cost(all): 1 day, 1:15:02/0:53:30, loss=0.282435977739807, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.95(1.03), norm=0.9485402312041267, lr=0.000165677035825988
2023-12-22 15:44:13   INFO  epoch: 23/24, acc_iter=81941, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:19:31/0:44:35, time_cost(all): 1 day, 1:15:58/0:54:06, loss=0.282239686420464, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.02(1.03), norm=1.7448766778201357, lr=0.000162386470950621
2023-12-22 15:45:09   INFO  epoch: 23/24, acc_iter=81991, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:20:26/0:45:50, time_cost(all): 1 day, 1:16:54/0:50:03, loss=0.28204339510112, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.88(1.03), norm=3.851774185319805, lr=0.000159095906075253
2023-12-22 15:46:04   INFO  epoch: 23/24, acc_iter=82041, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:21:22/0:45:32, time_cost(all): 1 day, 1:17:49/0:50:58, loss=0.281847103781777, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.21(1.03), norm=0.5861631640221702, lr=0.000155805341199886
2023-12-22 15:47:00   INFO  epoch: 23/24, acc_iter=82091, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:22:18/0:41:00, time_cost(all): 1 day, 1:18:45/0:52:20, loss=0.281650812462434, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.87(1.03), norm=1.9399846591868835, lr=0.000152514776324519
2023-12-22 15:47:56   INFO  epoch: 23/24, acc_iter=82141, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:23:14/0:42:10, time_cost(all): 1 day, 1:19:41/0:48:05, loss=0.28145452114309, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.84(1.03), norm=4.798589173993467, lr=0.000149224211449152
2023-12-22 15:48:52   INFO  epoch: 23/24, acc_iter=82191, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:24:09/0:39:24, time_cost(all): 1 day, 1:20:37/0:49:22, loss=0.281258229823747, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.03(1.03), norm=2.559523102962267, lr=0.000145933646573784
2023-12-22 15:49:47   INFO  epoch: 23/24, acc_iter=82241, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:25:05/0:40:50, time_cost(all): 1 day, 1:21:32/0:49:32, loss=0.281061938504403, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.91(1.03), norm=1.4945073216738851, lr=0.000142643081698417
2023-12-22 15:50:43   INFO  epoch: 23/24, acc_iter=82291, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:26:01/0:40:14, time_cost(all): 1 day, 1:22:28/0:48:51, loss=0.28086564718506, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.98(1.03), norm=1.1481229767137604, lr=0.00013935251682305
2023-12-22 15:51:39   INFO  epoch: 23/24, acc_iter=82341, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:26:57/0:37:53, time_cost(all): 1 day, 1:23:24/0:47:55, loss=0.280669355865716, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.13(1.03), norm=1.7091482377849354, lr=0.000136061951947682
2023-12-22 15:52:35   INFO  epoch: 23/24, acc_iter=82391, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:27:53/0:35:45, time_cost(all): 1 day, 1:24:20/0:43:17, loss=0.280473064546373, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.2(1.03), norm=2.596605455498743, lr=0.000132771387072315
2023-12-22 15:53:30   INFO  epoch: 23/24, acc_iter=82441, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:28:48/0:36:01, time_cost(all): 1 day, 1:25:15/0:41:53, loss=0.28027677322703, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.98(1.03), norm=3.658741647352334, lr=0.000129480822196948
2023-12-22 15:54:26   INFO  epoch: 23/24, acc_iter=82491, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:29:44/0:36:07, time_cost(all): 1 day, 1:26:11/0:43:06, loss=0.280080481907686, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.94(1.03), norm=4.9922480025838025, lr=0.000126190257321581
2023-12-22 15:55:22   INFO  epoch: 23/24, acc_iter=82541, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:30:40/0:34:39, time_cost(all): 1 day, 1:27:07/0:40:20, loss=0.279884190588343, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.12(1.03), norm=4.718951639788986, lr=0.000122899692446213
2023-12-22 15:56:18   INFO  epoch: 23/24, acc_iter=82591, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:31:36/0:34:15, time_cost(all): 1 day, 1:28:03/0:42:10, loss=0.279687899268999, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.04(1.03), norm=4.297206330629029, lr=0.000119609127570846
2023-12-22 15:57:14   INFO  epoch: 23/24, acc_iter=82641, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:32:31/0:33:49, time_cost(all): 1 day, 1:28:59/0:38:54, loss=0.279491607949656, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.16(1.03), norm=2.417384748616934, lr=0.000116318562695479
2023-12-22 15:58:09   INFO  epoch: 23/24, acc_iter=82691, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:33:27/0:31:26, time_cost(all): 1 day, 1:29:54/0:38:32, loss=0.279295316630312, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.16(1.03), norm=2.7128892259746737, lr=0.000113027997820112
2023-12-22 15:59:05   INFO  epoch: 23/24, acc_iter=82741, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:34:23/0:31:45, time_cost(all): 1 day, 1:30:50/0:39:27, loss=0.279099025310969, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.0(1.03), norm=1.5209931963350751, lr=0.000109737432944744
2023-12-22 16:00:01   INFO  epoch: 23/24, acc_iter=82791, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:35:19/0:29:28, time_cost(all): 1 day, 1:31:46/0:36:02, loss=0.278902733991626, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.88(1.03), norm=2.438061819872124, lr=0.000106446868069377
2023-12-22 16:00:57   INFO  epoch: 23/24, acc_iter=82841, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:36:14/0:29:56, time_cost(all): 1 day, 1:32:42/0:35:06, loss=0.278706442672282, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.98(1.03), norm=3.5430103875637577, lr=0.00010315630319401
2023-12-22 16:01:52   INFO  epoch: 23/24, acc_iter=82891, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:37:10/0:29:18, time_cost(all): 1 day, 1:33:37/0:37:04, loss=0.278510151352939, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.07(1.03), norm=1.8921972460064207, lr=9.9865738318643e-05
2023-12-22 16:02:48   INFO  epoch: 23/24, acc_iter=82941, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:38:06/0:26:17, time_cost(all): 1 day, 1:34:33/0:33:31, loss=0.278313860033595, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.93(1.03), norm=4.315627426347328, lr=9.6575173443275e-05
2023-12-22 16:03:44   INFO  epoch: 23/24, acc_iter=82991, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:39:02/0:26:49, time_cost(all): 1 day, 1:35:29/0:33:15, loss=0.278117568714252, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.88(1.03), norm=2.0981850953778225, lr=9.3284608567908e-05
2023-12-22 16:04:40   INFO  epoch: 23/24, acc_iter=83041, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:39:58/0:25:17, time_cost(all): 1 day, 1:36:25/0:31:42, loss=0.277921277394908, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.19(1.03), norm=3.150138708140768, lr=8.9994043692541e-05
2023-12-22 16:05:35   INFO  epoch: 23/24, acc_iter=83091, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:40:53/0:25:05, time_cost(all): 1 day, 1:37:20/0:30:57, loss=0.277724986075565, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.98(1.03), norm=2.0621916804215736, lr=8.6703478817174e-05
2023-12-22 16:06:31   INFO  epoch: 23/24, acc_iter=83141, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:41:49/0:24:06, time_cost(all): 1 day, 1:38:16/0:30:48, loss=0.277528694756221, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.96(1.03), norm=1.5135141532287884, lr=8.3412913941806e-05
2023-12-22 16:07:27   INFO  epoch: 23/24, acc_iter=83191, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:42:45/0:22:27, time_cost(all): 1 day, 1:39:12/0:28:33, loss=0.277332403436878, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.96(1.03), norm=2.1556155177210776, lr=8.0122349066439e-05
2023-12-22 16:08:23   INFO  epoch: 23/24, acc_iter=83241, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:43:41/0:21:23, time_cost(all): 1 day, 1:40:08/0:29:10, loss=0.277136112117535, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.07(1.03), norm=4.742757923290005, lr=7.6831784191072e-05
2023-12-22 16:09:19   INFO  epoch: 23/24, acc_iter=83291, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:44:36/0:20:51, time_cost(all): 1 day, 1:41:04/0:29:03, loss=0.276939820798191, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.04(1.03), norm=0.8650856512494065, lr=7.3541219315705e-05
2023-12-22 16:10:14   INFO  epoch: 23/24, acc_iter=83341, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:45:32/0:19:47, time_cost(all): 1 day, 1:41:59/0:27:39, loss=0.276743529478848, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.87(1.03), norm=4.9897980871048855, lr=7.0250654440337e-05
2023-12-22 16:11:10   INFO  epoch: 23/24, acc_iter=83391, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:46:28/0:19:08, time_cost(all): 1 day, 1:42:55/0:25:23, loss=0.276547238159504, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.01(1.03), norm=3.50568628359488, lr=6.696008956497e-05
2023-12-22 16:12:06   INFO  epoch: 23/24, acc_iter=83441, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:47:24/0:17:46, time_cost(all): 1 day, 1:43:51/0:24:41, loss=0.276350946840161, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.94(1.03), norm=2.3828633724652217, lr=6.3669524689603e-05
2023-12-22 16:13:02   INFO  epoch: 23/24, acc_iter=83491, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:48:19/0:16:37, time_cost(all): 1 day, 1:44:47/0:25:09, loss=0.276154655520817, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.17(1.03), norm=2.9578210356695074, lr=6.0378959814236e-05
2023-12-22 16:13:57   INFO  epoch: 23/24, acc_iter=83541, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:49:15/0:15:19, time_cost(all): 1 day, 1:45:42/0:23:38, loss=0.275958364201474, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.95(1.03), norm=0.6463165795606466, lr=5.7088394938868e-05
2023-12-22 16:14:53   INFO  epoch: 23/24, acc_iter=83591, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:50:11/0:15:15, time_cost(all): 1 day, 1:46:38/0:21:20, loss=0.275762072882131, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.04(1.03), norm=3.3829950432763, lr=5.3797830063501e-05
2023-12-22 16:15:49   INFO  epoch: 23/24, acc_iter=83641, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 0:51:07/0:14:08, time_cost(all): 1 day, 1:47:34/0:20:30, loss=0.275565781562787, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.85(1.03), norm=3.363218947898087, lr=5.0507265188134e-05
2023-12-22 16:16:45   INFO  epoch: 23/24, acc_iter=83691, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 0:52:03/0:13:35, time_cost(all): 1 day, 1:48:30/0:19:50, loss=0.275369490243444, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.22(1.03), norm=3.267885869423663, lr=4.7216700312766e-05
2023-12-22 16:17:40   INFO  epoch: 23/24, acc_iter=83741, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 0:52:58/0:12:11, time_cost(all): 1 day, 1:49:25/0:20:12, loss=0.2751731989241, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.87(1.03), norm=3.3509143408003066, lr=4.3926135437399e-05
2023-12-22 16:18:36   INFO  epoch: 23/24, acc_iter=83791, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 0:53:54/0:10:57, time_cost(all): 1 day, 1:50:21/0:19:03, loss=0.274976907604757, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.84(1.03), norm=3.810075502615167, lr=4.0635570562032e-05
2023-12-22 16:19:32   INFO  epoch: 23/24, acc_iter=83841, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 0:54:50/0:10:02, time_cost(all): 1 day, 1:51:17/0:17:14, loss=0.274780616285413, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.13(1.03), norm=4.006152018214, lr=3.7345005686665e-05
2023-12-22 16:20:28   INFO  epoch: 23/24, acc_iter=83891, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 0:55:46/0:09:51, time_cost(all): 1 day, 1:52:13/0:16:54, loss=0.27458432496607, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.99(1.03), norm=1.8250433741502707, lr=3.4054440811297e-05
2023-12-22 16:21:24   INFO  epoch: 23/24, acc_iter=83941, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 0:56:41/0:08:33, time_cost(all): 1 day, 1:53:09/0:15:50, loss=0.274388033646727, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.91(1.03), norm=3.1396623256019587, lr=3.076387593593e-05
2023-12-22 16:22:19   INFO  epoch: 23/24, acc_iter=83991, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 0:57:37/0:08:01, time_cost(all): 1 day, 1:54:04/0:15:10, loss=0.274191742327383, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.18(1.03), norm=2.2834312588811128, lr=2.7473311060563e-05
2023-12-22 16:23:15   INFO  epoch: 23/24, acc_iter=84041, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 0:58:33/0:06:45, time_cost(all): 1 day, 1:55:00/0:14:18, loss=0.27399545100804, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.13(1.03), norm=0.8010383609911611, lr=2.4182746185196e-05
2023-12-22 16:24:11   INFO  epoch: 23/24, acc_iter=84091, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 0:59:29/0:05:55, time_cost(all): 1 day, 1:55:56/0:13:32, loss=0.273799159688696, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.97(1.03), norm=1.4751747047224593, lr=2.0892181309828e-05
2023-12-22 16:25:07   INFO  epoch: 23/24, acc_iter=84141, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:00:24/0:04:47, time_cost(all): 1 day, 1:56:52/0:12:08, loss=0.273602868369353, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.89(1.03), norm=4.237796478342211, lr=1.7601616434461e-05
2023-12-22 16:26:02   INFO  epoch: 23/24, acc_iter=84191, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:01:20/0:04:02, time_cost(all): 1 day, 1:57:47/0:11:47, loss=0.273406577050009, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.93(1.03), norm=1.4398136166338606, lr=1.4311051559094e-05
2023-12-22 16:26:58   INFO  epoch: 23/24, acc_iter=84241, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:02:16/0:02:56, time_cost(all): 1 day, 1:58:43/0:10:45, loss=0.273210285730666, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.89(1.03), norm=3.171054387644086, lr=1.1020486683727e-05
2023-12-22 16:27:54   INFO  epoch: 23/24, acc_iter=84291, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:03:12/0:02:15, time_cost(all): 1 day, 1:59:39/0:09:16, loss=0.273013994411323, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.09(1.03), norm=2.8391526507760307, lr=7.729921808359e-06
2023-12-22 16:28:50   INFO  epoch: 23/24, acc_iter=84341, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:04:08/0:01:17, time_cost(all): 1 day, 2:00:35/0:08:34, loss=0.272817703091979, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.89(1.03), norm=4.766781306945577, lr=4.439356932992e-06
2023-12-22 16:29:45   INFO  epoch: 23/24, acc_iter=84391, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:05:03/0:00:18, time_cost(all): 1 day, 2:01:30/0:07:45, loss=0.272621411772636, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.85(1.03), norm=4.029300596692071, lr=1.148792057625e-06
2023-12-22 16:29:45   INFO  **********************End training cfgs/picture_models/picture_nuscenes_occupancy(default)**********************



2023-12-22 16:35:14   INFO  **********************Start evaluation cfgs/picture_models/picture_nuscenes_occupancy(default)**********************
+---------+-------+---------+---------+--------+--------+----------------------+------------+------------+--------------+---------+--------+-------------------+------------+----------+---------+---------+------------+--------+
| classes | noise | barrier | bicycle | bus    | car    | construction_vehicle | motorcycle | pedestrian | traffic_cone | trailer | truck  | driveable_surface | other_flat | sidewalk | terrain | manmade | vegetation | miou   |
+---------+-------+---------+---------+--------+--------+----------------------+------------+------------+--------------+---------+--------+-------------------+------------+----------+---------+---------+------------+--------+
| results | nan   | 0.2172  | 0.0812  | 0.1562 | 0.2068 | 0.0694               | 0.0739     | 0.1232     | 0.0582       | 0.1570  | 0.1581 | 0.4221            | 0.1634     | 0.1638   | 0.1819  | 0.3558  | 0.3597     | 0.1842 |
+---------+-------+---------+---------+--------+--------+----------------------+------------+------------+--------------+---------+--------+-------------------+------------+----------+---------+---------+------------+--------+
2023-12-22 16:35:14   INFO  noise: nan  barrier: 0.2172  bicycle: 0.0812  bus: 0.1562  car: 0.2068  construction_vehicle: 0.0694  motorcycle: 0.0739  pedestrian: 0.1232  traffic_cone: 0.0582  trailer: 0.1570  truck: 0.1581  driveable_surface: 0.4221  other_flat: 0.1634  sidewalk: 0.1638  terrain: 0.1819  manmade: 0.3558  vegetation: 0.3597  miou: 0.1842

2023-12-22 16:35:14   INFO  Result is save to xxxxxxxxxxxxxxx
2023-12-22 16:35:14   INFO  ****************Evaluation done.*****************
2023-12-22 16:35:14   INFO  Epoch 24 has been evaluated
2023-12-22 16:35:14   INFO  **********************End evaluation cfgs/picture_models/picture_nuscenes_occupancy(default)**********************
