2023-12-16 09:22:37   INFO  **********************Start logging**********************
2023-12-16 09:22:37   INFO  CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7
2023-12-16 09:22:37   INFO  total_batch_size: 8
2023-12-16 09:22:37   INFO  cfg_file         ./cfgs/picture_models/picture_nuscenes_segmentation.yaml
2023-12-16 09:22:37   INFO  batch_size       1
2023-12-16 09:22:37   INFO  epochs           24
2023-12-16 09:22:37   INFO  workers          4
2023-12-16 09:22:37   INFO  extra_tag        default
2023-12-16 09:22:37   INFO  ckpt             None
2023-12-16 09:22:37   INFO  pretrained_model nuscenes_pretrain_model.pth
2023-12-16 09:22:37   INFO  launcher         pytorch
2023-12-16 09:22:37   INFO  tcp_port         18888
2023-12-16 09:22:37   INFO  sync_bn          True
2023-12-16 09:22:37   INFO  fix_random_seed  False
2023-12-16 09:22:37   INFO  ckpt_save_interval 20
2023-12-16 09:22:37   INFO  local_rank       0
2023-12-16 09:22:37   INFO  max_ckpt_save_num 30
2023-12-16 09:22:37   INFO  merge_all_iters_to_one_epoch False
2023-12-16 09:22:37   INFO  set_cfgs         None
2023-12-16 09:22:37   INFO  max_waiting_mins 0
2023-12-16 09:22:37   INFO  start_epoch      0
2023-12-16 09:22:37   INFO  num_epochs_to_eval 0
2023-12-16 09:22:37   INFO  save_to_file     False
2023-12-16 09:22:37   INFO  use_tqdm_to_record False
2023-12-16 09:22:37   INFO  logger_iter_interval 50
2023-12-16 09:22:37   INFO  ckpt_save_time_interval 300
2023-12-16 09:22:37   INFO  wo_gpu_stat      False
2023-12-16 09:22:37   INFO  fp16             False
2023-12-16 09:22:37   INFO  cfg.ROOT_DIR: xxxxxxxxxxxxx
2023-12-16 09:22:37   INFO  cfg.LOCAL_RANK: 0
2023-12-16 09:22:37   INFO  cfg.CLASS_NAMES: ['car', 'truck', 'construction_vehicle', 'bus', 'trailer', 'barrier', 'motorcycle', 'bicycle', 'pedestrian', 'traffic_cone']
2023-12-16 09:22:37   INFO  
cfg.DATA_CONFIG = edict()
2023-12-16 09:22:37   INFO  cfg.DATA_CONFIG.DATASET: NuScenesSegDataset
2023-12-16 09:22:37   INFO  cfg.DATA_CONFIG.DATA_PATH: ../data/nuscenes
2023-12-16 09:22:37   INFO  cfg.DATA_CONFIG.DATA_PREFIX: 'lidarseg/v1.0-trainval'
2023-12-16 09:22:37   INFO  cfg.DATA_CONFIG.VERSION: v1.0-trainval
2023-12-16 09:22:37   INFO  cfg.DATA_CONFIG.PRED_VELOCITY: True
2023-12-16 09:22:37   INFO  cfg.DATA_CONFIG.SET_NAN_VELOCITY_TO_ZEROS: True
2023-12-16 09:22:37   INFO  cfg.DATA_CONFIG.FILTER_MIN_POINTS_IN_GT: 1
2023-12-16 09:22:37   INFO  
cfg.DATA_CONFIG.DATA_SPLIT = edict()
2023-12-16 09:22:37   INFO  cfg.DATA_CONFIG.DATA_SPLIT.train: train
2023-12-16 09:22:37   INFO  cfg.DATA_CONFIG.DATA_SPLIT.test: val
2023-12-16 09:22:37   INFO  
cfg.DATA_CONFIG.INFO_PATH = edict()
2023-12-16 09:22:37   INFO  cfg.DATA_CONFIG.INFO_PATH.train: ['nuscenes_infos_10sweeps_train.pkl']
2023-12-16 09:22:37   INFO  cfg.DATA_CONFIG.INFO_PATH.test: ['nuscenes_infos_10sweeps_val.pkl']
2023-12-16 09:22:37   INFO  cfg.DATA_CONFIG.POINT_CLOUD_RANGE: [-51.2, -51.2, -5.0, 51.2, 51.2, 3.0]
2023-12-16 09:22:37   INFO  cfg.DATA_CONFIG.CYLINDER_POINT_CLOUD_RANGE: [0, -3.14159265359, -4, 50, 3.14159265359, 2]
2023-12-16 09:22:37   INFO  
cfg.DATA_CONFIG.DATA_AUGMENTOR = edict()
2023-12-16 09:22:37   INFO  cfg.DATA_CONFIG.DATA_AUGMENTOR.DISABLE_AUG_LIST: ['placeholder']
2023-12-16 09:22:37   INFO  cfg.DATA_CONFIG.DATA_AUGMENTOR.AUG_CONFIG_LIST: [{'NAME': 'random_world_flip', 'ALONG_AXIS_LIST': ['x', 'y']}, {'NAME': 'random_world_rotation', 'WORLD_ROT_ANGLE': [-0.78539816, 0.78539816]}, {'NAME': 'random_world_scaling', 'WORLD_SCALE_RANGE': [0.9, 1.1]}, {'NAME': 'random_world_translation', 'NOISE_TRANSLATE_STD': [0.5, 0.5, 0.5]}]
2023-12-16 09:22:37   INFO  
cfg.DATA_CONFIG.POINT_FEATURE_ENCODING = edict()
2023-12-16 09:22:37   INFO  cfg.DATA_CONFIG.POINT_FEATURE_ENCODING.encoding_type: absolute_coordinates_encoding
2023-12-16 09:22:37   INFO  cfg.DATA_CONFIG.POINT_FEATURE_ENCODING.used_feature_list: ['x', 'y', 'z', 'intensity', 'timestamp']
2023-12-16 09:22:37   INFO  cfg.DATA_CONFIG.POINT_FEATURE_ENCODING.src_feature_list: ['x', 'y', 'z', 'intensity', 'timestamp']
2023-12-16 09:22:37   INFO  cfg.DATA_CONFIG.DATA_PROCESSOR: [{'NAME': 'mask_points_and_boxes_outside_rangeV2', 'REMOVE_OUTSIDE_BOXES': True}, {'NAME': 'shuffle_points', 'SHUFFLE_ENABLED': {'train': True, 'test': True}}, {'NAME': 'transform_points_to_voxels_placeholder', 'VOXEL_SIZE': [0.3, 0.3, 8.0]}]
2023-12-16 09:22:37   INFO  cfg.DATA_CONFIG._BASE_CONFIG_: cfgs/dataset_configs/nuscenes_seg_dataset.yaml
2023-12-16 09:22:37   INFO  
cfg.MODEL = edict()
2023-12-16 09:22:37   INFO  cfg.MODEL.NAME: Cylinder3D
2023-12-16 09:22:37   INFO  
cfg.MODEL.VFE = edict()
2023-12-16 09:22:37   INFO  cfg.MODEL.VFE.NAME: SegVFE
2023-12-16 09:22:37   INFO  cfg.MODEL.VFE.FEAT_CHANNELS: [64, 128, 256, 256]
2023-12-16 09:22:37   INFO  cfg.MODEL.VFE.IN_CHANNELS: 6
2023-12-16 09:22:37   INFO  cfg.MODEL.VFE.WITH_VOXEL_CENTER: True
2023-12-16 09:22:37   INFO  cfg.MODEL.VFE.FEAT_COMPRESSION: 16
2023-12-16 09:22:37   INFO  cfg.MODEL.VFE.RETURN_POINT_FEATS: False
2023-12-16 09:22:37   INFO  
cfg.MODEL.BACKBONE_3D = edict()
2023-12-16 09:22:37   INFO  cfg.MODEL.BACKBONE_3D.NAME: DSVT
2023-12-16 09:22:37   INFO  
cfg.MODEL.BACKBONE_3D.INPUT_LAYER = edict()
2023-12-16 09:22:37   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.sparse_shape: [ 480, 360, 32 ]
2023-12-16 09:22:37   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.downsample_stride: [ [ 1, 1, 4 ], [ 1, 1, 4 ], [ 1, 1, 2 ] ]
2023-12-16 09:22:37   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.d_model: [ 128, 128, 128, 128 ]
2023-12-16 09:22:37   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.set_info: [ [ 48, 1 ], [ 48, 1 ], [ 48, 1 ], [ 48, 1 ] ]
2023-12-16 09:22:37   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.window_shape: [ [ 12, 12, 32 ], [ 12, 12, 8 ], [ 12, 12, 2 ], [ 12, 12, 1 ] ]
2023-12-16 09:22:37   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.hybrid_factor: [ 2, 2, 1 ]
2023-12-16 09:22:37   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.shifts_list: [ [ [ 0, 0, 0 ], [ 6, 6, 0 ] ], [ [ 0, 0, 0 ], [ 6, 6, 0 ] ], [ [ 0, 0, 0 ], [ 6, 6, 0 ] ], [ [ 0, 0, 0 ], [ 6, 6, 0 ] ] ]
2023-12-16 09:22:37   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.normalize_pos: False
2023-12-16 09:22:37   INFO  cfg.MODEL.BACKBONE_3D.block_name: [ 'DSVTBlock','DSVTBlock','DSVTBlock','DSVTBlock' ]
2023-12-16 09:22:37   INFO  cfg.MODEL.BACKBONE_3D.set_info: [ [ 48, 1 ], [ 48, 1 ], [ 48, 1 ], [ 48, 1 ] ]
2023-12-16 09:22:37   INFO  cfg.MODEL.BACKBONE_3D.d_model: [ 128, 128, 128, 128 ]
2023-12-16 09:22:37   INFO  cfg.MODEL.BACKBONE_3D.nhead: [ 8, 8, 8, 8 ]
2023-12-16 09:22:37   INFO  cfg.MODEL.BACKBONE_3D.dim_feedforward: [ 384, 384, 384, 384 ]
2023-12-16 09:22:37   INFO  cfg.MODEL.BACKBONE_3D.dropout: 0.0
2023-12-16 09:22:37   INFO  cfg.MODEL.BACKBONE_3D.activation: gelu
2023-12-16 09:22:37   INFO  cfg.MODEL.BACKBONE_3D.activation: 'attention'
2023-12-16 09:22:37   INFO  cfg.MODEL.BACKBONE_3D.output_shape: [ 468, 360 ]
2023-12-16 09:22:37   INFO  cfg.MODEL.BACKBONE_3D.conv_out_channel: 128
2023-12-16 09:22:37   INFO  
cfg.MODEL.DENSE_HEAD = edict()
2023-12-16 09:22:37   INFO  cfg.MODEL.DENSE_HEAD.CLASS_AGNOSTIC: False
2023-12-16 09:22:37   INFO  cfg.MODEL.DENSE_HEAD.NAME: Cylinder3DHead
2023-12-16 09:22:37   INFO  cfg.MODEL.DENSE_HEAD.NUM_CLASSES: 20
2023-12-16 09:22:37   INFO  cfg.MODEL.DENSE_HEAD.HIDDEN_CHANNEL: 128
2023-12-16 09:22:37   INFO  cfg.MODEL.DENSE_HEAD.LOSS_CONFIG.LOSS_WEIGHTS: {'cls_weight': 1.0, 'Lovasz_weight': 1.0,}
2023-12-16 09:22:37   INFO  cfg.MODEL.DENSE_HEAD.LOSS_CONFIG.LOSS_LOVASZ.reduction: None
2023-12-16 09:22:37   INFO  cfg.MODEL.DENSE_HEAD.LOSS_CONFIG.LOSS_CLS.use_sigmoid: False
2023-12-16 09:22:37   INFO  cfg.MODEL.DENSE_HEAD.LOSS_CONFIG.LOSS_CLS.gamma: 2.0
2023-12-16 09:22:37   INFO  cfg.MODEL.DENSE_HEAD.LOSS_CONFIG.LOSS_CLS.alpha: 0.25
cfg.OPTIMIZATION = edict()
2023-12-16 09:22:37   INFO  cfg.OPTIMIZATION.BATCH_SIZE_PER_GPU: 1
2023-12-16 09:22:37   INFO  cfg.OPTIMIZATION.NUM_EPOCHS: 24
2023-12-16 09:22:37   INFO  cfg.OPTIMIZATION.OPTIMIZER: adamw
2023-12-16 09:22:37   INFO  cfg.OPTIMIZATION.LR: 0.005
2023-12-16 09:22:37   INFO  cfg.OPTIMIZATION.WEIGHT_DECAY: 0.05
2023-12-16 09:22:37   INFO  cfg.OPTIMIZATION.MOMENTUM: 0.9
2023-12-16 09:22:37   INFO  cfg.OPTIMIZATION.MOMS: [0.95, 0.85]
2023-12-16 09:22:37   INFO  cfg.OPTIMIZATION.PCT_START: 0.4
2023-12-16 09:22:37   INFO  cfg.OPTIMIZATION.DIV_FACTOR: 10
2023-12-16 09:22:37   INFO  cfg.OPTIMIZATION.DECAY_STEP_LIST: [35, 45]
2023-12-16 09:22:37   INFO  cfg.OPTIMIZATION.LR_DECAY: 0.1
2023-12-16 09:22:37   INFO  cfg.OPTIMIZATION.LR_CLIP: 1e-07
2023-12-16 09:22:37   INFO  cfg.OPTIMIZATION.LR_WARMUP: False
2023-12-16 09:22:37   INFO  cfg.OPTIMIZATION.WARMUP_EPOCH: 1
2023-12-16 09:22:37   INFO  cfg.OPTIMIZATION.GRAD_NORM_CLIP: 35
2023-12-16 09:22:37   INFO  cfg.OPTIMIZATION.LOSS_SCALE_FP16: 4.0
2023-12-16 09:22:37   INFO  
cfg.HOOK = edict()
2023-12-16 09:22:37   INFO  
cfg.HOOK.DisableAugmentationHook = edict()
2023-12-16 09:22:37   INFO  cfg.TAG: picture_nuscenes_segmentation
2023-12-16 09:22:37   INFO  cfg.EXP_GROUP_PATH: cfgs/picture_models
2023-12-16 09:22:37   INFO  Loading GT database to shared memory
2023-12-16 09:22:42   INFO  GT database has been saved to shared memory
2023-12-16 09:22:42   INFO  Loading NuScenes dataset
2023-12-16 09:22:46   INFO  Total samples for NuScenes dataset: 28130
2023-12-16 09:22:46   INFO  DistributedDataParallel(
  (module): Cylinder3D(
    (vfe): SegVFE(
      (pfn_layers): ModuleList(
        (0): PFNLayer(
          (linear): Linear(in_features=11, out_features=64, bias=False)
          (norm): SyncBatchNorm(64, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
          (relu): ReLU()
        )
        (1): PFNLayer(
          (linear): Linear(in_features=128, out_features=128, bias=False)
          (norm): SyncBatchNorm(128, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
          (relu): ReLU()
        )
      )
    )
    (backbone_3d): DSVT(
      (input_layer): DSVTInputLayer(
        (posembed_layers): ModuleList(
          (0): ModuleList(
            (0): ModuleList(
              (0): PositionEmbeddingLearned(
                (position_embedding_head): Sequential(
                  (0): Linear(in_features=2, out_features=128, bias=True)
                  (1): SyncBatchNorm(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): ReLU(inplace=True)
                  (3): Linear(in_features=128, out_features=128, bias=True)
                )
              )
              (1): PositionEmbeddingLearned(
                (position_embedding_head): Sequential(
                  (0): Linear(in_features=2, out_features=128, bias=True)
                  (1): SyncBatchNorm(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): ReLU(inplace=True)
                  (3): Linear(in_features=128, out_features=128, bias=True)
                )
              )
            )
            (1): ModuleList(
              (0): PositionEmbeddingLearned(
                (position_embedding_head): Sequential(
                  (0): Linear(in_features=2, out_features=128, bias=True)
                  (1): SyncBatchNorm(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): ReLU(inplace=True)
                  (3): Linear(in_features=128, out_features=128, bias=True)
                )
              )
              (1): PositionEmbeddingLearned(
                (position_embedding_head): Sequential(
                  (0): Linear(in_features=2, out_features=128, bias=True)
                  (1): SyncBatchNorm(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): ReLU(inplace=True)
                  (3): Linear(in_features=128, out_features=128, bias=True)
                )
              )
            )
            (2): ModuleList(
              (0): PositionEmbeddingLearned(
                (position_embedding_head): Sequential(
                  (0): Linear(in_features=2, out_features=128, bias=True)
                  (1): SyncBatchNorm(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): ReLU(inplace=True)
                  (3): Linear(in_features=128, out_features=128, bias=True)
                )
              )
              (1): PositionEmbeddingLearned(
                (position_embedding_head): Sequential(
                  (0): Linear(in_features=2, out_features=128, bias=True)
                  (1): SyncBatchNorm(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): ReLU(inplace=True)
                  (3): Linear(in_features=128, out_features=128, bias=True)
                )
              )
            )
            (3): ModuleList(
              (0): PositionEmbeddingLearned(
                (position_embedding_head): Sequential(
                  (0): Linear(in_features=2, out_features=128, bias=True)
                  (1): SyncBatchNorm(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): ReLU(inplace=True)
                  (3): Linear(in_features=128, out_features=128, bias=True)
                )
              )
              (1): PositionEmbeddingLearned(
                (position_embedding_head): Sequential(
                  (0): Linear(in_features=2, out_features=128, bias=True)
                  (1): SyncBatchNorm(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): ReLU(inplace=True)
                  (3): Linear(in_features=128, out_features=128, bias=True)
                )
              )
            )
          )
        )
      )
      (stage_0): ModuleList(
        (0): DSVTBlock(
          (encoder_list): ModuleList(
            (0): DSVT_EncoderLayer(
              (win_attn): SetAttention(
                (self_attn): MultiheadAttention(
                  (out_proj): NonDynamicallyQuantizableLinear(in_features=128, out_features=128, bias=True)
                )
                (linear1): Linear(in_features=128, out_features=128, bias=True)
                (dropout): Dropout(p=0, inplace=False)
                (linear2): Linear(in_features=128, out_features=128, bias=True)
                (norm1): LayerNorm((128,), eps=1e-05, elementwise_affine=True)
                (norm2): LayerNorm((128,), eps=1e-05, elementwise_affine=True)
                (dropout1): Identity()
                (dropout2): Identity()
              )
              (norm): LayerNorm((128,), eps=1e-05, elementwise_affine=True)
            )
            (1): DSVT_EncoderLayer(
              (win_attn): SetAttention(
                (self_attn): MultiheadAttention(
                  (out_proj): NonDynamicallyQuantizableLinear(in_features=128, out_features=128, bias=True)
                )
                (linear1): Linear(in_features=128, out_features=128, bias=True)
                (dropout): Dropout(p=0, inplace=False)
                (linear2): Linear(in_features=128, out_features=128, bias=True)
                (norm1): LayerNorm((128,), eps=1e-05, elementwise_affine=True)
                (norm2): LayerNorm((128,), eps=1e-05, elementwise_affine=True)
                (dropout1): Identity()
                (dropout2): Identity()
              )
              (norm): LayerNorm((128,), eps=1e-05, elementwise_affine=True)
            )
          )
        )
        (1): DSVTBlock(
          (encoder_list): ModuleList(
            (0): DSVT_EncoderLayer(
              (win_attn): SetAttention(
                (self_attn): MultiheadAttention(
                  (out_proj): NonDynamicallyQuantizableLinear(in_features=128, out_features=128, bias=True)
                )
                (linear1): Linear(in_features=128, out_features=128, bias=True)
                (dropout): Dropout(p=0, inplace=False)
                (linear2): Linear(in_features=128, out_features=128, bias=True)
                (norm1): LayerNorm((128,), eps=1e-05, elementwise_affine=True)
                (norm2): LayerNorm((128,), eps=1e-05, elementwise_affine=True)
                (dropout1): Identity()
                (dropout2): Identity()
              )
              (norm): LayerNorm((128,), eps=1e-05, elementwise_affine=True)
            )
            (1): DSVT_EncoderLayer(
              (win_attn): SetAttention(
                (self_attn): MultiheadAttention(
                  (out_proj): NonDynamicallyQuantizableLinear(in_features=128, out_features=128, bias=True)
                )
                (linear1): Linear(in_features=128, out_features=128, bias=True)
                (dropout): Dropout(p=0, inplace=False)
                (linear2): Linear(in_features=128, out_features=128, bias=True)
                (norm1): LayerNorm((128,), eps=1e-05, elementwise_affine=True)
                (norm2): LayerNorm((128,), eps=1e-05, elementwise_affine=True)
                (dropout1): Identity()
                (dropout2): Identity()
              )
              (norm): LayerNorm((128,), eps=1e-05, elementwise_affine=True)
            )
          )
        )
        (2): DSVTBlock(
          (encoder_list): ModuleList(
            (0): DSVT_EncoderLayer(
              (win_attn): SetAttention(
                (self_attn): MultiheadAttention(
                  (out_proj): NonDynamicallyQuantizableLinear(in_features=128, out_features=128, bias=True)
                )
                (linear1): Linear(in_features=128, out_features=128, bias=True)
                (dropout): Dropout(p=0, inplace=False)
                (linear2): Linear(in_features=128, out_features=128, bias=True)
                (norm1): LayerNorm((128,), eps=1e-05, elementwise_affine=True)
                (norm2): LayerNorm((128,), eps=1e-05, elementwise_affine=True)
                (dropout1): Identity()
                (dropout2): Identity()
              )
              (norm): LayerNorm((128,), eps=1e-05, elementwise_affine=True)
            )
            (1): DSVT_EncoderLayer(
              (win_attn): SetAttention(
                (self_attn): MultiheadAttention(
                  (out_proj): NonDynamicallyQuantizableLinear(in_features=128, out_features=128, bias=True)
                )
                (linear1): Linear(in_features=128, out_features=128, bias=True)
                (dropout): Dropout(p=0, inplace=False)
                (linear2): Linear(in_features=128, out_features=128, bias=True)
                (norm1): LayerNorm((128,), eps=1e-05, elementwise_affine=True)
                (norm2): LayerNorm((128,), eps=1e-05, elementwise_affine=True)
                (dropout1): Identity()
                (dropout2): Identity()
              )
              (norm): LayerNorm((128,), eps=1e-05, elementwise_affine=True)
            )
          )
        )
        (3): DSVTBlock(
          (encoder_list): ModuleList(
            (0): DSVT_EncoderLayer(
              (win_attn): SetAttention(
                (self_attn): MultiheadAttention(
                  (out_proj): NonDynamicallyQuantizableLinear(in_features=128, out_features=128, bias=True)
                )
                (linear1): Linear(in_features=128, out_features=128, bias=True)
                (dropout): Dropout(p=0, inplace=False)
                (linear2): Linear(in_features=128, out_features=128, bias=True)
                (norm1): LayerNorm((128,), eps=1e-05, elementwise_affine=True)
                (norm2): LayerNorm((128,), eps=1e-05, elementwise_affine=True)
                (dropout1): Identity()
                (dropout2): Identity()
              )
              (norm): LayerNorm((128,), eps=1e-05, elementwise_affine=True)
            )
            (1): DSVT_EncoderLayer(
              (win_attn): SetAttention(
                (self_attn): MultiheadAttention(
                  (out_proj): NonDynamicallyQuantizableLinear(in_features=128, out_features=128, bias=True)
                )
                (linear1): Linear(in_features=128, out_features=128, bias=True)
                (dropout): Dropout(p=0, inplace=False)
                (linear2): Linear(in_features=128, out_features=128, bias=True)
                (norm1): LayerNorm((128,), eps=1e-05, elementwise_affine=True)
                (norm2): LayerNorm((128,), eps=1e-05, elementwise_affine=True)
                (dropout1): Identity()
                (dropout2): Identity()
              )
              (norm): LayerNorm((128,), eps=1e-05, elementwise_affine=True)
            )
          )
        )
      )
      (residual_norm_stage_0): ModuleList(
        (0): LayerNorm((128,), eps=1e-05, elementwise_affine=True)
        (1): LayerNorm((128,), eps=1e-05, elementwise_affine=True)
        (2): LayerNorm((128,), eps=1e-05, elementwise_affine=True)
        (3): LayerNorm((128,), eps=1e-05, elementwise_affine=True)
      )
    )
    (map_to_bev_module): None
    (pfe): None
    (backbone_2d): None
    (dense_head): Cylinder3DHead(
      (loss_lovasz): LovaszLoss()
      (loss_ce): GaussianFocalLoss()
      (conv_seg): nn.Conv1d(128, 20, kernel_size=1)
    )
    (point_head): None
    (roi_head): None
  )
)
2023-12-16 09:24:15   INFO  Total number of parameters: 10223517
2023-12-16 09:24:15   INFO  **********************Start training cfgs/picture_models/picture_nuscenes_segmentation(default)**********************
2023-12-16 09:25:34   INFO  epoch: 0/24, acc_iter=50, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:01:06/1:19:36, time_cost(all): 0:01:06/1 day, 8:01:40, loss=3.088680140194569, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.92(1.03), norm=3.0841368994987755, lr=0.006332812055729315
2023-12-16 09:26:41   INFO  epoch: 0/24, acc_iter=100, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:02:13/1:14:42, time_cost(all): 0:02:13/1 day, 5:59:37, loss=2.95769697492117, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.9(1.03), norm=0.9425586447025563, lr=0.00766562411145863
2023-12-16 09:27:47   INFO  epoch: 0/24, acc_iter=150, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:03:19/1:12:17, time_cost(all): 0:03:19/1 day, 8:07:38, loss=2.826713809647772, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.86(1.03), norm=3.7002833226165044, lr=0.008998436167187946
2023-12-16 09:28:54   INFO  epoch: 0/24, acc_iter=200, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:04:26/1:14:13, time_cost(all): 0:04:26/1 day, 8:07:16, loss=2.695730644374374, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.05(1.03), norm=1.7180437166504645, lr=0.010331248222917259
2023-12-16 09:30:01   INFO  epoch: 0/24, acc_iter=250, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:05:33/1:11:34, time_cost(all): 0:05:33/1 day, 6:34:21, loss=2.564747479100975, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.07(1.03), norm=1.325430050289568, lr=0.011664060278646575
2023-12-16 09:31:07   INFO  epoch: 0/24, acc_iter=300, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:06:39/1:08:07, time_cost(all): 0:06:39/1 day, 8:35:18, loss=2.433764313827577, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.06(1.03), norm=2.5071821198232875, lr=0.012996872334375891
2023-12-16 09:32:14   INFO  epoch: 0/24, acc_iter=350, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:07:46/1:09:16, time_cost(all): 0:07:46/1 day, 6:16:57, loss=2.302781148554179, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.0(1.03), norm=4.456259181187217, lr=0.014329684390105204
2023-12-16 09:33:21   INFO  epoch: 0/24, acc_iter=400, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:08:53/1:06:32, time_cost(all): 0:08:53/1 day, 7:13:59, loss=2.17179798328078, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.09(1.03), norm=1.9128316518562358, lr=0.01566249644583452
2023-12-16 09:34:27   INFO  epoch: 0/24, acc_iter=450, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:09:59/1:11:14, time_cost(all): 0:09:59/1 day, 8:22:41, loss=2.040814818007382, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.94(1.03), norm=4.231660776999583, lr=0.016995308501563833
2023-12-16 09:35:34   INFO  epoch: 0/24, acc_iter=500, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:11:06/1:06:29, time_cost(all): 0:11:06/1 day, 6:32:40, loss=1.909831652733984, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.11(1.03), norm=2.7107646502286915, lr=0.01832812055729315
2023-12-16 09:36:41   INFO  epoch: 0/24, acc_iter=550, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:12:13/1:04:57, time_cost(all): 0:12:13/1 day, 6:15:57, loss=1.778848487460585, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.87(1.03), norm=1.3410918559049878, lr=0.01966093261302246
2023-12-16 09:37:47   INFO  epoch: 0/24, acc_iter=600, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:13:19/1:02:26, time_cost(all): 0:13:19/1 day, 8:27:41, loss=1.647865322187187, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.1(1.03), norm=2.9535075397854733, lr=0.02099374466875178
2023-12-16 09:38:54   INFO  epoch: 0/24, acc_iter=650, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:14:26/1:00:39, time_cost(all): 0:14:26/1 day, 8:15:59, loss=1.516882156913788, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.86(1.03), norm=2.0279698008840956, lr=0.022326556724481094
2023-12-16 09:40:01   INFO  epoch: 0/24, acc_iter=700, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:15:33/1:05:41, time_cost(all): 0:15:33/1 day, 5:53:38, loss=1.38589899164039, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.2(1.03), norm=4.307354375521891, lr=0.02365936878021041
2023-12-16 09:41:07   INFO  epoch: 0/24, acc_iter=750, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:16:39/1:01:05, time_cost(all): 0:16:39/1 day, 5:32:17, loss=1.254915826366992, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.1(1.03), norm=4.313807697810338, lr=0.024992180835939726
2023-12-16 09:42:14   INFO  epoch: 0/24, acc_iter=800, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:17:46/0:57:46, time_cost(all): 0:17:46/1 day, 7:22:24, loss=1.123932661093594, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.17(1.03), norm=1.028921376706679, lr=0.02632499289166904
2023-12-16 09:43:21   INFO  epoch: 0/24, acc_iter=850, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:18:53/0:56:40, time_cost(all): 0:18:53/1 day, 5:41:24, loss=0.992949495820195, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.1(1.03), norm=4.363378060090904, lr=0.027657804947398355
2023-12-16 09:44:27   INFO  epoch: 0/24, acc_iter=900, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:19:59/1:00:35, time_cost(all): 0:19:59/1 day, 5:41:48, loss=0.861966330546797, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.13(1.03), norm=2.5171435889557645, lr=0.028990617003127668
2023-12-16 09:45:34   INFO  epoch: 0/24, acc_iter=950, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:21:06/0:56:52, time_cost(all): 0:21:06/1 day, 8:20:42, loss=0.730983165273399, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.89(1.03), norm=2.260322447236881, lr=0.03032342905885698
2023-12-16 09:46:41   INFO  epoch: 0/24, acc_iter=1000, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:22:13/0:53:10, time_cost(all): 0:22:13/1 day, 5:36:12, loss=0.682177057574442, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.92(1.03), norm=4.328788297297898, lr=0.0316562411145863
2023-12-16 09:47:47   INFO  epoch: 0/24, acc_iter=1050, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:23:19/0:52:45, time_cost(all): 0:23:19/1 day, 6:46:07, loss=0.599801143213951, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.88(1.03), norm=1.6448243791521717, lr=0.03298905317031561
2023-12-16 09:48:54   INFO  epoch: 0/24, acc_iter=1100, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:24:26/0:52:50, time_cost(all): 0:24:26/1 day, 7:13:35, loss=0.599602286427902, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.89(1.03), norm=3.7515455393631134, lr=0.034321865226044926
2023-12-16 09:50:01   INFO  epoch: 0/24, acc_iter=1150, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:25:33/0:53:14, time_cost(all): 0:25:33/1 day, 6:34:02, loss=0.599403429641853, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.04(1.03), norm=2.5110090888173193, lr=0.035654677281774246
2023-12-16 09:51:07   INFO  epoch: 0/24, acc_iter=1200, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:26:39/0:51:01, time_cost(all): 0:26:39/1 day, 7:02:58, loss=0.599204572855804, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.89(1.03), norm=3.854408244379587, lr=0.03698748933750356
2023-12-16 09:52:14   INFO  epoch: 0/24, acc_iter=1250, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:27:46/0:50:20, time_cost(all): 0:27:46/1 day, 7:14:29, loss=0.599005716069755, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.19(1.03), norm=3.5293959718642203, lr=0.03832030139323287
2023-12-16 09:53:21   INFO  epoch: 0/24, acc_iter=1300, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:28:53/0:49:10, time_cost(all): 0:28:53/1 day, 5:37:32, loss=0.598806859283706, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.02(1.03), norm=1.7480739290669451, lr=0.039653113448962184
2023-12-16 09:54:27   INFO  epoch: 0/24, acc_iter=1350, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:29:59/0:48:21, time_cost(all): 0:29:59/1 day, 5:19:07, loss=0.598608002497657, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.18(1.03), norm=2.123936472016686, lr=0.0409859255046915
2023-12-16 09:55:34   INFO  epoch: 0/24, acc_iter=1400, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:31:06/0:46:48, time_cost(all): 0:31:06/1 day, 5:41:40, loss=0.598409145711608, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.13(1.03), norm=1.569468007147552, lr=0.042318737560420816
2023-12-16 09:56:40   INFO  epoch: 0/24, acc_iter=1450, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:32:12/0:46:48, time_cost(all): 0:32:12/1 day, 7:11:31, loss=0.598210288925559, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.9(1.03), norm=2.1128416825693623, lr=0.04365154961615013
2023-12-16 09:57:47   INFO  epoch: 0/24, acc_iter=1500, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:33:19/0:44:32, time_cost(all): 0:33:19/1 day, 8:11:42, loss=0.59801143213951, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.88(1.03), norm=4.489962868140999, lr=0.04498436167187945
2023-12-16 09:58:54   INFO  epoch: 0/24, acc_iter=1550, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:34:26/0:44:13, time_cost(all): 0:34:26/1 day, 8:01:38, loss=0.597812575353461, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.95(1.03), norm=2.3282590608452187, lr=0.04631717372760876
2023-12-16 10:00:00   INFO  epoch: 0/24, acc_iter=1600, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:35:32/0:42:45, time_cost(all): 0:35:32/1 day, 7:42:34, loss=0.597613718567412, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.85(1.03), norm=0.8868792154056366, lr=0.047649985783338074
2023-12-16 10:01:07   INFO  epoch: 0/24, acc_iter=1650, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:36:39/0:40:14, time_cost(all): 0:36:39/1 day, 6:33:28, loss=0.597414861781363, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.0(1.03), norm=0.7920648494207334, lr=0.04898279783906739
2023-12-16 10:02:14   INFO  epoch: 0/24, acc_iter=1700, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:37:46/0:38:45, time_cost(all): 0:37:46/1 day, 5:49:09, loss=0.597216004995314, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.07(1.03), norm=4.276130546182958, lr=0.050789024736991754
2023-12-16 10:03:20   INFO  epoch: 0/24, acc_iter=1750, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:38:52/0:38:06, time_cost(all): 0:38:52/1 day, 7:23:08, loss=0.597017148209265, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.12(1.03), norm=4.272478623552168, lr=0.054121054876315036
2023-12-16 10:04:27   INFO  epoch: 0/24, acc_iter=1800, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:39:59/0:38:40, time_cost(all): 0:39:59/1 day, 6:08:31, loss=0.596818291423216, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.95(1.03), norm=3.087499612809145, lr=0.057453085015638325
2023-12-16 10:05:34   INFO  epoch: 0/24, acc_iter=1850, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:41:06/0:37:44, time_cost(all): 0:41:06/1 day, 5:17:46, loss=0.596619434637167, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.96(1.03), norm=3.2116438069401254, lr=0.060785115154961614
2023-12-16 10:06:40   INFO  epoch: 0/24, acc_iter=1900, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:42:12/0:37:35, time_cost(all): 0:42:12/1 day, 7:33:09, loss=0.596420577851118, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.09(1.03), norm=3.283010435904073, lr=0.0641171452942849
2023-12-16 10:07:47   INFO  epoch: 0/24, acc_iter=1950, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:43:19/0:35:21, time_cost(all): 0:43:19/1 day, 6:02:02, loss=0.596221721065069, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.14(1.03), norm=3.3468053970953573, lr=0.06744917543360818
2023-12-16 10:08:54   INFO  epoch: 0/24, acc_iter=2000, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:44:26/0:32:42, time_cost(all): 0:44:26/1 day, 7:18:17, loss=0.59602286427902, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.07(1.03), norm=1.1774952979778404, lr=0.07078120557293147
2023-12-16 10:10:00   INFO  epoch: 0/24, acc_iter=2050, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:45:32/0:33:10, time_cost(all): 0:45:32/1 day, 7:30:11, loss=0.595824007492971, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.9(1.03), norm=1.694485799028453, lr=0.07411323571225475
2023-12-16 10:11:07   INFO  epoch: 0/24, acc_iter=2100, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:46:39/0:30:19, time_cost(all): 0:46:39/1 day, 5:36:28, loss=0.595625150706922, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.1(1.03), norm=1.060991066936896, lr=0.07744526585157804
2023-12-16 10:12:14   INFO  epoch: 0/24, acc_iter=2150, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:47:46/0:30:32, time_cost(all): 0:47:46/1 day, 7:30:47, loss=0.595426293920873, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.19(1.03), norm=2.5834059185784106, lr=0.08077729599090133
2023-12-16 10:13:20   INFO  epoch: 0/24, acc_iter=2200, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:48:52/0:28:41, time_cost(all): 0:48:52/1 day, 7:44:01, loss=0.595227437134824, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.01(1.03), norm=2.549084643395111, lr=0.08410932613022462
2023-12-16 10:14:27   INFO  epoch: 0/24, acc_iter=2250, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:49:59/0:29:01, time_cost(all): 0:49:59/1 day, 6:10:19, loss=0.595028580348775, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.91(1.03), norm=1.0840368652404737, lr=0.08744135626954791
2023-12-16 10:15:34   INFO  epoch: 0/24, acc_iter=2300, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:51:06/0:26:55, time_cost(all): 0:51:06/1 day, 7:21:56, loss=0.594829723562726, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.96(1.03), norm=2.0829861329459622, lr=0.09077338640887118
2023-12-16 10:16:40   INFO  epoch: 0/24, acc_iter=2350, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:52:12/0:25:53, time_cost(all): 0:52:12/1 day, 7:47:59, loss=0.594630866776677, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.89(1.03), norm=3.0952807776648688, lr=0.09410541654819447
2023-12-16 10:17:47   INFO  epoch: 0/24, acc_iter=2400, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:53:19/0:24:36, time_cost(all): 0:53:19/1 day, 5:14:58, loss=0.594432009990628, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.84(1.03), norm=1.1194636618837652, lr=0.09743744668751776
2023-12-16 10:18:54   INFO  epoch: 0/24, acc_iter=2450, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:54:26/0:23:08, time_cost(all): 0:54:26/1 day, 5:38:42, loss=0.594233153204579, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.88(1.03), norm=2.6495151713370557, lr=0.10076947682684105
2023-12-16 10:20:00   INFO  epoch: 0/24, acc_iter=2500, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:55:32/0:22:55, time_cost(all): 0:55:32/1 day, 6:00:29, loss=0.59403429641853, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.07(1.03), norm=4.676969526504573, lr=0.10410150696616433
2023-12-16 10:21:07   INFO  epoch: 0/24, acc_iter=2550, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:56:39/0:22:00, time_cost(all): 0:56:39/1 day, 6:42:10, loss=0.593835439632481, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.06(1.03), norm=4.651746548867419, lr=0.10743353710548761
2023-12-16 10:22:14   INFO  epoch: 0/24, acc_iter=2600, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:57:46/0:20:00, time_cost(all): 0:57:46/1 day, 5:08:10, loss=0.593636582846432, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.85(1.03), norm=1.8304238765563532, lr=0.1107655672448109
2023-12-16 10:23:20   INFO  epoch: 0/24, acc_iter=2650, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:58:52/0:19:15, time_cost(all): 0:58:52/1 day, 5:53:53, loss=0.593437726060383, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.01(1.03), norm=2.7709099006562687, lr=0.11409759738413419
2023-12-16 10:24:27   INFO  epoch: 0/24, acc_iter=2700, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:59:59/0:17:17, time_cost(all): 0:59:59/1 day, 4:48:28, loss=0.593238869274334, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.96(1.03), norm=1.9620973476346077, lr=0.11742962752345748
2023-12-16 10:25:33   INFO  epoch: 0/24, acc_iter=2750, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 1:01:05/0:16:22, time_cost(all): 1:01:05/1 day, 6:16:56, loss=0.593040012488285, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.84(1.03), norm=2.3565866522132684, lr=0.12076165766278077
2023-12-16 10:26:40   INFO  epoch: 0/24, acc_iter=2800, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 1:02:12/0:15:29, time_cost(all): 1:02:12/1 day, 5:54:58, loss=0.592841155702236, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.87(1.03), norm=3.052871520505922, lr=0.12409368780210404
2023-12-16 10:27:47   INFO  epoch: 0/24, acc_iter=2850, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 1:03:19/0:14:48, time_cost(all): 1:03:19/1 day, 7:39:49, loss=0.592642298916187, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.17(1.03), norm=3.2336390542888727, lr=0.12742571794142732
2023-12-16 10:28:53   INFO  epoch: 0/24, acc_iter=2900, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 1:04:25/0:13:29, time_cost(all): 1:04:25/1 day, 5:49:52, loss=0.592443442130138, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.03(1.03), norm=4.895871880554978, lr=0.13075774808075064
2023-12-16 10:30:00   INFO  epoch: 0/24, acc_iter=2950, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 1:05:32/0:13:06, time_cost(all): 1:05:32/1 day, 5:20:19, loss=0.592244585344089, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.02(1.03), norm=4.083666465229907, lr=0.1340897782200739
2023-12-16 10:31:07   INFO  epoch: 0/24, acc_iter=3000, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 1:06:39/0:11:35, time_cost(all): 1:06:39/1 day, 4:50:17, loss=0.59204572855804, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.01(1.03), norm=4.55797044066912, lr=0.1374218083593972
2023-12-16 10:32:13   INFO  epoch: 0/24, acc_iter=3050, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 1:07:45/0:10:02, time_cost(all): 1:07:45/1 day, 5:35:10, loss=0.591846871771991, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.22(1.03), norm=2.4683052688601728, lr=0.14075383849872047
2023-12-16 10:33:20   INFO  epoch: 0/24, acc_iter=3100, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 1:08:52/0:09:40, time_cost(all): 1:08:52/1 day, 4:51:37, loss=0.591648014985942, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.89(1.03), norm=1.1435005881993279, lr=0.1440858686380438
2023-12-16 10:34:27   INFO  epoch: 0/24, acc_iter=3150, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 1:09:59/0:08:20, time_cost(all): 1:09:59/1 day, 5:10:51, loss=0.591449158199893, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.05(1.03), norm=4.557160410804327, lr=0.14741789877736705
2023-12-16 10:35:33   INFO  epoch: 0/24, acc_iter=3200, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 1:11:05/0:07:18, time_cost(all): 1:11:05/1 day, 5:05:29, loss=0.591250301413844, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.05(1.03), norm=2.827457632239956, lr=0.15074992891669034
2023-12-16 10:36:40   INFO  epoch: 0/24, acc_iter=3250, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:12:12/0:05:44, time_cost(all): 1:12:12/1 day, 5:12:34, loss=0.591051444627795, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.21(1.03), norm=1.5059222263098302, lr=0.15408195905601363
2023-12-16 10:37:47   INFO  epoch: 0/24, acc_iter=3300, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:13:19/0:05:01, time_cost(all): 1:13:19/1 day, 5:24:01, loss=0.590852587841746, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.87(1.03), norm=0.9796663358400199, lr=0.15741398919533692
2023-12-16 10:38:53   INFO  epoch: 0/24, acc_iter=3350, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:14:25/0:03:52, time_cost(all): 1:14:25/1 day, 6:52:45, loss=0.590653731055697, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.85(1.03), norm=2.056463196807956, lr=0.1607460193346602
2023-12-16 10:40:00   INFO  epoch: 0/24, acc_iter=3400, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:15:32/0:02:43, time_cost(all): 1:15:32/1 day, 6:28:13, loss=0.590454874269648, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.04(1.03), norm=2.005558918882272, lr=0.16407804947398347
2023-12-16 10:41:07   INFO  epoch: 0/24, acc_iter=3450, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:16:39/0:01:31, time_cost(all): 1:16:39/1 day, 5:38:30, loss=0.590256017483599, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.05(1.03), norm=1.5980409502381905, lr=0.16741007961330678
2023-12-16 10:42:13   INFO  epoch: 0/24, acc_iter=3500, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:17:45/0:00:21, time_cost(all): 1:17:45/1 day, 6:04:47, loss=0.59005716069755, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.16(1.03), norm=0.5023584122043795, lr=0.17074210975263004
2023-12-16 10:43:20   INFO  epoch: 1/24, acc_iter=3567, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:01:06/1:16:21, time_cost(all): 1:18:52/1 day, 5:07:07, loss=0.589790692604244, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.15(1.03), norm=1.4766867655269094, lr=0.17520703013932326
2023-12-16 10:44:27   INFO  epoch: 1/24, acc_iter=3617, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:02:13/1:16:57, time_cost(all): 1:19:59/1 day, 5:26:07, loss=0.589591835818195, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.99(1.03), norm=1.6691273838743357, lr=0.17853906027864652
2023-12-16 10:45:33   INFO  epoch: 1/24, acc_iter=3667, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:03:19/1:16:21, time_cost(all): 1:21:05/1 day, 7:16:10, loss=0.589392979032146, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.06(1.03), norm=4.646089276775809, lr=0.18187109041796984
2023-12-16 10:46:40   INFO  epoch: 1/24, acc_iter=3717, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:04:26/1:12:31, time_cost(all): 1:22:12/1 day, 6:51:13, loss=0.589194122246097, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.91(1.03), norm=4.456520583775368, lr=0.18520312055729315
2023-12-16 10:47:47   INFO  epoch: 1/24, acc_iter=3767, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:05:33/1:15:38, time_cost(all): 1:23:19/1 day, 6:02:42, loss=0.588995265460048, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.08(1.03), norm=2.4008987763926366, lr=0.18853515069661642
2023-12-16 10:48:53   INFO  epoch: 1/24, acc_iter=3817, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:06:39/1:09:26, time_cost(all): 1:24:25/1 day, 6:01:57, loss=0.588796408673999, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.98(1.03), norm=4.859179123234592, lr=0.19186718083593973
2023-12-16 10:50:00   INFO  epoch: 1/24, acc_iter=3867, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:07:46/1:08:48, time_cost(all): 1:25:32/1 day, 4:53:53, loss=0.58859755188795, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.83(1.03), norm=4.921791730919718, lr=0.195199210975263
2023-12-16 10:51:07   INFO  epoch: 1/24, acc_iter=3917, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:08:53/1:07:23, time_cost(all): 1:26:39/1 day, 5:53:18, loss=0.588398695101901, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.93(1.03), norm=0.7316149466391278, lr=0.1985312411145863
2023-12-16 10:52:13   INFO  epoch: 1/24, acc_iter=3967, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:09:59/1:09:02, time_cost(all): 1:27:45/1 day, 5:55:21, loss=0.588199838315852, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.89(1.03), norm=1.5764317112063437, lr=0.20186327125390957
2023-12-16 10:53:20   INFO  epoch: 1/24, acc_iter=4017, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:11:06/1:04:11, time_cost(all): 1:28:52/1 day, 6:34:30, loss=0.588000981529803, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.01(1.03), norm=3.3967774596565845, lr=0.20519530139323283
2023-12-16 10:54:26   INFO  epoch: 1/24, acc_iter=4067, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:12:13/1:04:23, time_cost(all): 1:29:58/1 day, 6:55:14, loss=0.587802124743754, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.08(1.03), norm=3.163204748969187, lr=0.20852733153255615
2023-12-16 10:55:33   INFO  epoch: 1/24, acc_iter=4117, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:13:19/1:04:40, time_cost(all): 1:31:05/1 day, 5:24:12, loss=0.587603267957705, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.22(1.03), norm=3.075435171409465, lr=0.2118593616718794
2023-12-16 10:56:40   INFO  epoch: 1/24, acc_iter=4167, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:14:26/1:02:43, time_cost(all): 1:32:12/1 day, 4:55:38, loss=0.587404411171656, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.06(1.03), norm=3.7417919521369543, lr=0.21519139181120273
2023-12-16 10:57:46   INFO  epoch: 1/24, acc_iter=4217, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:15:33/1:00:10, time_cost(all): 1:33:18/1 day, 5:58:32, loss=0.587205554385607, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.9(1.03), norm=3.104200478564967, lr=0.218523421950526
2023-12-16 10:58:53   INFO  epoch: 1/24, acc_iter=4267, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:16:39/0:59:47, time_cost(all): 1:34:25/1 day, 5:45:04, loss=0.587006697599558, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.97(1.03), norm=0.8841324016209262, lr=0.2218554520898493
2023-12-16 11:00:00   INFO  epoch: 1/24, acc_iter=4317, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:17:46/1:00:37, time_cost(all): 1:35:32/1 day, 4:57:42, loss=0.586807840813509, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.01(1.03), norm=1.1258639174180258, lr=0.22518748222917256
2023-12-16 11:01:06   INFO  epoch: 1/24, acc_iter=4367, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:18:53/0:59:07, time_cost(all): 1:36:38/1 day, 6:58:42, loss=0.58660898402746, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.16(1.03), norm=2.5605579607020714, lr=0.22851951236849588
2023-12-16 11:02:13   INFO  epoch: 1/24, acc_iter=4417, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:19:59/0:58:09, time_cost(all): 1:37:45/1 day, 6:54:18, loss=0.586410127241411, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.13(1.03), norm=1.6413201207138635, lr=0.23185154250781914
2023-12-16 11:03:20   INFO  epoch: 1/24, acc_iter=4467, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:21:06/0:55:30, time_cost(all): 1:38:52/1 day, 6:16:01, loss=0.586211270455362, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.91(1.03), norm=4.972172840329096, lr=0.2351835726471424
2023-12-16 11:04:26   INFO  epoch: 1/24, acc_iter=4517, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:22:13/0:56:13, time_cost(all): 1:39:58/1 day, 6:32:43, loss=0.586012413669313, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.87(1.03), norm=1.2049124240113853, lr=0.23851560278646572
2023-12-16 11:05:33   INFO  epoch: 1/24, acc_iter=4567, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:23:19/0:56:32, time_cost(all): 1:41:05/1 day, 4:40:09, loss=0.585813556883264, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.13(1.03), norm=4.6626645278897465, lr=0.24184763292578898
2023-12-16 11:06:40   INFO  epoch: 1/24, acc_iter=4617, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:24:26/0:56:08, time_cost(all): 1:42:12/1 day, 7:01:39, loss=0.585614700097215, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.14(1.03), norm=2.75130129600571, lr=0.2451796630651123
2023-12-16 11:07:46   INFO  epoch: 1/24, acc_iter=4667, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:25:33/0:51:38, time_cost(all): 1:43:18/1 day, 6:22:05, loss=0.585415843311166, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.96(1.03), norm=4.397244053799586, lr=0.2485116932044356
2023-12-16 11:08:53   INFO  epoch: 1/24, acc_iter=4717, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:26:39/0:52:33, time_cost(all): 1:44:25/1 day, 4:04:41, loss=0.585216986525117, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.19(1.03), norm=0.9086908644825011, lr=0.2518437233437589
2023-12-16 11:10:00   INFO  epoch: 1/24, acc_iter=4767, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:27:46/0:52:31, time_cost(all): 1:45:32/1 day, 5:21:26, loss=0.585018129739068, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.02(1.03), norm=2.774722993414653, lr=0.25517575348308214
2023-12-16 11:11:06   INFO  epoch: 1/24, acc_iter=4817, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:28:53/0:51:41, time_cost(all): 1:46:38/1 day, 4:49:01, loss=0.584819272953019, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.12(1.03), norm=4.232313603845647, lr=0.25850778362240545
2023-12-16 11:12:13   INFO  epoch: 1/24, acc_iter=4867, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:29:59/0:47:29, time_cost(all): 1:47:45/1 day, 4:58:31, loss=0.58462041616697, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.09(1.03), norm=2.3945824246237843, lr=0.2618398137617287
2023-12-16 11:13:20   INFO  epoch: 1/24, acc_iter=4917, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:31:06/0:47:45, time_cost(all): 1:48:52/1 day, 5:40:35, loss=0.584421559380921, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.91(1.03), norm=1.0004407594077933, lr=0.26517184390105203
2023-12-16 11:14:26   INFO  epoch: 1/24, acc_iter=4967, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:32:12/0:44:55, time_cost(all): 1:49:58/1 day, 4:33:27, loss=0.584222702594872, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.19(1.03), norm=4.704378906821695, lr=0.2685038740403753
2023-12-16 11:15:33   INFO  epoch: 1/24, acc_iter=5017, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:33:19/0:46:56, time_cost(all): 1:51:05/1 day, 4:36:12, loss=0.584023845808823, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.1(1.03), norm=1.7772488758052425, lr=0.2718359041796986
2023-12-16 11:16:40   INFO  epoch: 1/24, acc_iter=5067, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:34:26/0:43:43, time_cost(all): 1:52:12/1 day, 5:09:05, loss=0.583824989022774, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.84(1.03), norm=4.313534911608009, lr=0.27516793431902187
2023-12-16 11:17:46   INFO  epoch: 1/24, acc_iter=5117, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:35:32/0:44:40, time_cost(all): 1:53:18/1 day, 6:12:17, loss=0.583626132236725, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.08(1.03), norm=4.925254265077892, lr=0.27849996445834513
2023-12-16 11:18:53   INFO  epoch: 1/24, acc_iter=5167, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:36:39/0:42:40, time_cost(all): 1:54:25/1 day, 5:22:24, loss=0.583427275450676, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.08(1.03), norm=3.2753339225700246, lr=0.28183199459766844
2023-12-16 11:20:00   INFO  epoch: 1/24, acc_iter=5217, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:37:46/0:41:05, time_cost(all): 1:55:32/1 day, 6:37:45, loss=0.583228418664627, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.99(1.03), norm=4.203500786297537, lr=0.28516402473699176
2023-12-16 11:21:06   INFO  epoch: 1/24, acc_iter=5267, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:38:52/0:41:12, time_cost(all): 1:56:38/1 day, 5:13:31, loss=0.583029561878578, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.06(1.03), norm=2.2576208184730007, lr=0.288496054876315
2023-12-16 11:22:13   INFO  epoch: 1/24, acc_iter=5317, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:39:59/0:37:05, time_cost(all): 1:57:45/1 day, 5:53:47, loss=0.582830705092529, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.22(1.03), norm=2.773669381445369, lr=0.29182808501563834
2023-12-16 11:23:20   INFO  epoch: 1/24, acc_iter=5367, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:41:06/0:36:14, time_cost(all): 1:58:52/1 day, 4:14:55, loss=0.58263184830648, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.99(1.03), norm=2.5388251436998495, lr=0.2951601151549616
2023-12-16 11:24:26   INFO  epoch: 1/24, acc_iter=5417, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:42:12/0:34:10, time_cost(all): 1:59:58/1 day, 6:01:08, loss=0.582432991520431, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.16(1.03), norm=3.786694115975289, lr=0.29849214529428486
2023-12-16 11:25:33   INFO  epoch: 1/24, acc_iter=5467, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:43:19/0:36:23, time_cost(all): 2:01:05/1 day, 3:54:11, loss=0.582234134734382, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.13(1.03), norm=1.9058270631328593, lr=0.3018241754336082
2023-12-16 11:26:39   INFO  epoch: 1/24, acc_iter=5517, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:44:26/0:32:53, time_cost(all): 2:02:11/1 day, 6:03:20, loss=0.582035277948333, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.15(1.03), norm=1.3697136068174285, lr=0.30515620557293144
2023-12-16 11:27:46   INFO  epoch: 1/24, acc_iter=5567, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:45:32/0:33:58, time_cost(all): 2:03:18/1 day, 6:26:12, loss=0.581836421162284, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.05(1.03), norm=2.019755344812441, lr=0.3084882357122547
2023-12-16 11:28:53   INFO  epoch: 1/24, acc_iter=5617, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:46:39/0:31:18, time_cost(all): 2:04:25/1 day, 4:36:11, loss=0.581637564376235, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.93(1.03), norm=3.657338713632596, lr=0.311820265851578
2023-12-16 11:29:59   INFO  epoch: 1/24, acc_iter=5667, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:47:46/0:29:45, time_cost(all): 2:05:31/1 day, 5:09:33, loss=0.581438707590186, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.01(1.03), norm=1.30514629116721, lr=0.3151522959909013
2023-12-16 11:31:06   INFO  epoch: 1/24, acc_iter=5717, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:48:52/0:30:40, time_cost(all): 2:06:38/1 day, 4:17:07, loss=0.581239850804137, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.14(1.03), norm=2.8993825592064466, lr=0.3184843261302246
2023-12-16 11:32:13   INFO  epoch: 1/24, acc_iter=5767, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:49:59/0:28:23, time_cost(all): 2:07:45/1 day, 4:58:59, loss=0.581040994018088, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.87(1.03), norm=1.7771758529333366, lr=0.32181635626954785
2023-12-16 11:33:19   INFO  epoch: 1/24, acc_iter=5817, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:51:06/0:26:43, time_cost(all): 2:08:51/1 day, 5:41:37, loss=0.580842137232039, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.99(1.03), norm=1.1235552486274545, lr=0.3251483864088711
2023-12-16 11:34:26   INFO  epoch: 1/24, acc_iter=5867, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:52:12/0:25:48, time_cost(all): 2:09:58/1 day, 6:14:14, loss=0.58064328044599, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.0(1.03), norm=2.76477412205334, lr=0.32848041654819443
2023-12-16 11:35:33   INFO  epoch: 1/24, acc_iter=5917, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:53:19/0:24:14, time_cost(all): 2:11:05/1 day, 5:58:11, loss=0.580444423659941, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.0(1.03), norm=2.6130192070852925, lr=0.3318124466875177
2023-12-16 11:36:39   INFO  epoch: 1/24, acc_iter=5967, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:54:26/0:23:49, time_cost(all): 2:12:11/1 day, 4:05:25, loss=0.580245566873892, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.94(1.03), norm=4.861091980429379, lr=0.335144476826841
2023-12-16 11:37:46   INFO  epoch: 1/24, acc_iter=6017, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:55:32/0:22:53, time_cost(all): 2:13:18/1 day, 6:25:08, loss=0.580046710087843, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.11(1.03), norm=1.4096799542090472, lr=0.3384765069661643
2023-12-16 11:38:53   INFO  epoch: 1/24, acc_iter=6067, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:56:39/0:22:19, time_cost(all): 2:14:25/1 day, 5:11:08, loss=0.579847853301794, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.99(1.03), norm=2.149865135936663, lr=0.3418085371054876
2023-12-16 11:39:59   INFO  epoch: 1/24, acc_iter=6117, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:57:46/0:21:06, time_cost(all): 2:15:31/1 day, 3:35:51, loss=0.579648996515745, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.18(1.03), norm=1.6771539891410614, lr=0.3451405672448109
2023-12-16 11:41:06   INFO  epoch: 1/24, acc_iter=6167, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:58:52/0:19:36, time_cost(all): 2:16:38/1 day, 4:26:54, loss=0.579450139729696, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.21(1.03), norm=4.597464987954365, lr=0.34847259738413416
2023-12-16 11:42:13   INFO  epoch: 1/24, acc_iter=6217, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:59:59/0:18:15, time_cost(all): 2:17:45/1 day, 3:34:30, loss=0.579251282943647, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.04(1.03), norm=2.585143146828827, lr=0.3518046275234575
2023-12-16 11:43:19   INFO  epoch: 1/24, acc_iter=6267, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 1:01:05/0:16:21, time_cost(all): 2:18:51/1 day, 6:22:44, loss=0.579052426157598, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.14(1.03), norm=2.626199910919392, lr=0.35513665766278074
2023-12-16 11:44:26   INFO  epoch: 1/24, acc_iter=6317, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 1:02:12/0:16:36, time_cost(all): 2:19:58/1 day, 3:36:15, loss=0.578853569371549, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.17(1.03), norm=2.928818563178162, lr=0.358468687802104
2023-12-16 11:45:33   INFO  epoch: 1/24, acc_iter=6367, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 1:03:19/0:14:34, time_cost(all): 2:21:05/1 day, 5:11:36, loss=0.5786547125855, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.91(1.03), norm=3.514225226302755, lr=0.3618007179414273
2023-12-16 11:46:39   INFO  epoch: 1/24, acc_iter=6417, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 1:04:25/0:13:33, time_cost(all): 2:22:11/1 day, 5:07:57, loss=0.578455855799451, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.87(1.03), norm=3.251225769455187, lr=0.3651327480807506
2023-12-16 11:47:46   INFO  epoch: 1/24, acc_iter=6467, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 1:05:32/0:12:30, time_cost(all): 2:23:18/1 day, 5:31:01, loss=0.578256999013402, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.87(1.03), norm=3.7566992303697195, lr=0.3684647782200739
2023-12-16 11:48:53   INFO  epoch: 1/24, acc_iter=6517, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 1:06:39/0:11:29, time_cost(all): 2:24:25/1 day, 5:53:03, loss=0.578058142227353, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.91(1.03), norm=4.3322351182159515, lr=0.37179680835939716
2023-12-16 11:49:59   INFO  epoch: 1/24, acc_iter=6567, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 1:07:45/0:10:29, time_cost(all): 2:25:31/1 day, 3:32:32, loss=0.577859285441304, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.92(1.03), norm=3.473978693330051, lr=0.3751288384987204
2023-12-16 11:51:06   INFO  epoch: 1/24, acc_iter=6617, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 1:08:52/0:09:35, time_cost(all): 2:26:38/1 day, 3:26:10, loss=0.577660428655255, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.16(1.03), norm=1.8969817558262856, lr=0.37846086863804373
2023-12-16 11:52:13   INFO  epoch: 1/24, acc_iter=6667, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 1:09:59/0:07:45, time_cost(all): 2:27:45/1 day, 4:40:44, loss=0.577461571869206, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.92(1.03), norm=2.673605257816428, lr=0.381792898777367
2023-12-16 11:53:19   INFO  epoch: 1/24, acc_iter=6717, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 1:11:05/0:06:52, time_cost(all): 2:28:51/1 day, 3:22:24, loss=0.577262715083157, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.01(1.03), norm=0.5652285512723059, lr=0.3851249289166903
2023-12-16 11:54:26   INFO  epoch: 1/24, acc_iter=6767, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:12:12/0:06:03, time_cost(all): 2:29:58/1 day, 6:09:14, loss=0.577063858297108, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.9(1.03), norm=2.8256800206461405, lr=0.3884569590560136
2023-12-16 11:55:32   INFO  epoch: 1/24, acc_iter=6817, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:13:19/0:04:38, time_cost(all): 2:31:04/1 day, 3:49:11, loss=0.576865001511059, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.05(1.03), norm=2.332335215848569, lr=0.3917889891953369
2023-12-16 11:56:39   INFO  epoch: 1/24, acc_iter=6867, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:14:25/0:03:47, time_cost(all): 2:32:11/1 day, 4:46:11, loss=0.57666614472501, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.02(1.03), norm=3.2957465792229277, lr=0.3951210193346602
2023-12-16 11:57:46   INFO  epoch: 1/24, acc_iter=6917, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:15:32/0:02:42, time_cost(all): 2:33:18/1 day, 3:52:13, loss=0.576467287938961, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.19(1.03), norm=4.365046731418189, lr=0.39845304947398347
2023-12-16 11:58:52   INFO  epoch: 1/24, acc_iter=6967, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:16:39/0:01:27, time_cost(all): 2:34:24/1 day, 5:36:15, loss=0.576268431152912, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.88(1.03), norm=4.993941284041771, lr=0.40178507961330673
2023-12-16 11:59:59   INFO  epoch: 1/24, acc_iter=7017, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:17:45/0:00:22, time_cost(all): 2:35:31/1 day, 4:32:23, loss=0.576069574366863, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.22(1.03), norm=2.6034897301277606, lr=0.40511710975263004
2023-12-16 12:01:06   INFO  epoch: 2/24, acc_iter=7084, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:01:06/1:14:22, time_cost(all): 2:36:38/1 day, 4:23:33, loss=0.575803106273557, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.14(1.03), norm=0.5141108651704653, lr=0.4095820301393232
2023-12-16 12:02:12   INFO  epoch: 2/24, acc_iter=7134, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:02:13/1:18:44, time_cost(all): 2:37:44/1 day, 5:54:49, loss=0.575604249487509, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.89(1.03), norm=0.825070928271951, lr=0.4129140602786465
2023-12-16 12:03:19   INFO  epoch: 2/24, acc_iter=7184, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:03:19/1:13:55, time_cost(all): 2:38:51/1 day, 5:16:56, loss=0.57540539270146, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.06(1.03), norm=4.994207790208278, lr=0.4162460904179698
2023-12-16 12:04:26   INFO  epoch: 2/24, acc_iter=7234, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:04:26/1:11:35, time_cost(all): 2:39:58/1 day, 4:26:56, loss=0.57520653591541, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.18(1.03), norm=3.3230489109425796, lr=0.41957812055729304
2023-12-16 12:05:32   INFO  epoch: 2/24, acc_iter=7284, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:05:33/1:11:36, time_cost(all): 2:41:04/1 day, 4:27:48, loss=0.575007679129361, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.99(1.03), norm=0.7005414776090999, lr=0.4229101506966164
2023-12-16 12:06:39   INFO  epoch: 2/24, acc_iter=7334, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:06:39/1:10:15, time_cost(all): 2:42:11/1 day, 5:52:56, loss=0.574808822343313, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.85(1.03), norm=4.3643301205315, lr=0.4262421808359397
2023-12-16 12:07:46   INFO  epoch: 2/24, acc_iter=7384, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:07:46/1:13:21, time_cost(all): 2:43:18/1 day, 4:50:00, loss=0.574609965557264, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.89(1.03), norm=3.613526903820736, lr=0.42957421097526294
2023-12-16 12:08:52   INFO  epoch: 2/24, acc_iter=7434, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:08:53/1:10:58, time_cost(all): 2:44:24/1 day, 3:48:35, loss=0.574411108771214, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.18(1.03), norm=1.7932680169642852, lr=0.43290624111458625
2023-12-16 12:09:59   INFO  epoch: 2/24, acc_iter=7484, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:09:59/1:10:35, time_cost(all): 2:45:31/1 day, 5:47:22, loss=0.574212251985165, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.91(1.03), norm=1.8964938867115102, lr=0.4362382712539095
2023-12-16 12:11:06   INFO  epoch: 2/24, acc_iter=7534, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:11:06/1:04:47, time_cost(all): 2:46:38/1 day, 3:18:36, loss=0.574013395199116, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.91(1.03), norm=3.921761787399544, lr=0.43957030139323283
2023-12-16 12:12:12   INFO  epoch: 2/24, acc_iter=7584, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:12:13/1:08:42, time_cost(all): 2:47:44/1 day, 3:09:09, loss=0.573814538413068, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.99(1.03), norm=0.8521175182852685, lr=0.4429023315325561
2023-12-16 12:13:19   INFO  epoch: 2/24, acc_iter=7634, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:13:19/1:01:54, time_cost(all): 2:48:51/1 day, 3:29:11, loss=0.573615681627018, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.03(1.03), norm=3.417137431730158, lr=0.44623436167187935
2023-12-16 12:14:26   INFO  epoch: 2/24, acc_iter=7684, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:14:26/1:00:41, time_cost(all): 2:49:58/1 day, 3:46:34, loss=0.573416824840969, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.95(1.03), norm=2.8585189391317423, lr=0.44956639181120267
2023-12-16 12:15:32   INFO  epoch: 2/24, acc_iter=7734, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:15:33/1:02:22, time_cost(all): 2:51:04/1 day, 3:27:38, loss=0.573217968054921, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.86(1.03), norm=3.836567144114181, lr=0.45289842195052593
2023-12-16 12:16:39   INFO  epoch: 2/24, acc_iter=7784, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:16:39/0:59:22, time_cost(all): 2:52:11/1 day, 3:40:26, loss=0.573019111268872, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.98(1.03), norm=2.421277112666632, lr=0.45623045208984925
2023-12-16 12:17:46   INFO  epoch: 2/24, acc_iter=7834, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:17:46/0:58:45, time_cost(all): 2:53:18/1 day, 5:06:12, loss=0.572820254482822, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.12(1.03), norm=2.987182753586543, lr=0.4595624822291725
2023-12-16 12:18:52   INFO  epoch: 2/24, acc_iter=7884, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:18:53/0:57:22, time_cost(all): 2:54:24/1 day, 5:36:12, loss=0.572621397696773, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.15(1.03), norm=4.032808550290138, lr=0.46289451236849577
2023-12-16 12:19:59   INFO  epoch: 2/24, acc_iter=7934, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:19:59/1:00:01, time_cost(all): 2:55:31/1 day, 4:17:45, loss=0.572422540910724, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.06(1.03), norm=0.9190604198499848, lr=0.4662265425078191
2023-12-16 12:21:06   INFO  epoch: 2/24, acc_iter=7984, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:21:06/0:55:18, time_cost(all): 2:56:38/1 day, 5:15:10, loss=0.572223684124676, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.9(1.03), norm=1.6546377392523828, lr=0.46955857264714235
2023-12-16 12:22:12   INFO  epoch: 2/24, acc_iter=8034, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:22:13/0:57:23, time_cost(all): 2:57:44/1 day, 3:08:16, loss=0.572024827338626, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.1(1.03), norm=0.5700169518582736, lr=0.47289060278646566
2023-12-16 12:23:19   INFO  epoch: 2/24, acc_iter=8084, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:23:19/0:53:30, time_cost(all): 2:58:51/1 day, 4:08:37, loss=0.571825970552577, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.98(1.03), norm=1.0027678616064097, lr=0.476222632925789
2023-12-16 12:24:25   INFO  epoch: 2/24, acc_iter=8134, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:24:26/0:52:08, time_cost(all): 2:59:57/1 day, 2:59:26, loss=0.571627113766528, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.96(1.03), norm=3.88091048300034, lr=0.47955466306511224
2023-12-16 12:25:32   INFO  epoch: 2/24, acc_iter=8184, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:25:33/0:54:25, time_cost(all): 3:01:04/1 day, 4:45:25, loss=0.571428256980479, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.85(1.03), norm=1.8666634635757662, lr=0.48288669320443556
2023-12-16 12:26:39   INFO  epoch: 2/24, acc_iter=8234, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:26:39/0:51:25, time_cost(all): 3:02:11/1 day, 4:35:23, loss=0.57122940019443, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.12(1.03), norm=1.8619568886259488, lr=0.4862187233437588
2023-12-16 12:27:45   INFO  epoch: 2/24, acc_iter=8284, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:27:46/0:51:11, time_cost(all): 3:03:17/1 day, 5:05:13, loss=0.571030543408381, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.84(1.03), norm=4.962930622895435, lr=0.48955075348308214
2023-12-16 12:28:52   INFO  epoch: 2/24, acc_iter=8334, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:28:53/0:51:21, time_cost(all): 3:04:24/1 day, 4:05:19, loss=0.570831686622332, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.18(1.03), norm=1.9269325107276098, lr=0.4928827836224054
2023-12-16 12:29:59   INFO  epoch: 2/24, acc_iter=8384, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:29:59/0:48:44, time_cost(all): 3:05:31/1 day, 4:38:08, loss=0.570632829836283, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.11(1.03), norm=1.9721635935847954, lr=0.49621481376172866
2023-12-16 12:31:05   INFO  epoch: 2/24, acc_iter=8434, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:31:06/0:47:09, time_cost(all): 3:06:37/1 day, 4:41:46, loss=0.570433973050234, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.03(1.03), norm=4.8278434820980065, lr=0.499546843901052
2023-12-16 12:32:12   INFO  epoch: 2/24, acc_iter=8484, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:32:12/0:45:42, time_cost(all): 3:07:44/1 day, 5:27:44, loss=0.570235116264185, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.9(1.03), norm=4.43625176091822, lr=0.4996756198264366
2023-12-16 12:33:19   INFO  epoch: 2/24, acc_iter=8534, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:33:19/0:44:36, time_cost(all): 3:08:51/1 day, 3:32:02, loss=0.570036259478136, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.9(1.03), norm=4.764846410392253, lr=0.4993001798107382
2023-12-16 12:34:25   INFO  epoch: 2/24, acc_iter=8584, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:34:26/0:43:51, time_cost(all): 3:09:57/1 day, 3:58:28, loss=0.569837402692087, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.03(1.03), norm=3.0659725661821913, lr=0.4989247397950398
2023-12-16 12:35:32   INFO  epoch: 2/24, acc_iter=8634, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:35:32/0:40:29, time_cost(all): 3:11:04/1 day, 4:47:48, loss=0.569638545906038, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.83(1.03), norm=4.488754719732048, lr=0.49854929977934137
2023-12-16 12:36:39   INFO  epoch: 2/24, acc_iter=8684, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:36:39/0:40:48, time_cost(all): 3:12:11/1 day, 5:23:31, loss=0.569439689119989, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.06(1.03), norm=1.1840405856178335, lr=0.498173859763643
2023-12-16 12:37:45   INFO  epoch: 2/24, acc_iter=8734, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:37:46/0:41:37, time_cost(all): 3:13:17/1 day, 4:03:19, loss=0.56924083233394, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.87(1.03), norm=1.1489817224639824, lr=0.4977984197479446
2023-12-16 12:38:52   INFO  epoch: 2/24, acc_iter=8784, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:38:52/0:39:05, time_cost(all): 3:14:24/1 day, 2:52:19, loss=0.569041975547891, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.1(1.03), norm=2.586464615770396, lr=0.4974229797322462
2023-12-16 12:39:59   INFO  epoch: 2/24, acc_iter=8834, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:39:59/0:36:57, time_cost(all): 3:15:31/1 day, 3:22:40, loss=0.568843118761842, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.03(1.03), norm=1.627229882703204, lr=0.4970475397165478
2023-12-16 12:41:05   INFO  epoch: 2/24, acc_iter=8884, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:41:06/0:37:13, time_cost(all): 3:16:37/1 day, 5:07:20, loss=0.568644261975793, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.06(1.03), norm=2.245167965275539, lr=0.4966720997008494
2023-12-16 12:42:12   INFO  epoch: 2/24, acc_iter=8934, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:42:12/0:36:08, time_cost(all): 3:17:44/1 day, 3:02:09, loss=0.568445405189744, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.19(1.03), norm=3.6752002610020873, lr=0.496296659685151
2023-12-16 12:43:19   INFO  epoch: 2/24, acc_iter=8984, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:43:19/0:33:21, time_cost(all): 3:18:51/1 day, 4:35:16, loss=0.568246548403695, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.16(1.03), norm=4.52505807365975, lr=0.49592121966945263
2023-12-16 12:44:25   INFO  epoch: 2/24, acc_iter=9034, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:44:26/0:34:34, time_cost(all): 3:19:57/1 day, 3:21:06, loss=0.568047691617646, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.1(1.03), norm=2.8948542502292782, lr=0.49554577965375424
2023-12-16 12:45:32   INFO  epoch: 2/24, acc_iter=9084, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:45:32/0:33:06, time_cost(all): 3:21:04/1 day, 2:46:26, loss=0.567848834831597, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.2(1.03), norm=1.4489862151612836, lr=0.4951703396380558
2023-12-16 12:46:39   INFO  epoch: 2/24, acc_iter=9134, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:46:39/0:32:53, time_cost(all): 3:22:11/1 day, 3:34:28, loss=0.567649978045548, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.09(1.03), norm=2.611167142893478, lr=0.4947948996223574
2023-12-16 12:47:45   INFO  epoch: 2/24, acc_iter=9184, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:47:46/0:29:53, time_cost(all): 3:23:17/1 day, 5:14:06, loss=0.567451121259499, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.14(1.03), norm=3.403065900256192, lr=0.494419459606659
2023-12-16 12:48:52   INFO  epoch: 2/24, acc_iter=9234, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:48:52/0:30:37, time_cost(all): 3:24:24/1 day, 4:28:08, loss=0.56725226447345, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.92(1.03), norm=3.2892961405531778, lr=0.4940440195909606
2023-12-16 12:49:59   INFO  epoch: 2/24, acc_iter=9284, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:49:59/0:28:07, time_cost(all): 3:25:31/1 day, 4:40:46, loss=0.567053407687401, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.01(1.03), norm=3.3795679580779097, lr=0.4936685795752622
2023-12-16 12:51:05   INFO  epoch: 2/24, acc_iter=9334, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:51:06/0:27:59, time_cost(all): 3:26:37/1 day, 3:33:01, loss=0.566854550901352, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.99(1.03), norm=1.6326614641819654, lr=0.49329313955956383
2023-12-16 12:52:12   INFO  epoch: 2/24, acc_iter=9384, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:52:12/0:25:36, time_cost(all): 3:27:44/1 day, 3:50:51, loss=0.566655694115303, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.97(1.03), norm=4.279909855942394, lr=0.49291769954386544
2023-12-16 12:53:18   INFO  epoch: 2/24, acc_iter=9434, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:53:19/0:25:27, time_cost(all): 3:28:50/1 day, 4:49:10, loss=0.566456837329254, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.12(1.03), norm=4.312538793056701, lr=0.49254225952816705
2023-12-16 12:54:25   INFO  epoch: 2/24, acc_iter=9484, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:54:26/0:23:03, time_cost(all): 3:29:57/1 day, 3:05:18, loss=0.566257980543205, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.87(1.03), norm=1.934620182712693, lr=0.4921668195124686
2023-12-16 12:55:32   INFO  epoch: 2/24, acc_iter=9534, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:55:32/0:22:04, time_cost(all): 3:31:04/1 day, 5:03:50, loss=0.566059123757156, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.13(1.03), norm=3.8766154461960545, lr=0.4917913794967702
2023-12-16 12:56:38   INFO  epoch: 2/24, acc_iter=9584, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:56:39/0:21:02, time_cost(all): 3:32:10/1 day, 3:36:58, loss=0.565860266971107, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.1(1.03), norm=2.8020188625156064, lr=0.4914159394810718
2023-12-16 12:57:45   INFO  epoch: 2/24, acc_iter=9634, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:57:46/0:19:58, time_cost(all): 3:33:17/1 day, 2:53:35, loss=0.565661410185058, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.04(1.03), norm=3.6873664844217124, lr=0.4910404994653734
2023-12-16 12:58:52   INFO  epoch: 2/24, acc_iter=9684, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:58:52/0:19:41, time_cost(all): 3:34:24/1 day, 3:36:24, loss=0.565462553399009, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.15(1.03), norm=0.5681945077368153, lr=0.49066505944967503
2023-12-16 12:59:58   INFO  epoch: 2/24, acc_iter=9734, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:59:59/0:18:53, time_cost(all): 3:35:30/1 day, 4:00:26, loss=0.56526369661296, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.14(1.03), norm=2.989997803532598, lr=0.49028961943397664
2023-12-16 13:01:05   INFO  epoch: 2/24, acc_iter=9784, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 1:01:05/0:16:46, time_cost(all): 3:36:37/1 day, 2:51:08, loss=0.565064839826911, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.83(1.03), norm=1.7922352924068357, lr=0.48991417941827825
2023-12-16 13:02:12   INFO  epoch: 2/24, acc_iter=9834, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 1:02:12/0:16:14, time_cost(all): 3:37:44/1 day, 2:48:11, loss=0.564865983040862, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.99(1.03), norm=0.826787429669611, lr=0.48953873940257986
2023-12-16 13:03:18   INFO  epoch: 2/24, acc_iter=9884, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 1:03:19/0:14:30, time_cost(all): 3:38:50/1 day, 2:36:42, loss=0.564667126254813, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.18(1.03), norm=3.480472168159497, lr=0.4891632993868814
2023-12-16 13:04:25   INFO  epoch: 2/24, acc_iter=9934, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 1:04:25/0:14:04, time_cost(all): 3:39:57/1 day, 3:57:40, loss=0.564468269468764, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.21(1.03), norm=3.6054899883212093, lr=0.488787859371183
2023-12-16 13:05:32   INFO  epoch: 2/24, acc_iter=9984, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 1:05:32/0:12:46, time_cost(all): 3:41:04/1 day, 2:47:31, loss=0.564269412682715, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.89(1.03), norm=4.609303746668827, lr=0.48841241935548463
2023-12-16 13:06:38   INFO  epoch: 2/24, acc_iter=10034, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 1:06:39/0:11:10, time_cost(all): 3:42:10/1 day, 3:23:36, loss=0.564070555896666, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.86(1.03), norm=2.651866520929662, lr=0.48803697933978624
2023-12-16 13:07:45   INFO  epoch: 2/24, acc_iter=10084, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 1:07:45/0:10:11, time_cost(all): 3:43:17/1 day, 2:44:15, loss=0.563871699110617, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.04(1.03), norm=1.5986419027519911, lr=0.48766153932408784
2023-12-16 13:08:52   INFO  epoch: 2/24, acc_iter=10134, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 1:08:52/0:09:42, time_cost(all): 3:44:24/1 day, 2:56:50, loss=0.563672842324568, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.86(1.03), norm=3.5867204082627784, lr=0.48728609930838945
2023-12-16 13:09:58   INFO  epoch: 2/24, acc_iter=10184, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 1:09:59/0:08:31, time_cost(all): 3:45:30/1 day, 3:55:44, loss=0.563473985538519, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.16(1.03), norm=4.214924499013949, lr=0.48691065929269106
2023-12-16 13:11:05   INFO  epoch: 2/24, acc_iter=10234, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 1:11:05/0:07:04, time_cost(all): 3:46:37/1 day, 3:48:45, loss=0.56327512875247, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.89(1.03), norm=4.198005394433126, lr=0.48653521927699267
2023-12-16 13:12:12   INFO  epoch: 2/24, acc_iter=10284, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:12:12/0:05:44, time_cost(all): 3:47:44/1 day, 4:16:22, loss=0.563076271966421, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.92(1.03), norm=4.823028343492581, lr=0.4861597792612942
2023-12-16 13:13:18   INFO  epoch: 2/24, acc_iter=10334, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:13:19/0:04:52, time_cost(all): 3:48:50/1 day, 2:51:20, loss=0.562877415180372, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.9(1.03), norm=2.618867598599994, lr=0.48578433924559583
2023-12-16 13:14:25   INFO  epoch: 2/24, acc_iter=10384, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:14:25/0:03:37, time_cost(all): 3:49:57/1 day, 4:11:54, loss=0.562678558394323, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.21(1.03), norm=2.124802290397389, lr=0.48540889922989744
2023-12-16 13:15:32   INFO  epoch: 2/24, acc_iter=10434, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:15:32/0:02:42, time_cost(all): 3:51:04/1 day, 3:19:42, loss=0.562479701608274, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.94(1.03), norm=4.39962421889711, lr=0.48503345921419905
2023-12-16 13:16:38   INFO  epoch: 2/24, acc_iter=10484, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:16:39/0:01:27, time_cost(all): 3:52:10/1 day, 3:56:15, loss=0.562280844822225, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.1(1.03), norm=3.7218743053589898, lr=0.48465801919850066
2023-12-16 13:17:45   INFO  epoch: 2/24, acc_iter=10534, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:17:45/0:00:23, time_cost(all): 3:53:17/1 day, 4:22:52, loss=0.562081988036176, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.91(1.03), norm=2.377280259539506, lr=0.48428257918280226
2023-12-16 13:18:52   INFO  epoch: 3/24, acc_iter=10601, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:01:06/1:16:17, time_cost(all): 3:54:24/1 day, 4:31:02, loss=0.561815519942871, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.88(1.03), norm=4.953526197381794, lr=0.4837794895617664
2023-12-16 13:19:58   INFO  epoch: 3/24, acc_iter=10651, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:02:13/1:12:14, time_cost(all): 3:55:30/1 day, 2:10:07, loss=0.561616663156822, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.19(1.03), norm=4.6591982946911426, lr=0.483404049546068
2023-12-16 13:21:05   INFO  epoch: 3/24, acc_iter=10701, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:03:19/1:13:58, time_cost(all): 3:56:37/1 day, 3:14:36, loss=0.561417806370773, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.12(1.03), norm=4.984012211226186, lr=0.4830286095303696
2023-12-16 13:22:12   INFO  epoch: 3/24, acc_iter=10751, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:04:26/1:14:10, time_cost(all): 3:57:44/1 day, 2:26:08, loss=0.561218949584724, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.22(1.03), norm=1.1868106402779297, lr=0.4826531695146712
2023-12-16 13:23:18   INFO  epoch: 3/24, acc_iter=10801, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:05:33/1:15:21, time_cost(all): 3:58:50/1 day, 2:16:15, loss=0.561020092798675, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.05(1.03), norm=3.281347116306471, lr=0.4822777294989728
2023-12-16 13:24:25   INFO  epoch: 3/24, acc_iter=10851, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:06:39/1:13:37, time_cost(all): 3:59:57/1 day, 2:27:00, loss=0.560821236012626, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.91(1.03), norm=2.7889281425152346, lr=0.4819022894832744
2023-12-16 13:25:31   INFO  epoch: 3/24, acc_iter=10901, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:07:46/1:11:30, time_cost(all): 4:01:03/1 day, 2:26:46, loss=0.560622379226577, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.87(1.03), norm=4.933423624398755, lr=0.481526849467576
2023-12-16 13:26:38   INFO  epoch: 3/24, acc_iter=10951, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:08:53/1:09:28, time_cost(all): 4:02:10/1 day, 3:34:35, loss=0.560423522440528, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.89(1.03), norm=0.9236423771988609, lr=0.4811514094518776
2023-12-16 13:27:45   INFO  epoch: 3/24, acc_iter=11001, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:09:59/1:06:22, time_cost(all): 4:03:17/1 day, 2:42:01, loss=0.560224665654479, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.85(1.03), norm=1.7092053560613825, lr=0.48077596943617923
2023-12-16 13:28:51   INFO  epoch: 3/24, acc_iter=11051, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:11:06/1:08:31, time_cost(all): 4:04:23/1 day, 4:15:16, loss=0.56002580886843, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.99(1.03), norm=4.969509539484591, lr=0.48040052942048084
2023-12-16 13:29:58   INFO  epoch: 3/24, acc_iter=11101, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:12:13/1:03:20, time_cost(all): 4:05:30/1 day, 3:23:31, loss=0.559826952082381, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.18(1.03), norm=0.7455794505054758, lr=0.4800250894047824
2023-12-16 13:31:05   INFO  epoch: 3/24, acc_iter=11151, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:13:19/1:01:35, time_cost(all): 4:06:37/1 day, 3:56:16, loss=0.559628095296332, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.21(1.03), norm=4.222472802543863, lr=0.479649649389084
2023-12-16 13:32:11   INFO  epoch: 3/24, acc_iter=11201, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:14:26/1:06:10, time_cost(all): 4:07:43/1 day, 4:02:23, loss=0.559429238510283, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.07(1.03), norm=3.202576207811305, lr=0.4792742093733856
2023-12-16 13:33:18   INFO  epoch: 3/24, acc_iter=11251, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:15:33/1:01:21, time_cost(all): 4:08:50/1 day, 2:22:13, loss=0.559230381724234, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.02(1.03), norm=2.779990936860194, lr=0.4788987693576872
2023-12-16 13:34:25   INFO  epoch: 3/24, acc_iter=11301, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:16:39/1:02:29, time_cost(all): 4:09:57/1 day, 3:40:11, loss=0.559031524938185, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.1(1.03), norm=3.291646371188859, lr=0.4785233293419888
2023-12-16 13:35:31   INFO  epoch: 3/24, acc_iter=11351, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:17:46/1:00:23, time_cost(all): 4:11:03/1 day, 1:46:06, loss=0.558832668152136, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.84(1.03), norm=2.147957505175278, lr=0.47814788932629043
2023-12-16 13:36:38   INFO  epoch: 3/24, acc_iter=11401, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:18:53/0:58:13, time_cost(all): 4:12:10/1 day, 2:33:41, loss=0.558633811366087, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.17(1.03), norm=1.9846495814638943, lr=0.47777244931059204
2023-12-16 13:37:45   INFO  epoch: 3/24, acc_iter=11451, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:19:59/0:56:01, time_cost(all): 4:13:17/1 day, 3:27:01, loss=0.558434954580038, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.2(1.03), norm=1.5140779543330674, lr=0.47739700929489365
2023-12-16 13:38:51   INFO  epoch: 3/24, acc_iter=11501, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:21:06/0:57:31, time_cost(all): 4:14:23/1 day, 3:53:48, loss=0.558236097793989, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.97(1.03), norm=2.1968439437672638, lr=0.4770215692791952
2023-12-16 13:39:58   INFO  epoch: 3/24, acc_iter=11551, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:22:13/0:55:22, time_cost(all): 4:15:30/1 day, 2:02:01, loss=0.55803724100794, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.98(1.03), norm=2.7253177426068422, lr=0.4766461292634968
2023-12-16 13:41:05   INFO  epoch: 3/24, acc_iter=11601, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:23:19/0:54:42, time_cost(all): 4:16:37/1 day, 3:10:58, loss=0.557838384221891, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.14(1.03), norm=3.464393646508307, lr=0.4762706892477984
2023-12-16 13:42:11   INFO  epoch: 3/24, acc_iter=11651, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:24:26/0:54:53, time_cost(all): 4:17:43/1 day, 3:03:54, loss=0.557639527435842, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.94(1.03), norm=4.885801325700836, lr=0.4758952492321
2023-12-16 13:43:18   INFO  epoch: 3/24, acc_iter=11701, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:25:33/0:52:23, time_cost(all): 4:18:50/1 day, 3:09:14, loss=0.557440670649793, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.95(1.03), norm=2.7647102861105277, lr=0.47551980921640163
2023-12-16 13:44:25   INFO  epoch: 3/24, acc_iter=11751, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:26:39/0:52:57, time_cost(all): 4:19:57/1 day, 3:27:37, loss=0.557241813863744, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.19(1.03), norm=2.0433331068369633, lr=0.47514436920070324
2023-12-16 13:45:31   INFO  epoch: 3/24, acc_iter=11801, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:27:46/0:47:54, time_cost(all): 4:21:03/1 day, 2:28:55, loss=0.557042957077695, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.84(1.03), norm=0.836912927126118, lr=0.47476892918500485
2023-12-16 13:46:38   INFO  epoch: 3/24, acc_iter=11851, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:28:53/0:49:29, time_cost(all): 4:22:10/1 day, 4:10:43, loss=0.556844100291646, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.99(1.03), norm=0.8700722345195493, lr=0.47439348916930646
2023-12-16 13:47:45   INFO  epoch: 3/24, acc_iter=11901, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:29:59/0:46:21, time_cost(all): 4:23:17/1 day, 4:05:28, loss=0.556645243505597, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.17(1.03), norm=4.308401110308009, lr=0.474018049153608
2023-12-16 13:48:51   INFO  epoch: 3/24, acc_iter=11951, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:31:06/0:48:43, time_cost(all): 4:24:23/1 day, 3:29:41, loss=0.556446386719548, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.15(1.03), norm=3.665924103018042, lr=0.4736426091379096
2023-12-16 13:49:58   INFO  epoch: 3/24, acc_iter=12001, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:32:12/0:47:19, time_cost(all): 4:25:30/1 day, 2:02:41, loss=0.556247529933499, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.13(1.03), norm=2.037107189722094, lr=0.47326716912221123
2023-12-16 13:51:05   INFO  epoch: 3/24, acc_iter=12051, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:33:19/0:42:48, time_cost(all): 4:26:37/1 day, 2:47:08, loss=0.55604867314745, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.07(1.03), norm=4.914009284538752, lr=0.47289172910651284
2023-12-16 13:52:11   INFO  epoch: 3/24, acc_iter=12101, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:34:26/0:44:59, time_cost(all): 4:27:43/1 day, 1:43:57, loss=0.555849816361401, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.07(1.03), norm=4.374966404408501, lr=0.47251628909081445
2023-12-16 13:53:18   INFO  epoch: 3/24, acc_iter=12151, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:35:32/0:43:06, time_cost(all): 4:28:50/1 day, 2:44:56, loss=0.555650959575352, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.14(1.03), norm=1.0711920700849893, lr=0.47214084907511605
2023-12-16 13:54:24   INFO  epoch: 3/24, acc_iter=12201, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:36:39/0:41:20, time_cost(all): 4:29:56/1 day, 2:31:43, loss=0.555452102789303, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.02(1.03), norm=2.9713836362757684, lr=0.47176540905941766
2023-12-16 13:55:31   INFO  epoch: 3/24, acc_iter=12251, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:37:46/0:42:16, time_cost(all): 4:31:03/1 day, 2:25:21, loss=0.555253246003254, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.87(1.03), norm=0.7690517352572015, lr=0.47138996904371927
2023-12-16 13:56:38   INFO  epoch: 3/24, acc_iter=12301, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:38:52/0:40:39, time_cost(all): 4:32:10/1 day, 1:46:46, loss=0.555054389217205, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.86(1.03), norm=3.0163289577067056, lr=0.4710145290280209
2023-12-16 13:57:44   INFO  epoch: 3/24, acc_iter=12351, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:39:59/0:39:57, time_cost(all): 4:33:16/1 day, 2:23:25, loss=0.554855532431156, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.94(1.03), norm=0.5668335030264681, lr=0.47063908901232243
2023-12-16 13:58:51   INFO  epoch: 3/24, acc_iter=12401, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:41:06/0:37:21, time_cost(all): 4:34:23/1 day, 1:30:43, loss=0.554656675645107, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.86(1.03), norm=2.1175524748117702, lr=0.47026364899662404
2023-12-16 13:59:58   INFO  epoch: 3/24, acc_iter=12451, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:42:12/0:35:17, time_cost(all): 4:35:30/1 day, 3:28:30, loss=0.554457818859058, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.02(1.03), norm=3.1814662554661055, lr=0.46988820898092565
2023-12-16 14:01:04   INFO  epoch: 3/24, acc_iter=12501, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:43:19/0:35:26, time_cost(all): 4:36:36/1 day, 2:57:37, loss=0.554258962073009, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.14(1.03), norm=2.8589212364474546, lr=0.46951276896522726
2023-12-16 14:02:11   INFO  epoch: 3/24, acc_iter=12551, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:44:26/0:34:23, time_cost(all): 4:37:43/1 day, 3:07:01, loss=0.55406010528696, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.89(1.03), norm=3.843813405789447, lr=0.46913732894952886
2023-12-16 14:03:18   INFO  epoch: 3/24, acc_iter=12601, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:45:32/0:33:41, time_cost(all): 4:38:50/1 day, 3:15:50, loss=0.553861248500911, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.9(1.03), norm=4.766608289750313, lr=0.4687618889338305
2023-12-16 14:04:24   INFO  epoch: 3/24, acc_iter=12651, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:46:39/0:31:43, time_cost(all): 4:39:56/1 day, 2:44:08, loss=0.553662391714862, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.17(1.03), norm=3.659678667288821, lr=0.4683864489181321
2023-12-16 14:05:31   INFO  epoch: 3/24, acc_iter=12701, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:47:46/0:31:10, time_cost(all): 4:41:03/1 day, 1:29:19, loss=0.553463534928813, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.02(1.03), norm=4.515041706412608, lr=0.46801100890243363
2023-12-16 14:06:38   INFO  epoch: 3/24, acc_iter=12751, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:48:52/0:30:41, time_cost(all): 4:42:10/1 day, 1:26:57, loss=0.553264678142764, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.95(1.03), norm=4.548216071683377, lr=0.46763556888673524
2023-12-16 14:07:44   INFO  epoch: 3/24, acc_iter=12801, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:49:59/0:28:33, time_cost(all): 4:43:16/1 day, 3:26:53, loss=0.553065821356715, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.93(1.03), norm=0.6891272121214844, lr=0.46726012887103685
2023-12-16 14:08:51   INFO  epoch: 3/24, acc_iter=12851, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:51:06/0:27:24, time_cost(all): 4:44:23/1 day, 3:04:18, loss=0.552866964570666, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.95(1.03), norm=4.524786052258927, lr=0.46688468885533846
2023-12-16 14:09:58   INFO  epoch: 3/24, acc_iter=12901, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:52:12/0:25:57, time_cost(all): 4:45:30/1 day, 2:35:09, loss=0.552668107784617, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.22(1.03), norm=1.1550610252637217, lr=0.46650924883964007
2023-12-16 14:11:04   INFO  epoch: 3/24, acc_iter=12951, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:53:19/0:24:09, time_cost(all): 4:46:36/1 day, 1:10:14, loss=0.552469250998568, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.91(1.03), norm=3.9926141603234373, lr=0.4661338088239417
2023-12-16 14:12:11   INFO  epoch: 3/24, acc_iter=13001, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:54:26/0:24:24, time_cost(all): 4:47:43/1 day, 2:21:00, loss=0.552270394212519, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.97(1.03), norm=1.1768229468578812, lr=0.4657583688082433
2023-12-16 14:13:18   INFO  epoch: 3/24, acc_iter=13051, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:55:32/0:22:39, time_cost(all): 4:48:50/1 day, 3:07:50, loss=0.55207153742647, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.08(1.03), norm=1.3362355117500115, lr=0.4653829287925449
2023-12-16 14:14:24   INFO  epoch: 3/24, acc_iter=13101, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:56:39/0:21:52, time_cost(all): 4:49:56/1 day, 2:36:00, loss=0.551872680640421, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.89(1.03), norm=1.4693814228212005, lr=0.4650074887768465
2023-12-16 14:15:31   INFO  epoch: 3/24, acc_iter=13151, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:57:46/0:19:54, time_cost(all): 4:51:03/1 day, 1:23:00, loss=0.551673823854372, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.94(1.03), norm=2.8070541390864165, lr=0.46463204876114805
2023-12-16 14:16:38   INFO  epoch: 3/24, acc_iter=13201, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:58:52/0:18:59, time_cost(all): 4:52:10/1 day, 1:07:22, loss=0.551474967068323, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.04(1.03), norm=2.571452879452327, lr=0.46425660874544966
2023-12-16 14:17:44   INFO  epoch: 3/24, acc_iter=13251, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:59:59/0:17:46, time_cost(all): 4:53:16/1 day, 2:19:00, loss=0.551276110282274, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.98(1.03), norm=2.728318898737813, lr=0.46388116872975127
2023-12-16 14:18:51   INFO  epoch: 3/24, acc_iter=13301, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 1:01:05/0:16:52, time_cost(all): 4:54:23/1 day, 1:03:11, loss=0.551077253496225, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.14(1.03), norm=2.7591769307994296, lr=0.4635057287140529
2023-12-16 14:19:58   INFO  epoch: 3/24, acc_iter=13351, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 1:02:12/0:16:25, time_cost(all): 4:55:30/1 day, 2:47:43, loss=0.550878396710176, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.05(1.03), norm=3.3828077595192334, lr=0.4631302886983545
2023-12-16 14:21:04   INFO  epoch: 3/24, acc_iter=13401, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 1:03:19/0:15:16, time_cost(all): 4:56:36/1 day, 2:03:57, loss=0.550679539924127, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.93(1.03), norm=4.599268191279893, lr=0.4627548486826561
2023-12-16 14:22:11   INFO  epoch: 3/24, acc_iter=13451, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 1:04:25/0:14:23, time_cost(all): 4:57:43/1 day, 2:02:56, loss=0.550480683138078, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.19(1.03), norm=1.9881201163983087, lr=0.4623794086669577
2023-12-16 14:23:17   INFO  epoch: 3/24, acc_iter=13501, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 1:05:32/0:12:08, time_cost(all): 4:58:49/1 day, 1:02:24, loss=0.550281826352029, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.88(1.03), norm=1.0460730979810116, lr=0.4620039686512593
2023-12-16 14:24:24   INFO  epoch: 3/24, acc_iter=13551, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 1:06:39/0:11:36, time_cost(all): 4:59:56/1 day, 2:38:11, loss=0.55008296956598, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.92(1.03), norm=4.898180802301871, lr=0.4616285286355609
2023-12-16 14:25:31   INFO  epoch: 3/24, acc_iter=13601, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 1:07:45/0:10:31, time_cost(all): 5:01:03/1 day, 3:13:49, loss=0.549884112779931, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.85(1.03), norm=2.034950022057399, lr=0.4612530886198625
2023-12-16 14:26:37   INFO  epoch: 3/24, acc_iter=13651, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 1:08:52/0:09:20, time_cost(all): 5:02:09/1 day, 1:03:27, loss=0.549685255993882, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.2(1.03), norm=2.923601126471754, lr=0.4608776486041641
2023-12-16 14:27:44   INFO  epoch: 3/24, acc_iter=13701, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 1:09:59/0:08:01, time_cost(all): 5:03:16/1 day, 2:21:01, loss=0.549486399207833, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.07(1.03), norm=0.6678186217961143, lr=0.4605022085884657
2023-12-16 14:28:51   INFO  epoch: 3/24, acc_iter=13751, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 1:11:05/0:06:58, time_cost(all): 5:04:23/1 day, 1:57:34, loss=0.549287542421784, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.01(1.03), norm=0.6578621470632557, lr=0.4601267685727673
2023-12-16 14:29:57   INFO  epoch: 3/24, acc_iter=13801, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:12:12/0:05:38, time_cost(all): 5:05:29/1 day, 1:54:41, loss=0.549088685635735, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.01(1.03), norm=2.725296167975781, lr=0.4597513285570689
2023-12-16 14:31:04   INFO  epoch: 3/24, acc_iter=13851, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:13:19/0:04:45, time_cost(all): 5:06:36/1 day, 2:34:10, loss=0.548889828849686, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.03(1.03), norm=1.5128464489945634, lr=0.4593758885413705
2023-12-16 14:32:11   INFO  epoch: 3/24, acc_iter=13901, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:14:25/0:03:44, time_cost(all): 5:07:43/1 day, 1:40:25, loss=0.548690972063637, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.95(1.03), norm=2.306781193947743, lr=0.4590004485256721
2023-12-16 14:33:17   INFO  epoch: 3/24, acc_iter=13951, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:15:32/0:02:29, time_cost(all): 5:08:49/1 day, 1:00:03, loss=0.548492115277588, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.99(1.03), norm=4.0628968386362905, lr=0.4586250085099737
2023-12-16 14:34:24   INFO  epoch: 3/24, acc_iter=14001, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:16:39/0:01:28, time_cost(all): 5:09:56/1 day, 1:23:22, loss=0.548293258491539, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.01(1.03), norm=4.722004621034627, lr=0.4582495684942753
2023-12-16 14:35:31   INFO  epoch: 3/24, acc_iter=14051, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:17:45/0:00:21, time_cost(all): 5:11:03/1 day, 2:49:03, loss=0.54809440170549, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.19(1.03), norm=3.307724217878142, lr=0.4578741284785769
2023-12-16 14:36:37   INFO  epoch: 4/24, acc_iter=14118, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:01:06/1:20:22, time_cost(all): 5:12:09/1 day, 2:45:05, loss=0.547827933612184, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.11(1.03), norm=2.807543707271656, lr=0.45737103885754105
2023-12-16 14:37:44   INFO  epoch: 4/24, acc_iter=14168, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:02:13/1:14:46, time_cost(all): 5:13:16/1 day, 2:07:00, loss=0.547629076826135, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.13(1.03), norm=0.5623371818879705, lr=0.45699559884184265
2023-12-16 14:38:51   INFO  epoch: 4/24, acc_iter=14218, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:03:19/1:14:18, time_cost(all): 5:14:23/1 day, 2:16:19, loss=0.547430220040086, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.83(1.03), norm=1.7250635492513768, lr=0.45662015882614426
2023-12-16 14:39:57   INFO  epoch: 4/24, acc_iter=14268, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:04:26/1:16:20, time_cost(all): 5:15:29/1 day, 2:33:38, loss=0.547231363254037, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.95(1.03), norm=1.2259108148720212, lr=0.45624471881044587
2023-12-16 14:41:04   INFO  epoch: 4/24, acc_iter=14318, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:05:33/1:10:53, time_cost(all): 5:16:36/1 day, 2:01:07, loss=0.547032506467988, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.21(1.03), norm=1.895458353570503, lr=0.4558692787947475
2023-12-16 14:42:11   INFO  epoch: 4/24, acc_iter=14368, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:06:39/1:13:48, time_cost(all): 5:17:43/1 day, 2:57:03, loss=0.546833649681939, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.87(1.03), norm=3.940855332546943, lr=0.45549383877904903
2023-12-16 14:43:17   INFO  epoch: 4/24, acc_iter=14418, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:07:46/1:13:26, time_cost(all): 5:18:49/1 day, 2:17:52, loss=0.54663479289589, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.03(1.03), norm=3.0668741769661576, lr=0.45511839876335064
2023-12-16 14:44:24   INFO  epoch: 4/24, acc_iter=14468, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:08:53/1:06:59, time_cost(all): 5:19:56/1 day, 2:35:38, loss=0.546435936109841, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.14(1.03), norm=1.6679467312253182, lr=0.45474295874765225
2023-12-16 14:45:31   INFO  epoch: 4/24, acc_iter=14518, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:09:59/1:06:30, time_cost(all): 5:21:03/1 day, 3:05:28, loss=0.546237079323792, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.16(1.03), norm=4.916345653122558, lr=0.45436751873195386
2023-12-16 14:46:37   INFO  epoch: 4/24, acc_iter=14568, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:11:06/1:06:47, time_cost(all): 5:22:09/1 day, 3:10:31, loss=0.546038222537743, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.1(1.03), norm=1.4346569934102864, lr=0.45399207871625546
2023-12-16 14:47:44   INFO  epoch: 4/24, acc_iter=14618, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:12:13/1:06:30, time_cost(all): 5:23:16/1 day, 1:00:56, loss=0.545839365751694, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.9(1.03), norm=2.5931910692787548, lr=0.4536166387005571
2023-12-16 14:48:51   INFO  epoch: 4/24, acc_iter=14668, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:13:19/1:06:20, time_cost(all): 5:24:23/1 day, 0:34:44, loss=0.545640508965645, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.91(1.03), norm=1.1687450127138541, lr=0.4532411986848587
2023-12-16 14:49:57   INFO  epoch: 4/24, acc_iter=14718, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:14:26/1:05:58, time_cost(all): 5:25:29/1 day, 1:09:51, loss=0.545441652179596, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.93(1.03), norm=1.683698349465728, lr=0.4528657586691603
2023-12-16 14:51:04   INFO  epoch: 4/24, acc_iter=14768, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:15:33/1:03:12, time_cost(all): 5:26:36/1 day, 2:16:49, loss=0.545242795393547, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.96(1.03), norm=4.465626321585102, lr=0.4524903186534619
2023-12-16 14:52:10   INFO  epoch: 4/24, acc_iter=14818, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:16:39/1:03:24, time_cost(all): 5:27:42/1 day, 2:16:53, loss=0.545043938607498, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.19(1.03), norm=4.491768851660742, lr=0.45211487863776345
2023-12-16 14:53:17   INFO  epoch: 4/24, acc_iter=14868, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:17:46/1:03:08, time_cost(all): 5:28:49/1 day, 1:09:32, loss=0.544845081821449, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.14(1.03), norm=2.967393732558084, lr=0.45173943862206506
2023-12-16 14:54:24   INFO  epoch: 4/24, acc_iter=14918, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:18:53/1:00:10, time_cost(all): 5:29:56/1 day, 2:03:08, loss=0.5446462250354, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.13(1.03), norm=1.404866389770365, lr=0.45136399860636667
2023-12-16 14:55:30   INFO  epoch: 4/24, acc_iter=14968, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:19:59/0:57:18, time_cost(all): 5:31:02/1 day, 1:19:10, loss=0.544447368249351, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.84(1.03), norm=0.6744735013067295, lr=0.4509885585906683
2023-12-16 14:56:37   INFO  epoch: 4/24, acc_iter=15018, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:21:06/0:57:02, time_cost(all): 5:32:09/1 day, 2:33:22, loss=0.544248511463302, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.99(1.03), norm=1.11642305204429, lr=0.4506131185749699
2023-12-16 14:57:44   INFO  epoch: 4/24, acc_iter=15068, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:22:13/0:54:09, time_cost(all): 5:33:16/1 day, 0:57:52, loss=0.544049654677253, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.01(1.03), norm=3.7182617449129984, lr=0.4502376785592715
2023-12-16 14:58:50   INFO  epoch: 4/24, acc_iter=15118, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:23:19/0:53:54, time_cost(all): 5:34:22/1 day, 1:19:38, loss=0.543850797891204, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.84(1.03), norm=0.8982682526583374, lr=0.4498622385435731
2023-12-16 14:59:57   INFO  epoch: 4/24, acc_iter=15168, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:24:26/0:55:38, time_cost(all): 5:35:29/1 day, 1:51:15, loss=0.543651941105155, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.12(1.03), norm=2.591942417726824, lr=0.44948679852787465
2023-12-16 15:01:04   INFO  epoch: 4/24, acc_iter=15218, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:25:33/0:53:34, time_cost(all): 5:36:36/1 day, 1:59:18, loss=0.543453084319106, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.86(1.03), norm=4.123789072259554, lr=0.44911135851217626
2023-12-16 15:02:10   INFO  epoch: 4/24, acc_iter=15268, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:26:39/0:49:06, time_cost(all): 5:37:42/1 day, 0:22:23, loss=0.543254227533057, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.85(1.03), norm=3.4813658730080608, lr=0.44873591849647787
2023-12-16 15:03:17   INFO  epoch: 4/24, acc_iter=15318, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:27:46/0:52:15, time_cost(all): 5:38:49/1 day, 2:42:21, loss=0.543055370747008, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.2(1.03), norm=3.767401788919086, lr=0.4483604784807795
2023-12-16 15:04:24   INFO  epoch: 4/24, acc_iter=15368, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:28:53/0:48:22, time_cost(all): 5:39:56/1 day, 1:44:26, loss=0.542856513960959, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.0(1.03), norm=3.815308058159275, lr=0.4479850384650811
2023-12-16 15:05:30   INFO  epoch: 4/24, acc_iter=15418, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:29:59/0:47:08, time_cost(all): 5:41:02/1 day, 2:11:05, loss=0.54265765717491, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.94(1.03), norm=0.6986103444691458, lr=0.4476095984493827
2023-12-16 15:06:37   INFO  epoch: 4/24, acc_iter=15468, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:31:06/0:48:50, time_cost(all): 5:42:09/1 day, 1:59:03, loss=0.542458800388861, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.05(1.03), norm=3.4195142972293406, lr=0.4472341584336843
2023-12-16 15:07:44   INFO  epoch: 4/24, acc_iter=15518, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:32:12/0:47:51, time_cost(all): 5:43:16/1 day, 1:34:04, loss=0.542259943602812, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.92(1.03), norm=3.349139760180416, lr=0.4468587184179859
2023-12-16 15:08:50   INFO  epoch: 4/24, acc_iter=15568, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:33:19/0:43:38, time_cost(all): 5:44:22/1 day, 2:18:06, loss=0.542061086816763, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.21(1.03), norm=2.361745960889502, lr=0.4464832784022875
2023-12-16 15:09:57   INFO  epoch: 4/24, acc_iter=15618, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:34:26/0:43:49, time_cost(all): 5:45:29/1 day, 1:34:00, loss=0.541862230030714, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.16(1.03), norm=2.538331995424477, lr=0.4461078383865891
2023-12-16 15:11:04   INFO  epoch: 4/24, acc_iter=15668, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:35:32/0:40:43, time_cost(all): 5:46:36/1 day, 1:31:16, loss=0.541663373244665, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.17(1.03), norm=3.3129921892489533, lr=0.4457323983708907
2023-12-16 15:12:10   INFO  epoch: 4/24, acc_iter=15718, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:36:39/0:40:48, time_cost(all): 5:47:42/1 day, 1:32:11, loss=0.541464516458616, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.0(1.03), norm=2.553083956743222, lr=0.4453569583551923
2023-12-16 15:13:17   INFO  epoch: 4/24, acc_iter=15768, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:37:46/0:40:33, time_cost(all): 5:48:49/1 day, 1:45:20, loss=0.541265659672567, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.13(1.03), norm=4.852758005763979, lr=0.4449815183394939
2023-12-16 15:14:24   INFO  epoch: 4/24, acc_iter=15818, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:38:52/0:37:35, time_cost(all): 5:49:56/1 day, 1:53:40, loss=0.541066802886518, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.97(1.03), norm=2.763526178580235, lr=0.4446060783237955
2023-12-16 15:15:30   INFO  epoch: 4/24, acc_iter=15868, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:39:59/0:38:24, time_cost(all): 5:51:02/1 day, 0:49:17, loss=0.540867946100469, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.15(1.03), norm=4.262534127504978, lr=0.4442306383080971
2023-12-16 15:16:37   INFO  epoch: 4/24, acc_iter=15918, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:41:06/0:35:31, time_cost(all): 5:52:09/1 day, 1:54:00, loss=0.54066908931442, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.88(1.03), norm=3.3678570002017825, lr=0.4438551982923987
2023-12-16 15:17:44   INFO  epoch: 4/24, acc_iter=15968, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:42:12/0:35:03, time_cost(all): 5:53:16/1 day, 1:55:10, loss=0.540470232528371, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.93(1.03), norm=3.594361943714495, lr=0.4434797582767003
2023-12-16 15:18:50   INFO  epoch: 4/24, acc_iter=16018, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:43:19/0:34:46, time_cost(all): 5:54:22/1 day, 0:13:33, loss=0.540271375742322, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.14(1.03), norm=4.121926535722693, lr=0.4431043182610019
2023-12-16 15:19:57   INFO  epoch: 4/24, acc_iter=16068, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:44:26/0:35:17, time_cost(all): 5:55:29/1 day, 0:09:31, loss=0.540072518956273, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.12(1.03), norm=4.804184445032215, lr=0.4427288782453035
2023-12-16 15:21:04   INFO  epoch: 4/24, acc_iter=16118, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:45:32/0:31:05, time_cost(all): 5:56:36/1 day, 0:24:37, loss=0.539873662170224, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.83(1.03), norm=2.459596790228084, lr=0.4423534382296051
2023-12-16 15:22:10   INFO  epoch: 4/24, acc_iter=16168, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:46:39/0:30:25, time_cost(all): 5:57:42/1 day, 0:11:35, loss=0.539674805384175, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.95(1.03), norm=1.5332270683446487, lr=0.4419779982139067
2023-12-16 15:23:17   INFO  epoch: 4/24, acc_iter=16218, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:47:46/0:31:42, time_cost(all): 5:58:49/1 day, 1:13:45, loss=0.539475948598126, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.91(1.03), norm=4.695771986536897, lr=0.4416025581982083
2023-12-16 15:24:23   INFO  epoch: 4/24, acc_iter=16268, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:48:52/0:29:41, time_cost(all): 5:59:55/1 day, 2:01:24, loss=0.539277091812077, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.19(1.03), norm=2.182232206088907, lr=0.4412271181825099
2023-12-16 15:25:30   INFO  epoch: 4/24, acc_iter=16318, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:49:59/0:29:02, time_cost(all): 6:01:02/1 day, 0:18:41, loss=0.539078235026028, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.16(1.03), norm=1.0746534850782836, lr=0.44085167816681153
2023-12-16 15:26:37   INFO  epoch: 4/24, acc_iter=16368, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:51:06/0:26:09, time_cost(all): 6:02:09/1 day, 0:00:00, loss=0.538879378239979, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.13(1.03), norm=2.1357715840267213, lr=0.44047623815111314
2023-12-16 15:27:43   INFO  epoch: 4/24, acc_iter=16418, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:52:12/0:25:46, time_cost(all): 6:03:15/1 day, 0:25:19, loss=0.53868052145393, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.89(1.03), norm=2.6475274313143693, lr=0.4401007981354147
2023-12-16 15:28:50   INFO  epoch: 4/24, acc_iter=16468, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:53:19/0:25:09, time_cost(all): 6:04:22/1 day, 0:58:19, loss=0.538481664667881, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.21(1.03), norm=4.927688449602324, lr=0.43972535811971636
2023-12-16 15:29:57   INFO  epoch: 4/24, acc_iter=16518, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:54:26/0:23:09, time_cost(all): 6:05:29/1 day, 1:23:15, loss=0.538282807881832, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.03(1.03), norm=2.007228526243057, lr=0.4393499181040179
2023-12-16 15:31:03   INFO  epoch: 4/24, acc_iter=16568, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:55:32/0:23:11, time_cost(all): 6:06:35/1 day, 1:54:56, loss=0.538083951095783, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.9(1.03), norm=2.4068584038250194, lr=0.4389744780883195
2023-12-16 15:32:10   INFO  epoch: 4/24, acc_iter=16618, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:56:39/0:20:25, time_cost(all): 6:07:42/1 day, 0:33:28, loss=0.537885094309734, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.01(1.03), norm=2.582784051636806, lr=0.43859903807262113
2023-12-16 15:33:17   INFO  epoch: 4/24, acc_iter=16668, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:57:46/0:20:17, time_cost(all): 6:08:49/1 day, 1:55:23, loss=0.537686237523685, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.06(1.03), norm=2.7079056646376576, lr=0.43822359805692274
2023-12-16 15:34:23   INFO  epoch: 4/24, acc_iter=16718, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:58:52/0:20:10, time_cost(all): 6:09:55/1 day, 0:48:31, loss=0.537487380737636, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.04(1.03), norm=4.571126599745596, lr=0.43784815804122434
2023-12-16 15:35:30   INFO  epoch: 4/24, acc_iter=16768, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:59:59/0:17:57, time_cost(all): 6:11:02/23:57:22, loss=0.537288523951587, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.05(1.03), norm=1.2360558232334644, lr=0.43747271802552595
2023-12-16 15:36:37   INFO  epoch: 4/24, acc_iter=16818, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 1:01:05/0:16:30, time_cost(all): 6:12:09/1 day, 0:23:54, loss=0.537089667165538, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.16(1.03), norm=1.5399895801477586, lr=0.43709727800982756
2023-12-16 15:37:43   INFO  epoch: 4/24, acc_iter=16868, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 1:02:12/0:16:35, time_cost(all): 6:13:15/1 day, 0:17:36, loss=0.536890810379489, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.13(1.03), norm=3.0335039800434664, lr=0.4367218379941291
2023-12-16 15:38:50   INFO  epoch: 4/24, acc_iter=16918, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 1:03:19/0:15:21, time_cost(all): 6:14:22/1 day, 1:03:25, loss=0.53669195359344, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.9(1.03), norm=1.3654095335706489, lr=0.4363463979784307
2023-12-16 15:39:57   INFO  epoch: 4/24, acc_iter=16968, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 1:04:25/0:13:59, time_cost(all): 6:15:29/1 day, 1:00:47, loss=0.536493096807391, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.94(1.03), norm=3.295388310069376, lr=0.43597095796273233
2023-12-16 15:41:03   INFO  epoch: 4/24, acc_iter=17018, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 1:05:32/0:12:18, time_cost(all): 6:16:35/1 day, 1:26:13, loss=0.536294240021342, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.13(1.03), norm=4.2148796138028, lr=0.43559551794703394
2023-12-16 15:42:10   INFO  epoch: 4/24, acc_iter=17068, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 1:06:39/0:11:00, time_cost(all): 6:17:42/1 day, 0:50:46, loss=0.536095383235293, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.89(1.03), norm=4.522919865690679, lr=0.43522007793133555
2023-12-16 15:43:17   INFO  epoch: 4/24, acc_iter=17118, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 1:07:45/0:10:34, time_cost(all): 6:18:49/1 day, 1:09:56, loss=0.535896526449244, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.88(1.03), norm=3.7332142396245325, lr=0.43484463791563716
2023-12-16 15:44:23   INFO  epoch: 4/24, acc_iter=17168, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 1:08:52/0:09:14, time_cost(all): 6:19:55/23:42:33, loss=0.535697669663195, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.89(1.03), norm=4.765349586407572, lr=0.43446919789993876
2023-12-16 15:45:30   INFO  epoch: 4/24, acc_iter=17218, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 1:09:59/0:08:02, time_cost(all): 6:21:02/1 day, 1:18:28, loss=0.535498812877146, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.08(1.03), norm=4.450801570003815, lr=0.4340937578842403
2023-12-16 15:46:37   INFO  epoch: 4/24, acc_iter=17268, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 1:11:05/0:06:50, time_cost(all): 6:22:09/1 day, 1:48:13, loss=0.535299956091097, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.96(1.03), norm=3.5792291297785384, lr=0.433718317868542
2023-12-16 15:47:43   INFO  epoch: 4/24, acc_iter=17318, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:12:12/0:05:59, time_cost(all): 6:23:15/1 day, 0:38:51, loss=0.535101099305048, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.01(1.03), norm=2.576351838101296, lr=0.43334287785284353
2023-12-16 15:48:50   INFO  epoch: 4/24, acc_iter=17368, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:13:19/0:04:52, time_cost(all): 6:24:22/23:58:22, loss=0.534902242518999, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.03(1.03), norm=4.8947901955269115, lr=0.43296743783714514
2023-12-16 15:49:57   INFO  epoch: 4/24, acc_iter=17418, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:14:25/0:03:41, time_cost(all): 6:25:29/1 day, 1:46:51, loss=0.53470338573295, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.02(1.03), norm=0.5777856229683125, lr=0.43259199782144675
2023-12-16 15:51:03   INFO  epoch: 4/24, acc_iter=17468, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:15:32/0:02:30, time_cost(all): 6:26:35/1 day, 0:20:28, loss=0.534504528946901, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.05(1.03), norm=4.474488766471601, lr=0.43221655780574836
2023-12-16 15:52:10   INFO  epoch: 4/24, acc_iter=17518, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:16:39/0:01:27, time_cost(all): 6:27:42/1 day, 1:49:48, loss=0.534305672160852, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.15(1.03), norm=3.6780978890505933, lr=0.43184111779004997
2023-12-16 15:53:16   INFO  epoch: 4/24, acc_iter=17568, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:17:45/0:00:21, time_cost(all): 6:28:48/1 day, 1:26:47, loss=0.534106815374803, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.84(1.03), norm=4.124112857899615, lr=0.4314656777743516
2023-12-16 15:54:23   INFO  epoch: 5/24, acc_iter=17635, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:01:06/1:16:47, time_cost(all): 6:29:55/1 day, 0:16:11, loss=0.533840347281497, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.04(1.03), norm=0.8720293894575186, lr=0.4309625881533157
2023-12-16 15:55:30   INFO  epoch: 5/24, acc_iter=17685, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:02:13/1:12:46, time_cost(all): 6:31:02/1 day, 0:39:32, loss=0.533641490495448, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.17(1.03), norm=2.496436894964922, lr=0.43058714813761734
2023-12-16 15:56:36   INFO  epoch: 5/24, acc_iter=17735, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:03:19/1:17:09, time_cost(all): 6:32:08/1 day, 1:54:41, loss=0.533442633709399, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.11(1.03), norm=2.284295263387481, lr=0.4302117081219189
2023-12-16 15:57:43   INFO  epoch: 5/24, acc_iter=17785, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:04:26/1:16:51, time_cost(all): 6:33:15/1 day, 0:24:22, loss=0.53324377692335, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.0(1.03), norm=3.029531618154543, lr=0.4298362681062205
2023-12-16 15:58:50   INFO  epoch: 5/24, acc_iter=17835, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:05:33/1:10:28, time_cost(all): 6:34:22/23:47:11, loss=0.533044920137301, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.08(1.03), norm=0.5644316012426119, lr=0.4294608280905221
2023-12-16 15:59:56   INFO  epoch: 5/24, acc_iter=17885, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:06:39/1:11:10, time_cost(all): 6:35:28/1 day, 0:43:08, loss=0.532846063351252, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.9(1.03), norm=1.8054078429669302, lr=0.4290853880748237
2023-12-16 16:01:03   INFO  epoch: 5/24, acc_iter=17935, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:07:46/1:08:43, time_cost(all): 6:36:35/1 day, 0:41:32, loss=0.532647206565203, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.12(1.03), norm=3.661863326087635, lr=0.4287099480591253
2023-12-16 16:02:10   INFO  epoch: 5/24, acc_iter=17985, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:08:53/1:11:35, time_cost(all): 6:37:42/1 day, 0:41:26, loss=0.532448349779154, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.01(1.03), norm=3.90379741930042, lr=0.42833450804342693
2023-12-16 16:03:16   INFO  epoch: 5/24, acc_iter=18035, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:09:59/1:11:25, time_cost(all): 6:38:48/1 day, 0:32:51, loss=0.532249492993105, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.19(1.03), norm=4.239414938383986, lr=0.42795906802772854
2023-12-16 16:04:23   INFO  epoch: 5/24, acc_iter=18085, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:11:06/1:07:08, time_cost(all): 6:39:55/1 day, 0:33:21, loss=0.532050636207056, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.13(1.03), norm=2.580415590716868, lr=0.4275836280120301
2023-12-16 16:05:30   INFO  epoch: 5/24, acc_iter=18135, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:12:13/1:06:17, time_cost(all): 6:41:02/1 day, 1:39:37, loss=0.531851779421007, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.98(1.03), norm=3.9438509671824495, lr=0.4272081879963317
2023-12-16 16:06:36   INFO  epoch: 5/24, acc_iter=18185, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:13:19/1:02:08, time_cost(all): 6:42:08/23:23:10, loss=0.531652922634958, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.11(1.03), norm=4.353521385827462, lr=0.4268327479806333
2023-12-16 16:07:43   INFO  epoch: 5/24, acc_iter=18235, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:14:26/1:04:05, time_cost(all): 6:43:15/1 day, 1:07:27, loss=0.531454065848909, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.01(1.03), norm=4.48747976843608, lr=0.4264573079649349
2023-12-16 16:08:50   INFO  epoch: 5/24, acc_iter=18285, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:15:33/1:03:45, time_cost(all): 6:44:22/1 day, 0:40:32, loss=0.53125520906286, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.12(1.03), norm=2.0697930264744318, lr=0.4260818679492365
2023-12-16 16:09:56   INFO  epoch: 5/24, acc_iter=18335, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:16:39/0:58:50, time_cost(all): 6:45:28/1 day, 1:39:09, loss=0.531056352276811, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.03(1.03), norm=2.481489278125653, lr=0.42570642793353813
2023-12-16 16:11:03   INFO  epoch: 5/24, acc_iter=18385, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:17:46/1:03:10, time_cost(all): 6:46:35/1 day, 0:22:08, loss=0.530857495490762, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.1(1.03), norm=3.3124696118925847, lr=0.42533098791783974
2023-12-16 16:12:10   INFO  epoch: 5/24, acc_iter=18435, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:18:53/0:56:38, time_cost(all): 6:47:42/23:25:41, loss=0.530658638704713, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.1(1.03), norm=4.018203219971454, lr=0.4249555479021413
2023-12-16 16:13:16   INFO  epoch: 5/24, acc_iter=18485, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:19:59/0:59:08, time_cost(all): 6:48:48/1 day, 1:06:13, loss=0.530459781918664, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.01(1.03), norm=0.8079098685847831, lr=0.42458010788644296
2023-12-16 16:14:23   INFO  epoch: 5/24, acc_iter=18535, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:21:06/0:57:13, time_cost(all): 6:49:55/1 day, 0:46:14, loss=0.530260925132615, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.96(1.03), norm=2.424915922506569, lr=0.4242046678707445
2023-12-16 16:15:30   INFO  epoch: 5/24, acc_iter=18585, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:22:13/0:57:14, time_cost(all): 6:51:02/1 day, 1:07:29, loss=0.530062068346566, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.21(1.03), norm=2.4629771544557952, lr=0.4238292278550461
2023-12-16 16:16:36   INFO  epoch: 5/24, acc_iter=18635, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:23:19/0:54:22, time_cost(all): 6:52:08/1 day, 0:07:55, loss=0.529863211560517, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.16(1.03), norm=1.699141148259081, lr=0.42345378783934773
2023-12-16 16:17:43   INFO  epoch: 5/24, acc_iter=18685, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:24:26/0:54:20, time_cost(all): 6:53:15/1 day, 1:23:06, loss=0.529664354774468, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.17(1.03), norm=4.922831917322477, lr=0.42307834782364934
2023-12-16 16:18:50   INFO  epoch: 5/24, acc_iter=18735, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:25:33/0:54:14, time_cost(all): 6:54:22/23:33:50, loss=0.529465497988419, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.02(1.03), norm=4.307869404804937, lr=0.42270290780795094
2023-12-16 16:19:56   INFO  epoch: 5/24, acc_iter=18785, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:26:39/0:52:20, time_cost(all): 6:55:28/23:40:34, loss=0.52926664120237, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.17(1.03), norm=3.43982226201329, lr=0.42232746779225255
2023-12-16 16:21:03   INFO  epoch: 5/24, acc_iter=18835, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:27:46/0:51:01, time_cost(all): 6:56:35/1 day, 0:13:18, loss=0.529067784416321, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.94(1.03), norm=2.5740378262720798, lr=0.42195202777655416
2023-12-16 16:22:09   INFO  epoch: 5/24, acc_iter=18885, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:28:53/0:47:03, time_cost(all): 6:57:41/1 day, 1:23:23, loss=0.528868927630272, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.99(1.03), norm=3.7408082410816306, lr=0.4215765877608557
2023-12-16 16:23:16   INFO  epoch: 5/24, acc_iter=18935, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:29:59/0:49:11, time_cost(all): 6:58:48/1 day, 0:41:34, loss=0.528670070844223, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.84(1.03), norm=2.223585491309241, lr=0.4212011477451573
2023-12-16 16:24:23   INFO  epoch: 5/24, acc_iter=18985, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:31:06/0:47:20, time_cost(all): 6:59:55/1 day, 0:37:14, loss=0.528471214058174, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.84(1.03), norm=1.074877715299155, lr=0.42082570772945893
2023-12-16 16:25:29   INFO  epoch: 5/24, acc_iter=19035, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:32:12/0:46:37, time_cost(all): 7:01:01/1 day, 0:42:24, loss=0.528272357272125, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.16(1.03), norm=1.0255931981494018, lr=0.42045026771376054
2023-12-16 16:26:36   INFO  epoch: 5/24, acc_iter=19085, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:33:19/0:45:59, time_cost(all): 7:02:08/1 day, 1:16:36, loss=0.528073500486076, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.96(1.03), norm=1.1578169395004851, lr=0.42007482769806215
2023-12-16 16:27:43   INFO  epoch: 5/24, acc_iter=19135, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:34:26/0:41:42, time_cost(all): 7:03:15/1 day, 1:12:26, loss=0.527874643700027, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.84(1.03), norm=4.525821071671146, lr=0.41969938768236376
2023-12-16 16:28:49   INFO  epoch: 5/24, acc_iter=19185, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:35:32/0:42:47, time_cost(all): 7:04:21/1 day, 0:14:47, loss=0.527675786913978, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.14(1.03), norm=1.3856624219407196, lr=0.41932394766666536
2023-12-16 16:29:56   INFO  epoch: 5/24, acc_iter=19235, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:36:39/0:40:37, time_cost(all): 7:05:28/1 day, 0:53:16, loss=0.527476930127929, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.95(1.03), norm=4.705152284430077, lr=0.4189485076509669
2023-12-16 16:31:03   INFO  epoch: 5/24, acc_iter=19285, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:37:46/0:41:46, time_cost(all): 7:06:35/23:58:30, loss=0.52727807334188, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.86(1.03), norm=0.791999432729337, lr=0.4185730676352686
2023-12-16 16:32:09   INFO  epoch: 5/24, acc_iter=19335, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:38:52/0:38:08, time_cost(all): 7:07:41/23:58:41, loss=0.527079216555831, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.01(1.03), norm=2.544759305751726, lr=0.41819762761957013
2023-12-16 16:33:16   INFO  epoch: 5/24, acc_iter=19385, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:39:59/0:39:41, time_cost(all): 7:08:48/23:18:24, loss=0.526880359769782, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.07(1.03), norm=4.559666303291317, lr=0.41782218760387174
2023-12-16 16:34:23   INFO  epoch: 5/24, acc_iter=19435, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:41:06/0:38:41, time_cost(all): 7:09:55/1 day, 1:10:07, loss=0.526681502983733, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.12(1.03), norm=1.6651569188024737, lr=0.41744674758817335
2023-12-16 16:35:29   INFO  epoch: 5/24, acc_iter=19485, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:42:12/0:34:41, time_cost(all): 7:11:01/1 day, 0:38:14, loss=0.526482646197684, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.97(1.03), norm=1.5857202176130964, lr=0.41707130757247496
2023-12-16 16:36:36   INFO  epoch: 5/24, acc_iter=19535, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:43:19/0:36:22, time_cost(all): 7:12:08/1 day, 0:14:06, loss=0.526283789411635, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.18(1.03), norm=2.1542143159788862, lr=0.41669586755677657
2023-12-16 16:37:43   INFO  epoch: 5/24, acc_iter=19585, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:44:26/0:32:11, time_cost(all): 7:13:15/1 day, 0:44:32, loss=0.526084932625586, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.2(1.03), norm=0.5899487626252946, lr=0.4163204275410782
2023-12-16 16:38:49   INFO  epoch: 5/24, acc_iter=19635, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:45:32/0:33:42, time_cost(all): 7:14:21/1 day, 0:38:38, loss=0.525886075839537, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.22(1.03), norm=3.968603728785289, lr=0.4159449875253798
2023-12-16 16:39:56   INFO  epoch: 5/24, acc_iter=19685, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:46:39/0:32:25, time_cost(all): 7:15:28/23:07:06, loss=0.525687219053488, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.14(1.03), norm=4.782563226721895, lr=0.41556954750968134
2023-12-16 16:41:03   INFO  epoch: 5/24, acc_iter=19735, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:47:46/0:28:58, time_cost(all): 7:16:35/23:01:53, loss=0.525488362267439, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.09(1.03), norm=0.8282624899050544, lr=0.41519410749398294
2023-12-16 16:42:09   INFO  epoch: 5/24, acc_iter=19785, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:48:52/0:29:49, time_cost(all): 7:17:41/23:52:30, loss=0.52528950548139, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.21(1.03), norm=0.5398401558227969, lr=0.41481866747828455
2023-12-16 16:43:16   INFO  epoch: 5/24, acc_iter=19835, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:49:59/0:27:13, time_cost(all): 7:18:48/23:55:37, loss=0.525090648695341, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.18(1.03), norm=1.5098079551675685, lr=0.41444322746258616
2023-12-16 16:44:23   INFO  epoch: 5/24, acc_iter=19885, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:51:06/0:26:48, time_cost(all): 7:19:55/23:36:26, loss=0.524891791909292, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.12(1.03), norm=1.3467061663916222, lr=0.41406778744688777
2023-12-16 16:45:29   INFO  epoch: 5/24, acc_iter=19935, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:52:12/0:25:14, time_cost(all): 7:21:01/1 day, 0:38:53, loss=0.524692935123243, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.03(1.03), norm=2.677167981417247, lr=0.4136923474311894
2023-12-16 16:46:36   INFO  epoch: 5/24, acc_iter=19985, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:53:19/0:25:40, time_cost(all): 7:22:08/22:53:01, loss=0.524494078337194, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.98(1.03), norm=4.260845874521915, lr=0.413316907415491
2023-12-16 16:47:43   INFO  epoch: 5/24, acc_iter=20035, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:54:26/0:22:41, time_cost(all): 7:23:15/23:26:20, loss=0.524295221551145, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.23(1.03), norm=3.396449431544422, lr=0.41294146739979254
2023-12-16 16:48:49   INFO  epoch: 5/24, acc_iter=20085, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:55:32/0:21:51, time_cost(all): 7:24:21/1 day, 0:17:54, loss=0.524096364765096, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.85(1.03), norm=1.984739874679507, lr=0.4125660273840942
2023-12-16 16:49:56   INFO  epoch: 5/24, acc_iter=20135, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:56:39/0:20:35, time_cost(all): 7:25:28/1 day, 0:24:57, loss=0.523897507979047, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.2(1.03), norm=0.5434932691150653, lr=0.41219058736839576
2023-12-16 16:51:02   INFO  epoch: 5/24, acc_iter=20185, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:57:46/0:19:48, time_cost(all): 7:26:34/23:38:45, loss=0.523698651192998, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.0(1.03), norm=4.369701688313633, lr=0.41181514735269736
2023-12-16 16:52:09   INFO  epoch: 5/24, acc_iter=20235, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:58:52/0:18:20, time_cost(all): 7:27:41/1 day, 0:31:41, loss=0.523499794406949, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.11(1.03), norm=1.4381518620667006, lr=0.41143970733699897
2023-12-16 16:53:16   INFO  epoch: 5/24, acc_iter=20285, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:59:59/0:18:03, time_cost(all): 7:28:48/23:23:31, loss=0.5233009376209, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.16(1.03), norm=3.7829633146689448, lr=0.4110642673213006
2023-12-16 16:54:22   INFO  epoch: 5/24, acc_iter=20335, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 1:01:05/0:17:44, time_cost(all): 7:29:54/23:36:28, loss=0.523102080834851, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.95(1.03), norm=1.4265653808037935, lr=0.4106888273056022
2023-12-16 16:55:29   INFO  epoch: 5/24, acc_iter=20385, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 1:02:12/0:15:25, time_cost(all): 7:31:01/22:51:36, loss=0.522903224048802, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.01(1.03), norm=2.3315817546822557, lr=0.4103133872899038
2023-12-16 16:56:36   INFO  epoch: 5/24, acc_iter=20435, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 1:03:19/0:15:14, time_cost(all): 7:32:08/1 day, 0:50:03, loss=0.522704367262753, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.97(1.03), norm=1.8306977181554753, lr=0.4099379472742054
2023-12-16 16:57:42   INFO  epoch: 5/24, acc_iter=20485, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 1:04:25/0:13:24, time_cost(all): 7:33:14/23:52:45, loss=0.522505510476704, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.11(1.03), norm=1.1424749935325313, lr=0.40956250725850696
2023-12-16 16:58:49   INFO  epoch: 5/24, acc_iter=20535, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 1:05:32/0:12:36, time_cost(all): 7:34:21/1 day, 0:16:45, loss=0.522306653690655, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.99(1.03), norm=4.209794804981582, lr=0.40918706724280857
2023-12-16 16:59:56   INFO  epoch: 5/24, acc_iter=20585, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 1:06:39/0:10:58, time_cost(all): 7:35:28/23:14:17, loss=0.522107796904606, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.87(1.03), norm=2.5559010772390893, lr=0.4088116272271102
2023-12-16 17:01:02   INFO  epoch: 5/24, acc_iter=20635, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 1:07:45/0:10:42, time_cost(all): 7:36:34/23:27:59, loss=0.521908940118557, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.09(1.03), norm=2.5514795206678085, lr=0.4084361872114118
2023-12-16 17:02:09   INFO  epoch: 5/24, acc_iter=20685, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 1:08:52/0:09:29, time_cost(all): 7:37:41/23:15:00, loss=0.521710083332508, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.96(1.03), norm=0.6963868432962037, lr=0.4080607471957134
2023-12-16 17:03:16   INFO  epoch: 5/24, acc_iter=20735, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 1:09:59/0:08:10, time_cost(all): 7:38:48/1 day, 0:21:29, loss=0.521511226546459, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.22(1.03), norm=0.8204322609321288, lr=0.407685307180015
2023-12-16 17:04:22   INFO  epoch: 5/24, acc_iter=20785, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 1:11:05/0:06:53, time_cost(all): 7:39:54/23:58:12, loss=0.52131236976041, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.15(1.03), norm=1.2367231431210173, lr=0.4073098671643166
2023-12-16 17:05:29   INFO  epoch: 5/24, acc_iter=20835, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:12:12/0:06:08, time_cost(all): 7:41:01/23:55:21, loss=0.521113512974361, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.84(1.03), norm=3.420368338596454, lr=0.4069344271486182
2023-12-16 17:06:36   INFO  epoch: 5/24, acc_iter=20885, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:13:19/0:05:00, time_cost(all): 7:42:08/23:03:33, loss=0.520914656188312, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.07(1.03), norm=1.844881834633107, lr=0.4065589871329198
2023-12-16 17:07:42   INFO  epoch: 5/24, acc_iter=20935, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:14:25/0:03:41, time_cost(all): 7:43:14/22:25:10, loss=0.520715799402263, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.18(1.03), norm=4.984134014153511, lr=0.4061835471172214
2023-12-16 17:08:49   INFO  epoch: 5/24, acc_iter=20985, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:15:32/0:02:40, time_cost(all): 7:44:21/23:30:16, loss=0.520516942616214, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.96(1.03), norm=4.57604193323927, lr=0.40580810710152304
2023-12-16 17:09:56   INFO  epoch: 5/24, acc_iter=21035, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:16:39/0:01:25, time_cost(all): 7:45:28/1 day, 0:03:22, loss=0.520318085830165, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.0(1.03), norm=2.8212487761198566, lr=0.4054326670858246
2023-12-16 17:11:02   INFO  epoch: 5/24, acc_iter=21085, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:17:45/0:00:22, time_cost(all): 7:46:34/1 day, 0:24:02, loss=0.520119229044116, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.11(1.03), norm=3.8181210332316264, lr=0.4050572270701262
2023-12-16 17:12:09   INFO  epoch: 6/24, acc_iter=21152, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:01:06/1:13:41, time_cost(all): 7:47:41/23:53:05, loss=0.51985276095081, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.16(1.03), norm=2.8918224519598796, lr=0.40455413744909036
2023-12-16 17:13:16   INFO  epoch: 6/24, acc_iter=21202, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:02:13/1:14:56, time_cost(all): 7:48:48/22:32:02, loss=0.519653904164761, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.17(1.03), norm=3.344750068450586, lr=0.40417869743339196
2023-12-16 17:14:22   INFO  epoch: 6/24, acc_iter=21252, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:03:19/1:11:33, time_cost(all): 7:49:54/23:39:00, loss=0.519455047378713, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.87(1.03), norm=3.861926986218978, lr=0.4038032574176936
2023-12-16 17:15:29   INFO  epoch: 6/24, acc_iter=21302, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:04:26/1:10:46, time_cost(all): 7:51:01/22:39:54, loss=0.519256190592663, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.85(1.03), norm=3.5384237534652847, lr=0.4034278174019952
2023-12-16 17:16:36   INFO  epoch: 6/24, acc_iter=21352, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:05:33/1:14:58, time_cost(all): 7:52:08/23:03:24, loss=0.519057333806614, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.85(1.03), norm=3.246412763446952, lr=0.40305237738629673
2023-12-16 17:17:42   INFO  epoch: 6/24, acc_iter=21402, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:06:39/1:08:38, time_cost(all): 7:53:14/23:38:04, loss=0.518858477020565, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.15(1.03), norm=1.9685529428768715, lr=0.4026769373705984
2023-12-16 17:18:49   INFO  epoch: 6/24, acc_iter=21452, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:07:46/1:08:18, time_cost(all): 7:54:21/23:26:01, loss=0.518659620234517, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.08(1.03), norm=3.1141366861323494, lr=0.40230149735489995
2023-12-16 17:19:56   INFO  epoch: 6/24, acc_iter=21502, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:08:53/1:07:43, time_cost(all): 7:55:28/1 day, 0:05:35, loss=0.518460763448467, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.02(1.03), norm=2.2852301448730574, lr=0.40192605733920156
2023-12-16 17:21:02   INFO  epoch: 6/24, acc_iter=21552, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:09:59/1:10:17, time_cost(all): 7:56:34/22:52:22, loss=0.518261906662418, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.18(1.03), norm=4.801537663405057, lr=0.40155061732350317
2023-12-16 17:22:09   INFO  epoch: 6/24, acc_iter=21602, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:11:06/1:07:06, time_cost(all): 7:57:41/23:07:25, loss=0.518063049876369, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.03(1.03), norm=1.2813825392471097, lr=0.4011751773078048
2023-12-16 17:23:15   INFO  epoch: 6/24, acc_iter=21652, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:12:13/1:08:07, time_cost(all): 7:58:47/1 day, 0:04:50, loss=0.517864193090321, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.06(1.03), norm=0.7273971266709972, lr=0.4007997372921064
2023-12-16 17:24:22   INFO  epoch: 6/24, acc_iter=21702, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:13:19/1:07:05, time_cost(all): 7:59:54/23:04:33, loss=0.517665336304271, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.15(1.03), norm=4.821531323791634, lr=0.400424297276408
2023-12-16 17:25:29   INFO  epoch: 6/24, acc_iter=21752, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:14:26/1:02:35, time_cost(all): 8:01:01/23:13:13, loss=0.517466479518222, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.1(1.03), norm=1.808973376403662, lr=0.4000488572607096
2023-12-16 17:26:35   INFO  epoch: 6/24, acc_iter=21802, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:15:33/1:02:27, time_cost(all): 8:02:07/23:45:46, loss=0.517267622732173, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.06(1.03), norm=1.7655653493595418, lr=0.39967341724501115
2023-12-16 17:27:42   INFO  epoch: 6/24, acc_iter=21852, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:16:39/1:03:25, time_cost(all): 8:03:14/23:40:58, loss=0.517068765946124, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.99(1.03), norm=3.577535082611998, lr=0.39929797722931276
2023-12-16 17:28:49   INFO  epoch: 6/24, acc_iter=21902, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:17:46/1:01:57, time_cost(all): 8:04:21/22:22:15, loss=0.516869909160075, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.1(1.03), norm=4.175447439910673, lr=0.39892253721361437
2023-12-16 17:29:55   INFO  epoch: 6/24, acc_iter=21952, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:18:53/0:59:38, time_cost(all): 8:05:27/23:53:20, loss=0.516671052374026, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.86(1.03), norm=3.7539749601347427, lr=0.398547097197916
2023-12-16 17:31:02   INFO  epoch: 6/24, acc_iter=22002, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:19:59/0:56:47, time_cost(all): 8:06:34/22:45:48, loss=0.516472195587977, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.08(1.03), norm=4.919099005065365, lr=0.3981716571822176
2023-12-16 17:32:09   INFO  epoch: 6/24, acc_iter=22052, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:21:06/0:55:40, time_cost(all): 8:07:41/22:49:11, loss=0.516273338801928, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.91(1.03), norm=3.12205098518925, lr=0.3977962171665192
2023-12-16 17:33:15   INFO  epoch: 6/24, acc_iter=22102, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:22:13/0:55:28, time_cost(all): 8:08:47/22:11:40, loss=0.516074482015879, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.96(1.03), norm=2.132384363052883, lr=0.3974207771508208
2023-12-16 17:34:22   INFO  epoch: 6/24, acc_iter=22152, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:23:19/0:52:46, time_cost(all): 8:09:54/23:49:57, loss=0.51587562522983, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.11(1.03), norm=4.1250354812219845, lr=0.39704533713512236
2023-12-16 17:35:29   INFO  epoch: 6/24, acc_iter=22202, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:24:26/0:51:20, time_cost(all): 8:11:01/23:05:41, loss=0.515676768443781, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.17(1.03), norm=1.4190245066900626, lr=0.396669897119424
2023-12-16 17:36:35   INFO  epoch: 6/24, acc_iter=22252, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:25:33/0:52:20, time_cost(all): 8:12:07/23:28:20, loss=0.515477911657732, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.97(1.03), norm=3.9561388704703773, lr=0.3962944571037256
2023-12-16 17:37:42   INFO  epoch: 6/24, acc_iter=22302, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:26:39/0:51:10, time_cost(all): 8:13:14/22:55:11, loss=0.515279054871683, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.89(1.03), norm=2.0824247803421123, lr=0.3959190170880272
2023-12-16 17:38:49   INFO  epoch: 6/24, acc_iter=22352, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:27:46/0:50:26, time_cost(all): 8:14:21/22:56:07, loss=0.515080198085634, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.13(1.03), norm=1.2880714082753053, lr=0.3955435770723288
2023-12-16 17:39:55   INFO  epoch: 6/24, acc_iter=22402, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:28:53/0:48:45, time_cost(all): 8:15:27/21:56:24, loss=0.514881341299585, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.92(1.03), norm=1.7716971970917959, lr=0.3951681370566304
2023-12-16 17:41:02   INFO  epoch: 6/24, acc_iter=22452, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:29:59/0:47:42, time_cost(all): 8:16:34/1 day, 0:00:39, loss=0.514682484513536, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.14(1.03), norm=2.424747137587457, lr=0.394792697040932
2023-12-16 17:42:09   INFO  epoch: 6/24, acc_iter=22502, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:31:06/0:44:47, time_cost(all): 8:17:41/21:58:34, loss=0.514483627727487, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.88(1.03), norm=4.8719441792347205, lr=0.3944172570252336
2023-12-16 17:43:15   INFO  epoch: 6/24, acc_iter=22552, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:32:12/0:45:02, time_cost(all): 8:18:47/1 day, 0:03:27, loss=0.514284770941438, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.03(1.03), norm=4.550853235605588, lr=0.3940418170095352
2023-12-16 17:44:22   INFO  epoch: 6/24, acc_iter=22602, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:33:19/0:44:09, time_cost(all): 8:19:54/22:34:28, loss=0.514085914155389, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.1(1.03), norm=3.9101550213332192, lr=0.3936663769938368
2023-12-16 17:45:29   INFO  epoch: 6/24, acc_iter=22652, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:34:26/0:45:41, time_cost(all): 8:21:01/22:56:46, loss=0.51388705736934, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.21(1.03), norm=4.343624568940709, lr=0.3932909369781384
2023-12-16 17:46:35   INFO  epoch: 6/24, acc_iter=22702, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:35:32/0:41:29, time_cost(all): 8:22:07/23:29:33, loss=0.513688200583291, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.15(1.03), norm=3.664497585028731, lr=0.39291549696244
2023-12-16 17:47:42   INFO  epoch: 6/24, acc_iter=22752, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:36:39/0:41:43, time_cost(all): 8:23:14/23:54:26, loss=0.513489343797242, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.18(1.03), norm=4.520740934223087, lr=0.3925400569467416
2023-12-16 17:48:49   INFO  epoch: 6/24, acc_iter=22802, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:37:46/0:42:02, time_cost(all): 8:24:21/22:08:34, loss=0.513290487011193, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.13(1.03), norm=2.19653881772361, lr=0.3921646169310432
2023-12-16 17:49:55   INFO  epoch: 6/24, acc_iter=22852, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:38:52/0:39:37, time_cost(all): 8:25:27/22:39:28, loss=0.513091630225144, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.95(1.03), norm=3.826738257737625, lr=0.3917891769153448
2023-12-16 17:51:02   INFO  epoch: 6/24, acc_iter=22902, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:39:59/0:39:46, time_cost(all): 8:26:34/22:47:20, loss=0.512892773439095, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.83(1.03), norm=3.5083986799251754, lr=0.3914137368996464
2023-12-16 17:52:08   INFO  epoch: 6/24, acc_iter=22952, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:41:06/0:35:58, time_cost(all): 8:27:40/22:25:54, loss=0.512693916653046, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.98(1.03), norm=4.643328682063376, lr=0.391038296883948
2023-12-16 17:53:15   INFO  epoch: 6/24, acc_iter=23002, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:42:12/0:37:13, time_cost(all): 8:28:47/21:39:58, loss=0.512495059866997, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.16(1.03), norm=1.246437210405694, lr=0.39066285686824964
2023-12-16 17:54:22   INFO  epoch: 6/24, acc_iter=23052, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:43:19/0:36:15, time_cost(all): 8:29:54/22:55:18, loss=0.512296203080948, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.99(1.03), norm=2.580091067010706, lr=0.3902874168525512
2023-12-16 17:55:28   INFO  epoch: 6/24, acc_iter=23102, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:44:26/0:33:26, time_cost(all): 8:31:00/23:05:38, loss=0.512097346294899, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.11(1.03), norm=1.6327152658393027, lr=0.3899119768368528
2023-12-16 17:56:35   INFO  epoch: 6/24, acc_iter=23152, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:45:32/0:31:55, time_cost(all): 8:32:07/22:53:58, loss=0.51189848950885, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.96(1.03), norm=1.187175288268917, lr=0.3895365368211544
2023-12-16 17:57:42   INFO  epoch: 6/24, acc_iter=23202, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:46:39/0:32:41, time_cost(all): 8:33:14/21:58:27, loss=0.511699632722801, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.07(1.03), norm=3.547992319353157, lr=0.389161096805456
2023-12-16 17:58:48   INFO  epoch: 6/24, acc_iter=23252, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:47:46/0:29:52, time_cost(all): 8:34:20/23:20:26, loss=0.511500775936752, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.99(1.03), norm=1.795289927129168, lr=0.3887856567897576
2023-12-16 17:59:55   INFO  epoch: 6/24, acc_iter=23302, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:48:52/0:29:52, time_cost(all): 8:35:27/23:21:59, loss=0.511301919150703, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.84(1.03), norm=2.6417282444700776, lr=0.38841021677405924
2023-12-16 18:01:02   INFO  epoch: 6/24, acc_iter=23352, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:49:59/0:28:59, time_cost(all): 8:36:34/21:46:34, loss=0.511103062364654, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.1(1.03), norm=1.4473878305314642, lr=0.38803477675836084
2023-12-16 18:02:08   INFO  epoch: 6/24, acc_iter=23402, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:51:06/0:27:53, time_cost(all): 8:37:40/22:40:12, loss=0.510904205578605, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.9(1.03), norm=3.2128146995686864, lr=0.3876593367426624
2023-12-16 18:03:15   INFO  epoch: 6/24, acc_iter=23452, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:52:12/0:25:11, time_cost(all): 8:38:47/21:57:39, loss=0.510705348792556, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.05(1.03), norm=2.617241247050672, lr=0.38728389672696406
2023-12-16 18:04:22   INFO  epoch: 6/24, acc_iter=23502, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:53:19/0:25:09, time_cost(all): 8:39:54/21:38:45, loss=0.510506492006507, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.94(1.03), norm=4.198370222001788, lr=0.3869084567112656
2023-12-16 18:05:28   INFO  epoch: 6/24, acc_iter=23552, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:54:26/0:23:15, time_cost(all): 8:41:00/23:36:13, loss=0.510307635220458, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.85(1.03), norm=3.2484774122302396, lr=0.3865330166955672
2023-12-16 18:06:35   INFO  epoch: 6/24, acc_iter=23602, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:55:32/0:23:40, time_cost(all): 8:42:07/23:09:59, loss=0.510108778434409, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.04(1.03), norm=1.140986327052909, lr=0.38615757667986883
2023-12-16 18:07:42   INFO  epoch: 6/24, acc_iter=23652, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:56:39/0:20:39, time_cost(all): 8:43:14/21:27:40, loss=0.50990992164836, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.06(1.03), norm=1.0993738003735385, lr=0.38578213666417044
2023-12-16 18:08:48   INFO  epoch: 6/24, acc_iter=23702, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:57:46/0:20:24, time_cost(all): 8:44:20/22:02:19, loss=0.509711064862311, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.03(1.03), norm=3.6561020038923546, lr=0.38540669664847205
2023-12-16 18:09:55   INFO  epoch: 6/24, acc_iter=23752, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:58:52/0:20:11, time_cost(all): 8:45:27/23:32:09, loss=0.509512208076262, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.92(1.03), norm=1.8575499284183221, lr=0.38503125663277366
2023-12-16 18:11:02   INFO  epoch: 6/24, acc_iter=23802, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:59:59/0:18:33, time_cost(all): 8:46:34/22:28:30, loss=0.509313351290213, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.92(1.03), norm=4.571989671546724, lr=0.38465581661707526
2023-12-16 18:12:08   INFO  epoch: 6/24, acc_iter=23852, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 1:01:05/0:17:17, time_cost(all): 8:47:40/23:22:58, loss=0.509114494504164, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.87(1.03), norm=2.45662109754366, lr=0.3842803766013768
2023-12-16 18:13:15   INFO  epoch: 6/24, acc_iter=23902, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 1:02:12/0:16:40, time_cost(all): 8:48:47/21:44:03, loss=0.508915637718115, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.01(1.03), norm=0.6887912487255208, lr=0.3839049365856784
2023-12-16 18:14:22   INFO  epoch: 6/24, acc_iter=23952, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 1:03:19/0:14:59, time_cost(all): 8:49:54/22:11:20, loss=0.508716780932066, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.19(1.03), norm=4.442535689021653, lr=0.38352949656998003
2023-12-16 18:15:28   INFO  epoch: 6/24, acc_iter=24002, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 1:04:25/0:13:30, time_cost(all): 8:51:00/23:20:43, loss=0.508517924146017, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.1(1.03), norm=2.1822070291028606, lr=0.38315405655428164
2023-12-16 18:16:35   INFO  epoch: 6/24, acc_iter=24052, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 1:05:32/0:12:14, time_cost(all): 8:52:07/21:54:53, loss=0.508319067359968, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.12(1.03), norm=2.2694232888513373, lr=0.38277861653858325
2023-12-16 18:17:42   INFO  epoch: 6/24, acc_iter=24102, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 1:06:39/0:11:05, time_cost(all): 8:53:14/23:21:55, loss=0.508120210573919, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.83(1.03), norm=4.264150767198744, lr=0.38240317652288486
2023-12-16 18:18:48   INFO  epoch: 6/24, acc_iter=24152, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 1:07:45/0:09:56, time_cost(all): 8:54:20/21:19:09, loss=0.50792135378787, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.16(1.03), norm=2.1057311152598794, lr=0.38202773650718647
2023-12-16 18:19:55   INFO  epoch: 6/24, acc_iter=24202, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 1:08:52/0:08:59, time_cost(all): 8:55:27/22:53:15, loss=0.507722497001821, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.18(1.03), norm=1.5972255303531118, lr=0.381652296491488
2023-12-16 18:21:01   INFO  epoch: 6/24, acc_iter=24252, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 1:09:59/0:08:21, time_cost(all): 8:56:33/21:20:51, loss=0.507523640215772, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.1(1.03), norm=4.31668695464281, lr=0.3812768564757897
2023-12-16 18:22:08   INFO  epoch: 6/24, acc_iter=24302, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 1:11:05/0:06:45, time_cost(all): 8:57:40/22:08:29, loss=0.507324783429723, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.98(1.03), norm=3.9705517933818717, lr=0.38090141646009124
2023-12-16 18:23:15   INFO  epoch: 6/24, acc_iter=24352, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:12:12/0:06:04, time_cost(all): 8:58:47/22:13:07, loss=0.507125926643674, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.96(1.03), norm=0.6615893072284144, lr=0.38052597644439284
2023-12-16 18:24:21   INFO  epoch: 6/24, acc_iter=24402, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:13:19/0:04:35, time_cost(all): 8:59:53/23:17:33, loss=0.506927069857625, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.06(1.03), norm=1.1771613268177799, lr=0.38015053642869445
2023-12-16 18:25:28   INFO  epoch: 6/24, acc_iter=24452, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:14:25/0:03:38, time_cost(all): 9:01:00/22:18:32, loss=0.506728213071576, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.84(1.03), norm=3.7451711348357066, lr=0.37977509641299606
2023-12-16 18:26:35   INFO  epoch: 6/24, acc_iter=24502, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:15:32/0:02:29, time_cost(all): 9:02:07/22:12:47, loss=0.506529356285527, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.88(1.03), norm=1.367320044329023, lr=0.37939965639729767
2023-12-16 18:27:41   INFO  epoch: 6/24, acc_iter=24552, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:16:39/0:01:32, time_cost(all): 9:03:13/23:14:28, loss=0.506330499499478, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.95(1.03), norm=0.538089870673242, lr=0.3790242163815993
2023-12-16 18:28:48   INFO  epoch: 6/24, acc_iter=24602, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:17:45/0:00:23, time_cost(all): 9:04:20/22:34:53, loss=0.506131642713429, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.95(1.03), norm=2.584625079151334, lr=0.3786487763659009
2023-12-16 18:29:55   INFO  epoch: 7/24, acc_iter=24669, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:01:06/1:17:02, time_cost(all): 9:05:27/21:26:25, loss=0.505865174620124, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.98(1.03), norm=2.8704229415449745, lr=0.37814568674486504
2023-12-16 18:31:01   INFO  epoch: 7/24, acc_iter=24719, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:02:13/1:18:26, time_cost(all): 9:06:33/21:46:36, loss=0.505666317834075, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.92(1.03), norm=2.718201934772319, lr=0.3777702467291666
2023-12-16 18:32:08   INFO  epoch: 7/24, acc_iter=24769, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:03:19/1:15:35, time_cost(all): 9:07:40/21:29:07, loss=0.505467461048026, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.01(1.03), norm=4.968698658182068, lr=0.3773948067134682
2023-12-16 18:33:15   INFO  epoch: 7/24, acc_iter=24819, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:04:26/1:14:56, time_cost(all): 9:08:47/21:33:44, loss=0.505268604261977, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.97(1.03), norm=4.115619555195246, lr=0.3770193666977698
2023-12-16 18:34:21   INFO  epoch: 7/24, acc_iter=24869, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:05:33/1:09:21, time_cost(all): 9:09:53/21:55:33, loss=0.505069747475928, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.15(1.03), norm=3.629363557598428, lr=0.3766439266820714
2023-12-16 18:35:28   INFO  epoch: 7/24, acc_iter=24919, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:06:39/1:10:33, time_cost(all): 9:11:00/22:57:46, loss=0.504870890689879, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.89(1.03), norm=4.262028711868173, lr=0.376268486666373
2023-12-16 18:36:35   INFO  epoch: 7/24, acc_iter=24969, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:07:46/1:10:05, time_cost(all): 9:12:07/21:10:35, loss=0.50467203390383, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.01(1.03), norm=2.3619073603836522, lr=0.37589304665067463
2023-12-16 18:37:41   INFO  epoch: 7/24, acc_iter=25019, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:08:53/1:10:37, time_cost(all): 9:13:13/22:55:06, loss=0.504473177117781, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.04(1.03), norm=1.6054926854738218, lr=0.37551760663497624
2023-12-16 18:38:48   INFO  epoch: 7/24, acc_iter=25069, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:09:59/1:07:32, time_cost(all): 9:14:20/22:05:40, loss=0.504274320331732, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.89(1.03), norm=4.654751685500932, lr=0.3751421666192778
2023-12-16 18:39:55   INFO  epoch: 7/24, acc_iter=25119, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:11:06/1:05:16, time_cost(all): 9:15:27/21:08:03, loss=0.504075463545683, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.15(1.03), norm=2.323033658699458, lr=0.37476672660357946
2023-12-16 18:41:01   INFO  epoch: 7/24, acc_iter=25169, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:12:13/1:03:15, time_cost(all): 9:16:33/22:37:48, loss=0.503876606759634, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.09(1.03), norm=1.7765693664374769, lr=0.374391286587881
2023-12-16 18:42:08   INFO  epoch: 7/24, acc_iter=25219, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:13:19/1:03:44, time_cost(all): 9:17:40/22:11:08, loss=0.503677749973585, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.06(1.03), norm=0.9483820427297016, lr=0.3740158465721827
2023-12-16 18:43:15   INFO  epoch: 7/24, acc_iter=25269, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:14:26/1:05:37, time_cost(all): 9:18:47/22:36:22, loss=0.503478893187536, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.98(1.03), norm=4.959296129536517, lr=0.37364040655648423
2023-12-16 18:44:21   INFO  epoch: 7/24, acc_iter=25319, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:15:33/1:00:02, time_cost(all): 9:19:53/22:07:12, loss=0.503280036401487, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.84(1.03), norm=3.0909503476101086, lr=0.37326496654078584
2023-12-16 18:45:28   INFO  epoch: 7/24, acc_iter=25369, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:16:39/1:02:36, time_cost(all): 9:21:00/22:12:08, loss=0.503081179615438, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.9(1.03), norm=4.3918316537242585, lr=0.37288952652508744
2023-12-16 18:46:35   INFO  epoch: 7/24, acc_iter=25419, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:17:46/0:58:40, time_cost(all): 9:22:07/21:07:51, loss=0.502882322829389, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.86(1.03), norm=0.9257754618512593, lr=0.37251408650938905
2023-12-16 18:47:41   INFO  epoch: 7/24, acc_iter=25469, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:18:53/0:58:14, time_cost(all): 9:23:13/22:43:24, loss=0.50268346604334, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.03(1.03), norm=2.8272150732398487, lr=0.37213864649369066
2023-12-16 18:48:48   INFO  epoch: 7/24, acc_iter=25519, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:19:59/0:57:47, time_cost(all): 9:24:20/20:48:00, loss=0.502484609257291, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.91(1.03), norm=2.3551601647338334, lr=0.3717632064779922
2023-12-16 18:49:55   INFO  epoch: 7/24, acc_iter=25569, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:21:06/0:58:06, time_cost(all): 9:25:27/21:39:29, loss=0.502285752471242, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.1(1.03), norm=1.9971722984801767, lr=0.3713877664622938
2023-12-16 18:51:01   INFO  epoch: 7/24, acc_iter=25619, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:22:13/0:58:42, time_cost(all): 9:26:33/21:19:29, loss=0.502086895685193, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.08(1.03), norm=4.118315961973968, lr=0.37101232644659543
2023-12-16 18:52:08   INFO  epoch: 7/24, acc_iter=25669, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:23:19/0:52:56, time_cost(all): 9:27:40/21:36:04, loss=0.501888038899144, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.9(1.03), norm=4.251351321255526, lr=0.37063688643089704
2023-12-16 18:53:14   INFO  epoch: 7/24, acc_iter=25719, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:24:26/0:51:53, time_cost(all): 9:28:46/21:30:31, loss=0.501689182113095, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.86(1.03), norm=2.1855337149688134, lr=0.37026144641519865
2023-12-16 18:54:21   INFO  epoch: 7/24, acc_iter=25769, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:25:33/0:51:55, time_cost(all): 9:29:53/20:47:27, loss=0.501490325327046, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.03(1.03), norm=0.8768868517486474, lr=0.3698860063995002
2023-12-16 18:55:28   INFO  epoch: 7/24, acc_iter=25819, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:26:39/0:53:10, time_cost(all): 9:31:00/21:32:33, loss=0.501291468540997, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.87(1.03), norm=4.401267888196306, lr=0.36951056638380186
2023-12-16 18:56:34   INFO  epoch: 7/24, acc_iter=25869, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:27:46/0:47:52, time_cost(all): 9:32:06/22:46:37, loss=0.501092611754948, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.03(1.03), norm=1.1207548143599597, lr=0.3691351263681034
2023-12-16 18:57:41   INFO  epoch: 7/24, acc_iter=25919, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:28:53/0:47:57, time_cost(all): 9:33:13/21:58:34, loss=0.500893754968899, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.99(1.03), norm=0.5182079164740472, lr=0.3687596863524051
2023-12-16 18:58:48   INFO  epoch: 7/24, acc_iter=25969, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:29:59/0:48:24, time_cost(all): 9:34:20/20:45:48, loss=0.50069489818285, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.89(1.03), norm=3.4489233993705515, lr=0.36838424633670663
2023-12-16 18:59:54   INFO  epoch: 7/24, acc_iter=26019, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:31:06/0:48:37, time_cost(all): 9:35:26/22:24:41, loss=0.500496041396801, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.11(1.03), norm=3.874882205729829, lr=0.3680088063210083
2023-12-16 19:01:01   INFO  epoch: 7/24, acc_iter=26069, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:32:12/0:45:37, time_cost(all): 9:36:33/21:41:08, loss=0.500297184610752, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.08(1.03), norm=1.8901483273228281, lr=0.36763336630530985
2023-12-16 19:02:08   INFO  epoch: 7/24, acc_iter=26119, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:33:19/0:42:38, time_cost(all): 9:37:40/22:32:44, loss=0.500098327824703, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.98(1.03), norm=1.7540183179257245, lr=0.36725792628961146
2023-12-16 19:03:14   INFO  epoch: 7/24, acc_iter=26169, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:34:26/0:44:36, time_cost(all): 9:38:46/22:01:20, loss=0.499899471038654, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.2(1.03), norm=3.417283289741281, lr=0.36688248627391307
2023-12-16 19:04:21   INFO  epoch: 7/24, acc_iter=26219, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:35:32/0:41:47, time_cost(all): 9:39:53/20:37:54, loss=0.499700614252605, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.09(1.03), norm=1.8193528180816307, lr=0.3665070462582147
2023-12-16 19:05:28   INFO  epoch: 7/24, acc_iter=26269, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:36:39/0:39:46, time_cost(all): 9:41:00/20:53:46, loss=0.499501757466556, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.99(1.03), norm=2.9951820555804503, lr=0.3661316062425163
2023-12-16 19:06:34   INFO  epoch: 7/24, acc_iter=26319, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:37:46/0:40:17, time_cost(all): 9:42:06/22:22:00, loss=0.499302900680507, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.93(1.03), norm=3.616417514607333, lr=0.36575616622681784
2023-12-16 19:07:41   INFO  epoch: 7/24, acc_iter=26369, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:38:52/0:37:50, time_cost(all): 9:43:13/21:24:54, loss=0.499104043894458, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.22(1.03), norm=3.60064563474066, lr=0.36538072621111944
2023-12-16 19:08:48   INFO  epoch: 7/24, acc_iter=26419, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:39:59/0:38:04, time_cost(all): 9:44:20/20:32:49, loss=0.498905187108409, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.02(1.03), norm=3.7831836711908085, lr=0.36500528619542105
2023-12-16 19:09:54   INFO  epoch: 7/24, acc_iter=26469, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:41:06/0:36:37, time_cost(all): 9:45:26/22:12:50, loss=0.49870633032236, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.04(1.03), norm=0.7843615366617158, lr=0.36462984617972266
2023-12-16 19:11:01   INFO  epoch: 7/24, acc_iter=26519, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:42:12/0:35:38, time_cost(all): 9:46:33/22:21:51, loss=0.498507473536311, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.93(1.03), norm=1.6261733924471968, lr=0.36425440616402427
2023-12-16 19:12:08   INFO  epoch: 7/24, acc_iter=26569, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:43:19/0:35:24, time_cost(all): 9:47:40/21:15:23, loss=0.498308616750262, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.14(1.03), norm=3.0828990217883434, lr=0.3638789661483258
2023-12-16 19:13:14   INFO  epoch: 7/24, acc_iter=26619, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:44:26/0:33:26, time_cost(all): 9:48:46/22:14:11, loss=0.498109759964213, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.88(1.03), norm=4.679336118554033, lr=0.3635035261326275
2023-12-16 19:14:21   INFO  epoch: 7/24, acc_iter=26669, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:45:32/0:33:25, time_cost(all): 9:49:53/20:57:26, loss=0.497910903178164, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.03(1.03), norm=4.743695163230018, lr=0.36312808611692904
2023-12-16 19:15:28   INFO  epoch: 7/24, acc_iter=26719, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:46:39/0:31:02, time_cost(all): 9:51:00/22:23:09, loss=0.497712046392115, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.23(1.03), norm=3.4449624828552157, lr=0.3627526461012307
2023-12-16 19:16:34   INFO  epoch: 7/24, acc_iter=26769, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:47:46/0:29:57, time_cost(all): 9:52:06/20:58:38, loss=0.497513189606066, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.09(1.03), norm=1.6866179349576664, lr=0.36237720608553226
2023-12-16 19:17:41   INFO  epoch: 7/24, acc_iter=26819, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:48:52/0:27:48, time_cost(all): 9:53:13/21:16:52, loss=0.497314332820017, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.06(1.03), norm=3.8168911471823366, lr=0.3620017660698339
2023-12-16 19:18:48   INFO  epoch: 7/24, acc_iter=26869, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:49:59/0:28:20, time_cost(all): 9:54:20/21:16:05, loss=0.497115476033968, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.18(1.03), norm=1.131499692295341, lr=0.36162632605413547
2023-12-16 19:19:54   INFO  epoch: 7/24, acc_iter=26919, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:51:06/0:26:03, time_cost(all): 9:55:26/20:38:37, loss=0.496916619247919, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.91(1.03), norm=4.4940726822765775, lr=0.3612508860384371
2023-12-16 19:21:01   INFO  epoch: 7/24, acc_iter=26969, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:52:12/0:26:02, time_cost(all): 9:56:33/21:10:45, loss=0.49671776246187, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.19(1.03), norm=4.810064265107428, lr=0.3608754460227387
2023-12-16 19:22:07   INFO  epoch: 7/24, acc_iter=27019, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:53:19/0:23:49, time_cost(all): 9:57:39/20:46:07, loss=0.496518905675821, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.22(1.03), norm=1.3980844390199394, lr=0.3605000060070403
2023-12-16 19:23:14   INFO  epoch: 7/24, acc_iter=27069, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:54:26/0:24:16, time_cost(all): 9:58:46/20:17:22, loss=0.496320048889772, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.12(1.03), norm=1.7548143762865154, lr=0.3601245659913419
2023-12-16 19:24:21   INFO  epoch: 7/24, acc_iter=27119, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:55:32/0:22:01, time_cost(all): 9:59:53/21:49:42, loss=0.496121192103723, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.03(1.03), norm=1.0165204841729418, lr=0.35974912597564346
2023-12-16 19:25:27   INFO  epoch: 7/24, acc_iter=27169, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:56:39/0:20:53, time_cost(all): 10:00:59/21:17:19, loss=0.495922335317674, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.13(1.03), norm=2.8383071321402285, lr=0.35937368595994507
2023-12-16 19:26:34   INFO  epoch: 7/24, acc_iter=27219, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:57:46/0:20:54, time_cost(all): 10:02:06/21:31:41, loss=0.495723478531625, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.85(1.03), norm=3.945368481797422, lr=0.3589982459442467
2023-12-16 19:27:41   INFO  epoch: 7/24, acc_iter=27269, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:58:52/0:19:18, time_cost(all): 10:03:13/20:53:51, loss=0.495524621745576, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.84(1.03), norm=3.89568536752046, lr=0.35862280592854834
2023-12-16 19:28:47   INFO  epoch: 7/24, acc_iter=27319, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:59:59/0:17:46, time_cost(all): 10:04:19/21:46:06, loss=0.495325764959527, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.09(1.03), norm=3.607882654257258, lr=0.3582473659128499
2023-12-16 19:29:54   INFO  epoch: 7/24, acc_iter=27369, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 1:01:05/0:17:50, time_cost(all): 10:05:26/20:39:12, loss=0.495126908173478, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.93(1.03), norm=1.9143724861781974, lr=0.3578719258971515
2023-12-16 19:31:01   INFO  epoch: 7/24, acc_iter=27419, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 1:02:12/0:15:14, time_cost(all): 10:06:33/21:05:51, loss=0.494928051387429, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.05(1.03), norm=3.3407013623029886, lr=0.3574964858814531
2023-12-16 19:32:07   INFO  epoch: 7/24, acc_iter=27469, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 1:03:19/0:15:20, time_cost(all): 10:07:39/21:07:24, loss=0.49472919460138, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.03(1.03), norm=0.5640441070723585, lr=0.3571210458657547
2023-12-16 19:33:14   INFO  epoch: 7/24, acc_iter=27519, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 1:04:25/0:13:49, time_cost(all): 10:08:46/21:14:17, loss=0.494530337815331, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.95(1.03), norm=3.1935156294957316, lr=0.3567456058500563
2023-12-16 19:34:21   INFO  epoch: 7/24, acc_iter=27569, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 1:05:32/0:12:14, time_cost(all): 10:09:53/21:13:16, loss=0.494331481029282, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.03(1.03), norm=3.7502309909021143, lr=0.3563701658343579
2023-12-16 19:35:27   INFO  epoch: 7/24, acc_iter=27619, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 1:06:39/0:11:25, time_cost(all): 10:10:59/20:53:47, loss=0.494132624243233, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.05(1.03), norm=2.874708335424227, lr=0.35599472581865954
2023-12-16 19:36:34   INFO  epoch: 7/24, acc_iter=27669, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 1:07:45/0:10:25, time_cost(all): 10:12:06/21:41:15, loss=0.493933767457184, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.13(1.03), norm=3.966396220435663, lr=0.3556192858029611
2023-12-16 19:37:41   INFO  epoch: 7/24, acc_iter=27719, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 1:08:52/0:09:34, time_cost(all): 10:13:13/20:28:06, loss=0.493734910671135, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.85(1.03), norm=4.168044868857909, lr=0.3552438457872627
2023-12-16 19:38:47   INFO  epoch: 7/24, acc_iter=27769, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 1:09:59/0:08:17, time_cost(all): 10:14:19/20:26:22, loss=0.493536053885086, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.84(1.03), norm=2.61198149926856, lr=0.3548684057715643
2023-12-16 19:39:54   INFO  epoch: 7/24, acc_iter=27819, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 1:11:05/0:07:22, time_cost(all): 10:15:26/20:53:35, loss=0.493337197099037, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.13(1.03), norm=3.2957719354261017, lr=0.3544929657558659
2023-12-16 19:41:01   INFO  epoch: 7/24, acc_iter=27869, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:12:12/0:05:49, time_cost(all): 10:16:33/21:14:14, loss=0.493138340312988, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.93(1.03), norm=0.5387668448993159, lr=0.3541175257401675
2023-12-16 19:42:07   INFO  epoch: 7/24, acc_iter=27919, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:13:19/0:04:55, time_cost(all): 10:17:39/21:56:38, loss=0.492939483526939, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.04(1.03), norm=4.118021684037063, lr=0.3537420857244691
2023-12-16 19:43:14   INFO  epoch: 7/24, acc_iter=27969, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:14:25/0:03:34, time_cost(all): 10:18:46/21:22:10, loss=0.49274062674089, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.88(1.03), norm=4.349097134926482, lr=0.35336664570877074
2023-12-16 19:44:21   INFO  epoch: 7/24, acc_iter=28019, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:15:32/0:02:30, time_cost(all): 10:19:53/19:54:33, loss=0.492541769954841, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.89(1.03), norm=4.929667575309536, lr=0.3529912056930723
2023-12-16 19:45:27   INFO  epoch: 7/24, acc_iter=28069, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:16:39/0:01:25, time_cost(all): 10:20:59/20:47:35, loss=0.492342913168792, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.9(1.03), norm=4.405300372093748, lr=0.35261576567737396
2023-12-16 19:46:34   INFO  epoch: 7/24, acc_iter=28119, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:17:45/0:00:22, time_cost(all): 10:22:06/21:02:10, loss=0.492144056382743, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.2(1.03), norm=3.430251975453596, lr=0.3522403256616755
2023-12-16 19:47:41   INFO  epoch: 8/24, acc_iter=28186, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:01:06/1:13:25, time_cost(all): 10:23:13/21:47:10, loss=0.491877588289437, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.19(1.03), norm=4.062173532716416, lr=0.35173723604063967
2023-12-16 19:48:47   INFO  epoch: 8/24, acc_iter=28236, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:02:13/1:18:04, time_cost(all): 10:24:19/21:00:39, loss=0.491678731503388, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.19(1.03), norm=0.7860542495013403, lr=0.3513617960249413
2023-12-16 19:49:54   INFO  epoch: 8/24, acc_iter=28286, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:03:19/1:17:22, time_cost(all): 10:25:26/21:19:25, loss=0.491479874717339, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.03(1.03), norm=2.089285949162952, lr=0.3509863560092429
2023-12-16 19:51:00   INFO  epoch: 8/24, acc_iter=28336, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:04:26/1:13:51, time_cost(all): 10:26:32/20:44:39, loss=0.49128101793129, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.91(1.03), norm=2.090602302078584, lr=0.35061091599354444
2023-12-16 19:52:07   INFO  epoch: 8/24, acc_iter=28386, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:05:33/1:10:44, time_cost(all): 10:27:39/20:48:34, loss=0.491082161145241, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.15(1.03), norm=3.5615566167228976, lr=0.3502354759778461
2023-12-16 19:53:14   INFO  epoch: 8/24, acc_iter=28436, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:06:39/1:08:12, time_cost(all): 10:28:46/20:35:13, loss=0.490883304359192, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.11(1.03), norm=4.295166889233641, lr=0.34986003596214765
2023-12-16 19:54:20   INFO  epoch: 8/24, acc_iter=28486, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:07:46/1:09:25, time_cost(all): 10:29:52/21:26:25, loss=0.490684447573143, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.0(1.03), norm=1.2175203758857749, lr=0.3494845959464493
2023-12-16 19:55:27   INFO  epoch: 8/24, acc_iter=28536, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:08:53/1:11:34, time_cost(all): 10:30:59/20:14:05, loss=0.490485590787094, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.98(1.03), norm=2.2338393189235406, lr=0.34910915593075087
2023-12-16 19:56:34   INFO  epoch: 8/24, acc_iter=28586, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:09:59/1:08:51, time_cost(all): 10:32:06/20:57:12, loss=0.490286734001045, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.09(1.03), norm=3.252520891151017, lr=0.3487337159150525
2023-12-16 19:57:40   INFO  epoch: 8/24, acc_iter=28636, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:11:06/1:09:05, time_cost(all): 10:33:12/21:12:59, loss=0.490087877214996, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.04(1.03), norm=1.59512319661986, lr=0.3483582758993541
2023-12-16 19:58:47   INFO  epoch: 8/24, acc_iter=28686, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:12:13/1:04:08, time_cost(all): 10:34:19/20:05:56, loss=0.489889020428947, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.09(1.03), norm=0.5340778000785265, lr=0.3479828358836557
2023-12-16 19:59:54   INFO  epoch: 8/24, acc_iter=28736, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:13:19/1:04:44, time_cost(all): 10:35:26/21:27:44, loss=0.489690163642898, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.84(1.03), norm=1.893415248680076, lr=0.3476073958679573
2023-12-16 20:01:00   INFO  epoch: 8/24, acc_iter=28786, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:14:26/1:04:15, time_cost(all): 10:36:32/20:37:32, loss=0.489491306856849, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.92(1.03), norm=1.9867671228580772, lr=0.34723195585225886
2023-12-16 20:02:07   INFO  epoch: 8/24, acc_iter=28836, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:15:33/1:03:06, time_cost(all): 10:37:39/19:54:39, loss=0.4892924500708, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.91(1.03), norm=2.723489533211515, lr=0.3468565158365605
2023-12-16 20:03:14   INFO  epoch: 8/24, acc_iter=28886, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:16:39/1:02:20, time_cost(all): 10:38:46/19:43:37, loss=0.489093593284751, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.0(1.03), norm=0.7605241657182229, lr=0.34648107582086207
2023-12-16 20:04:20   INFO  epoch: 8/24, acc_iter=28936, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:17:46/1:02:37, time_cost(all): 10:39:52/20:20:09, loss=0.488894736498702, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.14(1.03), norm=1.401760708780858, lr=0.3461056358051637
2023-12-16 20:05:27   INFO  epoch: 8/24, acc_iter=28986, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:18:53/0:58:33, time_cost(all): 10:40:59/21:34:41, loss=0.488695879712653, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.06(1.03), norm=2.9829591835837257, lr=0.3457301957894653
2023-12-16 20:06:34   INFO  epoch: 8/24, acc_iter=29036, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:19:59/0:57:03, time_cost(all): 10:42:06/20:25:51, loss=0.488497022926604, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.86(1.03), norm=4.55784260087656, lr=0.3453547557737669
2023-12-16 20:07:40   INFO  epoch: 8/24, acc_iter=29086, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:21:06/0:56:23, time_cost(all): 10:43:12/21:11:59, loss=0.488298166140555, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.09(1.03), norm=4.438179012014052, lr=0.3449793157580685
2023-12-16 20:08:47   INFO  epoch: 8/24, acc_iter=29136, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:22:13/0:58:33, time_cost(all): 10:44:19/20:03:11, loss=0.488099309354506, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.17(1.03), norm=3.931986915330172, lr=0.34460387574237006
2023-12-16 20:09:54   INFO  epoch: 8/24, acc_iter=29186, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:23:19/0:54:53, time_cost(all): 10:45:26/21:23:43, loss=0.487900452568457, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.09(1.03), norm=2.378375912265704, lr=0.3442284357266717
2023-12-16 20:11:00   INFO  epoch: 8/24, acc_iter=29236, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:24:26/0:52:40, time_cost(all): 10:46:32/20:02:32, loss=0.487701595782408, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.09(1.03), norm=0.6120245460506764, lr=0.3438529957109733
2023-12-16 20:12:07   INFO  epoch: 8/24, acc_iter=29286, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:25:33/0:53:35, time_cost(all): 10:47:39/20:54:27, loss=0.487502738996359, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.11(1.03), norm=3.7641573229815934, lr=0.34347755569527494
2023-12-16 20:13:14   INFO  epoch: 8/24, acc_iter=29336, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:26:39/0:52:07, time_cost(all): 10:48:46/20:10:10, loss=0.48730388221031, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.0(1.03), norm=3.3934655766136625, lr=0.3431021156795765
2023-12-16 20:14:20   INFO  epoch: 8/24, acc_iter=29386, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:27:46/0:49:21, time_cost(all): 10:49:52/20:11:24, loss=0.487105025424261, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.12(1.03), norm=4.029140391015914, lr=0.3427266756638781
2023-12-16 20:15:27   INFO  epoch: 8/24, acc_iter=29436, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:28:53/0:47:54, time_cost(all): 10:50:59/21:24:19, loss=0.486906168638212, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.12(1.03), norm=2.7957948111844253, lr=0.3423512356481797
2023-12-16 20:16:34   INFO  epoch: 8/24, acc_iter=29486, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:29:59/0:49:36, time_cost(all): 10:52:06/20:42:16, loss=0.486707311852163, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.94(1.03), norm=4.64665611226281, lr=0.3419757956324813
2023-12-16 20:17:40   INFO  epoch: 8/24, acc_iter=29536, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:31:06/0:47:23, time_cost(all): 10:53:12/19:58:44, loss=0.486508455066114, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.23(1.03), norm=2.3710395358203598, lr=0.3416003556167829
2023-12-16 20:18:47   INFO  epoch: 8/24, acc_iter=29586, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:32:12/0:44:46, time_cost(all): 10:54:19/19:56:39, loss=0.486309598280065, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.17(1.03), norm=4.9208694998368445, lr=0.3412249156010845
2023-12-16 20:19:53   INFO  epoch: 8/24, acc_iter=29636, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:33:19/0:43:02, time_cost(all): 10:55:25/19:24:07, loss=0.486110741494016, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.83(1.03), norm=3.6239422798979475, lr=0.34084947558538614
2023-12-16 20:21:00   INFO  epoch: 8/24, acc_iter=29686, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:34:26/0:44:43, time_cost(all): 10:56:32/20:16:20, loss=0.485911884707967, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.02(1.03), norm=3.055486108332622, lr=0.3404740355696877
2023-12-16 20:22:07   INFO  epoch: 8/24, acc_iter=29736, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:35:32/0:44:41, time_cost(all): 10:57:39/19:42:29, loss=0.485713027921918, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.93(1.03), norm=2.036111602759913, lr=0.3400985955539893
2023-12-16 20:23:13   INFO  epoch: 8/24, acc_iter=29786, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:36:39/0:41:51, time_cost(all): 10:58:45/19:28:15, loss=0.485514171135869, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.96(1.03), norm=4.5114365120172515, lr=0.3397231555382909
2023-12-16 20:24:20   INFO  epoch: 8/24, acc_iter=29836, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:37:46/0:40:49, time_cost(all): 10:59:52/19:36:45, loss=0.48531531434982, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.13(1.03), norm=3.5327087263686012, lr=0.3393477155225925
2023-12-16 20:25:27   INFO  epoch: 8/24, acc_iter=29886, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:38:52/0:39:40, time_cost(all): 11:00:59/19:17:20, loss=0.485116457563771, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.97(1.03), norm=4.000349899521687, lr=0.3389722755068941
2023-12-16 20:26:33   INFO  epoch: 8/24, acc_iter=29936, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:39:59/0:36:35, time_cost(all): 11:02:05/21:04:43, loss=0.484917600777722, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.22(1.03), norm=0.894258507395381, lr=0.3385968354911957
2023-12-16 20:27:40   INFO  epoch: 8/24, acc_iter=29986, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:41:06/0:37:59, time_cost(all): 11:03:12/19:46:06, loss=0.484718743991673, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.05(1.03), norm=3.977208481537413, lr=0.33822139547549734
2023-12-16 20:28:47   INFO  epoch: 8/24, acc_iter=30036, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:42:12/0:34:39, time_cost(all): 11:04:19/19:33:04, loss=0.484519887205624, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.07(1.03), norm=3.086132913961613, lr=0.3378459554597989
2023-12-16 20:29:53   INFO  epoch: 8/24, acc_iter=30086, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:43:19/0:33:31, time_cost(all): 11:05:25/19:49:13, loss=0.484321030419575, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.99(1.03), norm=4.197213339871256, lr=0.33747051544410056
2023-12-16 20:31:00   INFO  epoch: 8/24, acc_iter=30136, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:44:26/0:34:19, time_cost(all): 11:06:32/19:59:24, loss=0.484122173633526, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.93(1.03), norm=2.898086891840002, lr=0.3370950754284021
2023-12-16 20:32:07   INFO  epoch: 8/24, acc_iter=30186, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:45:32/0:31:58, time_cost(all): 11:07:39/20:25:49, loss=0.483923316847477, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.97(1.03), norm=1.555094794058702, lr=0.3367196354127037
2023-12-16 20:33:13   INFO  epoch: 8/24, acc_iter=30236, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:46:39/0:30:32, time_cost(all): 11:08:45/19:51:56, loss=0.483724460061428, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.2(1.03), norm=1.0067902516680571, lr=0.33634419539700533
2023-12-16 20:34:20   INFO  epoch: 8/24, acc_iter=30286, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:47:46/0:29:57, time_cost(all): 11:09:52/20:05:00, loss=0.483525603275379, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.87(1.03), norm=4.894561700072541, lr=0.33596875538130694
2023-12-16 20:35:27   INFO  epoch: 8/24, acc_iter=30336, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:48:52/0:27:57, time_cost(all): 11:10:59/19:27:18, loss=0.48332674648933, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.15(1.03), norm=1.4915109737465286, lr=0.33559331536560855
2023-12-16 20:36:33   INFO  epoch: 8/24, acc_iter=30386, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:49:59/0:28:52, time_cost(all): 11:12:05/20:06:19, loss=0.483127889703281, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.05(1.03), norm=4.002377104758285, lr=0.3352178753499101
2023-12-16 20:37:40   INFO  epoch: 8/24, acc_iter=30436, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:51:06/0:26:57, time_cost(all): 11:13:12/20:41:04, loss=0.482929032917232, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.0(1.03), norm=3.0064743221957873, lr=0.33484243533421176
2023-12-16 20:38:47   INFO  epoch: 8/24, acc_iter=30486, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:52:12/0:26:17, time_cost(all): 11:14:19/19:15:30, loss=0.482730176131183, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.99(1.03), norm=4.048866570143584, lr=0.3344669953185133
2023-12-16 20:39:53   INFO  epoch: 8/24, acc_iter=30536, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:53:19/0:23:47, time_cost(all): 11:15:25/19:41:01, loss=0.482531319345134, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.99(1.03), norm=4.897452873277117, lr=0.3340915553028149
2023-12-16 20:41:00   INFO  epoch: 8/24, acc_iter=30586, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:54:26/0:24:31, time_cost(all): 11:16:32/19:27:07, loss=0.482332462559085, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.97(1.03), norm=0.5805899393410465, lr=0.33371611528711653
2023-12-16 20:42:07   INFO  epoch: 8/24, acc_iter=30636, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:55:32/0:23:42, time_cost(all): 11:17:39/19:11:15, loss=0.482133605773036, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.01(1.03), norm=4.700800401228695, lr=0.33334067527141814
2023-12-16 20:43:13   INFO  epoch: 8/24, acc_iter=30686, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:56:39/0:21:35, time_cost(all): 11:18:45/18:58:44, loss=0.481934748986987, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.94(1.03), norm=2.4157149403741744, lr=0.33296523525571975
2023-12-16 20:44:20   INFO  epoch: 8/24, acc_iter=30736, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:57:46/0:19:58, time_cost(all): 11:19:52/19:30:09, loss=0.481735892200938, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.86(1.03), norm=4.8118747303849405, lr=0.3325897952400213
2023-12-16 20:45:27   INFO  epoch: 8/24, acc_iter=30786, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:58:52/0:20:07, time_cost(all): 11:20:59/19:21:37, loss=0.481537035414889, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.05(1.03), norm=4.7792536857678165, lr=0.33221435522432297
2023-12-16 20:46:33   INFO  epoch: 8/24, acc_iter=30836, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:59:59/0:17:52, time_cost(all): 11:22:05/19:54:21, loss=0.48133817862884, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.83(1.03), norm=1.4782513117652047, lr=0.3318389152086245
2023-12-16 20:47:40   INFO  epoch: 8/24, acc_iter=30886, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 1:01:05/0:17:26, time_cost(all): 11:23:12/19:16:22, loss=0.481139321842791, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.09(1.03), norm=1.529341021336673, lr=0.3314634751929262
2023-12-16 20:48:47   INFO  epoch: 8/24, acc_iter=30936, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 1:02:12/0:15:35, time_cost(all): 11:24:19/19:25:17, loss=0.480940465056742, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.02(1.03), norm=1.8164021813177593, lr=0.33108803517722774
2023-12-16 20:49:53   INFO  epoch: 8/24, acc_iter=30986, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 1:03:19/0:15:15, time_cost(all): 11:25:25/20:04:31, loss=0.480741608270693, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.85(1.03), norm=1.9128807070022122, lr=0.33071259516152934
2023-12-16 20:51:00   INFO  epoch: 8/24, acc_iter=31036, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 1:04:25/0:13:46, time_cost(all): 11:26:32/18:53:00, loss=0.480542751484644, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.16(1.03), norm=3.0189236535106265, lr=0.33033715514583095
2023-12-16 20:52:06   INFO  epoch: 8/24, acc_iter=31086, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 1:05:32/0:12:57, time_cost(all): 11:27:38/20:18:33, loss=0.480343894698595, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.23(1.03), norm=3.1781190657731178, lr=0.32996171513013256
2023-12-16 20:53:13   INFO  epoch: 8/24, acc_iter=31136, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 1:06:39/0:11:43, time_cost(all): 11:28:45/19:45:17, loss=0.480145037912546, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.9(1.03), norm=4.029446020626603, lr=0.32958627511443417
2023-12-16 20:54:20   INFO  epoch: 8/24, acc_iter=31186, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 1:07:45/0:10:15, time_cost(all): 11:29:52/20:38:59, loss=0.479946181126497, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.88(1.03), norm=4.476784845945371, lr=0.3292108350987357
2023-12-16 20:55:26   INFO  epoch: 8/24, acc_iter=31236, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 1:08:52/0:09:43, time_cost(all): 11:30:58/20:00:30, loss=0.479747324340448, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.93(1.03), norm=3.7073473327849076, lr=0.3288353950830374
2023-12-16 20:56:33   INFO  epoch: 8/24, acc_iter=31286, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 1:09:59/0:08:11, time_cost(all): 11:32:05/19:32:47, loss=0.479548467554399, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.21(1.03), norm=1.1575138085908636, lr=0.32845995506733894
2023-12-16 20:57:40   INFO  epoch: 8/24, acc_iter=31336, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 1:11:05/0:07:20, time_cost(all): 11:33:12/19:01:47, loss=0.47934961076835, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.94(1.03), norm=2.639445471301731, lr=0.32808451505164055
2023-12-16 20:58:46   INFO  epoch: 8/24, acc_iter=31386, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:12:12/0:05:46, time_cost(all): 11:34:18/20:25:55, loss=0.479150753982301, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.01(1.03), norm=3.4137863885072717, lr=0.32770907503594215
2023-12-16 20:59:53   INFO  epoch: 8/24, acc_iter=31436, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:13:19/0:04:55, time_cost(all): 11:35:25/20:13:43, loss=0.478951897196252, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.0(1.03), norm=3.3264614215457993, lr=0.32733363502024376
2023-12-16 21:01:00   INFO  epoch: 8/24, acc_iter=31486, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:14:25/0:03:36, time_cost(all): 11:36:32/20:35:08, loss=0.478753040410203, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.89(1.03), norm=3.131986703911637, lr=0.32695819500454537
2023-12-16 21:02:06   INFO  epoch: 8/24, acc_iter=31536, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:15:32/0:02:36, time_cost(all): 11:37:38/20:11:53, loss=0.478554183624154, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.18(1.03), norm=4.727618897187366, lr=0.3265827549888469
2023-12-16 21:03:13   INFO  epoch: 8/24, acc_iter=31586, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:16:39/0:01:26, time_cost(all): 11:38:45/18:41:09, loss=0.478355326838105, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.91(1.03), norm=3.5194348829950313, lr=0.3262073149731486
2023-12-16 21:04:20   INFO  epoch: 8/24, acc_iter=31636, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:17:45/0:00:23, time_cost(all): 11:39:52/18:54:25, loss=0.478156470052056, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.93(1.03), norm=2.1015077451283557, lr=0.3258318749574502
2023-12-16 21:05:26   INFO  epoch: 9/24, acc_iter=31703, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:01:06/1:17:26, time_cost(all): 11:40:58/19:24:06, loss=0.47789000195875, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.91(1.03), norm=2.8385174248127623, lr=0.3253287853364143
2023-12-16 21:06:33   INFO  epoch: 9/24, acc_iter=31753, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:02:13/1:16:01, time_cost(all): 11:42:05/19:05:28, loss=0.477691145172701, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.11(1.03), norm=1.7827580128136908, lr=0.3249533453207159
2023-12-16 21:07:40   INFO  epoch: 9/24, acc_iter=31803, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:03:19/1:12:10, time_cost(all): 11:43:12/19:51:29, loss=0.477492288386652, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.0(1.03), norm=2.9277368451425865, lr=0.3245779053050175
2023-12-16 21:08:46   INFO  epoch: 9/24, acc_iter=31853, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:04:26/1:12:56, time_cost(all): 11:44:18/19:49:56, loss=0.477293431600603, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.05(1.03), norm=1.216799766906583, lr=0.3242024652893191
2023-12-16 21:09:53   INFO  epoch: 9/24, acc_iter=31903, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:05:33/1:11:59, time_cost(all): 11:45:25/19:26:20, loss=0.477094574814554, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.98(1.03), norm=4.814410680816563, lr=0.3238270252736207
2023-12-16 21:11:00   INFO  epoch: 9/24, acc_iter=31953, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:06:39/1:11:49, time_cost(all): 11:46:32/18:48:16, loss=0.476895718028505, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.89(1.03), norm=0.7636349293625182, lr=0.3234515852579223
2023-12-16 21:12:06   INFO  epoch: 9/24, acc_iter=32003, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:07:46/1:10:14, time_cost(all): 11:47:38/18:53:39, loss=0.476696861242456, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.2(1.03), norm=4.4388021123826285, lr=0.32307614524222394
2023-12-16 21:13:13   INFO  epoch: 9/24, acc_iter=32053, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:08:53/1:08:01, time_cost(all): 11:48:45/19:37:06, loss=0.476498004456407, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.9(1.03), norm=3.3670536830538995, lr=0.32270070522652555
2023-12-16 21:14:20   INFO  epoch: 9/24, acc_iter=32103, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:09:59/1:11:05, time_cost(all): 11:49:52/20:15:25, loss=0.476299147670358, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.03(1.03), norm=1.7234130171840343, lr=0.32232526521082716
2023-12-16 21:15:26   INFO  epoch: 9/24, acc_iter=32153, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:11:06/1:05:20, time_cost(all): 11:50:58/19:36:37, loss=0.476100290884309, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.04(1.03), norm=0.9508476561184576, lr=0.3219498251951287
2023-12-16 21:16:33   INFO  epoch: 9/24, acc_iter=32203, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:12:13/1:07:26, time_cost(all): 11:52:05/18:41:28, loss=0.47590143409826, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.92(1.03), norm=4.650848505830116, lr=0.3215743851794303
2023-12-16 21:17:40   INFO  epoch: 9/24, acc_iter=32253, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:13:19/1:02:17, time_cost(all): 11:53:12/19:21:56, loss=0.475702577312211, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.17(1.03), norm=0.6994321568434709, lr=0.32119894516373193
2023-12-16 21:18:46   INFO  epoch: 9/24, acc_iter=32303, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:14:26/1:03:26, time_cost(all): 11:54:18/18:49:35, loss=0.475503720526162, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.98(1.03), norm=2.0492547083416843, lr=0.32082350514803354
2023-12-16 21:19:53   INFO  epoch: 9/24, acc_iter=32353, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:15:33/1:01:47, time_cost(all): 11:55:25/18:36:25, loss=0.475304863740113, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.98(1.03), norm=1.6441024265706266, lr=0.32044806513233515
2023-12-16 21:20:59   INFO  epoch: 9/24, acc_iter=32403, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:16:39/1:02:04, time_cost(all): 11:56:31/20:03:10, loss=0.475106006954064, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.22(1.03), norm=2.6880888048016964, lr=0.3200726251166367
2023-12-16 21:22:06   INFO  epoch: 9/24, acc_iter=32453, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:17:46/1:01:47, time_cost(all): 11:57:38/19:19:33, loss=0.474907150168015, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.14(1.03), norm=4.062290326619671, lr=0.31969718510093836
2023-12-16 21:23:13   INFO  epoch: 9/24, acc_iter=32503, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:18:53/0:58:56, time_cost(all): 11:58:45/19:58:33, loss=0.474708293381966, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.18(1.03), norm=4.770043803199961, lr=0.3193217450852399
2023-12-16 21:24:19   INFO  epoch: 9/24, acc_iter=32553, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:19:59/0:57:34, time_cost(all): 11:59:51/19:02:13, loss=0.474509436595917, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.02(1.03), norm=3.7825778465314457, lr=0.3189463050695416
2023-12-16 21:25:26   INFO  epoch: 9/24, acc_iter=32603, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:21:06/0:59:26, time_cost(all): 12:00:58/19:57:46, loss=0.474310579809868, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.17(1.03), norm=0.9827340473295101, lr=0.31857086505384313
2023-12-16 21:26:33   INFO  epoch: 9/24, acc_iter=32653, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:22:13/0:53:30, time_cost(all): 12:02:05/20:06:01, loss=0.474111723023819, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.0(1.03), norm=4.760089300306557, lr=0.3181954250381448
2023-12-16 21:27:39   INFO  epoch: 9/24, acc_iter=32703, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:23:19/0:54:00, time_cost(all): 12:03:11/19:17:29, loss=0.47391286623777, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.02(1.03), norm=1.0707758971839538, lr=0.31781998502244635
2023-12-16 21:28:46   INFO  epoch: 9/24, acc_iter=32753, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:24:26/0:52:59, time_cost(all): 12:04:18/19:30:25, loss=0.473714009451721, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.91(1.03), norm=2.1315175490010456, lr=0.31744454500674796
2023-12-16 21:29:53   INFO  epoch: 9/24, acc_iter=32803, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:25:33/0:52:18, time_cost(all): 12:05:25/18:27:15, loss=0.473515152665672, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.23(1.03), norm=2.6401264507300555, lr=0.31706910499104957
2023-12-16 21:30:59   INFO  epoch: 9/24, acc_iter=32853, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:26:39/0:53:06, time_cost(all): 12:06:31/20:05:30, loss=0.473316295879623, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.07(1.03), norm=0.8123981925792022, lr=0.3166936649753512
2023-12-16 21:32:06   INFO  epoch: 9/24, acc_iter=32903, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:27:46/0:52:12, time_cost(all): 12:07:38/19:05:12, loss=0.473117439093574, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.18(1.03), norm=1.8162014954306294, lr=0.3163182249596528
2023-12-16 21:33:13   INFO  epoch: 9/24, acc_iter=32953, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:28:53/0:47:42, time_cost(all): 12:08:45/18:17:02, loss=0.472918582307525, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.11(1.03), norm=2.532551202700961, lr=0.31594278494395434
2023-12-16 21:34:19   INFO  epoch: 9/24, acc_iter=33003, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:29:59/0:46:18, time_cost(all): 12:09:51/19:34:52, loss=0.472719725521476, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.97(1.03), norm=3.2330803640402834, lr=0.31556734492825594
2023-12-16 21:35:26   INFO  epoch: 9/24, acc_iter=33053, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:31:06/0:46:51, time_cost(all): 12:10:58/19:21:00, loss=0.472520868735427, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.97(1.03), norm=0.6876054238260573, lr=0.31519190491255755
2023-12-16 21:36:33   INFO  epoch: 9/24, acc_iter=33103, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:32:12/0:47:51, time_cost(all): 12:12:05/19:48:12, loss=0.472322011949378, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.2(1.03), norm=1.55299343460041, lr=0.31481646489685916
2023-12-16 21:37:39   INFO  epoch: 9/24, acc_iter=33153, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:33:19/0:43:46, time_cost(all): 12:13:11/18:28:53, loss=0.472123155163329, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.93(1.03), norm=1.8952665297879567, lr=0.31444102488116077
2023-12-16 21:38:46   INFO  epoch: 9/24, acc_iter=33203, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:34:26/0:44:30, time_cost(all): 12:14:18/19:10:37, loss=0.47192429837728, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.98(1.03), norm=4.954430737737874, lr=0.3140655848654623
2023-12-16 21:39:53   INFO  epoch: 9/24, acc_iter=33253, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:35:32/0:43:11, time_cost(all): 12:15:25/18:26:20, loss=0.471725441591231, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.87(1.03), norm=2.005903438170243, lr=0.313690144849764
2023-12-16 21:40:59   INFO  epoch: 9/24, acc_iter=33303, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:36:39/0:42:25, time_cost(all): 12:16:31/19:00:31, loss=0.471526584805182, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.84(1.03), norm=3.244995810557541, lr=0.31331470483406554
2023-12-16 21:42:06   INFO  epoch: 9/24, acc_iter=33353, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:37:46/0:41:05, time_cost(all): 12:17:38/19:38:31, loss=0.471327728019133, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.15(1.03), norm=2.122153996717952, lr=0.3129392648183672
2023-12-16 21:43:13   INFO  epoch: 9/24, acc_iter=33403, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:38:52/0:40:26, time_cost(all): 12:18:45/19:48:53, loss=0.471128871233084, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.09(1.03), norm=2.73397105188314, lr=0.31256382480266875
2023-12-16 21:44:19   INFO  epoch: 9/24, acc_iter=33453, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:39:59/0:37:50, time_cost(all): 12:19:51/19:17:52, loss=0.470930014447035, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.21(1.03), norm=0.7863844191352324, lr=0.3121883847869704
2023-12-16 21:45:26   INFO  epoch: 9/24, acc_iter=33503, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:41:06/0:37:14, time_cost(all): 12:20:58/18:08:19, loss=0.470731157660986, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.03(1.03), norm=1.7990953669604965, lr=0.31181294477127197
2023-12-16 21:46:33   INFO  epoch: 9/24, acc_iter=33553, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:42:12/0:35:30, time_cost(all): 12:22:05/18:22:21, loss=0.470532300874937, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.09(1.03), norm=4.884705281525753, lr=0.3114375047555736
2023-12-16 21:47:39   INFO  epoch: 9/24, acc_iter=33603, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:43:19/0:34:47, time_cost(all): 12:23:11/19:44:21, loss=0.470333444088888, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.08(1.03), norm=0.5871605946728025, lr=0.3110620647398752
2023-12-16 21:48:46   INFO  epoch: 9/24, acc_iter=33653, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:44:26/0:33:45, time_cost(all): 12:24:18/18:08:21, loss=0.470134587302839, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.97(1.03), norm=4.766430501421167, lr=0.3106866247241768
2023-12-16 21:49:52   INFO  epoch: 9/24, acc_iter=33703, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:45:32/0:33:23, time_cost(all): 12:25:24/18:15:06, loss=0.46993573051679, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.04(1.03), norm=3.7728590701025624, lr=0.3103111847084784
2023-12-16 21:50:59   INFO  epoch: 9/24, acc_iter=33753, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:46:39/0:30:15, time_cost(all): 12:26:31/19:10:22, loss=0.469736873730741, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.95(1.03), norm=3.161046857514367, lr=0.30993574469277996
2023-12-16 21:52:06   INFO  epoch: 9/24, acc_iter=33803, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:47:46/0:29:41, time_cost(all): 12:27:38/18:54:37, loss=0.469538016944692, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.09(1.03), norm=4.255160644287274, lr=0.30956030467708157
2023-12-16 21:53:12   INFO  epoch: 9/24, acc_iter=33853, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:48:52/0:28:35, time_cost(all): 12:28:44/17:53:55, loss=0.469339160158643, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.99(1.03), norm=3.052007211962855, lr=0.3091848646613832
2023-12-16 21:54:19   INFO  epoch: 9/24, acc_iter=33903, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:49:59/0:28:46, time_cost(all): 12:29:51/18:00:57, loss=0.469140303372594, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.06(1.03), norm=2.9215539094096687, lr=0.3088094246456848
2023-12-16 21:55:26   INFO  epoch: 9/24, acc_iter=33953, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:51:06/0:26:28, time_cost(all): 12:30:58/18:16:28, loss=0.468941446586545, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.98(1.03), norm=0.8532996691014934, lr=0.3084339846299864
2023-12-16 21:56:32   INFO  epoch: 9/24, acc_iter=34003, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:52:12/0:25:15, time_cost(all): 12:32:04/18:09:05, loss=0.468742589800496, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.83(1.03), norm=1.160267201567915, lr=0.30805854461428794
2023-12-16 21:57:39   INFO  epoch: 9/24, acc_iter=34053, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:53:19/0:25:59, time_cost(all): 12:33:11/19:08:52, loss=0.468543733014447, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.9(1.03), norm=4.513954634297798, lr=0.3076831045985896
2023-12-16 21:58:46   INFO  epoch: 9/24, acc_iter=34103, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:54:26/0:23:54, time_cost(all): 12:34:18/19:19:51, loss=0.468344876228398, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.89(1.03), norm=2.5858972004704355, lr=0.30730766458289116
2023-12-16 21:59:52   INFO  epoch: 9/24, acc_iter=34153, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:55:32/0:22:03, time_cost(all): 12:35:24/19:29:46, loss=0.468146019442349, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.93(1.03), norm=4.558572811998718, lr=0.3069322245671928
2023-12-16 22:00:59   INFO  epoch: 9/24, acc_iter=34203, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:56:39/0:20:48, time_cost(all): 12:36:31/17:52:39, loss=0.4679471626563, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.17(1.03), norm=2.0804834046320844, lr=0.3065567845514944
2023-12-16 22:02:06   INFO  epoch: 9/24, acc_iter=34253, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:57:46/0:20:34, time_cost(all): 12:37:38/18:32:54, loss=0.467748305870251, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.16(1.03), norm=1.1765612288648826, lr=0.30618134453579604
2023-12-16 22:03:12   INFO  epoch: 9/24, acc_iter=34303, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:58:52/0:19:12, time_cost(all): 12:38:44/18:09:23, loss=0.467549449084202, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.89(1.03), norm=1.2740570200477717, lr=0.3058059045200976
2023-12-16 22:04:19   INFO  epoch: 9/24, acc_iter=34353, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:59:59/0:17:39, time_cost(all): 12:39:51/19:09:40, loss=0.467350592298153, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.06(1.03), norm=1.0443369907594418, lr=0.3054304645043992
2023-12-16 22:05:26   INFO  epoch: 9/24, acc_iter=34403, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 1:01:05/0:16:49, time_cost(all): 12:40:58/18:11:49, loss=0.467151735512104, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.17(1.03), norm=4.660397829247536, lr=0.3050550244887008
2023-12-16 22:06:32   INFO  epoch: 9/24, acc_iter=34453, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 1:02:12/0:15:22, time_cost(all): 12:42:04/18:46:55, loss=0.466952878726055, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.84(1.03), norm=2.317018686846385, lr=0.3046795844730024
2023-12-16 22:07:39   INFO  epoch: 9/24, acc_iter=34503, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 1:03:19/0:15:04, time_cost(all): 12:43:11/17:42:34, loss=0.466754021940006, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.97(1.03), norm=4.130971786072104, lr=0.304304144457304
2023-12-16 22:08:46   INFO  epoch: 9/24, acc_iter=34553, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 1:04:25/0:14:16, time_cost(all): 12:44:18/19:07:55, loss=0.466555165153957, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.15(1.03), norm=3.7111119507572896, lr=0.3039287044416056
2023-12-16 22:09:52   INFO  epoch: 9/24, acc_iter=34603, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 1:05:32/0:12:50, time_cost(all): 12:45:24/18:36:21, loss=0.466356308367908, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.0(1.03), norm=4.729583747715367, lr=0.3035532644259072
2023-12-16 22:10:59   INFO  epoch: 9/24, acc_iter=34653, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 1:06:39/0:11:57, time_cost(all): 12:46:31/18:16:44, loss=0.466157451581859, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.86(1.03), norm=1.176294638063812, lr=0.3031778244102088
2023-12-16 22:12:06   INFO  epoch: 9/24, acc_iter=34703, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 1:07:45/0:10:33, time_cost(all): 12:47:38/18:28:34, loss=0.46595859479581, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.98(1.03), norm=1.7329714461520245, lr=0.3028023843945104
2023-12-16 22:13:12   INFO  epoch: 9/24, acc_iter=34753, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 1:08:52/0:09:14, time_cost(all): 12:48:44/18:36:49, loss=0.465759738009761, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.18(1.03), norm=2.276597125007303, lr=0.302426944378812
2023-12-16 22:14:19   INFO  epoch: 9/24, acc_iter=34803, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 1:09:59/0:08:12, time_cost(all): 12:49:51/18:18:41, loss=0.465560881223712, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.16(1.03), norm=2.306636246075289, lr=0.30205150436311357
2023-12-16 22:15:26   INFO  epoch: 9/24, acc_iter=34853, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 1:11:05/0:06:55, time_cost(all): 12:50:58/18:24:00, loss=0.465362024437663, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.01(1.03), norm=0.6297956555487972, lr=0.30167606434741523
2023-12-16 22:16:32   INFO  epoch: 9/24, acc_iter=34903, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:12:12/0:05:56, time_cost(all): 12:52:04/17:42:42, loss=0.465163167651614, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.16(1.03), norm=0.5452448738229114, lr=0.3013006243317168
2023-12-16 22:17:39   INFO  epoch: 9/24, acc_iter=34953, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:13:19/0:04:56, time_cost(all): 12:53:11/18:58:26, loss=0.464964310865565, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.14(1.03), norm=2.750826027559355, lr=0.30092518431601845
2023-12-16 22:18:45   INFO  epoch: 9/24, acc_iter=35003, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:14:25/0:03:41, time_cost(all): 12:54:17/18:45:25, loss=0.464765454079516, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.19(1.03), norm=3.725601030521598, lr=0.30054974430032
2023-12-16 22:19:52   INFO  epoch: 9/24, acc_iter=35053, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:15:32/0:02:37, time_cost(all): 12:55:24/17:59:49, loss=0.464566597293467, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.13(1.03), norm=0.50982586999929, lr=0.30017430428462166
2023-12-16 22:20:59   INFO  epoch: 9/24, acc_iter=35103, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:16:39/0:01:25, time_cost(all): 12:56:31/18:29:02, loss=0.464367740507418, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.14(1.03), norm=2.576255190680277, lr=0.2997988642689232
2023-12-16 22:22:05   INFO  epoch: 9/24, acc_iter=35153, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:17:45/0:00:23, time_cost(all): 12:57:37/18:11:29, loss=0.464168883721369, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.93(1.03), norm=4.5013581884243505, lr=0.2994234242532248
2023-12-16 22:23:12   INFO  epoch: 10/24, acc_iter=35220, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:01:06/1:18:42, time_cost(all): 12:58:44/18:07:08, loss=0.463902415628063, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.98(1.03), norm=1.3214666939519486, lr=0.2989203346321889
2023-12-16 22:24:19   INFO  epoch: 10/24, acc_iter=35270, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:02:13/1:13:02, time_cost(all): 12:59:51/18:12:49, loss=0.463703558842014, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.12(1.03), norm=3.16321073830148, lr=0.2985448946164906
2023-12-16 22:25:25   INFO  epoch: 10/24, acc_iter=35320, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:03:19/1:12:05, time_cost(all): 13:00:57/18:10:13, loss=0.463504702055965, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.22(1.03), norm=3.024101083061918, lr=0.29816945460079214
2023-12-16 22:26:32   INFO  epoch: 10/24, acc_iter=35370, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:04:26/1:17:15, time_cost(all): 13:02:04/18:50:29, loss=0.463305845269916, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.14(1.03), norm=3.9177501478084484, lr=0.2977940145850938
2023-12-16 22:27:39   INFO  epoch: 10/24, acc_iter=35420, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:05:33/1:11:39, time_cost(all): 13:03:11/18:19:03, loss=0.463106988483867, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.07(1.03), norm=3.338683590800241, lr=0.29741857456939536
2023-12-16 22:28:45   INFO  epoch: 10/24, acc_iter=35470, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:06:39/1:14:20, time_cost(all): 13:04:17/17:59:09, loss=0.462908131697818, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.89(1.03), norm=2.063745336324323, lr=0.297043134553697
2023-12-16 22:29:52   INFO  epoch: 10/24, acc_iter=35520, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:07:46/1:08:55, time_cost(all): 13:05:24/18:08:30, loss=0.462709274911769, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.14(1.03), norm=4.622261246454362, lr=0.29666769453799857
2023-12-16 22:30:59   INFO  epoch: 10/24, acc_iter=35570, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:08:53/1:06:45, time_cost(all): 13:06:31/18:17:10, loss=0.46251041812572, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.21(1.03), norm=3.2191052394181674, lr=0.2962922545223002
2023-12-16 22:32:05   INFO  epoch: 10/24, acc_iter=35620, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:09:59/1:08:25, time_cost(all): 13:07:37/17:36:16, loss=0.462311561339671, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.99(1.03), norm=1.1604501689783886, lr=0.2959168145066018
2023-12-16 22:33:12   INFO  epoch: 10/24, acc_iter=35670, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:11:06/1:09:38, time_cost(all): 13:08:44/18:05:16, loss=0.462112704553622, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.15(1.03), norm=4.032348597338449, lr=0.2955413744909034
2023-12-16 22:34:19   INFO  epoch: 10/24, acc_iter=35720, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:12:13/1:03:15, time_cost(all): 13:09:51/18:29:59, loss=0.461913847767573, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.18(1.03), norm=4.227105289765527, lr=0.295165934475205
2023-12-16 22:35:25   INFO  epoch: 10/24, acc_iter=35770, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:13:19/1:02:26, time_cost(all): 13:10:57/18:19:23, loss=0.461714990981524, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.17(1.03), norm=3.031711980189812, lr=0.29479049445950656
2023-12-16 22:36:32   INFO  epoch: 10/24, acc_iter=35820, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:14:26/1:06:34, time_cost(all): 13:12:04/17:12:02, loss=0.461516134195475, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.93(1.03), norm=2.5509558394794754, lr=0.29441505444380817
2023-12-16 22:37:39   INFO  epoch: 10/24, acc_iter=35870, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:15:33/1:00:00, time_cost(all): 13:13:11/17:59:21, loss=0.461317277409426, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.87(1.03), norm=3.2107390484351415, lr=0.2940396144281098
2023-12-16 22:38:45   INFO  epoch: 10/24, acc_iter=35920, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:16:39/0:58:36, time_cost(all): 13:14:17/18:36:01, loss=0.461118420623377, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.07(1.03), norm=4.358928121697538, lr=0.2936641744124114
2023-12-16 22:39:52   INFO  epoch: 10/24, acc_iter=35970, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:17:46/1:01:34, time_cost(all): 13:15:24/18:52:02, loss=0.460919563837328, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.95(1.03), norm=2.450299345101989, lr=0.293288734396713
2023-12-16 22:40:59   INFO  epoch: 10/24, acc_iter=36020, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:18:53/1:00:12, time_cost(all): 13:16:31/18:35:15, loss=0.460720707051279, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.13(1.03), norm=1.7575972069694081, lr=0.29291329438101454
2023-12-16 22:42:05   INFO  epoch: 10/24, acc_iter=36070, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:19:59/0:57:07, time_cost(all): 13:17:37/18:36:16, loss=0.46052185026523, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.19(1.03), norm=2.5290024147920707, lr=0.2925378543653162
2023-12-16 22:43:12   INFO  epoch: 10/24, acc_iter=36120, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:21:06/0:56:16, time_cost(all): 13:18:44/18:15:22, loss=0.460322993479181, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.04(1.03), norm=3.4600228559962103, lr=0.29216241434961776
2023-12-16 22:44:19   INFO  epoch: 10/24, acc_iter=36170, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:22:13/0:53:07, time_cost(all): 13:19:51/17:41:41, loss=0.460124136693132, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.96(1.03), norm=2.464412287959601, lr=0.2917869743339194
2023-12-16 22:45:25   INFO  epoch: 10/24, acc_iter=36220, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:23:19/0:56:05, time_cost(all): 13:20:57/17:20:40, loss=0.459925279907083, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.08(1.03), norm=3.6104193145582735, lr=0.291411534318221
2023-12-16 22:46:32   INFO  epoch: 10/24, acc_iter=36270, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:24:26/0:52:08, time_cost(all): 13:22:04/18:19:40, loss=0.459726423121034, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.03(1.03), norm=3.552910965597153, lr=0.29103609430252264
2023-12-16 22:47:39   INFO  epoch: 10/24, acc_iter=36320, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:25:33/0:52:53, time_cost(all): 13:23:11/18:00:58, loss=0.459527566334985, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.99(1.03), norm=1.6053857733771058, lr=0.2906606542868242
2023-12-16 22:48:45   INFO  epoch: 10/24, acc_iter=36370, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:26:39/0:52:07, time_cost(all): 13:24:17/18:21:06, loss=0.459328709548936, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.22(1.03), norm=0.9162516594908559, lr=0.2902852142711258
2023-12-16 22:49:52   INFO  epoch: 10/24, acc_iter=36420, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:27:46/0:47:51, time_cost(all): 13:25:24/18:05:08, loss=0.459129852762887, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.2(1.03), norm=2.4204485342973587, lr=0.2899097742554274
2023-12-16 22:50:58   INFO  epoch: 10/24, acc_iter=36470, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:28:53/0:49:22, time_cost(all): 13:26:30/18:10:05, loss=0.458930995976838, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.84(1.03), norm=4.317827907654471, lr=0.289534334239729
2023-12-16 22:52:05   INFO  epoch: 10/24, acc_iter=36520, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:29:59/0:47:02, time_cost(all): 13:27:37/18:37:33, loss=0.458732139190789, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.98(1.03), norm=3.7992145054045867, lr=0.2891588942240306
2023-12-16 22:53:12   INFO  epoch: 10/24, acc_iter=36570, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:31:06/0:48:18, time_cost(all): 13:28:44/17:34:29, loss=0.45853328240474, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.08(1.03), norm=4.09117433382407, lr=0.2887834542083322
2023-12-16 22:54:18   INFO  epoch: 10/24, acc_iter=36620, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:32:12/0:47:00, time_cost(all): 13:29:50/17:14:48, loss=0.458334425618691, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.13(1.03), norm=4.371751508640738, lr=0.28840801419263384
2023-12-16 22:55:25   INFO  epoch: 10/24, acc_iter=36670, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:33:19/0:42:36, time_cost(all): 13:30:57/16:51:48, loss=0.458135568832642, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.95(1.03), norm=1.5874294085019207, lr=0.2880325741769354
2023-12-16 22:56:32   INFO  epoch: 10/24, acc_iter=36720, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:34:26/0:42:21, time_cost(all): 13:32:04/17:13:15, loss=0.457936712046593, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.94(1.03), norm=4.875624889098677, lr=0.28765713416123706
2023-12-16 22:57:38   INFO  epoch: 10/24, acc_iter=36770, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:35:32/0:42:01, time_cost(all): 13:33:10/16:53:13, loss=0.457737855260544, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.98(1.03), norm=2.4378881542937614, lr=0.2872816941455386
2023-12-16 22:58:45   INFO  epoch: 10/24, acc_iter=36820, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:36:39/0:40:02, time_cost(all): 13:34:17/18:17:16, loss=0.457538998474495, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.0(1.03), norm=2.517948353003784, lr=0.2869062541298402
2023-12-16 22:59:52   INFO  epoch: 10/24, acc_iter=36870, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:37:46/0:40:00, time_cost(all): 13:35:24/17:42:24, loss=0.457340141688446, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.94(1.03), norm=3.5839986556042147, lr=0.28653081411414183
2023-12-16 23:00:58   INFO  epoch: 10/24, acc_iter=36920, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:38:52/0:39:56, time_cost(all): 13:36:30/17:41:02, loss=0.457141284902397, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.19(1.03), norm=3.634788160132036, lr=0.28615537409844344
2023-12-16 23:02:05   INFO  epoch: 10/24, acc_iter=36970, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:39:59/0:38:14, time_cost(all): 13:37:37/17:40:18, loss=0.456942428116348, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.91(1.03), norm=2.6986283759058804, lr=0.28577993408274505
2023-12-16 23:03:12   INFO  epoch: 10/24, acc_iter=37020, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:41:06/0:35:21, time_cost(all): 13:38:44/17:27:04, loss=0.456743571330299, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.15(1.03), norm=4.1038300121827795, lr=0.2854044940670466
2023-12-16 23:04:18   INFO  epoch: 10/24, acc_iter=37070, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:42:12/0:37:14, time_cost(all): 13:39:50/18:07:47, loss=0.45654471454425, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.97(1.03), norm=1.4176818729785894, lr=0.28502905405134826
2023-12-16 23:05:25   INFO  epoch: 10/24, acc_iter=37120, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:43:19/0:36:12, time_cost(all): 13:40:57/17:02:09, loss=0.456345857758201, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.84(1.03), norm=0.9546112986612282, lr=0.2846536140356498
2023-12-16 23:06:32   INFO  epoch: 10/24, acc_iter=37170, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:44:26/0:35:12, time_cost(all): 13:42:04/16:56:54, loss=0.456147000972152, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.14(1.03), norm=1.2036851111385396, lr=0.2842781740199514
2023-12-16 23:07:38   INFO  epoch: 10/24, acc_iter=37220, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:45:32/0:32:36, time_cost(all): 13:43:10/17:25:13, loss=0.455948144186103, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.06(1.03), norm=3.9375603799490024, lr=0.28390273400425303
2023-12-16 23:08:45   INFO  epoch: 10/24, acc_iter=37270, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:46:39/0:30:16, time_cost(all): 13:44:17/17:36:09, loss=0.455749287400054, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.04(1.03), norm=4.407346821709291, lr=0.28352729398855464
2023-12-16 23:09:52   INFO  epoch: 10/24, acc_iter=37320, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:47:46/0:29:28, time_cost(all): 13:45:24/17:57:09, loss=0.455550430614005, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.2(1.03), norm=2.3482903433234483, lr=0.28315185397285625
2023-12-16 23:10:58   INFO  epoch: 10/24, acc_iter=37370, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:48:52/0:30:15, time_cost(all): 13:46:30/17:39:51, loss=0.455351573827956, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.16(1.03), norm=1.4156638372587687, lr=0.2827764139571578
2023-12-16 23:12:05   INFO  epoch: 10/24, acc_iter=37420, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:49:59/0:27:20, time_cost(all): 13:47:37/17:17:55, loss=0.455152717041907, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.92(1.03), norm=2.7309108025005573, lr=0.28240097394145947
2023-12-16 23:13:12   INFO  epoch: 10/24, acc_iter=37470, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:51:06/0:28:00, time_cost(all): 13:48:44/17:10:05, loss=0.454953860255858, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.21(1.03), norm=4.839784900962209, lr=0.282025533925761
2023-12-16 23:14:18   INFO  epoch: 10/24, acc_iter=37520, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:52:12/0:24:38, time_cost(all): 13:49:50/17:32:26, loss=0.454755003469809, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.0(1.03), norm=0.513238158261606, lr=0.2816500939100627
2023-12-16 23:15:25   INFO  epoch: 10/24, acc_iter=37570, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:53:19/0:24:57, time_cost(all): 13:50:57/17:06:24, loss=0.45455614668376, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.84(1.03), norm=1.3893124706808162, lr=0.28127465389436423
2023-12-16 23:16:32   INFO  epoch: 10/24, acc_iter=37620, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:54:26/0:24:04, time_cost(all): 13:52:04/17:34:36, loss=0.454357289897711, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.16(1.03), norm=4.0634851378092955, lr=0.28089921387866584
2023-12-16 23:17:38   INFO  epoch: 10/24, acc_iter=37670, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:55:32/0:22:48, time_cost(all): 13:53:10/17:51:57, loss=0.454158433111662, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.95(1.03), norm=3.5298096072132816, lr=0.28052377386296745
2023-12-16 23:18:45   INFO  epoch: 10/24, acc_iter=37720, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:56:39/0:22:18, time_cost(all): 13:54:17/18:03:19, loss=0.453959576325613, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.03(1.03), norm=0.8389414060131202, lr=0.28014833384726906
2023-12-16 23:19:51   INFO  epoch: 10/24, acc_iter=37770, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:57:46/0:20:38, time_cost(all): 13:55:23/17:33:51, loss=0.453760719539564, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.99(1.03), norm=4.128114161172231, lr=0.27977289383157067
2023-12-16 23:20:58   INFO  epoch: 10/24, acc_iter=37820, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:58:52/0:18:43, time_cost(all): 13:56:30/17:57:20, loss=0.453561862753515, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.99(1.03), norm=1.7181963144921992, lr=0.2793974538158722
2023-12-16 23:22:05   INFO  epoch: 10/24, acc_iter=37870, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:59:59/0:18:24, time_cost(all): 13:57:37/17:31:21, loss=0.453363005967466, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.18(1.03), norm=2.808798828412524, lr=0.2790220138001739
2023-12-16 23:23:11   INFO  epoch: 10/24, acc_iter=37920, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 1:01:05/0:17:04, time_cost(all): 13:58:43/16:38:15, loss=0.453164149181417, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.87(1.03), norm=4.492487238335597, lr=0.27864657378447544
2023-12-16 23:24:18   INFO  epoch: 10/24, acc_iter=37970, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 1:02:12/0:15:28, time_cost(all): 13:59:50/16:29:40, loss=0.452965292395368, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.97(1.03), norm=1.489905366945792, lr=0.27827113376877705
2023-12-16 23:25:25   INFO  epoch: 10/24, acc_iter=38020, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 1:03:19/0:15:00, time_cost(all): 14:00:57/17:53:15, loss=0.452766435609319, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.13(1.03), norm=2.4379628360237007, lr=0.27789569375307865
2023-12-16 23:26:31   INFO  epoch: 10/24, acc_iter=38070, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 1:04:25/0:14:11, time_cost(all): 14:02:03/17:24:23, loss=0.45256757882327, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.96(1.03), norm=2.5232691014874007, lr=0.27752025373738026
2023-12-16 23:27:38   INFO  epoch: 10/24, acc_iter=38120, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 1:05:32/0:13:09, time_cost(all): 14:03:10/17:22:31, loss=0.452368722037221, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.07(1.03), norm=4.489939568217174, lr=0.27714481372168187
2023-12-16 23:28:45   INFO  epoch: 10/24, acc_iter=38170, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 1:06:39/0:11:49, time_cost(all): 14:04:17/17:57:02, loss=0.452169865251172, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.12(1.03), norm=0.7088443652901306, lr=0.2767693737059834
2023-12-16 23:29:51   INFO  epoch: 10/24, acc_iter=38220, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 1:07:45/0:10:06, time_cost(all): 14:05:23/17:17:00, loss=0.451971008465123, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.03(1.03), norm=4.261571749676298, lr=0.2763939336902851
2023-12-16 23:30:58   INFO  epoch: 10/24, acc_iter=38270, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 1:08:52/0:09:00, time_cost(all): 14:06:30/17:23:56, loss=0.451772151679074, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.86(1.03), norm=3.488029241811859, lr=0.27601849367458664
2023-12-16 23:32:05   INFO  epoch: 10/24, acc_iter=38320, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 1:09:59/0:08:10, time_cost(all): 14:07:37/16:40:46, loss=0.451573294893025, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.91(1.03), norm=2.537750780941955, lr=0.2756430536588883
2023-12-16 23:33:11   INFO  epoch: 10/24, acc_iter=38370, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 1:11:05/0:06:47, time_cost(all): 14:08:43/16:51:24, loss=0.451374438106976, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.2(1.03), norm=2.899626205799651, lr=0.27526761364318986
2023-12-16 23:34:18   INFO  epoch: 10/24, acc_iter=38420, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:12:12/0:05:38, time_cost(all): 14:09:50/17:00:31, loss=0.451175581320927, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.96(1.03), norm=1.4168811672064674, lr=0.27489217362749147
2023-12-16 23:35:25   INFO  epoch: 10/24, acc_iter=38470, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:13:19/0:04:40, time_cost(all): 14:10:57/17:11:29, loss=0.450976724534878, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.88(1.03), norm=1.4961341923480933, lr=0.2745167336117931
2023-12-16 23:36:31   INFO  epoch: 10/24, acc_iter=38520, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:14:25/0:03:32, time_cost(all): 14:12:03/17:15:04, loss=0.450777867748829, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.15(1.03), norm=1.9711088299197426, lr=0.2741412935960946
2023-12-16 23:37:38   INFO  epoch: 10/24, acc_iter=38570, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:15:32/0:02:40, time_cost(all): 14:13:10/16:23:19, loss=0.45057901096278, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.9(1.03), norm=4.212550951398677, lr=0.2737658535803963
2023-12-16 23:38:45   INFO  epoch: 10/24, acc_iter=38620, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:16:39/0:01:32, time_cost(all): 14:14:17/16:28:12, loss=0.450380154176731, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.11(1.03), norm=3.93867852396251, lr=0.27339041356469784
2023-12-16 23:39:51   INFO  epoch: 10/24, acc_iter=38670, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:17:45/0:00:22, time_cost(all): 14:15:23/17:41:27, loss=0.450181297390682, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.9(1.03), norm=0.8403452380217713, lr=0.2730149735489995
2023-12-16 23:40:58   INFO  epoch: 11/24, acc_iter=38737, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:01:06/1:17:32, time_cost(all): 14:16:30/17:16:01, loss=0.449914829297377, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.94(1.03), norm=1.739531085343432, lr=0.2725118839279636
2023-12-16 23:42:05   INFO  epoch: 11/24, acc_iter=38787, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:02:13/1:12:49, time_cost(all): 14:17:37/16:20:56, loss=0.449715972511328, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.19(1.03), norm=3.1905618694842897, lr=0.2721364439122652
2023-12-16 23:43:11   INFO  epoch: 11/24, acc_iter=38837, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:03:19/1:16:58, time_cost(all): 14:18:43/17:46:13, loss=0.449517115725279, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.1(1.03), norm=2.5604376155771873, lr=0.2717610038965668
2023-12-16 23:44:18   INFO  epoch: 11/24, acc_iter=38887, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:04:26/1:15:21, time_cost(all): 14:19:50/17:18:39, loss=0.44931825893923, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.93(1.03), norm=4.8443038775915195, lr=0.27138556388086843
2023-12-16 23:45:25   INFO  epoch: 11/24, acc_iter=38937, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:05:33/1:11:34, time_cost(all): 14:20:57/16:44:04, loss=0.449119402153181, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.88(1.03), norm=4.091373154256958, lr=0.27101012386517
2023-12-16 23:46:31   INFO  epoch: 11/24, acc_iter=38987, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:06:39/1:12:50, time_cost(all): 14:22:03/16:17:31, loss=0.448920545367132, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.01(1.03), norm=3.121079726520297, lr=0.27063468384947165
2023-12-16 23:47:38   INFO  epoch: 11/24, acc_iter=39037, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:07:46/1:13:03, time_cost(all): 14:23:10/16:52:35, loss=0.448721688581083, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.04(1.03), norm=4.074601134540641, lr=0.2702592438337732
2023-12-16 23:48:44   INFO  epoch: 11/24, acc_iter=39087, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:08:53/1:11:46, time_cost(all): 14:24:16/16:18:05, loss=0.448522831795034, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.05(1.03), norm=2.188948237214499, lr=0.26988380381807486
2023-12-16 23:49:51   INFO  epoch: 11/24, acc_iter=39137, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:09:59/1:08:38, time_cost(all): 14:25:23/17:03:28, loss=0.448323975008985, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.18(1.03), norm=4.578529872421152, lr=0.2695083638023764
2023-12-16 23:50:58   INFO  epoch: 11/24, acc_iter=39187, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:11:06/1:10:16, time_cost(all): 14:26:30/16:45:55, loss=0.448125118222936, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.87(1.03), norm=1.219878302350323, lr=0.269132923786678
2023-12-16 23:52:04   INFO  epoch: 11/24, acc_iter=39237, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:12:13/1:05:47, time_cost(all): 14:27:36/17:25:28, loss=0.447926261436887, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.1(1.03), norm=1.9703866927514064, lr=0.26875748377097963
2023-12-16 23:53:11   INFO  epoch: 11/24, acc_iter=39287, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:13:19/1:04:42, time_cost(all): 14:28:43/16:28:53, loss=0.447727404650838, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.07(1.03), norm=4.374739261039071, lr=0.26838204375528124
2023-12-16 23:54:18   INFO  epoch: 11/24, acc_iter=39337, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:14:26/1:02:19, time_cost(all): 14:29:50/17:35:40, loss=0.447528547864789, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.89(1.03), norm=2.0250238665999287, lr=0.26800660373958285
2023-12-16 23:55:24   INFO  epoch: 11/24, acc_iter=39387, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:15:33/1:02:07, time_cost(all): 14:30:56/16:34:10, loss=0.44732969107874, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.02(1.03), norm=3.1689252373894647, lr=0.2676311637238844
2023-12-16 23:56:31   INFO  epoch: 11/24, acc_iter=39437, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:16:39/1:03:33, time_cost(all): 14:32:03/16:23:42, loss=0.447130834292691, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.94(1.03), norm=3.1128990570738564, lr=0.26725572370818607
2023-12-16 23:57:38   INFO  epoch: 11/24, acc_iter=39487, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:17:46/0:58:57, time_cost(all): 14:33:10/15:54:12, loss=0.446931977506642, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.03(1.03), norm=2.6147719250450074, lr=0.2668802836924876
2023-12-16 23:58:44   INFO  epoch: 11/24, acc_iter=39537, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:18:53/0:59:46, time_cost(all): 14:34:16/15:58:22, loss=0.446733120720593, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.98(1.03), norm=4.188692236657257, lr=0.2665048436767892
2023-12-16 23:59:51   INFO  epoch: 11/24, acc_iter=39587, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:19:59/0:59:10, time_cost(all): 14:35:23/17:10:33, loss=0.446534263934544, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.2(1.03), norm=0.9652645272168183, lr=0.26612940366109084
2023-12-17 00:00:58   INFO  epoch: 11/24, acc_iter=39637, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:21:06/0:55:20, time_cost(all): 14:36:30/17:10:10, loss=0.446335407148495, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.02(1.03), norm=2.355849591135347, lr=0.26575396364539244
2023-12-17 00:02:04   INFO  epoch: 11/24, acc_iter=39687, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:22:13/0:54:39, time_cost(all): 14:37:36/17:03:27, loss=0.446136550362446, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.95(1.03), norm=1.6412654416439916, lr=0.26537852362969405
2023-12-17 00:03:11   INFO  epoch: 11/24, acc_iter=39737, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:23:19/0:56:26, time_cost(all): 14:38:43/17:03:07, loss=0.445937693576397, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.95(1.03), norm=3.259498627917046, lr=0.2650030836139956
2023-12-17 00:04:18   INFO  epoch: 11/24, acc_iter=39787, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:24:26/0:53:48, time_cost(all): 14:39:50/16:51:39, loss=0.445738836790348, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.88(1.03), norm=4.438853947324002, lr=0.26462764359829727
2023-12-17 00:05:24   INFO  epoch: 11/24, acc_iter=39837, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:25:33/0:53:27, time_cost(all): 14:40:56/17:06:12, loss=0.445539980004299, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.01(1.03), norm=3.9803012048616475, lr=0.2642522035825988
2023-12-17 00:06:31   INFO  epoch: 11/24, acc_iter=39887, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:26:39/0:51:18, time_cost(all): 14:42:03/15:46:12, loss=0.44534112321825, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.08(1.03), norm=2.414086716290422, lr=0.2638767635669005
2023-12-17 00:07:38   INFO  epoch: 11/24, acc_iter=39937, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:27:46/0:50:31, time_cost(all): 14:43:10/17:05:39, loss=0.445142266432201, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.01(1.03), norm=0.5900442283688698, lr=0.26350132355120204
2023-12-17 00:08:44   INFO  epoch: 11/24, acc_iter=39987, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:28:53/0:49:38, time_cost(all): 14:44:16/15:50:01, loss=0.444943409646152, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.01(1.03), norm=4.906197196542871, lr=0.26312588353550365
2023-12-17 00:09:51   INFO  epoch: 11/24, acc_iter=40037, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:29:59/0:46:31, time_cost(all): 14:45:23/15:44:05, loss=0.444744552860103, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.09(1.03), norm=2.0129797026861844, lr=0.26275044351980525
2023-12-17 00:10:58   INFO  epoch: 11/24, acc_iter=40087, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:31:06/0:46:43, time_cost(all): 14:46:30/16:26:26, loss=0.444545696074054, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.2(1.03), norm=1.7348739240872084, lr=0.26237500350410686
2023-12-17 00:12:04   INFO  epoch: 11/24, acc_iter=40137, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:32:12/0:47:16, time_cost(all): 14:47:36/16:43:38, loss=0.444346839288005, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.03(1.03), norm=1.972165341709331, lr=0.26199956348840847
2023-12-17 00:13:11   INFO  epoch: 11/24, acc_iter=40187, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:33:19/0:42:45, time_cost(all): 14:48:43/16:12:02, loss=0.444147982501956, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.93(1.03), norm=4.4466106029140455, lr=0.26162412347271
2023-12-17 00:14:18   INFO  epoch: 11/24, acc_iter=40237, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:34:26/0:44:29, time_cost(all): 14:49:50/15:41:35, loss=0.443949125715907, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.98(1.03), norm=2.275326191303733, lr=0.2612486834570117
2023-12-17 00:15:24   INFO  epoch: 11/24, acc_iter=40287, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:35:32/0:42:37, time_cost(all): 14:50:56/16:21:29, loss=0.443750268929858, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.2(1.03), norm=1.8226409067501637, lr=0.26087324344131324
2023-12-17 00:16:31   INFO  epoch: 11/24, acc_iter=40337, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:36:39/0:43:15, time_cost(all): 14:52:03/16:26:18, loss=0.443551412143809, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.15(1.03), norm=4.720145603472743, lr=0.26049780342561485
2023-12-17 00:17:37   INFO  epoch: 11/24, acc_iter=40387, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:37:46/0:40:36, time_cost(all): 14:53:09/15:53:21, loss=0.44335255535776, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.1(1.03), norm=4.038219839411103, lr=0.26012236340991646
2023-12-17 00:18:44   INFO  epoch: 11/24, acc_iter=40437, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:38:52/0:38:35, time_cost(all): 14:54:16/16:12:05, loss=0.443153698571711, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.14(1.03), norm=1.9768648158642175, lr=0.25974692339421807
2023-12-17 00:19:51   INFO  epoch: 11/24, acc_iter=40487, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:39:59/0:39:44, time_cost(all): 14:55:23/16:44:10, loss=0.442954841785662, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.04(1.03), norm=4.861367065111883, lr=0.2593714833785197
2023-12-17 00:20:57   INFO  epoch: 11/24, acc_iter=40537, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:41:06/0:36:25, time_cost(all): 14:56:29/15:43:11, loss=0.442755984999613, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.91(1.03), norm=4.245752896294599, lr=0.2589960433628212
2023-12-17 00:22:04   INFO  epoch: 11/24, acc_iter=40587, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:42:12/0:35:45, time_cost(all): 14:57:36/16:49:59, loss=0.442557128213564, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.85(1.03), norm=3.0376283932130637, lr=0.2586206033471229
2023-12-17 00:23:11   INFO  epoch: 11/24, acc_iter=40637, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:43:19/0:33:50, time_cost(all): 14:58:43/15:52:28, loss=0.442358271427515, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.95(1.03), norm=1.2796050058593256, lr=0.25824516333142444
2023-12-17 00:24:17   INFO  epoch: 11/24, acc_iter=40687, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:44:26/0:34:50, time_cost(all): 14:59:49/16:22:37, loss=0.442159414641466, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.97(1.03), norm=0.7657638349060416, lr=0.2578697233157261
2023-12-17 00:25:24   INFO  epoch: 11/24, acc_iter=40737, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:45:32/0:31:36, time_cost(all): 15:00:56/16:25:58, loss=0.441960557855417, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.0(1.03), norm=4.389345216705822, lr=0.2574942833000277
2023-12-17 00:26:31   INFO  epoch: 11/24, acc_iter=40787, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:46:39/0:31:26, time_cost(all): 15:02:03/15:50:10, loss=0.441761701069368, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.11(1.03), norm=1.2246571933825239, lr=0.25711884328432927
2023-12-17 00:27:37   INFO  epoch: 11/24, acc_iter=40837, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:47:46/0:30:11, time_cost(all): 15:03:09/16:27:03, loss=0.441562844283319, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.9(1.03), norm=2.801045296716905, lr=0.2567434032686309
2023-12-17 00:28:44   INFO  epoch: 11/24, acc_iter=40887, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:48:52/0:27:50, time_cost(all): 15:04:16/16:19:59, loss=0.44136398749727, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.94(1.03), norm=4.709915626070521, lr=0.2563679632529325
2023-12-17 00:29:51   INFO  epoch: 11/24, acc_iter=40937, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:49:59/0:28:41, time_cost(all): 15:05:23/15:41:05, loss=0.441165130711221, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.0(1.03), norm=0.8257835325164617, lr=0.2559925232372341
2023-12-17 00:30:57   INFO  epoch: 11/24, acc_iter=40987, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:51:06/0:28:03, time_cost(all): 15:06:29/16:40:01, loss=0.440966273925172, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.99(1.03), norm=1.0939431643410378, lr=0.25561708322153565
2023-12-17 00:32:04   INFO  epoch: 11/24, acc_iter=41037, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:52:12/0:27:11, time_cost(all): 15:07:36/15:19:52, loss=0.440767417139123, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.98(1.03), norm=1.1721311752243637, lr=0.2552416432058373
2023-12-17 00:33:11   INFO  epoch: 11/24, acc_iter=41087, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:53:19/0:23:55, time_cost(all): 15:08:43/16:33:35, loss=0.440568560353074, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.96(1.03), norm=0.514201906581175, lr=0.25486620319013886
2023-12-17 00:34:17   INFO  epoch: 11/24, acc_iter=41137, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:54:26/0:23:35, time_cost(all): 15:09:49/15:21:06, loss=0.440369703567025, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.06(1.03), norm=4.99717995732265, lr=0.2544907631744405
2023-12-17 00:35:24   INFO  epoch: 11/24, acc_iter=41187, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:55:32/0:22:50, time_cost(all): 15:10:56/16:15:42, loss=0.440170846780976, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.92(1.03), norm=2.0921581651492023, lr=0.2541153231587421
2023-12-17 00:36:31   INFO  epoch: 11/24, acc_iter=41237, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:56:39/0:21:04, time_cost(all): 15:12:03/16:49:40, loss=0.439971989994927, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.18(1.03), norm=3.08848537722143, lr=0.2537398831430437
2023-12-17 00:37:37   INFO  epoch: 11/24, acc_iter=41287, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:57:46/0:20:55, time_cost(all): 15:13:09/15:32:20, loss=0.439773133208878, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.84(1.03), norm=2.238124158894169, lr=0.2533644431273453
2023-12-17 00:38:44   INFO  epoch: 11/24, acc_iter=41337, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:58:52/0:19:57, time_cost(all): 15:14:16/15:37:19, loss=0.439574276422829, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.83(1.03), norm=1.097904669280115, lr=0.25298900311164696
2023-12-17 00:39:51   INFO  epoch: 11/24, acc_iter=41387, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:59:59/0:18:46, time_cost(all): 15:15:23/16:15:51, loss=0.43937541963678, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.22(1.03), norm=4.2370483484292585, lr=0.2526135630959485
2023-12-17 00:40:57   INFO  epoch: 11/24, acc_iter=41437, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 1:01:05/0:16:33, time_cost(all): 15:16:29/15:52:37, loss=0.439176562850731, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.98(1.03), norm=1.9772619647997773, lr=0.25223812308025007
2023-12-17 00:42:04   INFO  epoch: 11/24, acc_iter=41487, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 1:02:12/0:15:59, time_cost(all): 15:17:36/15:54:03, loss=0.438977706064682, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.99(1.03), norm=2.5646786940321777, lr=0.25186268306455173
2023-12-17 00:43:11   INFO  epoch: 11/24, acc_iter=41537, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 1:03:19/0:14:19, time_cost(all): 15:18:43/16:16:40, loss=0.438778849278633, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.92(1.03), norm=2.677965842105658, lr=0.25148724304885334
2023-12-17 00:44:17   INFO  epoch: 11/24, acc_iter=41587, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 1:04:25/0:14:18, time_cost(all): 15:19:49/15:26:58, loss=0.438579992492584, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.01(1.03), norm=1.2240401921732733, lr=0.2511118030331549
2023-12-17 00:45:24   INFO  epoch: 11/24, acc_iter=41637, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 1:05:32/0:13:02, time_cost(all): 15:20:56/16:23:15, loss=0.438381135706535, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.87(1.03), norm=0.6545269461830229, lr=0.2507363630174565
2023-12-17 00:46:31   INFO  epoch: 11/24, acc_iter=41687, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 1:06:39/0:11:32, time_cost(all): 15:22:03/15:37:48, loss=0.438182278920486, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.94(1.03), norm=2.8387443066204994, lr=0.2503609230017581
2023-12-17 00:47:37   INFO  epoch: 11/24, acc_iter=41737, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 1:07:45/0:10:09, time_cost(all): 15:23:09/15:31:42, loss=0.437983422134437, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.01(1.03), norm=4.999788712725794, lr=0.24998548298605971
2023-12-17 00:48:44   INFO  epoch: 11/24, acc_iter=41787, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 1:08:52/0:09:07, time_cost(all): 15:24:16/15:52:51, loss=0.437784565348388, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.07(1.03), norm=3.50973647426276, lr=0.24961004297036127
2023-12-17 00:49:50   INFO  epoch: 11/24, acc_iter=41837, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 1:09:59/0:08:28, time_cost(all): 15:25:22/16:22:43, loss=0.437585708562339, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.88(1.03), norm=0.7322810226123327, lr=0.24923460295466293
2023-12-17 00:50:57   INFO  epoch: 11/24, acc_iter=41887, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 1:11:05/0:07:09, time_cost(all): 15:26:29/16:02:07, loss=0.43738685177629, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.0(1.03), norm=1.5732531357727078, lr=0.24885916293896454
2023-12-17 00:52:04   INFO  epoch: 11/24, acc_iter=41937, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:12:12/0:06:06, time_cost(all): 15:27:36/15:22:22, loss=0.437187994990241, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.9(1.03), norm=1.5623346628976043, lr=0.24848372292326615
2023-12-17 00:53:10   INFO  epoch: 11/24, acc_iter=41987, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:13:19/0:04:43, time_cost(all): 15:28:42/16:28:29, loss=0.436989138204192, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.16(1.03), norm=3.794001999295229, lr=0.2481082829075677
2023-12-17 00:54:17   INFO  epoch: 11/24, acc_iter=42037, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:14:25/0:03:44, time_cost(all): 15:29:49/15:38:08, loss=0.436790281418143, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.01(1.03), norm=2.80291004377144, lr=0.2477328428918693
2023-12-17 00:55:24   INFO  epoch: 11/24, acc_iter=42087, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:15:32/0:02:42, time_cost(all): 15:30:56/16:00:53, loss=0.436591424632094, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.18(1.03), norm=0.989933143921385, lr=0.24735740287617092
2023-12-17 00:56:30   INFO  epoch: 11/24, acc_iter=42137, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:16:39/0:01:25, time_cost(all): 15:32:02/15:22:51, loss=0.436392567846045, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.23(1.03), norm=0.5753948345933844, lr=0.24698196286047253
2023-12-17 00:57:37   INFO  epoch: 11/24, acc_iter=42187, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:17:45/0:00:22, time_cost(all): 15:33:09/15:55:03, loss=0.436193711059996, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.07(1.03), norm=0.9030535783340906, lr=0.24660652284477408
2023-12-17 00:58:44   INFO  epoch: 12/24, acc_iter=42254, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:01:06/1:17:13, time_cost(all): 15:34:16/15:54:04, loss=0.43592724296669, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.86(1.03), norm=2.7512432576542896, lr=0.2461034332237383
2023-12-17 00:59:50   INFO  epoch: 12/24, acc_iter=42304, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:02:13/1:17:45, time_cost(all): 15:35:22/16:06:36, loss=0.435728386180641, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.17(1.03), norm=1.6015997897881102, lr=0.2457279932080399
2023-12-17 01:00:57   INFO  epoch: 12/24, acc_iter=42354, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:03:19/1:13:38, time_cost(all): 15:36:29/15:59:53, loss=0.435529529394592, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.87(1.03), norm=2.378683120068632, lr=0.2453525531923415
2023-12-17 01:02:04   INFO  epoch: 12/24, acc_iter=42404, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:04:26/1:11:50, time_cost(all): 15:37:36/15:08:18, loss=0.435330672608543, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.89(1.03), norm=0.5095013357487029, lr=0.24497711317664306
2023-12-17 01:03:10   INFO  epoch: 12/24, acc_iter=42454, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:05:33/1:15:13, time_cost(all): 15:38:42/15:22:37, loss=0.435131815822494, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.0(1.03), norm=1.7837223015659114, lr=0.24460167316094467
2023-12-17 01:04:17   INFO  epoch: 12/24, acc_iter=42504, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:06:39/1:12:14, time_cost(all): 15:39:49/15:48:51, loss=0.434932959036445, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.84(1.03), norm=4.995824554688657, lr=0.24422623314524627
2023-12-17 01:05:24   INFO  epoch: 12/24, acc_iter=42554, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:07:46/1:13:34, time_cost(all): 15:40:56/16:17:43, loss=0.434734102250396, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.91(1.03), norm=2.457029300561708, lr=0.24385079312954788
2023-12-17 01:06:30   INFO  epoch: 12/24, acc_iter=42604, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:08:53/1:08:17, time_cost(all): 15:42:02/15:05:27, loss=0.434535245464347, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.14(1.03), norm=1.5999118317330137, lr=0.24347535311384944
2023-12-17 01:07:37   INFO  epoch: 12/24, acc_iter=42654, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:09:59/1:05:46, time_cost(all): 15:43:09/15:32:19, loss=0.434336388678298, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.0(1.03), norm=1.4421828441051503, lr=0.2430999130981511
2023-12-17 01:08:44   INFO  epoch: 12/24, acc_iter=42704, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:11:06/1:04:42, time_cost(all): 15:44:16/15:32:41, loss=0.434137531892249, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.93(1.03), norm=2.484618093499087, lr=0.2427244730824527
2023-12-17 01:09:50   INFO  epoch: 12/24, acc_iter=42754, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:12:13/1:05:52, time_cost(all): 15:45:22/15:29:37, loss=0.4339386751062, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.08(1.03), norm=3.7097659603355213, lr=0.24234903306675432
2023-12-17 01:10:57   INFO  epoch: 12/24, acc_iter=42804, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:13:19/1:02:27, time_cost(all): 15:46:29/15:28:00, loss=0.433739818320151, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.07(1.03), norm=4.324523271567772, lr=0.24197359305105587
2023-12-17 01:12:04   INFO  epoch: 12/24, acc_iter=42854, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:14:26/1:06:50, time_cost(all): 15:47:36/15:34:04, loss=0.433540961534102, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.88(1.03), norm=1.89124839366526, lr=0.24159815303535748
2023-12-17 01:13:10   INFO  epoch: 12/24, acc_iter=42904, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:15:33/1:00:00, time_cost(all): 15:48:42/14:55:25, loss=0.433342104748053, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.02(1.03), norm=0.5509499736372856, lr=0.24122271301965909
2023-12-17 01:14:17   INFO  epoch: 12/24, acc_iter=42954, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:16:39/1:03:25, time_cost(all): 15:49:49/15:11:27, loss=0.433143247962004, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.08(1.03), norm=3.9048982938475514, lr=0.2408472730039607
2023-12-17 01:15:24   INFO  epoch: 12/24, acc_iter=43004, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:17:46/0:58:39, time_cost(all): 15:50:56/15:52:54, loss=0.432944391175955, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.92(1.03), norm=0.5865354862879448, lr=0.24047183298826225
2023-12-17 01:16:30   INFO  epoch: 12/24, acc_iter=43054, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:18:53/0:56:57, time_cost(all): 15:52:02/16:05:52, loss=0.432745534389906, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.23(1.03), norm=4.9792578743899005, lr=0.2400963929725639
2023-12-17 01:17:37   INFO  epoch: 12/24, acc_iter=43104, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:19:59/0:59:18, time_cost(all): 15:53:09/14:50:51, loss=0.432546677603857, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.1(1.03), norm=3.7244096610484267, lr=0.23972095295686552
2023-12-17 01:18:43   INFO  epoch: 12/24, acc_iter=43154, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:21:06/0:54:45, time_cost(all): 15:54:15/15:16:56, loss=0.432347820817808, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.18(1.03), norm=4.748174631975472, lr=0.23934551294116713
2023-12-17 01:19:50   INFO  epoch: 12/24, acc_iter=43204, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:22:13/0:56:59, time_cost(all): 15:55:22/15:17:19, loss=0.432148964031759, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.96(1.03), norm=4.876993430728699, lr=0.23897007292546868
2023-12-17 01:20:57   INFO  epoch: 12/24, acc_iter=43254, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:23:19/0:56:07, time_cost(all): 15:56:29/14:34:59, loss=0.43195010724571, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.02(1.03), norm=4.697830114027881, lr=0.2385946329097703
2023-12-17 01:22:03   INFO  epoch: 12/24, acc_iter=43304, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:24:26/0:52:26, time_cost(all): 15:57:35/14:46:54, loss=0.431751250459661, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.22(1.03), norm=4.998867817770327, lr=0.2382191928940719
2023-12-17 01:23:10   INFO  epoch: 12/24, acc_iter=43354, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:25:33/0:53:34, time_cost(all): 15:58:42/15:12:26, loss=0.431552393673612, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.92(1.03), norm=1.6865203902244361, lr=0.2378437528783735
2023-12-17 01:24:17   INFO  epoch: 12/24, acc_iter=43404, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:26:39/0:51:18, time_cost(all): 15:59:49/15:24:03, loss=0.431353536887563, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.9(1.03), norm=4.090303899962947, lr=0.23746831286267506
2023-12-17 01:25:23   INFO  epoch: 12/24, acc_iter=43454, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:27:46/0:47:57, time_cost(all): 16:00:55/15:19:39, loss=0.431154680101514, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.01(1.03), norm=0.7739982416594983, lr=0.23709287284697672
2023-12-17 01:26:30   INFO  epoch: 12/24, acc_iter=43504, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:28:53/0:51:17, time_cost(all): 16:02:02/14:46:40, loss=0.430955823315465, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.96(1.03), norm=2.395942725323837, lr=0.23671743283127833
2023-12-17 01:27:37   INFO  epoch: 12/24, acc_iter=43554, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:29:59/0:49:06, time_cost(all): 16:03:09/14:33:58, loss=0.430756966529416, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.12(1.03), norm=3.789414513370983, lr=0.23634199281557994
2023-12-17 01:28:43   INFO  epoch: 12/24, acc_iter=43604, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:31:06/0:46:04, time_cost(all): 16:04:15/15:12:23, loss=0.430558109743367, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.22(1.03), norm=2.72562593828943, lr=0.2359665527998815
2023-12-17 01:29:50   INFO  epoch: 12/24, acc_iter=43654, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:32:12/0:44:32, time_cost(all): 16:05:22/15:33:24, loss=0.430359252957318, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.92(1.03), norm=1.9380647229263366, lr=0.2355911127841831
2023-12-17 01:30:57   INFO  epoch: 12/24, acc_iter=43704, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:33:19/0:46:13, time_cost(all): 16:06:29/15:04:18, loss=0.430160396171269, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.84(1.03), norm=1.3311983132522927, lr=0.2352156727684847
2023-12-17 01:32:03   INFO  epoch: 12/24, acc_iter=43754, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:34:26/0:42:48, time_cost(all): 16:07:35/15:45:02, loss=0.42996153938522, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.83(1.03), norm=3.798541464852672, lr=0.23484023275278632
2023-12-17 01:33:10   INFO  epoch: 12/24, acc_iter=43804, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:35:32/0:41:26, time_cost(all): 16:08:42/15:48:57, loss=0.429762682599171, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.21(1.03), norm=4.250081968458616, lr=0.23446479273708787
2023-12-17 01:34:17   INFO  epoch: 12/24, acc_iter=43854, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:36:39/0:40:11, time_cost(all): 16:09:49/15:50:39, loss=0.429563825813122, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.21(1.03), norm=1.0194938292864764, lr=0.23408935272138953
2023-12-17 01:35:23   INFO  epoch: 12/24, acc_iter=43904, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:37:46/0:41:36, time_cost(all): 16:10:55/14:56:22, loss=0.429364969027073, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.17(1.03), norm=2.042517435147537, lr=0.23371391270569114
2023-12-17 01:36:30   INFO  epoch: 12/24, acc_iter=43954, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:38:52/0:39:28, time_cost(all): 16:12:02/15:37:58, loss=0.429166112241024, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.16(1.03), norm=4.27871233502126, lr=0.23333847268999275
2023-12-17 01:37:37   INFO  epoch: 12/24, acc_iter=44004, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:39:59/0:40:00, time_cost(all): 16:13:09/15:34:03, loss=0.428967255454975, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.13(1.03), norm=3.3808526162485126, lr=0.2329630326742943
2023-12-17 01:38:43   INFO  epoch: 12/24, acc_iter=44054, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:41:06/0:38:32, time_cost(all): 16:14:15/14:40:36, loss=0.428768398668926, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.98(1.03), norm=4.0066424933667815, lr=0.2325875926585959
2023-12-17 01:39:50   INFO  epoch: 12/24, acc_iter=44104, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:42:12/0:34:56, time_cost(all): 16:15:22/15:37:04, loss=0.428569541882877, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.01(1.03), norm=4.350637650551433, lr=0.23221215264289752
2023-12-17 01:40:57   INFO  epoch: 12/24, acc_iter=44154, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:43:19/0:35:20, time_cost(all): 16:16:29/14:22:03, loss=0.428370685096828, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.94(1.03), norm=1.2621838317630392, lr=0.23183671262719913
2023-12-17 01:42:03   INFO  epoch: 12/24, acc_iter=44204, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:44:26/0:34:17, time_cost(all): 16:17:35/15:30:35, loss=0.428171828310779, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.95(1.03), norm=1.607836756784331, lr=0.23146127261150068
2023-12-17 01:43:10   INFO  epoch: 12/24, acc_iter=44254, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:45:32/0:31:43, time_cost(all): 16:18:42/14:28:13, loss=0.42797297152473, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.16(1.03), norm=2.41137423616114, lr=0.23108583259580234
2023-12-17 01:44:17   INFO  epoch: 12/24, acc_iter=44304, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:46:39/0:31:55, time_cost(all): 16:19:49/14:15:05, loss=0.427774114738681, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.15(1.03), norm=1.9056121282585448, lr=0.23071039258010395
2023-12-17 01:45:23   INFO  epoch: 12/24, acc_iter=44354, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:47:46/0:29:35, time_cost(all): 16:20:55/14:12:46, loss=0.427575257952632, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.99(1.03), norm=3.458657961572512, lr=0.23033495256440556
2023-12-17 01:46:30   INFO  epoch: 12/24, acc_iter=44404, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:48:52/0:29:58, time_cost(all): 16:22:02/14:16:19, loss=0.427376401166583, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.01(1.03), norm=3.361892279545886, lr=0.2299595125487071
2023-12-17 01:47:36   INFO  epoch: 12/24, acc_iter=44454, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:49:59/0:29:06, time_cost(all): 16:23:08/15:18:09, loss=0.427177544380534, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.95(1.03), norm=1.0837689605016072, lr=0.22958407253300872
2023-12-17 01:48:43   INFO  epoch: 12/24, acc_iter=44504, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:51:06/0:26:08, time_cost(all): 16:24:15/14:23:53, loss=0.426978687594485, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.18(1.03), norm=1.821595408976809, lr=0.22920863251731033
2023-12-17 01:49:50   INFO  epoch: 12/24, acc_iter=44554, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:52:12/0:26:19, time_cost(all): 16:25:22/15:32:26, loss=0.426779830808436, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.06(1.03), norm=2.3609815282417337, lr=0.22883319250161194
2023-12-17 01:50:56   INFO  epoch: 12/24, acc_iter=44604, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:53:19/0:25:12, time_cost(all): 16:26:28/14:28:56, loss=0.426580974022387, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.06(1.03), norm=3.076019757049822, lr=0.2284577524859135
2023-12-17 01:52:03   INFO  epoch: 12/24, acc_iter=44654, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:54:26/0:24:14, time_cost(all): 16:27:35/15:12:24, loss=0.426382117236338, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.95(1.03), norm=1.5704378115687476, lr=0.22808231247021515
2023-12-17 01:53:10   INFO  epoch: 12/24, acc_iter=44704, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:55:32/0:23:33, time_cost(all): 16:28:42/15:18:48, loss=0.426183260450289, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.1(1.03), norm=1.6328766370849608, lr=0.22770687245451676
2023-12-17 01:54:16   INFO  epoch: 12/24, acc_iter=44754, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:56:39/0:20:26, time_cost(all): 16:29:48/14:54:10, loss=0.42598440366424, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.01(1.03), norm=4.263847168306894, lr=0.22733143243881837
2023-12-17 01:55:23   INFO  epoch: 12/24, acc_iter=44804, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:57:46/0:19:59, time_cost(all): 16:30:55/15:12:58, loss=0.425785546878191, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.01(1.03), norm=2.30699119260331, lr=0.22695599242311992
2023-12-17 01:56:30   INFO  epoch: 12/24, acc_iter=44854, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:58:52/0:19:22, time_cost(all): 16:32:02/14:22:21, loss=0.425586690092142, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.05(1.03), norm=2.068658713655891, lr=0.22658055240742153
2023-12-17 01:57:36   INFO  epoch: 12/24, acc_iter=44904, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:59:59/0:17:17, time_cost(all): 16:33:08/14:52:02, loss=0.425387833306093, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.87(1.03), norm=3.858472082260204, lr=0.22620511239172314
2023-12-17 01:58:43   INFO  epoch: 12/24, acc_iter=44954, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 1:01:05/0:16:17, time_cost(all): 16:34:15/13:59:52, loss=0.425188976520044, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.14(1.03), norm=3.8993599876028555, lr=0.22582967237602475
2023-12-17 01:59:50   INFO  epoch: 12/24, acc_iter=45004, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 1:02:12/0:15:20, time_cost(all): 16:35:22/14:45:16, loss=0.424990119733995, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.12(1.03), norm=3.9560254112582065, lr=0.2254542323603263
2023-12-17 02:00:56   INFO  epoch: 12/24, acc_iter=45054, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 1:03:19/0:15:27, time_cost(all): 16:36:28/14:43:45, loss=0.424791262947946, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.87(1.03), norm=3.4807762086690026, lr=0.22507879234462796
2023-12-17 02:02:03   INFO  epoch: 12/24, acc_iter=45104, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 1:04:25/0:14:20, time_cost(all): 16:37:35/14:51:32, loss=0.424592406161897, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.01(1.03), norm=4.67018061196724, lr=0.22470335232892957
2023-12-17 02:03:10   INFO  epoch: 12/24, acc_iter=45154, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 1:05:32/0:12:01, time_cost(all): 16:38:42/15:18:32, loss=0.424393549375848, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.87(1.03), norm=1.8421045154166271, lr=0.22432791231323118
2023-12-17 02:04:16   INFO  epoch: 12/24, acc_iter=45204, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 1:06:39/0:11:23, time_cost(all): 16:39:48/14:57:08, loss=0.424194692589799, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.23(1.03), norm=4.158269159311238, lr=0.2239524722975328
2023-12-17 02:05:23   INFO  epoch: 12/24, acc_iter=45254, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 1:07:45/0:10:18, time_cost(all): 16:40:55/14:48:58, loss=0.42399583580375, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.88(1.03), norm=4.411917765089887, lr=0.22357703228183434
2023-12-17 02:06:30   INFO  epoch: 12/24, acc_iter=45304, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 1:08:52/0:09:19, time_cost(all): 16:42:02/14:19:56, loss=0.423796979017701, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.04(1.03), norm=2.6847383306483064, lr=0.22320159226613595
2023-12-17 02:07:36   INFO  epoch: 12/24, acc_iter=45354, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 1:09:59/0:07:49, time_cost(all): 16:43:08/14:50:20, loss=0.423598122231652, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.86(1.03), norm=4.817102360937305, lr=0.22282615225043756
2023-12-17 02:08:43   INFO  epoch: 12/24, acc_iter=45404, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 1:11:05/0:07:16, time_cost(all): 16:44:15/14:22:18, loss=0.423399265445603, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.9(1.03), norm=1.393521011176793, lr=0.22245071223473917
2023-12-17 02:09:50   INFO  epoch: 12/24, acc_iter=45454, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:12:12/0:06:13, time_cost(all): 16:45:22/13:56:24, loss=0.423200408659554, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.16(1.03), norm=2.0182491512259784, lr=0.22207527221904078
2023-12-17 02:10:56   INFO  epoch: 12/24, acc_iter=45504, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:13:19/0:04:48, time_cost(all): 16:46:28/14:36:37, loss=0.423001551873505, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.14(1.03), norm=4.459717340876306, lr=0.22169983220334238
2023-12-17 02:12:03   INFO  epoch: 12/24, acc_iter=45554, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:14:25/0:03:45, time_cost(all): 16:47:35/13:47:35, loss=0.422802695087456, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.88(1.03), norm=2.6835091477753235, lr=0.221324392187644
2023-12-17 02:13:10   INFO  epoch: 12/24, acc_iter=45604, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:15:32/0:02:38, time_cost(all): 16:48:42/14:58:47, loss=0.422603838301407, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.14(1.03), norm=0.6870791566468253, lr=0.2209489521719456
2023-12-17 02:14:16   INFO  epoch: 12/24, acc_iter=45654, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:16:39/0:01:31, time_cost(all): 16:49:48/14:56:15, loss=0.422404981515358, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.01(1.03), norm=4.992844941587838, lr=0.22057351215624715
2023-12-17 02:15:23   INFO  epoch: 12/24, acc_iter=45704, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:17:45/0:00:23, time_cost(all): 16:50:55/14:52:54, loss=0.422206124729309, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.95(1.03), norm=3.7448800221822336, lr=0.22019807214054876
2023-12-17 02:16:30   INFO  epoch: 13/24, acc_iter=45771, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:01:06/1:17:54, time_cost(all): 16:52:02/15:02:51, loss=0.421939656636003, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.93(1.03), norm=3.297566669025015, lr=0.21969498251951292
2023-12-17 02:17:36   INFO  epoch: 13/24, acc_iter=45821, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:02:13/1:12:13, time_cost(all): 16:53:08/13:47:17, loss=0.421740799849954, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.08(1.03), norm=3.899237776236334, lr=0.21931954250381452
2023-12-17 02:18:43   INFO  epoch: 13/24, acc_iter=45871, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:03:19/1:11:57, time_cost(all): 16:54:15/14:49:49, loss=0.421541943063905, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.93(1.03), norm=3.0279588113210436, lr=0.21894410248811613
2023-12-17 02:19:49   INFO  epoch: 13/24, acc_iter=45921, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:04:26/1:11:05, time_cost(all): 16:55:21/14:54:52, loss=0.421343086277856, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.13(1.03), norm=1.211959198813161, lr=0.21856866247241774
2023-12-17 02:20:56   INFO  epoch: 13/24, acc_iter=45971, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:05:33/1:12:33, time_cost(all): 16:56:28/13:38:22, loss=0.421144229491807, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.03(1.03), norm=4.808314728497618, lr=0.21819322245671935
2023-12-17 02:22:03   INFO  epoch: 13/24, acc_iter=46021, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:06:39/1:11:37, time_cost(all): 16:57:35/13:46:06, loss=0.420945372705758, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.97(1.03), norm=4.041558414667797, lr=0.21781778244102096
2023-12-17 02:23:09   INFO  epoch: 13/24, acc_iter=46071, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:07:46/1:09:27, time_cost(all): 16:58:41/13:35:10, loss=0.420746515919709, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.04(1.03), norm=2.6351357613172235, lr=0.2174423424253225
2023-12-17 02:24:16   INFO  epoch: 13/24, acc_iter=46121, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:08:53/1:06:23, time_cost(all): 16:59:48/13:34:18, loss=0.42054765913366, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.21(1.03), norm=2.237459620704068, lr=0.21706690240962412
2023-12-17 02:25:23   INFO  epoch: 13/24, acc_iter=46171, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:09:59/1:04:57, time_cost(all): 17:00:55/14:24:03, loss=0.420348802347611, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.18(1.03), norm=2.2481738695336024, lr=0.21669146239392573
2023-12-17 02:26:29   INFO  epoch: 13/24, acc_iter=46221, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:11:06/1:06:59, time_cost(all): 17:02:01/14:45:24, loss=0.420149945561562, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.88(1.03), norm=2.2932352799164484, lr=0.21631602237822734
2023-12-17 02:27:36   INFO  epoch: 13/24, acc_iter=46271, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:12:13/1:09:10, time_cost(all): 17:03:08/14:09:42, loss=0.419951088775513, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.01(1.03), norm=1.8086718980736114, lr=0.21594058236252894
2023-12-17 02:28:43   INFO  epoch: 13/24, acc_iter=46321, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:13:19/1:02:59, time_cost(all): 17:04:15/14:08:01, loss=0.419752231989464, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.14(1.03), norm=2.681689754129176, lr=0.21556514234683055
2023-12-17 02:29:49   INFO  epoch: 13/24, acc_iter=46371, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:14:26/1:06:18, time_cost(all): 17:05:21/13:38:52, loss=0.419553375203415, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.98(1.03), norm=4.700251671268613, lr=0.21518970233113216
2023-12-17 02:30:56   INFO  epoch: 13/24, acc_iter=46421, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:15:33/1:05:41, time_cost(all): 17:06:28/14:05:35, loss=0.419354518417366, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.16(1.03), norm=3.299139885245721, lr=0.21481426231543377
2023-12-17 02:32:03   INFO  epoch: 13/24, acc_iter=46471, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:16:39/1:03:40, time_cost(all): 17:07:35/13:59:24, loss=0.419155661631317, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.97(1.03), norm=0.5519823994139853, lr=0.21443882229973532
2023-12-17 02:33:09   INFO  epoch: 13/24, acc_iter=46521, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:17:46/1:00:21, time_cost(all): 17:08:41/14:13:06, loss=0.418956804845268, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.03(1.03), norm=3.287841215780749, lr=0.21406338228403693
2023-12-17 02:34:16   INFO  epoch: 13/24, acc_iter=46571, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:18:53/1:01:45, time_cost(all): 17:09:48/14:44:57, loss=0.418757948059219, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.16(1.03), norm=4.209628277824236, lr=0.21368794226833854
2023-12-17 02:35:23   INFO  epoch: 13/24, acc_iter=46621, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:19:59/0:56:10, time_cost(all): 17:10:55/14:41:03, loss=0.41855909127317, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.15(1.03), norm=1.715696179874725, lr=0.21331250225264015
2023-12-17 02:36:29   INFO  epoch: 13/24, acc_iter=46671, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:21:06/0:56:33, time_cost(all): 17:12:01/13:47:20, loss=0.418360234487121, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.09(1.03), norm=3.572312630144811, lr=0.21293706223694175
2023-12-17 02:37:36   INFO  epoch: 13/24, acc_iter=46721, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:22:13/0:54:06, time_cost(all): 17:13:08/14:33:20, loss=0.418161377701072, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.93(1.03), norm=2.7202524096187126, lr=0.21256162222124336
2023-12-17 02:38:43   INFO  epoch: 13/24, acc_iter=46771, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:23:19/0:52:43, time_cost(all): 17:14:15/13:32:52, loss=0.417962520915023, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.03(1.03), norm=4.480761966756774, lr=0.21218618220554497
2023-12-17 02:39:49   INFO  epoch: 13/24, acc_iter=46821, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:24:26/0:52:34, time_cost(all): 17:15:21/14:01:37, loss=0.417763664128974, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.08(1.03), norm=3.4717370290432084, lr=0.21181074218984658
2023-12-17 02:40:56   INFO  epoch: 13/24, acc_iter=46871, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:25:33/0:53:48, time_cost(all): 17:16:28/14:18:53, loss=0.417564807342925, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.2(1.03), norm=0.8146058810854773, lr=0.21143530217414813
2023-12-17 02:42:03   INFO  epoch: 13/24, acc_iter=46921, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:26:39/0:51:55, time_cost(all): 17:17:35/14:32:27, loss=0.417365950556876, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.22(1.03), norm=1.4519683875619005, lr=0.21105986215844974
2023-12-17 02:43:09   INFO  epoch: 13/24, acc_iter=46971, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:27:46/0:52:32, time_cost(all): 17:18:41/14:16:50, loss=0.417167093770827, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.0(1.03), norm=4.352936184405132, lr=0.21068442214275135
2023-12-17 02:44:16   INFO  epoch: 13/24, acc_iter=47021, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:28:53/0:51:28, time_cost(all): 17:19:48/14:17:24, loss=0.416968236984778, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.89(1.03), norm=2.5999782742942816, lr=0.21030898212705296
2023-12-17 02:45:23   INFO  epoch: 13/24, acc_iter=47071, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:29:59/0:47:27, time_cost(all): 17:20:55/13:35:03, loss=0.416769380198729, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.14(1.03), norm=2.073343061813338, lr=0.20993354211135457
2023-12-17 02:46:29   INFO  epoch: 13/24, acc_iter=47121, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:31:06/0:46:09, time_cost(all): 17:22:01/14:12:46, loss=0.41657052341268, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.1(1.03), norm=2.9463813270009345, lr=0.20955810209565617
2023-12-17 02:47:36   INFO  epoch: 13/24, acc_iter=47171, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:32:12/0:46:06, time_cost(all): 17:23:08/13:22:59, loss=0.416371666626631, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.0(1.03), norm=1.8532911144993411, lr=0.20918266207995778
2023-12-17 02:48:42   INFO  epoch: 13/24, acc_iter=47221, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:33:19/0:46:48, time_cost(all): 17:24:14/13:35:42, loss=0.416172809840582, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.12(1.03), norm=1.9846665225136748, lr=0.2088072220642594
2023-12-17 02:49:49   INFO  epoch: 13/24, acc_iter=47271, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:34:26/0:45:27, time_cost(all): 17:25:21/14:27:02, loss=0.415973953054533, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.19(1.03), norm=3.993742716026067, lr=0.20843178204856094
2023-12-17 02:50:56   INFO  epoch: 13/24, acc_iter=47321, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:35:32/0:44:15, time_cost(all): 17:26:28/14:10:41, loss=0.415775096268484, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.09(1.03), norm=4.26274285783083, lr=0.20805634203286255
2023-12-17 02:52:02   INFO  epoch: 13/24, acc_iter=47371, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:36:39/0:43:29, time_cost(all): 17:27:34/14:04:13, loss=0.415576239482435, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.07(1.03), norm=1.093409850314051, lr=0.20768090201716416
2023-12-17 02:53:09   INFO  epoch: 13/24, acc_iter=47421, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:37:46/0:40:54, time_cost(all): 17:28:41/13:27:48, loss=0.415377382696386, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.12(1.03), norm=1.1512942461498503, lr=0.20730546200146577
2023-12-17 02:54:16   INFO  epoch: 13/24, acc_iter=47471, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:38:52/0:37:52, time_cost(all): 17:29:48/14:11:52, loss=0.415178525910337, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.2(1.03), norm=3.846214171461602, lr=0.20693002198576738
2023-12-17 02:55:22   INFO  epoch: 13/24, acc_iter=47521, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:39:59/0:36:46, time_cost(all): 17:30:54/14:15:21, loss=0.414979669124288, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.1(1.03), norm=0.508941454534338, lr=0.20655458197006898
2023-12-17 02:56:29   INFO  epoch: 13/24, acc_iter=47571, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:41:06/0:37:42, time_cost(all): 17:32:01/13:07:31, loss=0.414780812338239, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.21(1.03), norm=0.9138020354203418, lr=0.2061791419543706
2023-12-17 02:57:36   INFO  epoch: 13/24, acc_iter=47621, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:42:12/0:35:13, time_cost(all): 17:33:08/13:34:37, loss=0.41458195555219, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.98(1.03), norm=4.3296516357525014, lr=0.2058037019386722
2023-12-17 02:58:42   INFO  epoch: 13/24, acc_iter=47671, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:43:19/0:36:03, time_cost(all): 17:34:14/14:01:03, loss=0.414383098766141, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.93(1.03), norm=3.0743898327807657, lr=0.20542826192297375
2023-12-17 02:59:49   INFO  epoch: 13/24, acc_iter=47721, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:44:26/0:35:21, time_cost(all): 17:35:21/13:47:48, loss=0.414184241980092, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.98(1.03), norm=0.9619398913375237, lr=0.20505282190727536
2023-12-17 03:00:56   INFO  epoch: 13/24, acc_iter=47771, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:45:32/0:33:00, time_cost(all): 17:36:28/13:59:13, loss=0.413985385194043, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.17(1.03), norm=1.6532174819789263, lr=0.20467738189157697
2023-12-17 03:02:02   INFO  epoch: 13/24, acc_iter=47821, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:46:39/0:30:54, time_cost(all): 17:37:34/13:44:09, loss=0.413786528407994, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.16(1.03), norm=2.9100516390477833, lr=0.20430194187587858
2023-12-17 03:03:09   INFO  epoch: 13/24, acc_iter=47871, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:47:46/0:30:14, time_cost(all): 17:38:41/14:05:45, loss=0.413587671621945, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.11(1.03), norm=1.46774247312509, lr=0.2039265018601802
2023-12-17 03:04:16   INFO  epoch: 13/24, acc_iter=47921, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:48:52/0:29:38, time_cost(all): 17:39:48/13:47:55, loss=0.413388814835896, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.22(1.03), norm=1.3437557689168926, lr=0.2035510618444818
2023-12-17 03:05:22   INFO  epoch: 13/24, acc_iter=47971, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:49:59/0:26:55, time_cost(all): 17:40:54/13:50:57, loss=0.413189958049847, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.15(1.03), norm=3.259686871157827, lr=0.2031756218287834
2023-12-17 03:06:29   INFO  epoch: 13/24, acc_iter=48021, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:51:06/0:25:55, time_cost(all): 17:42:01/13:53:04, loss=0.412991101263798, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.11(1.03), norm=3.711434434073205, lr=0.202800181813085
2023-12-17 03:07:36   INFO  epoch: 13/24, acc_iter=48071, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:52:12/0:25:44, time_cost(all): 17:43:08/13:48:17, loss=0.412792244477749, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.17(1.03), norm=4.703628987833915, lr=0.20242474179738656
2023-12-17 03:08:42   INFO  epoch: 13/24, acc_iter=48121, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:53:19/0:23:39, time_cost(all): 17:44:14/13:09:38, loss=0.4125933876917, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.01(1.03), norm=1.032238635388122, lr=0.20204930178168817
2023-12-17 03:09:49   INFO  epoch: 13/24, acc_iter=48171, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:54:26/0:23:16, time_cost(all): 17:45:21/13:49:09, loss=0.412394530905651, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.09(1.03), norm=3.5283353982211985, lr=0.20167386176598978
2023-12-17 03:10:56   INFO  epoch: 13/24, acc_iter=48221, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:55:32/0:22:18, time_cost(all): 17:46:28/13:00:43, loss=0.412195674119602, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.91(1.03), norm=3.7296074980377805, lr=0.2012984217502914
2023-12-17 03:12:02   INFO  epoch: 13/24, acc_iter=48271, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:56:39/0:21:06, time_cost(all): 17:47:34/13:27:41, loss=0.411996817333553, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.98(1.03), norm=1.5773753677336382, lr=0.200922981734593
2023-12-17 03:13:09   INFO  epoch: 13/24, acc_iter=48321, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:57:46/0:20:55, time_cost(all): 17:48:41/13:12:18, loss=0.411797960547504, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.13(1.03), norm=3.4094053135358635, lr=0.2005475417188946
2023-12-17 03:14:16   INFO  epoch: 13/24, acc_iter=48371, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:58:52/0:18:53, time_cost(all): 17:49:48/13:43:36, loss=0.411599103761455, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.06(1.03), norm=2.0041760175343257, lr=0.20017210170319621
2023-12-17 03:15:22   INFO  epoch: 13/24, acc_iter=48421, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:59:59/0:18:42, time_cost(all): 17:50:54/13:19:27, loss=0.411400246975406, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.88(1.03), norm=2.8570061413243364, lr=0.19979666168749782
2023-12-17 03:16:29   INFO  epoch: 13/24, acc_iter=48471, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 1:01:05/0:16:13, time_cost(all): 17:52:01/13:25:15, loss=0.411201390189357, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.09(1.03), norm=2.8279031081553585, lr=0.19942122167179938
2023-12-17 03:17:35   INFO  epoch: 13/24, acc_iter=48521, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 1:02:12/0:16:15, time_cost(all): 17:53:07/13:39:34, loss=0.411002533403308, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.23(1.03), norm=3.6059059233645057, lr=0.19904578165610098
2023-12-17 03:18:42   INFO  epoch: 13/24, acc_iter=48571, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 1:03:19/0:14:34, time_cost(all): 17:54:14/12:59:57, loss=0.410803676617259, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.98(1.03), norm=3.4264097318866917, lr=0.1986703416404026
2023-12-17 03:19:49   INFO  epoch: 13/24, acc_iter=48621, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 1:04:25/0:13:47, time_cost(all): 17:55:21/13:40:01, loss=0.41060481983121, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.03(1.03), norm=1.6631116518390254, lr=0.1982949016247042
2023-12-17 03:20:55   INFO  epoch: 13/24, acc_iter=48671, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 1:05:32/0:12:30, time_cost(all): 17:56:27/12:41:37, loss=0.410405963045161, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.08(1.03), norm=3.76713214269201, lr=0.1979194616090058
2023-12-17 03:22:02   INFO  epoch: 13/24, acc_iter=48721, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 1:06:39/0:11:32, time_cost(all): 17:57:34/13:57:22, loss=0.410207106259112, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.89(1.03), norm=1.5251422604346703, lr=0.19754402159330742
2023-12-17 03:23:09   INFO  epoch: 13/24, acc_iter=48771, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 1:07:45/0:10:41, time_cost(all): 17:58:41/13:55:16, loss=0.410008249473063, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.99(1.03), norm=2.5259450640063164, lr=0.19716858157760903
2023-12-17 03:24:15   INFO  epoch: 13/24, acc_iter=48821, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 1:08:52/0:09:36, time_cost(all): 17:59:47/13:40:18, loss=0.409809392687014, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.08(1.03), norm=3.6850031337893894, lr=0.19679314156191063
2023-12-17 03:25:22   INFO  epoch: 13/24, acc_iter=48871, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 1:09:59/0:07:59, time_cost(all): 18:00:54/13:19:04, loss=0.409610535900965, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.09(1.03), norm=0.5048377655184275, lr=0.1964177015462122
2023-12-17 03:26:29   INFO  epoch: 13/24, acc_iter=48921, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 1:11:05/0:06:42, time_cost(all): 18:02:01/13:16:45, loss=0.409411679114916, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.97(1.03), norm=0.7319625953617723, lr=0.1960422615305138
2023-12-17 03:27:35   INFO  epoch: 13/24, acc_iter=48971, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:12:12/0:06:11, time_cost(all): 18:03:07/12:33:59, loss=0.409212822328867, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.07(1.03), norm=2.9549111158735, lr=0.1956668215148154
2023-12-17 03:28:42   INFO  epoch: 13/24, acc_iter=49021, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:13:19/0:04:54, time_cost(all): 18:04:14/12:58:13, loss=0.409013965542818, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.02(1.03), norm=3.200468888083737, lr=0.195291381499117
2023-12-17 03:29:49   INFO  epoch: 13/24, acc_iter=49071, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:14:25/0:03:43, time_cost(all): 18:05:21/13:18:36, loss=0.408815108756769, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.88(1.03), norm=2.3870293697383373, lr=0.19491594148341862
2023-12-17 03:30:55   INFO  epoch: 13/24, acc_iter=49121, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:15:32/0:02:43, time_cost(all): 18:06:27/12:31:20, loss=0.40861625197072, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.21(1.03), norm=4.713120676903699, lr=0.19454050146772023
2023-12-17 03:32:02   INFO  epoch: 13/24, acc_iter=49171, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:16:39/0:01:27, time_cost(all): 18:07:34/13:43:18, loss=0.408417395184671, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.92(1.03), norm=4.279304075960885, lr=0.19416506145202184
2023-12-17 03:33:09   INFO  epoch: 13/24, acc_iter=49221, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:17:45/0:00:22, time_cost(all): 18:08:41/13:43:28, loss=0.408218538398622, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.06(1.03), norm=1.4308870199199974, lr=0.19378962143632344
2023-12-17 03:34:15   INFO  epoch: 14/24, acc_iter=49288, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:01:06/1:17:22, time_cost(all): 18:09:47/13:00:46, loss=0.407952070305316, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.05(1.03), norm=1.324731498903248, lr=0.19328653181528754
2023-12-17 03:35:22   INFO  epoch: 14/24, acc_iter=49338, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:02:13/1:13:37, time_cost(all): 18:10:54/12:32:37, loss=0.407753213519267, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.17(1.03), norm=1.841169989391052, lr=0.19291109179958915
2023-12-17 03:36:29   INFO  epoch: 14/24, acc_iter=49388, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:03:19/1:13:12, time_cost(all): 18:12:01/13:06:55, loss=0.407554356733218, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.03(1.03), norm=2.4942323128949506, lr=0.19253565178389076
2023-12-17 03:37:35   INFO  epoch: 14/24, acc_iter=49438, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:04:26/1:11:28, time_cost(all): 18:13:07/13:39:41, loss=0.407355499947169, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.86(1.03), norm=3.393510624400255, lr=0.19216021176819237
2023-12-17 03:38:42   INFO  epoch: 14/24, acc_iter=49488, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:05:33/1:11:12, time_cost(all): 18:14:14/12:46:59, loss=0.40715664316112, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.21(1.03), norm=4.158452376735102, lr=0.19178477175249398
2023-12-17 03:39:49   INFO  epoch: 14/24, acc_iter=49538, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:06:39/1:14:06, time_cost(all): 18:15:21/12:49:15, loss=0.406957786375071, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.97(1.03), norm=4.356777183858086, lr=0.19140933173679558
2023-12-17 03:40:55   INFO  epoch: 14/24, acc_iter=49588, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:07:46/1:09:22, time_cost(all): 18:16:27/13:02:53, loss=0.406758929589022, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.22(1.03), norm=3.661609736389091, lr=0.1910338917210972
2023-12-17 03:42:02   INFO  epoch: 14/24, acc_iter=49638, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:08:53/1:11:32, time_cost(all): 18:17:34/12:36:19, loss=0.406560072802973, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.88(1.03), norm=1.1222799147192934, lr=0.1906584517053988
2023-12-17 03:43:09   INFO  epoch: 14/24, acc_iter=49688, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:09:59/1:08:05, time_cost(all): 18:18:41/12:26:43, loss=0.406361216016924, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.16(1.03), norm=2.2595598270127457, lr=0.19028301168970035
2023-12-17 03:44:15   INFO  epoch: 14/24, acc_iter=49738, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:11:06/1:05:30, time_cost(all): 18:19:47/12:23:16, loss=0.406162359230875, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.09(1.03), norm=3.8889780231217808, lr=0.18990757167400196
2023-12-17 03:45:22   INFO  epoch: 14/24, acc_iter=49788, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:12:13/1:08:10, time_cost(all): 18:20:54/12:19:13, loss=0.405963502444826, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.15(1.03), norm=4.049207839896859, lr=0.18953213165830357
2023-12-17 03:46:28   INFO  epoch: 14/24, acc_iter=49838, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:13:19/1:04:51, time_cost(all): 18:22:00/13:22:48, loss=0.405764645658777, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.84(1.03), norm=0.775683964126435, lr=0.18915669164260518
2023-12-17 03:47:35   INFO  epoch: 14/24, acc_iter=49888, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:14:26/1:01:03, time_cost(all): 18:23:07/12:14:52, loss=0.405565788872728, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.15(1.03), norm=1.600819132436004, lr=0.1887812516269068
2023-12-17 03:48:42   INFO  epoch: 14/24, acc_iter=49938, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:15:33/1:04:45, time_cost(all): 18:24:14/12:45:38, loss=0.405366932086679, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.84(1.03), norm=0.5338539416419732, lr=0.1884058116112084
2023-12-17 03:49:48   INFO  epoch: 14/24, acc_iter=49988, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:16:39/1:01:55, time_cost(all): 18:25:20/13:08:32, loss=0.40516807530063, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.88(1.03), norm=2.2284399157242474, lr=0.18803037159551
2023-12-17 03:50:55   INFO  epoch: 14/24, acc_iter=50038, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:17:46/0:59:29, time_cost(all): 18:26:27/12:23:24, loss=0.404969218514581, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.9(1.03), norm=2.2890652839688914, lr=0.1876549315798116
2023-12-17 03:52:02   INFO  epoch: 14/24, acc_iter=50088, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:18:53/0:58:07, time_cost(all): 18:27:34/13:21:06, loss=0.404770361728532, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.86(1.03), norm=3.5751086340938167, lr=0.18727949156411322
2023-12-17 03:53:08   INFO  epoch: 14/24, acc_iter=50138, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:19:59/0:58:25, time_cost(all): 18:28:40/12:59:35, loss=0.404571504942483, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.22(1.03), norm=3.547613662482998, lr=0.18690405154841477
2023-12-17 03:54:15   INFO  epoch: 14/24, acc_iter=50188, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:21:06/0:59:25, time_cost(all): 18:29:47/13:23:02, loss=0.404372648156434, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.93(1.03), norm=4.554498322632001, lr=0.18652861153271638
2023-12-17 03:55:22   INFO  epoch: 14/24, acc_iter=50238, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:22:13/0:58:26, time_cost(all): 18:30:54/12:33:00, loss=0.404173791370385, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.9(1.03), norm=3.5373317308836456, lr=0.186153171517018
2023-12-17 03:56:28   INFO  epoch: 14/24, acc_iter=50288, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:23:19/0:57:14, time_cost(all): 18:32:00/12:15:00, loss=0.403974934584336, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.03(1.03), norm=0.7911286872429542, lr=0.1857777315013196
2023-12-17 03:57:35   INFO  epoch: 14/24, acc_iter=50338, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:24:26/0:54:35, time_cost(all): 18:33:07/13:06:58, loss=0.403776077798287, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.14(1.03), norm=2.061077134488949, lr=0.1854022914856212
2023-12-17 03:58:42   INFO  epoch: 14/24, acc_iter=50388, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:25:33/0:50:50, time_cost(all): 18:34:14/12:32:43, loss=0.403577221012238, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.0(1.03), norm=0.59218350821233, lr=0.18502685146992282
2023-12-17 03:59:48   INFO  epoch: 14/24, acc_iter=50438, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:26:39/0:52:36, time_cost(all): 18:35:20/12:41:56, loss=0.403378364226189, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.03(1.03), norm=1.4776258733933634, lr=0.18465141145422442
2023-12-17 04:00:55   INFO  epoch: 14/24, acc_iter=50488, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:27:46/0:49:25, time_cost(all): 18:36:27/12:12:59, loss=0.40317950744014, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.86(1.03), norm=3.024586115939658, lr=0.18427597143852603
2023-12-17 04:02:02   INFO  epoch: 14/24, acc_iter=50538, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:28:53/0:50:21, time_cost(all): 18:37:34/12:26:09, loss=0.402980650654091, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.95(1.03), norm=3.7219253144170112, lr=0.18390053142282758
2023-12-17 04:03:08   INFO  epoch: 14/24, acc_iter=50588, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:29:59/0:46:00, time_cost(all): 18:38:40/12:16:39, loss=0.402781793868042, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.0(1.03), norm=1.0017564274189477, lr=0.1835250914071292
2023-12-17 04:04:15   INFO  epoch: 14/24, acc_iter=50638, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:31:06/0:46:13, time_cost(all): 18:39:47/13:05:22, loss=0.402582937081993, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.04(1.03), norm=2.561372405570355, lr=0.1831496513914308
2023-12-17 04:05:22   INFO  epoch: 14/24, acc_iter=50688, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:32:12/0:43:43, time_cost(all): 18:40:54/12:42:34, loss=0.402384080295944, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.93(1.03), norm=2.366578413097982, lr=0.1827742113757324
2023-12-17 04:06:28   INFO  epoch: 14/24, acc_iter=50738, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:33:19/0:43:19, time_cost(all): 18:42:00/12:35:32, loss=0.402185223509895, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.03(1.03), norm=1.3329022162212874, lr=0.18239877136003402
2023-12-17 04:07:35   INFO  epoch: 14/24, acc_iter=50788, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:34:26/0:43:26, time_cost(all): 18:43:07/11:58:29, loss=0.401986366723846, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.22(1.03), norm=2.6000578599027815, lr=0.18202333134433563
2023-12-17 04:08:42   INFO  epoch: 14/24, acc_iter=50838, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:35:32/0:44:28, time_cost(all): 18:44:14/12:40:20, loss=0.401787509937797, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.0(1.03), norm=3.6646033945304337, lr=0.18164789132863723
2023-12-17 04:09:48   INFO  epoch: 14/24, acc_iter=50888, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:36:39/0:42:41, time_cost(all): 18:45:20/12:09:04, loss=0.401588653151748, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.03(1.03), norm=3.972350767800412, lr=0.18127245131293884
2023-12-17 04:10:55   INFO  epoch: 14/24, acc_iter=50938, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:37:46/0:42:17, time_cost(all): 18:46:27/12:13:33, loss=0.401389796365699, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.17(1.03), norm=4.587148621334796, lr=0.1808970112972404
2023-12-17 04:12:02   INFO  epoch: 14/24, acc_iter=50988, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:38:52/0:37:53, time_cost(all): 18:47:34/12:37:03, loss=0.40119093957965, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.98(1.03), norm=3.4573360700661695, lr=0.180521571281542
2023-12-17 04:13:08   INFO  epoch: 14/24, acc_iter=51038, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:39:59/0:39:14, time_cost(all): 18:48:40/12:33:58, loss=0.400992082793601, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.05(1.03), norm=3.8752357846757692, lr=0.1801461312658436
2023-12-17 04:14:15   INFO  epoch: 14/24, acc_iter=51088, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:41:06/0:35:35, time_cost(all): 18:49:47/12:33:02, loss=0.400793226007552, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.85(1.03), norm=2.2555059527694103, lr=0.17977069125014522
2023-12-17 04:15:22   INFO  epoch: 14/24, acc_iter=51138, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:42:12/0:37:08, time_cost(all): 18:50:54/12:50:56, loss=0.400594369221503, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.0(1.03), norm=1.4325946953196604, lr=0.17939525123444683
2023-12-17 04:16:28   INFO  epoch: 14/24, acc_iter=51188, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:43:19/0:34:04, time_cost(all): 18:52:00/11:54:21, loss=0.400395512435454, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.04(1.03), norm=2.396896042038125, lr=0.17901981121874844
2023-12-17 04:17:35   INFO  epoch: 14/24, acc_iter=51238, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:44:26/0:34:25, time_cost(all): 18:53:07/12:01:38, loss=0.400196655649405, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.01(1.03), norm=1.4872148501662712, lr=0.17864437120305005
2023-12-17 04:18:41   INFO  epoch: 14/24, acc_iter=51288, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:45:32/0:33:03, time_cost(all): 18:54:13/11:59:00, loss=0.399997798863356, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.14(1.03), norm=1.6436197031536721, lr=0.17826893118735165
2023-12-17 04:19:48   INFO  epoch: 14/24, acc_iter=51338, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:46:39/0:32:22, time_cost(all): 18:55:20/12:02:05, loss=0.399798942077307, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.12(1.03), norm=1.7745245852891562, lr=0.1778934911716532
2023-12-17 04:20:55   INFO  epoch: 14/24, acc_iter=51388, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:47:46/0:31:10, time_cost(all): 18:56:27/12:11:30, loss=0.399600085291258, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.17(1.03), norm=2.581412766362263, lr=0.17751805115595481
2023-12-17 04:22:01   INFO  epoch: 14/24, acc_iter=51438, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:48:52/0:29:37, time_cost(all): 18:57:33/12:50:03, loss=0.399401228505209, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.92(1.03), norm=4.186875678763519, lr=0.17714261114025642
2023-12-17 04:23:08   INFO  epoch: 14/24, acc_iter=51488, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:49:59/0:29:28, time_cost(all): 18:58:40/11:53:40, loss=0.39920237171916, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.98(1.03), norm=4.2395919370490915, lr=0.17676717112455803
2023-12-17 04:24:15   INFO  epoch: 14/24, acc_iter=51538, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:51:06/0:26:08, time_cost(all): 18:59:47/11:39:29, loss=0.399003514933111, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.23(1.03), norm=3.607998488199417, lr=0.17639173110885964
2023-12-17 04:25:21   INFO  epoch: 14/24, acc_iter=51588, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:52:12/0:27:08, time_cost(all): 19:00:53/11:58:24, loss=0.398804658147062, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.99(1.03), norm=3.4040164176001992, lr=0.17601629109316125
2023-12-17 04:26:28   INFO  epoch: 14/24, acc_iter=51638, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:53:19/0:24:37, time_cost(all): 19:02:00/11:50:25, loss=0.398605801361013, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.15(1.03), norm=4.1329735596603285, lr=0.17564085107746286
2023-12-17 04:27:35   INFO  epoch: 14/24, acc_iter=51688, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:54:26/0:24:18, time_cost(all): 19:03:07/11:52:07, loss=0.398406944574964, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.11(1.03), norm=3.485260122322224, lr=0.17526541106176446
2023-12-17 04:28:41   INFO  epoch: 14/24, acc_iter=51738, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:55:32/0:22:08, time_cost(all): 19:04:13/12:18:44, loss=0.398208087788915, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.12(1.03), norm=4.5388479710347225, lr=0.17488997104606602
2023-12-17 04:29:48   INFO  epoch: 14/24, acc_iter=51788, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:56:39/0:22:33, time_cost(all): 19:05:20/11:53:57, loss=0.398009231002866, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.9(1.03), norm=3.2408578760466806, lr=0.17451453103036763
2023-12-17 04:30:55   INFO  epoch: 14/24, acc_iter=51838, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:57:46/0:20:33, time_cost(all): 19:06:27/11:59:23, loss=0.397810374216817, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.02(1.03), norm=2.1040100900992917, lr=0.17413909101466923
2023-12-17 04:32:01   INFO  epoch: 14/24, acc_iter=51888, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:58:52/0:18:29, time_cost(all): 19:07:33/11:40:32, loss=0.397611517430768, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.15(1.03), norm=2.667076218853012, lr=0.17376365099897084
2023-12-17 04:33:08   INFO  epoch: 14/24, acc_iter=51938, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:59:59/0:17:54, time_cost(all): 19:08:40/11:40:10, loss=0.397412660644719, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.05(1.03), norm=4.878456610700463, lr=0.17338821098327245
2023-12-17 04:34:15   INFO  epoch: 14/24, acc_iter=51988, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 1:01:05/0:16:55, time_cost(all): 19:09:47/12:19:42, loss=0.39721380385867, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.87(1.03), norm=0.9721903486010259, lr=0.17301277096757406
2023-12-17 04:35:21   INFO  epoch: 14/24, acc_iter=52038, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 1:02:12/0:15:34, time_cost(all): 19:10:53/12:09:40, loss=0.397014947072621, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.87(1.03), norm=2.930054031025334, lr=0.17263733095187567
2023-12-17 04:36:28   INFO  epoch: 14/24, acc_iter=52088, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 1:03:19/0:14:40, time_cost(all): 19:12:00/12:31:17, loss=0.396816090286572, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.04(1.03), norm=0.9422165681100556, lr=0.17226189093617728
2023-12-17 04:37:35   INFO  epoch: 14/24, acc_iter=52138, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 1:04:25/0:13:51, time_cost(all): 19:13:07/12:05:56, loss=0.396617233500523, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.97(1.03), norm=3.1076955356934843, lr=0.17188645092047883
2023-12-17 04:38:41   INFO  epoch: 14/24, acc_iter=52188, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 1:05:32/0:12:04, time_cost(all): 19:14:13/12:28:04, loss=0.396418376714474, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.14(1.03), norm=1.6675734579328936, lr=0.17151101090478044
2023-12-17 04:39:48   INFO  epoch: 14/24, acc_iter=52238, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 1:06:39/0:11:37, time_cost(all): 19:15:20/11:55:19, loss=0.396219519928425, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.89(1.03), norm=0.5936113835350358, lr=0.17113557088908204
2023-12-17 04:40:55   INFO  epoch: 14/24, acc_iter=52288, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 1:07:45/0:10:12, time_cost(all): 19:16:27/12:29:13, loss=0.396020663142376, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.03(1.03), norm=2.0311125892637425, lr=0.17076013087338365
2023-12-17 04:42:01   INFO  epoch: 14/24, acc_iter=52338, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 1:08:52/0:09:05, time_cost(all): 19:17:33/12:24:01, loss=0.395821806356327, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.05(1.03), norm=1.432646680136331, lr=0.17038469085768526
2023-12-17 04:43:08   INFO  epoch: 14/24, acc_iter=52388, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 1:09:59/0:08:23, time_cost(all): 19:18:40/12:32:18, loss=0.395622949570278, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.91(1.03), norm=2.098656345066497, lr=0.17000925084198687
2023-12-17 04:44:15   INFO  epoch: 14/24, acc_iter=52438, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 1:11:05/0:07:16, time_cost(all): 19:19:47/12:17:22, loss=0.395424092784229, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.92(1.03), norm=2.7230810746670437, lr=0.16963381082628848
2023-12-17 04:45:21   INFO  epoch: 14/24, acc_iter=52488, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:12:12/0:06:09, time_cost(all): 19:20:53/11:57:36, loss=0.39522523599818, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.04(1.03), norm=2.058178022371526, lr=0.1692583708105901
2023-12-17 04:46:28   INFO  epoch: 14/24, acc_iter=52538, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:13:19/0:04:45, time_cost(all): 19:22:00/12:13:17, loss=0.395026379212131, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.09(1.03), norm=4.0919624778196635, lr=0.16888293079489164
2023-12-17 04:47:34   INFO  epoch: 14/24, acc_iter=52588, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:14:25/0:03:45, time_cost(all): 19:23:06/11:44:51, loss=0.394827522426082, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.02(1.03), norm=3.1257165769455155, lr=0.16850749077919325
2023-12-17 04:48:41   INFO  epoch: 14/24, acc_iter=52638, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:15:32/0:02:31, time_cost(all): 19:24:13/12:20:35, loss=0.394628665640033, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.98(1.03), norm=3.8172019900432153, lr=0.16813205076349486
2023-12-17 04:49:48   INFO  epoch: 14/24, acc_iter=52688, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:16:39/0:01:32, time_cost(all): 19:25:20/12:17:34, loss=0.394429808853984, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.03(1.03), norm=0.8446829390549537, lr=0.16775661074779646
2023-12-17 04:50:54   INFO  epoch: 14/24, acc_iter=52738, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:17:45/0:00:21, time_cost(all): 19:26:26/11:46:28, loss=0.394230952067935, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.19(1.03), norm=2.3925435538750297, lr=0.16738117073209807
2023-12-17 04:52:01   INFO  epoch: 15/24, acc_iter=52805, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:01:06/1:13:23, time_cost(all): 19:27:33/12:11:11, loss=0.39396448397463, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.03(1.03), norm=3.2678581579578156, lr=0.16687808111106223
2023-12-17 04:53:08   INFO  epoch: 15/24, acc_iter=52855, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:02:13/1:12:50, time_cost(all): 19:28:40/11:29:34, loss=0.393765627188581, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.91(1.03), norm=4.676463203611892, lr=0.16650264109536383
2023-12-17 04:54:14   INFO  epoch: 15/24, acc_iter=52905, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:03:19/1:14:28, time_cost(all): 19:29:46/11:31:11, loss=0.393566770402532, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.0(1.03), norm=2.2169079634658972, lr=0.16612720107966544
2023-12-17 04:55:21   INFO  epoch: 15/24, acc_iter=52955, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:04:26/1:17:06, time_cost(all): 19:30:53/12:02:17, loss=0.393367913616483, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.09(1.03), norm=3.6059717618050895, lr=0.165751761063967
2023-12-17 04:56:28   INFO  epoch: 15/24, acc_iter=53005, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:05:33/1:12:38, time_cost(all): 19:32:00/11:54:52, loss=0.393169056830434, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.08(1.03), norm=1.912428334883352, lr=0.1653763210482686
2023-12-17 04:57:34   INFO  epoch: 15/24, acc_iter=53055, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:06:39/1:12:17, time_cost(all): 19:33:06/12:03:22, loss=0.392970200044385, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.89(1.03), norm=3.755513208804404, lr=0.1650008810325702
2023-12-17 04:58:41   INFO  epoch: 15/24, acc_iter=53105, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:07:46/1:08:16, time_cost(all): 19:34:13/12:15:20, loss=0.392771343258336, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.14(1.03), norm=4.318081826011651, lr=0.16462544101687182
2023-12-17 04:59:48   INFO  epoch: 15/24, acc_iter=53155, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:08:53/1:08:58, time_cost(all): 19:35:20/11:52:58, loss=0.392572486472287, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.01(1.03), norm=2.5215728751326494, lr=0.16425000100117343
2023-12-17 05:00:54   INFO  epoch: 15/24, acc_iter=53205, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:09:59/1:10:50, time_cost(all): 19:36:26/11:09:01, loss=0.392373629686238, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.95(1.03), norm=2.099499496677617, lr=0.16387456098547504
2023-12-17 05:02:01   INFO  epoch: 15/24, acc_iter=53255, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:11:06/1:07:32, time_cost(all): 19:37:33/12:11:41, loss=0.392174772900189, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.02(1.03), norm=0.9759787464671876, lr=0.16349912096977665
2023-12-17 05:03:08   INFO  epoch: 15/24, acc_iter=53305, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:12:13/1:07:04, time_cost(all): 19:38:40/11:09:10, loss=0.39197591611414, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.03(1.03), norm=3.1033501152566796, lr=0.16312368095407825
2023-12-17 05:04:14   INFO  epoch: 15/24, acc_iter=53355, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:13:19/1:05:25, time_cost(all): 19:39:46/11:50:09, loss=0.391777059328091, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.02(1.03), norm=4.11413664094282, lr=0.1627482409383798
2023-12-17 05:05:21   INFO  epoch: 15/24, acc_iter=53405, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:14:26/1:01:52, time_cost(all): 19:40:53/11:51:02, loss=0.391578202542042, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.99(1.03), norm=2.428353200987992, lr=0.16237280092268142
2023-12-17 05:06:28   INFO  epoch: 15/24, acc_iter=53455, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:15:33/1:03:35, time_cost(all): 19:42:00/11:59:12, loss=0.391379345755993, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.93(1.03), norm=2.6496463335902707, lr=0.16199736090698302
2023-12-17 05:07:34   INFO  epoch: 15/24, acc_iter=53505, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:16:39/0:59:03, time_cost(all): 19:43:06/12:01:13, loss=0.391180488969944, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.2(1.03), norm=4.862714951735604, lr=0.16162192089128463
2023-12-17 05:08:41   INFO  epoch: 15/24, acc_iter=53555, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:17:46/0:59:47, time_cost(all): 19:44:13/11:05:33, loss=0.390981632183895, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.93(1.03), norm=4.8239138848293175, lr=0.16124648087558624
2023-12-17 05:09:48   INFO  epoch: 15/24, acc_iter=53605, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:18:53/1:01:19, time_cost(all): 19:45:20/11:55:14, loss=0.390782775397846, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.88(1.03), norm=2.681570283539088, lr=0.16087104085988785
2023-12-17 05:10:54   INFO  epoch: 15/24, acc_iter=53655, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:19:59/0:58:59, time_cost(all): 19:46:26/11:27:06, loss=0.390583918611797, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.2(1.03), norm=1.909106476304385, lr=0.16049560084418946
2023-12-17 05:12:01   INFO  epoch: 15/24, acc_iter=53705, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:21:06/0:58:18, time_cost(all): 19:47:33/11:15:38, loss=0.390385061825748, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.2(1.03), norm=4.108550660408264, lr=0.16012016082849106
2023-12-17 05:13:08   INFO  epoch: 15/24, acc_iter=53755, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:22:13/0:53:46, time_cost(all): 19:48:40/11:38:38, loss=0.390186205039699, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.11(1.03), norm=3.5077654536467056, lr=0.15974472081279262
2023-12-17 05:14:14   INFO  epoch: 15/24, acc_iter=53805, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:23:19/0:52:28, time_cost(all): 19:49:46/10:59:00, loss=0.38998734825365, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.88(1.03), norm=4.296458800184156, lr=0.15936928079709423
2023-12-17 05:15:21   INFO  epoch: 15/24, acc_iter=53855, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:24:26/0:55:03, time_cost(all): 19:50:53/11:57:46, loss=0.389788491467601, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.09(1.03), norm=3.773365692875652, lr=0.15899384078139583
2023-12-17 05:16:27   INFO  epoch: 15/24, acc_iter=53905, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:25:33/0:51:24, time_cost(all): 19:51:59/11:51:09, loss=0.389589634681552, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.96(1.03), norm=4.9835346322872836, lr=0.15861840076569744
2023-12-17 05:17:34   INFO  epoch: 15/24, acc_iter=53955, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:26:39/0:50:20, time_cost(all): 19:53:06/11:22:27, loss=0.389390777895503, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.13(1.03), norm=1.908165493099928, lr=0.15824296074999905
2023-12-17 05:18:41   INFO  epoch: 15/24, acc_iter=54005, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:27:46/0:52:18, time_cost(all): 19:54:13/11:31:12, loss=0.389191921109454, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.22(1.03), norm=2.4137014780045885, lr=0.15786752073430066
2023-12-17 05:19:47   INFO  epoch: 15/24, acc_iter=54055, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:28:53/0:46:55, time_cost(all): 19:55:19/11:45:03, loss=0.388993064323405, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.17(1.03), norm=4.332524536658692, lr=0.15749208071860227
2023-12-17 05:20:54   INFO  epoch: 15/24, acc_iter=54105, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:29:59/0:49:09, time_cost(all): 19:56:26/11:44:01, loss=0.388794207537356, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.99(1.03), norm=3.785624162945105, lr=0.15711664070290388
2023-12-17 05:22:01   INFO  epoch: 15/24, acc_iter=54155, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:31:06/0:45:18, time_cost(all): 19:57:33/10:49:57, loss=0.388595350751307, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.08(1.03), norm=4.592701217411956, lr=0.15674120068720543
2023-12-17 05:23:07   INFO  epoch: 15/24, acc_iter=54205, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:32:12/0:45:07, time_cost(all): 19:58:39/11:36:17, loss=0.388396493965258, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.15(1.03), norm=0.9208636894037738, lr=0.15636576067150704
2023-12-17 05:24:14   INFO  epoch: 15/24, acc_iter=54255, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:33:19/0:46:55, time_cost(all): 19:59:46/11:35:07, loss=0.388197637179209, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.85(1.03), norm=4.047938568526469, lr=0.15599032065580865
2023-12-17 05:25:21   INFO  epoch: 15/24, acc_iter=54305, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:34:26/0:41:56, time_cost(all): 20:00:53/10:56:00, loss=0.38799878039316, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.93(1.03), norm=3.7246045344361085, lr=0.15561488064011025
2023-12-17 05:26:27   INFO  epoch: 15/24, acc_iter=54355, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:35:32/0:41:52, time_cost(all): 20:01:59/10:56:41, loss=0.387799923607111, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.98(1.03), norm=1.5424165644949002, lr=0.15523944062441192
2023-12-17 05:27:34   INFO  epoch: 15/24, acc_iter=54405, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:36:39/0:40:37, time_cost(all): 20:03:06/11:05:35, loss=0.387601066821062, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.19(1.03), norm=3.628241940266559, lr=0.15486400060871347
2023-12-17 05:28:41   INFO  epoch: 15/24, acc_iter=54455, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:37:46/0:40:55, time_cost(all): 20:04:13/11:19:54, loss=0.387402210035013, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.93(1.03), norm=4.669533039219073, lr=0.15448856059301508
2023-12-17 05:29:47   INFO  epoch: 15/24, acc_iter=54505, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:38:52/0:38:33, time_cost(all): 20:05:19/10:36:47, loss=0.387203353248964, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.14(1.03), norm=1.4344887341688994, lr=0.1541131205773167
2023-12-17 05:30:54   INFO  epoch: 15/24, acc_iter=54555, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:39:59/0:39:43, time_cost(all): 20:06:26/10:45:58, loss=0.387004496462915, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.12(1.03), norm=1.0847497680824847, lr=0.1537376805616183
2023-12-17 05:32:01   INFO  epoch: 15/24, acc_iter=54605, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:41:06/0:38:20, time_cost(all): 20:07:33/11:37:49, loss=0.386805639676866, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.84(1.03), norm=3.604494142844243, lr=0.15336224054591985
2023-12-17 05:33:07   INFO  epoch: 15/24, acc_iter=54655, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:42:12/0:36:46, time_cost(all): 20:08:39/11:24:47, loss=0.386606782890817, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.03(1.03), norm=4.98875101965674, lr=0.15298680053022146
2023-12-17 05:34:14   INFO  epoch: 15/24, acc_iter=54705, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:43:19/0:35:36, time_cost(all): 20:09:46/10:57:20, loss=0.386407926104768, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.13(1.03), norm=4.780864367263735, lr=0.15261136051452306
2023-12-17 05:35:21   INFO  epoch: 15/24, acc_iter=54755, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:44:26/0:34:25, time_cost(all): 20:10:53/11:35:57, loss=0.386209069318719, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.02(1.03), norm=3.0980080995075334, lr=0.15223592049882473
2023-12-17 05:36:27   INFO  epoch: 15/24, acc_iter=54805, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:45:32/0:32:01, time_cost(all): 20:11:59/10:44:23, loss=0.38601021253267, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.19(1.03), norm=0.5415111478169826, lr=0.15186048048312628
2023-12-17 05:37:34   INFO  epoch: 15/24, acc_iter=54855, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:46:39/0:31:51, time_cost(all): 20:13:06/10:56:22, loss=0.385811355746621, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.95(1.03), norm=4.278811410606783, lr=0.1514850404674279
2023-12-17 05:38:41   INFO  epoch: 15/24, acc_iter=54905, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:47:46/0:29:21, time_cost(all): 20:14:13/10:38:06, loss=0.385612498960572, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.98(1.03), norm=2.9068367098021315, lr=0.1511096004517295
2023-12-17 05:39:47   INFO  epoch: 15/24, acc_iter=54955, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:48:52/0:29:02, time_cost(all): 20:15:19/10:39:36, loss=0.385413642174523, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.1(1.03), norm=4.648336040748464, lr=0.1507341604360311
2023-12-17 05:40:54   INFO  epoch: 15/24, acc_iter=55005, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:49:59/0:29:10, time_cost(all): 20:16:26/11:20:28, loss=0.385214785388474, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.13(1.03), norm=3.0802432663322867, lr=0.15035872042033266
2023-12-17 05:42:01   INFO  epoch: 15/24, acc_iter=55055, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:51:06/0:28:11, time_cost(all): 20:17:33/11:02:41, loss=0.385015928602425, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.86(1.03), norm=4.637230931955132, lr=0.14998328040463427
2023-12-17 05:43:07   INFO  epoch: 15/24, acc_iter=55105, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:52:12/0:24:58, time_cost(all): 20:18:39/10:57:58, loss=0.384817071816376, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.84(1.03), norm=4.609403611329812, lr=0.14960784038893588
2023-12-17 05:44:14   INFO  epoch: 15/24, acc_iter=55155, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:53:19/0:24:53, time_cost(all): 20:19:46/10:53:15, loss=0.384618215030327, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.08(1.03), norm=2.0334758351024544, lr=0.14923240037323754
2023-12-17 05:45:20   INFO  epoch: 15/24, acc_iter=55205, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:54:26/0:24:26, time_cost(all): 20:20:52/10:47:41, loss=0.384419358244278, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.08(1.03), norm=3.3436178055677317, lr=0.1488569603575391
2023-12-17 05:46:27   INFO  epoch: 15/24, acc_iter=55255, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:55:32/0:22:53, time_cost(all): 20:21:59/10:24:06, loss=0.384220501458229, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.06(1.03), norm=1.8818302953550532, lr=0.1484815203418407
2023-12-17 05:47:34   INFO  epoch: 15/24, acc_iter=55305, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:56:39/0:20:25, time_cost(all): 20:23:06/10:53:03, loss=0.38402164467218, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.21(1.03), norm=1.2542063862296284, lr=0.1481060803261423
2023-12-17 05:48:40   INFO  epoch: 15/24, acc_iter=55355, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:57:46/0:19:41, time_cost(all): 20:24:12/10:43:45, loss=0.383822787886131, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.16(1.03), norm=2.213291552115489, lr=0.14773064031044392
2023-12-17 05:49:47   INFO  epoch: 15/24, acc_iter=55405, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:58:52/0:18:18, time_cost(all): 20:25:19/11:05:50, loss=0.383623931100082, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.18(1.03), norm=3.63444125601073, lr=0.14735520029474547
2023-12-17 05:50:54   INFO  epoch: 15/24, acc_iter=55455, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:59:59/0:17:56, time_cost(all): 20:26:26/10:20:38, loss=0.383425074314033, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.88(1.03), norm=4.241710576487214, lr=0.14697976027904708
2023-12-17 05:52:00   INFO  epoch: 15/24, acc_iter=55505, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 1:01:05/0:16:47, time_cost(all): 20:27:32/11:08:41, loss=0.383226217527984, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.89(1.03), norm=2.2225292158613392, lr=0.1466043202633487
2023-12-17 05:53:07   INFO  epoch: 15/24, acc_iter=55555, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 1:02:12/0:15:50, time_cost(all): 20:28:39/10:18:57, loss=0.383027360741935, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.13(1.03), norm=2.610545202469662, lr=0.14622888024765035
2023-12-17 05:54:14   INFO  epoch: 15/24, acc_iter=55605, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 1:03:19/0:15:07, time_cost(all): 20:29:46/10:26:49, loss=0.382828503955886, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.83(1.03), norm=3.899677424669306, lr=0.1458534402319519
2023-12-17 05:55:20   INFO  epoch: 15/24, acc_iter=55655, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 1:04:25/0:14:04, time_cost(all): 20:30:52/11:13:19, loss=0.382629647169837, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.97(1.03), norm=1.3621997140689945, lr=0.1454780002162535
2023-12-17 05:56:27   INFO  epoch: 15/24, acc_iter=55705, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 1:05:32/0:12:28, time_cost(all): 20:31:59/10:49:13, loss=0.382430790383788, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.08(1.03), norm=3.629374906549935, lr=0.14510256020055512
2023-12-17 05:57:34   INFO  epoch: 15/24, acc_iter=55755, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 1:06:39/0:11:49, time_cost(all): 20:33:06/11:04:59, loss=0.382231933597739, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.91(1.03), norm=4.183954853298337, lr=0.14472712018485673
2023-12-17 05:58:40   INFO  epoch: 15/24, acc_iter=55805, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 1:07:45/0:09:59, time_cost(all): 20:34:12/10:57:16, loss=0.38203307681169, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.02(1.03), norm=1.6294595601355268, lr=0.14435168016915828
2023-12-17 05:59:47   INFO  epoch: 15/24, acc_iter=55855, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 1:08:52/0:09:30, time_cost(all): 20:35:19/10:24:04, loss=0.381834220025641, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.98(1.03), norm=4.089238658502072, lr=0.1439762401534599
2023-12-17 06:00:54   INFO  epoch: 15/24, acc_iter=55905, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 1:09:59/0:08:29, time_cost(all): 20:36:26/11:05:29, loss=0.381635363239592, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.92(1.03), norm=0.9943596721650105, lr=0.1436008001377615
2023-12-17 06:02:00   INFO  epoch: 15/24, acc_iter=55955, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 1:11:05/0:06:45, time_cost(all): 20:37:32/11:06:25, loss=0.381436506453543, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.16(1.03), norm=3.2775288435199954, lr=0.14322536012206316
2023-12-17 06:03:07   INFO  epoch: 15/24, acc_iter=56005, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:12:12/0:05:57, time_cost(all): 20:38:39/10:40:29, loss=0.381237649667494, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.92(1.03), norm=3.139205963802164, lr=0.14284992010636471
2023-12-17 06:04:14   INFO  epoch: 15/24, acc_iter=56055, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:13:19/0:04:57, time_cost(all): 20:39:46/10:56:29, loss=0.381038792881445, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.03(1.03), norm=0.9149043294987003, lr=0.14247448009066632
2023-12-17 06:05:20   INFO  epoch: 15/24, acc_iter=56105, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:14:25/0:03:38, time_cost(all): 20:40:52/10:56:24, loss=0.380839936095396, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.87(1.03), norm=1.2267202488418338, lr=0.14209904007496793
2023-12-17 06:06:27   INFO  epoch: 15/24, acc_iter=56155, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:15:32/0:02:41, time_cost(all): 20:41:59/10:50:32, loss=0.380641079309347, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.85(1.03), norm=1.3048372590108008, lr=0.14172360005926954
2023-12-17 06:07:34   INFO  epoch: 15/24, acc_iter=56205, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:16:39/0:01:28, time_cost(all): 20:43:06/10:58:46, loss=0.380442222523298, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.08(1.03), norm=1.5959984901654773, lr=0.1413481600435711
2023-12-17 06:08:40   INFO  epoch: 15/24, acc_iter=56255, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:17:45/0:00:23, time_cost(all): 20:44:12/10:31:36, loss=0.380243365737249, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.86(1.03), norm=1.3060741355037115, lr=0.1409727200278727
2023-12-17 06:09:47   INFO  epoch: 16/24, acc_iter=56322, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:01:06/1:16:35, time_cost(all): 20:45:19/10:05:39, loss=0.379976897643943, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.2(1.03), norm=0.8815773588876086, lr=0.14046963040683685
2023-12-17 06:10:54   INFO  epoch: 16/24, acc_iter=56372, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:02:13/1:14:03, time_cost(all): 20:46:26/10:19:13, loss=0.379778040857894, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.06(1.03), norm=1.695871379689106, lr=0.14009419039113852
2023-12-17 06:12:00   INFO  epoch: 16/24, acc_iter=56422, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:03:19/1:15:45, time_cost(all): 20:47:32/10:23:44, loss=0.379579184071845, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.06(1.03), norm=2.580433317629149, lr=0.13971875037544007
2023-12-17 06:13:07   INFO  epoch: 16/24, acc_iter=56472, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:04:26/1:15:57, time_cost(all): 20:48:39/10:07:16, loss=0.379380327285796, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.01(1.03), norm=4.607145935682017, lr=0.13934331035974168
2023-12-17 06:14:14   INFO  epoch: 16/24, acc_iter=56522, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:05:33/1:09:08, time_cost(all): 20:49:46/10:25:54, loss=0.379181470499747, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.95(1.03), norm=2.6980498735652296, lr=0.1389678703440433
2023-12-17 06:15:20   INFO  epoch: 16/24, acc_iter=56572, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:06:39/1:11:04, time_cost(all): 20:50:52/10:35:33, loss=0.378982613713698, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.93(1.03), norm=3.471225011666925, lr=0.1385924303283449
2023-12-17 06:16:27   INFO  epoch: 16/24, acc_iter=56622, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:07:46/1:09:23, time_cost(all): 20:51:59/9:55:09, loss=0.378783756927649, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.0(1.03), norm=1.939040664537481, lr=0.13821699031264645
2023-12-17 06:17:33   INFO  epoch: 16/24, acc_iter=56672, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:08:53/1:06:59, time_cost(all): 20:53:05/10:08:22, loss=0.3785849001416, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.04(1.03), norm=1.2678983497076084, lr=0.13784155029694806
2023-12-17 06:18:40   INFO  epoch: 16/24, acc_iter=56722, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:09:59/1:11:12, time_cost(all): 20:54:12/10:38:59, loss=0.378386043355551, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.1(1.03), norm=3.7687643589960067, lr=0.13746611028124966
2023-12-17 06:19:47   INFO  epoch: 16/24, acc_iter=56772, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:11:06/1:08:15, time_cost(all): 20:55:19/10:38:56, loss=0.378187186569502, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.13(1.03), norm=3.906063975860246, lr=0.13709067026555133
2023-12-17 06:20:53   INFO  epoch: 16/24, acc_iter=56822, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:12:13/1:06:17, time_cost(all): 20:56:25/10:46:23, loss=0.377988329783453, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.11(1.03), norm=3.9810817344915628, lr=0.13671523024985288
2023-12-17 06:22:00   INFO  epoch: 16/24, acc_iter=56872, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:13:19/1:07:59, time_cost(all): 20:57:32/10:27:05, loss=0.377789472997404, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.9(1.03), norm=4.18397269918472, lr=0.1363397902341545
2023-12-17 06:23:07   INFO  epoch: 16/24, acc_iter=56922, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:14:26/1:02:06, time_cost(all): 20:58:39/10:28:13, loss=0.377590616211355, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.83(1.03), norm=2.307709465594049, lr=0.1359643502184561
2023-12-17 06:24:13   INFO  epoch: 16/24, acc_iter=56972, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:15:33/1:03:58, time_cost(all): 20:59:45/10:33:40, loss=0.377391759425306, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.87(1.03), norm=3.2782179562326133, lr=0.1355889102027577
2023-12-17 06:25:20   INFO  epoch: 16/24, acc_iter=57022, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:16:39/1:04:02, time_cost(all): 21:00:52/9:59:04, loss=0.377192902639257, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.2(1.03), norm=2.9895530580644247, lr=0.13521347018705926
2023-12-17 06:26:27   INFO  epoch: 16/24, acc_iter=57072, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:17:46/0:57:21, time_cost(all): 21:01:59/10:00:08, loss=0.376994045853208, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.06(1.03), norm=2.147184313308829, lr=0.13483803017136087
2023-12-17 06:27:33   INFO  epoch: 16/24, acc_iter=57122, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:18:53/1:00:17, time_cost(all): 21:03:05/9:51:16, loss=0.376795189067159, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.15(1.03), norm=0.8718225910022745, lr=0.13446259015566248
2023-12-17 06:28:40   INFO  epoch: 16/24, acc_iter=57172, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:19:59/0:58:43, time_cost(all): 21:04:12/10:07:11, loss=0.37659633228111, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.01(1.03), norm=4.425816245332506, lr=0.13408715013996414
2023-12-17 06:29:47   INFO  epoch: 16/24, acc_iter=57222, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:21:06/0:57:17, time_cost(all): 21:05:19/9:45:34, loss=0.376397475495061, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.89(1.03), norm=3.1243758614079304, lr=0.1337117101242657
2023-12-17 06:30:53   INFO  epoch: 16/24, acc_iter=57272, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:22:13/0:55:14, time_cost(all): 21:06:25/10:16:44, loss=0.376198618709012, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.99(1.03), norm=0.6491461553708479, lr=0.1333362701085673
2023-12-17 06:32:00   INFO  epoch: 16/24, acc_iter=57322, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:23:19/0:54:29, time_cost(all): 21:07:32/9:42:53, loss=0.375999761922963, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.93(1.03), norm=2.9478062510166896, lr=0.1329608300928689
2023-12-17 06:33:07   INFO  epoch: 16/24, acc_iter=57372, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:24:26/0:52:04, time_cost(all): 21:08:39/10:21:45, loss=0.375800905136914, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.0(1.03), norm=4.005857032105498, lr=0.13258539007717052
2023-12-17 06:34:13   INFO  epoch: 16/24, acc_iter=57422, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:25:33/0:50:49, time_cost(all): 21:09:45/10:24:37, loss=0.375602048350865, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.97(1.03), norm=1.6460544992067108, lr=0.13220995006147207
2023-12-17 06:35:20   INFO  epoch: 16/24, acc_iter=57472, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:26:39/0:52:50, time_cost(all): 21:10:52/10:26:16, loss=0.375403191564816, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.09(1.03), norm=1.1729487868639485, lr=0.13183451004577368
2023-12-17 06:36:27   INFO  epoch: 16/24, acc_iter=57522, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:27:46/0:49:08, time_cost(all): 21:11:59/10:08:21, loss=0.375204334778767, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.08(1.03), norm=0.9209994528019824, lr=0.1314590700300753
2023-12-17 06:37:33   INFO  epoch: 16/24, acc_iter=57572, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:28:53/0:46:50, time_cost(all): 21:13:05/10:15:04, loss=0.375005477992718, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.94(1.03), norm=3.498696750879926, lr=0.13108363001437695
2023-12-17 06:38:40   INFO  epoch: 16/24, acc_iter=57622, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:29:59/0:48:33, time_cost(all): 21:14:12/9:47:41, loss=0.374806621206669, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.01(1.03), norm=1.4524125232540601, lr=0.1307081899986785
2023-12-17 06:39:47   INFO  epoch: 16/24, acc_iter=57672, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:31:06/0:46:52, time_cost(all): 21:15:19/9:50:00, loss=0.37460776442062, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.88(1.03), norm=1.73354963510882, lr=0.1303327499829801
2023-12-17 06:40:53   INFO  epoch: 16/24, acc_iter=57722, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:32:12/0:43:40, time_cost(all): 21:16:25/10:08:19, loss=0.374408907634571, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.86(1.03), norm=4.1764159565349965, lr=0.12995730996728172
2023-12-17 06:42:00   INFO  epoch: 16/24, acc_iter=57772, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:33:19/0:43:52, time_cost(all): 21:17:32/10:07:07, loss=0.374210050848522, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.01(1.03), norm=0.7231597957601271, lr=0.12958186995158333
2023-12-17 06:43:07   INFO  epoch: 16/24, acc_iter=57822, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:34:26/0:41:31, time_cost(all): 21:18:39/10:06:06, loss=0.374011194062473, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.92(1.03), norm=4.667834134958425, lr=0.12920642993588488
2023-12-17 06:44:13   INFO  epoch: 16/24, acc_iter=57872, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:35:32/0:43:15, time_cost(all): 21:19:45/9:55:54, loss=0.373812337276424, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.88(1.03), norm=1.6133803573675638, lr=0.1288309899201865
2023-12-17 06:45:20   INFO  epoch: 16/24, acc_iter=57922, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:36:39/0:40:32, time_cost(all): 21:20:52/9:28:49, loss=0.373613480490375, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.08(1.03), norm=3.239365105345683, lr=0.1284555499044881
2023-12-17 06:46:26   INFO  epoch: 16/24, acc_iter=57972, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:37:46/0:39:23, time_cost(all): 21:21:58/10:18:52, loss=0.373414623704326, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.19(1.03), norm=3.917745854492689, lr=0.12808010988878976
2023-12-17 06:47:33   INFO  epoch: 16/24, acc_iter=58022, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:38:52/0:37:55, time_cost(all): 21:23:05/10:14:27, loss=0.373215766918277, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.19(1.03), norm=2.404369251116518, lr=0.12770466987309131
2023-12-17 06:48:40   INFO  epoch: 16/24, acc_iter=58072, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:39:59/0:37:29, time_cost(all): 21:24:12/9:53:01, loss=0.373016910132228, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.12(1.03), norm=4.197022011266407, lr=0.12732922985739292
2023-12-17 06:49:46   INFO  epoch: 16/24, acc_iter=58122, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:41:06/0:36:54, time_cost(all): 21:25:18/9:29:14, loss=0.372818053346179, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.18(1.03), norm=1.1494858813980893, lr=0.12695378984169453
2023-12-17 06:50:53   INFO  epoch: 16/24, acc_iter=58172, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:42:12/0:34:31, time_cost(all): 21:26:25/9:30:22, loss=0.37261919656013, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.01(1.03), norm=1.0472716154197796, lr=0.12657834982599614
2023-12-17 06:52:00   INFO  epoch: 16/24, acc_iter=58222, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:43:19/0:34:36, time_cost(all): 21:27:32/9:43:44, loss=0.372420339774081, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.04(1.03), norm=2.2783317732227584, lr=0.1262029098102977
2023-12-17 06:53:06   INFO  epoch: 16/24, acc_iter=58272, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:44:26/0:33:28, time_cost(all): 21:28:38/9:27:37, loss=0.372221482988032, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.89(1.03), norm=3.863299796177391, lr=0.1258274697945993
2023-12-17 06:54:13   INFO  epoch: 16/24, acc_iter=58322, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:45:32/0:33:44, time_cost(all): 21:29:45/9:32:50, loss=0.372022626201983, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.89(1.03), norm=3.7125325101834803, lr=0.1254520297789009
2023-12-17 06:55:20   INFO  epoch: 16/24, acc_iter=58372, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:46:39/0:31:06, time_cost(all): 21:30:52/9:54:00, loss=0.371823769415934, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.07(1.03), norm=4.400643326475326, lr=0.12507658976320257
2023-12-17 06:56:26   INFO  epoch: 16/24, acc_iter=58422, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:47:46/0:31:18, time_cost(all): 21:31:58/10:12:07, loss=0.371624912629885, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.85(1.03), norm=4.876559165797917, lr=0.12470114974750413
2023-12-17 06:57:33   INFO  epoch: 16/24, acc_iter=58472, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:48:52/0:29:54, time_cost(all): 21:33:05/9:50:32, loss=0.371426055843836, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.15(1.03), norm=1.0568152413654919, lr=0.12432570973180573
2023-12-17 06:58:40   INFO  epoch: 16/24, acc_iter=58522, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:49:59/0:28:50, time_cost(all): 21:34:12/9:39:57, loss=0.371227199057787, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.16(1.03), norm=1.6817627937578896, lr=0.12395026971610734
2023-12-17 06:59:46   INFO  epoch: 16/24, acc_iter=58572, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:51:06/0:28:03, time_cost(all): 21:35:18/9:12:24, loss=0.371028342271738, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.89(1.03), norm=1.0685922123680918, lr=0.12357482970040895
2023-12-17 07:00:53   INFO  epoch: 16/24, acc_iter=58622, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:52:12/0:25:57, time_cost(all): 21:36:25/9:32:41, loss=0.370829485485689, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.1(1.03), norm=1.5873330028965464, lr=0.1231993896847105
2023-12-17 07:02:00   INFO  epoch: 16/24, acc_iter=58672, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:53:19/0:23:53, time_cost(all): 21:37:32/9:24:39, loss=0.37063062869964, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.17(1.03), norm=4.213868816272154, lr=0.12282394966901211
2023-12-17 07:03:06   INFO  epoch: 16/24, acc_iter=58722, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:54:26/0:23:22, time_cost(all): 21:38:38/9:27:26, loss=0.370431771913591, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.19(1.03), norm=2.9734648293331407, lr=0.12244850965331372
2023-12-17 07:04:13   INFO  epoch: 16/24, acc_iter=58772, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:55:32/0:22:03, time_cost(all): 21:39:45/9:59:33, loss=0.370232915127542, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.9(1.03), norm=1.6018780988332089, lr=0.12207306963761538
2023-12-17 07:05:20   INFO  epoch: 16/24, acc_iter=58822, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:56:39/0:22:01, time_cost(all): 21:40:52/9:37:57, loss=0.370034058341493, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.0(1.03), norm=2.0365861050886567, lr=0.12169762962191699
2023-12-17 07:06:26   INFO  epoch: 16/24, acc_iter=58872, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:57:46/0:19:38, time_cost(all): 21:41:58/9:37:24, loss=0.369835201555444, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.16(1.03), norm=3.3095912027757612, lr=0.12132218960621854
2023-12-17 07:07:33   INFO  epoch: 16/24, acc_iter=58922, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:58:52/0:18:39, time_cost(all): 21:43:05/9:29:56, loss=0.369636344769395, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.17(1.03), norm=3.0553690162957494, lr=0.12094674959052015
2023-12-17 07:08:40   INFO  epoch: 16/24, acc_iter=58972, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:59:59/0:18:39, time_cost(all): 21:44:12/9:27:26, loss=0.369437487983346, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.85(1.03), norm=2.549189221313414, lr=0.12057130957482176
2023-12-17 07:09:46   INFO  epoch: 16/24, acc_iter=59022, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 1:01:05/0:17:25, time_cost(all): 21:45:18/9:02:09, loss=0.369238631197297, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.96(1.03), norm=4.443927269130727, lr=0.12019586955912337
2023-12-17 07:10:53   INFO  epoch: 16/24, acc_iter=59072, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 1:02:12/0:16:06, time_cost(all): 21:46:25/9:00:58, loss=0.369039774411248, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.88(1.03), norm=4.999129069404463, lr=0.11982042954342492
2023-12-17 07:12:00   INFO  epoch: 16/24, acc_iter=59122, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 1:03:19/0:15:26, time_cost(all): 21:47:32/9:23:27, loss=0.368840917625199, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.12(1.03), norm=3.3328569916238533, lr=0.11944498952772653
2023-12-17 07:13:06   INFO  epoch: 16/24, acc_iter=59172, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 1:04:25/0:13:06, time_cost(all): 21:48:38/9:34:46, loss=0.36864206083915, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.0(1.03), norm=1.402605263955921, lr=0.1190695495120282
2023-12-17 07:14:13   INFO  epoch: 16/24, acc_iter=59222, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 1:05:32/0:12:50, time_cost(all): 21:49:45/8:57:50, loss=0.368443204053101, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.23(1.03), norm=1.3147042048225053, lr=0.1186941094963298
2023-12-17 07:15:19   INFO  epoch: 16/24, acc_iter=59272, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 1:06:39/0:11:38, time_cost(all): 21:50:51/9:06:06, loss=0.368244347267052, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.22(1.03), norm=3.913109540493493, lr=0.11831866948063136
2023-12-17 07:16:26   INFO  epoch: 16/24, acc_iter=59322, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 1:07:45/0:10:13, time_cost(all): 21:51:58/9:06:23, loss=0.368045490481003, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.2(1.03), norm=2.438005820798697, lr=0.11794322946493296
2023-12-17 07:17:33   INFO  epoch: 16/24, acc_iter=59372, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 1:08:52/0:09:00, time_cost(all): 21:53:05/9:34:14, loss=0.367846633694954, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.13(1.03), norm=4.418254152506082, lr=0.11756778944923457
2023-12-17 07:18:39   INFO  epoch: 16/24, acc_iter=59422, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 1:09:59/0:08:24, time_cost(all): 21:54:11/9:41:53, loss=0.367647776908905, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.11(1.03), norm=2.3979698049851907, lr=0.11719234943353618
2023-12-17 07:19:46   INFO  epoch: 16/24, acc_iter=59472, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 1:11:05/0:07:16, time_cost(all): 21:55:18/9:01:25, loss=0.367448920122856, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.1(1.03), norm=1.2264186758911517, lr=0.11681690941783773
2023-12-17 07:20:53   INFO  epoch: 16/24, acc_iter=59522, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:12:12/0:06:11, time_cost(all): 21:56:25/9:21:34, loss=0.367250063336807, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.06(1.03), norm=3.564619284283104, lr=0.11644146940213934
2023-12-17 07:21:59   INFO  epoch: 16/24, acc_iter=59572, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:13:19/0:04:49, time_cost(all): 21:57:31/9:20:49, loss=0.367051206550758, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.14(1.03), norm=1.9930325049814663, lr=0.116066029386441
2023-12-17 07:23:06   INFO  epoch: 16/24, acc_iter=59622, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:14:25/0:03:35, time_cost(all): 21:58:38/8:58:55, loss=0.366852349764709, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.15(1.03), norm=2.9688810933886782, lr=0.11569058937074261
2023-12-17 07:24:13   INFO  epoch: 16/24, acc_iter=59672, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:15:32/0:02:31, time_cost(all): 21:59:45/9:33:13, loss=0.36665349297866, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.14(1.03), norm=3.5488417500520075, lr=0.11531514935504417
2023-12-17 07:25:19   INFO  epoch: 16/24, acc_iter=59722, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:16:39/0:01:28, time_cost(all): 22:00:51/9:23:21, loss=0.366454636192611, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.09(1.03), norm=2.717424328759863, lr=0.11493970933934577
2023-12-17 07:26:26   INFO  epoch: 16/24, acc_iter=59772, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:17:45/0:00:22, time_cost(all): 22:01:58/9:39:34, loss=0.366255779406562, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.92(1.03), norm=4.594369305880127, lr=0.11456426932364738
2023-12-17 07:27:33   INFO  epoch: 17/24, acc_iter=59839, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:01:06/1:15:19, time_cost(all): 22:03:05/9:03:54, loss=0.365989311313256, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.11(1.03), norm=2.869536553223691, lr=0.11406117970261154
2023-12-17 07:28:39   INFO  epoch: 17/24, acc_iter=59889, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:02:13/1:15:02, time_cost(all): 22:04:11/8:54:34, loss=0.365790454527207, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.84(1.03), norm=1.6557056520643714, lr=0.11368573968691309
2023-12-17 07:29:46   INFO  epoch: 17/24, acc_iter=59939, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:03:19/1:14:50, time_cost(all): 22:05:18/9:09:21, loss=0.365591597741158, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.94(1.03), norm=0.8209216937718584, lr=0.1133102996712147
2023-12-17 07:30:53   INFO  epoch: 17/24, acc_iter=59989, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:04:26/1:16:22, time_cost(all): 22:06:25/9:08:08, loss=0.365392740955109, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.96(1.03), norm=0.8700971853072607, lr=0.11293485965551636
2023-12-17 07:31:59   INFO  epoch: 17/24, acc_iter=60039, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:05:33/1:10:52, time_cost(all): 22:07:31/8:49:48, loss=0.36519388416906, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.08(1.03), norm=3.604511869778663, lr=0.11255941963981797
2023-12-17 07:33:06   INFO  epoch: 17/24, acc_iter=60089, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:06:39/1:08:54, time_cost(all): 22:08:38/9:28:27, loss=0.364995027383011, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.86(1.03), norm=1.0950079361731162, lr=0.11218397962411952
2023-12-17 07:34:13   INFO  epoch: 17/24, acc_iter=60139, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:07:46/1:08:30, time_cost(all): 22:09:45/9:15:02, loss=0.364796170596962, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.02(1.03), norm=1.3673502913288238, lr=0.11180853960842113
2023-12-17 07:35:19   INFO  epoch: 17/24, acc_iter=60189, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:08:53/1:06:48, time_cost(all): 22:10:51/8:56:41, loss=0.364597313810913, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.18(1.03), norm=4.3164727665730664, lr=0.11143309959272274
2023-12-17 07:36:26   INFO  epoch: 17/24, acc_iter=60239, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:09:59/1:06:10, time_cost(all): 22:11:58/8:38:35, loss=0.364398457024864, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.85(1.03), norm=1.2563111709056656, lr=0.11105765957702435
2023-12-17 07:37:33   INFO  epoch: 17/24, acc_iter=60289, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:11:06/1:05:10, time_cost(all): 22:13:05/8:59:22, loss=0.364199600238815, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.13(1.03), norm=3.742280934246252, lr=0.1106822195613259
2023-12-17 07:38:39   INFO  epoch: 17/24, acc_iter=60339, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:12:13/1:04:02, time_cost(all): 22:14:11/9:13:06, loss=0.364000743452766, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.05(1.03), norm=0.7687965933780189, lr=0.11030677954562751
2023-12-17 07:39:46   INFO  epoch: 17/24, acc_iter=60389, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:13:19/1:05:21, time_cost(all): 22:15:18/8:42:27, loss=0.363801886666717, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.92(1.03), norm=2.935013897645743, lr=0.10993133952992917
2023-12-17 07:40:53   INFO  epoch: 17/24, acc_iter=60439, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:14:26/1:04:26, time_cost(all): 22:16:25/8:38:44, loss=0.363603029880668, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.18(1.03), norm=0.817134312595436, lr=0.10955589951423078
2023-12-17 07:41:59   INFO  epoch: 17/24, acc_iter=60489, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:15:33/1:03:15, time_cost(all): 22:17:31/9:14:32, loss=0.363404173094619, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.16(1.03), norm=3.0612416258069537, lr=0.10918045949853233
2023-12-17 07:43:06   INFO  epoch: 17/24, acc_iter=60539, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:16:39/1:00:06, time_cost(all): 22:18:38/8:52:59, loss=0.36320531630857, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.01(1.03), norm=2.0751223061999866, lr=0.10880501948283394
2023-12-17 07:44:12   INFO  epoch: 17/24, acc_iter=60589, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:17:46/1:02:20, time_cost(all): 22:19:44/8:28:57, loss=0.363006459522521, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.9(1.03), norm=2.3980520642059098, lr=0.10842957946713555
2023-12-17 07:45:19   INFO  epoch: 17/24, acc_iter=60639, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:18:53/1:02:07, time_cost(all): 22:20:51/8:36:04, loss=0.362807602736472, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.07(1.03), norm=3.9262747039390913, lr=0.10805413945143716
2023-12-17 07:46:26   INFO  epoch: 17/24, acc_iter=60689, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:19:59/0:57:03, time_cost(all): 22:21:58/9:07:56, loss=0.362608745950423, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.03(1.03), norm=2.3107051420008884, lr=0.10767869943573871
2023-12-17 07:47:32   INFO  epoch: 17/24, acc_iter=60739, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:21:06/0:56:20, time_cost(all): 22:23:04/9:11:36, loss=0.362409889164374, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.17(1.03), norm=4.273305940976211, lr=0.10730325942004032
2023-12-17 07:48:39   INFO  epoch: 17/24, acc_iter=60789, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:22:13/0:57:29, time_cost(all): 22:24:11/9:11:21, loss=0.362211032378325, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.2(1.03), norm=4.323606162871776, lr=0.10692781940434198
2023-12-17 07:49:46   INFO  epoch: 17/24, acc_iter=60839, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:23:19/0:54:21, time_cost(all): 22:25:18/8:57:04, loss=0.362012175592276, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.05(1.03), norm=0.6540132922269093, lr=0.10655237938864359
2023-12-17 07:50:52   INFO  epoch: 17/24, acc_iter=60889, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:24:26/0:51:23, time_cost(all): 22:26:24/8:24:04, loss=0.361813318806227, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.02(1.03), norm=0.619436683065022, lr=0.10617693937294514
2023-12-17 07:51:59   INFO  epoch: 17/24, acc_iter=60939, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:25:33/0:51:07, time_cost(all): 22:27:31/8:26:21, loss=0.361614462020178, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.94(1.03), norm=3.566637918903226, lr=0.10580149935724675
2023-12-17 07:53:06   INFO  epoch: 17/24, acc_iter=60989, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:26:39/0:51:35, time_cost(all): 22:28:38/8:52:04, loss=0.361415605234129, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.23(1.03), norm=3.2424819096450728, lr=0.10542605934154836
2023-12-17 07:54:12   INFO  epoch: 17/24, acc_iter=61039, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:27:46/0:48:33, time_cost(all): 22:29:44/8:47:15, loss=0.36121674844808, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.16(1.03), norm=3.83761331234539, lr=0.10505061932584997
2023-12-17 07:55:19   INFO  epoch: 17/24, acc_iter=61089, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:28:53/0:50:56, time_cost(all): 22:30:51/8:37:06, loss=0.361017891662031, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.18(1.03), norm=1.1657626604540292, lr=0.10467517931015152
2023-12-17 07:56:26   INFO  epoch: 17/24, acc_iter=61139, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:29:59/0:49:30, time_cost(all): 22:31:58/8:47:32, loss=0.360819034875982, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.99(1.03), norm=2.031653138301798, lr=0.10429973929445313
2023-12-17 07:57:32   INFO  epoch: 17/24, acc_iter=61189, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:31:06/0:46:23, time_cost(all): 22:33:04/8:22:39, loss=0.360620178089933, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.93(1.03), norm=2.190944849439676, lr=0.1039242992787548
2023-12-17 07:58:39   INFO  epoch: 17/24, acc_iter=61239, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:32:12/0:44:04, time_cost(all): 22:34:11/8:42:29, loss=0.360421321303884, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.86(1.03), norm=0.5383513616105488, lr=0.1035488592630564
2023-12-17 07:59:46   INFO  epoch: 17/24, acc_iter=61289, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:33:19/0:44:05, time_cost(all): 22:35:18/8:38:12, loss=0.360222464517835, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.01(1.03), norm=0.908509949382708, lr=0.10317341924735796
2023-12-17 08:00:52   INFO  epoch: 17/24, acc_iter=61339, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:34:26/0:44:34, time_cost(all): 22:36:24/8:28:49, loss=0.360023607731786, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.17(1.03), norm=2.519957158497883, lr=0.10279797923165956
2023-12-17 08:01:59   INFO  epoch: 17/24, acc_iter=61389, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:35:32/0:43:12, time_cost(all): 22:37:31/8:18:18, loss=0.359824750945737, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.9(1.03), norm=1.9608327947002187, lr=0.10242253921596117
2023-12-17 08:03:06   INFO  epoch: 17/24, acc_iter=61439, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:36:39/0:43:31, time_cost(all): 22:38:38/8:51:34, loss=0.359625894159688, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.12(1.03), norm=2.562058568010677, lr=0.10204709920026278
2023-12-17 08:04:12   INFO  epoch: 17/24, acc_iter=61489, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:37:46/0:40:19, time_cost(all): 22:39:44/8:09:59, loss=0.359427037373639, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.96(1.03), norm=2.2555738973782624, lr=0.10167165918456433
2023-12-17 08:05:19   INFO  epoch: 17/24, acc_iter=61539, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:38:52/0:39:48, time_cost(all): 22:40:51/8:36:33, loss=0.35922818058759, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.2(1.03), norm=2.136158589618803, lr=0.10129621916886594
2023-12-17 08:06:26   INFO  epoch: 17/24, acc_iter=61589, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:39:59/0:37:59, time_cost(all): 22:41:58/8:15:28, loss=0.359029323801541, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.87(1.03), norm=2.6921565167570716, lr=0.1009207791531676
2023-12-17 08:07:32   INFO  epoch: 17/24, acc_iter=61639, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:41:06/0:36:05, time_cost(all): 22:43:04/8:26:21, loss=0.358830467015492, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.07(1.03), norm=2.7645543320165187, lr=0.10054533913746921
2023-12-17 08:08:39   INFO  epoch: 17/24, acc_iter=61689, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:42:12/0:36:50, time_cost(all): 22:44:11/8:15:29, loss=0.358631610229443, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.93(1.03), norm=1.9941296275957248, lr=0.10016989912177077
2023-12-17 08:09:46   INFO  epoch: 17/24, acc_iter=61739, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:43:19/0:35:40, time_cost(all): 22:45:18/8:09:36, loss=0.358432753443394, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.84(1.03), norm=2.6215722341599763, lr=0.09979445910607238
2023-12-17 08:10:52   INFO  epoch: 17/24, acc_iter=61789, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:44:26/0:33:56, time_cost(all): 22:46:24/8:22:03, loss=0.358233896657345, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.08(1.03), norm=1.0532489550839481, lr=0.09941901909037398
2023-12-17 08:11:59   INFO  epoch: 17/24, acc_iter=61839, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:45:32/0:31:11, time_cost(all): 22:47:31/8:09:52, loss=0.358035039871296, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.2(1.03), norm=1.572705047078656, lr=0.09904357907467559
2023-12-17 08:13:06   INFO  epoch: 17/24, acc_iter=61889, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:46:39/0:32:16, time_cost(all): 22:48:38/8:23:13, loss=0.357836183085247, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.04(1.03), norm=4.217066321944184, lr=0.09866813905897714
2023-12-17 08:14:12   INFO  epoch: 17/24, acc_iter=61939, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:47:46/0:30:59, time_cost(all): 22:49:44/8:02:14, loss=0.357637326299198, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.87(1.03), norm=1.7653489237810573, lr=0.09829269904327875
2023-12-17 08:15:19   INFO  epoch: 17/24, acc_iter=61989, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:48:52/0:29:06, time_cost(all): 22:50:51/8:02:05, loss=0.357438469513149, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.22(1.03), norm=0.7127894462465876, lr=0.09791725902758042
2023-12-17 08:16:25   INFO  epoch: 17/24, acc_iter=62039, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:49:59/0:28:33, time_cost(all): 22:51:57/8:30:15, loss=0.3572396127271, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.85(1.03), norm=3.21646303436935, lr=0.09754181901188202
2023-12-17 08:17:32   INFO  epoch: 17/24, acc_iter=62089, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:51:06/0:27:28, time_cost(all): 22:53:04/8:19:21, loss=0.357040755941051, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.12(1.03), norm=1.2732862172598645, lr=0.09716637899618358
2023-12-17 08:18:39   INFO  epoch: 17/24, acc_iter=62139, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:52:12/0:26:17, time_cost(all): 22:54:11/8:19:50, loss=0.356841899155002, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.94(1.03), norm=3.882601352627581, lr=0.09679093898048519
2023-12-17 08:19:45   INFO  epoch: 17/24, acc_iter=62189, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:53:19/0:25:29, time_cost(all): 22:55:17/8:04:43, loss=0.356643042368953, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.19(1.03), norm=4.520841279412406, lr=0.0964154989647868
2023-12-17 08:20:52   INFO  epoch: 17/24, acc_iter=62239, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:54:26/0:22:43, time_cost(all): 22:56:24/8:15:21, loss=0.356444185582904, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.1(1.03), norm=4.503395616252195, lr=0.0960400589490884
2023-12-17 08:21:59   INFO  epoch: 17/24, acc_iter=62289, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:55:32/0:23:20, time_cost(all): 22:57:31/7:55:38, loss=0.356245328796855, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.96(1.03), norm=4.4196502577245615, lr=0.09566461893338996
2023-12-17 08:23:05   INFO  epoch: 17/24, acc_iter=62339, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:56:39/0:21:07, time_cost(all): 22:58:37/8:37:14, loss=0.356046472010806, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.88(1.03), norm=1.8240656138875455, lr=0.09528917891769156
2023-12-17 08:24:12   INFO  epoch: 17/24, acc_iter=62389, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:57:46/0:20:24, time_cost(all): 22:59:44/8:26:46, loss=0.355847615224757, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.9(1.03), norm=1.21570826548675, lr=0.09491373890199323
2023-12-17 08:25:19   INFO  epoch: 17/24, acc_iter=62439, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:58:52/0:19:26, time_cost(all): 23:00:51/8:31:14, loss=0.355648758438708, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.84(1.03), norm=2.9693452318852263, lr=0.09453829888629484
2023-12-17 08:26:25   INFO  epoch: 17/24, acc_iter=62489, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:59:59/0:17:30, time_cost(all): 23:01:57/7:54:43, loss=0.355449901652659, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.02(1.03), norm=2.42563595469304, lr=0.09416285887059639
2023-12-17 08:27:32   INFO  epoch: 17/24, acc_iter=62539, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 1:01:05/0:16:58, time_cost(all): 23:03:04/8:06:23, loss=0.35525104486661, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.86(1.03), norm=3.343345906949395, lr=0.093787418854898
2023-12-17 08:28:39   INFO  epoch: 17/24, acc_iter=62589, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 1:02:12/0:16:18, time_cost(all): 23:04:11/8:29:25, loss=0.355052188080561, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.19(1.03), norm=4.154959550173701, lr=0.0934119788391996
2023-12-17 08:29:45   INFO  epoch: 17/24, acc_iter=62639, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 1:03:19/0:14:49, time_cost(all): 23:05:17/8:09:16, loss=0.354853331294512, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.94(1.03), norm=2.344152087170164, lr=0.09303653882350121
2023-12-17 08:30:52   INFO  epoch: 17/24, acc_iter=62689, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 1:04:25/0:14:16, time_cost(all): 23:06:24/7:58:56, loss=0.354654474508463, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.88(1.03), norm=2.266026793797144, lr=0.09266109880780277
2023-12-17 08:31:59   INFO  epoch: 17/24, acc_iter=62739, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 1:05:32/0:12:41, time_cost(all): 23:07:31/8:10:34, loss=0.354455617722414, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.91(1.03), norm=3.00065680276306, lr=0.09228565879210437
2023-12-17 08:33:05   INFO  epoch: 17/24, acc_iter=62789, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 1:06:39/0:10:57, time_cost(all): 23:08:37/7:47:07, loss=0.354256760936365, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.22(1.03), norm=4.977019333674544, lr=0.09191021877640604
2023-12-17 08:34:12   INFO  epoch: 17/24, acc_iter=62839, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 1:07:45/0:10:10, time_cost(all): 23:09:44/7:55:27, loss=0.354057904150316, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.08(1.03), norm=1.5640098384468228, lr=0.09153477876070765
2023-12-17 08:35:19   INFO  epoch: 17/24, acc_iter=62889, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 1:08:52/0:09:14, time_cost(all): 23:10:51/7:41:15, loss=0.353859047364267, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.19(1.03), norm=2.499919195812575, lr=0.0911593387450092
2023-12-17 08:36:25   INFO  epoch: 17/24, acc_iter=62939, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 1:09:59/0:08:30, time_cost(all): 23:11:57/7:52:22, loss=0.353660190578218, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.12(1.03), norm=3.386232186831539, lr=0.09078389872931081
2023-12-17 08:37:32   INFO  epoch: 17/24, acc_iter=62989, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 1:11:05/0:06:50, time_cost(all): 23:13:04/7:42:42, loss=0.353461333792169, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.14(1.03), norm=3.4783627912127777, lr=0.09040845871361242
2023-12-17 08:38:39   INFO  epoch: 17/24, acc_iter=63039, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:12:12/0:06:08, time_cost(all): 23:14:11/7:46:57, loss=0.35326247700612, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.88(1.03), norm=3.7994784827565735, lr=0.09003301869791402
2023-12-17 08:39:45   INFO  epoch: 17/24, acc_iter=63089, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:13:19/0:05:02, time_cost(all): 23:15:17/8:11:11, loss=0.353063620220071, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=3.6199527035720074, lr=0.08965757868221563
2023-12-17 08:40:52   INFO  epoch: 17/24, acc_iter=63139, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:14:25/0:03:45, time_cost(all): 23:16:24/8:13:23, loss=0.352864763434022, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.03(1.03), norm=3.0012102588688148, lr=0.08928213866651719
2023-12-17 08:41:59   INFO  epoch: 17/24, acc_iter=63189, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:15:32/0:02:39, time_cost(all): 23:17:31/7:44:24, loss=0.352665906647973, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.99(1.03), norm=0.7840804891930926, lr=0.08890669865081885
2023-12-17 08:43:05   INFO  epoch: 17/24, acc_iter=63239, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:16:39/0:01:25, time_cost(all): 23:18:37/8:13:13, loss=0.352467049861924, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.13(1.03), norm=2.4551489035194454, lr=0.08853125863512046
2023-12-17 08:44:12   INFO  epoch: 17/24, acc_iter=63289, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:17:45/0:00:23, time_cost(all): 23:19:44/8:18:08, loss=0.352268193075875, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.22(1.03), norm=1.4989924884766914, lr=0.08815581861942207
2023-12-17 08:45:18   INFO  epoch: 18/24, acc_iter=63356, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:01:06/1:17:27, time_cost(all): 23:20:50/7:39:31, loss=0.352001724982569, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.18(1.03), norm=1.0354890431104393, lr=0.08765272899838616
2023-12-17 08:46:25   INFO  epoch: 18/24, acc_iter=63406, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:02:13/1:16:48, time_cost(all): 23:21:57/8:06:22, loss=0.35180286819652, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.93(1.03), norm=3.8300627586531264, lr=0.08727728898268777
2023-12-17 08:47:32   INFO  epoch: 18/24, acc_iter=63456, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:03:19/1:15:07, time_cost(all): 23:23:04/8:03:00, loss=0.351604011410471, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.07(1.03), norm=1.8276385227556917, lr=0.08690184896698938
2023-12-17 08:48:38   INFO  epoch: 18/24, acc_iter=63506, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:04:26/1:17:09, time_cost(all): 23:24:10/7:50:31, loss=0.351405154624422, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.23(1.03), norm=1.6322705873993377, lr=0.08652640895129099
2023-12-17 08:49:45   INFO  epoch: 18/24, acc_iter=63556, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:05:33/1:09:07, time_cost(all): 23:25:17/7:59:49, loss=0.351206297838373, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.88(1.03), norm=4.992598870379896, lr=0.08615096893559254
2023-12-17 08:50:52   INFO  epoch: 18/24, acc_iter=63606, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:06:39/1:13:36, time_cost(all): 23:26:24/7:58:08, loss=0.351007441052324, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.86(1.03), norm=2.6547093311743053, lr=0.0857755289198942
2023-12-17 08:51:58   INFO  epoch: 18/24, acc_iter=63656, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:07:46/1:13:15, time_cost(all): 23:27:30/8:01:13, loss=0.350808584266275, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.95(1.03), norm=4.4078633492337325, lr=0.08540008890419581
2023-12-17 08:53:05   INFO  epoch: 18/24, acc_iter=63706, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:08:53/1:10:19, time_cost(all): 23:28:37/7:56:55, loss=0.350609727480226, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.16(1.03), norm=1.6066031373955436, lr=0.08502464888849742
2023-12-17 08:54:12   INFO  epoch: 18/24, acc_iter=63756, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:09:59/1:11:32, time_cost(all): 23:29:44/7:50:23, loss=0.350410870694177, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.89(1.03), norm=3.373414927105353, lr=0.08464920887279898
2023-12-17 08:55:18   INFO  epoch: 18/24, acc_iter=63806, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:11:06/1:10:15, time_cost(all): 23:30:50/7:46:22, loss=0.350212013908128, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.19(1.03), norm=0.5908018576726041, lr=0.08427376885710058
2023-12-17 08:56:25   INFO  epoch: 18/24, acc_iter=63856, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:12:13/1:06:13, time_cost(all): 23:31:57/8:06:24, loss=0.350013157122079, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.21(1.03), norm=4.04456651300993, lr=0.08389832884140219
2023-12-17 08:57:32   INFO  epoch: 18/24, acc_iter=63906, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:13:19/1:07:51, time_cost(all): 23:33:04/7:53:02, loss=0.34981430033603, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.99(1.03), norm=1.4335070393732943, lr=0.0835228888257038
2023-12-17 08:58:38   INFO  epoch: 18/24, acc_iter=63956, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:14:26/1:04:24, time_cost(all): 23:34:10/7:45:18, loss=0.349615443549981, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.2(1.03), norm=3.6561340841825607, lr=0.08314744881000535
2023-12-17 08:59:45   INFO  epoch: 18/24, acc_iter=64006, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:15:33/1:05:03, time_cost(all): 23:35:17/7:24:29, loss=0.349416586763932, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.89(1.03), norm=4.010220056489169, lr=0.08277200879430702
2023-12-17 09:00:52   INFO  epoch: 18/24, acc_iter=64056, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:16:39/0:58:29, time_cost(all): 23:36:24/7:36:49, loss=0.349217729977883, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.85(1.03), norm=4.698858575562691, lr=0.08239656877860863
2023-12-17 09:01:58   INFO  epoch: 18/24, acc_iter=64106, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:17:46/1:02:13, time_cost(all): 23:37:30/7:39:02, loss=0.349018873191834, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.87(1.03), norm=1.8463995461384428, lr=0.08202112876291023
2023-12-17 09:03:05   INFO  epoch: 18/24, acc_iter=64156, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:18:53/0:58:02, time_cost(all): 23:38:37/7:44:11, loss=0.348820016405785, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.13(1.03), norm=0.7142595092687162, lr=0.08164568874721179
2023-12-17 09:04:12   INFO  epoch: 18/24, acc_iter=64206, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:19:59/1:00:59, time_cost(all): 23:39:44/7:17:02, loss=0.348621159619736, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.16(1.03), norm=4.855873269194798, lr=0.0812702487315134
2023-12-17 09:05:18   INFO  epoch: 18/24, acc_iter=64256, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:21:06/0:58:46, time_cost(all): 23:40:50/7:24:07, loss=0.348422302833687, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.09(1.03), norm=1.8709791810880705, lr=0.080894808715815
2023-12-17 09:06:25   INFO  epoch: 18/24, acc_iter=64306, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:22:13/0:58:31, time_cost(all): 23:41:57/7:38:31, loss=0.348223446047638, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.12(1.03), norm=0.5054257308929397, lr=0.08051936870011661
2023-12-17 09:07:32   INFO  epoch: 18/24, acc_iter=64356, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:23:19/0:52:46, time_cost(all): 23:43:04/7:34:37, loss=0.348024589261589, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.89(1.03), norm=2.412787168556966, lr=0.08014392868441816
2023-12-17 09:08:38   INFO  epoch: 18/24, acc_iter=64406, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:24:26/0:51:41, time_cost(all): 23:44:10/7:23:18, loss=0.34782573247554, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.01(1.03), norm=1.3085781358854538, lr=0.07976848866871983
2023-12-17 09:09:45   INFO  epoch: 18/24, acc_iter=64456, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:25:33/0:54:04, time_cost(all): 23:45:17/7:33:47, loss=0.347626875689491, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.83(1.03), norm=2.0300267942719943, lr=0.07939304865302144
2023-12-17 09:10:52   INFO  epoch: 18/24, acc_iter=64506, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:26:39/0:53:39, time_cost(all): 23:46:24/7:24:57, loss=0.347428018903442, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.12(1.03), norm=1.7046285321705776, lr=0.07901760863732304
2023-12-17 09:11:58   INFO  epoch: 18/24, acc_iter=64556, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:27:46/0:51:45, time_cost(all): 23:47:30/7:47:04, loss=0.347229162117393, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.92(1.03), norm=2.9136865894661113, lr=0.0786421686216246
2023-12-17 09:13:05   INFO  epoch: 18/24, acc_iter=64606, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:28:53/0:47:17, time_cost(all): 23:48:37/7:42:09, loss=0.347030305331344, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.1(1.03), norm=3.4057453209062887, lr=0.0782667286059262
2023-12-17 09:14:11   INFO  epoch: 18/24, acc_iter=64656, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:29:59/0:50:27, time_cost(all): 23:49:43/7:20:20, loss=0.346831448545295, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.95(1.03), norm=2.0710534310886937, lr=0.07789128859022781
2023-12-17 09:15:18   INFO  epoch: 18/24, acc_iter=64706, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:31:06/0:44:52, time_cost(all): 23:50:50/7:05:49, loss=0.346632591759246, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.19(1.03), norm=4.056403099079738, lr=0.07751584857452942
2023-12-17 09:16:25   INFO  epoch: 18/24, acc_iter=64756, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:32:12/0:47:03, time_cost(all): 23:51:57/7:13:25, loss=0.346433734973197, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.87(1.03), norm=1.6509619328919092, lr=0.07714040855883098
2023-12-17 09:17:31   INFO  epoch: 18/24, acc_iter=64806, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:33:19/0:45:36, time_cost(all): 23:53:03/7:26:47, loss=0.346234878187148, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.03(1.03), norm=3.061764868594215, lr=0.07676496854313264
2023-12-17 09:18:38   INFO  epoch: 18/24, acc_iter=64856, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:34:26/0:45:48, time_cost(all): 23:54:10/7:28:02, loss=0.346036021401099, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.2(1.03), norm=1.2432153529555947, lr=0.07638952852743425
2023-12-17 09:19:45   INFO  epoch: 18/24, acc_iter=64906, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:35:32/0:44:39, time_cost(all): 23:55:17/6:58:12, loss=0.34583716461505, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.87(1.03), norm=3.0037033921881395, lr=0.07601408851173586
2023-12-17 09:20:51   INFO  epoch: 18/24, acc_iter=64956, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:36:39/0:40:23, time_cost(all): 23:56:23/7:39:20, loss=0.345638307829001, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.94(1.03), norm=3.028879166360579, lr=0.07563864849603741
2023-12-17 09:21:58   INFO  epoch: 18/24, acc_iter=65006, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:37:46/0:39:31, time_cost(all): 23:57:30/7:26:42, loss=0.345439451042952, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.9(1.03), norm=2.9095626909820917, lr=0.07526320848033902
2023-12-17 09:23:05   INFO  epoch: 18/24, acc_iter=65056, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:38:52/0:37:32, time_cost(all): 23:58:37/7:32:40, loss=0.345240594256903, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.2(1.03), norm=4.373450619485279, lr=0.07488776846464062
2023-12-17 09:24:11   INFO  epoch: 18/24, acc_iter=65106, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:39:59/0:37:51, time_cost(all): 23:59:43/7:17:32, loss=0.345041737470854, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.88(1.03), norm=0.8770720324802433, lr=0.07451232844894223
2023-12-17 09:25:18   INFO  epoch: 18/24, acc_iter=65156, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:41:06/0:37:36, time_cost(all): 1 day, 0:00:50/7:01:51, loss=0.344842880684805, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.9(1.03), norm=4.645132317968241, lr=0.07413688843324379
2023-12-17 09:26:25   INFO  epoch: 18/24, acc_iter=65206, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:42:12/0:36:44, time_cost(all): 1 day, 0:01:57/7:10:48, loss=0.344644023898756, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.23(1.03), norm=0.7594120707826637, lr=0.07376144841754545
2023-12-17 09:27:31   INFO  epoch: 18/24, acc_iter=65256, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:43:19/0:34:52, time_cost(all): 1 day, 0:03:03/7:19:14, loss=0.344445167112707, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.99(1.03), norm=4.6260445903064245, lr=0.07338600840184706
2023-12-17 09:28:38   INFO  epoch: 18/24, acc_iter=65306, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:44:26/0:33:32, time_cost(all): 1 day, 0:04:10/6:58:23, loss=0.344246310326658, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.09(1.03), norm=1.0898077118860483, lr=0.07301056838614867
2023-12-17 09:29:45   INFO  epoch: 18/24, acc_iter=65356, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:45:32/0:33:42, time_cost(all): 1 day, 0:05:17/7:09:00, loss=0.344047453540609, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.97(1.03), norm=4.784829891323295, lr=0.07263512837045022
2023-12-17 09:30:51   INFO  epoch: 18/24, acc_iter=65406, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:46:39/0:31:46, time_cost(all): 1 day, 0:06:23/6:52:39, loss=0.34384859675456, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.98(1.03), norm=1.531770540716398, lr=0.07225968835475183
2023-12-17 09:31:58   INFO  epoch: 18/24, acc_iter=65456, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:47:46/0:30:42, time_cost(all): 1 day, 0:07:30/7:09:49, loss=0.343649739968511, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.02(1.03), norm=2.856622227846245, lr=0.07188424833905344
2023-12-17 09:33:05   INFO  epoch: 18/24, acc_iter=65506, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:48:52/0:29:13, time_cost(all): 1 day, 0:08:37/7:08:26, loss=0.343450883182462, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.99(1.03), norm=3.6393952568989865, lr=0.07150880832335504
2023-12-17 09:34:11   INFO  epoch: 18/24, acc_iter=65556, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:49:59/0:28:37, time_cost(all): 1 day, 0:09:43/7:04:14, loss=0.343252026396413, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.94(1.03), norm=2.5510750635938493, lr=0.0711333683076566
2023-12-17 09:35:18   INFO  epoch: 18/24, acc_iter=65606, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:51:06/0:26:29, time_cost(all): 1 day, 0:10:50/6:57:10, loss=0.343053169610364, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.16(1.03), norm=2.369399098184619, lr=0.07075792829195826
2023-12-17 09:36:25   INFO  epoch: 18/24, acc_iter=65656, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:52:12/0:27:05, time_cost(all): 1 day, 0:11:57/6:59:36, loss=0.342854312824315, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.84(1.03), norm=0.5663704142012829, lr=0.07038248827625987
2023-12-17 09:37:31   INFO  epoch: 18/24, acc_iter=65706, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:53:19/0:24:05, time_cost(all): 1 day, 0:13:03/6:51:29, loss=0.342655456038266, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.07(1.03), norm=4.363386178968938, lr=0.07000704826056148
2023-12-17 09:38:38   INFO  epoch: 18/24, acc_iter=65756, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:54:26/0:24:51, time_cost(all): 1 day, 0:14:10/6:50:59, loss=0.342456599252217, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.06(1.03), norm=1.080391446872055, lr=0.06963160824486303
2023-12-17 09:39:45   INFO  epoch: 18/24, acc_iter=65806, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:55:32/0:22:09, time_cost(all): 1 day, 0:15:17/6:57:08, loss=0.342257742466168, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.04(1.03), norm=1.6663351903193084, lr=0.06925616822916464
2023-12-17 09:40:51   INFO  epoch: 18/24, acc_iter=65856, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:56:39/0:20:30, time_cost(all): 1 day, 0:16:23/7:13:43, loss=0.342058885680119, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.16(1.03), norm=2.9078155551855804, lr=0.06888072821346625
2023-12-17 09:41:58   INFO  epoch: 18/24, acc_iter=65906, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:57:46/0:20:04, time_cost(all): 1 day, 0:17:30/7:02:44, loss=0.34186002889407, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.1(1.03), norm=1.3841647589403463, lr=0.06850528819776786
2023-12-17 09:43:05   INFO  epoch: 18/24, acc_iter=65956, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:58:52/0:18:40, time_cost(all): 1 day, 0:18:37/6:49:56, loss=0.341661172108021, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.03(1.03), norm=2.4883265100487906, lr=0.06812984818206941
2023-12-17 09:44:11   INFO  epoch: 18/24, acc_iter=66006, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:59:59/0:17:19, time_cost(all): 1 day, 0:19:43/6:51:59, loss=0.341462315321972, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.94(1.03), norm=0.7947615436541641, lr=0.06775440816637107
2023-12-17 09:45:18   INFO  epoch: 18/24, acc_iter=66056, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 1:01:05/0:16:17, time_cost(all): 1 day, 0:20:50/7:06:22, loss=0.341263458535923, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.12(1.03), norm=2.3379542650300777, lr=0.06737896815067268
2023-12-17 09:46:24   INFO  epoch: 18/24, acc_iter=66106, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 1:02:12/0:15:22, time_cost(all): 1 day, 0:21:56/7:06:10, loss=0.341064601749874, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.06(1.03), norm=0.5702184542301962, lr=0.06700352813497429
2023-12-17 09:47:31   INFO  epoch: 18/24, acc_iter=66156, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 1:03:19/0:14:25, time_cost(all): 1 day, 0:23:03/6:48:53, loss=0.340865744963825, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.23(1.03), norm=3.3007636990499365, lr=0.06662808811927584
2023-12-17 09:48:38   INFO  epoch: 18/24, acc_iter=66206, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 1:04:25/0:13:34, time_cost(all): 1 day, 0:24:10/6:31:10, loss=0.340666888177776, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.14(1.03), norm=3.1171059516697186, lr=0.06625264810357745
2023-12-17 09:49:44   INFO  epoch: 18/24, acc_iter=66256, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 1:05:32/0:13:09, time_cost(all): 1 day, 0:25:16/7:08:08, loss=0.340468031391727, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.06(1.03), norm=4.1175852100627335, lr=0.06587720808787906
2023-12-17 09:50:51   INFO  epoch: 18/24, acc_iter=66306, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 1:06:39/0:11:05, time_cost(all): 1 day, 0:26:23/6:44:06, loss=0.340269174605678, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.19(1.03), norm=4.476369757924563, lr=0.06550176807218067
2023-12-17 09:51:58   INFO  epoch: 18/24, acc_iter=66356, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 1:07:45/0:10:01, time_cost(all): 1 day, 0:27:30/7:04:00, loss=0.340070317819629, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.99(1.03), norm=4.57519760916604, lr=0.06512632805648222
2023-12-17 09:53:04   INFO  epoch: 18/24, acc_iter=66406, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 1:08:52/0:09:41, time_cost(all): 1 day, 0:28:36/6:54:17, loss=0.33987146103358, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.89(1.03), norm=3.1616777480148976, lr=0.06475088804078388
2023-12-17 09:54:11   INFO  epoch: 18/24, acc_iter=66456, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 1:09:59/0:07:46, time_cost(all): 1 day, 0:29:43/6:48:03, loss=0.339672604247531, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.04(1.03), norm=4.589520690737662, lr=0.06437544802508549
2023-12-17 09:55:18   INFO  epoch: 18/24, acc_iter=66506, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 1:11:05/0:06:54, time_cost(all): 1 day, 0:30:50/6:35:54, loss=0.339473747461482, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=3.9212265370942267, lr=0.0640000080093871
2023-12-17 09:56:24   INFO  epoch: 18/24, acc_iter=66556, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:12:12/0:06:06, time_cost(all): 1 day, 0:31:56/6:35:58, loss=0.339274890675433, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.92(1.03), norm=3.550041080292336, lr=0.06362456799368865
2023-12-17 09:57:31   INFO  epoch: 18/24, acc_iter=66606, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:13:19/0:04:40, time_cost(all): 1 day, 0:33:03/6:36:29, loss=0.339076033889384, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.2(1.03), norm=2.141310271925806, lr=0.06324912797799026
2023-12-17 09:58:38   INFO  epoch: 18/24, acc_iter=66656, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:14:25/0:03:53, time_cost(all): 1 day, 0:34:10/6:31:28, loss=0.338877177103335, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.18(1.03), norm=3.971771151944597, lr=0.06287368796229187
2023-12-17 09:59:44   INFO  epoch: 18/24, acc_iter=66706, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:15:32/0:02:31, time_cost(all): 1 day, 0:35:16/6:30:22, loss=0.338678320317286, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.16(1.03), norm=2.3476820973895816, lr=0.06249824794659348
2023-12-17 10:00:51   INFO  epoch: 18/24, acc_iter=66756, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:16:39/0:01:26, time_cost(all): 1 day, 0:36:23/6:58:30, loss=0.338479463531237, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.23(1.03), norm=1.1772468652601331, lr=0.06212280793089503
2023-12-17 10:01:58   INFO  epoch: 18/24, acc_iter=66806, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:17:45/0:00:23, time_cost(all): 1 day, 0:37:30/6:52:16, loss=0.338280606745188, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.2(1.03), norm=3.7882360145052023, lr=0.061747367915196694
2023-12-17 10:03:04   INFO  epoch: 19/24, acc_iter=66873, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:01:06/1:20:24, time_cost(all): 1 day, 0:38:36/6:48:50, loss=0.338014138651883, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.06(1.03), norm=3.9740073107764244, lr=0.06124427829416085
2023-12-17 10:04:11   INFO  epoch: 19/24, acc_iter=66923, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:02:13/1:18:44, time_cost(all): 1 day, 0:39:43/6:40:21, loss=0.337815281865834, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.04(1.03), norm=1.787228368257039, lr=0.060868838278462456
2023-12-17 10:05:18   INFO  epoch: 19/24, acc_iter=66973, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:03:19/1:16:24, time_cost(all): 1 day, 0:40:50/6:53:26, loss=0.337616425079785, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.93(1.03), norm=0.8960797283587344, lr=0.06049339826276401
2023-12-17 10:06:24   INFO  epoch: 19/24, acc_iter=67023, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:04:26/1:12:21, time_cost(all): 1 day, 0:41:56/6:28:16, loss=0.337417568293736, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.09(1.03), norm=0.9385996245885073, lr=0.06011795824706562
2023-12-17 10:07:31   INFO  epoch: 19/24, acc_iter=67073, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:05:33/1:14:12, time_cost(all): 1 day, 0:43:03/6:33:42, loss=0.337218711507687, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.14(1.03), norm=4.98165121927218, lr=0.059742518231367225
2023-12-17 10:08:38   INFO  epoch: 19/24, acc_iter=67123, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:06:39/1:10:17, time_cost(all): 1 day, 0:44:10/6:49:51, loss=0.337019854721638, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.22(1.03), norm=2.1092382325915295, lr=0.059367078215668834
2023-12-17 10:09:44   INFO  epoch: 19/24, acc_iter=67173, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:07:46/1:08:46, time_cost(all): 1 day, 0:45:16/6:46:37, loss=0.336820997935589, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.15(1.03), norm=0.6373772856308948, lr=0.058991638199970386
2023-12-17 10:10:51   INFO  epoch: 19/24, acc_iter=67223, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:08:53/1:09:05, time_cost(all): 1 day, 0:46:23/6:36:56, loss=0.33662214114954, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.95(1.03), norm=3.6091489858147905, lr=0.05861619818427205
2023-12-17 10:11:58   INFO  epoch: 19/24, acc_iter=67273, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:09:59/1:09:33, time_cost(all): 1 day, 0:47:30/6:40:54, loss=0.336423284363491, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.86(1.03), norm=4.683471887211267, lr=0.05824075816857366
2023-12-17 10:13:04   INFO  epoch: 19/24, acc_iter=67323, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:11:06/1:08:39, time_cost(all): 1 day, 0:48:36/6:35:29, loss=0.336224427577442, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.02(1.03), norm=2.651303462237463, lr=0.05786531815287527
2023-12-17 10:14:11   INFO  epoch: 19/24, acc_iter=67373, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:12:13/1:07:25, time_cost(all): 1 day, 0:49:43/6:11:42, loss=0.336025570791393, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.18(1.03), norm=1.3007699688099366, lr=0.05748987813717682
2023-12-17 10:15:17   INFO  epoch: 19/24, acc_iter=67423, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:13:19/1:05:21, time_cost(all): 1 day, 0:50:49/6:23:56, loss=0.335826714005344, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.13(1.03), norm=3.8675736071575697, lr=0.05711443812147843
2023-12-17 10:16:24   INFO  epoch: 19/24, acc_iter=67473, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:14:26/1:04:05, time_cost(all): 1 day, 0:51:56/6:39:43, loss=0.335627857219295, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.17(1.03), norm=4.4728457857255135, lr=0.056738998105780036
2023-12-17 10:17:31   INFO  epoch: 19/24, acc_iter=67523, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:15:33/1:01:25, time_cost(all): 1 day, 0:53:03/6:31:39, loss=0.335429000433246, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.95(1.03), norm=1.6293170488161366, lr=0.056363558090081645
2023-12-17 10:18:37   INFO  epoch: 19/24, acc_iter=67573, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:16:39/0:59:40, time_cost(all): 1 day, 0:54:09/6:07:11, loss=0.335230143647197, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.86(1.03), norm=3.826062940991582, lr=0.0559881180743832
2023-12-17 10:19:44   INFO  epoch: 19/24, acc_iter=67623, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:17:46/1:00:27, time_cost(all): 1 day, 0:55:16/6:36:06, loss=0.335031286861148, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.13(1.03), norm=4.56019721528448, lr=0.05561267805868486
2023-12-17 10:20:51   INFO  epoch: 19/24, acc_iter=67673, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:18:53/0:59:01, time_cost(all): 1 day, 0:56:23/6:15:21, loss=0.334832430075099, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.02(1.03), norm=3.3589352883214127, lr=0.05523723804298647
2023-12-17 10:21:57   INFO  epoch: 19/24, acc_iter=67723, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:19:59/0:59:41, time_cost(all): 1 day, 0:57:29/6:01:27, loss=0.33463357328905, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.0(1.03), norm=3.6489366655967945, lr=0.05486179802728808
2023-12-17 10:23:04   INFO  epoch: 19/24, acc_iter=67773, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:21:06/0:59:25, time_cost(all): 1 day, 0:58:36/6:02:22, loss=0.334434716503001, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.18(1.03), norm=4.752433592828185, lr=0.05448635801158963
2023-12-17 10:24:11   INFO  epoch: 19/24, acc_iter=67823, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:22:13/0:58:07, time_cost(all): 1 day, 0:59:43/6:19:39, loss=0.334235859716952, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.0(1.03), norm=3.457573417856284, lr=0.05411091799589124
2023-12-17 10:25:17   INFO  epoch: 19/24, acc_iter=67873, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:23:19/0:53:26, time_cost(all): 1 day, 1:00:49/6:09:33, loss=0.334037002930903, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.91(1.03), norm=4.186628505828842, lr=0.05373547798019285
2023-12-17 10:26:24   INFO  epoch: 19/24, acc_iter=67923, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:24:26/0:54:26, time_cost(all): 1 day, 1:01:56/6:09:11, loss=0.333838146144854, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.15(1.03), norm=4.715195865841354, lr=0.053360037964494456
2023-12-17 10:27:31   INFO  epoch: 19/24, acc_iter=67973, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:25:33/0:51:52, time_cost(all): 1 day, 1:03:03/6:23:59, loss=0.333639289358805, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.18(1.03), norm=4.151060139009136, lr=0.052984597948796064
2023-12-17 10:28:37   INFO  epoch: 19/24, acc_iter=68023, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:26:39/0:51:51, time_cost(all): 1 day, 1:04:09/6:05:18, loss=0.333440432572756, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.97(1.03), norm=4.9329241233851775, lr=0.05260915793309767
2023-12-17 10:29:44   INFO  epoch: 19/24, acc_iter=68073, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:27:46/0:51:50, time_cost(all): 1 day, 1:05:16/6:07:37, loss=0.333241575786707, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.85(1.03), norm=4.3962010385189565, lr=0.05223371791739928
2023-12-17 10:30:51   INFO  epoch: 19/24, acc_iter=68123, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:28:53/0:50:10, time_cost(all): 1 day, 1:06:23/6:25:02, loss=0.333042719000658, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.88(1.03), norm=3.339804298841045, lr=0.05185827790170089
2023-12-17 10:31:57   INFO  epoch: 19/24, acc_iter=68173, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:29:59/0:46:42, time_cost(all): 1 day, 1:07:29/5:57:38, loss=0.332843862214609, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.0(1.03), norm=2.7717787874076585, lr=0.0514828378860025
2023-12-17 10:33:04   INFO  epoch: 19/24, acc_iter=68223, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:31:06/0:48:29, time_cost(all): 1 day, 1:08:36/5:51:34, loss=0.33264500542856, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.92(1.03), norm=4.819374241792534, lr=0.05110739787030405
2023-12-17 10:34:11   INFO  epoch: 19/24, acc_iter=68273, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:32:12/0:46:16, time_cost(all): 1 day, 1:09:43/5:59:18, loss=0.332446148642511, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.01(1.03), norm=1.017289235798633, lr=0.05073195785460566
2023-12-17 10:35:17   INFO  epoch: 19/24, acc_iter=68323, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:33:19/0:44:26, time_cost(all): 1 day, 1:10:49/6:19:51, loss=0.332247291856462, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.21(1.03), norm=4.146042126798605, lr=0.050356517838907267
2023-12-17 10:36:24   INFO  epoch: 19/24, acc_iter=68373, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:34:26/0:44:14, time_cost(all): 1 day, 1:11:56/5:59:52, loss=0.332048435070413, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.16(1.03), norm=1.5157321644601363, lr=0.049990121510645816
2023-12-17 10:37:31   INFO  epoch: 19/24, acc_iter=68423, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:35:32/0:42:47, time_cost(all): 1 day, 1:13:03/5:59:01, loss=0.331849578284364, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.99(1.03), norm=1.8786886325908223, lr=0.04979411973774445
2023-12-17 10:38:37   INFO  epoch: 19/24, acc_iter=68473, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:36:39/0:42:22, time_cost(all): 1 day, 1:14:09/5:47:52, loss=0.331650721498315, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.99(1.03), norm=4.523679699466243, lr=0.04959811796484308
2023-12-17 10:39:44   INFO  epoch: 19/24, acc_iter=68523, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:37:46/0:42:16, time_cost(all): 1 day, 1:15:16/5:42:27, loss=0.331451864712266, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.08(1.03), norm=0.5610192987661461, lr=0.0494021161919417
2023-12-17 10:40:51   INFO  epoch: 19/24, acc_iter=68573, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:38:52/0:39:03, time_cost(all): 1 day, 1:16:23/5:44:04, loss=0.331253007926217, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.93(1.03), norm=3.969112219671503, lr=0.04920611441904033
2023-12-17 10:41:57   INFO  epoch: 19/24, acc_iter=68623, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:39:59/0:38:46, time_cost(all): 1 day, 1:17:29/5:54:19, loss=0.331054151140168, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.9(1.03), norm=2.6062538485507645, lr=0.04901011264613896
2023-12-17 10:43:04   INFO  epoch: 19/24, acc_iter=68673, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:41:06/0:36:12, time_cost(all): 1 day, 1:18:36/5:55:42, loss=0.330855294354119, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.07(1.03), norm=2.8790437035049035, lr=0.048814110873237594
2023-12-17 10:44:10   INFO  epoch: 19/24, acc_iter=68723, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:42:12/0:34:39, time_cost(all): 1 day, 1:19:42/6:08:27, loss=0.33065643756807, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.12(1.03), norm=2.7474506640408323, lr=0.048618109100336225
2023-12-17 10:45:17   INFO  epoch: 19/24, acc_iter=68773, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:43:19/0:36:14, time_cost(all): 1 day, 1:20:49/5:58:48, loss=0.330457580782021, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.97(1.03), norm=3.547043352069009, lr=0.048422107327434856
2023-12-17 10:46:24   INFO  epoch: 19/24, acc_iter=68823, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:44:26/0:35:06, time_cost(all): 1 day, 1:21:56/5:51:36, loss=0.330258723995972, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.1(1.03), norm=3.480203004761977, lr=0.04822610555453349
2023-12-17 10:47:30   INFO  epoch: 19/24, acc_iter=68873, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:45:32/0:31:00, time_cost(all): 1 day, 1:23:02/5:46:09, loss=0.330059867209923, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.98(1.03), norm=3.3767508241219195, lr=0.04803010378163211
2023-12-17 10:48:37   INFO  epoch: 19/24, acc_iter=68923, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:46:39/0:32:35, time_cost(all): 1 day, 1:24:09/6:01:21, loss=0.329861010423874, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.85(1.03), norm=1.329651802872322, lr=0.04783410200873074
2023-12-17 10:49:44   INFO  epoch: 19/24, acc_iter=68973, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:47:46/0:29:59, time_cost(all): 1 day, 1:25:16/5:34:48, loss=0.329662153637825, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.21(1.03), norm=2.006856254538523, lr=0.04763810023582937
2023-12-17 10:50:50   INFO  epoch: 19/24, acc_iter=69023, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:48:52/0:29:31, time_cost(all): 1 day, 1:26:22/6:00:09, loss=0.329463296851776, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.15(1.03), norm=2.3230477408773353, lr=0.047442098462928003
2023-12-17 10:51:57   INFO  epoch: 19/24, acc_iter=69073, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:49:59/0:28:14, time_cost(all): 1 day, 1:27:29/5:51:49, loss=0.329264440065727, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.95(1.03), norm=2.396660828006916, lr=0.047246096690026634
2023-12-17 10:53:04   INFO  epoch: 19/24, acc_iter=69123, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:51:06/0:25:58, time_cost(all): 1 day, 1:28:36/5:42:49, loss=0.329065583279678, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.06(1.03), norm=2.336549784075714, lr=0.047050094917125265
2023-12-17 10:54:10   INFO  epoch: 19/24, acc_iter=69173, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:52:12/0:25:30, time_cost(all): 1 day, 1:29:42/5:31:02, loss=0.328866726493629, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.05(1.03), norm=4.285727102942971, lr=0.046854093144223896
2023-12-17 10:55:17   INFO  epoch: 19/24, acc_iter=69223, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:53:19/0:23:40, time_cost(all): 1 day, 1:30:49/5:41:17, loss=0.32866786970758, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.94(1.03), norm=0.9750774632027626, lr=0.04665809137132252
2023-12-17 10:56:24   INFO  epoch: 19/24, acc_iter=69273, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:54:26/0:24:40, time_cost(all): 1 day, 1:31:56/5:59:47, loss=0.328469012921531, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.05(1.03), norm=3.264792482532232, lr=0.04646208959842115
2023-12-17 10:57:30   INFO  epoch: 19/24, acc_iter=69323, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:55:32/0:21:54, time_cost(all): 1 day, 1:33:02/5:38:48, loss=0.328270156135482, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.19(1.03), norm=3.6351428275176825, lr=0.04626608782551978
2023-12-17 10:58:37   INFO  epoch: 19/24, acc_iter=69373, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:56:39/0:20:49, time_cost(all): 1 day, 1:34:09/5:54:48, loss=0.328071299349433, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.1(1.03), norm=1.9275502068492085, lr=0.04607008605261841
2023-12-17 10:59:44   INFO  epoch: 19/24, acc_iter=69423, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:57:46/0:19:46, time_cost(all): 1 day, 1:35:16/5:49:04, loss=0.327872442563384, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.13(1.03), norm=1.8981820522727204, lr=0.045874084279717044
2023-12-17 11:00:50   INFO  epoch: 19/24, acc_iter=69473, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:58:52/0:19:46, time_cost(all): 1 day, 1:36:22/5:37:06, loss=0.327673585777335, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.17(1.03), norm=1.887878309085249, lr=0.045678082506815675
2023-12-17 11:01:57   INFO  epoch: 19/24, acc_iter=69523, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:59:59/0:18:37, time_cost(all): 1 day, 1:37:29/5:35:20, loss=0.327474728991286, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.05(1.03), norm=3.679570402264993, lr=0.0454820807339143
2023-12-17 11:03:04   INFO  epoch: 19/24, acc_iter=69573, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 1:01:05/0:17:53, time_cost(all): 1 day, 1:38:36/5:34:30, loss=0.327275872205237, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.13(1.03), norm=4.130614751258788, lr=0.04528607896101293
2023-12-17 11:04:10   INFO  epoch: 19/24, acc_iter=69623, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 1:02:12/0:15:27, time_cost(all): 1 day, 1:39:42/5:46:20, loss=0.327077015419188, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.07(1.03), norm=3.595353298764442, lr=0.04509007718811156
2023-12-17 11:05:17   INFO  epoch: 19/24, acc_iter=69673, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 1:03:19/0:14:56, time_cost(all): 1 day, 1:40:49/5:49:03, loss=0.326878158633139, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.99(1.03), norm=4.2149285149238285, lr=0.04489407541521019
2023-12-17 11:06:24   INFO  epoch: 19/24, acc_iter=69723, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 1:04:25/0:13:07, time_cost(all): 1 day, 1:41:56/5:21:34, loss=0.32667930184709, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.92(1.03), norm=3.937978384282572, lr=0.04469807364230882
2023-12-17 11:07:30   INFO  epoch: 19/24, acc_iter=69773, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 1:05:32/0:13:08, time_cost(all): 1 day, 1:43:02/5:19:58, loss=0.326480445061041, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.22(1.03), norm=2.867657367367767, lr=0.04450207186940745
2023-12-17 11:08:37   INFO  epoch: 19/24, acc_iter=69823, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 1:06:39/0:11:37, time_cost(all): 1 day, 1:44:09/5:37:45, loss=0.326281588274992, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.0(1.03), norm=2.1010351542306775, lr=0.044306070096506084
2023-12-17 11:09:44   INFO  epoch: 19/24, acc_iter=69873, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 1:07:45/0:09:55, time_cost(all): 1 day, 1:45:16/5:40:51, loss=0.326082731488943, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.16(1.03), norm=4.488916077675898, lr=0.044110068323604715
2023-12-17 11:10:50   INFO  epoch: 19/24, acc_iter=69923, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 1:08:52/0:08:49, time_cost(all): 1 day, 1:46:22/5:28:08, loss=0.325883874702894, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.05(1.03), norm=3.5514403468765026, lr=0.04391406655070334
2023-12-17 11:11:57   INFO  epoch: 19/24, acc_iter=69973, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 1:09:59/0:07:45, time_cost(all): 1 day, 1:47:29/5:34:52, loss=0.325685017916845, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.91(1.03), norm=2.846310344203527, lr=0.04371806477780197
2023-12-17 11:13:03   INFO  epoch: 19/24, acc_iter=70023, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 1:11:05/0:07:22, time_cost(all): 1 day, 1:48:35/5:32:14, loss=0.325486161130796, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.9(1.03), norm=0.9363218750903897, lr=0.0435220630049006
2023-12-17 11:14:10   INFO  epoch: 19/24, acc_iter=70073, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:12:12/0:05:46, time_cost(all): 1 day, 1:49:42/5:14:01, loss=0.325287304344747, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.17(1.03), norm=0.9163312040808317, lr=0.04332606123199923
2023-12-17 11:15:17   INFO  epoch: 19/24, acc_iter=70123, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:13:19/0:04:35, time_cost(all): 1 day, 1:50:49/5:40:04, loss=0.325088447558698, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.9(1.03), norm=4.220247647562788, lr=0.04313005945909786
2023-12-17 11:16:23   INFO  epoch: 19/24, acc_iter=70173, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:14:25/0:03:32, time_cost(all): 1 day, 1:51:55/5:35:28, loss=0.324889590772649, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.99(1.03), norm=4.732935598868515, lr=0.042934057686196486
2023-12-17 11:17:30   INFO  epoch: 19/24, acc_iter=70223, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:15:32/0:02:29, time_cost(all): 1 day, 1:53:02/5:31:28, loss=0.3246907339866, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.14(1.03), norm=1.0949203038128834, lr=0.04273805591329512
2023-12-17 11:18:37   INFO  epoch: 19/24, acc_iter=70273, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:16:39/0:01:27, time_cost(all): 1 day, 1:54:09/5:05:16, loss=0.324491877200551, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.12(1.03), norm=0.9424594568649516, lr=0.04254205414039375
2023-12-17 11:19:43   INFO  epoch: 19/24, acc_iter=70323, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:17:45/0:00:23, time_cost(all): 1 day, 1:55:15/5:32:25, loss=0.324293020414502, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.05(1.03), norm=3.0797936652535585, lr=0.04234605236749238
2023-12-17 11:20:50   INFO  epoch: 20/24, acc_iter=70390, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:01:06/1:15:40, time_cost(all): 1 day, 1:56:22/5:11:44, loss=0.324026552321196, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.97(1.03), norm=4.197651875402283, lr=0.04208340999180454
2023-12-17 11:21:57   INFO  epoch: 20/24, acc_iter=70440, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:02:13/1:19:34, time_cost(all): 1 day, 1:57:29/5:13:12, loss=0.323827695535147, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.86(1.03), norm=3.759908887585598, lr=0.041887408218903174
2023-12-17 11:23:03   INFO  epoch: 20/24, acc_iter=70490, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:03:19/1:16:21, time_cost(all): 1 day, 1:58:35/5:29:17, loss=0.323628838749098, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.12(1.03), norm=3.465273819001239, lr=0.041691406446001805
2023-12-17 11:24:10   INFO  epoch: 20/24, acc_iter=70540, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:04:26/1:16:41, time_cost(all): 1 day, 1:59:42/5:21:31, loss=0.323429981963049, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.21(1.03), norm=3.2600166399993866, lr=0.041495404673100436
2023-12-17 11:25:17   INFO  epoch: 20/24, acc_iter=70590, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:05:33/1:12:18, time_cost(all): 1 day, 2:00:49/5:23:32, loss=0.323231125177, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.92(1.03), norm=1.031378698359634, lr=0.04129940290019906
2023-12-17 11:26:23   INFO  epoch: 20/24, acc_iter=70640, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:06:39/1:11:43, time_cost(all): 1 day, 2:01:55/5:04:05, loss=0.323032268390951, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.96(1.03), norm=3.9665537220391744, lr=0.04110340112729769
2023-12-17 11:27:30   INFO  epoch: 20/24, acc_iter=70690, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:07:46/1:08:59, time_cost(all): 1 day, 2:03:02/5:15:07, loss=0.322833411604902, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.95(1.03), norm=2.61535079316495, lr=0.04090739935439632
2023-12-17 11:28:37   INFO  epoch: 20/24, acc_iter=70740, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:08:53/1:07:26, time_cost(all): 1 day, 2:04:09/5:02:08, loss=0.322634554818853, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.2(1.03), norm=3.647327327353558, lr=0.04071139758149495
2023-12-17 11:29:43   INFO  epoch: 20/24, acc_iter=70790, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:09:59/1:05:06, time_cost(all): 1 day, 2:05:15/5:07:26, loss=0.322435698032804, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.01(1.03), norm=1.2418940238342429, lr=0.040515395808593584
2023-12-17 11:30:50   INFO  epoch: 20/24, acc_iter=70840, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:11:06/1:08:30, time_cost(all): 1 day, 2:06:22/5:04:32, loss=0.322236841246755, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.11(1.03), norm=1.6325150452046289, lr=0.040319394035692215
2023-12-17 11:31:57   INFO  epoch: 20/24, acc_iter=70890, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:12:13/1:07:21, time_cost(all): 1 day, 2:07:29/5:02:40, loss=0.322037984460706, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.93(1.03), norm=1.099718454152741, lr=0.040123392262790845
2023-12-17 11:33:03   INFO  epoch: 20/24, acc_iter=70940, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:13:19/1:04:07, time_cost(all): 1 day, 2:08:35/4:53:54, loss=0.321839127674657, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.85(1.03), norm=3.9542361013202005, lr=0.03992739048988947
2023-12-17 11:34:10   INFO  epoch: 20/24, acc_iter=70990, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:14:26/1:05:39, time_cost(all): 1 day, 2:09:42/5:08:25, loss=0.321640270888608, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.06(1.03), norm=1.109759593631718, lr=0.0397313887169881
2023-12-17 11:35:17   INFO  epoch: 20/24, acc_iter=71040, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:15:33/1:00:28, time_cost(all): 1 day, 2:10:49/5:10:19, loss=0.321441414102559, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.05(1.03), norm=1.3247053186445283, lr=0.03953538694408673
2023-12-17 11:36:23   INFO  epoch: 20/24, acc_iter=71090, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:16:39/0:58:39, time_cost(all): 1 day, 2:11:55/4:50:48, loss=0.32124255731651, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.95(1.03), norm=3.7044278255132728, lr=0.03933938517118536
2023-12-17 11:37:30   INFO  epoch: 20/24, acc_iter=71140, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:17:46/1:00:06, time_cost(all): 1 day, 2:13:02/5:06:52, loss=0.321043700530461, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.89(1.03), norm=3.1008636082452297, lr=0.03914338339828399
2023-12-17 11:38:37   INFO  epoch: 20/24, acc_iter=71190, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:18:53/1:00:23, time_cost(all): 1 day, 2:14:09/4:49:11, loss=0.320844843744412, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.84(1.03), norm=4.074374665859054, lr=0.038947381625382624
2023-12-17 11:39:43   INFO  epoch: 20/24, acc_iter=71240, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:19:59/0:58:13, time_cost(all): 1 day, 2:15:15/4:52:35, loss=0.320645986958363, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.93(1.03), norm=3.7257221236905766, lr=0.038751379852481255
2023-12-17 11:40:50   INFO  epoch: 20/24, acc_iter=71290, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:21:06/0:56:28, time_cost(all): 1 day, 2:16:22/5:06:15, loss=0.320447130172314, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.04(1.03), norm=0.7306478132163281, lr=0.038555378079579886
2023-12-17 11:41:57   INFO  epoch: 20/24, acc_iter=71340, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:22:13/0:57:49, time_cost(all): 1 day, 2:17:29/5:02:24, loss=0.320248273386265, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.96(1.03), norm=2.946594559358698, lr=0.03835937630667851
2023-12-17 11:43:03   INFO  epoch: 20/24, acc_iter=71390, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:23:19/0:55:44, time_cost(all): 1 day, 2:18:35/5:10:38, loss=0.320049416600216, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.02(1.03), norm=0.7291658804465462, lr=0.03816337453377714
2023-12-17 11:44:10   INFO  epoch: 20/24, acc_iter=71440, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:24:26/0:53:50, time_cost(all): 1 day, 2:19:42/5:06:02, loss=0.319850559814167, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.95(1.03), norm=4.627122049222542, lr=0.03796737276087577
2023-12-17 11:45:16   INFO  epoch: 20/24, acc_iter=71490, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:25:33/0:52:14, time_cost(all): 1 day, 2:20:48/4:57:48, loss=0.319651703028118, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.22(1.03), norm=2.253219450779869, lr=0.0377713709879744
2023-12-17 11:46:23   INFO  epoch: 20/24, acc_iter=71540, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:26:39/0:49:07, time_cost(all): 1 day, 2:21:55/4:44:15, loss=0.319452846242069, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.03(1.03), norm=1.116730296219462, lr=0.037575369215073026
2023-12-17 11:47:30   INFO  epoch: 20/24, acc_iter=71590, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:27:46/0:51:22, time_cost(all): 1 day, 2:23:02/4:38:08, loss=0.31925398945602, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.93(1.03), norm=4.286849685307944, lr=0.037379367442171664
2023-12-17 11:48:36   INFO  epoch: 20/24, acc_iter=71640, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:28:53/0:47:49, time_cost(all): 1 day, 2:24:08/4:55:29, loss=0.319055132669971, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.19(1.03), norm=2.534422961232431, lr=0.03718336566927029
2023-12-17 11:49:43   INFO  epoch: 20/24, acc_iter=71690, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:29:59/0:47:10, time_cost(all): 1 day, 2:25:15/4:59:58, loss=0.318856275883922, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.1(1.03), norm=1.7842230042129494, lr=0.03698736389636892
2023-12-17 11:50:50   INFO  epoch: 20/24, acc_iter=71740, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:31:06/0:45:42, time_cost(all): 1 day, 2:26:22/4:34:54, loss=0.318657419097873, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.05(1.03), norm=1.6326207825579508, lr=0.03679136212346755
2023-12-17 11:51:56   INFO  epoch: 20/24, acc_iter=71790, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:32:12/0:45:18, time_cost(all): 1 day, 2:27:28/4:57:02, loss=0.318458562311824, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.89(1.03), norm=1.3105049762579992, lr=0.03659536035056618
2023-12-17 11:53:03   INFO  epoch: 20/24, acc_iter=71840, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:33:19/0:42:42, time_cost(all): 1 day, 2:28:35/4:59:03, loss=0.318259705525775, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.15(1.03), norm=2.200798598499764, lr=0.03639935857766481
2023-12-17 11:54:10   INFO  epoch: 20/24, acc_iter=71890, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:34:26/0:44:13, time_cost(all): 1 day, 2:29:42/4:52:13, loss=0.318060848739726, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.88(1.03), norm=3.249194762512189, lr=0.03620335680476344
2023-12-17 11:55:16   INFO  epoch: 20/24, acc_iter=71940, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:35:32/0:42:46, time_cost(all): 1 day, 2:30:48/4:56:40, loss=0.317861991953677, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.02(1.03), norm=3.0337305700479438, lr=0.036007355031862066
2023-12-17 11:56:23   INFO  epoch: 20/24, acc_iter=71990, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:36:39/0:43:26, time_cost(all): 1 day, 2:31:55/4:57:22, loss=0.317663135167628, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.91(1.03), norm=3.5302196516777284, lr=0.035811353258960704
2023-12-17 11:57:30   INFO  epoch: 20/24, acc_iter=72040, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:37:46/0:38:47, time_cost(all): 1 day, 2:33:02/4:40:44, loss=0.317464278381579, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.03(1.03), norm=0.804340066742402, lr=0.03561535148605933
2023-12-17 11:58:36   INFO  epoch: 20/24, acc_iter=72090, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:38:52/0:41:03, time_cost(all): 1 day, 2:34:08/4:40:29, loss=0.31726542159553, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.91(1.03), norm=1.8199726371736478, lr=0.03541934971315796
2023-12-17 11:59:43   INFO  epoch: 20/24, acc_iter=72140, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:39:59/0:36:46, time_cost(all): 1 day, 2:35:15/4:38:18, loss=0.317066564809481, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.09(1.03), norm=4.587338811445255, lr=0.03522334794025659
2023-12-17 12:00:50   INFO  epoch: 20/24, acc_iter=72190, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:41:06/0:38:42, time_cost(all): 1 day, 2:36:22/4:49:12, loss=0.316867708023432, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.16(1.03), norm=2.3963329495885657, lr=0.03502734616735522
2023-12-17 12:01:56   INFO  epoch: 20/24, acc_iter=72240, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:42:12/0:36:58, time_cost(all): 1 day, 2:37:28/4:42:01, loss=0.316668851237383, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.17(1.03), norm=2.8871834029044345, lr=0.034831344394453845
2023-12-17 12:03:03   INFO  epoch: 20/24, acc_iter=72290, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:43:19/0:35:21, time_cost(all): 1 day, 2:38:35/4:26:20, loss=0.316469994451334, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.11(1.03), norm=2.7275625432017003, lr=0.034635342621552476
2023-12-17 12:04:10   INFO  epoch: 20/24, acc_iter=72340, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:44:26/0:32:20, time_cost(all): 1 day, 2:39:42/4:39:09, loss=0.316271137665285, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.08(1.03), norm=3.4410473060242714, lr=0.03443934084865111
2023-12-17 12:05:16   INFO  epoch: 20/24, acc_iter=72390, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:45:32/0:32:16, time_cost(all): 1 day, 2:40:48/4:25:17, loss=0.316072280879236, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.03(1.03), norm=0.6473854768232452, lr=0.03424333907574974
2023-12-17 12:06:23   INFO  epoch: 20/24, acc_iter=72440, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:46:39/0:32:12, time_cost(all): 1 day, 2:41:55/4:25:40, loss=0.315873424093187, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.15(1.03), norm=0.8406592028963774, lr=0.03404733730284837
2023-12-17 12:07:30   INFO  epoch: 20/24, acc_iter=72490, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:47:46/0:31:21, time_cost(all): 1 day, 2:43:02/4:38:03, loss=0.315674567307138, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.12(1.03), norm=3.032041534491866, lr=0.033851335529947
2023-12-17 12:08:36   INFO  epoch: 20/24, acc_iter=72540, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:48:52/0:28:01, time_cost(all): 1 day, 2:44:08/4:24:51, loss=0.315475710521089, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.12(1.03), norm=1.1786936454652204, lr=0.03365533375704563
2023-12-17 12:09:43   INFO  epoch: 20/24, acc_iter=72590, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:49:59/0:29:13, time_cost(all): 1 day, 2:45:15/4:18:09, loss=0.31527685373504, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.09(1.03), norm=2.6615598955872453, lr=0.03345933198414426
2023-12-17 12:10:50   INFO  epoch: 20/24, acc_iter=72640, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:51:06/0:27:10, time_cost(all): 1 day, 2:46:22/4:38:45, loss=0.315077996948991, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.96(1.03), norm=2.680400377901588, lr=0.03326333021124289
2023-12-17 12:11:56   INFO  epoch: 20/24, acc_iter=72690, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:52:12/0:25:08, time_cost(all): 1 day, 2:47:28/4:22:58, loss=0.314879140162942, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.08(1.03), norm=2.164861742978741, lr=0.033067328438341516
2023-12-17 12:13:03   INFO  epoch: 20/24, acc_iter=72740, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:53:19/0:25:08, time_cost(all): 1 day, 2:48:35/4:35:13, loss=0.314680283376893, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.02(1.03), norm=0.6196945035936298, lr=0.03287132666544015
2023-12-17 12:14:09   INFO  epoch: 20/24, acc_iter=72790, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:54:26/0:23:07, time_cost(all): 1 day, 2:49:41/4:18:07, loss=0.314481426590844, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.89(1.03), norm=0.9720690317808547, lr=0.03267532489253878
2023-12-17 12:15:16   INFO  epoch: 20/24, acc_iter=72840, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:55:32/0:22:42, time_cost(all): 1 day, 2:50:48/4:21:54, loss=0.314282569804795, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.02(1.03), norm=3.1224087507163403, lr=0.0324793231196374
2023-12-17 12:16:23   INFO  epoch: 20/24, acc_iter=72890, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:56:39/0:21:58, time_cost(all): 1 day, 2:51:55/4:30:55, loss=0.314083713018746, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.13(1.03), norm=0.9073364594354841, lr=0.03228332134673604
2023-12-17 12:17:29   INFO  epoch: 20/24, acc_iter=72940, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:57:46/0:20:13, time_cost(all): 1 day, 2:53:01/4:10:42, loss=0.313884856232697, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.15(1.03), norm=1.2316451457155366, lr=0.03208731957383466
2023-12-17 12:18:36   INFO  epoch: 20/24, acc_iter=72990, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:58:52/0:18:19, time_cost(all): 1 day, 2:54:08/4:10:42, loss=0.313685999446648, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.15(1.03), norm=4.64901086236102, lr=0.031891317800933294
2023-12-17 12:19:43   INFO  epoch: 20/24, acc_iter=73040, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:59:59/0:18:52, time_cost(all): 1 day, 2:55:15/4:11:35, loss=0.313487142660599, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.88(1.03), norm=2.9940359057180213, lr=0.031695316028031925
2023-12-17 12:20:49   INFO  epoch: 20/24, acc_iter=73090, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 1:01:05/0:17:47, time_cost(all): 1 day, 2:56:21/4:17:04, loss=0.31328828587455, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.94(1.03), norm=4.373636212952787, lr=0.031499314255130556
2023-12-17 12:21:56   INFO  epoch: 20/24, acc_iter=73140, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 1:02:12/0:15:46, time_cost(all): 1 day, 2:57:28/4:14:17, loss=0.313089429088501, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.1(1.03), norm=2.1648888107623403, lr=0.03130331248222919
2023-12-17 12:23:03   INFO  epoch: 20/24, acc_iter=73190, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 1:03:19/0:15:19, time_cost(all): 1 day, 2:58:35/4:11:18, loss=0.312890572302452, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.84(1.03), norm=0.5722805419567744, lr=0.031107310709327818
2023-12-17 12:24:09   INFO  epoch: 20/24, acc_iter=73240, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 1:04:25/0:13:54, time_cost(all): 1 day, 2:59:41/4:08:13, loss=0.312691715516403, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.22(1.03), norm=2.0781135663665324, lr=0.030911308936426445
2023-12-17 12:25:16   INFO  epoch: 20/24, acc_iter=73290, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 1:05:32/0:13:01, time_cost(all): 1 day, 3:00:48/4:23:35, loss=0.312492858730354, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.22(1.03), norm=2.0691616483680164, lr=0.030715307163525076
2023-12-17 12:26:23   INFO  epoch: 20/24, acc_iter=73340, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 1:06:39/0:11:53, time_cost(all): 1 day, 3:01:55/4:25:35, loss=0.312294001944305, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.18(1.03), norm=2.0587635834747466, lr=0.030519305390623707
2023-12-17 12:27:29   INFO  epoch: 20/24, acc_iter=73390, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 1:07:45/0:10:00, time_cost(all): 1 day, 3:03:01/4:06:23, loss=0.312095145158256, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.0(1.03), norm=4.872074560031613, lr=0.030323303617722334
2023-12-17 12:28:36   INFO  epoch: 20/24, acc_iter=73440, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 1:08:52/0:09:39, time_cost(all): 1 day, 3:04:08/4:15:43, loss=0.311896288372207, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.0(1.03), norm=2.075388562456597, lr=0.030127301844820965
2023-12-17 12:29:43   INFO  epoch: 20/24, acc_iter=73490, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 1:09:59/0:07:49, time_cost(all): 1 day, 3:05:15/4:09:03, loss=0.311697431586158, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.13(1.03), norm=3.520439344457756, lr=0.029931300071919596
2023-12-17 12:30:49   INFO  epoch: 20/24, acc_iter=73540, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 1:11:05/0:07:01, time_cost(all): 1 day, 3:06:21/4:09:05, loss=0.311498574800109, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.11(1.03), norm=3.86802613244123, lr=0.029735298299018227
2023-12-17 12:31:56   INFO  epoch: 20/24, acc_iter=73590, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:12:12/0:05:56, time_cost(all): 1 day, 3:07:28/4:19:20, loss=0.31129971801406, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.88(1.03), norm=4.011783489107986, lr=0.029539296526116855
2023-12-17 12:33:03   INFO  epoch: 20/24, acc_iter=73640, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:13:19/0:04:42, time_cost(all): 1 day, 3:08:35/4:15:35, loss=0.311100861228011, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.21(1.03), norm=3.47721044182877, lr=0.029343294753215485
2023-12-17 12:34:09   INFO  epoch: 20/24, acc_iter=73690, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:14:25/0:03:32, time_cost(all): 1 day, 3:09:41/3:57:32, loss=0.310902004441962, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.92(1.03), norm=2.0223186673866094, lr=0.029147292980314116
2023-12-17 12:35:16   INFO  epoch: 20/24, acc_iter=73740, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:15:32/0:02:41, time_cost(all): 1 day, 3:10:48/4:14:45, loss=0.310703147655913, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.18(1.03), norm=4.208277493618682, lr=0.028951291207412744
2023-12-17 12:36:23   INFO  epoch: 20/24, acc_iter=73790, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:16:39/0:01:25, time_cost(all): 1 day, 3:11:55/3:59:07, loss=0.310504290869864, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.14(1.03), norm=4.262489760138011, lr=0.028755289434511375
2023-12-17 12:37:29   INFO  epoch: 20/24, acc_iter=73840, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:17:45/0:00:22, time_cost(all): 1 day, 3:13:01/4:11:20, loss=0.310305434083815, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.84(1.03), norm=4.437231767610463, lr=0.028559287661610006
2023-12-17 12:38:36   INFO  epoch: 21/24, acc_iter=73907, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:01:06/1:19:01, time_cost(all): 1 day, 3:14:08/4:08:29, loss=0.310038965990509, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.1(1.03), norm=2.939956412623688, lr=0.028296645285922167
2023-12-17 12:39:43   INFO  epoch: 21/24, acc_iter=73957, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:02:13/1:17:09, time_cost(all): 1 day, 3:15:15/4:07:13, loss=0.30984010920446, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.02(1.03), norm=1.5597262524858047, lr=0.0281006435130208
2023-12-17 12:40:49   INFO  epoch: 21/24, acc_iter=74007, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:03:19/1:16:57, time_cost(all): 1 day, 3:16:21/3:58:15, loss=0.309641252418411, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.85(1.03), norm=3.341373249723034, lr=0.02790464174011943
2023-12-17 12:41:56   INFO  epoch: 21/24, acc_iter=74057, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:04:26/1:16:09, time_cost(all): 1 day, 3:17:28/4:00:19, loss=0.309442395632362, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.88(1.03), norm=1.9176883972151946, lr=0.02770863996721806
2023-12-17 12:43:02   INFO  epoch: 21/24, acc_iter=74107, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:05:33/1:14:22, time_cost(all): 1 day, 3:18:34/4:02:06, loss=0.309243538846313, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.92(1.03), norm=1.4973332933604488, lr=0.027512638194316687
2023-12-17 12:44:09   INFO  epoch: 21/24, acc_iter=74157, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:06:39/1:13:38, time_cost(all): 1 day, 3:19:41/4:03:09, loss=0.309044682060264, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.15(1.03), norm=1.1787180763952185, lr=0.027316636421415318
2023-12-17 12:45:16   INFO  epoch: 21/24, acc_iter=74207, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:07:46/1:08:50, time_cost(all): 1 day, 3:20:48/3:47:50, loss=0.308845825274215, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.05(1.03), norm=1.7593327992842251, lr=0.027120634648513945
2023-12-17 12:46:22   INFO  epoch: 21/24, acc_iter=74257, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:08:53/1:10:56, time_cost(all): 1 day, 3:21:54/3:48:08, loss=0.308646968488166, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.94(1.03), norm=4.235528765368671, lr=0.02692463287561258
2023-12-17 12:47:29   INFO  epoch: 21/24, acc_iter=74307, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:09:59/1:08:07, time_cost(all): 1 day, 3:23:01/3:41:13, loss=0.308448111702117, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.86(1.03), norm=2.9173270328901517, lr=0.02672863110271121
2023-12-17 12:48:36   INFO  epoch: 21/24, acc_iter=74357, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:11:06/1:07:59, time_cost(all): 1 day, 3:24:08/3:59:05, loss=0.308249254916068, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.87(1.03), norm=4.959320657297798, lr=0.026532629329809838
2023-12-17 12:49:42   INFO  epoch: 21/24, acc_iter=74407, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:12:13/1:06:50, time_cost(all): 1 day, 3:25:14/3:41:29, loss=0.308050398130019, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.14(1.03), norm=4.830182640544032, lr=0.02633662755690847
2023-12-17 12:50:49   INFO  epoch: 21/24, acc_iter=74457, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:13:19/1:07:48, time_cost(all): 1 day, 3:26:21/3:45:20, loss=0.30785154134397, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.22(1.03), norm=3.7652652860578586, lr=0.026140625784007096
2023-12-17 12:51:56   INFO  epoch: 21/24, acc_iter=74507, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:14:26/1:05:30, time_cost(all): 1 day, 3:27:28/3:50:33, loss=0.307652684557921, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.18(1.03), norm=1.8341365963011014, lr=0.025944624011105727
2023-12-17 12:53:02   INFO  epoch: 21/24, acc_iter=74557, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:15:33/1:02:59, time_cost(all): 1 day, 3:28:34/3:53:34, loss=0.307453827771872, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.1(1.03), norm=0.6501881138127539, lr=0.025748622238204354
2023-12-17 12:54:09   INFO  epoch: 21/24, acc_iter=74607, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:16:39/1:01:55, time_cost(all): 1 day, 3:29:41/3:54:00, loss=0.307254970985823, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.19(1.03), norm=2.5877664729151912, lr=0.02555262046530299
2023-12-17 12:55:16   INFO  epoch: 21/24, acc_iter=74657, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:17:46/1:01:36, time_cost(all): 1 day, 3:30:48/3:45:25, loss=0.307056114199774, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.17(1.03), norm=0.9281578616930113, lr=0.02535661869240162
2023-12-17 12:56:22   INFO  epoch: 21/24, acc_iter=74707, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:18:53/0:57:47, time_cost(all): 1 day, 3:31:54/3:46:50, loss=0.306857257413725, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.0(1.03), norm=4.340378189592041, lr=0.025160616919500247
2023-12-17 12:57:29   INFO  epoch: 21/24, acc_iter=74757, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:19:59/0:55:16, time_cost(all): 1 day, 3:33:01/3:33:43, loss=0.306658400627676, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.93(1.03), norm=4.991265597950422, lr=0.024964615146598878
2023-12-17 12:58:36   INFO  epoch: 21/24, acc_iter=74807, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:21:06/0:55:22, time_cost(all): 1 day, 3:34:08/3:49:21, loss=0.306459543841627, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.88(1.03), norm=4.739236379786924, lr=0.024768613373697505
2023-12-17 12:59:42   INFO  epoch: 21/24, acc_iter=74857, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:22:13/0:57:45, time_cost(all): 1 day, 3:35:14/3:31:55, loss=0.306260687055578, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.05(1.03), norm=0.7273062358250642, lr=0.024572611600796136
2023-12-17 13:00:49   INFO  epoch: 21/24, acc_iter=74907, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:23:19/0:54:26, time_cost(all): 1 day, 3:36:21/3:34:52, loss=0.306061830269529, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.98(1.03), norm=0.581798520085447, lr=0.024376609827894764
2023-12-17 13:01:56   INFO  epoch: 21/24, acc_iter=74957, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:24:26/0:51:12, time_cost(all): 1 day, 3:37:28/3:33:58, loss=0.30586297348348, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.02(1.03), norm=3.249955421012373, lr=0.024180608054993395
2023-12-17 13:03:02   INFO  epoch: 21/24, acc_iter=75007, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:25:33/0:55:03, time_cost(all): 1 day, 3:38:34/3:43:06, loss=0.305664116697431, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.88(1.03), norm=3.378314467150576, lr=0.02398460628209203
2023-12-17 13:04:09   INFO  epoch: 21/24, acc_iter=75057, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:26:39/0:53:19, time_cost(all): 1 day, 3:39:41/3:25:39, loss=0.305465259911382, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.05(1.03), norm=0.6245460595131314, lr=0.023788604509190656
2023-12-17 13:05:16   INFO  epoch: 21/24, acc_iter=75107, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:27:46/0:51:00, time_cost(all): 1 day, 3:40:48/3:24:13, loss=0.305266403125333, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.17(1.03), norm=4.417863525305091, lr=0.023592602736289287
2023-12-17 13:06:22   INFO  epoch: 21/24, acc_iter=75157, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:28:53/0:50:59, time_cost(all): 1 day, 3:41:54/3:39:06, loss=0.305067546339284, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.93(1.03), norm=3.705983947130985, lr=0.023396600963387915
2023-12-17 13:07:29   INFO  epoch: 21/24, acc_iter=75207, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:29:59/0:47:28, time_cost(all): 1 day, 3:43:01/3:33:24, loss=0.304868689553235, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.07(1.03), norm=2.6814083704821665, lr=0.023200599190486546
2023-12-17 13:08:36   INFO  epoch: 21/24, acc_iter=75257, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:31:06/0:45:27, time_cost(all): 1 day, 3:44:08/3:25:08, loss=0.304669832767186, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.84(1.03), norm=4.306274656537759, lr=0.023004597417585173
2023-12-17 13:09:42   INFO  epoch: 21/24, acc_iter=75307, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:32:12/0:43:43, time_cost(all): 1 day, 3:45:14/3:36:30, loss=0.304470975981137, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.01(1.03), norm=2.211366246195402, lr=0.022808595644683804
2023-12-17 13:10:49   INFO  epoch: 21/24, acc_iter=75357, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:33:19/0:46:09, time_cost(all): 1 day, 3:46:21/3:21:27, loss=0.304272119195088, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.89(1.03), norm=1.8392212272774577, lr=0.022612593871782438
2023-12-17 13:11:55   INFO  epoch: 21/24, acc_iter=75407, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:34:26/0:44:54, time_cost(all): 1 day, 3:47:27/3:24:21, loss=0.304073262409039, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.84(1.03), norm=2.6531229755988437, lr=0.022416592098881066
2023-12-17 13:13:02   INFO  epoch: 21/24, acc_iter=75457, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:35:32/0:43:53, time_cost(all): 1 day, 3:48:34/3:19:28, loss=0.30387440562299, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.05(1.03), norm=1.8075333513452052, lr=0.022220590325979696
2023-12-17 13:14:09   INFO  epoch: 21/24, acc_iter=75507, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:36:39/0:42:52, time_cost(all): 1 day, 3:49:41/3:21:03, loss=0.303675548836941, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.13(1.03), norm=3.4241818601269896, lr=0.022024588553078324
2023-12-17 13:15:15   INFO  epoch: 21/24, acc_iter=75557, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:37:46/0:40:12, time_cost(all): 1 day, 3:50:47/3:18:19, loss=0.303476692050892, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.04(1.03), norm=3.239087542221105, lr=0.021828586780176955
2023-12-17 13:16:22   INFO  epoch: 21/24, acc_iter=75607, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:38:52/0:38:04, time_cost(all): 1 day, 3:51:54/3:25:37, loss=0.303277835264843, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.88(1.03), norm=0.6423491503863914, lr=0.021632585007275582
2023-12-17 13:17:29   INFO  epoch: 21/24, acc_iter=75657, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:39:59/0:37:40, time_cost(all): 1 day, 3:53:01/3:25:55, loss=0.303078978478794, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.02(1.03), norm=4.8934765104500055, lr=0.021436583234374213
2023-12-17 13:18:35   INFO  epoch: 21/24, acc_iter=75707, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:41:06/0:35:50, time_cost(all): 1 day, 3:54:07/3:29:03, loss=0.302880121692745, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.13(1.03), norm=0.6232845905723501, lr=0.02124058146147284
2023-12-17 13:19:42   INFO  epoch: 21/24, acc_iter=75757, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:42:12/0:37:05, time_cost(all): 1 day, 3:55:14/3:21:45, loss=0.302681264906696, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.17(1.03), norm=1.924007955139474, lr=0.021044579688571475
2023-12-17 13:20:49   INFO  epoch: 21/24, acc_iter=75807, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:43:19/0:34:18, time_cost(all): 1 day, 3:56:21/3:27:59, loss=0.302482408120647, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.09(1.03), norm=2.262744740015225, lr=0.020848577915670106
2023-12-17 13:21:55   INFO  epoch: 21/24, acc_iter=75857, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:44:26/0:34:53, time_cost(all): 1 day, 3:57:27/3:12:18, loss=0.302283551334598, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.96(1.03), norm=4.00309535231049, lr=0.020652576142768733
2023-12-17 13:23:02   INFO  epoch: 21/24, acc_iter=75907, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:45:32/0:32:50, time_cost(all): 1 day, 3:58:34/3:23:27, loss=0.302084694548549, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.89(1.03), norm=4.622830060861229, lr=0.020456574369867364
2023-12-17 13:24:09   INFO  epoch: 21/24, acc_iter=75957, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:46:39/0:30:55, time_cost(all): 1 day, 3:59:41/3:14:10, loss=0.3018858377625, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.83(1.03), norm=3.0727974207834614, lr=0.02026057259696599
2023-12-17 13:25:15   INFO  epoch: 21/24, acc_iter=76007, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:47:46/0:31:02, time_cost(all): 1 day, 4:00:47/3:11:16, loss=0.301686980976451, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.11(1.03), norm=4.157767895741935, lr=0.020064570824064622
2023-12-17 13:26:22   INFO  epoch: 21/24, acc_iter=76057, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:48:52/0:30:27, time_cost(all): 1 day, 4:01:54/3:13:54, loss=0.301488124190402, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.2(1.03), norm=4.788435078054781, lr=0.01986856905116325
2023-12-17 13:27:29   INFO  epoch: 21/24, acc_iter=76107, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:49:59/0:28:46, time_cost(all): 1 day, 4:03:01/3:15:38, loss=0.301289267404353, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.91(1.03), norm=3.8181987639942756, lr=0.019672567278261884
2023-12-17 13:28:35   INFO  epoch: 21/24, acc_iter=76157, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:51:06/0:26:50, time_cost(all): 1 day, 4:04:07/3:06:07, loss=0.301090410618304, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.07(1.03), norm=0.7471284933511076, lr=0.019476565505360515
2023-12-17 13:29:42   INFO  epoch: 21/24, acc_iter=76207, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:52:12/0:25:22, time_cost(all): 1 day, 4:05:14/3:05:56, loss=0.300891553832255, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.02(1.03), norm=2.7131666527706058, lr=0.019280563732459143
2023-12-17 13:30:49   INFO  epoch: 21/24, acc_iter=76257, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:53:19/0:23:43, time_cost(all): 1 day, 4:06:21/3:00:57, loss=0.300692697046206, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.87(1.03), norm=4.609824535281879, lr=0.019084561959557773
2023-12-17 13:31:55   INFO  epoch: 21/24, acc_iter=76307, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:54:26/0:23:05, time_cost(all): 1 day, 4:07:27/3:08:16, loss=0.300493840260157, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.99(1.03), norm=1.713085766437517, lr=0.0188885601866564
2023-12-17 13:33:02   INFO  epoch: 21/24, acc_iter=76357, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:55:32/0:23:00, time_cost(all): 1 day, 4:08:34/3:15:57, loss=0.300294983474108, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.06(1.03), norm=1.1668896811003393, lr=0.018692558413755028
2023-12-17 13:34:09   INFO  epoch: 21/24, acc_iter=76407, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:56:39/0:21:47, time_cost(all): 1 day, 4:09:41/3:00:38, loss=0.300096126688059, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.11(1.03), norm=0.6827897478117132, lr=0.01849655664085366
2023-12-17 13:35:15   INFO  epoch: 21/24, acc_iter=76457, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:57:46/0:20:43, time_cost(all): 1 day, 4:10:47/3:01:10, loss=0.29989726990201, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.0(1.03), norm=3.435681024511318, lr=0.01830055486795229
2023-12-17 13:36:22   INFO  epoch: 21/24, acc_iter=76507, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:58:52/0:18:38, time_cost(all): 1 day, 4:11:54/3:00:16, loss=0.299698413115961, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.13(1.03), norm=2.503601239340498, lr=0.01810455309505092
2023-12-17 13:37:29   INFO  epoch: 21/24, acc_iter=76557, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:59:59/0:18:07, time_cost(all): 1 day, 4:13:01/2:56:17, loss=0.299499556329912, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.08(1.03), norm=1.4759074751113244, lr=0.017908551322149552
2023-12-17 13:38:35   INFO  epoch: 21/24, acc_iter=76607, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 1:01:05/0:17:37, time_cost(all): 1 day, 4:14:07/3:06:54, loss=0.299300699543863, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.07(1.03), norm=4.459256260352803, lr=0.017712549549248183
2023-12-17 13:39:42   INFO  epoch: 21/24, acc_iter=76657, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 1:02:12/0:15:30, time_cost(all): 1 day, 4:15:14/2:56:36, loss=0.299101842757814, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.23(1.03), norm=1.0348284418374212, lr=0.017516547776346814
2023-12-17 13:40:49   INFO  epoch: 21/24, acc_iter=76707, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 1:03:19/0:14:11, time_cost(all): 1 day, 4:16:21/2:53:00, loss=0.298902985971765, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.93(1.03), norm=3.594456693385765, lr=0.017320546003445438
2023-12-17 13:41:55   INFO  epoch: 21/24, acc_iter=76757, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 1:04:25/0:14:09, time_cost(all): 1 day, 4:17:27/2:52:37, loss=0.298704129185716, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.85(1.03), norm=3.6110235734972234, lr=0.01712454423054407
2023-12-17 13:43:02   INFO  epoch: 21/24, acc_iter=76807, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 1:05:32/0:12:40, time_cost(all): 1 day, 4:18:34/3:03:55, loss=0.298505272399667, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.05(1.03), norm=2.639645781181584, lr=0.0169285424576427
2023-12-17 13:44:08   INFO  epoch: 21/24, acc_iter=76857, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 1:06:39/0:10:59, time_cost(all): 1 day, 4:19:40/2:49:37, loss=0.298306415613618, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.99(1.03), norm=2.0615702844778045, lr=0.01673254068474133
2023-12-17 13:45:15   INFO  epoch: 21/24, acc_iter=76907, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 1:07:45/0:10:53, time_cost(all): 1 day, 4:20:47/2:55:46, loss=0.298107558827569, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.98(1.03), norm=1.1014557921357895, lr=0.01653653891183996
2023-12-17 13:46:22   INFO  epoch: 21/24, acc_iter=76957, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 1:08:52/0:09:21, time_cost(all): 1 day, 4:21:54/2:55:41, loss=0.29790870204152, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.08(1.03), norm=1.0019897077295614, lr=0.016340537138938592
2023-12-17 13:47:28   INFO  epoch: 21/24, acc_iter=77007, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 1:09:59/0:08:02, time_cost(all): 1 day, 4:23:00/2:55:21, loss=0.297709845255471, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.2(1.03), norm=2.7142739427758285, lr=0.016144535366037223
2023-12-17 13:48:35   INFO  epoch: 21/24, acc_iter=77057, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 1:11:05/0:06:54, time_cost(all): 1 day, 4:24:07/2:50:51, loss=0.297510988469422, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.16(1.03), norm=3.988359458703103, lr=0.015948533593135847
2023-12-17 13:49:42   INFO  epoch: 21/24, acc_iter=77107, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:12:12/0:05:46, time_cost(all): 1 day, 4:25:14/2:41:49, loss=0.297312131683373, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.99(1.03), norm=4.914467371996896, lr=0.015752531820234478
2023-12-17 13:50:48   INFO  epoch: 21/24, acc_iter=77157, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:13:19/0:05:00, time_cost(all): 1 day, 4:26:20/2:51:32, loss=0.297113274897324, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.96(1.03), norm=1.583361105219931, lr=0.015556530047333109
2023-12-17 13:51:55   INFO  epoch: 21/24, acc_iter=77207, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:14:25/0:03:42, time_cost(all): 1 day, 4:27:27/2:49:48, loss=0.296914418111275, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.11(1.03), norm=4.112713751594155, lr=0.01536052827443174
2023-12-17 13:53:02   INFO  epoch: 21/24, acc_iter=77257, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:15:32/0:02:39, time_cost(all): 1 day, 4:28:34/2:45:18, loss=0.296715561325226, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.89(1.03), norm=4.689041073692292, lr=0.01516452650153037
2023-12-17 13:54:08   INFO  epoch: 21/24, acc_iter=77307, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:16:39/0:01:30, time_cost(all): 1 day, 4:29:40/2:44:48, loss=0.296516704539177, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.18(1.03), norm=0.9391106573302243, lr=0.014968524728629001
2023-12-17 13:55:15   INFO  epoch: 21/24, acc_iter=77357, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:17:45/0:00:21, time_cost(all): 1 day, 4:30:47/2:45:29, loss=0.296317847753128, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.06(1.03), norm=3.101528574636636, lr=0.014772522955727632
2023-12-17 13:56:22   INFO  epoch: 22/24, acc_iter=77424, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:01:06/1:13:30, time_cost(all): 1 day, 4:31:54/2:35:35, loss=0.296051379659822, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.86(1.03), norm=3.8688711962853666, lr=0.014509880580039797
2023-12-17 13:57:28   INFO  epoch: 22/24, acc_iter=77474, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:02:13/1:17:28, time_cost(all): 1 day, 4:33:00/2:49:48, loss=0.295852522873773, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.1(1.03), norm=4.5310320874564605, lr=0.01431387880713842
2023-12-17 13:58:35   INFO  epoch: 22/24, acc_iter=77524, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:03:19/1:14:40, time_cost(all): 1 day, 4:34:07/2:45:07, loss=0.295653666087724, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.98(1.03), norm=0.5630407889605278, lr=0.014117877034237052
2023-12-17 13:59:42   INFO  epoch: 22/24, acc_iter=77574, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:04:26/1:17:02, time_cost(all): 1 day, 4:35:14/2:43:08, loss=0.295454809301675, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.06(1.03), norm=2.4505402306818533, lr=0.013921875261335682
2023-12-17 14:00:48   INFO  epoch: 22/24, acc_iter=77624, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:05:33/1:11:15, time_cost(all): 1 day, 4:36:20/2:46:04, loss=0.295255952515626, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.17(1.03), norm=3.55992448229132, lr=0.013725873488434313
2023-12-17 14:01:55   INFO  epoch: 22/24, acc_iter=77674, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:06:39/1:11:29, time_cost(all): 1 day, 4:37:27/2:31:59, loss=0.295057095729577, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.85(1.03), norm=4.821909463895949, lr=0.013529871715532944
2023-12-17 14:03:02   INFO  epoch: 22/24, acc_iter=77724, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:07:46/1:08:14, time_cost(all): 1 day, 4:38:34/2:36:58, loss=0.294858238943528, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.83(1.03), norm=3.881631699026735, lr=0.013333869942631575
2023-12-17 14:04:08   INFO  epoch: 22/24, acc_iter=77774, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:08:53/1:08:35, time_cost(all): 1 day, 4:39:40/2:37:41, loss=0.294659382157479, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.19(1.03), norm=2.8280930614600055, lr=0.013137868169730206
2023-12-17 14:05:15   INFO  epoch: 22/24, acc_iter=77824, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:09:59/1:06:54, time_cost(all): 1 day, 4:40:47/2:40:28, loss=0.29446052537143, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.96(1.03), norm=1.7180859862874436, lr=0.01294186639682883
2023-12-17 14:06:22   INFO  epoch: 22/24, acc_iter=77874, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:11:06/1:04:05, time_cost(all): 1 day, 4:41:54/2:33:02, loss=0.294261668585381, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.9(1.03), norm=4.551244977971989, lr=0.012745864623927461
2023-12-17 14:07:28   INFO  epoch: 22/24, acc_iter=77924, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:12:13/1:03:21, time_cost(all): 1 day, 4:43:00/2:30:45, loss=0.294062811799332, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.09(1.03), norm=2.3869656923590488, lr=0.012549862851026092
2023-12-17 14:08:35   INFO  epoch: 22/24, acc_iter=77974, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:13:19/1:07:16, time_cost(all): 1 day, 4:44:07/2:25:01, loss=0.293863955013283, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.18(1.03), norm=3.8595747120302804, lr=0.012353861078124723
2023-12-17 14:09:42   INFO  epoch: 22/24, acc_iter=78024, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:14:26/1:06:01, time_cost(all): 1 day, 4:45:14/2:29:56, loss=0.293665098227234, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.97(1.03), norm=1.3150860904885056, lr=0.012157859305223354
2023-12-17 14:10:48   INFO  epoch: 22/24, acc_iter=78074, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:15:33/1:00:55, time_cost(all): 1 day, 4:46:20/2:34:27, loss=0.293466241441185, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.09(1.03), norm=4.145628792033062, lr=0.011961857532321984
2023-12-17 14:11:55   INFO  epoch: 22/24, acc_iter=78124, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:16:39/0:58:48, time_cost(all): 1 day, 4:47:27/2:23:12, loss=0.293267384655136, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.19(1.03), norm=3.1522328670097584, lr=0.011765855759420615
2023-12-17 14:13:01   INFO  epoch: 22/24, acc_iter=78174, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:17:46/1:00:45, time_cost(all): 1 day, 4:48:33/2:30:10, loss=0.293068527869087, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.98(1.03), norm=1.0240234134157902, lr=0.011569853986519246
2023-12-17 14:14:08   INFO  epoch: 22/24, acc_iter=78224, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:18:53/1:00:42, time_cost(all): 1 day, 4:49:40/2:27:59, loss=0.292869671083038, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.89(1.03), norm=1.5892063185763763, lr=0.01137385221361787
2023-12-17 14:15:15   INFO  epoch: 22/24, acc_iter=78274, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:19:59/0:57:24, time_cost(all): 1 day, 4:50:47/2:23:35, loss=0.292670814296989, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.2(1.03), norm=2.936237101970648, lr=0.011177850440716501
2023-12-17 14:16:21   INFO  epoch: 22/24, acc_iter=78324, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:21:06/0:58:52, time_cost(all): 1 day, 4:51:53/2:28:19, loss=0.29247195751094, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.92(1.03), norm=3.441362982960696, lr=0.010981848667815132
2023-12-17 14:17:28   INFO  epoch: 22/24, acc_iter=78374, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:22:13/0:56:02, time_cost(all): 1 day, 4:53:00/2:16:15, loss=0.292273100724891, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.21(1.03), norm=2.748560174103819, lr=0.010785846894913763
2023-12-17 14:18:35   INFO  epoch: 22/24, acc_iter=78424, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:23:19/0:52:25, time_cost(all): 1 day, 4:54:07/2:25:44, loss=0.292074243938842, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.04(1.03), norm=2.4042500718932733, lr=0.010589845122012387
2023-12-17 14:19:41   INFO  epoch: 22/24, acc_iter=78474, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:24:26/0:52:49, time_cost(all): 1 day, 4:55:13/2:26:54, loss=0.291875387152793, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.93(1.03), norm=1.0650143333863586, lr=0.010393843349111025
2023-12-17 14:20:48   INFO  epoch: 22/24, acc_iter=78524, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:25:33/0:52:45, time_cost(all): 1 day, 4:56:20/2:12:21, loss=0.291676530366744, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.14(1.03), norm=3.076106351998312, lr=0.010197841576209656
2023-12-17 14:21:55   INFO  epoch: 22/24, acc_iter=78574, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:26:39/0:51:27, time_cost(all): 1 day, 4:57:27/2:16:44, loss=0.291477673580695, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.1(1.03), norm=1.6477117740239051, lr=0.01000183980330828
2023-12-17 14:23:01   INFO  epoch: 22/24, acc_iter=78624, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:27:46/0:51:37, time_cost(all): 1 day, 4:58:33/2:16:11, loss=0.291278816794646, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.19(1.03), norm=2.8367847547090874, lr=0.00980583803040691
2023-12-17 14:24:08   INFO  epoch: 22/24, acc_iter=78674, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:28:53/0:50:42, time_cost(all): 1 day, 4:59:40/2:18:17, loss=0.291079960008597, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.92(1.03), norm=4.478136607686661, lr=0.009609836257505541
2023-12-17 14:25:15   INFO  epoch: 22/24, acc_iter=78724, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:29:59/0:47:47, time_cost(all): 1 day, 5:00:47/2:12:16, loss=0.290881103222548, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.23(1.03), norm=3.7101503736446464, lr=0.009413834484604172
2023-12-17 14:26:21   INFO  epoch: 22/24, acc_iter=78774, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:31:06/0:48:38, time_cost(all): 1 day, 5:01:53/2:15:37, loss=0.290682246436499, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.18(1.03), norm=2.719754698557417, lr=0.009217832711702796
2023-12-17 14:27:28   INFO  epoch: 22/24, acc_iter=78824, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:32:12/0:46:48, time_cost(all): 1 day, 5:03:00/2:09:06, loss=0.29048338965045, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.04(1.03), norm=2.790935115894718, lr=0.009021830938801434
2023-12-17 14:28:35   INFO  epoch: 22/24, acc_iter=78874, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:33:19/0:46:32, time_cost(all): 1 day, 5:04:07/2:16:28, loss=0.290284532864401, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.23(1.03), norm=2.2483245751045198, lr=0.008825829165900065
2023-12-17 14:29:41   INFO  epoch: 22/24, acc_iter=78924, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:34:26/0:42:23, time_cost(all): 1 day, 5:05:13/2:10:45, loss=0.290085676078352, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.08(1.03), norm=4.246580643642945, lr=0.008629827392998689
2023-12-17 14:30:48   INFO  epoch: 22/24, acc_iter=78974, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:35:32/0:41:26, time_cost(all): 1 day, 5:06:20/2:13:46, loss=0.289886819292303, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.09(1.03), norm=4.981716843127871, lr=0.00843382562009732
2023-12-17 14:31:55   INFO  epoch: 22/24, acc_iter=79024, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:36:39/0:41:45, time_cost(all): 1 day, 5:07:27/2:07:59, loss=0.289687962506254, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.86(1.03), norm=2.6626358959675973, lr=0.00823782384719595
2023-12-17 14:33:01   INFO  epoch: 22/24, acc_iter=79074, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:37:46/0:40:02, time_cost(all): 1 day, 5:08:33/2:08:32, loss=0.289489105720205, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.17(1.03), norm=3.818255825246031, lr=0.008041822074294581
2023-12-17 14:34:08   INFO  epoch: 22/24, acc_iter=79124, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:38:52/0:39:05, time_cost(all): 1 day, 5:09:40/2:11:21, loss=0.289290248934156, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.15(1.03), norm=1.9882508801446213, lr=0.007845820301393205
2023-12-17 14:35:15   INFO  epoch: 22/24, acc_iter=79174, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:39:59/0:39:44, time_cost(all): 1 day, 5:10:47/2:06:23, loss=0.289091392148107, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.16(1.03), norm=0.6507847029139169, lr=0.007649818528491836
2023-12-17 14:36:21   INFO  epoch: 22/24, acc_iter=79224, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:41:06/0:36:02, time_cost(all): 1 day, 5:11:53/1:57:21, loss=0.288892535362058, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.84(1.03), norm=3.6555388835105296, lr=0.007453816755590474
2023-12-17 14:37:28   INFO  epoch: 22/24, acc_iter=79274, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:42:12/0:35:55, time_cost(all): 1 day, 5:13:00/1:57:28, loss=0.288693678576009, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.01(1.03), norm=2.9915356933424975, lr=0.007257814982689098
2023-12-17 14:38:35   INFO  epoch: 22/24, acc_iter=79324, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:43:19/0:34:13, time_cost(all): 1 day, 5:14:07/1:57:51, loss=0.28849482178996, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.84(1.03), norm=2.813944205275304, lr=0.007061813209787729
2023-12-17 14:39:41   INFO  epoch: 22/24, acc_iter=79374, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:44:26/0:32:51, time_cost(all): 1 day, 5:15:13/2:01:25, loss=0.288295965003911, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.91(1.03), norm=4.933328754874842, lr=0.00686581143688636
2023-12-17 14:40:48   INFO  epoch: 22/24, acc_iter=79424, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:45:32/0:31:36, time_cost(all): 1 day, 5:16:20/1:59:33, loss=0.288097108217862, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.93(1.03), norm=3.27609624489603, lr=0.006669809663984991
2023-12-17 14:41:54   INFO  epoch: 22/24, acc_iter=79474, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:46:39/0:31:20, time_cost(all): 1 day, 5:17:26/1:55:36, loss=0.287898251431813, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.06(1.03), norm=1.9095293482929552, lr=0.006473807891083615
2023-12-17 14:43:01   INFO  epoch: 22/24, acc_iter=79524, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:47:46/0:29:30, time_cost(all): 1 day, 5:18:33/1:59:16, loss=0.287699394645764, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.85(1.03), norm=4.764795279520516, lr=0.006277806118182246
2023-12-17 14:44:08   INFO  epoch: 22/24, acc_iter=79574, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:48:52/0:28:28, time_cost(all): 1 day, 5:19:40/1:51:53, loss=0.287500537859715, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.99(1.03), norm=2.2785404711807797, lr=0.006081804345280883
2023-12-17 14:45:14   INFO  epoch: 22/24, acc_iter=79624, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:49:59/0:28:05, time_cost(all): 1 day, 5:20:46/1:53:17, loss=0.287301681073666, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.2(1.03), norm=2.711958363006661, lr=0.005885802572379507
2023-12-17 14:46:21   INFO  epoch: 22/24, acc_iter=79674, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:51:06/0:25:52, time_cost(all): 1 day, 5:21:53/1:48:48, loss=0.287102824287617, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.21(1.03), norm=4.733166779098449, lr=0.005689800799478138
2023-12-17 14:47:28   INFO  epoch: 22/24, acc_iter=79724, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:52:12/0:26:52, time_cost(all): 1 day, 5:23:00/1:52:50, loss=0.286903967501568, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.11(1.03), norm=4.711693948386059, lr=0.005493799026576769
2023-12-17 14:48:34   INFO  epoch: 22/24, acc_iter=79774, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:53:19/0:25:48, time_cost(all): 1 day, 5:24:06/1:56:02, loss=0.286705110715519, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.9(1.03), norm=0.7940663119011162, lr=0.0052977972536754
2023-12-17 14:49:41   INFO  epoch: 22/24, acc_iter=79824, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:54:26/0:22:43, time_cost(all): 1 day, 5:25:13/1:55:10, loss=0.28650625392947, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.98(1.03), norm=1.502053447404324, lr=0.005101795480774024
2023-12-17 14:50:48   INFO  epoch: 22/24, acc_iter=79874, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:55:32/0:22:07, time_cost(all): 1 day, 5:26:20/1:53:08, loss=0.286307397143421, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.9(1.03), norm=4.167423936245521, lr=0.004973640381638385
2023-12-17 14:51:54   INFO  epoch: 22/24, acc_iter=79924, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:56:39/0:22:04, time_cost(all): 1 day, 5:27:26/1:51:26, loss=0.286108540357372, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.84(1.03), norm=0.5462182489576969, lr=0.004918797633715596
2023-12-17 14:53:01   INFO  epoch: 22/24, acc_iter=79974, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:57:46/0:19:54, time_cost(all): 1 day, 5:28:33/1:46:32, loss=0.285909683571323, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.2(1.03), norm=1.8344961414895748, lr=0.004863954885792809
2023-12-17 14:54:08   INFO  epoch: 22/24, acc_iter=80024, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:58:52/0:19:19, time_cost(all): 1 day, 5:29:40/1:45:31, loss=0.285710826785274, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.84(1.03), norm=4.422115918546659, lr=0.004809112137870022
2023-12-17 14:55:14   INFO  epoch: 22/24, acc_iter=80074, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:59:59/0:18:28, time_cost(all): 1 day, 5:30:46/1:47:34, loss=0.285511969999225, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.08(1.03), norm=1.3286575902597637, lr=0.004754269389947234
2023-12-17 14:56:21   INFO  epoch: 22/24, acc_iter=80124, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 1:01:05/0:17:20, time_cost(all): 1 day, 5:31:53/1:48:15, loss=0.285313113213176, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.15(1.03), norm=2.283767774870749, lr=0.004699426642024446
2023-12-17 14:57:28   INFO  epoch: 22/24, acc_iter=80174, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 1:02:12/0:16:38, time_cost(all): 1 day, 5:33:00/1:44:17, loss=0.285114256427127, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.97(1.03), norm=4.605244882693166, lr=0.004644583894101658
2023-12-17 14:58:34   INFO  epoch: 22/24, acc_iter=80224, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 1:03:19/0:15:04, time_cost(all): 1 day, 5:34:06/1:42:48, loss=0.284915399641078, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.92(1.03), norm=1.8762861132703934, lr=0.004589741146178871
2023-12-17 14:59:41   INFO  epoch: 22/24, acc_iter=80274, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 1:04:25/0:14:00, time_cost(all): 1 day, 5:35:13/1:43:49, loss=0.284716542855029, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.9(1.03), norm=1.377177375032914, lr=0.004534898398256083
2023-12-17 15:00:48   INFO  epoch: 22/24, acc_iter=80324, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 1:05:32/0:13:02, time_cost(all): 1 day, 5:36:20/1:35:38, loss=0.28451768606898, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.18(1.03), norm=4.153459013035665, lr=0.004480055650333295
2023-12-17 15:01:54   INFO  epoch: 22/24, acc_iter=80374, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 1:06:39/0:11:59, time_cost(all): 1 day, 5:37:26/1:40:22, loss=0.284318829282931, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.06(1.03), norm=0.8644959216124317, lr=0.004425212902410508
2023-12-17 15:03:01   INFO  epoch: 22/24, acc_iter=80424, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 1:07:45/0:10:35, time_cost(all): 1 day, 5:38:33/1:35:46, loss=0.284119972496882, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.08(1.03), norm=2.6775609480560814, lr=0.00437037015448772
2023-12-17 15:04:08   INFO  epoch: 22/24, acc_iter=80474, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 1:08:52/0:09:36, time_cost(all): 1 day, 5:39:40/1:39:42, loss=0.283921115710833, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.93(1.03), norm=1.5975304940772685, lr=0.004315527406564932
2023-12-17 15:05:14   INFO  epoch: 22/24, acc_iter=80524, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 1:09:59/0:08:29, time_cost(all): 1 day, 5:40:46/1:35:30, loss=0.283722258924784, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.03(1.03), norm=4.550372947777691, lr=0.004260684658642145
2023-12-17 15:06:21   INFO  epoch: 22/24, acc_iter=80574, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 1:11:05/0:07:01, time_cost(all): 1 day, 5:41:53/1:33:14, loss=0.283523402138735, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.94(1.03), norm=3.863545612122529, lr=0.004205841910719357
2023-12-17 15:07:28   INFO  epoch: 22/24, acc_iter=80624, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:12:12/0:06:07, time_cost(all): 1 day, 5:43:00/1:27:58, loss=0.283324545352686, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.19(1.03), norm=0.9642971192568831, lr=0.004150999162796569
2023-12-17 15:08:34   INFO  epoch: 22/24, acc_iter=80674, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:13:19/0:04:47, time_cost(all): 1 day, 5:44:06/1:30:02, loss=0.283125688566637, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.99(1.03), norm=3.444174310148717, lr=0.004096156414873782
2023-12-17 15:09:41   INFO  epoch: 22/24, acc_iter=80724, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:14:25/0:03:31, time_cost(all): 1 day, 5:45:13/1:33:29, loss=0.282926831780588, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.09(1.03), norm=0.9550756541950988, lr=0.004041313666950994
2023-12-17 15:10:47   INFO  epoch: 22/24, acc_iter=80774, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:15:32/0:02:41, time_cost(all): 1 day, 5:46:19/1:27:53, loss=0.282727974994539, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.92(1.03), norm=4.393178640616635, lr=0.003986470919028207
2023-12-17 15:11:54   INFO  epoch: 22/24, acc_iter=80824, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:16:39/0:01:33, time_cost(all): 1 day, 5:47:26/1:27:29, loss=0.28252911820849, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.95(1.03), norm=2.171389932226272, lr=0.003931628171105419
2023-12-17 15:13:01   INFO  epoch: 22/24, acc_iter=80874, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:17:45/0:00:22, time_cost(all): 1 day, 5:48:33/1:29:56, loss=0.282330261422441, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.85(1.03), norm=3.4610917197521784, lr=0.003876785423182631
2023-12-17 15:14:07   INFO  epoch: 23/24, acc_iter=80941, cur_iter=50/3517, batch_size=8, time_cost(epoch): 0:01:06/1:18:59, time_cost(all): 1 day, 5:49:39/1:26:11, loss=0.282063793329136, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.08(1.03), norm=2.7329395086973522, lr=0.003803296140966096
2023-12-17 15:15:14   INFO  epoch: 23/24, acc_iter=80991, cur_iter=100/3517, batch_size=8, time_cost(epoch): 0:02:13/1:12:26, time_cost(all): 1 day, 5:50:46/1:26:25, loss=0.281864936543087, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.95(1.03), norm=2.017336724011969, lr=0.003748453393043308
2023-12-17 15:16:21   INFO  epoch: 23/24, acc_iter=81041, cur_iter=150/3517, batch_size=8, time_cost(epoch): 0:03:19/1:18:10, time_cost(all): 1 day, 5:51:53/1:26:44, loss=0.281666079757038, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.87(1.03), norm=2.4962922519489235, lr=0.00369361064512052
2023-12-17 15:17:27   INFO  epoch: 23/24, acc_iter=81091, cur_iter=200/3517, batch_size=8, time_cost(epoch): 0:04:26/1:12:46, time_cost(all): 1 day, 5:52:59/1:26:17, loss=0.281467222970989, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.17(1.03), norm=0.8175530585495226, lr=0.003638767897197733
2023-12-17 15:18:34   INFO  epoch: 23/24, acc_iter=81141, cur_iter=250/3517, batch_size=8, time_cost(epoch): 0:05:33/1:09:20, time_cost(all): 1 day, 5:54:06/1:22:56, loss=0.28126836618494, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.17(1.03), norm=2.0195615873187664, lr=0.003583925149274945
2023-12-17 15:19:41   INFO  epoch: 23/24, acc_iter=81191, cur_iter=300/3517, batch_size=8, time_cost(epoch): 0:06:39/1:14:58, time_cost(all): 1 day, 5:55:13/1:22:50, loss=0.281069509398891, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.98(1.03), norm=2.2085498664671177, lr=0.003529082401352157
2023-12-17 15:20:47   INFO  epoch: 23/24, acc_iter=81241, cur_iter=350/3517, batch_size=8, time_cost(epoch): 0:07:46/1:07:05, time_cost(all): 1 day, 5:56:19/1:16:44, loss=0.280870652612842, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.2(1.03), norm=4.910683781169036, lr=0.00347423965342937
2023-12-17 15:21:54   INFO  epoch: 23/24, acc_iter=81291, cur_iter=400/3517, batch_size=8, time_cost(epoch): 0:08:53/1:06:27, time_cost(all): 1 day, 5:57:26/1:20:44, loss=0.280671795826793, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.2(1.03), norm=3.9833028065279503, lr=0.003419396905506582
2023-12-17 15:23:01   INFO  epoch: 23/24, acc_iter=81341, cur_iter=450/3517, batch_size=8, time_cost(epoch): 0:09:59/1:08:53, time_cost(all): 1 day, 5:58:33/1:15:34, loss=0.280472939040744, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.91(1.03), norm=1.0429877664460088, lr=0.003364554157583795
2023-12-17 15:24:07   INFO  epoch: 23/24, acc_iter=81391, cur_iter=500/3517, batch_size=8, time_cost(epoch): 0:11:06/1:05:45, time_cost(all): 1 day, 5:59:39/1:15:06, loss=0.280274082254695, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.14(1.03), norm=1.0048214961274202, lr=0.003309711409661007
2023-12-17 15:25:14   INFO  epoch: 23/24, acc_iter=81441, cur_iter=550/3517, batch_size=8, time_cost(epoch): 0:12:13/1:03:46, time_cost(all): 1 day, 6:00:46/1:17:52, loss=0.280075225468646, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.23(1.03), norm=4.673692788851022, lr=0.003254868661738219
2023-12-17 15:26:21   INFO  epoch: 23/24, acc_iter=81491, cur_iter=600/3517, batch_size=8, time_cost(epoch): 0:13:19/1:01:37, time_cost(all): 1 day, 6:01:53/1:16:44, loss=0.279876368682597, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.85(1.03), norm=2.0388794426986028, lr=0.003200025913815432
2023-12-17 15:27:27   INFO  epoch: 23/24, acc_iter=81541, cur_iter=650/3517, batch_size=8, time_cost(epoch): 0:14:26/1:06:39, time_cost(all): 1 day, 6:02:59/1:10:01, loss=0.279677511896548, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.09(1.03), norm=2.85601665374523, lr=0.003145183165892644
2023-12-17 15:28:34   INFO  epoch: 23/24, acc_iter=81591, cur_iter=700/3517, batch_size=8, time_cost(epoch): 0:15:33/1:00:14, time_cost(all): 1 day, 6:04:06/1:10:41, loss=0.279478655110499, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.15(1.03), norm=4.245265673674753, lr=0.003090340417969856
2023-12-17 15:29:41   INFO  epoch: 23/24, acc_iter=81641, cur_iter=750/3517, batch_size=8, time_cost(epoch): 0:16:39/1:00:28, time_cost(all): 1 day, 6:05:13/1:08:38, loss=0.27927979832445, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.21(1.03), norm=4.249951302050305, lr=0.003035497670047068
2023-12-17 15:30:47   INFO  epoch: 23/24, acc_iter=81691, cur_iter=800/3517, batch_size=8, time_cost(epoch): 0:17:46/0:58:14, time_cost(all): 1 day, 6:06:19/1:11:25, loss=0.279080941538401, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.93(1.03), norm=1.5099039783723434, lr=0.002980654922124281
2023-12-17 15:31:54   INFO  epoch: 23/24, acc_iter=81741, cur_iter=850/3517, batch_size=8, time_cost(epoch): 0:18:53/0:56:42, time_cost(all): 1 day, 6:07:26/1:10:45, loss=0.278882084752352, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.91(1.03), norm=3.4628561611771342, lr=0.002925812174201493
2023-12-17 15:33:01   INFO  epoch: 23/24, acc_iter=81791, cur_iter=900/3517, batch_size=8, time_cost(epoch): 0:19:59/0:56:14, time_cost(all): 1 day, 6:08:33/1:09:40, loss=0.278683227966303, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.14(1.03), norm=1.5219386623109457, lr=0.002870969426278706
2023-12-17 15:34:07   INFO  epoch: 23/24, acc_iter=81841, cur_iter=950/3517, batch_size=8, time_cost(epoch): 0:21:06/0:55:45, time_cost(all): 1 day, 6:09:39/1:02:43, loss=0.278484371180254, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.84(1.03), norm=0.674712466030277, lr=0.002816126678355918
2023-12-17 15:35:14   INFO  epoch: 23/24, acc_iter=81891, cur_iter=1000/3517, batch_size=8, time_cost(epoch): 0:22:13/0:53:15, time_cost(all): 1 day, 6:10:46/1:05:32, loss=0.278285514394205, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.91(1.03), norm=3.658635276730829, lr=0.00276128393043313
2023-12-17 15:36:21   INFO  epoch: 23/24, acc_iter=81941, cur_iter=1050/3517, batch_size=8, time_cost(epoch): 0:23:19/0:57:21, time_cost(all): 1 day, 6:11:53/1:04:21, loss=0.278086657608156, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.16(1.03), norm=2.5398694302282125, lr=0.002706441182510343
2023-12-17 15:37:27   INFO  epoch: 23/24, acc_iter=81991, cur_iter=1100/3517, batch_size=8, time_cost(epoch): 0:24:26/0:54:02, time_cost(all): 1 day, 6:12:59/0:59:46, loss=0.277887800822107, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.99(1.03), norm=0.5657419588057157, lr=0.002651598434587555
2023-12-17 15:38:34   INFO  epoch: 23/24, acc_iter=82041, cur_iter=1150/3517, batch_size=8, time_cost(epoch): 0:25:33/0:53:17, time_cost(all): 1 day, 6:14:06/1:03:51, loss=0.277688944036058, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.04(1.03), norm=1.8865097383965086, lr=0.002596755686664767
2023-12-17 15:39:41   INFO  epoch: 23/24, acc_iter=82091, cur_iter=1200/3517, batch_size=8, time_cost(epoch): 0:26:39/0:49:54, time_cost(all): 1 day, 6:15:13/1:00:19, loss=0.277490087250009, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.15(1.03), norm=3.348496919378323, lr=0.002541912938741979
2023-12-17 15:40:47   INFO  epoch: 23/24, acc_iter=82141, cur_iter=1250/3517, batch_size=8, time_cost(epoch): 0:27:46/0:52:46, time_cost(all): 1 day, 6:16:19/0:57:19, loss=0.27729123046396, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.96(1.03), norm=3.8948201898009382, lr=0.002487070190819192
2023-12-17 15:41:54   INFO  epoch: 23/24, acc_iter=82191, cur_iter=1300/3517, batch_size=8, time_cost(epoch): 0:28:53/0:49:59, time_cost(all): 1 day, 6:17:26/1:00:42, loss=0.277092373677911, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.07(1.03), norm=2.823338549438194, lr=0.002432227442896404
2023-12-17 15:43:00   INFO  epoch: 23/24, acc_iter=82241, cur_iter=1350/3517, batch_size=8, time_cost(epoch): 0:29:59/0:47:15, time_cost(all): 1 day, 6:18:32/0:58:11, loss=0.276893516891862, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.11(1.03), norm=4.905697289151874, lr=0.002377384694973617
2023-12-17 15:44:07   INFO  epoch: 23/24, acc_iter=82291, cur_iter=1400/3517, batch_size=8, time_cost(epoch): 0:31:06/0:47:25, time_cost(all): 1 day, 6:19:39/0:54:26, loss=0.276694660105813, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.01(1.03), norm=4.117694939560378, lr=0.002322541947050829
2023-12-17 15:45:14   INFO  epoch: 23/24, acc_iter=82341, cur_iter=1450/3517, batch_size=8, time_cost(epoch): 0:32:12/0:46:41, time_cost(all): 1 day, 6:20:46/0:55:24, loss=0.276495803319764, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.99(1.03), norm=1.3661108256501233, lr=0.002267699199128042
2023-12-17 15:46:20   INFO  epoch: 23/24, acc_iter=82391, cur_iter=1500/3517, batch_size=8, time_cost(epoch): 0:33:19/0:44:31, time_cost(all): 1 day, 6:21:52/0:54:39, loss=0.276296946533715, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.85(1.03), norm=4.841050459779943, lr=0.002212856451205254
2023-12-17 15:47:27   INFO  epoch: 23/24, acc_iter=82441, cur_iter=1550/3517, batch_size=8, time_cost(epoch): 0:34:26/0:44:02, time_cost(all): 1 day, 6:22:59/0:54:55, loss=0.276098089747666, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.19(1.03), norm=1.587714175148614, lr=0.002158013703282466
2023-12-17 15:48:34   INFO  epoch: 23/24, acc_iter=82491, cur_iter=1600/3517, batch_size=8, time_cost(epoch): 0:35:32/0:42:44, time_cost(all): 1 day, 6:24:06/0:49:59, loss=0.275899232961617, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.88(1.03), norm=4.039514478078874, lr=0.002103170955359679
2023-12-17 15:49:40   INFO  epoch: 23/24, acc_iter=82541, cur_iter=1650/3517, batch_size=8, time_cost(epoch): 0:36:39/0:39:24, time_cost(all): 1 day, 6:25:12/0:48:07, loss=0.275700376175568, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.09(1.03), norm=4.670339713453974, lr=0.00204832820743689
2023-12-17 15:50:47   INFO  epoch: 23/24, acc_iter=82591, cur_iter=1700/3517, batch_size=8, time_cost(epoch): 0:37:46/0:40:16, time_cost(all): 1 day, 6:26:19/0:48:06, loss=0.275501519389519, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.94(1.03), norm=4.580869647681852, lr=0.001993485459514103
2023-12-17 15:51:54   INFO  epoch: 23/24, acc_iter=82641, cur_iter=1750/3517, batch_size=8, time_cost(epoch): 0:38:52/0:40:27, time_cost(all): 1 day, 6:27:26/0:46:16, loss=0.27530266260347, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.85(1.03), norm=1.0369193659497524, lr=0.001938642711591315
2023-12-17 15:53:00   INFO  epoch: 23/24, acc_iter=82691, cur_iter=1800/3517, batch_size=8, time_cost(epoch): 0:39:59/0:39:56, time_cost(all): 1 day, 6:28:32/0:45:12, loss=0.275103805817421, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.98(1.03), norm=4.193344210029938, lr=0.001883799963668528
2023-12-17 15:54:07   INFO  epoch: 23/24, acc_iter=82741, cur_iter=1850/3517, batch_size=8, time_cost(epoch): 0:41:06/0:37:54, time_cost(all): 1 day, 6:29:39/0:45:24, loss=0.274904949031372, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.15(1.03), norm=4.474065909568291, lr=0.00182895721574574
2023-12-17 15:55:14   INFO  epoch: 23/24, acc_iter=82791, cur_iter=1900/3517, batch_size=8, time_cost(epoch): 0:42:12/0:34:35, time_cost(all): 1 day, 6:30:46/0:44:50, loss=0.274706092245323, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.85(1.03), norm=4.944218384347382, lr=0.001774114467822952
2023-12-17 15:56:20   INFO  epoch: 23/24, acc_iter=82841, cur_iter=1950/3517, batch_size=8, time_cost(epoch): 0:43:19/0:33:07, time_cost(all): 1 day, 6:31:52/0:45:10, loss=0.274507235459274, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.97(1.03), norm=4.164221652907691, lr=0.001719271719900165
2023-12-17 15:57:27   INFO  epoch: 23/24, acc_iter=82891, cur_iter=2000/3517, batch_size=8, time_cost(epoch): 0:44:26/0:34:50, time_cost(all): 1 day, 6:32:59/0:43:40, loss=0.274308378673225, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.89(1.03), norm=1.1040240321387575, lr=0.001664428971977377
2023-12-17 15:58:34   INFO  epoch: 23/24, acc_iter=82941, cur_iter=2050/3517, batch_size=8, time_cost(epoch): 0:45:32/0:33:11, time_cost(all): 1 day, 6:34:06/0:43:20, loss=0.274109521887176, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.05(1.03), norm=2.895677789465875, lr=0.001609586224054589
2023-12-17 15:59:40   INFO  epoch: 23/24, acc_iter=82991, cur_iter=2100/3517, batch_size=8, time_cost(epoch): 0:46:39/0:32:27, time_cost(all): 1 day, 6:35:12/0:38:18, loss=0.273910665101127, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.22(1.03), norm=0.634551367028511, lr=0.001554743476131802
2023-12-17 16:00:47   INFO  epoch: 23/24, acc_iter=83041, cur_iter=2150/3517, batch_size=8, time_cost(epoch): 0:47:46/0:31:44, time_cost(all): 1 day, 6:36:19/0:40:08, loss=0.273711808315078, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.03(1.03), norm=3.1899143273888866, lr=0.001499900728209014
2023-12-17 16:01:54   INFO  epoch: 23/24, acc_iter=83091, cur_iter=2200/3517, batch_size=8, time_cost(epoch): 0:48:52/0:29:53, time_cost(all): 1 day, 6:37:26/0:38:51, loss=0.273512951529029, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.14(1.03), norm=2.132278065513721, lr=0.001445057980286227
2023-12-17 16:03:00   INFO  epoch: 23/24, acc_iter=83141, cur_iter=2250/3517, batch_size=8, time_cost(epoch): 0:49:59/0:28:22, time_cost(all): 1 day, 6:38:32/0:38:38, loss=0.27331409474298, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.19(1.03), norm=4.474978440537807, lr=0.001390215232363439
2023-12-17 16:04:07   INFO  epoch: 23/24, acc_iter=83191, cur_iter=2300/3517, batch_size=8, time_cost(epoch): 0:51:06/0:26:08, time_cost(all): 1 day, 6:39:39/0:37:05, loss=0.273115237956931, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.15(1.03), norm=2.3132744233205353, lr=0.001335372484440651
2023-12-17 16:05:14   INFO  epoch: 23/24, acc_iter=83241, cur_iter=2350/3517, batch_size=8, time_cost(epoch): 0:52:12/0:24:51, time_cost(all): 1 day, 6:40:46/0:32:59, loss=0.272916381170882, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.21(1.03), norm=1.6712472208654183, lr=0.001280529736517863
2023-12-17 16:06:20   INFO  epoch: 23/24, acc_iter=83291, cur_iter=2400/3517, batch_size=8, time_cost(epoch): 0:53:19/0:24:14, time_cost(all): 1 day, 6:41:52/0:33:42, loss=0.272717524384833, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.07(1.03), norm=1.552976064693556, lr=0.001225686988595076
2023-12-17 16:07:27   INFO  epoch: 23/24, acc_iter=83341, cur_iter=2450/3517, batch_size=8, time_cost(epoch): 0:54:26/0:23:02, time_cost(all): 1 day, 6:42:59/0:32:00, loss=0.272518667598784, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.23(1.03), norm=2.816332811065992, lr=0.001170844240672288
2023-12-17 16:08:34   INFO  epoch: 23/24, acc_iter=83391, cur_iter=2500/3517, batch_size=8, time_cost(epoch): 0:55:32/0:22:21, time_cost(all): 1 day, 6:44:06/0:31:27, loss=0.272319810812735, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.16(1.03), norm=3.918627195181045, lr=0.001116001492749501
2023-12-17 16:09:40   INFO  epoch: 23/24, acc_iter=83441, cur_iter=2550/3517, batch_size=8, time_cost(epoch): 0:56:39/0:20:41, time_cost(all): 1 day, 6:45:12/0:28:50, loss=0.272120954026686, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.92(1.03), norm=1.0575468805592894, lr=0.001061158744826713
2023-12-17 16:10:47   INFO  epoch: 23/24, acc_iter=83491, cur_iter=2600/3517, batch_size=8, time_cost(epoch): 0:57:46/0:19:34, time_cost(all): 1 day, 6:46:19/0:29:55, loss=0.271922097240637, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.02(1.03), norm=0.7920661915594116, lr=0.001006315996903926
2023-12-17 16:11:53   INFO  epoch: 23/24, acc_iter=83541, cur_iter=2650/3517, batch_size=8, time_cost(epoch): 0:58:52/0:18:42, time_cost(all): 1 day, 6:47:25/0:27:38, loss=0.271723240454588, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.95(1.03), norm=0.5515757025838748, lr=0.000951473248981137
2023-12-17 16:13:00   INFO  epoch: 23/24, acc_iter=83591, cur_iter=2700/3517, batch_size=8, time_cost(epoch): 0:59:59/0:18:39, time_cost(all): 1 day, 6:48:32/0:28:09, loss=0.271524383668539, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.92(1.03), norm=3.378738030808126, lr=0.00089663050105835
2023-12-17 16:14:07   INFO  epoch: 23/24, acc_iter=83641, cur_iter=2750/3517, batch_size=8, time_cost(epoch): 1:01:05/0:16:47, time_cost(all): 1 day, 6:49:39/0:25:50, loss=0.27132552688249, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.09(1.03), norm=1.3951942241168271, lr=0.000841787753135562
2023-12-17 16:15:13   INFO  epoch: 23/24, acc_iter=83691, cur_iter=2800/3517, batch_size=8, time_cost(epoch): 1:02:12/0:16:13, time_cost(all): 1 day, 6:50:45/0:24:00, loss=0.271126670096441, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.99(1.03), norm=4.400031603935171, lr=0.000786945005212774
2023-12-17 16:16:20   INFO  epoch: 23/24, acc_iter=83741, cur_iter=2850/3517, batch_size=8, time_cost(epoch): 1:03:19/0:14:55, time_cost(all): 1 day, 6:51:52/0:24:23, loss=0.270927813310392, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.08(1.03), norm=3.613147991114472, lr=0.000732102257289987
2023-12-17 16:17:27   INFO  epoch: 23/24, acc_iter=83791, cur_iter=2900/3517, batch_size=8, time_cost(epoch): 1:04:25/0:13:10, time_cost(all): 1 day, 6:52:59/0:21:17, loss=0.270728956524343, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.97(1.03), norm=4.659363843125652, lr=0.000677259509367199
2023-12-17 16:18:33   INFO  epoch: 23/24, acc_iter=83841, cur_iter=2950/3517, batch_size=8, time_cost(epoch): 1:05:32/0:12:22, time_cost(all): 1 day, 6:54:05/0:20:30, loss=0.270530099738294, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.94(1.03), norm=3.2324106859814012, lr=0.000622416761444412
2023-12-17 16:19:40   INFO  epoch: 23/24, acc_iter=83891, cur_iter=3000/3517, batch_size=8, time_cost(epoch): 1:06:39/0:10:57, time_cost(all): 1 day, 6:55:12/0:19:23, loss=0.270331242952245, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.98(1.03), norm=4.160706077600548, lr=0.000567574013521624
2023-12-17 16:20:47   INFO  epoch: 23/24, acc_iter=83941, cur_iter=3050/3517, batch_size=8, time_cost(epoch): 1:07:45/0:10:33, time_cost(all): 1 day, 6:56:19/0:18:23, loss=0.270132386166196, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.93(1.03), norm=4.895422240247494, lr=0.000512731265598836
2023-12-17 16:21:53   INFO  epoch: 23/24, acc_iter=83991, cur_iter=3100/3517, batch_size=8, time_cost(epoch): 1:08:52/0:09:06, time_cost(all): 1 day, 6:57:25/0:18:48, loss=0.269933529380147, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.99(1.03), norm=1.8783581380605485, lr=0.000457888517676049
2023-12-17 16:23:00   INFO  epoch: 23/24, acc_iter=84041, cur_iter=3150/3517, batch_size=8, time_cost(epoch): 1:09:59/0:08:16, time_cost(all): 1 day, 6:58:32/0:17:10, loss=0.269734672594098, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.23(1.03), norm=1.2524695698321306, lr=0.000403045769753261
2023-12-17 16:24:07   INFO  epoch: 23/24, acc_iter=84091, cur_iter=3200/3517, batch_size=8, time_cost(epoch): 1:11:05/0:07:19, time_cost(all): 1 day, 6:59:39/0:15:51, loss=0.269535815808049, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.91(1.03), norm=4.130082187845632, lr=0.000348203021830473
2023-12-17 16:25:13   INFO  epoch: 23/24, acc_iter=84141, cur_iter=3250/3517, batch_size=8, time_cost(epoch): 1:12:12/0:06:08, time_cost(all): 1 day, 7:00:45/0:13:55, loss=0.269336959022, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.14(1.03), norm=2.5137288610639428, lr=0.000293360273907686
2023-12-17 16:26:20   INFO  epoch: 23/24, acc_iter=84191, cur_iter=3300/3517, batch_size=8, time_cost(epoch): 1:13:19/0:05:03, time_cost(all): 1 day, 7:01:52/0:13:50, loss=0.269138102235951, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.04(1.03), norm=4.0352064806809915, lr=0.000238517525984898
2023-12-17 16:27:27   INFO  epoch: 23/24, acc_iter=84241, cur_iter=3350/3517, batch_size=8, time_cost(epoch): 1:14:25/0:03:35, time_cost(all): 1 day, 7:02:59/0:12:28, loss=0.268939245449902, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.13(1.03), norm=4.996225873707785, lr=0.00018367477806211
2023-12-17 16:28:33   INFO  epoch: 23/24, acc_iter=84291, cur_iter=3400/3517, batch_size=8, time_cost(epoch): 1:15:32/0:02:37, time_cost(all): 1 day, 7:04:05/0:11:36, loss=0.268740388663853, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.1(1.03), norm=1.1198586135381583, lr=0.000128832030139323
2023-12-17 16:29:40   INFO  epoch: 23/24, acc_iter=84341, cur_iter=3450/3517, batch_size=8, time_cost(epoch): 1:16:39/0:01:30, time_cost(all): 1 day, 7:05:12/0:10:30, loss=0.268541531877804, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.19(1.03), norm=1.2426414416144653, lr=7.3989282216536e-05
2023-12-17 16:30:47   INFO  epoch: 23/24, acc_iter=84391, cur_iter=3500/3517, batch_size=8, time_cost(epoch): 1:17:45/0:00:22, time_cost(all): 1 day, 7:06:19/0:08:45, loss=0.268342675091755, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.91(1.03), norm=2.4460099024888433, lr=1.9146534293748e-05
2023-12-17 16:30:47   INFO  **********************End training cfgs/picture_models/picture_nuscenes_segmentation(default)**********************



2023-12-17 16:32:12   INFO  **********************Start evaluation cfgs/picture_models/picture_nuscenes_segmentation(default)**********************
+---------+-------+---------+---------+--------+--------+----------------------+------------+------------+--------------+---------+--------+-------------------+------------+----------+---------+---------+------------+--------+
| classes | noise | barrier | bicycle | bus    | car    | construction_vehicle | motorcycle | pedestrian | traffic_cone | trailer | truck  | driveable_surface | other_flat | sidewalk | terrain | manmade | vegetation | miou   |
+---------+-------+---------+---------+--------+--------+----------------------+------------+------------+--------------+---------+--------+-------------------+------------+----------+---------+---------+------------+--------+
| results | nan   | 0.8467  | 0.4318  | 0.9447 | 0.9631 | 0.6884               | 0.8063     | 0.8408     | 0.5268       | 0.6568  | 0.8751 | 0.9638            | 0.7624     | 0.7591   | 0.7758  | 0.9481  | 0.9142     | 0.7939 |
+---------+-------+---------+---------+--------+--------+----------------------+------------+------------+--------------+---------+--------+-------------------+------------+----------+---------+---------+------------+--------+
2023-12-17 16:32:12   INFO  noise: nan  barrier: 0.8467  bicycle: 0.4318  bus: 0.9447  car: 0.9631  construction_vehicle: 0.6884  motorcycle: 0.8063  pedestrian: 0.8408  traffic_cone: 0.5268  trailer: 0.6568  truck: 0.8751  driveable_surface: 0.9638  other_flat: 0.7624  sidewalk: 0.7591  terrain: 0.7758  manmade: 0.9481  vegetation: 0.9142  miou: 0.7939

2023-12-17 16:32:12   INFO  Result is save to xxxxxxxxxxxxxxx
2023-12-17 16:32:12   INFO  ****************Evaluation done.*****************
2023-12-17 16:32:12   INFO  Epoch 24 has been evaluated
2023-12-17 16:32:12   INFO  **********************End evaluation cfgs/picture_models/picture_nuscenes_segmentation(default)**********************
