2023-11-21 08:23:18   INFO  **********************Start logging**********************
2023-11-21 08:23:18   INFO  CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7
2023-11-21 08:23:18   INFO  cfg_file         cfgs/picture_models/picture_waymo_ssl_seal_decoder_mask.yaml
2023-11-21 08:23:18   INFO  batch_size       3
2023-11-21 08:23:18   INFO  epochs           30
2023-11-21 08:23:18   INFO  workers          4
2023-11-21 08:23:18   INFO  extra_tag        offline_30e
2023-11-21 08:23:18   INFO  ckpt             None
2023-11-21 08:23:18   INFO  pretrained_model None
2023-11-21 08:23:18   INFO  launcher         none
2023-11-21 08:23:18   INFO  tcp_port         18888
2023-11-21 08:23:18   INFO  sync_bn          False
2023-11-21 08:23:18   INFO  fix_random_seed  False
2023-11-21 08:23:18   INFO  ckpt_save_interval 1
2023-11-21 08:23:18   INFO  local_rank       0
2023-11-21 08:23:18   INFO  max_ckpt_save_num 1
2023-11-21 08:23:18   INFO  merge_all_iters_to_one_epoch False
2023-11-21 08:23:18   INFO  set_cfgs         None
2023-11-21 08:23:18   INFO  max_waiting_mins 0
2023-11-21 08:23:18   INFO  start_epoch      0
2023-11-21 08:23:18   INFO  num_epochs_to_eval 0
2023-11-21 08:23:18   INFO  save_to_file     False
2023-11-21 08:23:18   INFO  use_tqdm_to_record False
2023-11-21 08:23:18   INFO  logger_iter_interval 50
2023-11-21 08:23:18   INFO  ckpt_save_time_interval 300
2023-11-21 08:23:18   INFO  wo_gpu_stat      False
2023-11-21 08:23:18   INFO  fp16             False
2023-11-21 08:23:18   INFO  cfg.LOCAL_RANK: 0
2023-11-21 08:23:18   INFO  
cfg.DATA_CONFIG = edict()
2023-11-21 08:23:18   INFO  cfg.DATA_CONFIG.DATASET: WaymoDataset
2023-11-21 08:23:18   INFO  cfg.DATA_CONFIG.DATA_PATH: ../data/waymo
2023-11-21 08:23:18   INFO  cfg.DATA_CONFIG.PROCESSED_DATA_TAG: waymo_processed_data_v0_5_0
2023-11-21 08:23:18   INFO  cfg.DATA_CONFIG.POINT_CLOUD_RANGE: [-74.88, -74.88, -2, 74.88, 74.88, 4.0]
2023-11-21 08:23:18   INFO  
cfg.DATA_CONFIG.DATA_SPLIT = edict()
2023-11-21 08:23:18   INFO  cfg.DATA_CONFIG.DATA_SPLIT.train: train
2023-11-21 08:23:18   INFO  cfg.DATA_CONFIG.DATA_SPLIT.test: val
2023-11-21 08:23:18   INFO  
cfg.DATA_CONFIG.SAMPLED_INTERVAL = edict()
2023-11-21 08:23:18   INFO  cfg.DATA_CONFIG.SAMPLED_INTERVAL.train: 1
2023-11-21 08:23:18   INFO  cfg.DATA_CONFIG.SAMPLED_INTERVAL.test: 1
2023-11-21 08:23:18   INFO  cfg.DATA_CONFIG.FILTER_EMPTY_BOXES_FOR_TRAIN: True
2023-11-21 08:23:18   INFO  cfg.DATA_CONFIG.DISABLE_NLZ_FLAG_ON_POINTS: True
2023-11-21 08:23:18   INFO  cfg.DATA_CONFIG.USE_SHARED_MEMORY: False
2023-11-21 08:23:18   INFO  cfg.DATA_CONFIG.SHARED_MEMORY_FILE_LIMIT: 35000
2023-11-21 08:23:18   INFO  
cfg.DATA_CONFIG.DATA_AUGMENTOR = edict()
2023-11-21 08:23:18   INFO  cfg.DATA_CONFIG.DATA_AUGMENTOR.DISABLE_AUG_LIST: ['placeholder']
2023-11-21 08:23:18   INFO  
cfg.DATA_CONFIG.POINT_FEATURE_ENCODING = edict()
2023-11-21 08:23:18   INFO  cfg.DATA_CONFIG.POINT_FEATURE_ENCODING.encoding_type: absolute_coordinates_encoding
2023-11-21 08:23:18   INFO  cfg.DATA_CONFIG.POINT_FEATURE_ENCODING.used_feature_list: ['x', 'y', 'z', 'intensity', 'elongation']
2023-11-21 08:23:18   INFO  cfg.DATA_CONFIG.POINT_FEATURE_ENCODING.src_feature_list: ['x', 'y', 'z', 'intensity', 'elongation']
2023-11-21 08:23:18   INFO  cfg.DATA_CONFIG.DATA_PROCESSOR: [{'NAME': 'mask_points_and_boxes_outside_range', 'REMOVE_OUTSIDE_BOXES': True}, {'NAME': 'shuffle_points', 'SHUFFLE_ENABLED': {'train': True, 'test': False}}, {'NAME': 'transform_points_to_voxels_placeholder', 'VOXEL_SIZE': [0.32, 0.32, 6]}]
2023-11-21 08:23:18   INFO  cfg.DATA_CONFIG._BASE_CONFIG_: cfgs/dataset_configs/waymo_dataset.yaml
2023-11-21 08:23:18   INFO  
cfg.MODEL = edict()
2023-11-21 08:23:18   INFO  cfg.MODEL.NAME: PICTURE
2023-11-21 08:23:18   INFO  
cfg.MODEL.VFE = edict()
2023-11-21 08:23:18   INFO  cfg.MODEL.VFE.NAME: DynPillarVFE3D
2023-11-21 08:23:18   INFO  cfg.MODEL.VFE.WITH_DISTANCE: False
2023-11-21 08:23:18   INFO  cfg.MODEL.VFE.USE_ABSLOTE_XYZ: True
2023-11-21 08:23:18   INFO  cfg.MODEL.VFE.USE_NORM: True
2023-11-21 08:23:18   INFO  cfg.MODEL.VFE.NUM_FILTERS: [192, 192]
2023-11-21 08:23:18   INFO  
cfg.MODEL.BACKBONE_3D = edict()
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.NAME: DSVTBackboneMAE
2023-11-21 08:23:18   INFO  
cfg.MODEL.BACKBONE_3D.INPUT_LAYER = edict()
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.sparse_shape: [ 468, 468, 32 ]
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.downsample_stride: [ [ 1, 1, 4 ], [ 1, 1, 4 ], [ 1, 1, 2 ] ]
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.d_model: [ 192, 192, 192, 192 ]
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.set_info: [ [ 48, 1 ], [ 48, 1 ], [ 48, 1 ], [ 48, 1 ] ]
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.window_shape: [ [ 12, 12, 32 ], [ 12, 12, 8 ], [ 12, 12, 2 ], [ 12, 12, 1 ] ]
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.hybrid_factor: [2, 2, 1]
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.shifts_list: [ [ [ 0, 0, 0 ], [ 6, 6, 0 ] ], [ [ 0, 0, 0 ], [ 6, 6, 0 ] ], [ [ 0, 0, 0 ], [ 6, 6, 0 ] ], [ [ 0, 0, 0 ], [ 6, 6, 0 ] ] ]
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.normalize_pos: False
2023-11-21 08:23:18   INFO  
cfg.MODEL.BACKBONE_3D.MASK_CONFIG = edict()
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.MASK_CONFIG.n_clusters: 8
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.MASK_CONFIG.n_partition: [3, 3, 2]
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.MASK_CONFIG.lambda_threshold: 0.6
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.MASK_CONFIG.base_mask_ratio: [0.9, 0.45, 0]
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.MASK_CONFIG.NUM_SEAL_FEATURES: 64
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.MASK_CONFIG.GENERATE_MODE: offline
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.block_name: [ 'DSVTBlock','DSVTBlock','DSVTBlock','DSVTBlock' ]
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.set_info: [ [ 48, 1 ], [ 48, 1 ], [ 48, 1 ], [ 48, 1 ] ]
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.d_model: [ 192, 192, 192, 192 ]
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.nhead: [ 8, 8, 8, 8 ]
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.dim_feedforward: [ 384, 384, 384, 384 ]
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.dropout: 0.0
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.activation: gelu
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.output_shape: [468, 468]
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.conv_out_channel: 192
2023-11-21 08:23:18   INFO  
cfg.MODEL.BACKBONE_2D = edict()
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_2D.NAME: LightDecoder
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_2D.INPUT_LAYER.sparse_shape: [ 468, 468, 32 ]
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_2D.INPUT_LAYER.downsample_stride: [ [ 1, 1, 4 ], [ 1, 1, 4 ], [ 1, 1, 2 ] ]
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_2D.INPUT_LAYER.d_model: [ 192, 192 ]
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_2D.INPUT_LAYER.set_info: [ [ 48, 1 ], [ 48, 1 ]]
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_2D.INPUT_LAYER.window_shape: [ [ 12, 12, 32 ], [ 12, 12, 1 ] ]
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_2D.INPUT_LAYER.hybrid_factor: [ 2, 2, 1 ]
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_2D.INPUT_LAYER.shifts_list: [ [ [ 0, 0, 0 ], [ 6, 6, 0 ] ], [ [ 0, 0, 0 ], [ 6, 6, 0 ] ] ]
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_2D.INPUT_LAYER.shifts_list: normalize_pos: False
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_2D.INPUT_SHAPE: [ 468, 468, 32 ]
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_2D.NUM_BEV_FEATURES: 192
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.block_name: [ 'DSVTBlock','DSVTBlock' ]
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.set_info: [ [ 48, 1 ], [ 48, 1 ] ]
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.d_model: [ 192, 192 ]
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.nhead: [ 8, 8 ]
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.dim_feedforward: [ 384, 384 ]
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.dropout: 0.0
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.activation: gelu
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.output_shape: [468, 468]
2023-11-21 08:23:18   INFO  cfg.MODEL.BACKBONE_3D.conv_out_channel: 192
2023-11-21 08:23:18   INFO  
cfg.MODEL.DENSE_HEAD = edict()
2023-11-21 08:23:18   INFO  cfg.MODEL.DENSE_HEAD.NAME: PretrainHead3D
2023-11-21 08:23:18   INFO  cfg.MODEL.DENSE_HEAD.CLASS_AGNOSTIC: False
2023-11-21 08:23:18   INFO  
cfg.MODEL.DENSE_HEAD.MASK_CONFIG = edict()
2023-11-21 08:23:18   INFO  cfg.MODEL.DENSE_HEAD.MASK_CONFIG.NUM_PRD_POINTS: 16
2023-11-21 08:23:18   INFO  cfg.MODEL.DENSE_HEAD.MASK_CONFIG.NUM_GT_POINTS: 64
2023-11-21 08:23:18   INFO  cfg.MODEL.DENSE_HEAD.INPUT_SHAPE: [468, 468, 32]
2023-11-21 08:23:18   INFO  cfg.MODEL.DENSE_HEAD.NUM_MINK_FEATURES: 64
2023-11-21 08:23:18   INFO  cfg.MODEL.DENSE_HEAD.LOSS_WEIGHT: [1.0, 3.0]
2023-11-21 08:23:18   INFO  cfg.MODEL.DENSE_HEAD.GENERATE_MODE: offline
2023-11-21 08:23:18   INFO  
cfg.MODEL.POST_PROCESSING = edict()
2023-11-21 08:23:18   INFO  cfg.MODEL.POST_PROCESSING: None
2023-11-21 08:23:18   INFO  
cfg.OPTIMIZATION = edict()
2023-11-21 08:23:18   INFO  cfg.OPTIMIZATION.BATCH_SIZE_PER_GPU: 3
2023-11-21 08:23:18   INFO  cfg.OPTIMIZATION.NUM_EPOCHS: 30
2023-11-21 08:23:18   INFO  cfg.OPTIMIZATION.OPTIMIZER: adamw
2023-11-21 08:23:18   INFO  cfg.OPTIMIZATION.LR: 0.001
2023-11-21 08:23:18   INFO  cfg.OPTIMIZATION.WEIGHT_DECAY: 0.05
2023-11-21 08:23:18   INFO  cfg.OPTIMIZATION.MOMENTUM: 0.9
2023-11-21 08:23:18   INFO  cfg.OPTIMIZATION.MOMS: [0.95, 0.85]
2023-11-21 08:23:18   INFO  cfg.OPTIMIZATION.PCT_START: 0.1
2023-11-21 08:23:18   INFO  cfg.OPTIMIZATION.DIV_FACTOR: 100
2023-11-21 08:23:18   INFO  cfg.OPTIMIZATION.DECAY_STEP_LIST: [35, 45]
2023-11-21 08:23:18   INFO  cfg.OPTIMIZATION.LR_DECAY: 0.1
2023-11-21 08:23:18   INFO  cfg.OPTIMIZATION.LR_CLIP: 1e-07
2023-11-21 08:23:18   INFO  cfg.OPTIMIZATION.LR_WARMUP: False
2023-11-21 08:23:18   INFO  cfg.OPTIMIZATION.WARMUP_EPOCH: 1
2023-11-21 08:23:18   INFO  cfg.OPTIMIZATION.GRAD_NORM_CLIP: 10
2023-11-21 08:23:18   INFO  cfg.TAG: picture_waymo_ssl_seal_decoder_mask
2023-11-21 08:23:18   INFO  cfg.EXP_GROUP_PATH: picture_models
2023-11-21 08:23:18   INFO  Loading Waymo dataset
2023-11-21 08:23:24   INFO  Total skipped info 0
2023-11-21 08:23:24   INFO  Total samples for Waymo dataset: 158081
2023-11-21 08:23:24   INFO  Total sampled samples for Waymo dataset: 158081
2023-11-21 08:23:27   INFO  PICTURE(
  (vfe): DynamicPillarVFE_3d(
    (pfn_layers): ModuleList(
      (0): PFNLayerV2(
        (linear): Linear(in_features=11, out_features=64, bias=False)
        (norm): BatchNorm1d(64, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
        (relu): ReLU()
      )
      (1): PFNLayerV2(
        (linear): Linear(in_features=192, out_features=192, bias=False)
        (norm): BatchNorm1d(192, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
        (relu): ReLU()
      )
    )
  )
  (backbone_3d): DSVTBackboneMAE(
    (input_layer): DSVTInputLayer(
      (posembed_layers): ModuleList(
        (0): ModuleList(
          (0): ModuleList(
            (0): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=192, bias=True)
                (1): BatchNorm1d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=192, out_features=192, bias=True)
              )
            )
            (1): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=192, bias=True)
                (1): BatchNorm1d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=192, out_features=192, bias=True)
              )
            )
          )
          (1): ModuleList(
            (0): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=192, bias=True)
                (1): BatchNorm1d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=192, out_features=192, bias=True)
              )
            )
            (1): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=192, bias=True)
                (1): BatchNorm1d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=192, out_features=192, bias=True)
              )
            )
          )
          (2): ModuleList(
            (0): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=192, bias=True)
                (1): BatchNorm1d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=192, out_features=192, bias=True)
              )
            )
            (1): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=192, bias=True)
                (1): BatchNorm1d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=192, out_features=192, bias=True)
              )
            )
          )
          (3): ModuleList(
            (0): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=192, bias=True)
                (1): BatchNorm1d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=192, out_features=192, bias=True)
              )
            )
            (1): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=192, bias=True)
                (1): BatchNorm1d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=192, out_features=192, bias=True)
              )
            )
          )
        )
      )
    )
    (stage_0): ModuleList(
      (0): DSVTBlock(
        (encoder_list): ModuleList(
          (0): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=192, out_features=192, bias=True)
              )
              (linear1): Linear(in_features=192, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=192, bias=True)
              (norm1): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
          )
          (1): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=192, out_features=192, bias=True)
              )
              (linear1): Linear(in_features=192, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=192, bias=True)
              (norm1): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
          )
        )
      )
      (1): DSVTBlock(
        (encoder_list): ModuleList(
          (0): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=192, out_features=192, bias=True)
              )
              (linear1): Linear(in_features=192, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=192, bias=True)
              (norm1): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
          )
          (1): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=192, out_features=192, bias=True)
              )
              (linear1): Linear(in_features=192, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=192, bias=True)
              (norm1): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
          )
        )
      )
      (2): DSVTBlock(
        (encoder_list): ModuleList(
          (0): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=192, out_features=192, bias=True)
              )
              (linear1): Linear(in_features=192, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=192, bias=True)
              (norm1): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
          )
          (1): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=192, out_features=192, bias=True)
              )
              (linear1): Linear(in_features=192, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=192, bias=True)
              (norm1): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
          )
        )
      )
      (3): DSVTBlock(
        (encoder_list): ModuleList(
          (0): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=192, out_features=192, bias=True)
              )
              (linear1): Linear(in_features=192, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=192, bias=True)
              (norm1): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
          )
          (1): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=192, out_features=192, bias=True)
              )
              (linear1): Linear(in_features=192, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=192, bias=True)
              (norm1): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
          )
        )
      )
    )
    (residual_norm_stage_0): ModuleList(
      (0): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
      (1): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
      (2): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
      (3): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
    )
  )
  (map_to_bev_module): None
  (pfe): None
  (backbone_2d): LightDecoder(
    (input_layer): DSVTInputLayer(
      (posembed_layers): ModuleList(
        (0): ModuleList(
          (0): ModuleList(
            (0): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=192, bias=True)
                (1): BatchNorm1d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=192, out_features=192, bias=True)
              )
            )
            (1): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=192, bias=True)
                (1): BatchNorm1d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=192, out_features=192, bias=True)
              )
            )
          )
          (1): ModuleList(
            (0): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=192, bias=True)
                (1): BatchNorm1d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=192, out_features=192, bias=True)
              )
            )
            (1): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=192, bias=True)
                (1): BatchNorm1d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=192, out_features=192, bias=True)
              )
            )
          )
        )
      )
    )
    (stage_0): ModuleList(
      (0): DSVTBlock(
        (encoder_list): ModuleList(
          (0): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=192, out_features=192, bias=True)
              )
              (linear1): Linear(in_features=192, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=192, bias=True)
              (norm1): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
          )
          (1): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=192, out_features=192, bias=True)
              )
              (linear1): Linear(in_features=192, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=192, bias=True)
              (norm1): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
          )
        )
      )
      (1): DSVTBlock(
        (encoder_list): ModuleList(
          (0): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=192, out_features=192, bias=True)
              )
              (linear1): Linear(in_features=192, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=192, bias=True)
              (norm1): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
          )
          (1): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=192, out_features=192, bias=True)
              )
              (linear1): Linear(in_features=192, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=192, bias=True)
              (norm1): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
          )
        )
      )
    )
    (residual_norm_stage_0): ModuleList(
      (0): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
      (1): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
    )
  )
  (dense_head): PretrainHead3D(
    (decoder_pred): Linear(in_features=192, out_features=48, bias=True)
    (decoder_seal): Linear(in_features=192, out_features=64, bias=True)
    (seal_loss): SmoothL1Loss()
  )
  (point_head): None
  (roi_head): None
)
2023-11-21 08:24:51   INFO  Total number of parameters: 6514608
2023-11-21 08:24:51   INFO  **********************Start training picture_models/picture_waymo_ssl_seal_decoder_mask(offline_30e)**********************
2023-11-21 08:25:20   INFO  epoch: 0/30, acc_iter=50, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:49/1:50:49, time_cost(all): 0:00:49/2 days, 7:47:43, loss=3.135674308603753, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.87(1.03), norm=1.5696450819999839, lr=1.1138606345833e-05
2023-11-21 08:26:09   INFO  epoch: 0/30, acc_iter=100, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:38/1:44:29, time_cost(all): 0:01:38/2 days, 7:42:06, loss=3.002217766045661, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.05(1.03), norm=2.8402810500802294, lr=1.2277212691665e-05
2023-11-21 08:26:58   INFO  epoch: 0/30, acc_iter=150, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:27/1:48:27, time_cost(all): 0:02:27/2 days, 7:52:10, loss=2.868761223487569, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.22(1.03), norm=4.142486976124051, lr=1.3415819037498e-05
2023-11-21 08:27:47   INFO  epoch: 0/30, acc_iter=200, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:16/1:42:51, time_cost(all): 0:03:16/2 days, 3:53:03, loss=2.735304680929477, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.85(1.03), norm=4.008548212467337, lr=1.4554425383331e-05
2023-11-21 08:28:36   INFO  epoch: 0/30, acc_iter=250, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:05/1:44:53, time_cost(all): 0:04:05/2 days, 6:23:12, loss=2.601848138371384, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.86(1.03), norm=1.9113882261138082, lr=1.5693031729164e-05
2023-11-21 08:29:25   INFO  epoch: 0/30, acc_iter=300, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:04:54/1:40:32, time_cost(all): 0:04:54/2 days, 3:39:32, loss=2.468391595813292, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.1(1.03), norm=2.7312406447311015, lr=1.6831638074996e-05
2023-11-21 08:30:14   INFO  epoch: 0/30, acc_iter=350, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:05:43/1:43:03, time_cost(all): 0:05:43/2 days, 4:09:59, loss=2.3349350532552, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.15(1.03), norm=2.943052518422528, lr=1.7970244420829e-05
2023-11-21 08:31:03   INFO  epoch: 0/30, acc_iter=400, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:06:32/1:39:45, time_cost(all): 0:06:32/2 days, 3:22:26, loss=2.201478510697108, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.87(1.03), norm=1.0762411002467702, lr=1.9108850766662e-05
2023-11-21 08:31:53   INFO  epoch: 0/30, acc_iter=450, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:07:22/1:36:32, time_cost(all): 0:07:22/2 days, 3:58:00, loss=2.068021968139015, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.03(1.03), norm=3.8912868224368378, lr=2.0247457112494e-05
2023-11-21 08:32:42   INFO  epoch: 0/30, acc_iter=500, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:08:11/1:41:59, time_cost(all): 0:08:11/2 days, 4:11:53, loss=1.934565425580923, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.88(1.03), norm=4.406637336941177, lr=2.1386063458327e-05
2023-11-21 08:33:31   INFO  epoch: 0/30, acc_iter=550, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:09:00/1:40:38, time_cost(all): 0:09:00/2 days, 7:14:56, loss=1.80110888302283, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.22(1.03), norm=2.6382355317630233, lr=2.252466980416e-05
2023-11-21 08:34:20   INFO  epoch: 0/30, acc_iter=600, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:09:49/1:35:40, time_cost(all): 0:09:49/2 days, 3:55:39, loss=1.667652340464738, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.01(1.03), norm=4.8921846457771405, lr=2.3663276149992e-05
2023-11-21 08:35:09   INFO  epoch: 0/30, acc_iter=650, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:10:38/1:33:39, time_cost(all): 0:10:38/2 days, 5:51:23, loss=1.534195797906646, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.86(1.03), norm=2.2551751597421807, lr=2.4801882495825e-05
2023-11-21 08:35:58   INFO  epoch: 0/30, acc_iter=700, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:11:27/1:35:55, time_cost(all): 0:11:27/2 days, 3:41:43, loss=1.400739255348554, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.22(1.03), norm=2.123299026073647, lr=2.5940488841658e-05
2023-11-21 08:36:47   INFO  epoch: 0/30, acc_iter=750, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:12:16/1:34:10, time_cost(all): 0:12:16/2 days, 4:20:47, loss=1.267282712790462, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.03(1.03), norm=2.615732339919659, lr=2.7079095187491e-05
2023-11-21 08:37:36   INFO  epoch: 0/30, acc_iter=800, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:13:05/1:35:10, time_cost(all): 0:13:05/2 days, 3:42:53, loss=1.133826170232369, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.03(1.03), norm=4.788377785984305, lr=2.8217701533323e-05
2023-11-21 08:38:25   INFO  epoch: 0/30, acc_iter=850, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:13:54/1:34:42, time_cost(all): 0:13:54/2 days, 5:30:21, loss=1.000369627674277, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.97(1.03), norm=2.345185549432604, lr=2.9356307879156e-05
2023-11-21 08:39:15   INFO  epoch: 0/30, acc_iter=900, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:14:44/1:31:56, time_cost(all): 0:14:44/2 days, 3:12:11, loss=0.866913085116185, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.93(1.03), norm=2.941188798813134, lr=3.0494914224989e-05
2023-11-21 08:40:04   INFO  epoch: 0/30, acc_iter=950, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:15:33/1:32:45, time_cost(all): 0:15:33/2 days, 7:15:47, loss=0.733456542558093, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.04(1.03), norm=3.9539276924036666, lr=3.1633520570821e-05
2023-11-21 08:40:53   INFO  epoch: 0/30, acc_iter=1000, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:16:22/1:27:46, time_cost(all): 0:16:22/2 days, 6:15:13, loss=0.659394201052072, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.09(1.03), norm=2.4123928229412877, lr=3.2772126916654e-05
2023-11-21 08:41:42   INFO  epoch: 0/30, acc_iter=1050, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:17:11/1:32:38, time_cost(all): 0:17:11/2 days, 5:29:59, loss=0.599916886013691, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.04(1.03), norm=0.6041153655555894, lr=3.3910733262487e-05
2023-11-21 08:42:31   INFO  epoch: 0/30, acc_iter=1100, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:18:00/1:29:48, time_cost(all): 0:18:00/2 days, 5:07:49, loss=0.599833772027382, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.14(1.03), norm=4.499310836924016, lr=3.5049339608319e-05
2023-11-21 08:43:20   INFO  epoch: 0/30, acc_iter=1150, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:18:49/1:29:40, time_cost(all): 0:18:49/2 days, 4:28:03, loss=0.599750658041073, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.95(1.03), norm=3.804450691714214, lr=3.6187945954152e-05
2023-11-21 08:44:09   INFO  epoch: 0/30, acc_iter=1200, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:19:38/1:31:19, time_cost(all): 0:19:38/2 days, 4:23:11, loss=0.599667544054764, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.93(1.03), norm=3.939102382039244, lr=3.7326552299985e-05
2023-11-21 08:44:58   INFO  epoch: 0/30, acc_iter=1250, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:20:27/1:24:48, time_cost(all): 0:20:27/2 days, 7:26:06, loss=0.599584430068455, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.13(1.03), norm=2.8862100803722077, lr=3.8465158645818e-05
2023-11-21 08:45:48   INFO  epoch: 0/30, acc_iter=1300, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:21:17/1:29:29, time_cost(all): 0:21:17/2 days, 7:26:42, loss=0.599501316082146, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.16(1.03), norm=4.370798567277446, lr=3.960376499165e-05
2023-11-21 08:46:37   INFO  epoch: 0/30, acc_iter=1350, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:22:06/1:25:26, time_cost(all): 0:22:06/2 days, 5:57:51, loss=0.599418202095837, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.98(1.03), norm=0.8963356751660546, lr=4.0742371337483e-05
2023-11-21 08:47:26   INFO  epoch: 0/30, acc_iter=1400, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:22:55/1:25:44, time_cost(all): 0:22:55/2 days, 6:59:34, loss=0.599335088109528, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.17(1.03), norm=3.3384788259581675, lr=4.1880977683316e-05
2023-11-21 08:48:15   INFO  epoch: 0/30, acc_iter=1450, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:23:44/1:26:34, time_cost(all): 0:23:44/2 days, 5:54:33, loss=0.599251974123218, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.2(1.03), norm=1.4446982135978148, lr=4.3019584029148e-05
2023-11-21 08:49:04   INFO  epoch: 0/30, acc_iter=1500, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:24:33/1:23:54, time_cost(all): 0:24:33/2 days, 2:54:55, loss=0.599168860136909, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.83(1.03), norm=3.041897109252311, lr=4.4158190374981e-05
2023-11-21 08:49:53   INFO  epoch: 0/30, acc_iter=1550, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:25:22/1:21:26, time_cost(all): 0:25:22/2 days, 3:41:20, loss=0.5990857461506, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.22(1.03), norm=1.7811298481990574, lr=4.5296796720814e-05
2023-11-21 08:50:42   INFO  epoch: 0/30, acc_iter=1600, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:26:11/1:22:23, time_cost(all): 0:26:11/2 days, 3:27:09, loss=0.599002632164291, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.01(1.03), norm=3.093367062897806, lr=4.6435403066646e-05
2023-11-21 08:51:31   INFO  epoch: 0/30, acc_iter=1650, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:27:00/1:22:11, time_cost(all): 0:27:00/2 days, 6:27:22, loss=0.598919518177982, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.05(1.03), norm=1.1318831779034777, lr=4.7574009412479e-05
2023-11-21 08:52:20   INFO  epoch: 0/30, acc_iter=1700, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:27:49/1:16:35, time_cost(all): 0:27:49/2 days, 6:46:14, loss=0.598836404191673, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.12(1.03), norm=1.3421387907200037, lr=4.8712615758312e-05
2023-11-21 08:53:10   INFO  epoch: 0/30, acc_iter=1750, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:28:39/1:20:11, time_cost(all): 0:28:39/2 days, 6:02:08, loss=0.598753290205364, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.18(1.03), norm=2.3050827035164936, lr=4.9851222104145e-05
2023-11-21 08:53:59   INFO  epoch: 0/30, acc_iter=1800, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:29:28/1:19:21, time_cost(all): 0:29:28/2 days, 6:04:23, loss=0.598670176219055, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.18(1.03), norm=3.528468537152197, lr=5.0989828449977e-05
2023-11-21 08:54:48   INFO  epoch: 0/30, acc_iter=1850, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:30:17/1:14:54, time_cost(all): 0:30:17/2 days, 7:53:35, loss=0.598587062232746, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.88(1.03), norm=4.392303706906163, lr=5.212843479581e-05
2023-11-21 08:55:37   INFO  epoch: 0/30, acc_iter=1900, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:31:06/1:15:06, time_cost(all): 0:31:06/2 days, 3:15:56, loss=0.598503948246437, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.03(1.03), norm=0.8122540489538684, lr=5.3267041141643e-05
2023-11-21 08:56:26   INFO  epoch: 0/30, acc_iter=1950, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:31:55/1:16:57, time_cost(all): 0:31:55/2 days, 2:48:19, loss=0.598420834260128, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.95(1.03), norm=2.5394163887911243, lr=5.4405647487475e-05
2023-11-21 08:57:15   INFO  epoch: 0/30, acc_iter=2000, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:32:44/1:18:38, time_cost(all): 0:32:44/2 days, 4:04:15, loss=0.598337720273819, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.84(1.03), norm=0.8152402704716304, lr=5.5544253833308e-05
2023-11-21 08:58:04   INFO  epoch: 0/30, acc_iter=2050, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:33:33/1:12:42, time_cost(all): 0:33:33/2 days, 7:05:58, loss=0.59825460628751, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.95(1.03), norm=2.558772154639512, lr=5.6682860179141e-05
2023-11-21 08:58:53   INFO  epoch: 0/30, acc_iter=2100, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:34:22/1:09:52, time_cost(all): 0:34:22/2 days, 4:17:33, loss=0.598171492301201, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.19(1.03), norm=2.2411956362376206, lr=5.7821466524973e-05
2023-11-21 08:59:43   INFO  epoch: 0/30, acc_iter=2150, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:35:12/1:13:38, time_cost(all): 0:35:12/2 days, 5:22:58, loss=0.598088378314891, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.84(1.03), norm=3.684878810557178, lr=5.8960072870806e-05
2023-11-21 09:00:32   INFO  epoch: 0/30, acc_iter=2200, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:36:01/1:11:37, time_cost(all): 0:36:01/2 days, 4:53:48, loss=0.598005264328582, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.1(1.03), norm=1.724406105432216, lr=6.0098679216639e-05
2023-11-21 09:01:21   INFO  epoch: 0/30, acc_iter=2250, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:36:50/1:07:37, time_cost(all): 0:36:50/2 days, 3:36:37, loss=0.597922150342273, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.18(1.03), norm=3.6811950688748123, lr=6.1237285562472e-05
2023-11-21 09:02:10   INFO  epoch: 0/30, acc_iter=2300, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:37:39/1:08:54, time_cost(all): 0:37:39/2 days, 4:13:11, loss=0.597839036355964, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.84(1.03), norm=4.487042832020864, lr=6.2375891908304e-05
2023-11-21 09:02:59   INFO  epoch: 0/30, acc_iter=2350, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:38:28/1:11:43, time_cost(all): 0:38:28/2 days, 6:09:31, loss=0.597755922369655, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.04(1.03), norm=2.6233434807173817, lr=6.3514498254137e-05
2023-11-21 09:03:48   INFO  epoch: 0/30, acc_iter=2400, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:39:17/1:08:33, time_cost(all): 0:39:17/2 days, 7:22:52, loss=0.597672808383346, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.09(1.03), norm=3.691308443321398, lr=6.465310459997e-05
2023-11-21 09:04:37   INFO  epoch: 0/30, acc_iter=2450, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:40:06/1:08:26, time_cost(all): 0:40:06/2 days, 4:39:48, loss=0.597589694397037, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.91(1.03), norm=0.5250032818529469, lr=6.5791710945802e-05
2023-11-21 09:05:26   INFO  epoch: 0/30, acc_iter=2500, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:40:55/1:07:42, time_cost(all): 0:40:55/2 days, 3:56:21, loss=0.597506580410728, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.01(1.03), norm=1.8011059326684635, lr=6.6930317291635e-05
2023-11-21 09:06:15   INFO  epoch: 0/30, acc_iter=2550, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:41:44/1:06:43, time_cost(all): 0:41:44/2 days, 7:17:32, loss=0.597423466424419, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.19(1.03), norm=2.587740515338278, lr=6.8068923637468e-05
2023-11-21 09:07:05   INFO  epoch: 0/30, acc_iter=2600, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:42:34/1:03:26, time_cost(all): 0:42:34/2 days, 7:33:33, loss=0.59734035243811, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.08(1.03), norm=2.839171341462575, lr=6.92075299833e-05
2023-11-21 09:07:54   INFO  epoch: 0/30, acc_iter=2650, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:43:23/1:06:29, time_cost(all): 0:43:23/2 days, 5:53:11, loss=0.597257238451801, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.97(1.03), norm=2.7217434130093885, lr=7.0346136329133e-05
2023-11-21 09:08:43   INFO  epoch: 0/30, acc_iter=2700, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:44:12/1:00:39, time_cost(all): 0:44:12/2 days, 7:39:15, loss=0.597174124465492, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.91(1.03), norm=4.123147410331604, lr=7.1484742674966e-05
2023-11-21 09:09:32   INFO  epoch: 0/30, acc_iter=2750, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:45:01/1:00:56, time_cost(all): 0:45:01/2 days, 5:53:33, loss=0.597091010479183, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.99(1.03), norm=2.5350124880794596, lr=7.2623349020799e-05
2023-11-21 09:10:21   INFO  epoch: 0/30, acc_iter=2800, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:45:50/1:02:12, time_cost(all): 0:45:50/2 days, 3:18:37, loss=0.597007896492874, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.03(1.03), norm=2.9733741592675242, lr=7.3761955366631e-05
2023-11-21 09:11:10   INFO  epoch: 0/30, acc_iter=2850, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:46:39/1:01:31, time_cost(all): 0:46:39/2 days, 6:24:11, loss=0.596924782506565, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.13(1.03), norm=2.4569369383582123, lr=7.4900561712464e-05
2023-11-21 09:11:59   INFO  epoch: 0/30, acc_iter=2900, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:47:28/0:58:37, time_cost(all): 0:47:28/2 days, 5:28:15, loss=0.596841668520256, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.07(1.03), norm=3.732076403268079, lr=7.6039168058297e-05
2023-11-21 09:12:48   INFO  epoch: 0/30, acc_iter=2950, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:48:17/0:58:53, time_cost(all): 0:48:17/2 days, 6:30:33, loss=0.596758554533947, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.14(1.03), norm=0.9359324021818366, lr=7.7177774404129e-05
2023-11-21 09:13:38   INFO  epoch: 0/30, acc_iter=3000, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:49:07/0:59:44, time_cost(all): 0:49:07/2 days, 7:30:09, loss=0.596675440547637, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.96(1.03), norm=4.048409340174693, lr=7.8316380749962e-05
2023-11-21 09:14:27   INFO  epoch: 0/30, acc_iter=3050, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:49:56/0:55:55, time_cost(all): 0:49:56/2 days, 7:01:18, loss=0.596592326561328, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.97(1.03), norm=2.2806952700218104, lr=7.9454987095795e-05
2023-11-21 09:15:16   INFO  epoch: 0/30, acc_iter=3100, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:50:45/0:58:05, time_cost(all): 0:50:45/2 days, 6:19:24, loss=0.596509212575019, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.98(1.03), norm=3.511846729643406, lr=8.0593593441627e-05
2023-11-21 09:16:05   INFO  epoch: 0/30, acc_iter=3150, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 0:51:34/0:56:12, time_cost(all): 0:51:34/2 days, 7:34:20, loss=0.59642609858871, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.9(1.03), norm=2.21769590579588, lr=8.173219978746e-05
2023-11-21 09:16:54   INFO  epoch: 0/30, acc_iter=3200, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 0:52:23/0:52:47, time_cost(all): 0:52:23/2 days, 6:17:31, loss=0.596342984602401, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.2(1.03), norm=0.8077068609878952, lr=8.2870806133293e-05
2023-11-21 09:17:43   INFO  epoch: 0/30, acc_iter=3250, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 0:53:12/0:54:49, time_cost(all): 0:53:12/2 days, 2:39:10, loss=0.596259870616092, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.23(1.03), norm=3.8188932059956437, lr=8.4009412479126e-05
2023-11-21 09:18:32   INFO  epoch: 0/30, acc_iter=3300, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 0:54:01/0:52:11, time_cost(all): 0:54:01/2 days, 4:10:17, loss=0.596176756629783, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.16(1.03), norm=1.0386403946984275, lr=8.5148018824958e-05
2023-11-21 09:19:21   INFO  epoch: 0/30, acc_iter=3350, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 0:54:50/0:54:01, time_cost(all): 0:54:50/2 days, 7:39:20, loss=0.596093642643474, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.86(1.03), norm=3.836004463419619, lr=8.6286625170791e-05
2023-11-21 09:20:10   INFO  epoch: 0/30, acc_iter=3400, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 0:55:39/0:50:25, time_cost(all): 0:55:39/2 days, 4:31:19, loss=0.596010528657165, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.92(1.03), norm=3.2178783750166664, lr=8.7425231516624e-05
2023-11-21 09:21:00   INFO  epoch: 0/30, acc_iter=3450, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 0:56:29/0:52:48, time_cost(all): 0:56:29/2 days, 7:02:19, loss=0.595927414670856, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.05(1.03), norm=3.0216637859111732, lr=8.8563837862456e-05
2023-11-21 09:21:49   INFO  epoch: 0/30, acc_iter=3500, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 0:57:18/0:52:22, time_cost(all): 0:57:18/2 days, 3:07:18, loss=0.595844300684547, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.07(1.03), norm=3.4061361428225423, lr=8.9702444208289e-05
2023-11-21 09:22:38   INFO  epoch: 0/30, acc_iter=3550, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 0:58:07/0:51:22, time_cost(all): 0:58:07/2 days, 6:25:52, loss=0.595761186698238, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.13(1.03), norm=1.580374743697785, lr=9.0841050554122e-05
2023-11-21 09:23:27   INFO  epoch: 0/30, acc_iter=3600, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 0:58:56/0:47:12, time_cost(all): 0:58:56/2 days, 6:08:01, loss=0.595678072711929, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.92(1.03), norm=3.01840633432452, lr=9.1979656899954e-05
2023-11-21 09:24:16   INFO  epoch: 0/30, acc_iter=3650, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 0:59:45/0:46:05, time_cost(all): 0:59:45/2 days, 6:59:43, loss=0.59559495872562, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.08(1.03), norm=4.8864115870788, lr=9.3118263245787e-05
2023-11-21 09:25:05   INFO  epoch: 0/30, acc_iter=3700, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:00:34/0:46:39, time_cost(all): 1:00:34/2 days, 3:00:45, loss=0.595511844739311, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.87(1.03), norm=0.9063216670232465, lr=9.425686959162e-05
2023-11-21 09:25:54   INFO  epoch: 0/30, acc_iter=3750, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:01:23/0:44:35, time_cost(all): 1:01:23/2 days, 3:41:31, loss=0.595428730753002, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.88(1.03), norm=1.0713824668197043, lr=9.5395475937453e-05
2023-11-21 09:26:43   INFO  epoch: 0/30, acc_iter=3800, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:02:12/0:45:32, time_cost(all): 1:02:12/2 days, 5:51:15, loss=0.595345616766693, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.89(1.03), norm=3.837588903085734, lr=9.6534082283285e-05
2023-11-21 09:27:33   INFO  epoch: 0/30, acc_iter=3850, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:03:02/0:44:00, time_cost(all): 1:03:02/2 days, 3:57:52, loss=0.595262502780383, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.15(1.03), norm=2.018367833908765, lr=9.7672688629118e-05
2023-11-21 09:28:22   INFO  epoch: 0/30, acc_iter=3900, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:03:51/0:42:16, time_cost(all): 1:03:51/2 days, 7:20:18, loss=0.595179388794074, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.2(1.03), norm=1.0380735339189988, lr=9.8811294974951e-05
2023-11-21 09:29:11   INFO  epoch: 0/30, acc_iter=3950, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:04:40/0:43:27, time_cost(all): 1:04:40/2 days, 6:36:50, loss=0.595096274807765, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.83(1.03), norm=1.6611713794257363, lr=9.9949901320783e-05
2023-11-21 09:30:00   INFO  epoch: 0/30, acc_iter=4000, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:05:29/0:40:49, time_cost(all): 1:05:29/2 days, 7:14:48, loss=0.595013160821456, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.95(1.03), norm=3.3484022777733933, lr=0.00010272126916654
2023-11-21 09:30:49   INFO  epoch: 0/30, acc_iter=4050, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:06:18/0:40:22, time_cost(all): 1:06:18/2 days, 5:21:26, loss=0.594930046835147, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.02(1.03), norm=4.344411418847425, lr=0.000105567785031122
2023-11-21 09:31:38   INFO  epoch: 0/30, acc_iter=4100, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:07:07/0:39:40, time_cost(all): 1:07:07/2 days, 4:02:24, loss=0.594846932848838, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.89(1.03), norm=0.9899295638273082, lr=0.000108414300895704
2023-11-21 09:32:27   INFO  epoch: 0/30, acc_iter=4150, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:07:56/0:39:28, time_cost(all): 1:07:56/2 days, 3:22:58, loss=0.594763818862529, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.87(1.03), norm=3.4670187777131614, lr=0.000111260816760285
2023-11-21 09:33:16   INFO  epoch: 0/30, acc_iter=4200, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:08:45/0:38:47, time_cost(all): 1:08:45/2 days, 7:24:18, loss=0.59468070487622, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.9(1.03), norm=2.5436910259093963, lr=0.000114107332624867
2023-11-21 09:34:05   INFO  epoch: 0/30, acc_iter=4250, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:09:34/0:39:33, time_cost(all): 1:09:34/2 days, 3:40:52, loss=0.594597590889911, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.91(1.03), norm=1.889032399306615, lr=0.000116953848489449
2023-11-21 09:34:55   INFO  epoch: 0/30, acc_iter=4300, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:10:24/0:37:37, time_cost(all): 1:10:24/2 days, 2:33:26, loss=0.594514476903602, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.9(1.03), norm=4.505156390356472, lr=0.000119800364354031
2023-11-21 09:35:44   INFO  epoch: 0/30, acc_iter=4350, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:11:13/0:37:26, time_cost(all): 1:11:13/2 days, 3:00:43, loss=0.594431362917293, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.97(1.03), norm=3.3978204048189076, lr=0.000122646880218612
2023-11-21 09:36:33   INFO  epoch: 0/30, acc_iter=4400, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:12:02/0:37:35, time_cost(all): 1:12:02/2 days, 2:15:54, loss=0.594348248930984, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.86(1.03), norm=3.389236172802948, lr=0.000125493396083194
2023-11-21 09:37:22   INFO  epoch: 0/30, acc_iter=4450, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:12:51/0:35:39, time_cost(all): 1:12:51/2 days, 5:19:48, loss=0.594265134944675, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.92(1.03), norm=4.105902205639977, lr=0.000128339911947776
2023-11-21 09:38:11   INFO  epoch: 0/30, acc_iter=4500, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:13:40/0:33:19, time_cost(all): 1:13:40/2 days, 6:55:14, loss=0.594182020958366, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.87(1.03), norm=0.8335327223148863, lr=0.000131186427812358
2023-11-21 09:39:00   INFO  epoch: 0/30, acc_iter=4550, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:14:29/0:33:30, time_cost(all): 1:14:29/2 days, 3:27:44, loss=0.594098906972056, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.03(1.03), norm=4.697250690641236, lr=0.000134032943676939
2023-11-21 09:39:49   INFO  epoch: 0/30, acc_iter=4600, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:15:18/0:32:58, time_cost(all): 1:15:18/2 days, 2:36:56, loss=0.594015792985747, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.97(1.03), norm=2.431305500403279, lr=0.000136879459541521
2023-11-21 09:40:38   INFO  epoch: 0/30, acc_iter=4650, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:16:07/0:31:37, time_cost(all): 1:16:07/2 days, 3:08:41, loss=0.593932678999438, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.17(1.03), norm=3.715847327393116, lr=0.000139725975406103
2023-11-21 09:41:28   INFO  epoch: 0/30, acc_iter=4700, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:16:57/0:32:23, time_cost(all): 1:16:57/2 days, 6:43:25, loss=0.593849565013129, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.09(1.03), norm=3.6585209949877875, lr=0.000142572491270685
2023-11-21 09:42:17   INFO  epoch: 0/30, acc_iter=4750, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:17:46/0:29:07, time_cost(all): 1:17:46/2 days, 7:13:25, loss=0.59376645102682, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.08(1.03), norm=1.7055501810963196, lr=0.000145419007135266
2023-11-21 09:43:06   INFO  epoch: 0/30, acc_iter=4800, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:18:35/0:29:35, time_cost(all): 1:18:35/2 days, 4:19:17, loss=0.593683337040511, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.9(1.03), norm=2.0931044413599347, lr=0.000148265522999848
2023-11-21 09:43:55   INFO  epoch: 0/30, acc_iter=4850, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:19:24/0:28:45, time_cost(all): 1:19:24/2 days, 6:09:12, loss=0.593600223054202, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.88(1.03), norm=4.273337257857058, lr=0.00015111203886443
2023-11-21 09:44:44   INFO  epoch: 0/30, acc_iter=4900, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:20:13/0:28:09, time_cost(all): 1:20:13/2 days, 6:27:52, loss=0.593517109067893, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.98(1.03), norm=4.971464811300947, lr=0.000153958554729012
2023-11-21 09:45:33   INFO  epoch: 0/30, acc_iter=4950, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:21:02/0:26:33, time_cost(all): 1:21:02/2 days, 6:43:05, loss=0.593433995081584, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.19(1.03), norm=0.5247055503479952, lr=0.000156805070593593
2023-11-21 09:46:22   INFO  epoch: 0/30, acc_iter=5000, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:21:51/0:26:25, time_cost(all): 1:21:51/2 days, 5:07:38, loss=0.593350881095275, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.99(1.03), norm=0.9788802258418006, lr=0.000159651586458175
2023-11-21 09:47:11   INFO  epoch: 0/30, acc_iter=5050, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:22:40/0:25:05, time_cost(all): 1:22:40/2 days, 6:13:15, loss=0.593267767108966, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.86(1.03), norm=4.538299749671498, lr=0.000162498102322757
2023-11-21 09:48:00   INFO  epoch: 0/30, acc_iter=5100, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:23:29/0:25:24, time_cost(all): 1:23:29/2 days, 6:06:01, loss=0.593184653122657, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.12(1.03), norm=0.9625289889333297, lr=0.000165344618187339
2023-11-21 09:48:50   INFO  epoch: 0/30, acc_iter=5150, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:24:19/0:23:46, time_cost(all): 1:24:19/2 days, 3:12:29, loss=0.593101539136348, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.07(1.03), norm=2.3438470667967835, lr=0.00016819113405192
2023-11-21 09:49:39   INFO  epoch: 0/30, acc_iter=5200, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:25:08/0:22:39, time_cost(all): 1:25:08/2 days, 5:29:43, loss=0.593018425150039, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.12(1.03), norm=1.0769949955248057, lr=0.000171037649916502
2023-11-21 09:50:28   INFO  epoch: 0/30, acc_iter=5250, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:25:57/0:21:25, time_cost(all): 1:25:57/2 days, 1:58:41, loss=0.59293531116373, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.2(1.03), norm=4.96668244413241, lr=0.000173884165781084
2023-11-21 09:51:17   INFO  epoch: 0/30, acc_iter=5300, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:26:46/0:20:33, time_cost(all): 1:26:46/2 days, 2:33:05, loss=0.592852197177421, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.0(1.03), norm=2.882146548736667, lr=0.000176730681645666
2023-11-21 09:52:06   INFO  epoch: 0/30, acc_iter=5350, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:27:35/0:21:00, time_cost(all): 1:27:35/2 days, 5:55:36, loss=0.592769083191112, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.12(1.03), norm=4.531556624797033, lr=0.000179577197510247
2023-11-21 09:52:55   INFO  epoch: 0/30, acc_iter=5400, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:28:24/0:20:14, time_cost(all): 1:28:24/2 days, 3:21:16, loss=0.592685969204802, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.99(1.03), norm=2.533025809463747, lr=0.000182423713374829
2023-11-21 09:53:44   INFO  epoch: 0/30, acc_iter=5450, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:29:13/0:17:56, time_cost(all): 1:29:13/2 days, 2:48:14, loss=0.592602855218493, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.06(1.03), norm=3.5098501785567704, lr=0.000185270229239411
2023-11-21 09:54:33   INFO  epoch: 0/30, acc_iter=5500, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:30:02/0:18:05, time_cost(all): 1:30:02/2 days, 2:21:26, loss=0.592519741232184, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.21(1.03), norm=0.5608344237366428, lr=0.000188116745103993
2023-11-21 09:55:23   INFO  epoch: 0/30, acc_iter=5550, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:30:52/0:17:27, time_cost(all): 1:30:52/2 days, 4:13:12, loss=0.592436627245875, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.15(1.03), norm=3.2669581070549762, lr=0.000190963260968574
2023-11-21 09:56:12   INFO  epoch: 0/30, acc_iter=5600, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:31:41/0:15:24, time_cost(all): 1:31:41/2 days, 6:51:52, loss=0.592353513259566, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.98(1.03), norm=0.8352378982745573, lr=0.000193809776833156
2023-11-21 09:57:01   INFO  epoch: 0/30, acc_iter=5650, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:32:30/0:14:38, time_cost(all): 1:32:30/2 days, 3:56:27, loss=0.592270399273257, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.1(1.03), norm=2.7480404395226947, lr=0.000196656292697738
2023-11-21 09:57:50   INFO  epoch: 0/30, acc_iter=5700, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:33:19/0:14:26, time_cost(all): 1:33:19/2 days, 2:57:18, loss=0.592187285286948, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.98(1.03), norm=3.5838636613252315, lr=0.00019950280856232
2023-11-21 09:58:39   INFO  epoch: 0/30, acc_iter=5750, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:34:08/0:13:16, time_cost(all): 1:34:08/2 days, 2:09:29, loss=0.592104171300639, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.1(1.03), norm=4.236978737578861, lr=0.000202349324426901
2023-11-21 09:59:28   INFO  epoch: 0/30, acc_iter=5800, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:34:57/0:12:41, time_cost(all): 1:34:57/2 days, 3:33:28, loss=0.59202105731433, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.92(1.03), norm=0.6707534159664956, lr=0.000205195840291483
2023-11-21 10:00:17   INFO  epoch: 0/30, acc_iter=5850, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:35:46/0:11:47, time_cost(all): 1:35:46/2 days, 3:49:46, loss=0.591937943328021, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.89(1.03), norm=1.8670851750769533, lr=0.000208042356156065
2023-11-21 10:01:06   INFO  epoch: 0/30, acc_iter=5900, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:36:35/0:10:46, time_cost(all): 1:36:35/2 days, 1:57:07, loss=0.591854829341712, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.86(1.03), norm=2.92214051893763, lr=0.000210888872020647
2023-11-21 10:01:55   INFO  epoch: 0/30, acc_iter=5950, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:37:24/0:10:36, time_cost(all): 1:37:24/2 days, 5:33:12, loss=0.591771715355403, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.13(1.03), norm=3.540786553933639, lr=0.000213735387885228
2023-11-21 10:02:45   INFO  epoch: 0/30, acc_iter=6000, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:38:14/0:09:20, time_cost(all): 1:38:14/2 days, 1:44:18, loss=0.591688601369094, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.08(1.03), norm=3.8651118722482227, lr=0.00021658190374981
2023-11-21 10:03:34   INFO  epoch: 0/30, acc_iter=6050, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:39:03/0:08:59, time_cost(all): 1:39:03/2 days, 5:14:39, loss=0.591605487382785, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.93(1.03), norm=3.003062553709305, lr=0.000219428419614392
2023-11-21 10:04:23   INFO  epoch: 0/30, acc_iter=6100, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:39:52/0:08:05, time_cost(all): 1:39:52/2 days, 6:51:46, loss=0.591522373396476, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.86(1.03), norm=2.2401853170872856, lr=0.000222274935478974
2023-11-21 10:05:12   INFO  epoch: 0/30, acc_iter=6150, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:40:41/0:07:29, time_cost(all): 1:40:41/2 days, 2:22:30, loss=0.591439259410167, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.85(1.03), norm=4.959288604995528, lr=0.000225121451343555
2023-11-21 10:06:01   INFO  epoch: 0/30, acc_iter=6200, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:41:30/0:06:11, time_cost(all): 1:41:30/2 days, 3:06:25, loss=0.591356145423857, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.08(1.03), norm=4.6815290371624885, lr=0.000227967967208137
2023-11-21 10:06:50   INFO  epoch: 0/30, acc_iter=6250, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 1:42:19/0:05:36, time_cost(all): 1:42:19/2 days, 3:19:43, loss=0.591273031437548, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.87(1.03), norm=4.997976391958783, lr=0.000230814483072719
2023-11-21 10:07:39   INFO  epoch: 0/30, acc_iter=6300, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 1:43:08/0:04:49, time_cost(all): 1:43:08/2 days, 3:40:16, loss=0.591189917451239, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.1(1.03), norm=4.093869390479508, lr=0.000233660998937301
2023-11-21 10:08:28   INFO  epoch: 0/30, acc_iter=6350, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 1:43:57/0:03:46, time_cost(all): 1:43:57/2 days, 6:46:40, loss=0.59110680346493, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.97(1.03), norm=4.218935752702765, lr=0.000236507514801882
2023-11-21 10:09:18   INFO  epoch: 0/30, acc_iter=6400, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 1:44:47/0:03:06, time_cost(all): 1:44:47/2 days, 3:37:03, loss=0.591023689478621, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.93(1.03), norm=1.750603257984524, lr=0.000239354030666464
2023-11-21 10:10:07   INFO  epoch: 0/30, acc_iter=6450, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 1:45:36/0:02:20, time_cost(all): 1:45:36/2 days, 5:47:02, loss=0.590940575492312, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.9(1.03), norm=1.5458687638338342, lr=0.000242200546531046
2023-11-21 10:10:56   INFO  epoch: 0/30, acc_iter=6500, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 1:46:25/0:01:21, time_cost(all): 1:46:25/2 days, 3:27:58, loss=0.590857461506003, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.98(1.03), norm=1.9991502588199555, lr=0.000245047062395628
2023-11-21 10:11:45   INFO  epoch: 0/30, acc_iter=6550, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 1:47:14/0:00:36, time_cost(all): 1:47:14/2 days, 1:33:20, loss=0.590774347519694, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.86(1.03), norm=4.521175616617061, lr=0.000247893578260209
2023-11-21 10:12:34   INFO  epoch: 1/30, acc_iter=6637, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:49/1:48:59, time_cost(all): 1:48:03/2 days, 5:34:11, loss=0.590629729183516, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.09(1.03), norm=0.6771711803093629, lr=0.000252846515864582
2023-11-21 10:13:23   INFO  epoch: 1/30, acc_iter=6687, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:38/1:49:49, time_cost(all): 1:48:52/2 days, 5:10:31, loss=0.590546615197207, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.91(1.03), norm=4.971427922449675, lr=0.000255693031729163
2023-11-21 10:14:12   INFO  epoch: 1/30, acc_iter=6737, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:27/1:48:16, time_cost(all): 1:49:41/2 days, 5:38:30, loss=0.590463501210898, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.92(1.03), norm=2.534669042621627, lr=0.000258539547593745
2023-11-21 10:15:01   INFO  epoch: 1/30, acc_iter=6787, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:16/1:46:38, time_cost(all): 1:50:30/2 days, 4:04:12, loss=0.590380387224589, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.96(1.03), norm=0.8645513364826657, lr=0.000261386063458327
2023-11-21 10:15:50   INFO  epoch: 1/30, acc_iter=6837, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:05/1:44:47, time_cost(all): 1:51:19/2 days, 5:36:34, loss=0.59029727323828, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.93(1.03), norm=1.2688501462810242, lr=0.000264232579322909
2023-11-21 10:16:40   INFO  epoch: 1/30, acc_iter=6887, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:04:54/1:42:01, time_cost(all): 1:52:09/2 days, 1:30:57, loss=0.590214159251971, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.01(1.03), norm=1.411406663400228, lr=0.000267079095187491
2023-11-21 10:17:29   INFO  epoch: 1/30, acc_iter=6937, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:05:43/1:43:41, time_cost(all): 1:52:58/2 days, 3:35:33, loss=0.590131045265662, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.84(1.03), norm=3.638667839195136, lr=0.000269925611052072
2023-11-21 10:18:18   INFO  epoch: 1/30, acc_iter=6987, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:06:32/1:40:03, time_cost(all): 1:53:47/2 days, 3:36:35, loss=0.590047931279353, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.87(1.03), norm=3.871806021725037, lr=0.000272772126916654
2023-11-21 10:19:07   INFO  epoch: 1/30, acc_iter=7037, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:07:22/1:41:45, time_cost(all): 1:54:36/2 days, 6:35:33, loss=0.589964817293044, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.03(1.03), norm=4.858592455072674, lr=0.000275618642781236
2023-11-21 10:19:56   INFO  epoch: 1/30, acc_iter=7087, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:08:11/1:36:12, time_cost(all): 1:55:25/2 days, 4:44:16, loss=0.589881703306735, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.0(1.03), norm=2.3842535754809684, lr=0.000278465158645817
2023-11-21 10:20:45   INFO  epoch: 1/30, acc_iter=7137, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:09:00/1:35:17, time_cost(all): 1:56:14/2 days, 3:28:43, loss=0.589798589320426, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.16(1.03), norm=2.201015915526526, lr=0.000281311674510399
2023-11-21 10:21:34   INFO  epoch: 1/30, acc_iter=7187, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:09:49/1:39:13, time_cost(all): 1:57:03/2 days, 6:17:52, loss=0.589715475334117, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.96(1.03), norm=4.251252654057909, lr=0.000284158190374981
2023-11-21 10:22:23   INFO  epoch: 1/30, acc_iter=7237, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:10:38/1:33:05, time_cost(all): 1:57:52/2 days, 6:24:20, loss=0.589632361347807, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.89(1.03), norm=2.7608808902132385, lr=0.000287004706239563
2023-11-21 10:23:12   INFO  epoch: 1/30, acc_iter=7287, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:11:27/1:32:27, time_cost(all): 1:58:41/2 days, 3:57:39, loss=0.589549247361498, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.88(1.03), norm=1.260759719288764, lr=0.000289851222104145
2023-11-21 10:24:02   INFO  epoch: 1/30, acc_iter=7337, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:12:16/1:39:05, time_cost(all): 1:59:31/2 days, 2:47:41, loss=0.589466133375189, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.22(1.03), norm=1.6365756333095716, lr=0.000292697737968726
2023-11-21 10:24:51   INFO  epoch: 1/30, acc_iter=7387, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:13:05/1:37:19, time_cost(all): 2:00:20/2 days, 5:55:30, loss=0.58938301938888, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.18(1.03), norm=2.593589641081145, lr=0.000295544253833308
2023-11-21 10:25:40   INFO  epoch: 1/30, acc_iter=7437, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:13:54/1:36:03, time_cost(all): 2:01:09/2 days, 6:13:42, loss=0.589299905402571, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.96(1.03), norm=1.7157226668338925, lr=0.00029839076969789
2023-11-21 10:26:29   INFO  epoch: 1/30, acc_iter=7487, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:14:44/1:34:05, time_cost(all): 2:01:58/2 days, 4:30:22, loss=0.589216791416262, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.11(1.03), norm=1.133282549608436, lr=0.000301237285562471
2023-11-21 10:27:18   INFO  epoch: 1/30, acc_iter=7537, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:15:33/1:29:03, time_cost(all): 2:02:47/2 days, 2:37:58, loss=0.589133677429953, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.88(1.03), norm=4.827414978642154, lr=0.000304083801427053
2023-11-21 10:28:07   INFO  epoch: 1/30, acc_iter=7587, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:16:22/1:30:49, time_cost(all): 2:03:36/2 days, 6:05:24, loss=0.589050563443644, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.98(1.03), norm=3.937831207054681, lr=0.000306930317291635
2023-11-21 10:28:56   INFO  epoch: 1/30, acc_iter=7637, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:17:11/1:34:43, time_cost(all): 2:04:25/2 days, 1:36:49, loss=0.588967449457335, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.11(1.03), norm=3.482807198952718, lr=0.000309776833156217
2023-11-21 10:29:45   INFO  epoch: 1/30, acc_iter=7687, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:18:00/1:30:16, time_cost(all): 2:05:14/2 days, 3:11:23, loss=0.588884335471026, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.85(1.03), norm=3.4656000650749954, lr=0.000312623349020799
2023-11-21 10:30:35   INFO  epoch: 1/30, acc_iter=7737, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:18:49/1:31:03, time_cost(all): 2:06:04/2 days, 1:29:20, loss=0.588801221484717, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.22(1.03), norm=3.216949759743603, lr=0.00031546986488538
2023-11-21 10:31:24   INFO  epoch: 1/30, acc_iter=7787, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:19:38/1:26:16, time_cost(all): 2:06:53/2 days, 4:37:11, loss=0.588718107498408, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.04(1.03), norm=0.5796740078893283, lr=0.000318316380749962
2023-11-21 10:32:13   INFO  epoch: 1/30, acc_iter=7837, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:20:27/1:29:32, time_cost(all): 2:07:42/2 days, 1:30:08, loss=0.588634993512099, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.09(1.03), norm=0.5490645625035411, lr=0.000321162896614544
2023-11-21 10:33:02   INFO  epoch: 1/30, acc_iter=7887, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:21:17/1:30:46, time_cost(all): 2:08:31/2 days, 3:51:50, loss=0.58855187952579, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.93(1.03), norm=4.300677050744559, lr=0.000324009412479126
2023-11-21 10:33:51   INFO  epoch: 1/30, acc_iter=7937, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:22:06/1:23:53, time_cost(all): 2:09:20/2 days, 3:19:37, loss=0.588468765539481, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.15(1.03), norm=3.404662511302271, lr=0.000326855928343707
2023-11-21 10:34:40   INFO  epoch: 1/30, acc_iter=7987, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:22:55/1:25:52, time_cost(all): 2:10:09/2 days, 5:54:53, loss=0.588385651553172, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.86(1.03), norm=4.337824397153051, lr=0.000329702444208289
2023-11-21 10:35:29   INFO  epoch: 1/30, acc_iter=8037, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:23:44/1:20:15, time_cost(all): 2:10:58/2 days, 1:09:52, loss=0.588302537566862, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.89(1.03), norm=3.9953237259125904, lr=0.000332548960072871
2023-11-21 10:36:18   INFO  epoch: 1/30, acc_iter=8087, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:24:33/1:22:53, time_cost(all): 2:11:47/2 days, 6:03:34, loss=0.588219423580553, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.87(1.03), norm=2.6123116018582833, lr=0.000335395475937452
2023-11-21 10:37:07   INFO  epoch: 1/30, acc_iter=8137, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:25:22/1:21:39, time_cost(all): 2:12:36/2 days, 1:12:57, loss=0.588136309594244, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.87(1.03), norm=0.7946303331882357, lr=0.000338241991802034
2023-11-21 10:37:57   INFO  epoch: 1/30, acc_iter=8187, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:26:11/1:23:49, time_cost(all): 2:13:26/2 days, 6:02:27, loss=0.588053195607935, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.13(1.03), norm=4.096817918912217, lr=0.000341088507666616
2023-11-21 10:38:46   INFO  epoch: 1/30, acc_iter=8237, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:27:00/1:24:21, time_cost(all): 2:14:15/2 days, 4:35:41, loss=0.587970081621626, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.99(1.03), norm=1.631228012321541, lr=0.000343935023531198
2023-11-21 10:39:35   INFO  epoch: 1/30, acc_iter=8287, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:27:49/1:16:03, time_cost(all): 2:15:04/2 days, 4:20:46, loss=0.587886967635317, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.15(1.03), norm=4.63277422969119, lr=0.00034678153939578
2023-11-21 10:40:24   INFO  epoch: 1/30, acc_iter=8337, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:28:39/1:15:36, time_cost(all): 2:15:53/2 days, 4:42:42, loss=0.587803853649008, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.13(1.03), norm=0.9698862595356126, lr=0.000349628055260361
2023-11-21 10:41:13   INFO  epoch: 1/30, acc_iter=8387, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:29:28/1:22:14, time_cost(all): 2:16:42/2 days, 5:15:27, loss=0.587720739662699, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.09(1.03), norm=0.6256591038993067, lr=0.000352474571124943
2023-11-21 10:42:02   INFO  epoch: 1/30, acc_iter=8437, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:30:17/1:18:07, time_cost(all): 2:17:31/2 days, 1:41:46, loss=0.58763762567639, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.92(1.03), norm=2.6452279561814893, lr=0.000355321086989525
2023-11-21 10:42:51   INFO  epoch: 1/30, acc_iter=8487, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:31:06/1:18:06, time_cost(all): 2:18:20/2 days, 4:10:03, loss=0.587554511690081, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.98(1.03), norm=1.757555960834504, lr=0.000358167602854107
2023-11-21 10:43:40   INFO  epoch: 1/30, acc_iter=8537, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:31:55/1:19:15, time_cost(all): 2:19:09/2 days, 4:55:16, loss=0.587471397703772, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.83(1.03), norm=4.104280182355241, lr=0.000361014118718688
2023-11-21 10:44:30   INFO  epoch: 1/30, acc_iter=8587, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:32:44/1:14:53, time_cost(all): 2:19:59/2 days, 4:18:35, loss=0.587388283717463, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.05(1.03), norm=2.6911522346697643, lr=0.00036386063458327
2023-11-21 10:45:19   INFO  epoch: 1/30, acc_iter=8637, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:33:33/1:10:49, time_cost(all): 2:20:48/2 days, 1:52:19, loss=0.587305169731154, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.88(1.03), norm=2.733230044013135, lr=0.000366707150447852
2023-11-21 10:46:08   INFO  epoch: 1/30, acc_iter=8687, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:34:22/1:15:17, time_cost(all): 2:21:37/2 days, 5:39:46, loss=0.587222055744845, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.83(1.03), norm=3.904595733324524, lr=0.000369553666312434
2023-11-21 10:46:57   INFO  epoch: 1/30, acc_iter=8737, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:35:12/1:09:36, time_cost(all): 2:22:26/2 days, 3:37:05, loss=0.587138941758536, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.04(1.03), norm=3.427679331268619, lr=0.000372400182177015
2023-11-21 10:47:46   INFO  epoch: 1/30, acc_iter=8787, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:36:01/1:09:07, time_cost(all): 2:23:15/2 days, 2:35:37, loss=0.587055827772227, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.94(1.03), norm=4.265653415483948, lr=0.000375246698041597
2023-11-21 10:48:35   INFO  epoch: 1/30, acc_iter=8837, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:36:50/1:13:27, time_cost(all): 2:24:04/2 days, 2:26:24, loss=0.586972713785917, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.08(1.03), norm=2.4780488478064084, lr=0.000378093213906179
2023-11-21 10:49:24   INFO  epoch: 1/30, acc_iter=8887, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:37:39/1:13:13, time_cost(all): 2:24:53/2 days, 4:17:04, loss=0.586889599799608, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.84(1.03), norm=1.6501490869880078, lr=0.000380939729770761
2023-11-21 10:50:13   INFO  epoch: 1/30, acc_iter=8937, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:38:28/1:07:23, time_cost(all): 2:25:42/2 days, 5:01:26, loss=0.586806485813299, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.93(1.03), norm=2.548100078228148, lr=0.000383786245635342
2023-11-21 10:51:02   INFO  epoch: 1/30, acc_iter=8987, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:39:17/1:06:13, time_cost(all): 2:26:31/2 days, 4:26:21, loss=0.58672337182699, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.18(1.03), norm=2.341429189404887, lr=0.000386632761499924
2023-11-21 10:51:52   INFO  epoch: 1/30, acc_iter=9037, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:40:06/1:05:35, time_cost(all): 2:27:21/2 days, 0:58:53, loss=0.586640257840681, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.02(1.03), norm=0.6116840044911394, lr=0.000389479277364506
2023-11-21 10:52:41   INFO  epoch: 1/30, acc_iter=9087, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:40:55/1:03:45, time_cost(all): 2:28:10/2 days, 1:05:32, loss=0.586557143854372, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.06(1.03), norm=4.256263093865239, lr=0.000392325793229088
2023-11-21 10:53:30   INFO  epoch: 1/30, acc_iter=9137, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:41:44/1:07:25, time_cost(all): 2:28:59/2 days, 2:26:26, loss=0.586474029868063, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.88(1.03), norm=1.9031416094999984, lr=0.000395172309093669
2023-11-21 10:54:19   INFO  epoch: 1/30, acc_iter=9187, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:42:34/1:03:25, time_cost(all): 2:29:48/2 days, 3:23:40, loss=0.586390915881754, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.2(1.03), norm=4.305992548893039, lr=0.000398018824958251
2023-11-21 10:55:08   INFO  epoch: 1/30, acc_iter=9237, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:43:23/1:06:14, time_cost(all): 2:30:37/2 days, 0:54:09, loss=0.586307801895445, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.94(1.03), norm=4.71418004687941, lr=0.000400865340822833
2023-11-21 10:55:57   INFO  epoch: 1/30, acc_iter=9287, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:44:12/1:04:20, time_cost(all): 2:31:26/2 days, 3:07:08, loss=0.586224687909136, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.04(1.03), norm=3.3383994706377065, lr=0.000403711856687415
2023-11-21 10:56:46   INFO  epoch: 1/30, acc_iter=9337, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:45:01/1:00:30, time_cost(all): 2:32:15/2 days, 1:42:55, loss=0.586141573922827, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.1(1.03), norm=3.4393272067646095, lr=0.000406558372551996
2023-11-21 10:57:35   INFO  epoch: 1/30, acc_iter=9387, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:45:50/1:04:17, time_cost(all): 2:33:04/2 days, 1:35:44, loss=0.586058459936518, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.17(1.03), norm=1.4969869048441846, lr=0.000409404888416578
2023-11-21 10:58:25   INFO  epoch: 1/30, acc_iter=9437, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:46:39/1:00:01, time_cost(all): 2:33:54/2 days, 2:00:58, loss=0.585975345950209, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.09(1.03), norm=0.8668781796037872, lr=0.00041225140428116
2023-11-21 10:59:14   INFO  epoch: 1/30, acc_iter=9487, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:47:28/1:02:01, time_cost(all): 2:34:43/2 days, 4:31:07, loss=0.5858922319639, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.95(1.03), norm=4.7916017305306955, lr=0.000415097920145742
2023-11-21 11:00:03   INFO  epoch: 1/30, acc_iter=9537, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:48:17/0:59:03, time_cost(all): 2:35:32/2 days, 1:40:20, loss=0.585809117977591, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.85(1.03), norm=4.522661428376905, lr=0.000417944436010323
2023-11-21 11:00:52   INFO  epoch: 1/30, acc_iter=9587, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:49:07/0:57:46, time_cost(all): 2:36:21/2 days, 2:46:17, loss=0.585726003991281, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.19(1.03), norm=2.0309362017864907, lr=0.000420790951874905
2023-11-21 11:01:41   INFO  epoch: 1/30, acc_iter=9637, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:49:56/0:59:30, time_cost(all): 2:37:10/2 days, 5:43:49, loss=0.585642890004972, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.11(1.03), norm=0.7169437508349354, lr=0.000423637467739487
2023-11-21 11:02:30   INFO  epoch: 1/30, acc_iter=9687, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:50:45/0:57:54, time_cost(all): 2:37:59/2 days, 4:21:03, loss=0.585559776018663, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.89(1.03), norm=2.402471435249336, lr=0.000426483983604069
2023-11-21 11:03:19   INFO  epoch: 1/30, acc_iter=9737, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 0:51:34/0:54:01, time_cost(all): 2:38:48/2 days, 1:40:54, loss=0.585476662032354, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.12(1.03), norm=1.2624483016304375, lr=0.00042933049946865
2023-11-21 11:04:08   INFO  epoch: 1/30, acc_iter=9787, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 0:52:23/0:55:56, time_cost(all): 2:39:37/2 days, 1:00:24, loss=0.585393548046045, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.14(1.03), norm=3.3330070459396617, lr=0.000432177015333232
2023-11-21 11:04:57   INFO  epoch: 1/30, acc_iter=9837, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 0:53:12/0:56:30, time_cost(all): 2:40:26/2 days, 1:49:13, loss=0.585310434059736, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.22(1.03), norm=1.9740938013861267, lr=0.000435023531197814
2023-11-21 11:05:47   INFO  epoch: 1/30, acc_iter=9887, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 0:54:01/0:55:02, time_cost(all): 2:41:16/2 days, 4:00:18, loss=0.585227320073427, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.17(1.03), norm=1.6401845156366026, lr=0.000437870047062396
2023-11-21 11:06:36   INFO  epoch: 1/30, acc_iter=9937, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 0:54:50/0:54:14, time_cost(all): 2:42:05/2 days, 3:43:53, loss=0.585144206087118, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.2(1.03), norm=4.256757418716674, lr=0.000440716562926977
2023-11-21 11:07:25   INFO  epoch: 1/30, acc_iter=9987, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 0:55:39/0:52:28, time_cost(all): 2:42:54/2 days, 4:15:18, loss=0.585061092100809, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.89(1.03), norm=4.183614707213233, lr=0.000443563078791559
2023-11-21 11:08:14   INFO  epoch: 1/30, acc_iter=10037, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 0:56:29/0:52:26, time_cost(all): 2:43:43/2 days, 4:37:07, loss=0.5849779781145, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.13(1.03), norm=4.847676509074102, lr=0.000446409594656141
2023-11-21 11:09:03   INFO  epoch: 1/30, acc_iter=10087, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 0:57:18/0:51:32, time_cost(all): 2:44:32/2 days, 5:22:57, loss=0.584894864128191, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.13(1.03), norm=3.578414163970544, lr=0.000449256110520723
2023-11-21 11:09:52   INFO  epoch: 1/30, acc_iter=10137, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 0:58:07/0:51:26, time_cost(all): 2:45:21/2 days, 0:57:12, loss=0.584811750141882, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.17(1.03), norm=3.859580314907492, lr=0.000452102626385304
2023-11-21 11:10:41   INFO  epoch: 1/30, acc_iter=10187, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 0:58:56/0:46:46, time_cost(all): 2:46:10/2 days, 5:09:50, loss=0.584728636155573, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.89(1.03), norm=2.0465494823570696, lr=0.000454949142249886
2023-11-21 11:11:30   INFO  epoch: 1/30, acc_iter=10237, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 0:59:45/0:48:44, time_cost(all): 2:46:59/2 days, 4:54:41, loss=0.584645522169264, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.03(1.03), norm=3.689902877716685, lr=0.000457795658114468
2023-11-21 11:12:20   INFO  epoch: 1/30, acc_iter=10287, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:00:34/0:49:30, time_cost(all): 2:47:49/2 days, 4:47:49, loss=0.584562408182955, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.1(1.03), norm=1.1325966091133621, lr=0.00046064217397905
2023-11-21 11:13:09   INFO  epoch: 1/30, acc_iter=10337, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:01:23/0:48:41, time_cost(all): 2:48:38/2 days, 1:44:55, loss=0.584479294196646, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.14(1.03), norm=2.1114766241427634, lr=0.000463488689843631
2023-11-21 11:13:58   INFO  epoch: 1/30, acc_iter=10387, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:02:12/0:46:56, time_cost(all): 2:49:27/2 days, 4:45:31, loss=0.584396180210337, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.89(1.03), norm=2.313736264084008, lr=0.000466335205708213
2023-11-21 11:14:47   INFO  epoch: 1/30, acc_iter=10437, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:03:02/0:46:08, time_cost(all): 2:50:16/2 days, 2:10:49, loss=0.584313066224027, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.0(1.03), norm=0.8903183288943828, lr=0.000469181721572795
2023-11-21 11:15:36   INFO  epoch: 1/30, acc_iter=10487, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:03:51/0:42:27, time_cost(all): 2:51:05/2 days, 4:03:15, loss=0.584229952237718, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.98(1.03), norm=3.7303706702488304, lr=0.000472028237437377
2023-11-21 11:16:25   INFO  epoch: 1/30, acc_iter=10537, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:04:40/0:43:33, time_cost(all): 2:51:54/2 days, 1:59:56, loss=0.584146838251409, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.88(1.03), norm=3.0463932956683495, lr=0.000474874753301958
2023-11-21 11:17:14   INFO  epoch: 1/30, acc_iter=10587, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:05:29/0:42:41, time_cost(all): 2:52:43/2 days, 1:55:00, loss=0.5840637242651, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.98(1.03), norm=3.428256376727553, lr=0.00047772126916654
2023-11-21 11:18:03   INFO  epoch: 1/30, acc_iter=10637, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:06:18/0:42:48, time_cost(all): 2:53:32/2 days, 4:44:33, loss=0.583980610278791, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.98(1.03), norm=2.859590230607768, lr=0.000480567785031122
2023-11-21 11:18:52   INFO  epoch: 1/30, acc_iter=10687, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:07:07/0:40:39, time_cost(all): 2:54:21/2 days, 5:24:27, loss=0.583897496292482, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.21(1.03), norm=0.8242308727811462, lr=0.000483414300895704
2023-11-21 11:19:42   INFO  epoch: 1/30, acc_iter=10737, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:07:56/0:38:00, time_cost(all): 2:55:11/2 days, 2:25:08, loss=0.583814382306173, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.91(1.03), norm=4.5054585432475935, lr=0.000486260816760285
2023-11-21 11:20:31   INFO  epoch: 1/30, acc_iter=10787, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:08:45/0:39:31, time_cost(all): 2:56:00/2 days, 4:22:56, loss=0.583731268319864, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.84(1.03), norm=1.7993523973891072, lr=0.000489107332624867
2023-11-21 11:21:20   INFO  epoch: 1/30, acc_iter=10837, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:09:34/0:39:21, time_cost(all): 2:56:49/2 days, 4:57:27, loss=0.583648154333555, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.98(1.03), norm=1.3523483146736563, lr=0.000491953848489449
2023-11-21 11:22:09   INFO  epoch: 1/30, acc_iter=10887, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:10:24/0:38:15, time_cost(all): 2:57:38/2 days, 5:23:18, loss=0.583565040347246, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.07(1.03), norm=1.1505110421961864, lr=0.000494800364354031
2023-11-21 11:22:58   INFO  epoch: 1/30, acc_iter=10937, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:11:13/0:36:51, time_cost(all): 2:58:27/2 days, 1:04:33, loss=0.583481926360937, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.85(1.03), norm=3.3598429899376607, lr=0.000497646880218612
2023-11-21 11:23:47   INFO  epoch: 1/30, acc_iter=10987, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:12:02/0:35:06, time_cost(all): 2:59:16/2 days, 2:10:45, loss=0.583398812374628, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.11(1.03), norm=4.766652808754483, lr=0.000500493396083194
2023-11-21 11:24:36   INFO  epoch: 1/30, acc_iter=11037, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:12:51/0:33:23, time_cost(all): 3:00:05/2 days, 1:32:33, loss=0.583315698388319, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.14(1.03), norm=2.7849994158558693, lr=0.000503339911947776
2023-11-21 11:25:25   INFO  epoch: 1/30, acc_iter=11087, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:13:40/0:33:22, time_cost(all): 3:00:54/2 days, 3:11:29, loss=0.58323258440201, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.9(1.03), norm=3.7320031394667503, lr=0.000506186427812358
2023-11-21 11:26:15   INFO  epoch: 1/30, acc_iter=11137, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:14:29/0:31:44, time_cost(all): 3:01:44/2 days, 3:47:30, loss=0.583149470415701, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.23(1.03), norm=2.0337631708877684, lr=0.000509032943676939
2023-11-21 11:27:04   INFO  epoch: 1/30, acc_iter=11187, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:15:18/0:31:52, time_cost(all): 3:02:33/2 days, 2:41:42, loss=0.583066356429392, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.96(1.03), norm=2.687489568072523, lr=0.000511879459541521
2023-11-21 11:27:53   INFO  epoch: 1/30, acc_iter=11237, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:16:07/0:30:48, time_cost(all): 3:03:22/2 days, 4:07:40, loss=0.582983242443082, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.99(1.03), norm=0.9778928277724764, lr=0.000514725975406103
2023-11-21 11:28:42   INFO  epoch: 1/30, acc_iter=11287, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:16:57/0:30:49, time_cost(all): 3:04:11/2 days, 2:30:24, loss=0.582900128456773, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.18(1.03), norm=4.7015903463323525, lr=0.000517572491270685
2023-11-21 11:29:31   INFO  epoch: 1/30, acc_iter=11337, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:17:46/0:28:44, time_cost(all): 3:05:00/2 days, 2:22:11, loss=0.582817014470464, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.98(1.03), norm=1.3127680885273503, lr=0.000520419007135266
2023-11-21 11:30:20   INFO  epoch: 1/30, acc_iter=11387, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:18:35/0:29:13, time_cost(all): 3:05:49/2 days, 0:28:40, loss=0.582733900484155, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.94(1.03), norm=4.704393759331663, lr=0.000523265522999848
2023-11-21 11:31:09   INFO  epoch: 1/30, acc_iter=11437, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:19:24/0:28:16, time_cost(all): 3:06:38/2 days, 4:45:20, loss=0.582650786497846, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.1(1.03), norm=2.8762783691124296, lr=0.00052611203886443
2023-11-21 11:31:58   INFO  epoch: 1/30, acc_iter=11487, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:20:13/0:27:09, time_cost(all): 3:07:27/2 days, 1:16:45, loss=0.582567672511537, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.83(1.03), norm=1.5935751649100367, lr=0.000528958554729012
2023-11-21 11:32:47   INFO  epoch: 1/30, acc_iter=11537, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:21:02/0:26:08, time_cost(all): 3:08:16/2 days, 3:02:05, loss=0.582484558525228, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.86(1.03), norm=3.405889953726079, lr=0.000531805070593593
2023-11-21 11:33:37   INFO  epoch: 1/30, acc_iter=11587, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:21:51/0:25:04, time_cost(all): 3:09:06/2 days, 2:35:18, loss=0.582401444538919, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.91(1.03), norm=1.4572776663848601, lr=0.000534651586458175
2023-11-21 11:34:26   INFO  epoch: 1/30, acc_iter=11637, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:22:40/0:24:56, time_cost(all): 3:09:55/2 days, 0:30:44, loss=0.58231833055261, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.21(1.03), norm=1.1014339251323848, lr=0.000537498102322757
2023-11-21 11:35:15   INFO  epoch: 1/30, acc_iter=11687, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:23:29/0:24:19, time_cost(all): 3:10:44/2 days, 3:45:55, loss=0.582235216566301, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.0(1.03), norm=4.921627735212528, lr=0.000540344618187339
2023-11-21 11:36:04   INFO  epoch: 1/30, acc_iter=11737, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:24:19/0:24:12, time_cost(all): 3:11:33/2 days, 0:54:00, loss=0.582152102579992, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.12(1.03), norm=4.956577596072835, lr=0.00054319113405192
2023-11-21 11:36:53   INFO  epoch: 1/30, acc_iter=11787, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:25:08/0:22:20, time_cost(all): 3:12:22/2 days, 1:08:06, loss=0.582068988593683, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.85(1.03), norm=3.782459575791154, lr=0.000546037649916502
2023-11-21 11:37:42   INFO  epoch: 1/30, acc_iter=11837, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:25:57/0:21:53, time_cost(all): 3:13:11/2 days, 1:36:06, loss=0.581985874607374, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.88(1.03), norm=3.4552162409829172, lr=0.000548884165781084
2023-11-21 11:38:31   INFO  epoch: 1/30, acc_iter=11887, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:26:46/0:21:23, time_cost(all): 3:14:00/2 days, 4:18:16, loss=0.581902760621065, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.05(1.03), norm=1.0058165795154588, lr=0.000551730681645666
2023-11-21 11:39:20   INFO  epoch: 1/30, acc_iter=11937, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:27:35/0:19:58, time_cost(all): 3:14:49/2 days, 4:32:37, loss=0.581819646634756, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.06(1.03), norm=4.2076921918148305, lr=0.000554577197510248
2023-11-21 11:40:10   INFO  epoch: 1/30, acc_iter=11987, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:28:24/0:19:48, time_cost(all): 3:15:39/2 days, 3:04:28, loss=0.581736532648446, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.85(1.03), norm=2.5681742692600253, lr=0.000557423713374829
2023-11-21 11:40:59   INFO  epoch: 1/30, acc_iter=12037, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:29:13/0:18:15, time_cost(all): 3:16:28/2 days, 2:10:51, loss=0.581653418662137, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.08(1.03), norm=4.875120963272436, lr=0.000560270229239411
2023-11-21 11:41:48   INFO  epoch: 1/30, acc_iter=12087, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:30:02/0:17:59, time_cost(all): 3:17:17/2 days, 0:08:08, loss=0.581570304675828, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.97(1.03), norm=1.2526360603455808, lr=0.000563116745103993
2023-11-21 11:42:37   INFO  epoch: 1/30, acc_iter=12137, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:30:52/0:17:36, time_cost(all): 3:18:06/2 days, 3:07:09, loss=0.581487190689519, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.23(1.03), norm=4.830614065964153, lr=0.000565963260968574
2023-11-21 11:43:26   INFO  epoch: 1/30, acc_iter=12187, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:31:41/0:16:49, time_cost(all): 3:18:55/2 days, 1:57:55, loss=0.58140407670321, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.83(1.03), norm=1.1614857584947451, lr=0.000568809776833156
2023-11-21 11:44:15   INFO  epoch: 1/30, acc_iter=12237, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:32:30/0:15:01, time_cost(all): 3:19:44/2 days, 2:24:15, loss=0.581320962716901, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.91(1.03), norm=0.7424782526596889, lr=0.000571656292697738
2023-11-21 11:45:04   INFO  epoch: 1/30, acc_iter=12287, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:33:19/0:14:57, time_cost(all): 3:20:33/2 days, 0:31:27, loss=0.581237848730592, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.08(1.03), norm=3.2420333487068125, lr=0.00057450280856232
2023-11-21 11:45:53   INFO  epoch: 1/30, acc_iter=12337, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:34:08/0:13:39, time_cost(all): 3:21:22/2 days, 2:36:25, loss=0.581154734744283, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.06(1.03), norm=1.646418148457217, lr=0.000577349324426901
2023-11-21 11:46:42   INFO  epoch: 1/30, acc_iter=12387, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:34:57/0:13:05, time_cost(all): 3:22:11/2 days, 2:06:38, loss=0.581071620757974, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.88(1.03), norm=3.175578994122393, lr=0.000580195840291483
2023-11-21 11:47:32   INFO  epoch: 1/30, acc_iter=12437, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:35:46/0:11:58, time_cost(all): 3:23:01/2 days, 0:29:56, loss=0.580988506771665, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.89(1.03), norm=0.5072325847040802, lr=0.000583042356156065
2023-11-21 11:48:21   INFO  epoch: 1/30, acc_iter=12487, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:36:35/0:11:39, time_cost(all): 3:23:50/2 days, 4:19:22, loss=0.580905392785356, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.85(1.03), norm=2.711019757630423, lr=0.000585888872020647
2023-11-21 11:49:10   INFO  epoch: 1/30, acc_iter=12537, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:37:24/0:10:28, time_cost(all): 3:24:39/2 days, 3:48:37, loss=0.580822278799047, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.2(1.03), norm=3.409928353130289, lr=0.000588735387885229
2023-11-21 11:49:59   INFO  epoch: 1/30, acc_iter=12587, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:38:14/0:09:37, time_cost(all): 3:25:28/2 days, 4:37:11, loss=0.580739164812738, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.01(1.03), norm=2.1369259221265446, lr=0.00059158190374981
2023-11-21 11:50:48   INFO  epoch: 1/30, acc_iter=12637, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:39:03/0:08:35, time_cost(all): 3:26:17/2 days, 0:33:03, loss=0.580656050826429, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.01(1.03), norm=3.376022538902766, lr=0.000594428419614392
2023-11-21 11:51:37   INFO  epoch: 1/30, acc_iter=12687, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:39:52/0:08:20, time_cost(all): 3:27:06/2 days, 2:18:31, loss=0.58057293684012, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.86(1.03), norm=3.874277459567954, lr=0.000597274935478974
2023-11-21 11:52:26   INFO  epoch: 1/30, acc_iter=12737, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:40:41/0:07:06, time_cost(all): 3:27:55/2 days, 4:28:35, loss=0.580489822853811, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.23(1.03), norm=2.6456419588364444, lr=0.000600121451343556
2023-11-21 11:53:15   INFO  epoch: 1/30, acc_iter=12787, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:41:30/0:06:23, time_cost(all): 3:28:44/2 days, 1:26:36, loss=0.580406708867501, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.22(1.03), norm=4.422430351089851, lr=0.000602967967208137
2023-11-21 11:54:05   INFO  epoch: 1/30, acc_iter=12837, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 1:42:19/0:05:19, time_cost(all): 3:29:34/2 days, 1:21:04, loss=0.580323594881192, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.17(1.03), norm=1.4256192022253265, lr=0.000605814483072719
2023-11-21 11:54:54   INFO  epoch: 1/30, acc_iter=12887, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 1:43:08/0:04:30, time_cost(all): 3:30:23/2 days, 4:12:08, loss=0.580240480894883, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.11(1.03), norm=1.5483550480005759, lr=0.000608660998937301
2023-11-21 11:55:43   INFO  epoch: 1/30, acc_iter=12937, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 1:43:57/0:03:44, time_cost(all): 3:31:12/2 days, 1:14:56, loss=0.580157366908574, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.85(1.03), norm=2.7597272912271107, lr=0.000611507514801883
2023-11-21 11:56:32   INFO  epoch: 1/30, acc_iter=12987, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 1:44:47/0:03:09, time_cost(all): 3:32:01/2 days, 1:58:04, loss=0.580074252922265, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.93(1.03), norm=2.1123298591259063, lr=0.000614354030666464
2023-11-21 11:57:21   INFO  epoch: 1/30, acc_iter=13037, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 1:45:36/0:02:10, time_cost(all): 3:32:50/2 days, 4:26:41, loss=0.579991138935956, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.02(1.03), norm=3.0743840210614213, lr=0.000617200546531046
2023-11-21 11:58:10   INFO  epoch: 1/30, acc_iter=13087, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 1:46:25/0:01:25, time_cost(all): 3:33:39/2 days, 0:11:11, loss=0.579908024949647, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.07(1.03), norm=4.002665694470222, lr=0.000620047062395628
2023-11-21 11:58:59   INFO  epoch: 1/30, acc_iter=13137, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 1:47:14/0:00:35, time_cost(all): 3:34:28/2 days, 3:30:49, loss=0.579824910963338, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.23(1.03), norm=2.7854717494857053, lr=0.00062289357826021
2023-11-21 11:59:48   INFO  epoch: 2/30, acc_iter=13224, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:49/1:46:01, time_cost(all): 3:35:17/2 days, 2:27:58, loss=0.57968029262716, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.91(1.03), norm=2.0788726920339675, lr=0.000627846515864582
2023-11-21 12:00:37   INFO  epoch: 2/30, acc_iter=13274, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:38/1:49:18, time_cost(all): 3:36:06/2 days, 3:50:24, loss=0.579597178640851, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.92(1.03), norm=3.791370807522943, lr=0.000630693031729163
2023-11-21 12:01:27   INFO  epoch: 2/30, acc_iter=13324, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:27/1:40:54, time_cost(all): 3:36:56/2 days, 4:32:30, loss=0.579514064654542, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.95(1.03), norm=4.8438453568594975, lr=0.000633539547593745
2023-11-21 12:02:16   INFO  epoch: 2/30, acc_iter=13374, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:16/1:43:07, time_cost(all): 3:37:45/2 days, 4:47:31, loss=0.579430950668233, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.92(1.03), norm=0.9633557621523887, lr=0.000636386063458327
2023-11-21 12:03:05   INFO  epoch: 2/30, acc_iter=13424, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:05/1:42:27, time_cost(all): 3:38:34/2 days, 4:15:42, loss=0.579347836681924, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.96(1.03), norm=1.838767738869209, lr=0.000639232579322909
2023-11-21 12:03:54   INFO  epoch: 2/30, acc_iter=13474, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:04:54/1:43:17, time_cost(all): 3:39:23/2 days, 2:21:43, loss=0.579264722695615, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.14(1.03), norm=4.972461130017192, lr=0.000642079095187491
2023-11-21 12:04:43   INFO  epoch: 2/30, acc_iter=13524, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:05:43/1:45:10, time_cost(all): 3:40:12/2 days, 4:10:16, loss=0.579181608709306, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.18(1.03), norm=2.9531472076601872, lr=0.000644925611052072
2023-11-21 12:05:32   INFO  epoch: 2/30, acc_iter=13574, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:06:32/1:38:03, time_cost(all): 3:41:01/2 days, 3:59:13, loss=0.579098494722997, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.85(1.03), norm=1.8642318580899004, lr=0.000647772126916654
2023-11-21 12:06:21   INFO  epoch: 2/30, acc_iter=13624, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:07:22/1:45:08, time_cost(all): 3:41:50/2 days, 3:55:43, loss=0.579015380736688, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.01(1.03), norm=1.1245021369362234, lr=0.000650618642781236
2023-11-21 12:07:10   INFO  epoch: 2/30, acc_iter=13674, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:08:11/1:35:53, time_cost(all): 3:42:39/2 days, 1:44:11, loss=0.578932266750379, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.94(1.03), norm=2.2822667108749277, lr=0.000653465158645818
2023-11-21 12:07:59   INFO  epoch: 2/30, acc_iter=13724, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:09:00/1:39:10, time_cost(all): 3:43:28/2 days, 2:54:33, loss=0.57884915276407, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.06(1.03), norm=3.2028889047368603, lr=0.000656311674510399
2023-11-21 12:08:49   INFO  epoch: 2/30, acc_iter=13774, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:09:49/1:37:06, time_cost(all): 3:44:18/2 days, 1:54:30, loss=0.578766038777761, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.22(1.03), norm=3.7501123548366224, lr=0.000659158190374981
2023-11-21 12:09:38   INFO  epoch: 2/30, acc_iter=13824, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:10:38/1:39:18, time_cost(all): 3:45:07/2 days, 1:55:34, loss=0.578682924791452, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.98(1.03), norm=2.944531618182096, lr=0.000662004706239563
2023-11-21 12:10:27   INFO  epoch: 2/30, acc_iter=13874, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:11:27/1:35:05, time_cost(all): 3:45:56/2 days, 1:24:58, loss=0.578599810805142, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.07(1.03), norm=1.627772422866568, lr=0.000664851222104144
2023-11-21 12:11:16   INFO  epoch: 2/30, acc_iter=13924, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:12:16/1:36:53, time_cost(all): 3:46:45/2 days, 4:04:39, loss=0.578516696818833, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.0(1.03), norm=1.0023187862269018, lr=0.000667697737968726
2023-11-21 12:12:05   INFO  epoch: 2/30, acc_iter=13974, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:13:05/1:38:50, time_cost(all): 3:47:34/2 days, 1:42:07, loss=0.578433582832524, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.02(1.03), norm=1.2048091095511009, lr=0.000670544253833308
2023-11-21 12:12:54   INFO  epoch: 2/30, acc_iter=14024, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:13:54/1:35:08, time_cost(all): 3:48:23/2 days, 0:30:09, loss=0.578350468846215, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.1(1.03), norm=4.823778742349689, lr=0.00067339076969789
2023-11-21 12:13:43   INFO  epoch: 2/30, acc_iter=14074, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:14:44/1:33:34, time_cost(all): 3:49:12/2 days, 1:41:26, loss=0.578267354859906, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.88(1.03), norm=1.0788627936230566, lr=0.000676237285562472
2023-11-21 12:14:32   INFO  epoch: 2/30, acc_iter=14124, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:15:33/1:36:26, time_cost(all): 3:50:01/2 days, 3:20:42, loss=0.578184240873597, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.05(1.03), norm=3.82314478383279, lr=0.000679083801427053
2023-11-21 12:15:22   INFO  epoch: 2/30, acc_iter=14174, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:16:22/1:28:31, time_cost(all): 3:50:51/2 days, 3:42:33, loss=0.578101126887288, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.08(1.03), norm=2.267440435845153, lr=0.000681930317291635
2023-11-21 12:16:11   INFO  epoch: 2/30, acc_iter=14224, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:17:11/1:29:37, time_cost(all): 3:51:40/2 days, 3:47:55, loss=0.578018012900979, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.18(1.03), norm=4.936075737203135, lr=0.000684776833156217
2023-11-21 12:17:00   INFO  epoch: 2/30, acc_iter=14274, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:18:00/1:29:20, time_cost(all): 3:52:29/2 days, 1:03:11, loss=0.57793489891467, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.05(1.03), norm=3.9265839796882744, lr=0.000687623349020799
2023-11-21 12:17:49   INFO  epoch: 2/30, acc_iter=14324, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:18:49/1:29:38, time_cost(all): 3:53:18/2 days, 1:27:30, loss=0.577851784928361, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.92(1.03), norm=4.36025091890571, lr=0.00069046986488538
2023-11-21 12:18:38   INFO  epoch: 2/30, acc_iter=14374, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:19:38/1:26:01, time_cost(all): 3:54:07/2 days, 2:02:34, loss=0.577768670942052, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.19(1.03), norm=3.142309598097644, lr=0.000693316380749962
2023-11-21 12:19:27   INFO  epoch: 2/30, acc_iter=14424, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:20:27/1:24:09, time_cost(all): 3:54:56/2 days, 3:43:28, loss=0.577685556955743, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.94(1.03), norm=2.197931161737794, lr=0.000696162896614544
2023-11-21 12:20:16   INFO  epoch: 2/30, acc_iter=14474, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:21:17/1:29:25, time_cost(all): 3:55:45/2 days, 3:34:32, loss=0.577602442969434, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.16(1.03), norm=2.5320653239876414, lr=0.000699009412479126
2023-11-21 12:21:05   INFO  epoch: 2/30, acc_iter=14524, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:22:06/1:27:02, time_cost(all): 3:56:34/2 days, 0:20:47, loss=0.577519328983125, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.98(1.03), norm=0.9902967270430376, lr=0.000701855928343707
2023-11-21 12:21:54   INFO  epoch: 2/30, acc_iter=14574, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:22:55/1:27:42, time_cost(all): 3:57:23/2 days, 2:08:52, loss=0.577436214996816, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.99(1.03), norm=1.5978999609648172, lr=0.000704702444208289
2023-11-21 12:22:44   INFO  epoch: 2/30, acc_iter=14624, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:23:44/1:20:12, time_cost(all): 3:58:13/2 days, 0:05:35, loss=0.577353101010507, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.15(1.03), norm=2.8719791097964333, lr=0.000707548960072871
2023-11-21 12:23:33   INFO  epoch: 2/30, acc_iter=14674, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:24:33/1:24:49, time_cost(all): 3:59:02/2 days, 2:05:59, loss=0.577269987024198, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.88(1.03), norm=1.711718088279177, lr=0.000710395475937453
2023-11-21 12:24:22   INFO  epoch: 2/30, acc_iter=14724, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:25:22/1:21:20, time_cost(all): 3:59:51/2 days, 1:56:36, loss=0.577186873037888, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.17(1.03), norm=4.035848546738812, lr=0.000713241991802034
2023-11-21 12:25:11   INFO  epoch: 2/30, acc_iter=14774, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:26:11/1:24:09, time_cost(all): 4:00:40/2 days, 3:53:02, loss=0.577103759051579, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.02(1.03), norm=3.9783203701028977, lr=0.000716088507666616
2023-11-21 12:26:00   INFO  epoch: 2/30, acc_iter=14824, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:27:00/1:17:11, time_cost(all): 4:01:29/2 days, 4:06:54, loss=0.57702064506527, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.13(1.03), norm=0.965178406751839, lr=0.000718935023531198
2023-11-21 12:26:49   INFO  epoch: 2/30, acc_iter=14874, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:27:49/1:18:39, time_cost(all): 4:02:18/2 days, 1:26:49, loss=0.576937531078961, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.91(1.03), norm=3.3453500020172044, lr=0.00072178153939578
2023-11-21 12:27:38   INFO  epoch: 2/30, acc_iter=14924, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:28:39/1:20:13, time_cost(all): 4:03:07/2 days, 3:47:23, loss=0.576854417092652, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.89(1.03), norm=4.793886384859732, lr=0.000724628055260361
2023-11-21 12:28:27   INFO  epoch: 2/30, acc_iter=14974, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:29:28/1:21:48, time_cost(all): 4:03:56/2 days, 0:33:15, loss=0.576771303106343, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.88(1.03), norm=3.0502977968490277, lr=0.000727474571124943
2023-11-21 12:29:17   INFO  epoch: 2/30, acc_iter=15024, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:30:17/1:18:04, time_cost(all): 4:04:46/2 days, 1:15:08, loss=0.576688189120034, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.1(1.03), norm=4.663263691742482, lr=0.000730321086989525
2023-11-21 12:30:06   INFO  epoch: 2/30, acc_iter=15074, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:31:06/1:17:31, time_cost(all): 4:05:35/2 days, 4:06:06, loss=0.576605075133725, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.12(1.03), norm=0.6131497964275594, lr=0.000733167602854107
2023-11-21 12:30:55   INFO  epoch: 2/30, acc_iter=15124, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:31:55/1:17:40, time_cost(all): 4:06:24/2 days, 0:52:14, loss=0.576521961147416, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.22(1.03), norm=0.8303615528186776, lr=0.000736014118718688
2023-11-21 12:31:44   INFO  epoch: 2/30, acc_iter=15174, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:32:44/1:15:30, time_cost(all): 4:07:13/2 days, 0:08:14, loss=0.576438847161107, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.09(1.03), norm=4.903713207433572, lr=0.00073886063458327
2023-11-21 12:32:33   INFO  epoch: 2/30, acc_iter=15224, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:33:33/1:13:03, time_cost(all): 4:08:02/2 days, 3:39:08, loss=0.576355733174798, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.1(1.03), norm=1.2192138630814047, lr=0.000741707150447852
2023-11-21 12:33:22   INFO  epoch: 2/30, acc_iter=15274, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:34:22/1:09:49, time_cost(all): 4:08:51/1 day, 23:59:13, loss=0.576272619188489, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.19(1.03), norm=4.100774087092812, lr=0.000744553666312434
2023-11-21 12:34:11   INFO  epoch: 2/30, acc_iter=15324, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:35:12/1:09:02, time_cost(all): 4:09:40/2 days, 1:18:14, loss=0.57618950520218, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.16(1.03), norm=3.041621543213058, lr=0.000747400182177015
2023-11-21 12:35:00   INFO  epoch: 2/30, acc_iter=15374, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:36:01/1:11:40, time_cost(all): 4:10:29/2 days, 1:12:33, loss=0.576106391215871, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.92(1.03), norm=2.838683756844948, lr=0.000750246698041597
2023-11-21 12:35:49   INFO  epoch: 2/30, acc_iter=15424, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:36:50/1:11:27, time_cost(all): 4:11:18/1 day, 23:39:59, loss=0.576023277229561, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.0(1.03), norm=2.803382967925121, lr=0.000753093213906179
2023-11-21 12:36:39   INFO  epoch: 2/30, acc_iter=15474, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:37:39/1:07:17, time_cost(all): 4:12:08/1 day, 23:55:58, loss=0.575940163243252, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.12(1.03), norm=4.4510781945461755, lr=0.000755939729770761
2023-11-21 12:37:28   INFO  epoch: 2/30, acc_iter=15524, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:38:28/1:10:33, time_cost(all): 4:12:57/1 day, 23:25:38, loss=0.575857049256943, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.14(1.03), norm=2.2905356009580355, lr=0.000758786245635342
2023-11-21 12:38:17   INFO  epoch: 2/30, acc_iter=15574, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:39:17/1:05:16, time_cost(all): 4:13:46/1 day, 23:46:28, loss=0.575773935270634, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.2(1.03), norm=2.9062755217748983, lr=0.000761632761499924
2023-11-21 12:39:06   INFO  epoch: 2/30, acc_iter=15624, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:40:06/1:06:14, time_cost(all): 4:14:35/2 days, 0:16:28, loss=0.575690821284325, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.09(1.03), norm=1.7239323971567393, lr=0.000764479277364506
2023-11-21 12:39:55   INFO  epoch: 2/30, acc_iter=15674, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:40:55/1:05:23, time_cost(all): 4:15:24/2 days, 1:10:02, loss=0.575607707298016, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.85(1.03), norm=4.068471210389522, lr=0.000767325793229088
2023-11-21 12:40:44   INFO  epoch: 2/30, acc_iter=15724, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:41:44/1:04:23, time_cost(all): 4:16:13/2 days, 4:04:42, loss=0.575524593311707, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.2(1.03), norm=0.5345726598962963, lr=0.000770172309093669
2023-11-21 12:41:33   INFO  epoch: 2/30, acc_iter=15774, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:42:34/1:07:05, time_cost(all): 4:17:02/2 days, 0:21:32, loss=0.575441479325398, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.12(1.03), norm=2.025979127605312, lr=0.000773018824958251
2023-11-21 12:42:22   INFO  epoch: 2/30, acc_iter=15824, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:43:23/1:05:24, time_cost(all): 4:17:51/2 days, 0:30:03, loss=0.575358365339089, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.18(1.03), norm=2.6816355895478736, lr=0.000775865340822833
2023-11-21 12:43:12   INFO  epoch: 2/30, acc_iter=15874, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:44:12/1:05:07, time_cost(all): 4:18:41/2 days, 1:09:40, loss=0.57527525135278, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.19(1.03), norm=3.1550430382723826, lr=0.000778711856687415
2023-11-21 12:44:01   INFO  epoch: 2/30, acc_iter=15924, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:45:01/1:05:15, time_cost(all): 4:19:30/2 days, 2:05:11, loss=0.575192137366471, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.05(1.03), norm=2.6613244068611945, lr=0.000781558372551996
2023-11-21 12:44:50   INFO  epoch: 2/30, acc_iter=15974, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:45:50/1:02:31, time_cost(all): 4:20:19/2 days, 1:14:57, loss=0.575109023380162, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.14(1.03), norm=4.4582629831656835, lr=0.000784404888416578
2023-11-21 12:45:39   INFO  epoch: 2/30, acc_iter=16024, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:46:39/1:01:56, time_cost(all): 4:21:08/2 days, 1:10:11, loss=0.575025909393853, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.94(1.03), norm=1.7333672809401972, lr=0.00078725140428116
2023-11-21 12:46:28   INFO  epoch: 2/30, acc_iter=16074, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:47:28/0:59:29, time_cost(all): 4:21:57/2 days, 2:07:55, loss=0.574942795407544, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.97(1.03), norm=0.9478540096521442, lr=0.000790097920145742
2023-11-21 12:47:17   INFO  epoch: 2/30, acc_iter=16124, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:48:17/0:57:09, time_cost(all): 4:22:46/1 day, 23:51:34, loss=0.574859681421235, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.1(1.03), norm=2.287820616200817, lr=0.000792944436010323
2023-11-21 12:48:06   INFO  epoch: 2/30, acc_iter=16174, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:49:07/0:56:42, time_cost(all): 4:23:35/2 days, 0:20:48, loss=0.574776567434926, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.92(1.03), norm=1.3014928854480938, lr=0.000795790951874905
2023-11-21 12:48:55   INFO  epoch: 2/30, acc_iter=16224, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:49:56/0:58:29, time_cost(all): 4:24:24/2 days, 3:50:41, loss=0.574693453448617, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.94(1.03), norm=2.823612507529703, lr=0.000798637467739487
2023-11-21 12:49:44   INFO  epoch: 2/30, acc_iter=16274, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:50:45/0:58:01, time_cost(all): 4:25:13/1 day, 23:54:38, loss=0.574610339462307, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.13(1.03), norm=4.0970845565666405, lr=0.000801483983604069
2023-11-21 12:50:34   INFO  epoch: 2/30, acc_iter=16324, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 0:51:34/0:56:54, time_cost(all): 4:26:03/2 days, 2:31:38, loss=0.574527225475998, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.98(1.03), norm=2.336815777012963, lr=0.00080433049946865
2023-11-21 12:51:23   INFO  epoch: 2/30, acc_iter=16374, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 0:52:23/0:57:02, time_cost(all): 4:26:52/2 days, 3:06:12, loss=0.574444111489689, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.2(1.03), norm=1.2111331511608747, lr=0.000807177015333232
2023-11-21 12:52:12   INFO  epoch: 2/30, acc_iter=16424, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 0:53:12/0:53:29, time_cost(all): 4:27:41/2 days, 1:13:11, loss=0.57436099750338, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.18(1.03), norm=2.643969676139113, lr=0.000810023531197814
2023-11-21 12:53:01   INFO  epoch: 2/30, acc_iter=16474, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 0:54:01/0:51:59, time_cost(all): 4:28:30/1 day, 22:59:20, loss=0.574277883517071, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.87(1.03), norm=4.896461384166939, lr=0.000812870047062396
2023-11-21 12:53:50   INFO  epoch: 2/30, acc_iter=16524, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 0:54:50/0:51:18, time_cost(all): 4:29:19/1 day, 22:59:55, loss=0.574194769530762, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.12(1.03), norm=3.5383271120843784, lr=0.000815716562926977
2023-11-21 12:54:39   INFO  epoch: 2/30, acc_iter=16574, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 0:55:39/0:52:59, time_cost(all): 4:30:08/2 days, 0:36:22, loss=0.574111655544453, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.03(1.03), norm=1.1488497340226496, lr=0.000818563078791559
2023-11-21 12:55:28   INFO  epoch: 2/30, acc_iter=16624, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 0:56:29/0:49:11, time_cost(all): 4:30:57/2 days, 3:15:10, loss=0.574028541558144, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.91(1.03), norm=2.1528382808213045, lr=0.000821409594656141
2023-11-21 12:56:17   INFO  epoch: 2/30, acc_iter=16674, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 0:57:18/0:53:03, time_cost(all): 4:31:46/2 days, 0:03:05, loss=0.573945427571835, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.87(1.03), norm=2.493604918216892, lr=0.000824256110520723
2023-11-21 12:57:07   INFO  epoch: 2/30, acc_iter=16724, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 0:58:07/0:49:42, time_cost(all): 4:32:36/2 days, 1:32:31, loss=0.573862313585526, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.1(1.03), norm=1.3844522106616877, lr=0.000827102626385304
2023-11-21 12:57:56   INFO  epoch: 2/30, acc_iter=16774, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 0:58:56/0:48:34, time_cost(all): 4:33:25/2 days, 1:55:33, loss=0.573779199599217, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.95(1.03), norm=4.727278362114822, lr=0.000829949142249886
2023-11-21 12:58:45   INFO  epoch: 2/30, acc_iter=16824, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 0:59:45/0:48:56, time_cost(all): 4:34:14/2 days, 1:05:21, loss=0.573696085612908, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.92(1.03), norm=4.882717305624195, lr=0.000832795658114468
2023-11-21 12:59:34   INFO  epoch: 2/30, acc_iter=16874, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:00:34/0:45:36, time_cost(all): 4:35:03/1 day, 23:49:16, loss=0.573612971626599, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.94(1.03), norm=1.2280323806220863, lr=0.00083564217397905
2023-11-21 13:00:23   INFO  epoch: 2/30, acc_iter=16924, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:01:23/0:48:11, time_cost(all): 4:35:52/2 days, 3:25:47, loss=0.57352985764029, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.87(1.03), norm=2.038915071347551, lr=0.000838488689843631
2023-11-21 13:01:12   INFO  epoch: 2/30, acc_iter=16974, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:02:12/0:46:34, time_cost(all): 4:36:41/2 days, 1:18:42, loss=0.573446743653981, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.18(1.03), norm=0.536277926739603, lr=0.000841335205708213
2023-11-21 13:02:01   INFO  epoch: 2/30, acc_iter=17024, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:03:02/0:46:31, time_cost(all): 4:37:30/1 day, 23:53:03, loss=0.573363629667672, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.01(1.03), norm=4.0406077454336415, lr=0.000844181721572795
2023-11-21 13:02:50   INFO  epoch: 2/30, acc_iter=17074, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:03:51/0:45:00, time_cost(all): 4:38:19/2 days, 0:22:04, loss=0.573280515681362, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.19(1.03), norm=2.7181741098644427, lr=0.000847028237437377
2023-11-21 13:03:39   INFO  epoch: 2/30, acc_iter=17124, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:04:40/0:44:13, time_cost(all): 4:39:08/2 days, 3:29:04, loss=0.573197401695053, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.91(1.03), norm=0.5714883016911136, lr=0.000849874753301958
2023-11-21 13:04:29   INFO  epoch: 2/30, acc_iter=17174, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:05:29/0:40:15, time_cost(all): 4:39:58/1 day, 22:59:08, loss=0.573114287708744, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.14(1.03), norm=1.139103351100772, lr=0.00085272126916654
2023-11-21 13:05:18   INFO  epoch: 2/30, acc_iter=17224, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:06:18/0:42:16, time_cost(all): 4:40:47/2 days, 2:29:14, loss=0.573031173722435, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.84(1.03), norm=1.5065820896368731, lr=0.000855567785031122
2023-11-21 13:06:07   INFO  epoch: 2/30, acc_iter=17274, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:07:07/0:38:47, time_cost(all): 4:41:36/2 days, 1:33:02, loss=0.572948059736126, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.1(1.03), norm=2.0588802903428065, lr=0.000858414300895704
2023-11-21 13:06:56   INFO  epoch: 2/30, acc_iter=17324, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:07:56/0:39:28, time_cost(all): 4:42:25/2 days, 3:01:01, loss=0.572864945749817, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.97(1.03), norm=3.293468704987082, lr=0.000861260816760285
2023-11-21 13:07:45   INFO  epoch: 2/30, acc_iter=17374, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:08:45/0:40:23, time_cost(all): 4:43:14/2 days, 3:03:38, loss=0.572781831763508, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.9(1.03), norm=2.0519874923166954, lr=0.000864107332624867
2023-11-21 13:08:34   INFO  epoch: 2/30, acc_iter=17424, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:09:34/0:38:11, time_cost(all): 4:44:03/2 days, 1:00:55, loss=0.572698717777199, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.9(1.03), norm=2.7929155749291517, lr=0.000866953848489449
2023-11-21 13:09:23   INFO  epoch: 2/30, acc_iter=17474, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:10:24/0:37:41, time_cost(all): 4:44:52/2 days, 2:07:54, loss=0.57261560379089, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.86(1.03), norm=3.393985870928815, lr=0.000869800364354031
2023-11-21 13:10:12   INFO  epoch: 2/30, acc_iter=17524, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:11:13/0:36:19, time_cost(all): 4:45:41/1 day, 23:47:26, loss=0.572532489804581, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.9(1.03), norm=2.1585330550240656, lr=0.000872646880218612
2023-11-21 13:11:02   INFO  epoch: 2/30, acc_iter=17574, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:12:02/0:37:26, time_cost(all): 4:46:31/1 day, 22:43:27, loss=0.572449375818272, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.13(1.03), norm=3.984940842790314, lr=0.000875493396083194
2023-11-21 13:11:51   INFO  epoch: 2/30, acc_iter=17624, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:12:51/0:36:36, time_cost(all): 4:47:20/2 days, 1:40:42, loss=0.572366261831963, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.14(1.03), norm=4.292152037711446, lr=0.000878339911947776
2023-11-21 13:12:40   INFO  epoch: 2/30, acc_iter=17674, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:13:40/0:35:11, time_cost(all): 4:48:09/2 days, 2:34:03, loss=0.572283147845654, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.12(1.03), norm=4.56102484423021, lr=0.000881186427812358
2023-11-21 13:13:29   INFO  epoch: 2/30, acc_iter=17724, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:14:29/0:32:58, time_cost(all): 4:48:58/1 day, 22:47:30, loss=0.572200033859345, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.16(1.03), norm=4.975339311940127, lr=0.00088403294367694
2023-11-21 13:14:18   INFO  epoch: 2/30, acc_iter=17774, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:15:18/0:33:25, time_cost(all): 4:49:47/2 days, 2:10:36, loss=0.572116919873036, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.16(1.03), norm=3.0464889120542953, lr=0.000886879459541521
2023-11-21 13:15:07   INFO  epoch: 2/30, acc_iter=17824, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:16:07/0:31:35, time_cost(all): 4:50:36/2 days, 0:26:33, loss=0.572033805886726, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.16(1.03), norm=3.939475701368225, lr=0.000889725975406103
2023-11-21 13:15:56   INFO  epoch: 2/30, acc_iter=17874, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:16:57/0:30:36, time_cost(all): 4:51:25/2 days, 1:44:09, loss=0.571950691900417, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.96(1.03), norm=2.851631604920326, lr=0.000892572491270685
2023-11-21 13:16:45   INFO  epoch: 2/30, acc_iter=17924, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:17:46/0:31:27, time_cost(all): 4:52:14/2 days, 0:50:37, loss=0.571867577914108, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.07(1.03), norm=1.2580346350429654, lr=0.000895419007135266
2023-11-21 13:17:34   INFO  epoch: 2/30, acc_iter=17974, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:18:35/0:27:49, time_cost(all): 4:53:03/1 day, 23:42:40, loss=0.571784463927799, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.94(1.03), norm=2.1126604753330627, lr=0.000898265522999848
2023-11-21 13:18:24   INFO  epoch: 2/30, acc_iter=18024, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:19:24/0:27:13, time_cost(all): 4:53:53/1 day, 23:40:05, loss=0.57170134994149, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.0(1.03), norm=0.8855033066224534, lr=0.00090111203886443
2023-11-21 13:19:13   INFO  epoch: 2/30, acc_iter=18074, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:20:13/0:26:51, time_cost(all): 4:54:42/2 days, 2:31:57, loss=0.571618235955181, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.88(1.03), norm=1.2883681443008306, lr=0.000903958554729012
2023-11-21 13:20:02   INFO  epoch: 2/30, acc_iter=18124, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:21:02/0:27:11, time_cost(all): 4:55:31/1 day, 23:58:16, loss=0.571535121968872, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.22(1.03), norm=1.2822368838060723, lr=0.000906805070593593
2023-11-21 13:20:51   INFO  epoch: 2/30, acc_iter=18174, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:21:51/0:25:57, time_cost(all): 4:56:20/2 days, 1:49:44, loss=0.571452007982563, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.85(1.03), norm=4.031153854438434, lr=0.000909651586458175
2023-11-21 13:21:40   INFO  epoch: 2/30, acc_iter=18224, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:22:40/0:23:58, time_cost(all): 4:57:09/2 days, 1:01:41, loss=0.571368893996254, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.07(1.03), norm=2.069975715746686, lr=0.000912498102322757
2023-11-21 13:22:29   INFO  epoch: 2/30, acc_iter=18274, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:23:29/0:23:59, time_cost(all): 4:57:58/1 day, 22:42:06, loss=0.571285780009945, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.1(1.03), norm=1.6705891765226935, lr=0.000915344618187339
2023-11-21 13:23:18   INFO  epoch: 2/30, acc_iter=18324, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:24:19/0:22:34, time_cost(all): 4:58:47/2 days, 2:14:17, loss=0.571202666023636, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.1(1.03), norm=4.433506323335836, lr=0.000918191134051921
2023-11-21 13:24:07   INFO  epoch: 2/30, acc_iter=18374, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:25:08/0:22:09, time_cost(all): 4:59:36/2 days, 0:13:28, loss=0.571119552037327, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.96(1.03), norm=4.109051160229805, lr=0.000921037649916502
2023-11-21 13:24:57   INFO  epoch: 2/30, acc_iter=18424, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:25:57/0:21:59, time_cost(all): 5:00:26/2 days, 2:24:30, loss=0.571036438051018, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.88(1.03), norm=1.927737695528813, lr=0.000923884165781084
2023-11-21 13:25:46   INFO  epoch: 2/30, acc_iter=18474, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:26:46/0:20:03, time_cost(all): 5:01:15/2 days, 1:45:06, loss=0.570953324064709, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.97(1.03), norm=4.37843783970068, lr=0.000926730681645666
2023-11-21 13:26:35   INFO  epoch: 2/30, acc_iter=18524, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:27:35/0:20:10, time_cost(all): 5:02:04/1 day, 23:46:00, loss=0.5708702100784, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.98(1.03), norm=4.718334862754434, lr=0.000929577197510247
2023-11-21 13:27:24   INFO  epoch: 2/30, acc_iter=18574, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:28:24/0:18:37, time_cost(all): 5:02:53/1 day, 22:33:09, loss=0.570787096092091, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.86(1.03), norm=4.649096834957772, lr=0.000932423713374829
2023-11-21 13:28:13   INFO  epoch: 2/30, acc_iter=18624, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:29:13/0:18:02, time_cost(all): 5:03:42/2 days, 2:13:14, loss=0.570703982105782, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.17(1.03), norm=4.275367640261624, lr=0.000935270229239411
2023-11-21 13:29:02   INFO  epoch: 2/30, acc_iter=18674, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:30:02/0:18:14, time_cost(all): 5:04:31/1 day, 22:28:19, loss=0.570620868119472, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.2(1.03), norm=1.3388192212421999, lr=0.000938116745103993
2023-11-21 13:29:51   INFO  epoch: 2/30, acc_iter=18724, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:30:52/0:16:30, time_cost(all): 5:05:20/2 days, 1:56:40, loss=0.570537754133163, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.98(1.03), norm=2.942836245409777, lr=0.000940963260968575
2023-11-21 13:30:40   INFO  epoch: 2/30, acc_iter=18774, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:31:41/0:15:53, time_cost(all): 5:06:09/1 day, 23:58:51, loss=0.570454640146854, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.91(1.03), norm=1.0830168648822196, lr=0.000943809776833156
2023-11-21 13:31:29   INFO  epoch: 2/30, acc_iter=18824, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:32:30/0:14:48, time_cost(all): 5:06:58/2 days, 1:21:43, loss=0.570371526160545, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.11(1.03), norm=0.7259779432392914, lr=0.000946656292697738
2023-11-21 13:32:19   INFO  epoch: 2/30, acc_iter=18874, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:33:19/0:14:18, time_cost(all): 5:07:48/1 day, 22:47:57, loss=0.570288412174236, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.06(1.03), norm=3.1181208370310767, lr=0.00094950280856232
2023-11-21 13:33:08   INFO  epoch: 2/30, acc_iter=18924, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:34:08/0:14:14, time_cost(all): 5:08:37/2 days, 1:23:52, loss=0.570205298187927, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.97(1.03), norm=0.6602407530777218, lr=0.000952349324426901
2023-11-21 13:33:57   INFO  epoch: 2/30, acc_iter=18974, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:34:57/0:13:25, time_cost(all): 5:09:26/1 day, 23:43:21, loss=0.570122184201618, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.04(1.03), norm=4.994488942856112, lr=0.000955195840291483
2023-11-21 13:34:46   INFO  epoch: 2/30, acc_iter=19024, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:35:46/0:11:28, time_cost(all): 5:10:15/2 days, 1:14:19, loss=0.570039070215309, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.05(1.03), norm=4.640283179794989, lr=0.000958042356156065
2023-11-21 13:35:35   INFO  epoch: 2/30, acc_iter=19074, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:36:35/0:11:11, time_cost(all): 5:11:04/2 days, 0:43:01, loss=0.569955956229, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.19(1.03), norm=3.3088739748085487, lr=0.000960888872020647
2023-11-21 13:36:24   INFO  epoch: 2/30, acc_iter=19124, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:37:24/0:10:20, time_cost(all): 5:11:53/2 days, 0:55:37, loss=0.569872842242691, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.09(1.03), norm=2.3999637725663785, lr=0.000963735387885228
2023-11-21 13:37:13   INFO  epoch: 2/30, acc_iter=19174, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:38:14/0:09:58, time_cost(all): 5:12:42/1 day, 23:16:34, loss=0.569789728256382, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.08(1.03), norm=1.2010763122151247, lr=0.00096658190374981
2023-11-21 13:38:02   INFO  epoch: 2/30, acc_iter=19224, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:39:03/0:08:57, time_cost(all): 5:13:31/2 days, 2:34:47, loss=0.569706614270073, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.21(1.03), norm=4.914388443867722, lr=0.000969428419614392
2023-11-21 13:38:52   INFO  epoch: 2/30, acc_iter=19274, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:39:52/0:07:44, time_cost(all): 5:14:21/2 days, 1:21:13, loss=0.569623500283764, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.96(1.03), norm=2.044911672075991, lr=0.000972274935478974
2023-11-21 13:39:41   INFO  epoch: 2/30, acc_iter=19324, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:40:41/0:07:14, time_cost(all): 5:15:10/1 day, 22:14:16, loss=0.569540386297455, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.92(1.03), norm=0.6225450190459179, lr=0.000975121451343556
2023-11-21 13:40:30   INFO  epoch: 2/30, acc_iter=19374, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:41:30/0:06:19, time_cost(all): 5:15:59/2 days, 2:20:20, loss=0.569457272311146, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.02(1.03), norm=0.7262589366065462, lr=0.000977967967208137
2023-11-21 13:41:19   INFO  epoch: 2/30, acc_iter=19424, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 1:42:19/0:05:24, time_cost(all): 5:16:48/2 days, 1:52:24, loss=0.569374158324837, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.07(1.03), norm=4.08560214262795, lr=0.000980814483072719
2023-11-21 13:42:08   INFO  epoch: 2/30, acc_iter=19474, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 1:43:08/0:04:28, time_cost(all): 5:17:37/2 days, 0:37:00, loss=0.569291044338527, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.06(1.03), norm=2.5219412850738085, lr=0.000983660998937301
2023-11-21 13:42:57   INFO  epoch: 2/30, acc_iter=19524, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 1:43:57/0:03:52, time_cost(all): 5:18:26/1 day, 22:36:03, loss=0.569207930352218, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.14(1.03), norm=1.967155080263771, lr=0.000986507514801883
2023-11-21 13:43:46   INFO  epoch: 2/30, acc_iter=19574, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 1:44:47/0:03:11, time_cost(all): 5:19:15/2 days, 2:57:31, loss=0.569124816365909, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.97(1.03), norm=2.102395527979803, lr=0.000989354030666464
2023-11-21 13:44:35   INFO  epoch: 2/30, acc_iter=19624, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 1:45:36/0:02:11, time_cost(all): 5:20:04/1 day, 23:49:47, loss=0.5690417023796, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.99(1.03), norm=2.1340505793574414, lr=0.000992200546531046
2023-11-21 13:45:24   INFO  epoch: 2/30, acc_iter=19674, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 1:46:25/0:01:29, time_cost(all): 5:20:53/2 days, 2:16:21, loss=0.568958588393291, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.85(1.03), norm=4.286116025080834, lr=0.000995047062395628
2023-11-21 13:46:14   INFO  epoch: 2/30, acc_iter=19724, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 1:47:14/0:00:35, time_cost(all): 5:21:43/1 day, 22:22:33, loss=0.568875474406982, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.18(1.03), norm=4.438611494233999, lr=0.000997893578260209
2023-11-21 13:47:03   INFO  epoch: 3/30, acc_iter=19811, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:49/1:46:40, time_cost(all): 5:22:32/1 day, 22:50:49, loss=0.568730856070804, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.83(1.03), norm=1.0462100561073324, lr=0.000999679265818075
2023-11-21 13:47:52   INFO  epoch: 3/30, acc_iter=19861, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:38/1:49:15, time_cost(all): 5:23:21/1 day, 22:45:33, loss=0.568647742084495, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.12(1.03), norm=3.049469388086228, lr=0.000999358531636151
2023-11-21 13:48:41   INFO  epoch: 3/30, acc_iter=19911, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:27/1:43:19, time_cost(all): 5:24:10/2 days, 2:05:37, loss=0.568564628098186, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.1(1.03), norm=3.2157061584634374, lr=0.000999037797454226
2023-11-21 13:49:30   INFO  epoch: 3/30, acc_iter=19961, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:16/1:44:02, time_cost(all): 5:24:59/1 day, 22:58:37, loss=0.568481514111877, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.14(1.03), norm=2.761138821023268, lr=0.000998717063272301
2023-11-21 13:50:19   INFO  epoch: 3/30, acc_iter=20011, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:05/1:47:19, time_cost(all): 5:25:48/2 days, 0:34:15, loss=0.568398400125568, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.21(1.03), norm=4.756166309274823, lr=0.000998396329090376
2023-11-21 13:51:08   INFO  epoch: 3/30, acc_iter=20061, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:04:54/1:40:12, time_cost(all): 5:26:37/1 day, 22:59:23, loss=0.568315286139259, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.12(1.03), norm=2.0078637093476734, lr=0.000998075594908452
2023-11-21 13:51:57   INFO  epoch: 3/30, acc_iter=20111, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:05:43/1:37:02, time_cost(all): 5:27:26/2 days, 2:12:23, loss=0.56823217215295, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.91(1.03), norm=3.369400078274181, lr=0.000997754860726527
2023-11-21 13:52:46   INFO  epoch: 3/30, acc_iter=20161, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:06:32/1:42:04, time_cost(all): 5:28:15/2 days, 1:24:28, loss=0.568149058166641, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.18(1.03), norm=4.854624212838905, lr=0.000997434126544602
2023-11-21 13:53:36   INFO  epoch: 3/30, acc_iter=20211, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:07:22/1:39:21, time_cost(all): 5:29:05/2 days, 2:30:03, loss=0.568065944180332, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.18(1.03), norm=0.9797233000995949, lr=0.000997113392362678
2023-11-21 13:54:25   INFO  epoch: 3/30, acc_iter=20261, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:08:11/1:41:06, time_cost(all): 5:29:54/2 days, 2:32:56, loss=0.567982830194023, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.08(1.03), norm=4.031728769586946, lr=0.000996792658180753
2023-11-21 13:55:14   INFO  epoch: 3/30, acc_iter=20311, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:09:00/1:34:12, time_cost(all): 5:30:43/1 day, 22:57:57, loss=0.567899716207714, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.2(1.03), norm=4.837957470810138, lr=0.000996471923998828
2023-11-21 13:56:03   INFO  epoch: 3/30, acc_iter=20361, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:09:49/1:41:50, time_cost(all): 5:31:32/2 days, 2:18:23, loss=0.567816602221405, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.97(1.03), norm=3.078375353601639, lr=0.000996151189816904
2023-11-21 13:56:52   INFO  epoch: 3/30, acc_iter=20411, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:10:38/1:40:23, time_cost(all): 5:32:21/2 days, 2:17:03, loss=0.567733488235096, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.14(1.03), norm=3.8028074952807773, lr=0.000995830455634979
2023-11-21 13:57:41   INFO  epoch: 3/30, acc_iter=20461, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:11:27/1:40:10, time_cost(all): 5:33:10/1 day, 23:38:21, loss=0.567650374248787, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.03(1.03), norm=4.007964052313442, lr=0.000995509721453054
2023-11-21 13:58:30   INFO  epoch: 3/30, acc_iter=20511, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:12:16/1:39:32, time_cost(all): 5:33:59/2 days, 2:03:00, loss=0.567567260262478, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.83(1.03), norm=2.542612140247417, lr=0.000995188987271129
2023-11-21 13:59:19   INFO  epoch: 3/30, acc_iter=20561, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:13:05/1:35:36, time_cost(all): 5:34:48/2 days, 0:49:12, loss=0.567484146276168, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.02(1.03), norm=4.9414545753158485, lr=0.000994868253089205
2023-11-21 14:00:09   INFO  epoch: 3/30, acc_iter=20611, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:13:54/1:35:38, time_cost(all): 5:35:38/2 days, 2:32:08, loss=0.567401032289859, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.03(1.03), norm=1.6072490183119104, lr=0.00099454751890728
2023-11-21 14:00:58   INFO  epoch: 3/30, acc_iter=20661, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:14:44/1:34:08, time_cost(all): 5:36:27/1 day, 23:01:12, loss=0.56731791830355, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.05(1.03), norm=2.4024795169326785, lr=0.000994226784725355
2023-11-21 14:01:47   INFO  epoch: 3/30, acc_iter=20711, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:15:33/1:29:38, time_cost(all): 5:37:16/2 days, 0:43:49, loss=0.567234804317241, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.92(1.03), norm=1.422027235211173, lr=0.000993906050543431
2023-11-21 14:02:36   INFO  epoch: 3/30, acc_iter=20761, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:16:22/1:32:09, time_cost(all): 5:38:05/2 days, 0:23:16, loss=0.567151690330932, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.9(1.03), norm=4.577702452656021, lr=0.000993585316361506
2023-11-21 14:03:25   INFO  epoch: 3/30, acc_iter=20811, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:17:11/1:32:05, time_cost(all): 5:38:54/2 days, 1:02:43, loss=0.567068576344623, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.2(1.03), norm=3.567012392538683, lr=0.000993264582179581
2023-11-21 14:04:14   INFO  epoch: 3/30, acc_iter=20861, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:18:00/1:30:25, time_cost(all): 5:39:43/1 day, 21:52:59, loss=0.566985462358314, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.11(1.03), norm=4.27706680444712, lr=0.000992943847997657
2023-11-21 14:05:03   INFO  epoch: 3/30, acc_iter=20911, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:18:49/1:25:46, time_cost(all): 5:40:32/2 days, 1:56:05, loss=0.566902348372005, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.2(1.03), norm=3.765208135308793, lr=0.000992623113815732
2023-11-21 14:05:52   INFO  epoch: 3/30, acc_iter=20961, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:19:38/1:24:14, time_cost(all): 5:41:21/2 days, 2:13:04, loss=0.566819234385696, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.08(1.03), norm=2.7132591659061767, lr=0.000992302379633807
2023-11-21 14:06:41   INFO  epoch: 3/30, acc_iter=21011, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:20:27/1:24:33, time_cost(all): 5:42:10/2 days, 1:12:00, loss=0.566736120399387, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.96(1.03), norm=3.127356230701997, lr=0.000991981645451882
2023-11-21 14:07:31   INFO  epoch: 3/30, acc_iter=21061, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:21:17/1:26:47, time_cost(all): 5:43:00/2 days, 0:43:44, loss=0.566653006413078, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.89(1.03), norm=1.0110686320670514, lr=0.000991660911269958
2023-11-21 14:08:20   INFO  epoch: 3/30, acc_iter=21111, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:22:06/1:29:40, time_cost(all): 5:43:49/2 days, 1:08:51, loss=0.566569892426769, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.1(1.03), norm=4.732563365684266, lr=0.000991340177088033
2023-11-21 14:09:09   INFO  epoch: 3/30, acc_iter=21161, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:22:55/1:26:40, time_cost(all): 5:44:38/2 days, 0:01:45, loss=0.56648677844046, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.12(1.03), norm=1.7729356410670258, lr=0.000991019442906108
2023-11-21 14:09:58   INFO  epoch: 3/30, acc_iter=21211, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:23:44/1:23:24, time_cost(all): 5:45:27/2 days, 1:51:55, loss=0.566403664454151, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.0(1.03), norm=4.447104197682693, lr=0.000990698708724184
2023-11-21 14:10:47   INFO  epoch: 3/30, acc_iter=21261, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:24:33/1:25:41, time_cost(all): 5:46:16/1 day, 22:20:42, loss=0.566320550467841, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.18(1.03), norm=4.995357602300282, lr=0.000990377974542259
2023-11-21 14:11:36   INFO  epoch: 3/30, acc_iter=21311, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:25:22/1:19:07, time_cost(all): 5:47:05/1 day, 21:53:59, loss=0.566237436481532, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.94(1.03), norm=2.0541871637873377, lr=0.000990057240360334
2023-11-21 14:12:25   INFO  epoch: 3/30, acc_iter=21361, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:26:11/1:24:41, time_cost(all): 5:47:54/2 days, 2:10:35, loss=0.566154322495223, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.94(1.03), norm=0.8673308945677547, lr=0.000989736506178409
2023-11-21 14:13:14   INFO  epoch: 3/30, acc_iter=21411, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:27:00/1:19:04, time_cost(all): 5:48:43/2 days, 1:11:10, loss=0.566071208508914, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.14(1.03), norm=3.278754451581186, lr=0.000989415771996485
2023-11-21 14:14:04   INFO  epoch: 3/30, acc_iter=21461, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:27:49/1:21:16, time_cost(all): 5:49:33/1 day, 23:53:14, loss=0.565988094522605, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.04(1.03), norm=2.6193722562374337, lr=0.00098909503781456
2023-11-21 14:14:53   INFO  epoch: 3/30, acc_iter=21511, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:28:39/1:19:16, time_cost(all): 5:50:22/1 day, 23:39:19, loss=0.565904980536296, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.94(1.03), norm=4.129790788344188, lr=0.000988774303632635
2023-11-21 14:15:42   INFO  epoch: 3/30, acc_iter=21561, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:29:28/1:21:38, time_cost(all): 5:51:11/2 days, 0:19:32, loss=0.565821866549987, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.98(1.03), norm=2.0327012104458038, lr=0.000988453569450711
2023-11-21 14:16:31   INFO  epoch: 3/30, acc_iter=21611, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:30:17/1:15:20, time_cost(all): 5:52:00/2 days, 0:47:57, loss=0.565738752563678, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.92(1.03), norm=4.620127809980079, lr=0.000988132835268786
2023-11-21 14:17:20   INFO  epoch: 3/30, acc_iter=21661, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:31:06/1:19:35, time_cost(all): 5:52:49/1 day, 21:47:33, loss=0.565655638577369, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.96(1.03), norm=1.4621930690116165, lr=0.000987812101086861
2023-11-21 14:18:09   INFO  epoch: 3/30, acc_iter=21711, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:31:55/1:14:47, time_cost(all): 5:53:38/2 days, 1:22:09, loss=0.56557252459106, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.03(1.03), norm=4.287338174652072, lr=0.000987491366904937
2023-11-21 14:18:58   INFO  epoch: 3/30, acc_iter=21761, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:32:44/1:11:24, time_cost(all): 5:54:27/1 day, 21:59:36, loss=0.565489410604751, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.84(1.03), norm=2.1541033905329363, lr=0.000987170632723012
2023-11-21 14:19:47   INFO  epoch: 3/30, acc_iter=21811, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:33:33/1:11:24, time_cost(all): 5:55:16/2 days, 1:55:04, loss=0.565406296618442, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.92(1.03), norm=2.358961845253458, lr=0.000986849898541087
2023-11-21 14:20:36   INFO  epoch: 3/30, acc_iter=21861, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:34:22/1:15:37, time_cost(all): 5:56:05/1 day, 22:59:03, loss=0.565323182632133, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.12(1.03), norm=3.3444766101276553, lr=0.000986529164359162
2023-11-21 14:21:26   INFO  epoch: 3/30, acc_iter=21911, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:35:12/1:10:08, time_cost(all): 5:56:55/1 day, 21:40:56, loss=0.565240068645824, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.22(1.03), norm=3.949274685296385, lr=0.000986208430177238
2023-11-21 14:22:15   INFO  epoch: 3/30, acc_iter=21961, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:36:01/1:09:12, time_cost(all): 5:57:44/1 day, 23:12:07, loss=0.565156954659515, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.03(1.03), norm=1.318818123945002, lr=0.000985887695995313
2023-11-21 14:23:04   INFO  epoch: 3/30, acc_iter=22011, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:36:50/1:08:51, time_cost(all): 5:58:33/2 days, 1:59:30, loss=0.565073840673206, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.15(1.03), norm=2.0094776877345986, lr=0.000985566961813388
2023-11-21 14:23:53   INFO  epoch: 3/30, acc_iter=22061, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:37:39/1:10:46, time_cost(all): 5:59:22/1 day, 22:10:11, loss=0.564990726686897, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.92(1.03), norm=2.7902623217810594, lr=0.000985246227631464
2023-11-21 14:24:42   INFO  epoch: 3/30, acc_iter=22111, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:38:28/1:09:27, time_cost(all): 6:00:11/1 day, 22:20:51, loss=0.564907612700587, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.11(1.03), norm=3.157733332587044, lr=0.000984925493449539
2023-11-21 14:25:31   INFO  epoch: 3/30, acc_iter=22161, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:39:17/1:05:56, time_cost(all): 6:01:00/2 days, 1:48:57, loss=0.564824498714278, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.22(1.03), norm=2.938049538101565, lr=0.000984604759267614
2023-11-21 14:26:20   INFO  epoch: 3/30, acc_iter=22211, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:40:06/1:04:21, time_cost(all): 6:01:49/2 days, 1:07:53, loss=0.564741384727969, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.11(1.03), norm=4.673369003693898, lr=0.00098428402508569
2023-11-21 14:27:09   INFO  epoch: 3/30, acc_iter=22261, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:40:55/1:07:42, time_cost(all): 6:02:38/2 days, 1:57:14, loss=0.56465827074166, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.18(1.03), norm=1.996232866613076, lr=0.000983963290903765
2023-11-21 14:27:59   INFO  epoch: 3/30, acc_iter=22311, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:41:44/1:06:09, time_cost(all): 6:03:28/2 days, 2:14:19, loss=0.564575156755351, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.15(1.03), norm=1.618732526518984, lr=0.00098364255672184
2023-11-21 14:28:48   INFO  epoch: 3/30, acc_iter=22361, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:42:34/1:04:22, time_cost(all): 6:04:17/2 days, 0:17:22, loss=0.564492042769042, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.09(1.03), norm=3.63644812516906, lr=0.000983321822539915
2023-11-21 14:29:37   INFO  epoch: 3/30, acc_iter=22411, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:43:23/1:05:22, time_cost(all): 6:05:06/1 day, 23:06:56, loss=0.564408928782733, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.95(1.03), norm=4.088442012775397, lr=0.000983001088357991
2023-11-21 14:30:26   INFO  epoch: 3/30, acc_iter=22461, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:44:12/1:03:15, time_cost(all): 6:05:55/1 day, 23:17:54, loss=0.564325814796424, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.13(1.03), norm=2.351110655458607, lr=0.000982680354176066
2023-11-21 14:31:15   INFO  epoch: 3/30, acc_iter=22511, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:45:01/1:00:15, time_cost(all): 6:06:44/2 days, 0:53:44, loss=0.564242700810115, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.17(1.03), norm=3.868655515669439, lr=0.000982359619994141
2023-11-21 14:32:04   INFO  epoch: 3/30, acc_iter=22561, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:45:50/0:59:08, time_cost(all): 6:07:33/1 day, 23:51:37, loss=0.564159586823806, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.19(1.03), norm=2.6024931582548017, lr=0.000982038885812217
2023-11-21 14:32:53   INFO  epoch: 3/30, acc_iter=22611, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:46:39/1:02:39, time_cost(all): 6:08:22/2 days, 0:53:31, loss=0.564076472837497, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.06(1.03), norm=3.239927000991693, lr=0.000981718151630292
2023-11-21 14:33:42   INFO  epoch: 3/30, acc_iter=22661, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:47:28/1:01:59, time_cost(all): 6:09:11/1 day, 23:29:41, loss=0.563993358851188, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.13(1.03), norm=1.6882000741832563, lr=0.000981397417448367
2023-11-21 14:34:31   INFO  epoch: 3/30, acc_iter=22711, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:48:17/1:00:12, time_cost(all): 6:10:00/2 days, 1:23:24, loss=0.563910244864879, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.84(1.03), norm=2.90423124009439, lr=0.000981076683266443
2023-11-21 14:35:21   INFO  epoch: 3/30, acc_iter=22761, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:49:07/0:56:49, time_cost(all): 6:10:50/2 days, 0:01:46, loss=0.56382713087857, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.22(1.03), norm=1.2457691759657052, lr=0.000980755949084518
2023-11-21 14:36:10   INFO  epoch: 3/30, acc_iter=22811, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:49:56/0:57:29, time_cost(all): 6:11:39/1 day, 23:05:38, loss=0.563744016892261, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.99(1.03), norm=4.916064968296633, lr=0.000980435214902593
2023-11-21 14:36:59   INFO  epoch: 3/30, acc_iter=22861, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:50:45/0:58:34, time_cost(all): 6:12:28/1 day, 21:34:04, loss=0.563660902905951, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.93(1.03), norm=0.9964128609204892, lr=0.000980114480720668
2023-11-21 14:37:48   INFO  epoch: 3/30, acc_iter=22911, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 0:51:34/0:58:34, time_cost(all): 6:13:17/2 days, 1:30:54, loss=0.563577788919642, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.1(1.03), norm=3.1448644176216742, lr=0.000979793746538744
2023-11-21 14:38:37   INFO  epoch: 3/30, acc_iter=22961, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 0:52:23/0:54:29, time_cost(all): 6:14:06/1 day, 23:20:44, loss=0.563494674933333, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.86(1.03), norm=2.4397436655317923, lr=0.000979473012356819
2023-11-21 14:39:26   INFO  epoch: 3/30, acc_iter=23011, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 0:53:12/0:52:54, time_cost(all): 6:14:55/1 day, 23:38:41, loss=0.563411560947024, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.21(1.03), norm=1.096168291535092, lr=0.000979152278174894
2023-11-21 14:40:15   INFO  epoch: 3/30, acc_iter=23061, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 0:54:01/0:54:03, time_cost(all): 6:15:44/1 day, 22:18:21, loss=0.563328446960715, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.12(1.03), norm=2.6254812794168374, lr=0.000978831543992969
2023-11-21 14:41:04   INFO  epoch: 3/30, acc_iter=23111, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 0:54:50/0:55:04, time_cost(all): 6:16:33/1 day, 21:47:48, loss=0.563245332974406, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.06(1.03), norm=3.723643772724611, lr=0.000978510809811045
2023-11-21 14:41:54   INFO  epoch: 3/30, acc_iter=23161, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 0:55:39/0:50:28, time_cost(all): 6:17:23/1 day, 23:38:12, loss=0.563162218988097, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.22(1.03), norm=1.3320112601089942, lr=0.00097819007562912
2023-11-21 14:42:43   INFO  epoch: 3/30, acc_iter=23211, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 0:56:29/0:53:34, time_cost(all): 6:18:12/1 day, 23:58:53, loss=0.563079105001788, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.96(1.03), norm=4.998649909107762, lr=0.000977869341447196
2023-11-21 14:43:32   INFO  epoch: 3/30, acc_iter=23261, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 0:57:18/0:50:48, time_cost(all): 6:19:01/1 day, 21:54:38, loss=0.562995991015479, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.91(1.03), norm=4.813043274722379, lr=0.000977548607265271
2023-11-21 14:44:21   INFO  epoch: 3/30, acc_iter=23311, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 0:58:07/0:49:09, time_cost(all): 6:19:50/2 days, 0:24:36, loss=0.56291287702917, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.98(1.03), norm=3.9765376058034025, lr=0.000977227873083346
2023-11-21 14:45:10   INFO  epoch: 3/30, acc_iter=23361, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 0:58:56/0:49:26, time_cost(all): 6:20:39/1 day, 23:49:22, loss=0.562829763042861, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.95(1.03), norm=4.816055011859332, lr=0.000976907138901421
2023-11-21 14:45:59   INFO  epoch: 3/30, acc_iter=23411, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 0:59:45/0:48:25, time_cost(all): 6:21:28/2 days, 0:56:29, loss=0.562746649056552, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.84(1.03), norm=1.812797883560638, lr=0.000976586404719497
2023-11-21 14:46:48   INFO  epoch: 3/30, acc_iter=23461, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:00:34/0:48:21, time_cost(all): 6:22:17/1 day, 22:35:05, loss=0.562663535070243, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.92(1.03), norm=3.8690002749154737, lr=0.000976265670537572
2023-11-21 14:47:37   INFO  epoch: 3/30, acc_iter=23511, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:01:23/0:47:28, time_cost(all): 6:23:06/1 day, 21:47:39, loss=0.562580421083934, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.0(1.03), norm=4.5201782053322015, lr=0.000975944936355647
2023-11-21 14:48:26   INFO  epoch: 3/30, acc_iter=23561, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:02:12/0:46:50, time_cost(all): 6:23:55/1 day, 22:33:45, loss=0.562497307097625, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.95(1.03), norm=2.2797806347307192, lr=0.000975624202173722
2023-11-21 14:49:16   INFO  epoch: 3/30, acc_iter=23611, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:03:02/0:45:47, time_cost(all): 6:24:45/2 days, 0:15:01, loss=0.562414193111316, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.06(1.03), norm=0.9415727320731179, lr=0.000975303467991798
2023-11-21 14:50:05   INFO  epoch: 3/30, acc_iter=23661, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:03:51/0:42:09, time_cost(all): 6:25:34/1 day, 23:38:52, loss=0.562331079125006, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.18(1.03), norm=3.8915127235033524, lr=0.000974982733809873
2023-11-21 14:50:54   INFO  epoch: 3/30, acc_iter=23711, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:04:40/0:41:42, time_cost(all): 6:26:23/1 day, 22:17:28, loss=0.562247965138697, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.12(1.03), norm=3.7820219638287513, lr=0.000974661999627948
2023-11-21 14:51:43   INFO  epoch: 3/30, acc_iter=23761, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:05:29/0:44:07, time_cost(all): 6:27:12/1 day, 23:05:10, loss=0.562164851152388, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.13(1.03), norm=1.6583502717601515, lr=0.000974341265446024
2023-11-21 14:52:32   INFO  epoch: 3/30, acc_iter=23811, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:06:18/0:43:05, time_cost(all): 6:28:01/1 day, 23:05:44, loss=0.562081737166079, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.92(1.03), norm=4.648310431150857, lr=0.000974020531264099
2023-11-21 14:53:21   INFO  epoch: 3/30, acc_iter=23861, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:07:07/0:39:02, time_cost(all): 6:28:50/1 day, 22:07:28, loss=0.56199862317977, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.01(1.03), norm=3.2943082377160557, lr=0.000973699797082174
2023-11-21 14:54:10   INFO  epoch: 3/30, acc_iter=23911, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:07:56/0:38:34, time_cost(all): 6:29:39/1 day, 22:43:53, loss=0.561915509193461, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.06(1.03), norm=2.409641882602309, lr=0.00097337906290025
2023-11-21 14:54:59   INFO  epoch: 3/30, acc_iter=23961, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:08:45/0:39:58, time_cost(all): 6:30:28/1 day, 22:48:02, loss=0.561832395207152, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.98(1.03), norm=3.741685141967793, lr=0.000973058328718325
2023-11-21 14:55:49   INFO  epoch: 3/30, acc_iter=24011, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:09:34/0:39:21, time_cost(all): 6:31:18/1 day, 22:21:17, loss=0.561749281220843, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.05(1.03), norm=3.944987031694266, lr=0.0009727375945364
2023-11-21 14:56:38   INFO  epoch: 3/30, acc_iter=24061, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:10:24/0:37:35, time_cost(all): 6:32:07/1 day, 22:42:34, loss=0.561666167234534, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.85(1.03), norm=1.416559161469293, lr=0.000972416860354475
2023-11-21 14:57:27   INFO  epoch: 3/30, acc_iter=24111, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:11:13/0:35:34, time_cost(all): 6:32:56/1 day, 22:59:56, loss=0.561583053248225, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.2(1.03), norm=3.857305152941505, lr=0.000972096126172551
2023-11-21 14:58:16   INFO  epoch: 3/30, acc_iter=24161, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:12:02/0:34:18, time_cost(all): 6:33:45/1 day, 23:30:37, loss=0.561499939261916, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.97(1.03), norm=4.986202058164937, lr=0.000971775391990626
2023-11-21 14:59:05   INFO  epoch: 3/30, acc_iter=24211, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:12:51/0:33:54, time_cost(all): 6:34:34/1 day, 22:24:34, loss=0.561416825275607, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.91(1.03), norm=4.965759512414702, lr=0.000971454657808701
2023-11-21 14:59:54   INFO  epoch: 3/30, acc_iter=24261, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:13:40/0:33:19, time_cost(all): 6:35:23/1 day, 23:14:41, loss=0.561333711289298, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.94(1.03), norm=1.743792816223178, lr=0.000971133923626777
2023-11-21 15:00:43   INFO  epoch: 3/30, acc_iter=24311, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:14:29/0:33:15, time_cost(all): 6:36:12/1 day, 22:12:42, loss=0.561250597302989, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.13(1.03), norm=1.636020395614009, lr=0.000970813189444852
2023-11-21 15:01:32   INFO  epoch: 3/30, acc_iter=24361, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:15:18/0:30:59, time_cost(all): 6:37:01/1 day, 23:22:24, loss=0.56116748331668, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.07(1.03), norm=3.206277773274208, lr=0.000970492455262927
2023-11-21 15:02:21   INFO  epoch: 3/30, acc_iter=24411, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:16:07/0:30:49, time_cost(all): 6:37:50/1 day, 21:26:25, loss=0.561084369330371, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.01(1.03), norm=0.718652173568781, lr=0.000970171721081003
2023-11-21 15:03:11   INFO  epoch: 3/30, acc_iter=24461, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:16:57/0:31:50, time_cost(all): 6:38:40/1 day, 23:39:53, loss=0.561001255344062, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.08(1.03), norm=1.2177509706662573, lr=0.000969850986899078
2023-11-21 15:04:00   INFO  epoch: 3/30, acc_iter=24511, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:17:46/0:30:44, time_cost(all): 6:39:29/2 days, 0:22:06, loss=0.560918141357752, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.94(1.03), norm=2.7767154718373335, lr=0.000969530252717153
2023-11-21 15:04:49   INFO  epoch: 3/30, acc_iter=24561, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:18:35/0:29:15, time_cost(all): 6:40:18/1 day, 21:21:29, loss=0.560835027371443, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.09(1.03), norm=1.2310759507088729, lr=0.000969209518535228
2023-11-21 15:05:38   INFO  epoch: 3/30, acc_iter=24611, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:19:24/0:28:53, time_cost(all): 6:41:07/1 day, 21:20:53, loss=0.560751913385134, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.18(1.03), norm=4.332628977857967, lr=0.000968888784353304
2023-11-21 15:06:27   INFO  epoch: 3/30, acc_iter=24661, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:20:13/0:26:39, time_cost(all): 6:41:56/1 day, 20:55:58, loss=0.560668799398825, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.15(1.03), norm=4.741963278875753, lr=0.000968568050171379
2023-11-21 15:07:16   INFO  epoch: 3/30, acc_iter=24711, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:21:02/0:25:35, time_cost(all): 6:42:45/1 day, 22:53:00, loss=0.560585685412516, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.02(1.03), norm=3.647626691451916, lr=0.000968247315989454
2023-11-21 15:08:05   INFO  epoch: 3/30, acc_iter=24761, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:21:51/0:26:38, time_cost(all): 6:43:34/1 day, 21:33:09, loss=0.560502571426207, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.23(1.03), norm=1.1133769629703638, lr=0.00096792658180753
2023-11-21 15:08:54   INFO  epoch: 3/30, acc_iter=24811, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:22:40/0:24:28, time_cost(all): 6:44:23/1 day, 22:24:35, loss=0.560419457439898, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.88(1.03), norm=1.810528590884429, lr=0.000967605847625605
2023-11-21 15:09:44   INFO  epoch: 3/30, acc_iter=24861, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:23:29/0:23:31, time_cost(all): 6:45:13/1 day, 22:22:52, loss=0.560336343453589, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.91(1.03), norm=2.1670467573308105, lr=0.00096728511344368
2023-11-21 15:10:33   INFO  epoch: 3/30, acc_iter=24911, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:24:19/0:23:16, time_cost(all): 6:46:02/2 days, 0:51:19, loss=0.56025322946728, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.22(1.03), norm=1.2899120274294458, lr=0.000966964379261755
2023-11-21 15:11:22   INFO  epoch: 3/30, acc_iter=24961, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:25:08/0:22:13, time_cost(all): 6:46:51/1 day, 21:40:05, loss=0.560170115480971, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.18(1.03), norm=1.7538221339614626, lr=0.000966643645079831
2023-11-21 15:12:11   INFO  epoch: 3/30, acc_iter=25011, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:25:57/0:22:06, time_cost(all): 6:47:40/1 day, 23:46:53, loss=0.560087001494662, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.21(1.03), norm=1.1294502331665794, lr=0.000966322910897906
2023-11-21 15:13:00   INFO  epoch: 3/30, acc_iter=25061, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:26:46/0:20:57, time_cost(all): 6:48:29/2 days, 0:57:47, loss=0.560003887508353, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.23(1.03), norm=2.761272693237579, lr=0.000966002176715981
2023-11-21 15:13:49   INFO  epoch: 3/30, acc_iter=25111, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:27:35/0:21:06, time_cost(all): 6:49:18/1 day, 22:24:30, loss=0.559920773522044, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.0(1.03), norm=3.7192269683462293, lr=0.000965681442534057
2023-11-21 15:14:38   INFO  epoch: 3/30, acc_iter=25161, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:28:24/0:20:12, time_cost(all): 6:50:07/2 days, 1:17:12, loss=0.559837659535735, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.2(1.03), norm=1.3695955568261957, lr=0.000965360708352132
2023-11-21 15:15:27   INFO  epoch: 3/30, acc_iter=25211, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:29:13/0:18:21, time_cost(all): 6:50:56/1 day, 20:50:38, loss=0.559754545549426, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.93(1.03), norm=4.284167013388146, lr=0.000965039974170207
2023-11-21 15:16:16   INFO  epoch: 3/30, acc_iter=25261, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:30:02/0:17:09, time_cost(all): 6:51:45/2 days, 1:18:47, loss=0.559671431563116, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.09(1.03), norm=2.3619863596077195, lr=0.000964719239988283
2023-11-21 15:17:06   INFO  epoch: 3/30, acc_iter=25311, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:30:52/0:17:03, time_cost(all): 6:52:35/1 day, 23:15:28, loss=0.559588317576807, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.12(1.03), norm=4.780530107719022, lr=0.000964398505806358
2023-11-21 15:17:55   INFO  epoch: 3/30, acc_iter=25361, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:31:41/0:16:52, time_cost(all): 6:53:24/2 days, 0:45:33, loss=0.559505203590498, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.05(1.03), norm=4.377507080130443, lr=0.000964077771624433
2023-11-21 15:18:44   INFO  epoch: 3/30, acc_iter=25411, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:32:30/0:15:58, time_cost(all): 6:54:13/1 day, 21:58:31, loss=0.559422089604189, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.87(1.03), norm=3.2107484215996838, lr=0.000963757037442508
2023-11-21 15:19:33   INFO  epoch: 3/30, acc_iter=25461, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:33:19/0:15:13, time_cost(all): 6:55:02/1 day, 21:47:22, loss=0.55933897561788, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.13(1.03), norm=4.601287868587612, lr=0.000963436303260584
2023-11-21 15:20:22   INFO  epoch: 3/30, acc_iter=25511, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:34:08/0:13:27, time_cost(all): 6:55:51/1 day, 23:28:23, loss=0.559255861631571, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.1(1.03), norm=2.9531418119650468, lr=0.000963115569078659
2023-11-21 15:21:11   INFO  epoch: 3/30, acc_iter=25561, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:34:57/0:12:38, time_cost(all): 6:56:40/2 days, 0:00:25, loss=0.559172747645262, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.15(1.03), norm=4.704388498501953, lr=0.000962794834896734
2023-11-21 15:22:00   INFO  epoch: 3/30, acc_iter=25611, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:35:46/0:12:19, time_cost(all): 6:57:29/1 day, 20:38:41, loss=0.559089633658953, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.06(1.03), norm=0.8948752577524174, lr=0.00096247410071481
2023-11-21 15:22:49   INFO  epoch: 3/30, acc_iter=25661, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:36:35/0:11:42, time_cost(all): 6:58:18/2 days, 0:53:05, loss=0.559006519672644, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.87(1.03), norm=2.7777696602016384, lr=0.000962153366532885
2023-11-21 15:23:39   INFO  epoch: 3/30, acc_iter=25711, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:37:24/0:10:33, time_cost(all): 6:59:08/1 day, 21:49:55, loss=0.558923405686335, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.18(1.03), norm=4.483790427272668, lr=0.00096183263235096
2023-11-21 15:24:28   INFO  epoch: 3/30, acc_iter=25761, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:38:14/0:09:54, time_cost(all): 6:59:57/1 day, 23:31:29, loss=0.558840291700026, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.01(1.03), norm=1.9251690934509458, lr=0.000961511898169036
2023-11-21 15:25:17   INFO  epoch: 3/30, acc_iter=25811, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:39:03/0:08:51, time_cost(all): 7:00:46/1 day, 22:04:38, loss=0.558757177713717, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.97(1.03), norm=4.540400789613364, lr=0.000961191163987111
2023-11-21 15:26:06   INFO  epoch: 3/30, acc_iter=25861, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:39:52/0:07:42, time_cost(all): 7:01:35/1 day, 22:58:46, loss=0.558674063727408, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.04(1.03), norm=1.951437674831662, lr=0.000960870429805186
2023-11-21 15:26:55   INFO  epoch: 3/30, acc_iter=25911, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:40:41/0:06:53, time_cost(all): 7:02:24/1 day, 21:41:05, loss=0.558590949741099, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.04(1.03), norm=1.2405951891874878, lr=0.000960549695623261
2023-11-21 15:27:44   INFO  epoch: 3/30, acc_iter=25961, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:41:30/0:06:21, time_cost(all): 7:03:13/1 day, 21:06:59, loss=0.55850783575479, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.16(1.03), norm=3.4234717982206275, lr=0.000960228961441337
2023-11-21 15:28:33   INFO  epoch: 3/30, acc_iter=26011, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 1:42:19/0:05:32, time_cost(all): 7:04:02/1 day, 21:10:00, loss=0.558424721768481, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.18(1.03), norm=2.602006608919436, lr=0.000959908227259412
2023-11-21 15:29:22   INFO  epoch: 3/30, acc_iter=26061, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 1:43:08/0:04:36, time_cost(all): 7:04:51/1 day, 23:10:56, loss=0.558341607782171, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.91(1.03), norm=2.0483967673680405, lr=0.000959587493077487
2023-11-21 15:30:11   INFO  epoch: 3/30, acc_iter=26111, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 1:43:57/0:04:00, time_cost(all): 7:05:40/2 days, 0:43:46, loss=0.558258493795862, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.02(1.03), norm=4.16081579848106, lr=0.000959266758895563
2023-11-21 15:31:01   INFO  epoch: 3/30, acc_iter=26161, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 1:44:47/0:03:06, time_cost(all): 7:06:30/2 days, 0:58:03, loss=0.558175379809553, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.98(1.03), norm=1.9155049745705517, lr=0.000958946024713638
2023-11-21 15:31:50   INFO  epoch: 3/30, acc_iter=26211, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 1:45:36/0:02:14, time_cost(all): 7:07:19/1 day, 22:02:36, loss=0.558092265823244, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.91(1.03), norm=1.3595502822890912, lr=0.000958625290531713
2023-11-21 15:32:39   INFO  epoch: 3/30, acc_iter=26261, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 1:46:25/0:01:24, time_cost(all): 7:08:08/1 day, 22:28:07, loss=0.558009151836935, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.11(1.03), norm=3.9970958893245827, lr=0.000958304556349788
2023-11-21 15:33:28   INFO  epoch: 3/30, acc_iter=26311, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 1:47:14/0:00:36, time_cost(all): 7:08:57/1 day, 23:11:10, loss=0.557926037850626, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.07(1.03), norm=4.864069315508175, lr=0.000957983822167864
2023-11-21 15:34:17   INFO  epoch: 4/30, acc_iter=26398, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:49/1:47:48, time_cost(all): 7:09:46/2 days, 1:05:21, loss=0.557781419514448, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.19(1.03), norm=4.948830351295952, lr=0.000957425744691315
2023-11-21 15:35:06   INFO  epoch: 4/30, acc_iter=26448, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:38/1:44:03, time_cost(all): 7:10:35/1 day, 21:45:10, loss=0.557698305528139, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.17(1.03), norm=4.328432273984557, lr=0.00095710501050939
2023-11-21 15:35:55   INFO  epoch: 4/30, acc_iter=26498, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:27/1:46:17, time_cost(all): 7:11:24/1 day, 22:28:46, loss=0.55761519154183, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.09(1.03), norm=3.82417674924998, lr=0.000956784276327465
2023-11-21 15:36:44   INFO  epoch: 4/30, acc_iter=26548, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:16/1:45:12, time_cost(all): 7:12:13/1 day, 21:15:13, loss=0.557532077555521, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.97(1.03), norm=3.370628363992315, lr=0.000956463542145541
2023-11-21 15:37:33   INFO  epoch: 4/30, acc_iter=26598, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:05/1:39:05, time_cost(all): 7:13:02/1 day, 22:01:57, loss=0.557448963569212, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.93(1.03), norm=0.649863166194016, lr=0.000956142807963616
2023-11-21 15:38:23   INFO  epoch: 4/30, acc_iter=26648, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:04:54/1:46:55, time_cost(all): 7:13:52/1 day, 20:58:40, loss=0.557365849582903, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.21(1.03), norm=2.0346858931287013, lr=0.000955822073781691
2023-11-21 15:39:12   INFO  epoch: 4/30, acc_iter=26698, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:05:43/1:39:22, time_cost(all): 7:14:41/1 day, 23:29:18, loss=0.557282735596594, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.02(1.03), norm=0.7390417647119089, lr=0.000955501339599767
2023-11-21 15:40:01   INFO  epoch: 4/30, acc_iter=26748, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:06:32/1:41:33, time_cost(all): 7:15:30/1 day, 21:41:27, loss=0.557199621610285, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.2(1.03), norm=2.7488314062694648, lr=0.000955180605417842
2023-11-21 15:40:50   INFO  epoch: 4/30, acc_iter=26798, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:07:22/1:40:00, time_cost(all): 7:16:19/2 days, 0:15:53, loss=0.557116507623976, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.91(1.03), norm=4.5482288413942715, lr=0.000954859871235917
2023-11-21 15:41:39   INFO  epoch: 4/30, acc_iter=26848, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:08:11/1:39:21, time_cost(all): 7:17:08/1 day, 21:56:25, loss=0.557033393637667, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.91(1.03), norm=3.7989184758410586, lr=0.000954539137053992
2023-11-21 15:42:28   INFO  epoch: 4/30, acc_iter=26898, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:09:00/1:34:23, time_cost(all): 7:17:57/1 day, 22:52:14, loss=0.556950279651358, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.18(1.03), norm=1.1615687310910348, lr=0.000954218402872068
2023-11-21 15:43:17   INFO  epoch: 4/30, acc_iter=26948, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:09:49/1:36:14, time_cost(all): 7:18:46/1 day, 20:47:23, loss=0.556867165665049, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.91(1.03), norm=3.4107043688816416, lr=0.000953897668690143
2023-11-21 15:44:06   INFO  epoch: 4/30, acc_iter=26998, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:10:38/1:36:11, time_cost(all): 7:19:35/1 day, 21:18:34, loss=0.55678405167874, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.06(1.03), norm=3.7166100687717933, lr=0.000953576934508218
2023-11-21 15:44:56   INFO  epoch: 4/30, acc_iter=27048, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:11:27/1:40:39, time_cost(all): 7:20:25/1 day, 22:15:42, loss=0.556700937692431, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.19(1.03), norm=0.6043962717823583, lr=0.000953256200326294
2023-11-21 15:45:45   INFO  epoch: 4/30, acc_iter=27098, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:12:16/1:35:41, time_cost(all): 7:21:14/1 day, 21:03:44, loss=0.556617823706122, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.83(1.03), norm=4.128626915515672, lr=0.000952935466144369
2023-11-21 15:46:34   INFO  epoch: 4/30, acc_iter=27148, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:13:05/1:31:53, time_cost(all): 7:22:03/1 day, 23:35:32, loss=0.556534709719812, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.16(1.03), norm=3.600389840061533, lr=0.000952614731962444
2023-11-21 15:47:23   INFO  epoch: 4/30, acc_iter=27198, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:13:54/1:37:14, time_cost(all): 7:22:52/1 day, 22:24:07, loss=0.556451595733503, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.22(1.03), norm=1.5963341794637955, lr=0.000952293997780519
2023-11-21 15:48:12   INFO  epoch: 4/30, acc_iter=27248, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:14:44/1:32:38, time_cost(all): 7:23:41/1 day, 23:45:28, loss=0.556368481747194, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.84(1.03), norm=4.3796299797915115, lr=0.000951973263598595
2023-11-21 15:49:01   INFO  epoch: 4/30, acc_iter=27298, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:15:33/1:36:00, time_cost(all): 7:24:30/1 day, 23:54:46, loss=0.556285367760885, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.09(1.03), norm=3.7285983192606804, lr=0.00095165252941667
2023-11-21 15:49:50   INFO  epoch: 4/30, acc_iter=27348, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:16:22/1:28:49, time_cost(all): 7:25:19/1 day, 21:05:13, loss=0.556202253774576, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.17(1.03), norm=0.8633255835213358, lr=0.000951331795234745
2023-11-21 15:50:39   INFO  epoch: 4/30, acc_iter=27398, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:17:11/1:33:01, time_cost(all): 7:26:08/1 day, 22:42:09, loss=0.556119139788267, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.13(1.03), norm=3.7969599059215122, lr=0.000951011061052821
2023-11-21 15:51:28   INFO  epoch: 4/30, acc_iter=27448, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:18:00/1:30:22, time_cost(all): 7:26:57/1 day, 21:54:20, loss=0.556036025801958, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.17(1.03), norm=2.1295140296085577, lr=0.000950690326870896
2023-11-21 15:52:18   INFO  epoch: 4/30, acc_iter=27498, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:18:49/1:27:40, time_cost(all): 7:27:47/1 day, 23:42:23, loss=0.555952911815649, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.11(1.03), norm=3.9209982727094066, lr=0.000950369592688971
2023-11-21 15:53:07   INFO  epoch: 4/30, acc_iter=27548, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:19:38/1:28:25, time_cost(all): 7:28:36/1 day, 22:25:31, loss=0.55586979782934, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.08(1.03), norm=3.204094383591908, lr=0.000950048858507047
2023-11-21 15:53:56   INFO  epoch: 4/30, acc_iter=27598, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:20:27/1:27:00, time_cost(all): 7:29:25/1 day, 20:16:30, loss=0.555786683843031, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.16(1.03), norm=1.7419034593065872, lr=0.000949728124325122
2023-11-21 15:54:45   INFO  epoch: 4/30, acc_iter=27648, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:21:17/1:28:39, time_cost(all): 7:30:14/1 day, 21:27:59, loss=0.555703569856722, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.86(1.03), norm=2.4735973476226905, lr=0.000949407390143197
2023-11-21 15:55:34   INFO  epoch: 4/30, acc_iter=27698, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:22:06/1:24:09, time_cost(all): 7:31:03/2 days, 0:33:06, loss=0.555620455870413, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.12(1.03), norm=3.2152404303029676, lr=0.000949086655961272
2023-11-21 15:56:23   INFO  epoch: 4/30, acc_iter=27748, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:22:55/1:27:31, time_cost(all): 7:31:52/1 day, 22:52:24, loss=0.555537341884104, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.21(1.03), norm=4.055516851034394, lr=0.000948765921779348
2023-11-21 15:57:12   INFO  epoch: 4/30, acc_iter=27798, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:23:44/1:23:04, time_cost(all): 7:32:41/2 days, 0:20:43, loss=0.555454227897795, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.2(1.03), norm=1.4562106910746877, lr=0.000948445187597423
2023-11-21 15:58:01   INFO  epoch: 4/30, acc_iter=27848, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:24:33/1:25:15, time_cost(all): 7:33:30/1 day, 22:12:08, loss=0.555371113911486, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.86(1.03), norm=4.868162004907234, lr=0.000948124453415498
2023-11-21 15:58:51   INFO  epoch: 4/30, acc_iter=27898, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:25:22/1:24:28, time_cost(all): 7:34:20/1 day, 20:50:49, loss=0.555287999925177, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.09(1.03), norm=4.051748917115505, lr=0.000947803719233574
2023-11-21 15:59:40   INFO  epoch: 4/30, acc_iter=27948, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:26:11/1:25:27, time_cost(all): 7:35:09/1 day, 23:21:27, loss=0.555204885938867, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.9(1.03), norm=4.056492691298308, lr=0.000947482985051649
2023-11-21 16:00:29   INFO  epoch: 4/30, acc_iter=27998, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:27:00/1:17:55, time_cost(all): 7:35:58/1 day, 22:57:46, loss=0.555121771952558, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.95(1.03), norm=3.955530321833948, lr=0.000947162250869724
2023-11-21 16:01:18   INFO  epoch: 4/30, acc_iter=28048, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:27:49/1:23:18, time_cost(all): 7:36:47/1 day, 21:44:20, loss=0.555038657966249, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.04(1.03), norm=2.446275345200723, lr=0.0009468415166878
2023-11-21 16:02:07   INFO  epoch: 4/30, acc_iter=28098, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:28:39/1:17:50, time_cost(all): 7:37:36/1 day, 23:01:24, loss=0.55495554397994, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.95(1.03), norm=0.5517421906131171, lr=0.000946520782505875
2023-11-21 16:02:56   INFO  epoch: 4/30, acc_iter=28148, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:29:28/1:17:39, time_cost(all): 7:38:25/1 day, 21:49:10, loss=0.554872429993631, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.91(1.03), norm=2.4810905314299174, lr=0.00094620004832395
2023-11-21 16:03:45   INFO  epoch: 4/30, acc_iter=28198, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:30:17/1:21:17, time_cost(all): 7:39:14/2 days, 0:29:43, loss=0.554789316007322, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.93(1.03), norm=4.354804837840736, lr=0.000945879314142025
2023-11-21 16:04:34   INFO  epoch: 4/30, acc_iter=28248, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:31:06/1:18:25, time_cost(all): 7:40:03/1 day, 20:19:37, loss=0.554706202021013, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.88(1.03), norm=2.898866990100733, lr=0.000945558579960101
2023-11-21 16:05:23   INFO  epoch: 4/30, acc_iter=28298, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:31:55/1:15:28, time_cost(all): 7:40:52/1 day, 20:07:23, loss=0.554623088034704, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.01(1.03), norm=0.54515256264977, lr=0.000945237845778176
2023-11-21 16:06:13   INFO  epoch: 4/30, acc_iter=28348, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:32:44/1:14:23, time_cost(all): 7:41:42/1 day, 21:15:27, loss=0.554539974048395, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.05(1.03), norm=3.8240831186347273, lr=0.000944917111596251
2023-11-21 16:07:02   INFO  epoch: 4/30, acc_iter=28398, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:33:33/1:17:42, time_cost(all): 7:42:31/1 day, 22:50:43, loss=0.554456860062086, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.97(1.03), norm=2.9381674652062, lr=0.000944596377414327
2023-11-21 16:07:51   INFO  epoch: 4/30, acc_iter=28448, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:34:22/1:09:56, time_cost(all): 7:43:20/1 day, 22:22:59, loss=0.554373746075777, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.0(1.03), norm=3.8240540580763644, lr=0.000944275643232402
2023-11-21 16:08:40   INFO  epoch: 4/30, acc_iter=28498, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:35:12/1:11:07, time_cost(all): 7:44:09/1 day, 20:50:53, loss=0.554290632089468, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.99(1.03), norm=3.2468487486268405, lr=0.000943954909050477
2023-11-21 16:09:29   INFO  epoch: 4/30, acc_iter=28548, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:36:01/1:10:15, time_cost(all): 7:44:58/2 days, 0:23:53, loss=0.554207518103159, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.22(1.03), norm=3.2270708836718933, lr=0.000943634174868552
2023-11-21 16:10:18   INFO  epoch: 4/30, acc_iter=28598, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:36:50/1:13:43, time_cost(all): 7:45:47/1 day, 20:57:39, loss=0.55412440411685, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.07(1.03), norm=0.8685367361171863, lr=0.000943313440686628
2023-11-21 16:11:07   INFO  epoch: 4/30, acc_iter=28648, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:37:39/1:07:42, time_cost(all): 7:46:36/1 day, 22:45:12, loss=0.554041290130541, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.91(1.03), norm=3.0169199779683415, lr=0.000942992706504703
2023-11-21 16:11:56   INFO  epoch: 4/30, acc_iter=28698, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:38:28/1:12:40, time_cost(all): 7:47:25/1 day, 23:16:11, loss=0.553958176144231, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.16(1.03), norm=4.931028100141953, lr=0.000942671972322778
2023-11-21 16:12:46   INFO  epoch: 4/30, acc_iter=28748, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:39:17/1:08:13, time_cost(all): 7:48:15/2 days, 0:14:42, loss=0.553875062157922, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.08(1.03), norm=0.979329300526725, lr=0.000942351238140854
2023-11-21 16:13:35   INFO  epoch: 4/30, acc_iter=28798, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:40:06/1:04:52, time_cost(all): 7:49:04/1 day, 22:53:39, loss=0.553791948171613, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.21(1.03), norm=3.45759134577066, lr=0.000942030503958929
2023-11-21 16:14:24   INFO  epoch: 4/30, acc_iter=28848, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:40:55/1:06:32, time_cost(all): 7:49:53/1 day, 22:06:03, loss=0.553708834185304, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.85(1.03), norm=1.6108908034894844, lr=0.000941709769777004
2023-11-21 16:15:13   INFO  epoch: 4/30, acc_iter=28898, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:41:44/1:04:44, time_cost(all): 7:50:42/1 day, 20:09:49, loss=0.553625720198995, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.22(1.03), norm=2.800000679335252, lr=0.00094138903559508
2023-11-21 16:16:02   INFO  epoch: 4/30, acc_iter=28948, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:42:34/1:04:37, time_cost(all): 7:51:31/1 day, 20:18:09, loss=0.553542606212686, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.16(1.03), norm=1.7500343368258977, lr=0.000941068301413155
2023-11-21 16:16:51   INFO  epoch: 4/30, acc_iter=28998, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:43:23/1:02:22, time_cost(all): 7:52:20/1 day, 20:45:53, loss=0.553459492226377, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.91(1.03), norm=2.725502660301503, lr=0.00094074756723123
2023-11-21 16:17:40   INFO  epoch: 4/30, acc_iter=29048, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:44:12/1:00:54, time_cost(all): 7:53:09/1 day, 21:10:46, loss=0.553376378240068, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.05(1.03), norm=3.711362080570756, lr=0.000940426833049305
2023-11-21 16:18:29   INFO  epoch: 4/30, acc_iter=29098, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:45:01/1:02:02, time_cost(all): 7:53:58/1 day, 23:48:58, loss=0.553293264253759, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.21(1.03), norm=1.299509558377399, lr=0.000940106098867381
2023-11-21 16:19:18   INFO  epoch: 4/30, acc_iter=29148, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:45:50/0:59:37, time_cost(all): 7:54:47/1 day, 19:51:52, loss=0.55321015026745, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.84(1.03), norm=1.4274316595140248, lr=0.000939785364685456
2023-11-21 16:20:08   INFO  epoch: 4/30, acc_iter=29198, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:46:39/1:03:28, time_cost(all): 7:55:37/1 day, 21:08:50, loss=0.553127036281141, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.22(1.03), norm=1.299444525882173, lr=0.000939464630503531
2023-11-21 16:20:57   INFO  epoch: 4/30, acc_iter=29248, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:47:28/1:03:21, time_cost(all): 7:56:26/1 day, 20:30:42, loss=0.553043922294832, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.23(1.03), norm=2.232182956567916, lr=0.000939143896321607
2023-11-21 16:21:46   INFO  epoch: 4/30, acc_iter=29298, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:48:17/0:59:26, time_cost(all): 7:57:15/1 day, 21:22:55, loss=0.552960808308523, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.1(1.03), norm=1.945500658070953, lr=0.000938823162139682
2023-11-21 16:22:35   INFO  epoch: 4/30, acc_iter=29348, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:49:07/0:55:53, time_cost(all): 7:58:04/1 day, 23:17:11, loss=0.552877694322214, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.97(1.03), norm=3.6719703723706827, lr=0.000938502427957757
2023-11-21 16:23:24   INFO  epoch: 4/30, acc_iter=29398, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:49:56/0:58:45, time_cost(all): 7:58:53/2 days, 0:07:53, loss=0.552794580335905, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.1(1.03), norm=2.042884719988527, lr=0.000938181693775833
2023-11-21 16:24:13   INFO  epoch: 4/30, acc_iter=29448, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:50:45/0:55:50, time_cost(all): 7:59:42/1 day, 20:53:28, loss=0.552711466349596, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.89(1.03), norm=3.566212316759055, lr=0.000937860959593908
2023-11-21 16:25:02   INFO  epoch: 4/30, acc_iter=29498, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 0:51:34/0:55:31, time_cost(all): 8:00:31/1 day, 23:08:25, loss=0.552628352363287, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.89(1.03), norm=2.3610756110734523, lr=0.000937540225411983
2023-11-21 16:25:51   INFO  epoch: 4/30, acc_iter=29548, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 0:52:23/0:53:26, time_cost(all): 8:01:20/1 day, 22:41:48, loss=0.552545238376977, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.88(1.03), norm=0.8159493458113687, lr=0.000937219491230058
2023-11-21 16:26:41   INFO  epoch: 4/30, acc_iter=29598, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 0:53:12/0:54:39, time_cost(all): 8:02:10/1 day, 20:40:19, loss=0.552462124390668, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.04(1.03), norm=4.641150634453345, lr=0.000936898757048134
2023-11-21 16:27:30   INFO  epoch: 4/30, acc_iter=29648, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 0:54:01/0:53:28, time_cost(all): 8:02:59/1 day, 20:28:51, loss=0.552379010404359, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.0(1.03), norm=4.2537757088133095, lr=0.000936578022866209
2023-11-21 16:28:19   INFO  epoch: 4/30, acc_iter=29698, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 0:54:50/0:52:35, time_cost(all): 8:03:48/1 day, 21:15:21, loss=0.55229589641805, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.06(1.03), norm=1.1139153645362927, lr=0.000936257288684284
2023-11-21 16:29:08   INFO  epoch: 4/30, acc_iter=29748, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 0:55:39/0:52:14, time_cost(all): 8:04:37/1 day, 23:21:59, loss=0.552212782431741, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.9(1.03), norm=1.9236394536962083, lr=0.00093593655450236
2023-11-21 16:29:57   INFO  epoch: 4/30, acc_iter=29798, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 0:56:29/0:49:57, time_cost(all): 8:05:26/1 day, 22:18:00, loss=0.552129668445432, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.85(1.03), norm=4.14764398083407, lr=0.000935615820320435
2023-11-21 16:30:46   INFO  epoch: 4/30, acc_iter=29848, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 0:57:18/0:51:22, time_cost(all): 8:06:15/1 day, 21:17:44, loss=0.552046554459123, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.83(1.03), norm=2.247737270335471, lr=0.00093529508613851
2023-11-21 16:31:35   INFO  epoch: 4/30, acc_iter=29898, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 0:58:07/0:48:04, time_cost(all): 8:07:04/1 day, 23:33:48, loss=0.551963440472814, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.04(1.03), norm=2.6507628857643937, lr=0.000934974351956585
2023-11-21 16:32:24   INFO  epoch: 4/30, acc_iter=29948, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 0:58:56/0:50:08, time_cost(all): 8:07:53/1 day, 20:34:09, loss=0.551880326486505, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.84(1.03), norm=2.1870501279057235, lr=0.000934653617774661
2023-11-21 16:33:13   INFO  epoch: 4/30, acc_iter=29998, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 0:59:45/0:48:04, time_cost(all): 8:08:42/1 day, 20:42:09, loss=0.551797212500196, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.89(1.03), norm=0.859256631641306, lr=0.000934332883592736
2023-11-21 16:34:03   INFO  epoch: 4/30, acc_iter=30048, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:00:34/0:46:47, time_cost(all): 8:09:32/1 day, 22:05:25, loss=0.551714098513887, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.09(1.03), norm=4.727637318194317, lr=0.000934012149410811
2023-11-21 16:34:52   INFO  epoch: 4/30, acc_iter=30098, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:01:23/0:47:28, time_cost(all): 8:10:21/1 day, 19:40:46, loss=0.551630984527578, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.96(1.03), norm=1.2893882317567993, lr=0.000933691415228887
2023-11-21 16:35:41   INFO  epoch: 4/30, acc_iter=30148, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:02:12/0:45:05, time_cost(all): 8:11:10/1 day, 20:09:21, loss=0.551547870541269, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.16(1.03), norm=3.939156587259607, lr=0.000933370681046962
2023-11-21 16:36:30   INFO  epoch: 4/30, acc_iter=30198, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:03:02/0:44:13, time_cost(all): 8:11:59/1 day, 22:46:25, loss=0.55146475655496, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.16(1.03), norm=0.5910447283551008, lr=0.000933049946865037
2023-11-21 16:37:19   INFO  epoch: 4/30, acc_iter=30248, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:03:51/0:43:39, time_cost(all): 8:12:48/1 day, 22:49:44, loss=0.551381642568651, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.2(1.03), norm=3.0379117907534536, lr=0.000932729212683113
2023-11-21 16:38:08   INFO  epoch: 4/30, acc_iter=30298, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:04:40/0:42:43, time_cost(all): 8:13:37/1 day, 22:50:13, loss=0.551298528582342, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.91(1.03), norm=1.7527751411677648, lr=0.000932408478501188
2023-11-21 16:38:57   INFO  epoch: 4/30, acc_iter=30348, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:05:29/0:44:13, time_cost(all): 8:14:26/1 day, 19:56:55, loss=0.551215414596032, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.94(1.03), norm=3.9659321062791895, lr=0.000932087744319263
2023-11-21 16:39:46   INFO  epoch: 4/30, acc_iter=30398, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:06:18/0:42:02, time_cost(all): 8:15:15/1 day, 20:37:31, loss=0.551132300609723, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.95(1.03), norm=4.92534792233488, lr=0.000931767010137338
2023-11-21 16:40:36   INFO  epoch: 4/30, acc_iter=30448, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:07:07/0:39:19, time_cost(all): 8:16:05/1 day, 20:43:41, loss=0.551049186623414, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.03(1.03), norm=2.284513053007803, lr=0.000931446275955414
2023-11-21 16:41:25   INFO  epoch: 4/30, acc_iter=30498, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:07:56/0:40:51, time_cost(all): 8:16:54/1 day, 19:22:00, loss=0.550966072637105, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.03(1.03), norm=1.630328614855302, lr=0.000931125541773489
2023-11-21 16:42:14   INFO  epoch: 4/30, acc_iter=30548, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:08:45/0:38:25, time_cost(all): 8:17:43/1 day, 21:53:00, loss=0.550882958650796, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.91(1.03), norm=2.94646818569013, lr=0.000930804807591564
2023-11-21 16:43:03   INFO  epoch: 4/30, acc_iter=30598, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:09:34/0:38:02, time_cost(all): 8:18:32/1 day, 21:14:02, loss=0.550799844664487, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.97(1.03), norm=0.6425713323725895, lr=0.00093048407340964
2023-11-21 16:43:52   INFO  epoch: 4/30, acc_iter=30648, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:10:24/0:37:54, time_cost(all): 8:19:21/1 day, 22:10:23, loss=0.550716730678178, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.87(1.03), norm=3.2329915102904687, lr=0.000930163339227715
2023-11-21 16:44:41   INFO  epoch: 4/30, acc_iter=30698, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:11:13/0:37:56, time_cost(all): 8:20:10/1 day, 23:10:41, loss=0.550633616691869, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.01(1.03), norm=2.7066558817854225, lr=0.00092984260504579
2023-11-21 16:45:30   INFO  epoch: 4/30, acc_iter=30748, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:12:02/0:37:30, time_cost(all): 8:20:59/1 day, 20:26:24, loss=0.55055050270556, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.08(1.03), norm=0.706555987215596, lr=0.000929521870863865
2023-11-21 16:46:19   INFO  epoch: 4/30, acc_iter=30798, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:12:51/0:35:33, time_cost(all): 8:21:48/1 day, 23:24:24, loss=0.550467388719251, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.21(1.03), norm=0.8480700942420898, lr=0.000929201136681941
2023-11-21 16:47:08   INFO  epoch: 4/30, acc_iter=30848, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:13:40/0:34:10, time_cost(all): 8:22:37/1 day, 20:10:07, loss=0.550384274732942, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.05(1.03), norm=2.1090008333378436, lr=0.000928880402500016
2023-11-21 16:47:58   INFO  epoch: 4/30, acc_iter=30898, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:14:29/0:31:42, time_cost(all): 8:23:27/1 day, 23:33:02, loss=0.550301160746633, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.22(1.03), norm=2.5948179924415165, lr=0.000928559668318091
2023-11-21 16:48:47   INFO  epoch: 4/30, acc_iter=30948, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:15:18/0:33:45, time_cost(all): 8:24:16/1 day, 19:17:28, loss=0.550218046760324, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.13(1.03), norm=3.31243200880091, lr=0.000928238934136167
2023-11-21 16:49:36   INFO  epoch: 4/30, acc_iter=30998, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:16:07/0:31:30, time_cost(all): 8:25:05/1 day, 20:39:38, loss=0.550134932774015, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.96(1.03), norm=1.0858890070710192, lr=0.000927918199954242
2023-11-21 16:50:25   INFO  epoch: 4/30, acc_iter=31048, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:16:57/0:32:04, time_cost(all): 8:25:54/1 day, 19:19:40, loss=0.550051818787706, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.15(1.03), norm=2.9236635456108684, lr=0.000927597465772317
2023-11-21 16:51:14   INFO  epoch: 4/30, acc_iter=31098, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:17:46/0:29:16, time_cost(all): 8:26:43/1 day, 23:07:37, loss=0.549968704801396, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.03(1.03), norm=1.0101382417202693, lr=0.000927276731590393
2023-11-21 16:52:03   INFO  epoch: 4/30, acc_iter=31148, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:18:35/0:28:33, time_cost(all): 8:27:32/1 day, 20:16:30, loss=0.549885590815087, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.16(1.03), norm=4.562717110379158, lr=0.000926955997408468
2023-11-21 16:52:52   INFO  epoch: 4/30, acc_iter=31198, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:19:24/0:27:27, time_cost(all): 8:28:21/1 day, 21:58:13, loss=0.549802476828778, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.91(1.03), norm=3.082927112205358, lr=0.000926635263226543
2023-11-21 16:53:41   INFO  epoch: 4/30, acc_iter=31248, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:20:13/0:28:51, time_cost(all): 8:29:10/1 day, 22:49:37, loss=0.549719362842469, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.94(1.03), norm=3.2476207319392123, lr=0.000926314529044618
2023-11-21 16:54:31   INFO  epoch: 4/30, acc_iter=31298, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:21:02/0:26:14, time_cost(all): 8:30:00/1 day, 19:54:49, loss=0.54963624885616, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.02(1.03), norm=4.618991296587903, lr=0.000925993794862694
2023-11-21 16:55:20   INFO  epoch: 4/30, acc_iter=31348, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:21:51/0:25:00, time_cost(all): 8:30:49/1 day, 22:40:10, loss=0.549553134869851, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.2(1.03), norm=1.1251393297493717, lr=0.000925673060680769
2023-11-21 16:56:09   INFO  epoch: 4/30, acc_iter=31398, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:22:40/0:25:06, time_cost(all): 8:31:38/1 day, 22:32:52, loss=0.549470020883542, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.02(1.03), norm=1.553312151340014, lr=0.000925352326498844
2023-11-21 16:56:58   INFO  epoch: 4/30, acc_iter=31448, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:23:29/0:23:30, time_cost(all): 8:32:27/1 day, 22:57:17, loss=0.549386906897233, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.15(1.03), norm=4.960989780447194, lr=0.00092503159231692
2023-11-21 16:57:47   INFO  epoch: 4/30, acc_iter=31498, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:24:19/0:22:24, time_cost(all): 8:33:16/1 day, 19:36:10, loss=0.549303792910924, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.13(1.03), norm=3.9182793969507808, lr=0.000924710858134995
2023-11-21 16:58:36   INFO  epoch: 4/30, acc_iter=31548, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:25:08/0:23:04, time_cost(all): 8:34:05/1 day, 20:39:12, loss=0.549220678924615, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.11(1.03), norm=4.784438177326529, lr=0.00092439012395307
2023-11-21 16:59:25   INFO  epoch: 4/30, acc_iter=31598, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:25:57/0:21:01, time_cost(all): 8:34:54/1 day, 23:26:46, loss=0.549137564938306, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.97(1.03), norm=2.93647129785806, lr=0.000924069389771145
2023-11-21 17:00:14   INFO  epoch: 4/30, acc_iter=31648, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:26:46/0:20:10, time_cost(all): 8:35:43/1 day, 21:16:46, loss=0.549054450951997, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.97(1.03), norm=4.620222819446705, lr=0.000923748655589221
2023-11-21 17:01:03   INFO  epoch: 4/30, acc_iter=31698, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:27:35/0:19:28, time_cost(all): 8:36:32/1 day, 19:44:15, loss=0.548971336965688, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.97(1.03), norm=0.5945312273907404, lr=0.000923427921407296
2023-11-21 17:01:53   INFO  epoch: 4/30, acc_iter=31748, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:28:24/0:19:45, time_cost(all): 8:37:22/1 day, 19:16:29, loss=0.548888222979379, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.91(1.03), norm=4.702512478242466, lr=0.000923107187225371
2023-11-21 17:02:42   INFO  epoch: 4/30, acc_iter=31798, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:29:13/0:19:22, time_cost(all): 8:38:11/1 day, 22:39:14, loss=0.54880510899307, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.2(1.03), norm=3.080865336766192, lr=0.000922786453043447
2023-11-21 17:03:31   INFO  epoch: 4/30, acc_iter=31848, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:30:02/0:17:28, time_cost(all): 8:39:00/1 day, 22:46:52, loss=0.548721995006761, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.21(1.03), norm=3.69243565963613, lr=0.000922465718861522
2023-11-21 17:04:20   INFO  epoch: 4/30, acc_iter=31898, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:30:52/0:16:22, time_cost(all): 8:39:49/1 day, 20:56:07, loss=0.548638881020451, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.09(1.03), norm=0.625180899515809, lr=0.000922144984679597
2023-11-21 17:05:09   INFO  epoch: 4/30, acc_iter=31948, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:31:41/0:16:40, time_cost(all): 8:40:38/1 day, 19:07:05, loss=0.548555767034142, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.21(1.03), norm=3.099540198478245, lr=0.000921824250497673
2023-11-21 17:05:58   INFO  epoch: 4/30, acc_iter=31998, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:32:30/0:15:00, time_cost(all): 8:41:27/1 day, 21:45:25, loss=0.548472653047833, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.05(1.03), norm=4.674252603923034, lr=0.000921503516315748
2023-11-21 17:06:47   INFO  epoch: 4/30, acc_iter=32048, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:33:19/0:14:02, time_cost(all): 8:42:16/1 day, 22:02:17, loss=0.548389539061524, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.15(1.03), norm=2.1178303121096977, lr=0.000921182782133823
2023-11-21 17:07:36   INFO  epoch: 4/30, acc_iter=32098, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:34:08/0:13:01, time_cost(all): 8:43:05/1 day, 21:21:20, loss=0.548306425075215, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.14(1.03), norm=1.1947811565767386, lr=0.000920862047951898
2023-11-21 17:08:26   INFO  epoch: 4/30, acc_iter=32148, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:34:57/0:12:14, time_cost(all): 8:43:55/1 day, 21:59:13, loss=0.548223311088906, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.93(1.03), norm=2.512450019929727, lr=0.000920541313769974
2023-11-21 17:09:15   INFO  epoch: 4/30, acc_iter=32198, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:35:46/0:12:17, time_cost(all): 8:44:44/1 day, 20:40:27, loss=0.548140197102597, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.06(1.03), norm=4.603481050779229, lr=0.000920220579588049
2023-11-21 17:10:04   INFO  epoch: 4/30, acc_iter=32248, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:36:35/0:10:52, time_cost(all): 8:45:33/1 day, 22:44:03, loss=0.548057083116288, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.88(1.03), norm=2.907776664759131, lr=0.000919899845406124
2023-11-21 17:10:53   INFO  epoch: 4/30, acc_iter=32298, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:37:24/0:09:58, time_cost(all): 8:46:22/1 day, 21:36:48, loss=0.547973969129979, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.95(1.03), norm=3.665893181422377, lr=0.0009195791112242
2023-11-21 17:11:42   INFO  epoch: 4/30, acc_iter=32348, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:38:14/0:09:27, time_cost(all): 8:47:11/1 day, 23:10:16, loss=0.54789085514367, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.09(1.03), norm=0.5735315038383215, lr=0.000919258377042275
2023-11-21 17:12:31   INFO  epoch: 4/30, acc_iter=32398, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:39:03/0:08:42, time_cost(all): 8:48:00/1 day, 20:42:40, loss=0.547807741157361, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.19(1.03), norm=2.4727455749809786, lr=0.00091893764286035
2023-11-21 17:13:20   INFO  epoch: 4/30, acc_iter=32448, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:39:52/0:07:40, time_cost(all): 8:48:49/1 day, 23:16:44, loss=0.547724627171052, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.05(1.03), norm=1.8648010444658136, lr=0.000918616908678426
2023-11-21 17:14:09   INFO  epoch: 4/30, acc_iter=32498, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:40:41/0:07:01, time_cost(all): 8:49:38/1 day, 21:25:46, loss=0.547641513184743, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.86(1.03), norm=4.267115958300568, lr=0.000918296174496501
2023-11-21 17:14:58   INFO  epoch: 4/30, acc_iter=32548, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:41:30/0:06:04, time_cost(all): 8:50:27/1 day, 19:53:43, loss=0.547558399198434, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.13(1.03), norm=3.357107541137235, lr=0.000917975440314576
2023-11-21 17:15:48   INFO  epoch: 4/30, acc_iter=32598, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 1:42:19/0:05:46, time_cost(all): 8:51:17/1 day, 20:23:44, loss=0.547475285212125, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.18(1.03), norm=3.1717939003446407, lr=0.000917654706132651
2023-11-21 17:16:37   INFO  epoch: 4/30, acc_iter=32648, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 1:43:08/0:04:49, time_cost(all): 8:52:06/1 day, 22:57:30, loss=0.547392171225816, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.12(1.03), norm=0.6177249680280503, lr=0.000917333971950727
2023-11-21 17:17:26   INFO  epoch: 4/30, acc_iter=32698, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 1:43:57/0:04:03, time_cost(all): 8:52:55/1 day, 20:23:21, loss=0.547309057239507, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.94(1.03), norm=1.3761594484517654, lr=0.000917013237768802
2023-11-21 17:18:15   INFO  epoch: 4/30, acc_iter=32748, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 1:44:47/0:03:05, time_cost(all): 8:53:44/1 day, 19:02:48, loss=0.547225943253197, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.0(1.03), norm=2.9608314858800697, lr=0.000916692503586877
2023-11-21 17:19:04   INFO  epoch: 4/30, acc_iter=32798, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 1:45:36/0:02:17, time_cost(all): 8:54:33/1 day, 21:46:19, loss=0.547142829266888, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.1(1.03), norm=2.837748777613677, lr=0.000916371769404953
2023-11-21 17:19:53   INFO  epoch: 4/30, acc_iter=32848, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 1:46:25/0:01:26, time_cost(all): 8:55:22/1 day, 19:38:35, loss=0.547059715280579, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.95(1.03), norm=3.1273151395299896, lr=0.000916051035223028
2023-11-21 17:20:42   INFO  epoch: 4/30, acc_iter=32898, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 1:47:14/0:00:34, time_cost(all): 8:56:11/1 day, 21:59:55, loss=0.54697660129427, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.23(1.03), norm=1.8687443460751347, lr=0.000915730301041103
2023-11-21 17:21:31   INFO  epoch: 5/30, acc_iter=32985, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:49/1:41:52, time_cost(all): 8:57:00/1 day, 19:55:15, loss=0.546831982958092, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.18(1.03), norm=1.6312447434000248, lr=0.000915172223564554
2023-11-21 17:22:21   INFO  epoch: 5/30, acc_iter=33035, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:38/1:46:07, time_cost(all): 8:57:50/1 day, 19:02:00, loss=0.546748868971783, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.98(1.03), norm=0.59601449634476, lr=0.00091485148938263
2023-11-21 17:23:10   INFO  epoch: 5/30, acc_iter=33085, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:27/1:46:29, time_cost(all): 8:58:39/1 day, 23:01:36, loss=0.546665754985474, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.08(1.03), norm=2.1547810213105336, lr=0.000914530755200705
2023-11-21 17:23:59   INFO  epoch: 5/30, acc_iter=33135, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:16/1:47:09, time_cost(all): 8:59:28/1 day, 19:50:28, loss=0.546582640999165, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.15(1.03), norm=0.7487489597582027, lr=0.00091421002101878
2023-11-21 17:24:48   INFO  epoch: 5/30, acc_iter=33185, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:05/1:43:04, time_cost(all): 9:00:17/1 day, 21:54:48, loss=0.546499527012856, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.08(1.03), norm=2.598351897208471, lr=0.000913889286836855
2023-11-21 17:25:37   INFO  epoch: 5/30, acc_iter=33235, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:04:54/1:41:36, time_cost(all): 9:01:06/1 day, 22:17:48, loss=0.546416413026547, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.03(1.03), norm=2.8163583356308184, lr=0.000913568552654931
2023-11-21 17:26:26   INFO  epoch: 5/30, acc_iter=33285, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:05:43/1:42:32, time_cost(all): 9:01:55/1 day, 19:24:24, loss=0.546333299040238, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.86(1.03), norm=4.827676607732273, lr=0.000913247818473006
2023-11-21 17:27:15   INFO  epoch: 5/30, acc_iter=33335, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:06:32/1:45:57, time_cost(all): 9:02:44/1 day, 20:34:49, loss=0.546250185053929, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.09(1.03), norm=3.400170725696225, lr=0.000912927084291081
2023-11-21 17:28:04   INFO  epoch: 5/30, acc_iter=33385, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:07:22/1:44:39, time_cost(all): 9:03:33/1 day, 21:09:23, loss=0.54616707106762, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.03(1.03), norm=4.192890247336729, lr=0.000912606350109157
2023-11-21 17:28:53   INFO  epoch: 5/30, acc_iter=33435, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:08:11/1:43:43, time_cost(all): 9:04:22/1 day, 21:29:43, loss=0.546083957081311, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.05(1.03), norm=0.5782145214648609, lr=0.000912285615927232
2023-11-21 17:29:43   INFO  epoch: 5/30, acc_iter=33485, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:09:00/1:42:46, time_cost(all): 9:05:12/1 day, 19:39:34, loss=0.546000843095002, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.05(1.03), norm=3.8872826531376927, lr=0.000911964881745307
2023-11-21 17:30:32   INFO  epoch: 5/30, acc_iter=33535, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:09:49/1:35:23, time_cost(all): 9:06:01/1 day, 22:44:16, loss=0.545917729108693, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.89(1.03), norm=1.6835111985037903, lr=0.000911644147563382
2023-11-21 17:31:21   INFO  epoch: 5/30, acc_iter=33585, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:10:38/1:38:43, time_cost(all): 9:06:50/1 day, 20:25:25, loss=0.545834615122384, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.06(1.03), norm=2.4791552740946403, lr=0.000911323413381458
2023-11-21 17:32:10   INFO  epoch: 5/30, acc_iter=33635, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:11:27/1:32:32, time_cost(all): 9:07:39/1 day, 18:40:05, loss=0.545751501136075, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.04(1.03), norm=3.780538725617074, lr=0.000911002679199533
2023-11-21 17:32:59   INFO  epoch: 5/30, acc_iter=33685, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:12:16/1:31:13, time_cost(all): 9:08:28/1 day, 19:45:40, loss=0.545668387149766, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.1(1.03), norm=3.6825061849988865, lr=0.000910681945017608
2023-11-21 17:33:48   INFO  epoch: 5/30, acc_iter=33735, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:13:05/1:34:39, time_cost(all): 9:09:17/1 day, 21:32:23, loss=0.545585273163457, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.83(1.03), norm=4.15760806476248, lr=0.000910361210835684
2023-11-21 17:34:37   INFO  epoch: 5/30, acc_iter=33785, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:13:54/1:36:34, time_cost(all): 9:10:06/1 day, 18:44:17, loss=0.545502159177148, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.87(1.03), norm=1.9152180670971322, lr=0.000910040476653759
2023-11-21 17:35:26   INFO  epoch: 5/30, acc_iter=33835, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:14:44/1:31:11, time_cost(all): 9:10:55/1 day, 18:41:18, loss=0.545419045190838, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.08(1.03), norm=2.135961131264449, lr=0.000909719742471834
2023-11-21 17:36:15   INFO  epoch: 5/30, acc_iter=33885, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:15:33/1:36:45, time_cost(all): 9:11:44/1 day, 22:41:18, loss=0.545335931204529, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.03(1.03), norm=0.9146989775035544, lr=0.00090939900828991
2023-11-21 17:37:05   INFO  epoch: 5/30, acc_iter=33935, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:16:22/1:33:28, time_cost(all): 9:12:34/1 day, 19:12:17, loss=0.54525281721822, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.88(1.03), norm=0.5259888796214155, lr=0.000909078274107985
2023-11-21 17:37:54   INFO  epoch: 5/30, acc_iter=33985, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:17:11/1:31:37, time_cost(all): 9:13:23/1 day, 22:00:04, loss=0.545169703231911, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.06(1.03), norm=1.4963203764795687, lr=0.00090875753992606
2023-11-21 17:38:43   INFO  epoch: 5/30, acc_iter=34035, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:18:00/1:30:17, time_cost(all): 9:14:12/1 day, 20:55:10, loss=0.545086589245602, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.07(1.03), norm=2.7958318436609035, lr=0.000908436805744135
2023-11-21 17:39:32   INFO  epoch: 5/30, acc_iter=34085, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:18:49/1:33:04, time_cost(all): 9:15:01/1 day, 21:10:40, loss=0.545003475259293, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.83(1.03), norm=0.7710972112766377, lr=0.000908116071562211
2023-11-21 17:40:21   INFO  epoch: 5/30, acc_iter=34135, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:19:38/1:25:46, time_cost(all): 9:15:50/1 day, 19:45:23, loss=0.544920361272984, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.1(1.03), norm=0.5654702329837326, lr=0.000907795337380286
2023-11-21 17:41:10   INFO  epoch: 5/30, acc_iter=34185, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:20:27/1:28:04, time_cost(all): 9:16:39/1 day, 19:30:08, loss=0.544837247286675, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.1(1.03), norm=4.84092755891733, lr=0.000907474603198361
2023-11-21 17:41:59   INFO  epoch: 5/30, acc_iter=34235, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:21:17/1:23:42, time_cost(all): 9:17:28/1 day, 18:35:13, loss=0.544754133300366, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.07(1.03), norm=0.7919370278307624, lr=0.000907153869016437
2023-11-21 17:42:48   INFO  epoch: 5/30, acc_iter=34285, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:22:06/1:26:53, time_cost(all): 9:18:17/1 day, 19:46:17, loss=0.544671019314057, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.09(1.03), norm=4.134569742566752, lr=0.000906833134834512
2023-11-21 17:43:38   INFO  epoch: 5/30, acc_iter=34335, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:22:55/1:24:31, time_cost(all): 9:19:07/1 day, 19:51:14, loss=0.544587905327748, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.08(1.03), norm=2.1196068528784062, lr=0.000906512400652587
2023-11-21 17:44:27   INFO  epoch: 5/30, acc_iter=34385, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:23:44/1:24:55, time_cost(all): 9:19:56/1 day, 19:59:18, loss=0.544504791341439, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.0(1.03), norm=0.9612829588133117, lr=0.000906191666470662
2023-11-21 17:45:16   INFO  epoch: 5/30, acc_iter=34435, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:24:33/1:24:34, time_cost(all): 9:20:45/1 day, 21:46:44, loss=0.54442167735513, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.93(1.03), norm=1.5399803753074832, lr=0.000905870932288738
2023-11-21 17:46:05   INFO  epoch: 5/30, acc_iter=34485, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:25:22/1:21:15, time_cost(all): 9:21:34/1 day, 20:34:19, loss=0.544338563368821, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.0(1.03), norm=3.124805045813724, lr=0.000905550198106813
2023-11-21 17:46:54   INFO  epoch: 5/30, acc_iter=34535, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:26:11/1:19:12, time_cost(all): 9:22:23/1 day, 19:58:36, loss=0.544255449382511, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.2(1.03), norm=2.973647661763421, lr=0.000905229463924888
2023-11-21 17:47:43   INFO  epoch: 5/30, acc_iter=34585, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:27:00/1:22:40, time_cost(all): 9:23:12/1 day, 21:38:47, loss=0.544172335396202, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.03(1.03), norm=3.7033378667752492, lr=0.000904908729742964
2023-11-21 17:48:32   INFO  epoch: 5/30, acc_iter=34635, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:27:49/1:18:26, time_cost(all): 9:24:01/1 day, 19:28:22, loss=0.544089221409893, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.04(1.03), norm=4.936657739502246, lr=0.000904587995561039
2023-11-21 17:49:21   INFO  epoch: 5/30, acc_iter=34685, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:28:39/1:21:27, time_cost(all): 9:24:50/1 day, 18:52:31, loss=0.544006107423584, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.08(1.03), norm=3.5135798429031184, lr=0.000904267261379114
2023-11-21 17:50:10   INFO  epoch: 5/30, acc_iter=34735, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:29:28/1:22:14, time_cost(all): 9:25:39/1 day, 20:34:54, loss=0.543922993437275, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.16(1.03), norm=3.885196394480061, lr=0.00090394652719719
2023-11-21 17:51:00   INFO  epoch: 5/30, acc_iter=34785, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:30:17/1:14:45, time_cost(all): 9:26:29/1 day, 20:14:27, loss=0.543839879450966, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.1(1.03), norm=2.070095174544245, lr=0.000903625793015265
2023-11-21 17:51:49   INFO  epoch: 5/30, acc_iter=34835, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:31:06/1:20:20, time_cost(all): 9:27:18/1 day, 18:16:58, loss=0.543756765464657, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.22(1.03), norm=4.558593593876886, lr=0.00090330505883334
2023-11-21 17:52:38   INFO  epoch: 5/30, acc_iter=34885, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:31:55/1:17:16, time_cost(all): 9:28:07/1 day, 19:01:26, loss=0.543673651478348, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.01(1.03), norm=0.6938664736222697, lr=0.000902984324651415
2023-11-21 17:53:27   INFO  epoch: 5/30, acc_iter=34935, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:32:44/1:12:29, time_cost(all): 9:28:56/1 day, 21:08:32, loss=0.543590537492039, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.86(1.03), norm=4.135551082655767, lr=0.000902663590469491
2023-11-21 17:54:16   INFO  epoch: 5/30, acc_iter=34985, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:33:33/1:11:51, time_cost(all): 9:29:45/1 day, 19:48:16, loss=0.54350742350573, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.98(1.03), norm=3.9436053353691385, lr=0.000902342856287566
2023-11-21 17:55:05   INFO  epoch: 5/30, acc_iter=35035, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:34:22/1:13:26, time_cost(all): 9:30:34/1 day, 19:56:24, loss=0.543424309519421, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.07(1.03), norm=1.1524887765130014, lr=0.000902022122105641
2023-11-21 17:55:54   INFO  epoch: 5/30, acc_iter=35085, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:35:12/1:11:43, time_cost(all): 9:31:23/1 day, 19:52:13, loss=0.543341195533112, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.13(1.03), norm=0.957884280845128, lr=0.000901701387923717
2023-11-21 17:56:43   INFO  epoch: 5/30, acc_iter=35135, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:36:01/1:13:55, time_cost(all): 9:32:12/1 day, 22:18:08, loss=0.543258081546803, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.15(1.03), norm=2.48154255256945, lr=0.000901380653741792
2023-11-21 17:57:33   INFO  epoch: 5/30, acc_iter=35185, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:36:50/1:14:18, time_cost(all): 9:33:02/1 day, 19:01:18, loss=0.543174967560494, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.11(1.03), norm=2.017051601111576, lr=0.000901059919559867
2023-11-21 17:58:22   INFO  epoch: 5/30, acc_iter=35235, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:37:39/1:11:57, time_cost(all): 9:33:51/1 day, 21:19:54, loss=0.543091853574185, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.92(1.03), norm=2.4582937231500095, lr=0.000900739185377942
2023-11-21 17:59:11   INFO  epoch: 5/30, acc_iter=35285, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:38:28/1:11:06, time_cost(all): 9:34:40/1 day, 20:37:08, loss=0.543008739587876, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.08(1.03), norm=2.4850586670571557, lr=0.000900418451196018
2023-11-21 18:00:00   INFO  epoch: 5/30, acc_iter=35335, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:39:17/1:05:59, time_cost(all): 9:35:29/1 day, 20:40:26, loss=0.542925625601567, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.85(1.03), norm=4.2074139022654276, lr=0.000900097717014093
2023-11-21 18:00:49   INFO  epoch: 5/30, acc_iter=35385, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:40:06/1:06:22, time_cost(all): 9:36:18/1 day, 19:37:06, loss=0.542842511615257, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.0(1.03), norm=2.10554265543187, lr=0.000899776982832168
2023-11-21 18:01:38   INFO  epoch: 5/30, acc_iter=35435, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:40:55/1:04:11, time_cost(all): 9:37:07/1 day, 19:02:39, loss=0.542759397628948, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.83(1.03), norm=2.6390867588304885, lr=0.000899456248650244
2023-11-21 18:02:27   INFO  epoch: 5/30, acc_iter=35485, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:41:44/1:03:18, time_cost(all): 9:37:56/1 day, 21:45:18, loss=0.542676283642639, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.01(1.03), norm=4.5101486451873525, lr=0.000899135514468319
2023-11-21 18:03:16   INFO  epoch: 5/30, acc_iter=35535, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:42:34/1:06:00, time_cost(all): 9:38:45/1 day, 21:40:02, loss=0.54259316965633, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.13(1.03), norm=3.597542193281021, lr=0.000898814780286394
2023-11-21 18:04:05   INFO  epoch: 5/30, acc_iter=35585, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:43:23/1:03:09, time_cost(all): 9:39:34/1 day, 18:47:23, loss=0.542510055670021, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.09(1.03), norm=1.379195141188325, lr=0.00089849404610447
2023-11-21 18:04:55   INFO  epoch: 5/30, acc_iter=35635, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:44:12/1:01:00, time_cost(all): 9:40:24/1 day, 18:36:00, loss=0.542426941683712, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.05(1.03), norm=1.3445619137559546, lr=0.000898173311922545
2023-11-21 18:05:44   INFO  epoch: 5/30, acc_iter=35685, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:45:01/1:05:47, time_cost(all): 9:41:13/1 day, 20:56:53, loss=0.542343827697403, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.05(1.03), norm=3.1814224865200655, lr=0.00089785257774062
2023-11-21 18:06:33   INFO  epoch: 5/30, acc_iter=35735, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:45:50/1:00:44, time_cost(all): 9:42:02/1 day, 21:59:41, loss=0.542260713711094, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.95(1.03), norm=4.637367998810526, lr=0.000897531843558695
2023-11-21 18:07:22   INFO  epoch: 5/30, acc_iter=35785, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:46:39/1:02:09, time_cost(all): 9:42:51/1 day, 20:25:56, loss=0.542177599724785, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.19(1.03), norm=1.8536607078560499, lr=0.000897211109376771
2023-11-21 18:08:11   INFO  epoch: 5/30, acc_iter=35835, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:47:28/1:00:34, time_cost(all): 9:43:40/1 day, 20:32:43, loss=0.542094485738476, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.12(1.03), norm=1.3243490350808451, lr=0.000896890375194846
2023-11-21 18:09:00   INFO  epoch: 5/30, acc_iter=35885, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:48:17/0:58:15, time_cost(all): 9:44:29/1 day, 20:12:15, loss=0.542011371752167, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.13(1.03), norm=1.185685534371324, lr=0.000896569641012921
2023-11-21 18:09:49   INFO  epoch: 5/30, acc_iter=35935, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:49:07/0:56:56, time_cost(all): 9:45:18/1 day, 20:20:49, loss=0.541928257765858, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.9(1.03), norm=4.826318205340113, lr=0.000896248906830997
2023-11-21 18:10:38   INFO  epoch: 5/30, acc_iter=35985, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:49:56/0:56:23, time_cost(all): 9:46:07/1 day, 21:04:11, loss=0.541845143779549, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.9(1.03), norm=3.510320104678738, lr=0.000895928172649072
2023-11-21 18:11:28   INFO  epoch: 5/30, acc_iter=36035, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:50:45/0:54:29, time_cost(all): 9:46:57/1 day, 21:48:58, loss=0.54176202979324, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.99(1.03), norm=1.1728263954589226, lr=0.000895607438467147
2023-11-21 18:12:17   INFO  epoch: 5/30, acc_iter=36085, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 0:51:34/0:56:15, time_cost(all): 9:47:46/1 day, 20:34:01, loss=0.541678915806931, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.05(1.03), norm=4.155929818188726, lr=0.000895286704285223
2023-11-21 18:13:06   INFO  epoch: 5/30, acc_iter=36135, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 0:52:23/0:57:54, time_cost(all): 9:48:35/1 day, 21:28:59, loss=0.541595801820622, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.16(1.03), norm=3.2261437409046807, lr=0.000894965970103298
2023-11-21 18:13:55   INFO  epoch: 5/30, acc_iter=36185, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 0:53:12/0:52:51, time_cost(all): 9:49:24/1 day, 20:47:20, loss=0.541512687834312, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.97(1.03), norm=3.0792761023046307, lr=0.000894645235921373
2023-11-21 18:14:44   INFO  epoch: 5/30, acc_iter=36235, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 0:54:01/0:53:09, time_cost(all): 9:50:13/1 day, 22:07:39, loss=0.541429573848003, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.11(1.03), norm=2.5341709904471927, lr=0.000894324501739448
2023-11-21 18:15:33   INFO  epoch: 5/30, acc_iter=36285, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 0:54:50/0:52:45, time_cost(all): 9:51:02/1 day, 18:29:25, loss=0.541346459861694, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.0(1.03), norm=3.0479636409503916, lr=0.000894003767557524
2023-11-21 18:16:22   INFO  epoch: 5/30, acc_iter=36335, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 0:55:39/0:53:27, time_cost(all): 9:51:51/1 day, 20:53:11, loss=0.541263345875385, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.15(1.03), norm=4.528979041646255, lr=0.000893683033375599
2023-11-21 18:17:11   INFO  epoch: 5/30, acc_iter=36385, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 0:56:29/0:49:26, time_cost(all): 9:52:40/1 day, 20:29:23, loss=0.541180231889076, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.95(1.03), norm=1.3211523295269432, lr=0.000893362299193674
2023-11-21 18:18:00   INFO  epoch: 5/30, acc_iter=36435, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 0:57:18/0:51:22, time_cost(all): 9:53:29/1 day, 22:00:38, loss=0.541097117902767, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.2(1.03), norm=3.9380788098600314, lr=0.00089304156501175
2023-11-21 18:18:50   INFO  epoch: 5/30, acc_iter=36485, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 0:58:07/0:47:32, time_cost(all): 9:54:19/1 day, 21:22:56, loss=0.541014003916458, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.18(1.03), norm=1.9513389986542373, lr=0.000892720830829825
2023-11-21 18:19:39   INFO  epoch: 5/30, acc_iter=36535, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 0:58:56/0:48:46, time_cost(all): 9:55:08/1 day, 19:01:57, loss=0.540930889930149, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.95(1.03), norm=2.0611129503967804, lr=0.0008924000966479
2023-11-21 18:20:28   INFO  epoch: 5/30, acc_iter=36585, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 0:59:45/0:49:31, time_cost(all): 9:55:57/1 day, 20:11:58, loss=0.54084777594384, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.17(1.03), norm=4.578892762527888, lr=0.000892079362465975
2023-11-21 18:21:17   INFO  epoch: 5/30, acc_iter=36635, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:00:34/0:46:04, time_cost(all): 9:56:46/1 day, 19:41:57, loss=0.540764661957531, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.01(1.03), norm=2.366410389087857, lr=0.000891758628284051
2023-11-21 18:22:06   INFO  epoch: 5/30, acc_iter=36685, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:01:23/0:47:58, time_cost(all): 9:57:35/1 day, 19:59:47, loss=0.540681547971222, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.95(1.03), norm=4.305990088989937, lr=0.000891437894102126
2023-11-21 18:22:55   INFO  epoch: 5/30, acc_iter=36735, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:02:12/0:43:49, time_cost(all): 9:58:24/1 day, 20:15:10, loss=0.540598433984913, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.2(1.03), norm=2.473644238199982, lr=0.000891117159920201
2023-11-21 18:23:44   INFO  epoch: 5/30, acc_iter=36785, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:03:02/0:46:40, time_cost(all): 9:59:13/1 day, 18:46:02, loss=0.540515319998604, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.19(1.03), norm=4.564700421962949, lr=0.000890796425738277
2023-11-21 18:24:33   INFO  epoch: 5/30, acc_iter=36835, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:03:51/0:45:32, time_cost(all): 10:00:02/1 day, 21:22:27, loss=0.540432206012295, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.97(1.03), norm=0.7542180088086046, lr=0.000890475691556352
2023-11-21 18:25:23   INFO  epoch: 5/30, acc_iter=36885, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:04:40/0:41:40, time_cost(all): 10:00:52/1 day, 20:14:01, loss=0.540349092025986, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.85(1.03), norm=4.366394136405959, lr=0.000890154957374427
2023-11-21 18:26:12   INFO  epoch: 5/30, acc_iter=36935, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:05:29/0:44:08, time_cost(all): 10:01:41/1 day, 18:19:49, loss=0.540265978039676, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.87(1.03), norm=4.2350053763049385, lr=0.000889834223192503
2023-11-21 18:27:01   INFO  epoch: 5/30, acc_iter=36985, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:06:18/0:39:36, time_cost(all): 10:02:30/1 day, 19:39:22, loss=0.540182864053367, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.17(1.03), norm=2.5080078520308344, lr=0.000889513489010578
2023-11-21 18:27:50   INFO  epoch: 5/30, acc_iter=37035, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:07:07/0:42:37, time_cost(all): 10:03:19/1 day, 19:34:25, loss=0.540099750067058, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.95(1.03), norm=2.461561182524479, lr=0.000889192754828653
2023-11-21 18:28:39   INFO  epoch: 5/30, acc_iter=37085, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:07:56/0:40:49, time_cost(all): 10:04:08/1 day, 21:20:10, loss=0.540016636080749, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.88(1.03), norm=1.9477847503434849, lr=0.000888872020646728
2023-11-21 18:29:28   INFO  epoch: 5/30, acc_iter=37135, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:08:45/0:40:59, time_cost(all): 10:04:57/1 day, 19:47:28, loss=0.53993352209444, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.02(1.03), norm=2.562837419830156, lr=0.000888551286464804
2023-11-21 18:30:17   INFO  epoch: 5/30, acc_iter=37185, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:09:34/0:37:30, time_cost(all): 10:05:46/1 day, 18:05:44, loss=0.539850408108131, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.02(1.03), norm=3.3288912467006035, lr=0.000888230552282879
2023-11-21 18:31:06   INFO  epoch: 5/30, acc_iter=37235, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:10:24/0:37:25, time_cost(all): 10:06:35/1 day, 21:51:40, loss=0.539767294121822, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.0(1.03), norm=3.1310540818812793, lr=0.000887909818100954
2023-11-21 18:31:55   INFO  epoch: 5/30, acc_iter=37285, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:11:13/0:37:33, time_cost(all): 10:07:24/1 day, 19:38:16, loss=0.539684180135513, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.86(1.03), norm=4.667387220082122, lr=0.00088758908391903
2023-11-21 18:32:45   INFO  epoch: 5/30, acc_iter=37335, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:12:02/0:36:16, time_cost(all): 10:08:14/1 day, 19:14:37, loss=0.539601066149204, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.87(1.03), norm=4.074427859591563, lr=0.000887268349737105
2023-11-21 18:33:34   INFO  epoch: 5/30, acc_iter=37385, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:12:51/0:35:14, time_cost(all): 10:09:03/1 day, 19:49:09, loss=0.539517952162895, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.92(1.03), norm=3.0387050484842897, lr=0.00088694761555518
2023-11-21 18:34:23   INFO  epoch: 5/30, acc_iter=37435, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:13:40/0:35:42, time_cost(all): 10:09:52/1 day, 21:51:33, loss=0.539434838176586, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.21(1.03), norm=1.2722459850887513, lr=0.000886626881373255
2023-11-21 18:35:12   INFO  epoch: 5/30, acc_iter=37485, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:14:29/0:33:46, time_cost(all): 10:10:41/1 day, 20:58:43, loss=0.539351724190277, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.9(1.03), norm=2.5929446113071952, lr=0.000886306147191331
2023-11-21 18:36:01   INFO  epoch: 5/30, acc_iter=37535, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:15:18/0:31:26, time_cost(all): 10:11:30/1 day, 21:05:40, loss=0.539268610203968, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.1(1.03), norm=0.778314196884752, lr=0.000885985413009406
2023-11-21 18:36:50   INFO  epoch: 5/30, acc_iter=37585, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:16:07/0:33:01, time_cost(all): 10:12:19/1 day, 19:37:46, loss=0.539185496217659, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.85(1.03), norm=4.725950741782229, lr=0.000885664678827481
2023-11-21 18:37:39   INFO  epoch: 5/30, acc_iter=37635, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:16:57/0:32:18, time_cost(all): 10:13:08/1 day, 18:27:37, loss=0.53910238223135, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=0.7972188027104177, lr=0.000885343944645557
2023-11-21 18:38:28   INFO  epoch: 5/30, acc_iter=37685, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:17:46/0:30:10, time_cost(all): 10:13:57/1 day, 18:17:21, loss=0.539019268245041, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.97(1.03), norm=1.1854057905857718, lr=0.000885023210463632
2023-11-21 18:39:18   INFO  epoch: 5/30, acc_iter=37735, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:18:35/0:30:12, time_cost(all): 10:14:47/1 day, 20:04:55, loss=0.538936154258732, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.84(1.03), norm=4.255238619662801, lr=0.000884702476281707
2023-11-21 18:40:07   INFO  epoch: 5/30, acc_iter=37785, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:19:24/0:29:47, time_cost(all): 10:15:36/1 day, 21:46:33, loss=0.538853040272422, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.04(1.03), norm=2.2563916208569332, lr=0.000884381742099783
2023-11-21 18:40:56   INFO  epoch: 5/30, acc_iter=37835, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:20:13/0:27:19, time_cost(all): 10:16:25/1 day, 18:59:28, loss=0.538769926286113, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.96(1.03), norm=4.798461549546063, lr=0.000884061007917858
2023-11-21 18:41:45   INFO  epoch: 5/30, acc_iter=37885, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:21:02/0:25:44, time_cost(all): 10:17:14/1 day, 17:43:24, loss=0.538686812299804, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.01(1.03), norm=3.9622974747412187, lr=0.000883740273735933
2023-11-21 18:42:34   INFO  epoch: 5/30, acc_iter=37935, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:21:51/0:25:15, time_cost(all): 10:18:03/1 day, 20:47:33, loss=0.538603698313495, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.98(1.03), norm=0.5816547383076132, lr=0.000883419539554008
2023-11-21 18:43:23   INFO  epoch: 5/30, acc_iter=37985, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:22:40/0:26:19, time_cost(all): 10:18:52/1 day, 18:03:45, loss=0.538520584327186, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.03(1.03), norm=1.9865804193127352, lr=0.000883098805372084
2023-11-21 18:44:12   INFO  epoch: 5/30, acc_iter=38035, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:23:29/0:24:57, time_cost(all): 10:19:41/1 day, 20:18:33, loss=0.538437470340877, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.07(1.03), norm=2.658513006440979, lr=0.000882778071190159
2023-11-21 18:45:01   INFO  epoch: 5/30, acc_iter=38085, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:24:19/0:24:07, time_cost(all): 10:20:30/1 day, 17:47:29, loss=0.538354356354568, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.19(1.03), norm=3.2724490044132093, lr=0.000882457337008234
2023-11-21 18:45:50   INFO  epoch: 5/30, acc_iter=38135, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:25:08/0:23:31, time_cost(all): 10:21:19/1 day, 18:09:24, loss=0.538271242368259, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.16(1.03), norm=3.803087708836872, lr=0.00088213660282631
2023-11-21 18:46:40   INFO  epoch: 5/30, acc_iter=38185, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:25:57/0:22:05, time_cost(all): 10:22:09/1 day, 19:31:23, loss=0.53818812838195, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.94(1.03), norm=3.6532453452744473, lr=0.000881815868644385
2023-11-21 18:47:29   INFO  epoch: 5/30, acc_iter=38235, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:26:46/0:20:03, time_cost(all): 10:22:58/1 day, 18:06:44, loss=0.538105014395641, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.95(1.03), norm=1.476393990546638, lr=0.00088149513446246
2023-11-21 18:48:18   INFO  epoch: 5/30, acc_iter=38285, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:27:35/0:20:14, time_cost(all): 10:23:47/1 day, 21:20:50, loss=0.538021900409332, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.09(1.03), norm=2.1184400589204513, lr=0.000881174400280535
2023-11-21 18:49:07   INFO  epoch: 5/30, acc_iter=38335, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:28:24/0:19:57, time_cost(all): 10:24:36/1 day, 21:19:05, loss=0.537938786423023, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.88(1.03), norm=1.4833735776687078, lr=0.000880853666098611
2023-11-21 18:49:56   INFO  epoch: 5/30, acc_iter=38385, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:29:13/0:18:06, time_cost(all): 10:25:25/1 day, 19:43:40, loss=0.537855672436714, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.95(1.03), norm=4.898986276282918, lr=0.000880532931916686
2023-11-21 18:50:45   INFO  epoch: 5/30, acc_iter=38435, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:30:02/0:18:21, time_cost(all): 10:26:14/1 day, 20:02:02, loss=0.537772558450405, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.01(1.03), norm=3.2056356059930255, lr=0.000880212197734761
2023-11-21 18:51:34   INFO  epoch: 5/30, acc_iter=38485, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:30:52/0:16:43, time_cost(all): 10:27:03/1 day, 20:39:20, loss=0.537689444464096, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.98(1.03), norm=2.3749260458009207, lr=0.000879891463552837
2023-11-21 18:52:23   INFO  epoch: 5/30, acc_iter=38535, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:31:41/0:15:44, time_cost(all): 10:27:52/1 day, 18:42:10, loss=0.537606330477786, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.17(1.03), norm=2.720821937753599, lr=0.000879570729370912
2023-11-21 18:53:13   INFO  epoch: 5/30, acc_iter=38585, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:32:30/0:14:51, time_cost(all): 10:28:42/1 day, 20:16:01, loss=0.537523216491477, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.95(1.03), norm=3.726314871360729, lr=0.000879249995188987
2023-11-21 18:54:02   INFO  epoch: 5/30, acc_iter=38635, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:33:19/0:14:03, time_cost(all): 10:29:31/1 day, 20:39:04, loss=0.537440102505168, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.02(1.03), norm=2.385913494978026, lr=0.000878929261007063
2023-11-21 18:54:51   INFO  epoch: 5/30, acc_iter=38685, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:34:08/0:13:39, time_cost(all): 10:30:20/1 day, 17:53:26, loss=0.537356988518859, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.11(1.03), norm=2.4705119297679454, lr=0.000878608526825138
2023-11-21 18:55:40   INFO  epoch: 5/30, acc_iter=38735, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:34:57/0:13:24, time_cost(all): 10:31:09/1 day, 20:11:07, loss=0.53727387453255, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.15(1.03), norm=3.771170151477946, lr=0.000878287792643213
2023-11-21 18:56:29   INFO  epoch: 5/30, acc_iter=38785, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:35:46/0:12:01, time_cost(all): 10:31:58/1 day, 19:02:25, loss=0.537190760546241, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.92(1.03), norm=4.567831746749401, lr=0.000877967058461288
2023-11-21 18:57:18   INFO  epoch: 5/30, acc_iter=38835, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:36:35/0:11:07, time_cost(all): 10:32:47/1 day, 18:08:53, loss=0.537107646559932, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.13(1.03), norm=3.059973868928102, lr=0.000877646324279364
2023-11-21 18:58:07   INFO  epoch: 5/30, acc_iter=38885, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:37:24/0:10:20, time_cost(all): 10:33:36/1 day, 20:20:41, loss=0.537024532573623, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.97(1.03), norm=2.2463297320199644, lr=0.000877325590097439
2023-11-21 18:58:56   INFO  epoch: 5/30, acc_iter=38935, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:38:14/0:10:02, time_cost(all): 10:34:25/1 day, 21:25:58, loss=0.536941418587314, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.85(1.03), norm=4.640004031820501, lr=0.000877004855915514
2023-11-21 18:59:45   INFO  epoch: 5/30, acc_iter=38985, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:39:03/0:08:35, time_cost(all): 10:35:14/1 day, 19:21:22, loss=0.536858304601005, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.13(1.03), norm=4.46115271766722, lr=0.00087668412173359
2023-11-21 19:00:35   INFO  epoch: 5/30, acc_iter=39035, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:39:52/0:08:01, time_cost(all): 10:36:04/1 day, 18:42:55, loss=0.536775190614696, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.13(1.03), norm=1.0156485186608304, lr=0.000876363387551665
2023-11-21 19:01:24   INFO  epoch: 5/30, acc_iter=39085, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:40:41/0:06:58, time_cost(all): 10:36:53/1 day, 20:35:06, loss=0.536692076628387, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.12(1.03), norm=2.840339632256785, lr=0.00087604265336974
2023-11-21 19:02:13   INFO  epoch: 5/30, acc_iter=39135, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:41:30/0:06:32, time_cost(all): 10:37:42/1 day, 18:40:40, loss=0.536608962642078, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.22(1.03), norm=2.669523195427849, lr=0.000875721919187816
2023-11-21 19:03:02   INFO  epoch: 5/30, acc_iter=39185, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 1:42:19/0:05:35, time_cost(all): 10:38:31/1 day, 20:22:32, loss=0.536525848655769, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.97(1.03), norm=3.63710117495443, lr=0.000875401185005891
2023-11-21 19:03:51   INFO  epoch: 5/30, acc_iter=39235, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 1:43:08/0:04:44, time_cost(all): 10:39:20/1 day, 19:01:57, loss=0.53644273466946, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.93(1.03), norm=2.296880396520767, lr=0.000875080450823966
2023-11-21 19:04:40   INFO  epoch: 5/30, acc_iter=39285, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 1:43:57/0:03:52, time_cost(all): 10:40:09/1 day, 19:00:08, loss=0.536359620683151, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.89(1.03), norm=3.7747513323216815, lr=0.000874759716642041
2023-11-21 19:05:29   INFO  epoch: 5/30, acc_iter=39335, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 1:44:47/0:02:55, time_cost(all): 10:40:58/1 day, 19:01:37, loss=0.536276506696841, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.89(1.03), norm=1.5857340521492915, lr=0.000874438982460117
2023-11-21 19:06:18   INFO  epoch: 5/30, acc_iter=39385, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 1:45:36/0:02:15, time_cost(all): 10:41:47/1 day, 17:47:06, loss=0.536193392710532, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.94(1.03), norm=1.3440471459230041, lr=0.000874118248278192
2023-11-21 19:07:08   INFO  epoch: 5/30, acc_iter=39435, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 1:46:25/0:01:23, time_cost(all): 10:42:37/1 day, 20:43:35, loss=0.536110278724223, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.04(1.03), norm=3.6907126078458603, lr=0.000873797514096267
2023-11-21 19:07:57   INFO  epoch: 5/30, acc_iter=39485, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 1:47:14/0:00:35, time_cost(all): 10:43:26/1 day, 21:01:53, loss=0.536027164737914, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.01(1.03), norm=3.130463320023549, lr=0.000873476779914343
2023-11-21 19:08:46   INFO  epoch: 6/30, acc_iter=39572, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:49/1:50:18, time_cost(all): 10:44:15/1 day, 21:06:47, loss=0.535882546401736, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.17(1.03), norm=2.3467661946417664, lr=0.000872918702437794
2023-11-21 19:09:35   INFO  epoch: 6/30, acc_iter=39622, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:38/1:48:55, time_cost(all): 10:45:04/1 day, 21:15:24, loss=0.535799432415427, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.22(1.03), norm=0.8738735111663605, lr=0.000872597968255869
2023-11-21 19:10:24   INFO  epoch: 6/30, acc_iter=39672, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:27/1:46:21, time_cost(all): 10:45:53/1 day, 17:31:54, loss=0.535716318429118, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.02(1.03), norm=1.7481592660805667, lr=0.000872277234073944
2023-11-21 19:11:13   INFO  epoch: 6/30, acc_iter=39722, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:16/1:47:59, time_cost(all): 10:46:42/1 day, 20:00:42, loss=0.535633204442809, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.19(1.03), norm=0.8950654595816634, lr=0.00087195649989202
2023-11-21 19:12:02   INFO  epoch: 6/30, acc_iter=39772, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:05/1:45:16, time_cost(all): 10:47:31/1 day, 19:16:20, loss=0.5355500904565, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.86(1.03), norm=4.405565742753915, lr=0.000871635765710095
2023-11-21 19:12:51   INFO  epoch: 6/30, acc_iter=39822, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:04:54/1:43:27, time_cost(all): 10:48:20/1 day, 17:05:40, loss=0.535466976470191, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.95(1.03), norm=1.7191085491453038, lr=0.00087131503152817
2023-11-21 19:13:40   INFO  epoch: 6/30, acc_iter=39872, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:05:43/1:38:44, time_cost(all): 10:49:09/1 day, 17:19:22, loss=0.535383862483882, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.18(1.03), norm=3.6634915427054, lr=0.000870994297346245
2023-11-21 19:14:30   INFO  epoch: 6/30, acc_iter=39922, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:06:32/1:39:41, time_cost(all): 10:49:59/1 day, 20:56:19, loss=0.535300748497573, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.98(1.03), norm=4.136925620792939, lr=0.000870673563164321
2023-11-21 19:15:19   INFO  epoch: 6/30, acc_iter=39972, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:07:22/1:39:39, time_cost(all): 10:50:48/1 day, 20:49:47, loss=0.535217634511264, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.11(1.03), norm=2.7399017655440834, lr=0.000870352828982396
2023-11-21 19:16:08   INFO  epoch: 6/30, acc_iter=40022, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:08:11/1:41:22, time_cost(all): 10:51:37/1 day, 20:49:24, loss=0.535134520524955, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.07(1.03), norm=1.4790679024546654, lr=0.000870032094800471
2023-11-21 19:16:57   INFO  epoch: 6/30, acc_iter=40072, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:09:00/1:37:49, time_cost(all): 10:52:26/1 day, 19:56:18, loss=0.535051406538646, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.87(1.03), norm=2.0402768206084043, lr=0.000869711360618547
2023-11-21 19:17:46   INFO  epoch: 6/30, acc_iter=40122, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:09:49/1:33:09, time_cost(all): 10:53:15/1 day, 19:47:16, loss=0.534968292552337, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.87(1.03), norm=4.492557764707001, lr=0.000869390626436622
2023-11-21 19:18:35   INFO  epoch: 6/30, acc_iter=40172, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:10:38/1:38:04, time_cost(all): 10:54:04/1 day, 20:16:41, loss=0.534885178566028, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.93(1.03), norm=3.03338916620447, lr=0.000869069892254697
2023-11-21 19:19:24   INFO  epoch: 6/30, acc_iter=40222, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:11:27/1:38:53, time_cost(all): 10:54:53/1 day, 18:23:11, loss=0.534802064579719, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.89(1.03), norm=3.7573367166234033, lr=0.000868749158072772
2023-11-21 19:20:13   INFO  epoch: 6/30, acc_iter=40272, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:12:16/1:32:28, time_cost(all): 10:55:42/1 day, 20:02:29, loss=0.53471895059341, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.91(1.03), norm=0.64235328739636, lr=0.000868428423890848
2023-11-21 19:21:02   INFO  epoch: 6/30, acc_iter=40322, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:13:05/1:37:11, time_cost(all): 10:56:31/1 day, 21:02:39, loss=0.534635836607101, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.93(1.03), norm=4.73242745309761, lr=0.000868107689708923
2023-11-21 19:21:52   INFO  epoch: 6/30, acc_iter=40372, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:13:54/1:31:35, time_cost(all): 10:57:21/1 day, 19:42:31, loss=0.534552722620792, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.18(1.03), norm=2.942951312344584, lr=0.000867786955526998
2023-11-21 19:22:41   INFO  epoch: 6/30, acc_iter=40422, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:14:44/1:34:41, time_cost(all): 10:58:10/1 day, 20:04:38, loss=0.534469608634482, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.13(1.03), norm=4.770052296108892, lr=0.000867466221345074
2023-11-21 19:23:30   INFO  epoch: 6/30, acc_iter=40472, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:15:33/1:29:36, time_cost(all): 10:58:59/1 day, 21:00:11, loss=0.534386494648173, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.07(1.03), norm=4.0472778049957, lr=0.000867145487163149
2023-11-21 19:24:19   INFO  epoch: 6/30, acc_iter=40522, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:16:22/1:30:58, time_cost(all): 10:59:48/1 day, 17:58:49, loss=0.534303380661864, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.11(1.03), norm=0.9000821254278133, lr=0.000866824752981224
2023-11-21 19:25:08   INFO  epoch: 6/30, acc_iter=40572, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:17:11/1:32:43, time_cost(all): 11:00:37/1 day, 17:09:59, loss=0.534220266675555, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.9(1.03), norm=3.8323614825538197, lr=0.0008665040187993
2023-11-21 19:25:57   INFO  epoch: 6/30, acc_iter=40622, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:18:00/1:30:23, time_cost(all): 11:01:26/1 day, 18:24:36, loss=0.534137152689246, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.04(1.03), norm=2.7989261562866092, lr=0.000866183284617375
2023-11-21 19:26:46   INFO  epoch: 6/30, acc_iter=40672, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:18:49/1:29:15, time_cost(all): 11:02:15/1 day, 19:46:23, loss=0.534054038702937, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.13(1.03), norm=2.2554045957275197, lr=0.00086586255043545
2023-11-21 19:27:35   INFO  epoch: 6/30, acc_iter=40722, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:19:38/1:26:07, time_cost(all): 11:03:04/1 day, 20:39:32, loss=0.533970924716628, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.01(1.03), norm=3.773804771728006, lr=0.000865541816253525
2023-11-21 19:28:25   INFO  epoch: 6/30, acc_iter=40772, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:20:27/1:30:56, time_cost(all): 11:03:54/1 day, 16:55:15, loss=0.533887810730319, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.92(1.03), norm=3.416370116300986, lr=0.000865221082071601
2023-11-21 19:29:14   INFO  epoch: 6/30, acc_iter=40822, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:21:17/1:29:30, time_cost(all): 11:04:43/1 day, 18:54:38, loss=0.53380469674401, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.03(1.03), norm=0.5146532150355279, lr=0.000864900347889676
2023-11-21 19:30:03   INFO  epoch: 6/30, acc_iter=40872, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:22:06/1:24:03, time_cost(all): 11:05:32/1 day, 18:04:10, loss=0.533721582757701, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.03(1.03), norm=3.7716651696352232, lr=0.000864579613707751
2023-11-21 19:30:52   INFO  epoch: 6/30, acc_iter=40922, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:22:55/1:22:18, time_cost(all): 11:06:21/1 day, 17:28:54, loss=0.533638468771392, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.11(1.03), norm=3.496821759382111, lr=0.000864258879525827
2023-11-21 19:31:41   INFO  epoch: 6/30, acc_iter=40972, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:23:44/1:22:21, time_cost(all): 11:07:10/1 day, 17:51:48, loss=0.533555354785083, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.97(1.03), norm=0.7821700594764588, lr=0.000863938145343902
2023-11-21 19:32:30   INFO  epoch: 6/30, acc_iter=41022, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:24:33/1:25:25, time_cost(all): 11:07:59/1 day, 19:27:24, loss=0.533472240798774, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.04(1.03), norm=4.215369995485218, lr=0.000863617411161977
2023-11-21 19:33:19   INFO  epoch: 6/30, acc_iter=41072, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:25:22/1:24:07, time_cost(all): 11:08:48/1 day, 16:40:56, loss=0.533389126812465, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.96(1.03), norm=2.0852212636117096, lr=0.000863296676980052
2023-11-21 19:34:08   INFO  epoch: 6/30, acc_iter=41122, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:26:11/1:25:32, time_cost(all): 11:09:37/1 day, 20:48:18, loss=0.533306012826156, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.86(1.03), norm=1.8777143826955602, lr=0.000862975942798128
2023-11-21 19:34:57   INFO  epoch: 6/30, acc_iter=41172, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:27:00/1:23:46, time_cost(all): 11:10:26/1 day, 17:22:27, loss=0.533222898839847, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.12(1.03), norm=2.1589805524330554, lr=0.000862655208616203
2023-11-21 19:35:47   INFO  epoch: 6/30, acc_iter=41222, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:27:49/1:18:03, time_cost(all): 11:11:16/1 day, 20:43:28, loss=0.533139784853537, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.07(1.03), norm=4.276437860651916, lr=0.000862334474434278
2023-11-21 19:36:36   INFO  epoch: 6/30, acc_iter=41272, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:28:39/1:17:14, time_cost(all): 11:12:05/1 day, 19:14:21, loss=0.533056670867228, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.01(1.03), norm=3.557895886988173, lr=0.000862013740252354
2023-11-21 19:37:25   INFO  epoch: 6/30, acc_iter=41322, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:29:28/1:21:28, time_cost(all): 11:12:54/1 day, 20:26:52, loss=0.532973556880919, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.15(1.03), norm=1.198474315594086, lr=0.000861693006070429
2023-11-21 19:38:14   INFO  epoch: 6/30, acc_iter=41372, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:30:17/1:14:53, time_cost(all): 11:13:43/1 day, 17:43:33, loss=0.53289044289461, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.11(1.03), norm=3.331897392806455, lr=0.000861372271888504
2023-11-21 19:39:03   INFO  epoch: 6/30, acc_iter=41422, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:31:06/1:17:42, time_cost(all): 11:14:32/1 day, 20:33:03, loss=0.532807328908301, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.03(1.03), norm=3.481141929970033, lr=0.00086105153770658
2023-11-21 19:39:52   INFO  epoch: 6/30, acc_iter=41472, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:31:55/1:13:59, time_cost(all): 11:15:21/1 day, 18:16:56, loss=0.532724214921992, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.97(1.03), norm=1.1977028912334982, lr=0.000860730803524655
2023-11-21 19:40:41   INFO  epoch: 6/30, acc_iter=41522, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:32:44/1:18:02, time_cost(all): 11:16:10/1 day, 17:13:40, loss=0.532641100935683, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.17(1.03), norm=1.4174138533223581, lr=0.00086041006934273
2023-11-21 19:41:30   INFO  epoch: 6/30, acc_iter=41572, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:33:33/1:13:35, time_cost(all): 11:16:59/1 day, 20:09:36, loss=0.532557986949374, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.94(1.03), norm=2.977601316061525, lr=0.000860089335160805
2023-11-21 19:42:20   INFO  epoch: 6/30, acc_iter=41622, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:34:22/1:15:28, time_cost(all): 11:17:49/1 day, 17:56:24, loss=0.532474872963065, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.12(1.03), norm=3.113091611115832, lr=0.000859768600978881
2023-11-21 19:43:09   INFO  epoch: 6/30, acc_iter=41672, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:35:12/1:10:57, time_cost(all): 11:18:38/1 day, 19:39:49, loss=0.532391758976756, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.89(1.03), norm=2.844230321403309, lr=0.000859447866796956
2023-11-21 19:43:58   INFO  epoch: 6/30, acc_iter=41722, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:36:01/1:14:18, time_cost(all): 11:19:27/1 day, 18:55:39, loss=0.532308644990447, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.91(1.03), norm=1.5224604954256122, lr=0.000859127132615031
2023-11-21 19:44:47   INFO  epoch: 6/30, acc_iter=41772, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:36:50/1:12:20, time_cost(all): 11:20:16/1 day, 17:15:56, loss=0.532225531004138, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.84(1.03), norm=2.944194915301055, lr=0.000858806398433107
2023-11-21 19:45:36   INFO  epoch: 6/30, acc_iter=41822, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:37:39/1:07:41, time_cost(all): 11:21:05/1 day, 18:18:15, loss=0.532142417017829, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.88(1.03), norm=1.545918240485308, lr=0.000858485664251182
2023-11-21 19:46:25   INFO  epoch: 6/30, acc_iter=41872, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:38:28/1:09:07, time_cost(all): 11:21:54/1 day, 16:58:00, loss=0.53205930303152, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.16(1.03), norm=0.7917117770425681, lr=0.000858164930069257
2023-11-21 19:47:14   INFO  epoch: 6/30, acc_iter=41922, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:39:17/1:09:21, time_cost(all): 11:22:43/1 day, 19:30:03, loss=0.531976189045211, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.07(1.03), norm=1.1060540082479566, lr=0.000857844195887333
2023-11-21 19:48:03   INFO  epoch: 6/30, acc_iter=41972, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:40:06/1:06:31, time_cost(all): 11:23:32/1 day, 17:18:23, loss=0.531893075058901, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.14(1.03), norm=3.1229223398649304, lr=0.000857523461705408
2023-11-21 19:48:52   INFO  epoch: 6/30, acc_iter=42022, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:40:55/1:06:00, time_cost(all): 11:24:21/1 day, 16:49:06, loss=0.531809961072592, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.22(1.03), norm=4.82537141641073, lr=0.000857202727523483
2023-11-21 19:49:42   INFO  epoch: 6/30, acc_iter=42072, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:41:44/1:07:42, time_cost(all): 11:25:11/1 day, 20:34:16, loss=0.531726847086283, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.12(1.03), norm=4.99811928545604, lr=0.000856881993341558
2023-11-21 19:50:31   INFO  epoch: 6/30, acc_iter=42122, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:42:34/1:08:21, time_cost(all): 11:26:00/1 day, 19:37:20, loss=0.531643733099974, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.19(1.03), norm=4.269429510106496, lr=0.000856561259159634
2023-11-21 19:51:20   INFO  epoch: 6/30, acc_iter=42172, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:43:23/1:04:50, time_cost(all): 11:26:49/1 day, 18:09:07, loss=0.531560619113665, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.94(1.03), norm=0.8011099956305804, lr=0.000856240524977709
2023-11-21 19:52:09   INFO  epoch: 6/30, acc_iter=42222, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:44:12/1:00:56, time_cost(all): 11:27:38/1 day, 16:26:10, loss=0.531477505127356, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.88(1.03), norm=1.5047892479961302, lr=0.000855919790795784
2023-11-21 19:52:58   INFO  epoch: 6/30, acc_iter=42272, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:45:01/1:05:51, time_cost(all): 11:28:27/1 day, 16:28:50, loss=0.531394391141047, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.88(1.03), norm=0.5436979295705242, lr=0.00085559905661386
2023-11-21 19:53:47   INFO  epoch: 6/30, acc_iter=42322, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:45:50/1:01:31, time_cost(all): 11:29:16/1 day, 18:52:44, loss=0.531311277154738, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.07(1.03), norm=1.9935710130598236, lr=0.000855278322431935
2023-11-21 19:54:36   INFO  epoch: 6/30, acc_iter=42372, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:46:39/0:58:11, time_cost(all): 11:30:05/1 day, 18:59:26, loss=0.531228163168429, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.07(1.03), norm=3.0931950253605494, lr=0.00085495758825001
2023-11-21 19:55:25   INFO  epoch: 6/30, acc_iter=42422, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:47:28/0:58:03, time_cost(all): 11:30:54/1 day, 17:41:53, loss=0.53114504918212, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.17(1.03), norm=4.817470788723884, lr=0.000854636854068086
2023-11-21 19:56:15   INFO  epoch: 6/30, acc_iter=42472, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:48:17/0:59:18, time_cost(all): 11:31:44/1 day, 20:11:36, loss=0.531061935195811, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.95(1.03), norm=2.088687535265636, lr=0.000854316119886161
2023-11-21 19:57:04   INFO  epoch: 6/30, acc_iter=42522, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:49:07/0:58:11, time_cost(all): 11:32:33/1 day, 18:00:58, loss=0.530978821209502, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.2(1.03), norm=4.166978544427645, lr=0.000853995385704236
2023-11-21 19:57:53   INFO  epoch: 6/30, acc_iter=42572, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:49:56/0:59:48, time_cost(all): 11:33:22/1 day, 16:54:47, loss=0.530895707223193, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.2(1.03), norm=1.8173464001747242, lr=0.000853674651522311
2023-11-21 19:58:42   INFO  epoch: 6/30, acc_iter=42622, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:50:45/0:56:28, time_cost(all): 11:34:11/1 day, 17:02:01, loss=0.530812593236884, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.04(1.03), norm=1.0531958281927265, lr=0.000853353917340387
2023-11-21 19:59:31   INFO  epoch: 6/30, acc_iter=42672, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 0:51:34/0:59:01, time_cost(all): 11:35:00/1 day, 18:29:37, loss=0.530729479250575, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.84(1.03), norm=1.066851353508837, lr=0.000853033183158462
2023-11-21 20:00:20   INFO  epoch: 6/30, acc_iter=42722, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 0:52:23/0:55:13, time_cost(all): 11:35:49/1 day, 16:34:31, loss=0.530646365264266, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.05(1.03), norm=4.096677432894502, lr=0.000852712448976537
2023-11-21 20:01:09   INFO  epoch: 6/30, acc_iter=42772, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 0:53:12/0:53:02, time_cost(all): 11:36:38/1 day, 17:06:20, loss=0.530563251277957, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.23(1.03), norm=2.7529172421442114, lr=0.000852391714794613
2023-11-21 20:01:58   INFO  epoch: 6/30, acc_iter=42822, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 0:54:01/0:53:37, time_cost(all): 11:37:27/1 day, 20:05:03, loss=0.530480137291647, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.85(1.03), norm=3.0658397999448876, lr=0.000852070980612688
2023-11-21 20:02:47   INFO  epoch: 6/30, acc_iter=42872, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 0:54:50/0:53:38, time_cost(all): 11:38:16/1 day, 19:05:15, loss=0.530397023305338, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.11(1.03), norm=1.0140918021250607, lr=0.000851750246430763
2023-11-21 20:03:37   INFO  epoch: 6/30, acc_iter=42922, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 0:55:39/0:51:41, time_cost(all): 11:39:06/1 day, 19:51:05, loss=0.530313909319029, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.0(1.03), norm=2.847883974179412, lr=0.000851429512248839
2023-11-21 20:04:26   INFO  epoch: 6/30, acc_iter=42972, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 0:56:29/0:52:37, time_cost(all): 11:39:55/1 day, 19:24:32, loss=0.53023079533272, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.0(1.03), norm=3.07264659668734, lr=0.000851108778066914
2023-11-21 20:05:15   INFO  epoch: 6/30, acc_iter=43022, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 0:57:18/0:48:55, time_cost(all): 11:40:44/1 day, 17:59:59, loss=0.530147681346411, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.06(1.03), norm=1.20948834214451, lr=0.000850788043884989
2023-11-21 20:06:04   INFO  epoch: 6/30, acc_iter=43072, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 0:58:07/0:52:06, time_cost(all): 11:41:33/1 day, 17:32:59, loss=0.530064567360102, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.18(1.03), norm=4.330497801526561, lr=0.000850467309703064
2023-11-21 20:06:53   INFO  epoch: 6/30, acc_iter=43122, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 0:58:56/0:51:10, time_cost(all): 11:42:22/1 day, 18:31:27, loss=0.529981453373793, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.06(1.03), norm=3.415604034590408, lr=0.00085014657552114
2023-11-21 20:07:42   INFO  epoch: 6/30, acc_iter=43172, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 0:59:45/0:49:26, time_cost(all): 11:43:11/1 day, 17:13:48, loss=0.529898339387484, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.93(1.03), norm=3.6899690864450427, lr=0.000849825841339215
2023-11-21 20:08:31   INFO  epoch: 6/30, acc_iter=43222, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:00:34/0:47:47, time_cost(all): 11:44:00/1 day, 16:34:29, loss=0.529815225401175, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.21(1.03), norm=0.6969505159943945, lr=0.00084950510715729
2023-11-21 20:09:20   INFO  epoch: 6/30, acc_iter=43272, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:01:23/0:47:42, time_cost(all): 11:44:49/1 day, 19:15:20, loss=0.529732111414866, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.1(1.03), norm=2.750096192434597, lr=0.000849184372975365
2023-11-21 20:10:10   INFO  epoch: 6/30, acc_iter=43322, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:02:12/0:47:02, time_cost(all): 11:45:39/1 day, 18:53:26, loss=0.529648997428557, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.15(1.03), norm=2.353341101892068, lr=0.000848863638793441
2023-11-21 20:10:59   INFO  epoch: 6/30, acc_iter=43372, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:03:02/0:43:31, time_cost(all): 11:46:28/1 day, 18:39:20, loss=0.529565883442248, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.87(1.03), norm=2.4186568445830203, lr=0.000848542904611516
2023-11-21 20:11:48   INFO  epoch: 6/30, acc_iter=43422, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:03:51/0:46:00, time_cost(all): 11:47:17/1 day, 17:39:56, loss=0.529482769455939, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.09(1.03), norm=2.102624015764019, lr=0.000848222170429591
2023-11-21 20:12:37   INFO  epoch: 6/30, acc_iter=43472, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:04:40/0:41:41, time_cost(all): 11:48:06/1 day, 17:06:11, loss=0.52939965546963, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.06(1.03), norm=1.50128763068338, lr=0.000847901436247667
2023-11-21 20:13:26   INFO  epoch: 6/30, acc_iter=43522, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:05:29/0:43:01, time_cost(all): 11:48:55/1 day, 18:39:56, loss=0.529316541483321, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.19(1.03), norm=1.633332994365667, lr=0.000847580702065742
2023-11-21 20:14:15   INFO  epoch: 6/30, acc_iter=43572, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:06:18/0:41:53, time_cost(all): 11:49:44/1 day, 18:10:28, loss=0.529233427497012, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.99(1.03), norm=1.5623566611292286, lr=0.000847259967883817
2023-11-21 20:15:04   INFO  epoch: 6/30, acc_iter=43622, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:07:07/0:41:37, time_cost(all): 11:50:33/1 day, 19:36:02, loss=0.529150313510702, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.16(1.03), norm=2.7960597947911454, lr=0.000846939233701893
2023-11-21 20:15:53   INFO  epoch: 6/30, acc_iter=43672, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:07:56/0:38:45, time_cost(all): 11:51:22/1 day, 18:54:50, loss=0.529067199524393, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.86(1.03), norm=4.399678533070639, lr=0.000846618499519968
2023-11-21 20:16:42   INFO  epoch: 6/30, acc_iter=43722, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:08:45/0:39:07, time_cost(all): 11:52:11/1 day, 18:35:13, loss=0.528984085538084, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.85(1.03), norm=4.9720663340528635, lr=0.000846297765338043
2023-11-21 20:17:32   INFO  epoch: 6/30, acc_iter=43772, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:09:34/0:36:24, time_cost(all): 11:53:01/1 day, 19:20:30, loss=0.528900971551775, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.05(1.03), norm=2.731854587355595, lr=0.000845977031156118
2023-11-21 20:18:21   INFO  epoch: 6/30, acc_iter=43822, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:10:24/0:39:14, time_cost(all): 11:53:50/1 day, 17:46:27, loss=0.528817857565466, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.01(1.03), norm=3.777955448033271, lr=0.000845656296974194
2023-11-21 20:19:10   INFO  epoch: 6/30, acc_iter=43872, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:11:13/0:35:23, time_cost(all): 11:54:39/1 day, 17:52:37, loss=0.528734743579157, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.99(1.03), norm=4.8375452484066175, lr=0.000845335562792269
2023-11-21 20:19:59   INFO  epoch: 6/30, acc_iter=43922, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:12:02/0:34:31, time_cost(all): 11:55:28/1 day, 19:42:27, loss=0.528651629592848, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.91(1.03), norm=2.0786148335923516, lr=0.000845014828610344
2023-11-21 20:20:48   INFO  epoch: 6/30, acc_iter=43972, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:12:51/0:34:38, time_cost(all): 11:56:17/1 day, 17:40:02, loss=0.528568515606539, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.99(1.03), norm=4.6686712808999635, lr=0.00084469409442842
2023-11-21 20:21:37   INFO  epoch: 6/30, acc_iter=44022, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:13:40/0:34:25, time_cost(all): 11:57:06/1 day, 19:19:51, loss=0.52848540162023, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.99(1.03), norm=1.1129670000957983, lr=0.000844373360246495
2023-11-21 20:22:26   INFO  epoch: 6/30, acc_iter=44072, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:14:29/0:32:34, time_cost(all): 11:57:55/1 day, 16:02:53, loss=0.528402287633921, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.18(1.03), norm=4.988094438082246, lr=0.00084405262606457
2023-11-21 20:23:15   INFO  epoch: 6/30, acc_iter=44122, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:15:18/0:33:37, time_cost(all): 11:58:44/1 day, 17:48:20, loss=0.528319173647612, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.0(1.03), norm=2.6083640816274545, lr=0.000843731891882646
2023-11-21 20:24:05   INFO  epoch: 6/30, acc_iter=44172, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:16:07/0:30:10, time_cost(all): 11:59:34/1 day, 19:48:03, loss=0.528236059661303, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.94(1.03), norm=1.5704797896600122, lr=0.000843411157700721
2023-11-21 20:24:54   INFO  epoch: 6/30, acc_iter=44222, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:16:57/0:32:04, time_cost(all): 12:00:23/1 day, 16:01:13, loss=0.528152945674994, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.97(1.03), norm=2.2900662234179796, lr=0.000843090423518796
2023-11-21 20:25:43   INFO  epoch: 6/30, acc_iter=44272, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:17:46/0:29:01, time_cost(all): 12:01:12/1 day, 19:18:30, loss=0.528069831688685, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.01(1.03), norm=4.3799857564819344, lr=0.000842769689336871
2023-11-21 20:26:32   INFO  epoch: 6/30, acc_iter=44322, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:18:35/0:29:45, time_cost(all): 12:02:01/1 day, 18:35:26, loss=0.527986717702376, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.85(1.03), norm=1.4420449378023843, lr=0.000842448955154947
2023-11-21 20:27:21   INFO  epoch: 6/30, acc_iter=44372, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:19:24/0:29:05, time_cost(all): 12:02:50/1 day, 19:15:49, loss=0.527903603716066, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.03(1.03), norm=1.0710011363038778, lr=0.000842128220973022
2023-11-21 20:28:10   INFO  epoch: 6/30, acc_iter=44422, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:20:13/0:28:08, time_cost(all): 12:03:39/1 day, 18:05:38, loss=0.527820489729757, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.16(1.03), norm=2.984250336723776, lr=0.000841807486791097
2023-11-21 20:28:59   INFO  epoch: 6/30, acc_iter=44472, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:21:02/0:27:17, time_cost(all): 12:04:28/1 day, 18:43:03, loss=0.527737375743448, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.22(1.03), norm=3.672852075433735, lr=0.000841486752609173
2023-11-21 20:29:48   INFO  epoch: 6/30, acc_iter=44522, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:21:51/0:25:48, time_cost(all): 12:05:17/1 day, 15:57:57, loss=0.527654261757139, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.19(1.03), norm=4.17876733691712, lr=0.000841166018427248
2023-11-21 20:30:37   INFO  epoch: 6/30, acc_iter=44572, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:22:40/0:25:28, time_cost(all): 12:06:06/1 day, 19:22:41, loss=0.52757114777083, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.08(1.03), norm=1.9917245654635636, lr=0.000840845284245323
2023-11-21 20:31:27   INFO  epoch: 6/30, acc_iter=44622, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:23:29/0:23:58, time_cost(all): 12:06:56/1 day, 17:39:25, loss=0.527488033784521, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.16(1.03), norm=1.6670183005348105, lr=0.000840524550063399
2023-11-21 20:32:16   INFO  epoch: 6/30, acc_iter=44672, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:24:19/0:22:29, time_cost(all): 12:07:45/1 day, 16:11:45, loss=0.527404919798212, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.09(1.03), norm=2.3767986584326453, lr=0.000840203815881474
2023-11-21 20:33:05   INFO  epoch: 6/30, acc_iter=44722, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:25:08/0:23:46, time_cost(all): 12:08:34/1 day, 16:32:09, loss=0.527321805811903, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.87(1.03), norm=0.7690893707540933, lr=0.000839883081699549
2023-11-21 20:33:54   INFO  epoch: 6/30, acc_iter=44772, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:25:57/0:22:20, time_cost(all): 12:09:23/1 day, 17:56:05, loss=0.527238691825594, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.08(1.03), norm=1.3222578515179264, lr=0.000839562347517624
2023-11-21 20:34:43   INFO  epoch: 6/30, acc_iter=44822, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:26:46/0:20:24, time_cost(all): 12:10:12/1 day, 18:49:54, loss=0.527155577839285, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.93(1.03), norm=4.773909543993155, lr=0.0008392416133357
2023-11-21 20:35:32   INFO  epoch: 6/30, acc_iter=44872, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:27:35/0:20:44, time_cost(all): 12:11:01/1 day, 16:45:32, loss=0.527072463852976, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.22(1.03), norm=4.345390541667092, lr=0.000838920879153775
2023-11-21 20:36:21   INFO  epoch: 6/30, acc_iter=44922, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:28:24/0:18:58, time_cost(all): 12:11:50/1 day, 17:22:11, loss=0.526989349866667, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.95(1.03), norm=2.4441347829445714, lr=0.00083860014497185
2023-11-21 20:37:10   INFO  epoch: 6/30, acc_iter=44972, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:29:13/0:18:44, time_cost(all): 12:12:39/1 day, 16:01:25, loss=0.526906235880358, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.12(1.03), norm=2.6447359014678833, lr=0.000838279410789926
2023-11-21 20:38:00   INFO  epoch: 6/30, acc_iter=45022, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:30:02/0:18:09, time_cost(all): 12:13:29/1 day, 19:31:22, loss=0.526823121894049, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.03(1.03), norm=1.213459544282556, lr=0.000837958676608001
2023-11-21 20:38:49   INFO  epoch: 6/30, acc_iter=45072, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:30:52/0:17:29, time_cost(all): 12:14:18/1 day, 19:38:03, loss=0.52674000790774, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.98(1.03), norm=4.248151793494595, lr=0.000837637942426076
2023-11-21 20:39:38   INFO  epoch: 6/30, acc_iter=45122, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:31:41/0:16:46, time_cost(all): 12:15:07/1 day, 17:16:47, loss=0.526656893921431, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.96(1.03), norm=1.6671883645676455, lr=0.000837317208244151
2023-11-21 20:40:27   INFO  epoch: 6/30, acc_iter=45172, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:32:30/0:14:52, time_cost(all): 12:15:56/1 day, 18:59:41, loss=0.526573779935121, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.86(1.03), norm=2.6965261234913216, lr=0.000836996474062227
2023-11-21 20:41:16   INFO  epoch: 6/30, acc_iter=45222, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:33:19/0:14:49, time_cost(all): 12:16:45/1 day, 16:15:04, loss=0.526490665948812, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.94(1.03), norm=1.7303962406019682, lr=0.000836675739880302
2023-11-21 20:42:05   INFO  epoch: 6/30, acc_iter=45272, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:34:08/0:13:55, time_cost(all): 12:17:34/1 day, 19:03:19, loss=0.526407551962503, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.0(1.03), norm=4.383818371603924, lr=0.000836355005698377
2023-11-21 20:42:54   INFO  epoch: 6/30, acc_iter=45322, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:34:57/0:13:13, time_cost(all): 12:18:23/1 day, 16:25:41, loss=0.526324437976194, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.09(1.03), norm=4.897433452999771, lr=0.000836034271516453
2023-11-21 20:43:43   INFO  epoch: 6/30, acc_iter=45372, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:35:46/0:12:31, time_cost(all): 12:19:12/1 day, 18:23:11, loss=0.526241323989885, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.84(1.03), norm=3.079651704178037, lr=0.000835713537334528
2023-11-21 20:44:32   INFO  epoch: 6/30, acc_iter=45422, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:36:35/0:11:38, time_cost(all): 12:20:01/1 day, 15:56:57, loss=0.526158210003576, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.93(1.03), norm=1.9008676586209305, lr=0.000835392803152603
2023-11-21 20:45:22   INFO  epoch: 6/30, acc_iter=45472, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:37:24/0:10:42, time_cost(all): 12:20:51/1 day, 18:22:44, loss=0.526075096017267, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.11(1.03), norm=3.2563150998836394, lr=0.000835072068970679
2023-11-21 20:46:11   INFO  epoch: 6/30, acc_iter=45522, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:38:14/0:09:39, time_cost(all): 12:21:40/1 day, 19:28:41, loss=0.525991982030958, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.18(1.03), norm=2.216667440957822, lr=0.000834751334788754
2023-11-21 20:47:00   INFO  epoch: 6/30, acc_iter=45572, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:39:03/0:08:57, time_cost(all): 12:22:29/1 day, 16:23:04, loss=0.525908868044649, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.18(1.03), norm=2.1192768274046383, lr=0.000834430600606829
2023-11-21 20:47:49   INFO  epoch: 6/30, acc_iter=45622, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:39:52/0:07:47, time_cost(all): 12:23:18/1 day, 16:00:13, loss=0.52582575405834, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.85(1.03), norm=4.335320761894075, lr=0.000834109866424904
2023-11-21 20:48:38   INFO  epoch: 6/30, acc_iter=45672, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:40:41/0:07:08, time_cost(all): 12:24:07/1 day, 17:34:05, loss=0.525742640072031, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.87(1.03), norm=4.367932905866873, lr=0.00083378913224298
2023-11-21 20:49:27   INFO  epoch: 6/30, acc_iter=45722, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:41:30/0:06:05, time_cost(all): 12:24:56/1 day, 16:13:57, loss=0.525659526085722, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.01(1.03), norm=1.0151347171328473, lr=0.000833468398061055
2023-11-21 20:50:16   INFO  epoch: 6/30, acc_iter=45772, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 1:42:19/0:05:39, time_cost(all): 12:25:45/1 day, 16:38:09, loss=0.525576412099413, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.08(1.03), norm=3.7709111876885717, lr=0.00083314766387913
2023-11-21 20:51:05   INFO  epoch: 6/30, acc_iter=45822, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 1:43:08/0:04:33, time_cost(all): 12:26:34/1 day, 16:48:02, loss=0.525493298113104, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.21(1.03), norm=4.766519494168799, lr=0.000832826929697206
2023-11-21 20:51:55   INFO  epoch: 6/30, acc_iter=45872, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 1:43:57/0:03:44, time_cost(all): 12:27:24/1 day, 17:19:01, loss=0.525410184126795, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.03(1.03), norm=1.532530862114543, lr=0.000832506195515281
2023-11-21 20:52:44   INFO  epoch: 6/30, acc_iter=45922, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 1:44:47/0:03:03, time_cost(all): 12:28:13/1 day, 16:56:22, loss=0.525327070140486, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.05(1.03), norm=3.7387939190557793, lr=0.000832185461333356
2023-11-21 20:53:33   INFO  epoch: 6/30, acc_iter=45972, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 1:45:36/0:02:15, time_cost(all): 12:29:02/1 day, 15:41:35, loss=0.525243956154177, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.13(1.03), norm=2.7221263233974677, lr=0.000831864727151431
2023-11-21 20:54:22   INFO  epoch: 6/30, acc_iter=46022, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 1:46:25/0:01:28, time_cost(all): 12:29:51/1 day, 18:55:59, loss=0.525160842167867, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.12(1.03), norm=3.8525222140368025, lr=0.000831543992969507
2023-11-21 20:55:11   INFO  epoch: 6/30, acc_iter=46072, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 1:47:14/0:00:37, time_cost(all): 12:30:40/1 day, 15:27:39, loss=0.525077728181558, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.14(1.03), norm=4.662041038414553, lr=0.000831223258787582
2023-11-21 20:56:00   INFO  epoch: 7/30, acc_iter=46159, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:49/1:49:04, time_cost(all): 12:31:29/1 day, 18:31:38, loss=0.524933109845381, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.94(1.03), norm=1.8604926391888403, lr=0.000830665181311033
2023-11-21 20:56:49   INFO  epoch: 7/30, acc_iter=46209, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:38/1:50:20, time_cost(all): 12:32:18/1 day, 16:25:29, loss=0.524849995859072, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.09(1.03), norm=2.980366985322731, lr=0.000830344447129108
2023-11-21 20:57:38   INFO  epoch: 7/30, acc_iter=46259, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:27/1:49:04, time_cost(all): 12:33:07/1 day, 17:28:53, loss=0.524766881872762, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.04(1.03), norm=3.0348652984614404, lr=0.000830023712947184
2023-11-21 20:58:27   INFO  epoch: 7/30, acc_iter=46309, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:16/1:39:50, time_cost(all): 12:33:56/1 day, 16:21:21, loss=0.524683767886453, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.16(1.03), norm=1.1547866815104355, lr=0.000829702978765259
2023-11-21 20:59:17   INFO  epoch: 7/30, acc_iter=46359, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:05/1:38:44, time_cost(all): 12:34:46/1 day, 16:29:12, loss=0.524600653900144, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.19(1.03), norm=4.298966170021506, lr=0.000829382244583334
2023-11-21 21:00:06   INFO  epoch: 7/30, acc_iter=46409, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:04:54/1:38:16, time_cost(all): 12:35:35/1 day, 17:58:22, loss=0.524517539913835, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.84(1.03), norm=2.3595642641398156, lr=0.00082906151040141
2023-11-21 21:00:55   INFO  epoch: 7/30, acc_iter=46459, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:05:43/1:42:59, time_cost(all): 12:36:24/1 day, 17:51:14, loss=0.524434425927526, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.92(1.03), norm=1.2580534166229422, lr=0.000828740776219485
2023-11-21 21:01:44   INFO  epoch: 7/30, acc_iter=46509, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:06:32/1:37:15, time_cost(all): 12:37:13/1 day, 16:15:02, loss=0.524351311941217, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.85(1.03), norm=2.16792725351753, lr=0.00082842004203756
2023-11-21 21:02:33   INFO  epoch: 7/30, acc_iter=46559, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:07:22/1:39:11, time_cost(all): 12:38:02/1 day, 18:41:13, loss=0.524268197954908, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.97(1.03), norm=3.962523474045438, lr=0.000828099307855635
2023-11-21 21:03:22   INFO  epoch: 7/30, acc_iter=46609, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:08:11/1:41:29, time_cost(all): 12:38:51/1 day, 18:40:46, loss=0.524185083968599, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.94(1.03), norm=2.909152208652144, lr=0.000827778573673711
2023-11-21 21:04:11   INFO  epoch: 7/30, acc_iter=46659, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:09:00/1:36:03, time_cost(all): 12:39:40/1 day, 19:03:43, loss=0.52410196998229, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.92(1.03), norm=0.7858375384441469, lr=0.000827457839491786
2023-11-21 21:05:00   INFO  epoch: 7/30, acc_iter=46709, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:09:49/1:34:38, time_cost(all): 12:40:29/1 day, 16:12:54, loss=0.524018855995981, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.22(1.03), norm=4.100279299887751, lr=0.000827137105309861
2023-11-21 21:05:49   INFO  epoch: 7/30, acc_iter=46759, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:10:38/1:35:46, time_cost(all): 12:41:18/1 day, 15:35:03, loss=0.523935742009672, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.84(1.03), norm=0.8008520195157471, lr=0.000826816371127937
2023-11-21 21:06:39   INFO  epoch: 7/30, acc_iter=46809, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:11:27/1:37:04, time_cost(all): 12:42:08/1 day, 18:08:10, loss=0.523852628023363, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.05(1.03), norm=2.3076915996729737, lr=0.000826495636946012
2023-11-21 21:07:28   INFO  epoch: 7/30, acc_iter=46859, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:12:16/1:33:10, time_cost(all): 12:42:57/1 day, 19:11:10, loss=0.523769514037054, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.95(1.03), norm=4.142314696712697, lr=0.000826174902764087
2023-11-21 21:08:17   INFO  epoch: 7/30, acc_iter=46909, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:13:05/1:36:42, time_cost(all): 12:43:46/1 day, 15:58:30, loss=0.523686400050745, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.02(1.03), norm=4.1655354442253145, lr=0.000825854168582163
2023-11-21 21:09:06   INFO  epoch: 7/30, acc_iter=46959, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:13:54/1:29:17, time_cost(all): 12:44:35/1 day, 16:49:49, loss=0.523603286064436, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.22(1.03), norm=4.834741455951155, lr=0.000825533434400238
2023-11-21 21:09:55   INFO  epoch: 7/30, acc_iter=47009, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:14:44/1:28:53, time_cost(all): 12:45:24/1 day, 17:22:27, loss=0.523520172078127, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.08(1.03), norm=2.546380525821872, lr=0.000825212700218313
2023-11-21 21:10:44   INFO  epoch: 7/30, acc_iter=47059, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:15:33/1:35:42, time_cost(all): 12:46:13/1 day, 16:13:54, loss=0.523437058091817, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.91(1.03), norm=2.2252159181882916, lr=0.000824891966036388
2023-11-21 21:11:33   INFO  epoch: 7/30, acc_iter=47109, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:16:22/1:34:45, time_cost(all): 12:47:02/1 day, 17:51:22, loss=0.523353944105508, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.02(1.03), norm=4.118852886983857, lr=0.000824571231854464
2023-11-21 21:12:22   INFO  epoch: 7/30, acc_iter=47159, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:17:11/1:29:56, time_cost(all): 12:47:51/1 day, 17:47:20, loss=0.523270830119199, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.15(1.03), norm=0.9672753521810695, lr=0.000824250497672539
2023-11-21 21:13:12   INFO  epoch: 7/30, acc_iter=47209, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:18:00/1:27:18, time_cost(all): 12:48:41/1 day, 15:52:04, loss=0.52318771613289, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.08(1.03), norm=2.875486696605409, lr=0.000823929763490614
2023-11-21 21:14:01   INFO  epoch: 7/30, acc_iter=47259, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:18:49/1:31:31, time_cost(all): 12:49:30/1 day, 16:32:01, loss=0.523104602146581, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.97(1.03), norm=1.6322946328329326, lr=0.00082360902930869
2023-11-21 21:14:50   INFO  epoch: 7/30, acc_iter=47309, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:19:38/1:26:28, time_cost(all): 12:50:19/1 day, 18:27:29, loss=0.523021488160272, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.06(1.03), norm=4.320883961165219, lr=0.000823288295126765
2023-11-21 21:15:39   INFO  epoch: 7/30, acc_iter=47359, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:20:27/1:26:44, time_cost(all): 12:51:08/1 day, 15:25:47, loss=0.522938374173963, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.83(1.03), norm=1.2647174888100443, lr=0.00082296756094484
2023-11-21 21:16:28   INFO  epoch: 7/30, acc_iter=47409, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:21:17/1:24:49, time_cost(all): 12:51:57/1 day, 16:07:29, loss=0.522855260187654, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.08(1.03), norm=0.6352111274830634, lr=0.000822646826762916
2023-11-21 21:17:17   INFO  epoch: 7/30, acc_iter=47459, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:22:06/1:22:54, time_cost(all): 12:52:46/1 day, 16:53:47, loss=0.522772146201345, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.15(1.03), norm=2.491159485957198, lr=0.000822326092580991
2023-11-21 21:18:06   INFO  epoch: 7/30, acc_iter=47509, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:22:55/1:29:08, time_cost(all): 12:53:35/1 day, 16:07:24, loss=0.522689032215036, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.01(1.03), norm=1.9105376580612667, lr=0.000822005358399066
2023-11-21 21:18:55   INFO  epoch: 7/30, acc_iter=47559, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:23:44/1:21:32, time_cost(all): 12:54:24/1 day, 17:52:35, loss=0.522605918228727, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.95(1.03), norm=4.704216202049573, lr=0.000821684624217141
2023-11-21 21:19:44   INFO  epoch: 7/30, acc_iter=47609, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:24:33/1:20:04, time_cost(all): 12:55:13/1 day, 18:38:29, loss=0.522522804242418, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.11(1.03), norm=3.5216105840377043, lr=0.000821363890035217
2023-11-21 21:20:34   INFO  epoch: 7/30, acc_iter=47659, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:25:22/1:26:30, time_cost(all): 12:56:03/1 day, 16:06:15, loss=0.522439690256109, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.09(1.03), norm=3.5606260725706322, lr=0.000821043155853292
2023-11-21 21:21:23   INFO  epoch: 7/30, acc_iter=47709, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:26:11/1:22:42, time_cost(all): 12:56:52/1 day, 18:00:59, loss=0.5223565762698, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.02(1.03), norm=4.727123096060401, lr=0.000820722421671367
2023-11-21 21:22:12   INFO  epoch: 7/30, acc_iter=47759, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:27:00/1:23:19, time_cost(all): 12:57:41/1 day, 16:52:24, loss=0.522273462283491, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.85(1.03), norm=4.847226558823773, lr=0.000820401687489443
2023-11-21 21:23:01   INFO  epoch: 7/30, acc_iter=47809, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:27:49/1:18:08, time_cost(all): 12:58:30/1 day, 18:03:27, loss=0.522190348297181, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.04(1.03), norm=1.2955600460141001, lr=0.000820080953307518
2023-11-21 21:23:50   INFO  epoch: 7/30, acc_iter=47859, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:28:39/1:15:59, time_cost(all): 12:59:19/1 day, 16:49:30, loss=0.522107234310872, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.18(1.03), norm=3.451731954794738, lr=0.000819760219125593
2023-11-21 21:24:39   INFO  epoch: 7/30, acc_iter=47909, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:29:28/1:18:42, time_cost(all): 13:00:08/1 day, 16:42:25, loss=0.522024120324563, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.19(1.03), norm=2.840783543905273, lr=0.000819439484943668
2023-11-21 21:25:28   INFO  epoch: 7/30, acc_iter=47959, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:30:17/1:14:19, time_cost(all): 13:00:57/1 day, 17:42:43, loss=0.521941006338254, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.09(1.03), norm=2.8606454534223085, lr=0.000819118750761744
2023-11-21 21:26:17   INFO  epoch: 7/30, acc_iter=48009, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:31:06/1:15:17, time_cost(all): 13:01:46/1 day, 17:24:13, loss=0.521857892351945, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.18(1.03), norm=0.8532318318851825, lr=0.000818798016579819
2023-11-21 21:27:07   INFO  epoch: 7/30, acc_iter=48059, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:31:55/1:16:07, time_cost(all): 13:02:36/1 day, 16:04:50, loss=0.521774778365636, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.1(1.03), norm=1.5807490949291911, lr=0.000818477282397894
2023-11-21 21:27:56   INFO  epoch: 7/30, acc_iter=48109, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:32:44/1:12:47, time_cost(all): 13:03:25/1 day, 14:59:04, loss=0.521691664379327, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.11(1.03), norm=3.740382604692571, lr=0.00081815654821597
2023-11-21 21:28:45   INFO  epoch: 7/30, acc_iter=48159, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:33:33/1:13:54, time_cost(all): 13:04:14/1 day, 18:09:45, loss=0.521608550393018, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.19(1.03), norm=2.226727265080957, lr=0.000817835814034045
2023-11-21 21:29:34   INFO  epoch: 7/30, acc_iter=48209, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:34:22/1:14:21, time_cost(all): 13:05:03/1 day, 17:14:35, loss=0.521525436406709, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.99(1.03), norm=1.1939099269090259, lr=0.00081751507985212
2023-11-21 21:30:23   INFO  epoch: 7/30, acc_iter=48259, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:35:12/1:15:42, time_cost(all): 13:05:52/1 day, 15:59:15, loss=0.5214423224204, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.08(1.03), norm=2.039104527676294, lr=0.000817194345670196
2023-11-21 21:31:12   INFO  epoch: 7/30, acc_iter=48309, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:36:01/1:13:50, time_cost(all): 13:06:41/1 day, 15:07:50, loss=0.521359208434091, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.89(1.03), norm=0.600738498095905, lr=0.000816873611488271
2023-11-21 21:32:01   INFO  epoch: 7/30, acc_iter=48359, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:36:50/1:13:27, time_cost(all): 13:07:30/1 day, 18:34:25, loss=0.521276094447782, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.21(1.03), norm=1.2426349578631979, lr=0.000816552877306346
2023-11-21 21:32:50   INFO  epoch: 7/30, acc_iter=48409, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:37:39/1:09:33, time_cost(all): 13:08:19/1 day, 15:03:29, loss=0.521192980461473, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.96(1.03), norm=3.0234675741872397, lr=0.000816232143124421
2023-11-21 21:33:39   INFO  epoch: 7/30, acc_iter=48459, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:38:28/1:12:03, time_cost(all): 13:09:08/1 day, 17:57:44, loss=0.521109866475164, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.09(1.03), norm=1.1846166485313756, lr=0.000815911408942497
2023-11-21 21:34:29   INFO  epoch: 7/30, acc_iter=48509, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:39:17/1:09:00, time_cost(all): 13:09:58/1 day, 15:41:13, loss=0.521026752488855, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.86(1.03), norm=4.662075660649275, lr=0.000815590674760572
2023-11-21 21:35:18   INFO  epoch: 7/30, acc_iter=48559, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:40:06/1:10:09, time_cost(all): 13:10:47/1 day, 17:30:53, loss=0.520943638502546, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.16(1.03), norm=3.1347712650276476, lr=0.000815269940578647
2023-11-21 21:36:07   INFO  epoch: 7/30, acc_iter=48609, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:40:55/1:05:27, time_cost(all): 13:11:36/1 day, 15:24:37, loss=0.520860524516237, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.85(1.03), norm=1.88364958944021, lr=0.000814949206396723
2023-11-21 21:36:56   INFO  epoch: 7/30, acc_iter=48659, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:41:44/1:05:48, time_cost(all): 13:12:25/1 day, 17:32:30, loss=0.520777410529927, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.17(1.03), norm=3.8748236434494405, lr=0.000814628472214798
2023-11-21 21:37:45   INFO  epoch: 7/30, acc_iter=48709, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:42:34/1:05:44, time_cost(all): 13:13:14/1 day, 18:33:01, loss=0.520694296543618, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.85(1.03), norm=3.8603234604388312, lr=0.000814307738032873
2023-11-21 21:38:34   INFO  epoch: 7/30, acc_iter=48759, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:43:23/1:07:29, time_cost(all): 13:14:03/1 day, 14:50:05, loss=0.520611182557309, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.88(1.03), norm=4.543218937641785, lr=0.000813987003850948
2023-11-21 21:39:23   INFO  epoch: 7/30, acc_iter=48809, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:44:12/1:02:40, time_cost(all): 13:14:52/1 day, 17:44:06, loss=0.520528068571, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.19(1.03), norm=2.8374595022664986, lr=0.000813666269669024
2023-11-21 21:40:12   INFO  epoch: 7/30, acc_iter=48859, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:45:01/1:04:41, time_cost(all): 13:15:41/1 day, 15:38:30, loss=0.520444954584691, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.06(1.03), norm=2.0713252215637645, lr=0.000813345535487099
2023-11-21 21:41:02   INFO  epoch: 7/30, acc_iter=48909, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:45:50/1:00:30, time_cost(all): 13:16:31/1 day, 15:29:41, loss=0.520361840598382, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.95(1.03), norm=4.535460014558275, lr=0.000813024801305174
2023-11-21 21:41:51   INFO  epoch: 7/30, acc_iter=48959, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:46:39/1:00:52, time_cost(all): 13:17:20/1 day, 17:46:21, loss=0.520278726612073, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.15(1.03), norm=3.7057255704760297, lr=0.00081270406712325
2023-11-21 21:42:40   INFO  epoch: 7/30, acc_iter=49009, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:47:28/0:59:24, time_cost(all): 13:18:09/1 day, 16:57:51, loss=0.520195612625764, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.17(1.03), norm=1.5776282087748188, lr=0.000812383332941325
2023-11-21 21:43:29   INFO  epoch: 7/30, acc_iter=49059, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:48:17/1:02:28, time_cost(all): 13:18:58/1 day, 16:01:50, loss=0.520112498639455, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.01(1.03), norm=1.9483927172657878, lr=0.0008120625987594
2023-11-21 21:44:18   INFO  epoch: 7/30, acc_iter=49109, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:49:07/1:00:05, time_cost(all): 13:19:47/1 day, 15:54:47, loss=0.520029384653146, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.01(1.03), norm=4.406382379875472, lr=0.000811741864577476
2023-11-21 21:45:07   INFO  epoch: 7/30, acc_iter=49159, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:49:56/0:58:36, time_cost(all): 13:20:36/1 day, 15:55:00, loss=0.519946270666837, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.22(1.03), norm=2.133020661740933, lr=0.000811421130395551
2023-11-21 21:45:56   INFO  epoch: 7/30, acc_iter=49209, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:50:45/0:58:50, time_cost(all): 13:21:25/1 day, 15:20:00, loss=0.519863156680528, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.18(1.03), norm=1.2490284146660557, lr=0.000811100396213626
2023-11-21 21:46:45   INFO  epoch: 7/30, acc_iter=49259, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 0:51:34/0:56:25, time_cost(all): 13:22:14/1 day, 17:13:39, loss=0.519780042694219, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.11(1.03), norm=0.5979241171788499, lr=0.000810779662031701
2023-11-21 21:47:34   INFO  epoch: 7/30, acc_iter=49309, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 0:52:23/0:52:45, time_cost(all): 13:23:03/1 day, 16:24:00, loss=0.51969692870791, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.89(1.03), norm=2.9835453246559105, lr=0.000810458927849777
2023-11-21 21:48:24   INFO  epoch: 7/30, acc_iter=49359, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 0:53:12/0:52:24, time_cost(all): 13:23:53/1 day, 15:39:51, loss=0.519613814721601, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.15(1.03), norm=3.8237394278982664, lr=0.000810138193667852
2023-11-21 21:49:13   INFO  epoch: 7/30, acc_iter=49409, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 0:54:01/0:54:29, time_cost(all): 13:24:42/1 day, 18:25:00, loss=0.519530700735292, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.01(1.03), norm=1.5219546935972736, lr=0.000809817459485927
2023-11-21 21:50:02   INFO  epoch: 7/30, acc_iter=49459, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 0:54:50/0:52:05, time_cost(all): 13:25:31/1 day, 18:30:48, loss=0.519447586748982, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.97(1.03), norm=3.0303441546706997, lr=0.000809496725304003
2023-11-21 21:50:51   INFO  epoch: 7/30, acc_iter=49509, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 0:55:39/0:53:37, time_cost(all): 13:26:20/1 day, 15:50:25, loss=0.519364472762673, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.07(1.03), norm=3.5592584948623105, lr=0.000809175991122078
2023-11-21 21:51:40   INFO  epoch: 7/30, acc_iter=49559, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 0:56:29/0:49:49, time_cost(all): 13:27:09/1 day, 16:50:34, loss=0.519281358776364, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.15(1.03), norm=2.806768568721202, lr=0.000808855256940153
2023-11-21 21:52:29   INFO  epoch: 7/30, acc_iter=49609, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 0:57:18/0:49:26, time_cost(all): 13:27:58/1 day, 16:48:26, loss=0.519198244790055, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.11(1.03), norm=4.488284631134003, lr=0.000808534522758228
2023-11-21 21:53:18   INFO  epoch: 7/30, acc_iter=49659, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 0:58:07/0:50:03, time_cost(all): 13:28:47/1 day, 16:21:52, loss=0.519115130803746, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.08(1.03), norm=3.0250438736302208, lr=0.000808213788576304
2023-11-21 21:54:07   INFO  epoch: 7/30, acc_iter=49709, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 0:58:56/0:46:54, time_cost(all): 13:29:36/1 day, 17:08:31, loss=0.519032016817437, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.13(1.03), norm=1.3697899714557122, lr=0.000807893054394379
2023-11-21 21:54:57   INFO  epoch: 7/30, acc_iter=49759, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 0:59:45/0:47:38, time_cost(all): 13:30:26/1 day, 15:08:12, loss=0.518948902831128, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.08(1.03), norm=2.0598999175142256, lr=0.000807572320212454
2023-11-21 21:55:46   INFO  epoch: 7/30, acc_iter=49809, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:00:34/0:47:24, time_cost(all): 13:31:15/1 day, 17:51:39, loss=0.518865788844819, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.88(1.03), norm=1.4626368870755229, lr=0.00080725158603053
2023-11-21 21:56:35   INFO  epoch: 7/30, acc_iter=49859, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:01:23/0:48:09, time_cost(all): 13:32:04/1 day, 16:44:13, loss=0.51878267485851, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.09(1.03), norm=0.8023010024087396, lr=0.000806930851848605
2023-11-21 21:57:24   INFO  epoch: 7/30, acc_iter=49909, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:02:12/0:46:20, time_cost(all): 13:32:53/1 day, 18:13:48, loss=0.518699560872201, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.04(1.03), norm=1.2763364115347562, lr=0.00080661011766668
2023-11-21 21:58:13   INFO  epoch: 7/30, acc_iter=49959, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:03:02/0:42:35, time_cost(all): 13:33:42/1 day, 16:57:31, loss=0.518616446885892, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.88(1.03), norm=2.1602949225918846, lr=0.000806289383484756
2023-11-21 21:59:02   INFO  epoch: 7/30, acc_iter=50009, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:03:51/0:45:55, time_cost(all): 13:34:31/1 day, 16:08:25, loss=0.518533332899583, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.11(1.03), norm=1.8469741628492837, lr=0.000805968649302831
2023-11-21 21:59:51   INFO  epoch: 7/30, acc_iter=50059, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:04:40/0:43:37, time_cost(all): 13:35:20/1 day, 16:05:17, loss=0.518450218913274, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.91(1.03), norm=0.7747863783257911, lr=0.000805647915120906
2023-11-21 22:00:40   INFO  epoch: 7/30, acc_iter=50109, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:05:29/0:41:52, time_cost(all): 13:36:09/1 day, 14:41:59, loss=0.518367104926965, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.1(1.03), norm=0.6965212718731183, lr=0.000805327180938981
2023-11-21 22:01:29   INFO  epoch: 7/30, acc_iter=50159, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:06:18/0:43:01, time_cost(all): 13:36:58/1 day, 15:21:46, loss=0.518283990940656, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.89(1.03), norm=2.762083413617333, lr=0.000805006446757057
2023-11-21 22:02:19   INFO  epoch: 7/30, acc_iter=50209, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:07:07/0:38:41, time_cost(all): 13:37:48/1 day, 16:08:33, loss=0.518200876954346, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=2.3051865416800448, lr=0.000804685712575132
2023-11-21 22:03:08   INFO  epoch: 7/30, acc_iter=50259, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:07:56/0:41:05, time_cost(all): 13:38:37/1 day, 16:55:31, loss=0.518117762968037, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.98(1.03), norm=3.4824436436524135, lr=0.000804364978393207
2023-11-21 22:03:57   INFO  epoch: 7/30, acc_iter=50309, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:08:45/0:39:42, time_cost(all): 13:39:26/1 day, 14:45:22, loss=0.518034648981728, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.93(1.03), norm=1.9750543256883966, lr=0.000804044244211283
2023-11-21 22:04:46   INFO  epoch: 7/30, acc_iter=50359, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:09:34/0:37:09, time_cost(all): 13:40:15/1 day, 16:30:06, loss=0.517951534995419, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.07(1.03), norm=2.3673779201041523, lr=0.000803723510029358
2023-11-21 22:05:35   INFO  epoch: 7/30, acc_iter=50409, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:10:24/0:35:37, time_cost(all): 13:41:04/1 day, 17:23:13, loss=0.51786842100911, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.15(1.03), norm=3.089067099524968, lr=0.000803402775847433
2023-11-21 22:06:24   INFO  epoch: 7/30, acc_iter=50459, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:11:13/0:36:19, time_cost(all): 13:41:53/1 day, 15:15:56, loss=0.517785307022801, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.04(1.03), norm=3.9230984693616144, lr=0.000803082041665508
2023-11-21 22:07:13   INFO  epoch: 7/30, acc_iter=50509, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:12:02/0:34:29, time_cost(all): 13:42:42/1 day, 15:31:34, loss=0.517702193036492, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.84(1.03), norm=0.9548428265967058, lr=0.000802761307483584
2023-11-21 22:08:02   INFO  epoch: 7/30, acc_iter=50559, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:12:51/0:36:32, time_cost(all): 13:43:31/1 day, 16:25:56, loss=0.517619079050183, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.91(1.03), norm=3.089837483154088, lr=0.000802440573301659
2023-11-21 22:08:52   INFO  epoch: 7/30, acc_iter=50609, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:13:40/0:33:13, time_cost(all): 13:44:21/1 day, 17:58:52, loss=0.517535965063874, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.16(1.03), norm=3.4805505923771713, lr=0.000802119839119734
2023-11-21 22:09:41   INFO  epoch: 7/30, acc_iter=50659, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:14:29/0:32:06, time_cost(all): 13:45:10/1 day, 15:25:45, loss=0.517452851077565, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.12(1.03), norm=0.6945140352314547, lr=0.00080179910493781
2023-11-21 22:10:30   INFO  epoch: 7/30, acc_iter=50709, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:15:18/0:33:16, time_cost(all): 13:45:59/1 day, 16:20:11, loss=0.517369737091256, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.04(1.03), norm=0.761964352367926, lr=0.000801478370755885
2023-11-21 22:11:19   INFO  epoch: 7/30, acc_iter=50759, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:16:07/0:32:25, time_cost(all): 13:46:48/1 day, 15:29:46, loss=0.517286623104947, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.1(1.03), norm=0.5628304126586197, lr=0.00080115763657396
2023-11-21 22:12:08   INFO  epoch: 7/30, acc_iter=50809, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:16:57/0:31:24, time_cost(all): 13:47:37/1 day, 17:47:56, loss=0.517203509118638, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.85(1.03), norm=4.229454519667556, lr=0.000800836902392036
2023-11-21 22:12:57   INFO  epoch: 7/30, acc_iter=50859, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:17:46/0:31:31, time_cost(all): 13:48:26/1 day, 15:00:26, loss=0.517120395132329, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.01(1.03), norm=0.9338305140245189, lr=0.000800516168210111
2023-11-21 22:13:46   INFO  epoch: 7/30, acc_iter=50909, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:18:35/0:30:07, time_cost(all): 13:49:15/1 day, 15:22:02, loss=0.51703728114602, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.18(1.03), norm=4.629937177994417, lr=0.000800195434028186
2023-11-21 22:14:35   INFO  epoch: 7/30, acc_iter=50959, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:19:24/0:29:20, time_cost(all): 13:50:04/1 day, 15:44:41, loss=0.516954167159711, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.09(1.03), norm=4.258482904054563, lr=0.000799874699846261
2023-11-21 22:15:24   INFO  epoch: 7/30, acc_iter=51009, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:20:13/0:26:20, time_cost(all): 13:50:53/1 day, 14:40:03, loss=0.516871053173402, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.02(1.03), norm=3.0230815222533183, lr=0.000799553965664337
2023-11-21 22:16:14   INFO  epoch: 7/30, acc_iter=51059, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:21:02/0:26:40, time_cost(all): 13:51:43/1 day, 16:04:53, loss=0.516787939187092, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.93(1.03), norm=0.9514763308749273, lr=0.000799233231482412
2023-11-21 22:17:03   INFO  epoch: 7/30, acc_iter=51109, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:21:51/0:26:42, time_cost(all): 13:52:32/1 day, 14:43:28, loss=0.516704825200783, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.03(1.03), norm=1.8190947055017777, lr=0.000798912497300487
2023-11-21 22:17:52   INFO  epoch: 7/30, acc_iter=51159, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:22:40/0:25:34, time_cost(all): 13:53:21/1 day, 17:19:46, loss=0.516621711214474, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.95(1.03), norm=1.0512041666660936, lr=0.000798591763118563
2023-11-21 22:18:41   INFO  epoch: 7/30, acc_iter=51209, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:23:29/0:25:22, time_cost(all): 13:54:10/1 day, 14:05:18, loss=0.516538597228165, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.13(1.03), norm=2.088377727159807, lr=0.000798271028936638
2023-11-21 22:19:30   INFO  epoch: 7/30, acc_iter=51259, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:24:19/0:23:36, time_cost(all): 13:54:59/1 day, 15:01:25, loss=0.516455483241856, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.97(1.03), norm=2.8005875177854667, lr=0.000797950294754713
2023-11-21 22:20:19   INFO  epoch: 7/30, acc_iter=51309, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:25:08/0:23:30, time_cost(all): 13:55:48/1 day, 15:44:26, loss=0.516372369255547, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.09(1.03), norm=2.0096732050935957, lr=0.000797629560572788
2023-11-21 22:21:08   INFO  epoch: 7/30, acc_iter=51359, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:25:57/0:21:39, time_cost(all): 13:56:37/1 day, 14:07:17, loss=0.516289255269238, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.87(1.03), norm=1.4196293499947095, lr=0.000797308826390864
2023-11-21 22:21:57   INFO  epoch: 7/30, acc_iter=51409, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:26:46/0:20:01, time_cost(all): 13:57:26/1 day, 15:40:14, loss=0.516206141282929, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.17(1.03), norm=1.0119229308804631, lr=0.000796988092208939
2023-11-21 22:22:47   INFO  epoch: 7/30, acc_iter=51459, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:27:35/0:20:30, time_cost(all): 13:58:16/1 day, 15:57:42, loss=0.51612302729662, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.06(1.03), norm=1.0777044756044567, lr=0.000796667358027014
2023-11-21 22:23:36   INFO  epoch: 7/30, acc_iter=51509, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:28:24/0:19:50, time_cost(all): 13:59:05/1 day, 15:49:23, loss=0.516039913310311, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.85(1.03), norm=0.7918727933490701, lr=0.00079634662384509
2023-11-21 22:24:25   INFO  epoch: 7/30, acc_iter=51559, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:29:13/0:19:28, time_cost(all): 13:59:54/1 day, 17:18:13, loss=0.515956799324002, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.08(1.03), norm=2.3858187115229144, lr=0.000796025889663165
2023-11-21 22:25:14   INFO  epoch: 7/30, acc_iter=51609, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:30:02/0:17:04, time_cost(all): 14:00:43/1 day, 16:49:16, loss=0.515873685337693, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.09(1.03), norm=1.8373288021995609, lr=0.00079570515548124
2023-11-21 22:26:03   INFO  epoch: 7/30, acc_iter=51659, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:30:52/0:16:40, time_cost(all): 14:01:32/1 day, 17:40:37, loss=0.515790571351384, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.89(1.03), norm=1.2193022445613404, lr=0.000795384421299316
2023-11-21 22:26:52   INFO  epoch: 7/30, acc_iter=51709, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:31:41/0:15:46, time_cost(all): 14:02:21/1 day, 15:27:16, loss=0.515707457365075, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.07(1.03), norm=4.853645956179685, lr=0.000795063687117391
2023-11-21 22:27:41   INFO  epoch: 7/30, acc_iter=51759, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:32:30/0:15:00, time_cost(all): 14:03:10/1 day, 14:25:02, loss=0.515624343378766, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.09(1.03), norm=3.0529058096265134, lr=0.000794742952935466
2023-11-21 22:28:30   INFO  epoch: 7/30, acc_iter=51809, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:33:19/0:14:52, time_cost(all): 14:03:59/1 day, 16:15:29, loss=0.515541229392457, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.13(1.03), norm=3.3897469487158456, lr=0.000794422218753541
2023-11-21 22:29:19   INFO  epoch: 7/30, acc_iter=51859, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:34:08/0:13:41, time_cost(all): 14:04:48/1 day, 15:53:07, loss=0.515458115406147, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.92(1.03), norm=3.660935869535811, lr=0.000794101484571617
2023-11-21 22:30:09   INFO  epoch: 7/30, acc_iter=51909, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:34:57/0:13:11, time_cost(all): 14:05:38/1 day, 17:26:44, loss=0.515375001419838, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.97(1.03), norm=4.115719997676498, lr=0.000793780750389692
2023-11-21 22:30:58   INFO  epoch: 7/30, acc_iter=51959, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:35:46/0:12:30, time_cost(all): 14:06:27/1 day, 14:05:53, loss=0.515291887433529, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.0(1.03), norm=3.2446610871120902, lr=0.000793460016207767
2023-11-21 22:31:47   INFO  epoch: 7/30, acc_iter=52009, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:36:35/0:11:11, time_cost(all): 14:07:16/1 day, 14:40:16, loss=0.51520877344722, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.92(1.03), norm=3.108468259027892, lr=0.000793139282025843
2023-11-21 22:32:36   INFO  epoch: 7/30, acc_iter=52059, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:37:24/0:09:57, time_cost(all): 14:08:05/1 day, 14:07:46, loss=0.515125659460911, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.12(1.03), norm=4.66119454715287, lr=0.000792818547843918
2023-11-21 22:33:25   INFO  epoch: 7/30, acc_iter=52109, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:38:14/0:09:56, time_cost(all): 14:08:54/1 day, 17:05:00, loss=0.515042545474602, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.22(1.03), norm=3.2207058671071165, lr=0.000792497813661993
2023-11-21 22:34:14   INFO  epoch: 7/30, acc_iter=52159, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:39:03/0:08:25, time_cost(all): 14:09:43/1 day, 16:00:12, loss=0.514959431488293, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.13(1.03), norm=1.455746115911864, lr=0.000792177079480069
2023-11-21 22:35:03   INFO  epoch: 7/30, acc_iter=52209, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:39:52/0:07:37, time_cost(all): 14:10:32/1 day, 17:19:28, loss=0.514876317501984, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.05(1.03), norm=3.907173022326249, lr=0.000791856345298144
2023-11-21 22:35:52   INFO  epoch: 7/30, acc_iter=52259, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:40:41/0:07:12, time_cost(all): 14:11:21/1 day, 14:39:21, loss=0.514793203515675, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.18(1.03), norm=2.6908023454465866, lr=0.000791535611116219
2023-11-21 22:36:42   INFO  epoch: 7/30, acc_iter=52309, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:41:30/0:06:26, time_cost(all): 14:12:11/1 day, 13:50:54, loss=0.514710089529366, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.1(1.03), norm=2.0860233794377443, lr=0.000791214876934294
2023-11-21 22:37:31   INFO  epoch: 7/30, acc_iter=52359, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 1:42:19/0:05:34, time_cost(all): 14:13:00/1 day, 15:57:54, loss=0.514626975543057, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.97(1.03), norm=4.305620068479788, lr=0.00079089414275237
2023-11-21 22:38:20   INFO  epoch: 7/30, acc_iter=52409, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 1:43:08/0:04:34, time_cost(all): 14:13:49/1 day, 14:54:17, loss=0.514543861556748, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.06(1.03), norm=2.1743590067365415, lr=0.000790573408570445
2023-11-21 22:39:09   INFO  epoch: 7/30, acc_iter=52459, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 1:43:57/0:03:59, time_cost(all): 14:14:38/1 day, 16:30:36, loss=0.514460747570439, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.23(1.03), norm=1.306951048135079, lr=0.00079025267438852
2023-11-21 22:39:58   INFO  epoch: 7/30, acc_iter=52509, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 1:44:47/0:02:56, time_cost(all): 14:15:27/1 day, 14:36:22, loss=0.51437763358413, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.92(1.03), norm=3.437958076061851, lr=0.000789931940206596
2023-11-21 22:40:47   INFO  epoch: 7/30, acc_iter=52559, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 1:45:36/0:02:09, time_cost(all): 14:16:16/1 day, 15:36:55, loss=0.514294519597821, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.84(1.03), norm=3.39045907375143, lr=0.000789611206024671
2023-11-21 22:41:36   INFO  epoch: 7/30, acc_iter=52609, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 1:46:25/0:01:26, time_cost(all): 14:17:05/1 day, 16:56:23, loss=0.514211405611511, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.07(1.03), norm=2.825207588665994, lr=0.000789290471842746
2023-11-21 22:42:25   INFO  epoch: 7/30, acc_iter=52659, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 1:47:14/0:00:35, time_cost(all): 14:17:54/1 day, 15:25:09, loss=0.514128291625202, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.97(1.03), norm=3.751341391566652, lr=0.000788969737660821
2023-11-21 22:43:14   INFO  epoch: 8/30, acc_iter=52746, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:49/1:48:33, time_cost(all): 14:18:43/1 day, 16:15:47, loss=0.513983673289025, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.98(1.03), norm=3.839574401850489, lr=0.000788411660184273
2023-11-21 22:44:04   INFO  epoch: 8/30, acc_iter=52796, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:38/1:45:57, time_cost(all): 14:19:33/1 day, 16:28:42, loss=0.513900559302716, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.2(1.03), norm=1.1947555128087322, lr=0.000788090926002348
2023-11-21 22:44:53   INFO  epoch: 8/30, acc_iter=52846, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:27/1:46:28, time_cost(all): 14:20:22/1 day, 13:49:20, loss=0.513817445316406, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.84(1.03), norm=4.837559962681981, lr=0.000787770191820423
2023-11-21 22:45:42   INFO  epoch: 8/30, acc_iter=52896, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:16/1:49:04, time_cost(all): 14:21:11/1 day, 15:51:44, loss=0.513734331330097, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.01(1.03), norm=2.666312619898433, lr=0.000787449457638498
2023-11-21 22:46:31   INFO  epoch: 8/30, acc_iter=52946, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:05/1:44:24, time_cost(all): 14:22:00/1 day, 14:33:54, loss=0.513651217343788, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.04(1.03), norm=0.7380606104188223, lr=0.000787128723456574
2023-11-21 22:47:20   INFO  epoch: 8/30, acc_iter=52996, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:04:54/1:38:21, time_cost(all): 14:22:49/1 day, 17:23:02, loss=0.513568103357479, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.97(1.03), norm=4.525589425033594, lr=0.000786807989274649
2023-11-21 22:48:09   INFO  epoch: 8/30, acc_iter=53046, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:05:43/1:41:10, time_cost(all): 14:23:38/1 day, 14:49:57, loss=0.51348498937117, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.06(1.03), norm=0.6695485989698842, lr=0.000786487255092724
2023-11-21 22:48:58   INFO  epoch: 8/30, acc_iter=53096, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:06:32/1:39:22, time_cost(all): 14:24:27/1 day, 13:53:33, loss=0.513401875384861, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.19(1.03), norm=1.7263757717672852, lr=0.0007861665209108
2023-11-21 22:49:47   INFO  epoch: 8/30, acc_iter=53146, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:07:22/1:41:17, time_cost(all): 14:25:16/1 day, 16:23:42, loss=0.513318761398552, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.04(1.03), norm=4.004668721927345, lr=0.000785845786728875
2023-11-21 22:50:36   INFO  epoch: 8/30, acc_iter=53196, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:08:11/1:44:12, time_cost(all): 14:26:05/1 day, 15:33:45, loss=0.513235647412243, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.9(1.03), norm=4.233716095434689, lr=0.00078552505254695
2023-11-21 22:51:26   INFO  epoch: 8/30, acc_iter=53246, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:09:00/1:41:50, time_cost(all): 14:26:55/1 day, 16:56:46, loss=0.513152533425934, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.97(1.03), norm=0.5984522386765532, lr=0.000785204318365025
2023-11-21 22:52:15   INFO  epoch: 8/30, acc_iter=53296, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:09:49/1:35:38, time_cost(all): 14:27:44/1 day, 14:37:07, loss=0.513069419439625, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.08(1.03), norm=1.860847882016664, lr=0.000784883584183101
2023-11-21 22:53:04   INFO  epoch: 8/30, acc_iter=53346, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:10:38/1:34:53, time_cost(all): 14:28:33/1 day, 16:19:26, loss=0.512986305453316, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.99(1.03), norm=1.836097186115603, lr=0.000784562850001176
2023-11-21 22:53:53   INFO  epoch: 8/30, acc_iter=53396, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:11:27/1:35:24, time_cost(all): 14:29:22/1 day, 14:35:42, loss=0.512903191467007, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.11(1.03), norm=3.3410466866211106, lr=0.000784242115819251
2023-11-21 22:54:42   INFO  epoch: 8/30, acc_iter=53446, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:12:16/1:33:01, time_cost(all): 14:30:11/1 day, 13:34:11, loss=0.512820077480698, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.15(1.03), norm=4.594447138285369, lr=0.000783921381637327
2023-11-21 22:55:31   INFO  epoch: 8/30, acc_iter=53496, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:13:05/1:33:06, time_cost(all): 14:31:00/1 day, 15:18:51, loss=0.512736963494389, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.08(1.03), norm=3.7366283974281043, lr=0.000783600647455402
2023-11-21 22:56:20   INFO  epoch: 8/30, acc_iter=53546, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:13:54/1:32:33, time_cost(all): 14:31:49/1 day, 17:19:22, loss=0.51265384950808, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.13(1.03), norm=3.469801181500607, lr=0.000783279913273477
2023-11-21 22:57:09   INFO  epoch: 8/30, acc_iter=53596, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:14:44/1:32:35, time_cost(all): 14:32:38/1 day, 13:48:46, loss=0.512570735521771, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.17(1.03), norm=0.6035549892848032, lr=0.000782959179091553
2023-11-21 22:57:59   INFO  epoch: 8/30, acc_iter=53646, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:15:33/1:34:45, time_cost(all): 14:33:28/1 day, 16:49:40, loss=0.512487621535461, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.11(1.03), norm=2.207500159705233, lr=0.000782638444909628
2023-11-21 22:58:48   INFO  epoch: 8/30, acc_iter=53696, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:16:22/1:30:12, time_cost(all): 14:34:17/1 day, 14:52:25, loss=0.512404507549152, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.06(1.03), norm=3.8576008119527576, lr=0.000782317710727703
2023-11-21 22:59:37   INFO  epoch: 8/30, acc_iter=53746, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:17:11/1:33:49, time_cost(all): 14:35:06/1 day, 15:00:13, loss=0.512321393562843, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.14(1.03), norm=0.9666669229189742, lr=0.000781996976545778
2023-11-21 23:00:26   INFO  epoch: 8/30, acc_iter=53796, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:18:00/1:30:28, time_cost(all): 14:35:55/1 day, 14:24:09, loss=0.512238279576534, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.96(1.03), norm=1.1163829173078796, lr=0.000781676242363854
2023-11-21 23:01:15   INFO  epoch: 8/30, acc_iter=53846, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:18:49/1:27:00, time_cost(all): 14:36:44/1 day, 16:06:29, loss=0.512155165590225, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.95(1.03), norm=1.2195316937350729, lr=0.000781355508181929
2023-11-21 23:02:04   INFO  epoch: 8/30, acc_iter=53896, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:19:38/1:28:16, time_cost(all): 14:37:33/1 day, 15:00:47, loss=0.512072051603916, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.16(1.03), norm=2.235824036852909, lr=0.000781034774000004
2023-11-21 23:02:53   INFO  epoch: 8/30, acc_iter=53946, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:20:27/1:31:38, time_cost(all): 14:38:22/1 day, 13:28:25, loss=0.511988937617607, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.95(1.03), norm=1.9256282934669222, lr=0.00078071403981808
2023-11-21 23:03:42   INFO  epoch: 8/30, acc_iter=53996, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:21:17/1:29:58, time_cost(all): 14:39:11/1 day, 15:21:09, loss=0.511905823631298, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.06(1.03), norm=4.989529709489315, lr=0.000780393305636155
2023-11-21 23:04:31   INFO  epoch: 8/30, acc_iter=54046, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:22:06/1:21:55, time_cost(all): 14:40:00/1 day, 15:36:27, loss=0.511822709644989, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.22(1.03), norm=1.4363527462310957, lr=0.00078007257145423
2023-11-21 23:05:21   INFO  epoch: 8/30, acc_iter=54096, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:22:55/1:23:26, time_cost(all): 14:40:50/1 day, 17:02:35, loss=0.51173959565868, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.13(1.03), norm=1.973221228519273, lr=0.000779751837272305
2023-11-21 23:06:10   INFO  epoch: 8/30, acc_iter=54146, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:23:44/1:22:50, time_cost(all): 14:41:39/1 day, 15:27:57, loss=0.511656481672371, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.99(1.03), norm=3.564828726041581, lr=0.000779431103090381
2023-11-21 23:06:59   INFO  epoch: 8/30, acc_iter=54196, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:24:33/1:24:25, time_cost(all): 14:42:28/1 day, 13:41:48, loss=0.511573367686062, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.11(1.03), norm=2.237227180655181, lr=0.000779110368908456
2023-11-21 23:07:48   INFO  epoch: 8/30, acc_iter=54246, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:25:22/1:24:23, time_cost(all): 14:43:17/1 day, 13:25:42, loss=0.511490253699753, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.09(1.03), norm=2.548823785450233, lr=0.000778789634726531
2023-11-21 23:08:37   INFO  epoch: 8/30, acc_iter=54296, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:26:11/1:18:43, time_cost(all): 14:44:06/1 day, 16:15:20, loss=0.511407139713444, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.02(1.03), norm=3.1251808407454797, lr=0.000778468900544607
2023-11-21 23:09:26   INFO  epoch: 8/30, acc_iter=54346, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:27:00/1:22:32, time_cost(all): 14:44:55/1 day, 14:22:30, loss=0.511324025727135, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.94(1.03), norm=3.4967765003168783, lr=0.000778148166362682
2023-11-21 23:10:15   INFO  epoch: 8/30, acc_iter=54396, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:27:49/1:20:54, time_cost(all): 14:45:44/1 day, 13:16:29, loss=0.511240911740826, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.09(1.03), norm=4.951201176996203, lr=0.000777827432180757
2023-11-21 23:11:04   INFO  epoch: 8/30, acc_iter=54446, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:28:39/1:19:06, time_cost(all): 14:46:33/1 day, 15:31:01, loss=0.511157797754517, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.07(1.03), norm=0.7791034348243469, lr=0.000777506697998833
2023-11-21 23:11:54   INFO  epoch: 8/30, acc_iter=54496, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:29:28/1:22:01, time_cost(all): 14:47:23/1 day, 13:12:32, loss=0.511074683768207, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.08(1.03), norm=4.2747621143876735, lr=0.000777185963816908
2023-11-21 23:12:43   INFO  epoch: 8/30, acc_iter=54546, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:30:17/1:15:59, time_cost(all): 14:48:12/1 day, 13:12:38, loss=0.510991569781898, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.18(1.03), norm=0.5602617606007483, lr=0.000776865229634983
2023-11-21 23:13:32   INFO  epoch: 8/30, acc_iter=54596, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:31:06/1:14:54, time_cost(all): 14:49:01/1 day, 14:18:35, loss=0.510908455795589, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.15(1.03), norm=1.9649218352199278, lr=0.000776544495453058
2023-11-21 23:14:21   INFO  epoch: 8/30, acc_iter=54646, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:31:55/1:16:03, time_cost(all): 14:49:50/1 day, 16:12:40, loss=0.51082534180928, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.21(1.03), norm=2.746559696130952, lr=0.000776223761271134
2023-11-21 23:15:10   INFO  epoch: 8/30, acc_iter=54696, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:32:44/1:16:16, time_cost(all): 14:50:39/1 day, 14:48:13, loss=0.510742227822971, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.12(1.03), norm=0.9261320816539975, lr=0.000775903027089209
2023-11-21 23:15:59   INFO  epoch: 8/30, acc_iter=54746, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:33:33/1:13:22, time_cost(all): 14:51:28/1 day, 16:35:19, loss=0.510659113836662, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.01(1.03), norm=3.7851687688330906, lr=0.000775582292907284
2023-11-21 23:16:48   INFO  epoch: 8/30, acc_iter=54796, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:34:22/1:12:54, time_cost(all): 14:52:17/1 day, 13:22:32, loss=0.510575999850353, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.08(1.03), norm=0.780672709391931, lr=0.00077526155872536
2023-11-21 23:17:37   INFO  epoch: 8/30, acc_iter=54846, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:35:12/1:11:03, time_cost(all): 14:53:06/1 day, 14:04:16, loss=0.510492885864044, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.17(1.03), norm=1.1363820721634517, lr=0.000774940824543435
2023-11-21 23:18:26   INFO  epoch: 8/30, acc_iter=54896, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:36:01/1:11:25, time_cost(all): 14:53:55/1 day, 13:51:29, loss=0.510409771877735, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.14(1.03), norm=2.523188741585301, lr=0.00077462009036151
2023-11-21 23:19:16   INFO  epoch: 8/30, acc_iter=54946, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:36:50/1:13:53, time_cost(all): 14:54:45/1 day, 13:20:38, loss=0.510326657891426, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.84(1.03), norm=0.6305538652317801, lr=0.000774299356179585
2023-11-21 23:20:05   INFO  epoch: 8/30, acc_iter=54996, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:37:39/1:09:47, time_cost(all): 14:55:34/1 day, 14:26:15, loss=0.510243543905117, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.04(1.03), norm=2.7858120940469364, lr=0.000773978621997661
2023-11-21 23:20:54   INFO  epoch: 8/30, acc_iter=55046, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:38:28/1:11:25, time_cost(all): 14:56:23/1 day, 14:33:53, loss=0.510160429918808, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.21(1.03), norm=3.8235298476466353, lr=0.000773657887815736
2023-11-21 23:21:43   INFO  epoch: 8/30, acc_iter=55096, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:39:17/1:09:37, time_cost(all): 14:57:12/1 day, 15:01:53, loss=0.510077315932499, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.97(1.03), norm=2.4180069339376113, lr=0.000773337153633811
2023-11-21 23:22:32   INFO  epoch: 8/30, acc_iter=55146, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:40:06/1:04:24, time_cost(all): 14:58:01/1 day, 16:04:10, loss=0.50999420194619, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.22(1.03), norm=3.501924889014868, lr=0.000773016419451887
2023-11-21 23:23:21   INFO  epoch: 8/30, acc_iter=55196, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:40:55/1:04:21, time_cost(all): 14:58:50/1 day, 13:48:36, loss=0.50991108795988, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.92(1.03), norm=2.0882829129137517, lr=0.000772695685269962
2023-11-21 23:24:10   INFO  epoch: 8/30, acc_iter=55246, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:41:44/1:02:49, time_cost(all): 14:59:39/1 day, 13:28:53, loss=0.509827973973571, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.9(1.03), norm=1.2388639705165942, lr=0.000772374951088037
2023-11-21 23:24:59   INFO  epoch: 8/30, acc_iter=55296, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:42:34/1:04:04, time_cost(all): 15:00:28/1 day, 14:11:57, loss=0.509744859987262, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.97(1.03), norm=1.5583343590682157, lr=0.000772054216906113
2023-11-21 23:25:49   INFO  epoch: 8/30, acc_iter=55346, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:43:23/1:04:36, time_cost(all): 15:01:18/1 day, 14:31:47, loss=0.509661746000953, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.15(1.03), norm=0.7785824430668803, lr=0.000771733482724188
2023-11-21 23:26:38   INFO  epoch: 8/30, acc_iter=55396, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:44:12/1:05:02, time_cost(all): 15:02:07/1 day, 12:58:22, loss=0.509578632014644, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.98(1.03), norm=4.293936102635887, lr=0.000771412748542263
2023-11-21 23:27:27   INFO  epoch: 8/30, acc_iter=55446, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:45:01/1:02:04, time_cost(all): 15:02:56/1 day, 13:19:37, loss=0.509495518028335, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.13(1.03), norm=4.081693196129594, lr=0.000771092014360338
2023-11-21 23:28:16   INFO  epoch: 8/30, acc_iter=55496, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:45:50/1:01:05, time_cost(all): 15:03:45/1 day, 13:58:42, loss=0.509412404042026, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.99(1.03), norm=1.1114682121589603, lr=0.000770771280178414
2023-11-21 23:29:05   INFO  epoch: 8/30, acc_iter=55546, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:46:39/1:02:30, time_cost(all): 15:04:34/1 day, 13:00:30, loss=0.509329290055717, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.9(1.03), norm=2.0925050209570144, lr=0.000770450545996489
2023-11-21 23:29:54   INFO  epoch: 8/30, acc_iter=55596, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:47:28/0:59:22, time_cost(all): 15:05:23/1 day, 16:39:56, loss=0.509246176069408, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.19(1.03), norm=1.5812164644252231, lr=0.000770129811814564
2023-11-21 23:30:43   INFO  epoch: 8/30, acc_iter=55646, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:48:17/1:00:48, time_cost(all): 15:06:12/1 day, 15:47:18, loss=0.509163062083099, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.04(1.03), norm=4.968692799060388, lr=0.00076980907763264
2023-11-21 23:31:32   INFO  epoch: 8/30, acc_iter=55696, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:49:07/0:57:42, time_cost(all): 15:07:01/1 day, 14:42:01, loss=0.50907994809679, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.95(1.03), norm=2.112687604406629, lr=0.000769488343450715
2023-11-21 23:32:21   INFO  epoch: 8/30, acc_iter=55746, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:49:56/1:00:42, time_cost(all): 15:07:50/1 day, 13:24:43, loss=0.508996834110481, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.07(1.03), norm=2.074422247186272, lr=0.00076916760926879
2023-11-21 23:33:11   INFO  epoch: 8/30, acc_iter=55796, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:50:45/0:55:20, time_cost(all): 15:08:40/1 day, 14:35:41, loss=0.508913720124172, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.87(1.03), norm=4.94985704903548, lr=0.000768846875086865
2023-11-21 23:34:00   INFO  epoch: 8/30, acc_iter=55846, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 0:51:34/0:58:38, time_cost(all): 15:09:29/1 day, 14:48:15, loss=0.508830606137863, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.09(1.03), norm=3.419606549235259, lr=0.000768526140904941
2023-11-21 23:34:49   INFO  epoch: 8/30, acc_iter=55896, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 0:52:23/0:53:47, time_cost(all): 15:10:18/1 day, 16:07:34, loss=0.508747492151554, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.14(1.03), norm=2.735819008018882, lr=0.000768205406723016
2023-11-21 23:35:38   INFO  epoch: 8/30, acc_iter=55946, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 0:53:12/0:54:44, time_cost(all): 15:11:07/1 day, 13:19:43, loss=0.508664378165245, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.11(1.03), norm=2.5694923512028733, lr=0.000767884672541091
2023-11-21 23:36:27   INFO  epoch: 8/30, acc_iter=55996, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 0:54:01/0:55:33, time_cost(all): 15:11:56/1 day, 15:54:59, loss=0.508581264178936, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.09(1.03), norm=2.93118709526057, lr=0.000767563938359167
2023-11-21 23:37:16   INFO  epoch: 8/30, acc_iter=56046, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 0:54:50/0:55:33, time_cost(all): 15:12:45/1 day, 14:47:38, loss=0.508498150192626, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.85(1.03), norm=1.2854440301066596, lr=0.000767243204177242
2023-11-21 23:38:05   INFO  epoch: 8/30, acc_iter=56096, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 0:55:39/0:50:02, time_cost(all): 15:13:34/1 day, 13:57:24, loss=0.508415036206317, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.2(1.03), norm=2.1234201741582517, lr=0.000766922469995317
2023-11-21 23:38:54   INFO  epoch: 8/30, acc_iter=56146, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 0:56:29/0:50:06, time_cost(all): 15:14:23/1 day, 15:44:07, loss=0.508331922220008, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.03(1.03), norm=4.633827241056587, lr=0.000766601735813393
2023-11-21 23:39:44   INFO  epoch: 8/30, acc_iter=56196, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 0:57:18/0:53:01, time_cost(all): 15:15:13/1 day, 13:08:31, loss=0.508248808233699, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.11(1.03), norm=1.2147534197798715, lr=0.000766281001631468
2023-11-21 23:40:33   INFO  epoch: 8/30, acc_iter=56246, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 0:58:07/0:49:42, time_cost(all): 15:16:02/1 day, 15:33:26, loss=0.50816569424739, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.17(1.03), norm=4.929363212359288, lr=0.000765960267449543
2023-11-21 23:41:22   INFO  epoch: 8/30, acc_iter=56296, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 0:58:56/0:49:29, time_cost(all): 15:16:51/1 day, 13:00:58, loss=0.508082580261081, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.85(1.03), norm=4.96629245594891, lr=0.000765639533267618
2023-11-21 23:42:11   INFO  epoch: 8/30, acc_iter=56346, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 0:59:45/0:48:21, time_cost(all): 15:17:40/1 day, 15:43:59, loss=0.507999466274772, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.99(1.03), norm=2.8479015236686784, lr=0.000765318799085694
2023-11-21 23:43:00   INFO  epoch: 8/30, acc_iter=56396, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:00:34/0:46:43, time_cost(all): 15:18:29/1 day, 15:47:49, loss=0.507916352288463, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.22(1.03), norm=2.730965989981201, lr=0.000764998064903769
2023-11-21 23:43:49   INFO  epoch: 8/30, acc_iter=56446, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:01:23/0:46:33, time_cost(all): 15:19:18/1 day, 14:03:12, loss=0.507833238302154, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.13(1.03), norm=0.7766038113878508, lr=0.000764677330721844
2023-11-21 23:44:38   INFO  epoch: 8/30, acc_iter=56496, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:02:12/0:46:10, time_cost(all): 15:20:07/1 day, 16:25:41, loss=0.507750124315845, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.18(1.03), norm=0.7633417681033823, lr=0.00076435659653992
2023-11-21 23:45:27   INFO  epoch: 8/30, acc_iter=56546, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:03:02/0:45:00, time_cost(all): 15:20:56/1 day, 14:58:00, loss=0.507667010329536, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.88(1.03), norm=3.3709710498086385, lr=0.000764035862357995
2023-11-21 23:46:16   INFO  epoch: 8/30, acc_iter=56596, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:03:51/0:44:46, time_cost(all): 15:21:45/1 day, 16:10:50, loss=0.507583896343227, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.08(1.03), norm=4.6707909990065914, lr=0.00076371512817607
2023-11-21 23:47:06   INFO  epoch: 8/30, acc_iter=56646, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:04:40/0:43:51, time_cost(all): 15:22:35/1 day, 16:12:35, loss=0.507500782356918, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.08(1.03), norm=2.1279132696686682, lr=0.000763394393994146
2023-11-21 23:47:55   INFO  epoch: 8/30, acc_iter=56696, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:05:29/0:40:30, time_cost(all): 15:23:24/1 day, 16:22:30, loss=0.507417668370609, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.09(1.03), norm=4.505654723424554, lr=0.000763073659812221
2023-11-21 23:48:44   INFO  epoch: 8/30, acc_iter=56746, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:06:18/0:39:53, time_cost(all): 15:24:13/1 day, 14:14:39, loss=0.5073345543843, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.94(1.03), norm=0.5253060875072657, lr=0.000762752925630296
2023-11-21 23:49:33   INFO  epoch: 8/30, acc_iter=56796, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:07:07/0:39:48, time_cost(all): 15:25:02/1 day, 14:01:29, loss=0.507251440397991, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.11(1.03), norm=2.5705957662993426, lr=0.000762432191448371
2023-11-21 23:50:22   INFO  epoch: 8/30, acc_iter=56846, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:07:56/0:40:27, time_cost(all): 15:25:51/1 day, 15:22:35, loss=0.507168326411682, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.14(1.03), norm=0.5567517636411282, lr=0.000762111457266447
2023-11-21 23:51:11   INFO  epoch: 8/30, acc_iter=56896, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:08:45/0:38:55, time_cost(all): 15:26:40/1 day, 13:40:41, loss=0.507085212425372, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.05(1.03), norm=1.3582887651246338, lr=0.000761790723084522
2023-11-21 23:52:00   INFO  epoch: 8/30, acc_iter=56946, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:09:34/0:38:29, time_cost(all): 15:27:29/1 day, 13:23:31, loss=0.507002098439063, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.08(1.03), norm=3.7395678651499824, lr=0.000761469988902597
2023-11-21 23:52:49   INFO  epoch: 8/30, acc_iter=56996, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:10:24/0:39:05, time_cost(all): 15:28:18/1 day, 14:07:35, loss=0.506918984452754, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.15(1.03), norm=4.084800570047271, lr=0.000761149254720673
2023-11-21 23:53:39   INFO  epoch: 8/30, acc_iter=57046, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:11:13/0:37:52, time_cost(all): 15:29:08/1 day, 15:01:34, loss=0.506835870466445, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.98(1.03), norm=4.982665560108896, lr=0.000760828520538748
2023-11-21 23:54:28   INFO  epoch: 8/30, acc_iter=57096, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:12:02/0:37:03, time_cost(all): 15:29:57/1 day, 13:23:49, loss=0.506752756480136, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.99(1.03), norm=3.3397149013623135, lr=0.000760507786356823
2023-11-21 23:55:17   INFO  epoch: 8/30, acc_iter=57146, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:12:51/0:34:05, time_cost(all): 15:30:46/1 day, 13:31:57, loss=0.506669642493827, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.13(1.03), norm=3.8734520181096723, lr=0.000760187052174899
2023-11-21 23:56:06   INFO  epoch: 8/30, acc_iter=57196, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:13:40/0:34:58, time_cost(all): 15:31:35/1 day, 16:10:53, loss=0.506586528507518, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.08(1.03), norm=0.8790882229293093, lr=0.000759866317992974
2023-11-21 23:56:55   INFO  epoch: 8/30, acc_iter=57246, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:14:29/0:33:11, time_cost(all): 15:32:24/1 day, 16:09:29, loss=0.506503414521209, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.98(1.03), norm=2.267430298414774, lr=0.000759545583811049
2023-11-21 23:57:44   INFO  epoch: 8/30, acc_iter=57296, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:15:18/0:32:59, time_cost(all): 15:33:13/1 day, 15:38:11, loss=0.5064203005349, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.84(1.03), norm=3.7228593300459782, lr=0.000759224849629124
2023-11-21 23:58:33   INFO  epoch: 8/30, acc_iter=57346, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:16:07/0:30:17, time_cost(all): 15:34:02/1 day, 15:57:15, loss=0.506337186548591, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.86(1.03), norm=2.2797898991738825, lr=0.0007589041154472
2023-11-21 23:59:22   INFO  epoch: 8/30, acc_iter=57396, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:16:57/0:29:44, time_cost(all): 15:34:51/1 day, 13:21:10, loss=0.506254072562282, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.9(1.03), norm=1.237251754770165, lr=0.000758583381265275
2023-11-22 00:00:11   INFO  epoch: 8/30, acc_iter=57446, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:17:46/0:29:00, time_cost(all): 15:35:40/1 day, 14:41:39, loss=0.506170958575973, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.02(1.03), norm=0.5406764140294358, lr=0.00075826264708335
2023-11-22 00:01:01   INFO  epoch: 8/30, acc_iter=57496, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:18:35/0:27:53, time_cost(all): 15:36:30/1 day, 12:38:49, loss=0.506087844589664, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.15(1.03), norm=0.7695378172178622, lr=0.000757941912901426
2023-11-22 00:01:50   INFO  epoch: 8/30, acc_iter=57546, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:19:24/0:28:48, time_cost(all): 15:37:19/1 day, 14:14:03, loss=0.506004730603355, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.18(1.03), norm=4.404625379541638, lr=0.000757621178719501
2023-11-22 00:02:39   INFO  epoch: 8/30, acc_iter=57596, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:20:13/0:28:12, time_cost(all): 15:38:08/1 day, 15:36:25, loss=0.505921616617045, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.86(1.03), norm=2.073476306377067, lr=0.000757300444537576
2023-11-22 00:03:28   INFO  epoch: 8/30, acc_iter=57646, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:21:02/0:27:12, time_cost(all): 15:38:57/1 day, 14:28:59, loss=0.505838502630736, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.87(1.03), norm=1.1341809945111154, lr=0.000756979710355651
2023-11-22 00:04:17   INFO  epoch: 8/30, acc_iter=57696, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:21:51/0:25:48, time_cost(all): 15:39:46/1 day, 14:14:57, loss=0.505755388644427, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.86(1.03), norm=4.258711226325856, lr=0.000756658976173727
2023-11-22 00:05:06   INFO  epoch: 8/30, acc_iter=57746, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:22:40/0:26:15, time_cost(all): 15:40:35/1 day, 14:09:13, loss=0.505672274658118, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.98(1.03), norm=4.530217046306795, lr=0.000756338241991802
2023-11-22 00:05:55   INFO  epoch: 8/30, acc_iter=57796, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:23:29/0:24:40, time_cost(all): 15:41:24/1 day, 16:07:43, loss=0.505589160671809, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.87(1.03), norm=3.2208757428255237, lr=0.000756017507809877
2023-11-22 00:06:44   INFO  epoch: 8/30, acc_iter=57846, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:24:19/0:22:49, time_cost(all): 15:42:13/1 day, 14:02:17, loss=0.5055060466855, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.89(1.03), norm=3.7223180289149163, lr=0.000755696773627953
2023-11-22 00:07:34   INFO  epoch: 8/30, acc_iter=57896, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:25:08/0:23:24, time_cost(all): 15:43:03/1 day, 14:50:01, loss=0.505422932699191, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.83(1.03), norm=3.848107872539786, lr=0.000755376039446028
2023-11-22 00:08:23   INFO  epoch: 8/30, acc_iter=57946, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:25:57/0:22:53, time_cost(all): 15:43:52/1 day, 12:52:44, loss=0.505339818712882, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.18(1.03), norm=4.891024278790462, lr=0.000755055305264103
2023-11-22 00:09:12   INFO  epoch: 8/30, acc_iter=57996, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:26:46/0:21:20, time_cost(all): 15:44:41/1 day, 14:20:02, loss=0.505256704726573, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.0(1.03), norm=1.733868601416795, lr=0.000754734571082179
2023-11-22 00:10:01   INFO  epoch: 8/30, acc_iter=58046, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:27:35/0:20:22, time_cost(all): 15:45:30/1 day, 15:56:39, loss=0.505173590740264, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.84(1.03), norm=0.5866935828721723, lr=0.000754413836900254
2023-11-22 00:10:50   INFO  epoch: 8/30, acc_iter=58096, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:28:24/0:20:20, time_cost(all): 15:46:19/1 day, 12:30:51, loss=0.505090476753955, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.92(1.03), norm=3.882274730556852, lr=0.000754093102718329
2023-11-22 00:11:39   INFO  epoch: 8/30, acc_iter=58146, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:29:13/0:18:45, time_cost(all): 15:47:08/1 day, 13:57:26, loss=0.505007362767646, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.95(1.03), norm=1.4251658134416483, lr=0.000753772368536405
2023-11-22 00:12:28   INFO  epoch: 8/30, acc_iter=58196, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:30:02/0:18:06, time_cost(all): 15:47:57/1 day, 12:33:05, loss=0.504924248781337, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.93(1.03), norm=4.292853290699808, lr=0.00075345163435448
2023-11-22 00:13:17   INFO  epoch: 8/30, acc_iter=58246, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:30:52/0:16:22, time_cost(all): 15:48:46/1 day, 15:22:44, loss=0.504841134795028, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.0(1.03), norm=0.5323905914872, lr=0.000753130900172555
2023-11-22 00:14:06   INFO  epoch: 8/30, acc_iter=58296, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:31:41/0:16:17, time_cost(all): 15:49:35/1 day, 12:53:29, loss=0.504758020808719, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.17(1.03), norm=2.4013520294251, lr=0.00075281016599063
2023-11-22 00:14:56   INFO  epoch: 8/30, acc_iter=58346, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:32:30/0:15:58, time_cost(all): 15:50:25/1 day, 13:04:55, loss=0.50467490682241, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.01(1.03), norm=4.708915675291878, lr=0.000752489431808706
2023-11-22 00:15:45   INFO  epoch: 8/30, acc_iter=58396, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:33:19/0:15:12, time_cost(all): 15:51:14/1 day, 12:24:18, loss=0.504591792836101, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.16(1.03), norm=3.5199518969086085, lr=0.000752168697626781
2023-11-22 00:16:34   INFO  epoch: 8/30, acc_iter=58446, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:34:08/0:13:47, time_cost(all): 15:52:03/1 day, 15:16:31, loss=0.504508678849791, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.05(1.03), norm=3.6789010882012754, lr=0.000751847963444856
2023-11-22 00:17:23   INFO  epoch: 8/30, acc_iter=58496, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:34:57/0:12:16, time_cost(all): 15:52:52/1 day, 12:22:14, loss=0.504425564863482, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.15(1.03), norm=1.4827468722298396, lr=0.000751527229262931
2023-11-22 00:18:12   INFO  epoch: 8/30, acc_iter=58546, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:35:46/0:11:37, time_cost(all): 15:53:41/1 day, 15:10:46, loss=0.504342450877173, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.06(1.03), norm=4.984200547933072, lr=0.000751206495081007
2023-11-22 00:19:01   INFO  epoch: 8/30, acc_iter=58596, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:36:35/0:10:46, time_cost(all): 15:54:30/1 day, 12:09:22, loss=0.504259336890864, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.15(1.03), norm=4.113285051632531, lr=0.000750885760899082
2023-11-22 00:19:50   INFO  epoch: 8/30, acc_iter=58646, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:37:24/0:10:40, time_cost(all): 15:55:19/1 day, 13:08:25, loss=0.504176222904555, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.02(1.03), norm=4.858011213137153, lr=0.000750565026717157
2023-11-22 00:20:39   INFO  epoch: 8/30, acc_iter=58696, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:38:14/0:09:48, time_cost(all): 15:56:08/1 day, 14:40:14, loss=0.504093108918246, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.03(1.03), norm=0.5578688712506444, lr=0.000750244292535233
2023-11-22 00:21:29   INFO  epoch: 8/30, acc_iter=58746, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:39:03/0:09:06, time_cost(all): 15:56:58/1 day, 14:50:53, loss=0.504009994931937, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.17(1.03), norm=2.128022697294306, lr=0.000749923558353308
2023-11-22 00:22:18   INFO  epoch: 8/30, acc_iter=58796, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:39:52/0:07:55, time_cost(all): 15:57:47/1 day, 12:09:09, loss=0.503926880945628, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.84(1.03), norm=1.574854290990074, lr=0.000749602824171383
2023-11-22 00:23:07   INFO  epoch: 8/30, acc_iter=58846, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:40:41/0:07:10, time_cost(all): 15:58:36/1 day, 13:44:01, loss=0.503843766959319, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.07(1.03), norm=3.5218707740969077, lr=0.000749282089989459
2023-11-22 00:23:56   INFO  epoch: 8/30, acc_iter=58896, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:41:30/0:06:33, time_cost(all): 15:59:25/1 day, 14:11:43, loss=0.50376065297301, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.9(1.03), norm=4.090857320139493, lr=0.000748961355807534
2023-11-22 00:24:45   INFO  epoch: 8/30, acc_iter=58946, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 1:42:19/0:05:44, time_cost(all): 16:00:14/1 day, 13:28:14, loss=0.503677538986701, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.23(1.03), norm=3.8988382881255603, lr=0.000748640621625609
2023-11-22 00:25:34   INFO  epoch: 8/30, acc_iter=58996, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 1:43:08/0:04:30, time_cost(all): 16:01:03/1 day, 14:21:01, loss=0.503594425000392, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.94(1.03), norm=4.8291424731382016, lr=0.000748319887443684
2023-11-22 00:26:23   INFO  epoch: 8/30, acc_iter=59046, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 1:43:57/0:03:45, time_cost(all): 16:01:52/1 day, 13:05:10, loss=0.503511311014083, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.99(1.03), norm=2.9686795829958434, lr=0.00074799915326176
2023-11-22 00:27:12   INFO  epoch: 8/30, acc_iter=59096, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 1:44:47/0:02:54, time_cost(all): 16:02:41/1 day, 12:34:02, loss=0.503428197027774, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.2(1.03), norm=1.813905163817397, lr=0.000747678419079835
2023-11-22 00:28:01   INFO  epoch: 8/30, acc_iter=59146, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 1:45:36/0:02:14, time_cost(all): 16:03:30/1 day, 13:10:14, loss=0.503345083041465, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.22(1.03), norm=4.403451675337925, lr=0.00074735768489791
2023-11-22 00:28:51   INFO  epoch: 8/30, acc_iter=59196, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 1:46:25/0:01:27, time_cost(all): 16:04:20/1 day, 14:37:19, loss=0.503261969055156, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.97(1.03), norm=2.086060003835935, lr=0.000747036950715986
2023-11-22 00:29:40   INFO  epoch: 8/30, acc_iter=59246, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 1:47:14/0:00:34, time_cost(all): 16:05:09/1 day, 14:02:59, loss=0.503178855068847, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.09(1.03), norm=0.6777432602252571, lr=0.000746716216534061
2023-11-22 00:30:29   INFO  epoch: 9/30, acc_iter=59333, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:49/1:43:34, time_cost(all): 16:05:58/1 day, 13:23:24, loss=0.503034236732669, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.13(1.03), norm=3.591326053323032, lr=0.000746158139057512
2023-11-22 00:31:18   INFO  epoch: 9/30, acc_iter=59383, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:38/1:44:27, time_cost(all): 16:06:47/1 day, 13:07:40, loss=0.50295112274636, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.13(1.03), norm=4.744053854938523, lr=0.000745837404875587
2023-11-22 00:32:07   INFO  epoch: 9/30, acc_iter=59433, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:27/1:49:48, time_cost(all): 16:07:36/1 day, 13:40:53, loss=0.502868008760051, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.1(1.03), norm=0.5154323480301235, lr=0.000745516670693663
2023-11-22 00:32:56   INFO  epoch: 9/30, acc_iter=59483, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:16/1:47:22, time_cost(all): 16:08:25/1 day, 13:00:18, loss=0.502784894773741, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.88(1.03), norm=0.5807466219248834, lr=0.000745195936511738
2023-11-22 00:33:45   INFO  epoch: 9/30, acc_iter=59533, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:05/1:39:10, time_cost(all): 16:09:14/1 day, 13:41:04, loss=0.502701780787432, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.02(1.03), norm=3.0377399864924053, lr=0.000744875202329813
2023-11-22 00:34:34   INFO  epoch: 9/30, acc_iter=59583, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:04:54/1:47:04, time_cost(all): 16:10:03/1 day, 13:03:15, loss=0.502618666801123, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.88(1.03), norm=0.7512106694536206, lr=0.000744554468147888
2023-11-22 00:35:24   INFO  epoch: 9/30, acc_iter=59633, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:05:43/1:45:23, time_cost(all): 16:10:53/1 day, 13:44:52, loss=0.502535552814814, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.94(1.03), norm=2.637077908053011, lr=0.000744233733965964
2023-11-22 00:36:13   INFO  epoch: 9/30, acc_iter=59683, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:06:32/1:44:04, time_cost(all): 16:11:42/1 day, 14:48:08, loss=0.502452438828505, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.9(1.03), norm=1.2185580083143153, lr=0.000743912999784039
2023-11-22 00:37:02   INFO  epoch: 9/30, acc_iter=59733, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:07:22/1:45:27, time_cost(all): 16:12:31/1 day, 13:08:43, loss=0.502369324842196, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.09(1.03), norm=4.826830970993947, lr=0.000743592265602114
2023-11-22 00:37:51   INFO  epoch: 9/30, acc_iter=59783, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:08:11/1:38:56, time_cost(all): 16:13:20/1 day, 13:52:14, loss=0.502286210855887, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.86(1.03), norm=4.883679320445583, lr=0.00074327153142019
2023-11-22 00:38:40   INFO  epoch: 9/30, acc_iter=59833, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:09:00/1:37:52, time_cost(all): 16:14:09/1 day, 14:37:09, loss=0.502203096869578, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.2(1.03), norm=0.9576979204393117, lr=0.000742950797238265
2023-11-22 00:39:29   INFO  epoch: 9/30, acc_iter=59883, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:09:49/1:42:03, time_cost(all): 16:14:58/1 day, 13:26:53, loss=0.502119982883269, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.9(1.03), norm=3.0965603839976943, lr=0.00074263006305634
2023-11-22 00:40:18   INFO  epoch: 9/30, acc_iter=59933, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:10:38/1:38:45, time_cost(all): 16:15:47/1 day, 12:30:59, loss=0.50203686889696, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.98(1.03), norm=1.9411052880262594, lr=0.000742309328874416
2023-11-22 00:41:07   INFO  epoch: 9/30, acc_iter=59983, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:11:27/1:33:52, time_cost(all): 16:16:36/1 day, 12:23:11, loss=0.501953754910651, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.09(1.03), norm=0.9970329445929307, lr=0.000741988594692491
2023-11-22 00:41:56   INFO  epoch: 9/30, acc_iter=60033, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:12:16/1:35:37, time_cost(all): 16:17:25/1 day, 13:13:00, loss=0.501870640924342, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.07(1.03), norm=2.108442507371216, lr=0.000741667860510566
2023-11-22 00:42:46   INFO  epoch: 9/30, acc_iter=60083, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:13:05/1:31:00, time_cost(all): 16:18:15/1 day, 12:41:31, loss=0.501787526938033, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.98(1.03), norm=4.003474535853519, lr=0.000741347126328641
2023-11-22 00:43:35   INFO  epoch: 9/30, acc_iter=60133, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:13:54/1:35:54, time_cost(all): 16:19:04/1 day, 13:49:38, loss=0.501704412951724, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.96(1.03), norm=4.636970786813559, lr=0.000741026392146717
2023-11-22 00:44:24   INFO  epoch: 9/30, acc_iter=60183, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:14:44/1:37:40, time_cost(all): 16:19:53/1 day, 12:35:11, loss=0.501621298965415, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.99(1.03), norm=2.865669544743754, lr=0.000740705657964792
2023-11-22 00:45:13   INFO  epoch: 9/30, acc_iter=60233, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:15:33/1:28:50, time_cost(all): 16:20:42/1 day, 14:13:37, loss=0.501538184979106, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.0(1.03), norm=4.87288084462888, lr=0.000740384923782867
2023-11-22 00:46:02   INFO  epoch: 9/30, acc_iter=60283, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:16:22/1:34:53, time_cost(all): 16:21:31/1 day, 14:48:06, loss=0.501455070992797, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.18(1.03), norm=0.797501839958564, lr=0.000740064189600942
2023-11-22 00:46:51   INFO  epoch: 9/30, acc_iter=60333, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:17:11/1:29:02, time_cost(all): 16:22:20/1 day, 15:06:31, loss=0.501371957006487, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.98(1.03), norm=3.450501587331543, lr=0.000739743455419018
2023-11-22 00:47:40   INFO  epoch: 9/30, acc_iter=60383, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:18:00/1:29:25, time_cost(all): 16:23:09/1 day, 14:50:21, loss=0.501288843020178, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.19(1.03), norm=2.0944093320657524, lr=0.000739422721237093
2023-11-22 00:48:29   INFO  epoch: 9/30, acc_iter=60433, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:18:49/1:32:47, time_cost(all): 16:23:58/1 day, 14:46:44, loss=0.501205729033869, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.91(1.03), norm=0.6792441132296316, lr=0.000739101987055168
2023-11-22 00:49:18   INFO  epoch: 9/30, acc_iter=60483, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:19:38/1:27:53, time_cost(all): 16:24:47/1 day, 15:05:26, loss=0.50112261504756, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.83(1.03), norm=0.6955259335836208, lr=0.000738781252873244
2023-11-22 00:50:08   INFO  epoch: 9/30, acc_iter=60533, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:20:27/1:31:28, time_cost(all): 16:25:37/1 day, 11:52:31, loss=0.501039501061251, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.17(1.03), norm=2.031046561174182, lr=0.000738460518691319
2023-11-22 00:50:57   INFO  epoch: 9/30, acc_iter=60583, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:21:17/1:23:57, time_cost(all): 16:26:26/1 day, 12:58:16, loss=0.500956387074942, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.11(1.03), norm=1.5175841282601559, lr=0.000738139784509394
2023-11-22 00:51:46   INFO  epoch: 9/30, acc_iter=60633, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:22:06/1:27:31, time_cost(all): 16:27:15/1 day, 15:11:38, loss=0.500873273088633, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.21(1.03), norm=1.567496318194481, lr=0.00073781905032747
2023-11-22 00:52:35   INFO  epoch: 9/30, acc_iter=60683, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:22:55/1:28:29, time_cost(all): 16:28:04/1 day, 13:51:05, loss=0.500790159102324, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.15(1.03), norm=3.6833162194478835, lr=0.000737498316145545
2023-11-22 00:53:24   INFO  epoch: 9/30, acc_iter=60733, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:23:44/1:21:21, time_cost(all): 16:28:53/1 day, 14:49:05, loss=0.500707045116015, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.16(1.03), norm=3.461664278877639, lr=0.00073717758196362
2023-11-22 00:54:13   INFO  epoch: 9/30, acc_iter=60783, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:24:33/1:25:53, time_cost(all): 16:29:42/1 day, 13:30:20, loss=0.500623931129706, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.11(1.03), norm=4.587836034012685, lr=0.000736856847781695
2023-11-22 00:55:02   INFO  epoch: 9/30, acc_iter=60833, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:25:22/1:19:44, time_cost(all): 16:30:31/1 day, 14:33:16, loss=0.500540817143397, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.85(1.03), norm=0.8366486046632071, lr=0.000736536113599771
2023-11-22 00:55:51   INFO  epoch: 9/30, acc_iter=60883, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:26:11/1:17:34, time_cost(all): 16:31:20/1 day, 12:45:30, loss=0.500457703157088, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.2(1.03), norm=4.147047103408523, lr=0.000736215379417846
2023-11-22 00:56:41   INFO  epoch: 9/30, acc_iter=60933, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:27:00/1:23:53, time_cost(all): 16:32:10/1 day, 12:23:16, loss=0.500374589170779, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.01(1.03), norm=2.1336403373602986, lr=0.000735894645235921
2023-11-22 00:57:30   INFO  epoch: 9/30, acc_iter=60983, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:27:49/1:23:41, time_cost(all): 16:32:59/1 day, 12:41:50, loss=0.50029147518447, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.09(1.03), norm=2.7799822843411173, lr=0.000735573911053997
2023-11-22 00:58:19   INFO  epoch: 9/30, acc_iter=61033, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:28:39/1:16:00, time_cost(all): 16:33:48/1 day, 11:55:34, loss=0.500208361198161, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.22(1.03), norm=0.6465585124538202, lr=0.000735253176872072
2023-11-22 00:59:08   INFO  epoch: 9/30, acc_iter=61083, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:29:28/1:20:43, time_cost(all): 16:34:37/1 day, 13:59:09, loss=0.500125247211851, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.14(1.03), norm=4.189937901846063, lr=0.000734932442690147
2023-11-22 00:59:57   INFO  epoch: 9/30, acc_iter=61133, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:30:17/1:17:49, time_cost(all): 16:35:26/1 day, 11:44:48, loss=0.500042133225542, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.2(1.03), norm=1.0621028403543427, lr=0.000734611708508223
2023-11-22 01:00:46   INFO  epoch: 9/30, acc_iter=61183, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:31:06/1:13:13, time_cost(all): 16:36:15/1 day, 12:11:03, loss=0.499959019239233, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.05(1.03), norm=2.193757427092921, lr=0.000734290974326298
2023-11-22 01:01:35   INFO  epoch: 9/30, acc_iter=61233, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:31:55/1:17:45, time_cost(all): 16:37:04/1 day, 11:42:10, loss=0.499875905252924, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.92(1.03), norm=4.790069061073058, lr=0.000733970240144373
2023-11-22 01:02:24   INFO  epoch: 9/30, acc_iter=61283, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:32:44/1:12:22, time_cost(all): 16:37:53/1 day, 12:50:31, loss=0.499792791266615, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.94(1.03), norm=4.40497987988821, lr=0.000733649505962448
2023-11-22 01:03:13   INFO  epoch: 9/30, acc_iter=61333, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:33:33/1:11:54, time_cost(all): 16:38:42/1 day, 13:39:55, loss=0.499709677280306, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.93(1.03), norm=3.013933575433899, lr=0.000733328771780524
2023-11-22 01:04:03   INFO  epoch: 9/30, acc_iter=61383, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:34:22/1:11:28, time_cost(all): 16:39:32/1 day, 12:08:10, loss=0.499626563293997, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.89(1.03), norm=1.4407900667198674, lr=0.000733008037598599
2023-11-22 01:04:52   INFO  epoch: 9/30, acc_iter=61433, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:35:12/1:14:40, time_cost(all): 16:40:21/1 day, 12:03:39, loss=0.499543449307688, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.92(1.03), norm=1.3846846514704156, lr=0.000732687303416674
2023-11-22 01:05:41   INFO  epoch: 9/30, acc_iter=61483, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:36:01/1:13:54, time_cost(all): 16:41:10/1 day, 14:08:04, loss=0.499460335321379, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.13(1.03), norm=0.5095887184605093, lr=0.00073236656923475
2023-11-22 01:06:30   INFO  epoch: 9/30, acc_iter=61533, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:36:50/1:11:07, time_cost(all): 16:41:59/1 day, 13:03:12, loss=0.49937722133507, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.86(1.03), norm=2.900422489547263, lr=0.000732045835052825
2023-11-22 01:07:19   INFO  epoch: 9/30, acc_iter=61583, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:37:39/1:12:36, time_cost(all): 16:42:48/1 day, 11:36:52, loss=0.499294107348761, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.08(1.03), norm=4.9488660186413345, lr=0.0007317251008709
2023-11-22 01:08:08   INFO  epoch: 9/30, acc_iter=61633, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:38:28/1:08:09, time_cost(all): 16:43:37/1 day, 14:19:44, loss=0.499210993362452, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.91(1.03), norm=4.949167378569967, lr=0.000731404366688976
2023-11-22 01:08:57   INFO  epoch: 9/30, acc_iter=61683, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:39:17/1:10:44, time_cost(all): 16:44:26/1 day, 11:37:39, loss=0.499127879376143, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.9(1.03), norm=1.4400650318615424, lr=0.000731083632507051
2023-11-22 01:09:46   INFO  epoch: 9/30, acc_iter=61733, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:40:06/1:07:18, time_cost(all): 16:45:15/1 day, 14:50:50, loss=0.499044765389834, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.18(1.03), norm=3.587670856120752, lr=0.000730762898325126
2023-11-22 01:10:36   INFO  epoch: 9/30, acc_iter=61783, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:40:55/1:09:58, time_cost(all): 16:46:05/1 day, 14:23:45, loss=0.498961651403525, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.0(1.03), norm=2.715758538251649, lr=0.000730442164143201
2023-11-22 01:11:25   INFO  epoch: 9/30, acc_iter=61833, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:41:44/1:05:18, time_cost(all): 16:46:54/1 day, 14:27:43, loss=0.498878537417216, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.17(1.03), norm=4.342050117746157, lr=0.000730121429961277
2023-11-22 01:12:14   INFO  epoch: 9/30, acc_iter=61883, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:42:34/1:07:51, time_cost(all): 16:47:43/1 day, 13:57:09, loss=0.498795423430906, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.17(1.03), norm=1.4319553660935167, lr=0.000729800695779352
2023-11-22 01:13:03   INFO  epoch: 9/30, acc_iter=61933, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:43:23/1:03:41, time_cost(all): 16:48:32/1 day, 14:57:31, loss=0.498712309444597, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.83(1.03), norm=3.244715994183276, lr=0.000729479961597427
2023-11-22 01:13:52   INFO  epoch: 9/30, acc_iter=61983, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:44:12/1:04:48, time_cost(all): 16:49:21/1 day, 12:59:05, loss=0.498629195458288, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.1(1.03), norm=0.6308504935498354, lr=0.000729159227415503
2023-11-22 01:14:41   INFO  epoch: 9/30, acc_iter=62033, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:45:01/0:59:49, time_cost(all): 16:50:10/1 day, 12:58:33, loss=0.498546081471979, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.93(1.03), norm=0.8284178978787832, lr=0.000728838493233578
2023-11-22 01:15:30   INFO  epoch: 9/30, acc_iter=62083, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:45:50/1:04:43, time_cost(all): 16:50:59/1 day, 13:35:44, loss=0.49846296748567, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.85(1.03), norm=3.825792584635063, lr=0.000728517759051653
2023-11-22 01:16:19   INFO  epoch: 9/30, acc_iter=62133, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:46:39/1:03:25, time_cost(all): 16:51:48/1 day, 13:39:58, loss=0.498379853499361, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.93(1.03), norm=2.76664115805249, lr=0.000728197024869729
2023-11-22 01:17:08   INFO  epoch: 9/30, acc_iter=62183, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:47:28/1:01:00, time_cost(all): 16:52:37/1 day, 12:43:44, loss=0.498296739513052, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.06(1.03), norm=2.101248976675252, lr=0.000727876290687804
2023-11-22 01:17:58   INFO  epoch: 9/30, acc_iter=62233, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:48:17/0:58:59, time_cost(all): 16:53:27/1 day, 13:08:20, loss=0.498213625526743, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.92(1.03), norm=4.563002565377625, lr=0.000727555556505879
2023-11-22 01:18:47   INFO  epoch: 9/30, acc_iter=62283, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:49:07/0:59:15, time_cost(all): 16:54:16/1 day, 11:14:36, loss=0.498130511540434, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.04(1.03), norm=2.408267430510974, lr=0.000727234822323954
2023-11-22 01:19:36   INFO  epoch: 9/30, acc_iter=62333, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:49:56/1:00:47, time_cost(all): 16:55:05/1 day, 11:23:17, loss=0.498047397554125, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.15(1.03), norm=0.6165406051631697, lr=0.00072691408814203
2023-11-22 01:20:25   INFO  epoch: 9/30, acc_iter=62383, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:50:45/0:55:34, time_cost(all): 16:55:54/1 day, 14:14:18, loss=0.497964283567816, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.98(1.03), norm=3.6852322032869114, lr=0.000726593353960105
2023-11-22 01:21:14   INFO  epoch: 9/30, acc_iter=62433, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 0:51:34/0:56:57, time_cost(all): 16:56:43/1 day, 13:04:43, loss=0.497881169581507, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.07(1.03), norm=2.2504547626863483, lr=0.00072627261977818
2023-11-22 01:22:03   INFO  epoch: 9/30, acc_iter=62483, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 0:52:23/0:55:10, time_cost(all): 16:57:32/1 day, 13:49:57, loss=0.497798055595198, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.97(1.03), norm=1.5633558574167905, lr=0.000725951885596256
2023-11-22 01:22:52   INFO  epoch: 9/30, acc_iter=62533, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 0:53:12/0:55:17, time_cost(all): 16:58:21/1 day, 13:52:26, loss=0.497714941608889, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.12(1.03), norm=4.932937149701397, lr=0.000725631151414331
2023-11-22 01:23:41   INFO  epoch: 9/30, acc_iter=62583, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 0:54:01/0:56:02, time_cost(all): 16:59:10/1 day, 14:09:49, loss=0.49763182762258, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.14(1.03), norm=4.327852731373284, lr=0.000725310417232406
2023-11-22 01:24:31   INFO  epoch: 9/30, acc_iter=62633, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 0:54:50/0:54:42, time_cost(all): 17:00:00/1 day, 11:29:22, loss=0.497548713636271, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.95(1.03), norm=4.373786103361163, lr=0.000724989683050482
2023-11-22 01:25:20   INFO  epoch: 9/30, acc_iter=62683, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 0:55:39/0:52:49, time_cost(all): 17:00:49/1 day, 12:08:15, loss=0.497465599649962, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.97(1.03), norm=3.0121120917083983, lr=0.000724668948868557
2023-11-22 01:26:09   INFO  epoch: 9/30, acc_iter=62733, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 0:56:29/0:52:10, time_cost(all): 17:01:38/1 day, 13:08:56, loss=0.497382485663652, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.02(1.03), norm=3.235348666989545, lr=0.000724348214686632
2023-11-22 01:26:58   INFO  epoch: 9/30, acc_iter=62783, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 0:57:18/0:48:45, time_cost(all): 17:02:27/1 day, 11:41:52, loss=0.497299371677343, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.94(1.03), norm=0.8742712920263302, lr=0.000724027480504707
2023-11-22 01:27:47   INFO  epoch: 9/30, acc_iter=62833, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 0:58:07/0:51:12, time_cost(all): 17:03:16/1 day, 12:13:56, loss=0.497216257691034, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.0(1.03), norm=2.7775730920834705, lr=0.000723706746322783
2023-11-22 01:28:36   INFO  epoch: 9/30, acc_iter=62883, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 0:58:56/0:50:45, time_cost(all): 17:04:05/1 day, 12:12:59, loss=0.497133143704725, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.17(1.03), norm=1.8208625594771288, lr=0.000723386012140858
2023-11-22 01:29:25   INFO  epoch: 9/30, acc_iter=62933, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 0:59:45/0:46:38, time_cost(all): 17:04:54/1 day, 14:37:57, loss=0.497050029718416, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.01(1.03), norm=2.004882025803384, lr=0.000723065277958933
2023-11-22 01:30:14   INFO  epoch: 9/30, acc_iter=62983, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:00:34/0:47:41, time_cost(all): 17:05:43/1 day, 13:20:01, loss=0.496966915732107, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.02(1.03), norm=0.829032142137651, lr=0.000722744543777009
2023-11-22 01:31:03   INFO  epoch: 9/30, acc_iter=63033, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:01:23/0:45:40, time_cost(all): 17:06:32/1 day, 13:20:02, loss=0.496883801745798, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.12(1.03), norm=0.6831136581901698, lr=0.000722423809595084
2023-11-22 01:31:53   INFO  epoch: 9/30, acc_iter=63083, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:02:12/0:45:39, time_cost(all): 17:07:22/1 day, 11:15:41, loss=0.496800687759489, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.21(1.03), norm=2.543455784147811, lr=0.000722103075413159
2023-11-22 01:32:42   INFO  epoch: 9/30, acc_iter=63133, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:03:02/0:45:55, time_cost(all): 17:08:11/1 day, 11:14:31, loss=0.49671757377318, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.04(1.03), norm=1.3941560332494765, lr=0.000721782341231234
2023-11-22 01:33:31   INFO  epoch: 9/30, acc_iter=63183, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:03:51/0:45:28, time_cost(all): 17:09:00/1 day, 10:58:03, loss=0.496634459786871, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.09(1.03), norm=3.3602355157777, lr=0.00072146160704931
2023-11-22 01:34:20   INFO  epoch: 9/30, acc_iter=63233, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:04:40/0:41:25, time_cost(all): 17:09:49/1 day, 13:15:07, loss=0.496551345800562, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.21(1.03), norm=4.0367927392515295, lr=0.000721140872867385
2023-11-22 01:35:09   INFO  epoch: 9/30, acc_iter=63283, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:05:29/0:41:07, time_cost(all): 17:10:38/1 day, 11:37:26, loss=0.496468231814253, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.23(1.03), norm=2.103469169065347, lr=0.00072082013868546
2023-11-22 01:35:58   INFO  epoch: 9/30, acc_iter=63333, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:06:18/0:42:15, time_cost(all): 17:11:27/1 day, 12:26:22, loss=0.496385117827944, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.99(1.03), norm=2.061149289601234, lr=0.000720499404503536
2023-11-22 01:36:47   INFO  epoch: 9/30, acc_iter=63383, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:07:07/0:42:20, time_cost(all): 17:12:16/1 day, 12:36:31, loss=0.496302003841635, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.91(1.03), norm=4.917637276289143, lr=0.000720178670321611
2023-11-22 01:37:36   INFO  epoch: 9/30, acc_iter=63433, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:07:56/0:38:13, time_cost(all): 17:13:05/1 day, 14:22:31, loss=0.496218889855326, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.02(1.03), norm=2.612791070984608, lr=0.000719857936139686
2023-11-22 01:38:26   INFO  epoch: 9/30, acc_iter=63483, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:08:45/0:37:42, time_cost(all): 17:13:55/1 day, 11:50:54, loss=0.496135775869016, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.15(1.03), norm=3.454714586522705, lr=0.000719537201957762
2023-11-22 01:39:15   INFO  epoch: 9/30, acc_iter=63533, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:09:34/0:39:56, time_cost(all): 17:14:44/1 day, 11:51:10, loss=0.496052661882707, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.07(1.03), norm=2.915295578244727, lr=0.000719216467775837
2023-11-22 01:40:04   INFO  epoch: 9/30, acc_iter=63583, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:10:24/0:38:06, time_cost(all): 17:15:33/1 day, 12:43:49, loss=0.495969547896398, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.18(1.03), norm=1.8566427449384013, lr=0.000718895733593912
2023-11-22 01:40:53   INFO  epoch: 9/30, acc_iter=63633, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:11:13/0:35:34, time_cost(all): 17:16:22/1 day, 13:54:38, loss=0.495886433910089, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.93(1.03), norm=4.919164143437477, lr=0.000718574999411987
2023-11-22 01:41:42   INFO  epoch: 9/30, acc_iter=63683, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:12:02/0:34:55, time_cost(all): 17:17:11/1 day, 14:09:27, loss=0.49580331992378, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.16(1.03), norm=1.6213644782908727, lr=0.000718254265230063
2023-11-22 01:42:31   INFO  epoch: 9/30, acc_iter=63733, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:12:51/0:33:47, time_cost(all): 17:18:00/1 day, 13:18:44, loss=0.495720205937471, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.99(1.03), norm=1.249645810957666, lr=0.000717933531048138
2023-11-22 01:43:20   INFO  epoch: 9/30, acc_iter=63783, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:13:40/0:35:41, time_cost(all): 17:18:49/1 day, 13:48:19, loss=0.495637091951162, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.14(1.03), norm=1.209041470069538, lr=0.000717612796866213
2023-11-22 01:44:09   INFO  epoch: 9/30, acc_iter=63833, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:14:29/0:32:05, time_cost(all): 17:19:38/1 day, 14:11:27, loss=0.495553977964853, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.95(1.03), norm=1.7323837038238852, lr=0.000717292062684289
2023-11-22 01:44:58   INFO  epoch: 9/30, acc_iter=63883, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:15:18/0:33:17, time_cost(all): 17:20:27/1 day, 13:42:03, loss=0.495470863978544, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.17(1.03), norm=4.005123394398412, lr=0.000716971328502364
2023-11-22 01:45:48   INFO  epoch: 9/30, acc_iter=63933, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:16:07/0:31:23, time_cost(all): 17:21:17/1 day, 14:11:01, loss=0.495387749992235, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.01(1.03), norm=2.3776786285589826, lr=0.000716650594320439
2023-11-22 01:46:37   INFO  epoch: 9/30, acc_iter=63983, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:16:57/0:31:46, time_cost(all): 17:22:06/1 day, 13:29:53, loss=0.495304636005926, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.0(1.03), norm=0.820231919921998, lr=0.000716329860138514
2023-11-22 01:47:26   INFO  epoch: 9/30, acc_iter=64033, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:17:46/0:29:47, time_cost(all): 17:22:55/1 day, 12:17:39, loss=0.495221522019617, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.02(1.03), norm=3.432317950240006, lr=0.00071600912595659
2023-11-22 01:48:15   INFO  epoch: 9/30, acc_iter=64083, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:18:35/0:27:57, time_cost(all): 17:23:44/1 day, 12:23:15, loss=0.495138408033308, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.19(1.03), norm=2.128739423231007, lr=0.000715688391774665
2023-11-22 01:49:04   INFO  epoch: 9/30, acc_iter=64133, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:19:24/0:28:11, time_cost(all): 17:24:33/1 day, 12:39:15, loss=0.495055294046999, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.95(1.03), norm=0.8650247384787396, lr=0.00071536765759274
2023-11-22 01:49:53   INFO  epoch: 9/30, acc_iter=64183, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:20:13/0:26:57, time_cost(all): 17:25:22/1 day, 11:22:40, loss=0.49497218006069, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.95(1.03), norm=2.0020791189611176, lr=0.000715046923410816
2023-11-22 01:50:42   INFO  epoch: 9/30, acc_iter=64233, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:21:02/0:25:49, time_cost(all): 17:26:11/1 day, 11:24:02, loss=0.494889066074381, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.22(1.03), norm=3.643725856422807, lr=0.000714726189228891
2023-11-22 01:51:31   INFO  epoch: 9/30, acc_iter=64283, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:21:51/0:27:11, time_cost(all): 17:27:00/1 day, 14:10:27, loss=0.494805952088071, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.04(1.03), norm=4.306570477285814, lr=0.000714405455046966
2023-11-22 01:52:21   INFO  epoch: 9/30, acc_iter=64333, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:22:40/0:26:09, time_cost(all): 17:27:50/1 day, 10:48:40, loss=0.494722838101762, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.16(1.03), norm=4.433954316443785, lr=0.000714084720865042
2023-11-22 01:53:10   INFO  epoch: 9/30, acc_iter=64383, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:23:29/0:23:46, time_cost(all): 17:28:39/1 day, 13:39:14, loss=0.494639724115453, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.09(1.03), norm=1.334505891737179, lr=0.000713763986683117
2023-11-22 01:53:59   INFO  epoch: 9/30, acc_iter=64433, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:24:19/0:22:28, time_cost(all): 17:29:28/1 day, 11:17:12, loss=0.494556610129144, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.02(1.03), norm=4.37903480310023, lr=0.000713443252501192
2023-11-22 01:54:48   INFO  epoch: 9/30, acc_iter=64483, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:25:08/0:23:27, time_cost(all): 17:30:17/1 day, 13:28:54, loss=0.494473496142835, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.04(1.03), norm=3.2240786790646108, lr=0.000713122518319267
2023-11-22 01:55:37   INFO  epoch: 9/30, acc_iter=64533, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:25:57/0:22:24, time_cost(all): 17:31:06/1 day, 14:03:21, loss=0.494390382156526, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.19(1.03), norm=4.998280860752766, lr=0.000712801784137343
2023-11-22 01:56:26   INFO  epoch: 9/30, acc_iter=64583, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:26:46/0:21:38, time_cost(all): 17:31:55/1 day, 13:07:04, loss=0.494307268170217, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.07(1.03), norm=3.3785433336711144, lr=0.000712481049955418
2023-11-22 01:57:15   INFO  epoch: 9/30, acc_iter=64633, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:27:35/0:19:15, time_cost(all): 17:32:44/1 day, 12:37:15, loss=0.494224154183908, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.02(1.03), norm=1.1376457166921599, lr=0.000712160315773493
2023-11-22 01:58:04   INFO  epoch: 9/30, acc_iter=64683, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:28:24/0:18:39, time_cost(all): 17:33:33/1 day, 10:44:37, loss=0.494141040197599, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.21(1.03), norm=3.366360709367484, lr=0.000711839581591569
2023-11-22 01:58:53   INFO  epoch: 9/30, acc_iter=64733, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:29:13/0:18:57, time_cost(all): 17:34:22/1 day, 13:04:14, loss=0.49405792621129, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.84(1.03), norm=2.9099780081233684, lr=0.000711518847409644
2023-11-22 01:59:43   INFO  epoch: 9/30, acc_iter=64783, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:30:02/0:17:49, time_cost(all): 17:35:12/1 day, 12:05:55, loss=0.493974812224981, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.93(1.03), norm=4.379741516984924, lr=0.000711198113227719
2023-11-22 02:00:32   INFO  epoch: 9/30, acc_iter=64833, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:30:52/0:17:35, time_cost(all): 17:36:01/1 day, 13:49:42, loss=0.493891698238672, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.12(1.03), norm=2.6348888275018836, lr=0.000710877379045794
2023-11-22 02:01:21   INFO  epoch: 9/30, acc_iter=64883, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:31:41/0:16:27, time_cost(all): 17:36:50/1 day, 12:08:11, loss=0.493808584252363, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.95(1.03), norm=2.210831547102609, lr=0.00071055664486387
2023-11-22 02:02:10   INFO  epoch: 9/30, acc_iter=64933, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:32:30/0:14:40, time_cost(all): 17:37:39/1 day, 14:01:06, loss=0.493725470266054, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.95(1.03), norm=3.106191115158407, lr=0.000710235910681945
2023-11-22 02:02:59   INFO  epoch: 9/30, acc_iter=64983, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:33:19/0:13:55, time_cost(all): 17:38:28/1 day, 10:54:47, loss=0.493642356279745, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.07(1.03), norm=4.830970416305163, lr=0.00070991517650002
2023-11-22 02:03:48   INFO  epoch: 9/30, acc_iter=65033, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:34:08/0:13:06, time_cost(all): 17:39:17/1 day, 13:16:44, loss=0.493559242293436, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.87(1.03), norm=1.3613867871855836, lr=0.000709594442318096
2023-11-22 02:04:37   INFO  epoch: 9/30, acc_iter=65083, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:34:57/0:12:58, time_cost(all): 17:40:06/1 day, 13:30:16, loss=0.493476128307126, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.85(1.03), norm=1.7103903625465342, lr=0.000709273708136171
2023-11-22 02:05:26   INFO  epoch: 9/30, acc_iter=65133, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:35:46/0:12:11, time_cost(all): 17:40:55/1 day, 10:56:59, loss=0.493393014320817, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.02(1.03), norm=4.389606193489687, lr=0.000708952973954246
2023-11-22 02:06:16   INFO  epoch: 9/30, acc_iter=65183, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:36:35/0:11:29, time_cost(all): 17:41:45/1 day, 11:31:26, loss=0.493309900334508, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.2(1.03), norm=4.397772196742103, lr=0.000708632239772322
2023-11-22 02:07:05   INFO  epoch: 9/30, acc_iter=65233, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:37:24/0:10:32, time_cost(all): 17:42:34/1 day, 11:11:43, loss=0.493226786348199, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.9(1.03), norm=4.420500769458962, lr=0.000708311505590397
2023-11-22 02:07:54   INFO  epoch: 9/30, acc_iter=65283, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:38:14/0:09:51, time_cost(all): 17:43:23/1 day, 10:38:10, loss=0.49314367236189, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.86(1.03), norm=0.7227731159942028, lr=0.000707990771408472
2023-11-22 02:08:43   INFO  epoch: 9/30, acc_iter=65333, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:39:03/0:08:57, time_cost(all): 17:44:12/1 day, 11:39:55, loss=0.493060558375581, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.21(1.03), norm=4.8358685475065695, lr=0.000707670037226547
2023-11-22 02:09:32   INFO  epoch: 9/30, acc_iter=65383, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:39:52/0:08:22, time_cost(all): 17:45:01/1 day, 10:33:33, loss=0.492977444389272, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.07(1.03), norm=4.129774695304305, lr=0.000707349303044623
2023-11-22 02:10:21   INFO  epoch: 9/30, acc_iter=65433, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:40:41/0:07:00, time_cost(all): 17:45:50/1 day, 11:15:29, loss=0.492894330402963, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.9(1.03), norm=0.7061990997127202, lr=0.000707028568862698
2023-11-22 02:11:10   INFO  epoch: 9/30, acc_iter=65483, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:41:30/0:06:27, time_cost(all): 17:46:39/1 day, 12:43:16, loss=0.492811216416654, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.13(1.03), norm=0.9560982193539853, lr=0.000706707834680773
2023-11-22 02:11:59   INFO  epoch: 9/30, acc_iter=65533, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 1:42:19/0:05:19, time_cost(all): 17:47:28/1 day, 11:12:01, loss=0.492728102430345, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.19(1.03), norm=2.683123468975703, lr=0.000706387100498849
2023-11-22 02:12:48   INFO  epoch: 9/30, acc_iter=65583, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 1:43:08/0:04:45, time_cost(all): 17:48:17/1 day, 10:25:46, loss=0.492644988444036, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.99(1.03), norm=4.768457420338398, lr=0.000706066366316924
2023-11-22 02:13:38   INFO  epoch: 9/30, acc_iter=65633, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 1:43:57/0:03:44, time_cost(all): 17:49:07/1 day, 11:56:21, loss=0.492561874457727, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.11(1.03), norm=4.176683747463558, lr=0.000705745632134999
2023-11-22 02:14:27   INFO  epoch: 9/30, acc_iter=65683, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 1:44:47/0:03:10, time_cost(all): 17:49:56/1 day, 12:44:39, loss=0.492478760471418, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.98(1.03), norm=2.5079106479780613, lr=0.000705424897953074
2023-11-22 02:15:16   INFO  epoch: 9/30, acc_iter=65733, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 1:45:36/0:02:08, time_cost(all): 17:50:45/1 day, 12:51:20, loss=0.492395646485109, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.91(1.03), norm=2.931444136307676, lr=0.00070510416377115
2023-11-22 02:16:05   INFO  epoch: 9/30, acc_iter=65783, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 1:46:25/0:01:21, time_cost(all): 17:51:34/1 day, 12:50:35, loss=0.4923125324988, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.96(1.03), norm=2.2814439846825527, lr=0.000704783429589225
2023-11-22 02:16:54   INFO  epoch: 9/30, acc_iter=65833, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 1:47:14/0:00:34, time_cost(all): 17:52:23/1 day, 11:14:07, loss=0.492229418512491, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.01(1.03), norm=2.4714252942511674, lr=0.0007044626954073
2023-11-22 02:17:43   INFO  epoch: 10/30, acc_iter=65920, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:49/1:46:12, time_cost(all): 17:53:12/1 day, 13:06:46, loss=0.492084800176313, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.17(1.03), norm=1.3926404258337453, lr=0.000703904617930751
2023-11-22 02:18:32   INFO  epoch: 10/30, acc_iter=65970, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:38/1:43:15, time_cost(all): 17:54:01/1 day, 11:59:49, loss=0.492001686190004, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.12(1.03), norm=0.9232500011135123, lr=0.000703583883748827
2023-11-22 02:19:21   INFO  epoch: 10/30, acc_iter=66020, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:27/1:48:31, time_cost(all): 17:54:50/1 day, 11:23:49, loss=0.491918572203695, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.03(1.03), norm=4.7921490428156215, lr=0.000703263149566902
2023-11-22 02:20:11   INFO  epoch: 10/30, acc_iter=66070, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:16/1:46:33, time_cost(all): 17:55:40/1 day, 11:31:29, loss=0.491835458217386, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.99(1.03), norm=1.8931510619252163, lr=0.000702942415384977
2023-11-22 02:21:00   INFO  epoch: 10/30, acc_iter=66120, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:05/1:43:23, time_cost(all): 17:56:29/1 day, 10:51:09, loss=0.491752344231076, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.17(1.03), norm=4.752899350135491, lr=0.000702621681203053
2023-11-22 02:21:49   INFO  epoch: 10/30, acc_iter=66170, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:04:54/1:38:30, time_cost(all): 17:57:18/1 day, 11:37:09, loss=0.491669230244767, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.02(1.03), norm=1.190181936516241, lr=0.000702300947021128
2023-11-22 02:22:38   INFO  epoch: 10/30, acc_iter=66220, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:05:43/1:46:59, time_cost(all): 17:58:07/1 day, 12:53:59, loss=0.491586116258458, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.94(1.03), norm=0.9125059140118823, lr=0.000701980212839203
2023-11-22 02:23:27   INFO  epoch: 10/30, acc_iter=66270, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:06:32/1:38:23, time_cost(all): 17:58:56/1 day, 13:36:20, loss=0.491503002272149, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.86(1.03), norm=0.7574646867573398, lr=0.000701659478657279
2023-11-22 02:24:16   INFO  epoch: 10/30, acc_iter=66320, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:07:22/1:37:33, time_cost(all): 17:59:45/1 day, 11:47:48, loss=0.49141988828584, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.17(1.03), norm=3.5175191491471294, lr=0.000701338744475354
2023-11-22 02:25:05   INFO  epoch: 10/30, acc_iter=66370, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:08:11/1:40:52, time_cost(all): 18:00:34/1 day, 10:59:31, loss=0.491336774299531, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.1(1.03), norm=2.8607651831530485, lr=0.000701018010293429
2023-11-22 02:25:54   INFO  epoch: 10/30, acc_iter=66420, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:09:00/1:33:57, time_cost(all): 18:01:23/1 day, 12:15:41, loss=0.491253660313222, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.03(1.03), norm=4.376304304942017, lr=0.000700697276111504
2023-11-22 02:26:43   INFO  epoch: 10/30, acc_iter=66470, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:09:49/1:41:42, time_cost(all): 18:02:12/1 day, 10:43:49, loss=0.491170546326913, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.9(1.03), norm=0.8596101058565848, lr=0.00070037654192958
2023-11-22 02:27:33   INFO  epoch: 10/30, acc_iter=66520, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:10:38/1:32:58, time_cost(all): 18:03:02/1 day, 11:41:05, loss=0.491087432340604, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.07(1.03), norm=0.8392070876749149, lr=0.000700055807747655
2023-11-22 02:28:22   INFO  epoch: 10/30, acc_iter=66570, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:11:27/1:34:32, time_cost(all): 18:03:51/1 day, 11:30:15, loss=0.491004318354295, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.06(1.03), norm=4.456346709470559, lr=0.00069973507356573
2023-11-22 02:29:11   INFO  epoch: 10/30, acc_iter=66620, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:12:16/1:31:05, time_cost(all): 18:04:40/1 day, 12:10:51, loss=0.490921204367986, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.04(1.03), norm=4.790247539009997, lr=0.000699414339383806
2023-11-22 02:30:00   INFO  epoch: 10/30, acc_iter=66670, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:13:05/1:38:16, time_cost(all): 18:05:29/1 day, 11:02:52, loss=0.490838090381677, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.11(1.03), norm=4.402371011265689, lr=0.000699093605201881
2023-11-22 02:30:49   INFO  epoch: 10/30, acc_iter=66720, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:13:54/1:33:31, time_cost(all): 18:06:18/1 day, 10:48:41, loss=0.490754976395368, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.89(1.03), norm=2.37739221966054, lr=0.000698772871019956
2023-11-22 02:31:38   INFO  epoch: 10/30, acc_iter=66770, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:14:44/1:33:20, time_cost(all): 18:07:07/1 day, 10:52:03, loss=0.490671862409059, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.19(1.03), norm=2.621708096305892, lr=0.000698452136838031
2023-11-22 02:32:27   INFO  epoch: 10/30, acc_iter=66820, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:15:33/1:30:52, time_cost(all): 18:07:56/1 day, 12:40:03, loss=0.49058874842275, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.15(1.03), norm=1.4715213706530943, lr=0.000698131402656107
2023-11-22 02:33:16   INFO  epoch: 10/30, acc_iter=66870, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:16:22/1:29:42, time_cost(all): 18:08:45/1 day, 12:36:50, loss=0.490505634436441, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.89(1.03), norm=3.2989737960841308, lr=0.000697810668474182
2023-11-22 02:34:05   INFO  epoch: 10/30, acc_iter=66920, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:17:11/1:29:49, time_cost(all): 18:09:34/1 day, 12:46:09, loss=0.490422520450132, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.15(1.03), norm=3.038594609741397, lr=0.000697489934292257
2023-11-22 02:34:55   INFO  epoch: 10/30, acc_iter=66970, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:18:00/1:31:26, time_cost(all): 18:10:24/1 day, 10:44:52, loss=0.490339406463822, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.89(1.03), norm=0.9083270269926713, lr=0.000697169200110333
2023-11-22 02:35:44   INFO  epoch: 10/30, acc_iter=67020, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:18:49/1:32:09, time_cost(all): 18:11:13/1 day, 10:35:37, loss=0.490256292477513, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.19(1.03), norm=2.2979963161745873, lr=0.000696848465928408
2023-11-22 02:36:33   INFO  epoch: 10/30, acc_iter=67070, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:19:38/1:25:48, time_cost(all): 18:12:02/1 day, 10:01:01, loss=0.490173178491204, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.12(1.03), norm=1.999395879711836, lr=0.000696527731746483
2023-11-22 02:37:22   INFO  epoch: 10/30, acc_iter=67120, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:20:27/1:24:17, time_cost(all): 18:12:51/1 day, 10:06:30, loss=0.490090064504895, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.11(1.03), norm=4.265279434463253, lr=0.000696206997564559
2023-11-22 02:38:11   INFO  epoch: 10/30, acc_iter=67170, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:21:17/1:26:21, time_cost(all): 18:13:40/1 day, 11:49:42, loss=0.490006950518586, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.16(1.03), norm=0.6112127203038897, lr=0.000695886263382634
2023-11-22 02:39:00   INFO  epoch: 10/30, acc_iter=67220, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:22:06/1:23:36, time_cost(all): 18:14:29/1 day, 11:08:52, loss=0.489923836532277, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.0(1.03), norm=1.4746801897856456, lr=0.000695565529200709
2023-11-22 02:39:49   INFO  epoch: 10/30, acc_iter=67270, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:22:55/1:27:05, time_cost(all): 18:15:18/1 day, 10:31:54, loss=0.489840722545968, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.1(1.03), norm=1.0176807077334165, lr=0.000695244795018784
2023-11-22 02:40:38   INFO  epoch: 10/30, acc_iter=67320, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:23:44/1:26:15, time_cost(all): 18:16:07/1 day, 12:14:58, loss=0.489757608559659, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.23(1.03), norm=4.90855583667137, lr=0.00069492406083686
2023-11-22 02:41:28   INFO  epoch: 10/30, acc_iter=67370, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:24:33/1:23:00, time_cost(all): 18:16:57/1 day, 9:52:17, loss=0.48967449457335, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.19(1.03), norm=0.9451212737658616, lr=0.000694603326654935
2023-11-22 02:42:17   INFO  epoch: 10/30, acc_iter=67420, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:25:22/1:25:44, time_cost(all): 18:17:46/1 day, 12:29:26, loss=0.489591380587041, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.85(1.03), norm=2.1415507840096497, lr=0.00069428259247301
2023-11-22 02:43:06   INFO  epoch: 10/30, acc_iter=67470, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:26:11/1:18:55, time_cost(all): 18:18:35/1 day, 9:58:13, loss=0.489508266600732, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.1(1.03), norm=4.036796618979757, lr=0.000693961858291086
2023-11-22 02:43:55   INFO  epoch: 10/30, acc_iter=67520, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:27:00/1:24:19, time_cost(all): 18:19:24/1 day, 10:55:26, loss=0.489425152614423, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.17(1.03), norm=0.7694545164301541, lr=0.000693641124109161
2023-11-22 02:44:44   INFO  epoch: 10/30, acc_iter=67570, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:27:49/1:23:34, time_cost(all): 18:20:13/1 day, 10:42:51, loss=0.489342038628114, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.04(1.03), norm=4.695340530217258, lr=0.000693320389927236
2023-11-22 02:45:33   INFO  epoch: 10/30, acc_iter=67620, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:28:39/1:21:26, time_cost(all): 18:21:02/1 day, 11:57:05, loss=0.489258924641805, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.95(1.03), norm=3.032247217909218, lr=0.000692999655745311
2023-11-22 02:46:22   INFO  epoch: 10/30, acc_iter=67670, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:29:28/1:16:53, time_cost(all): 18:21:51/1 day, 10:34:09, loss=0.489175810655496, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.88(1.03), norm=4.200595337253296, lr=0.000692678921563387
2023-11-22 02:47:11   INFO  epoch: 10/30, acc_iter=67720, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:30:17/1:16:51, time_cost(all): 18:22:40/1 day, 12:12:28, loss=0.489092696669186, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.12(1.03), norm=4.271334484467948, lr=0.000692358187381462
2023-11-22 02:48:00   INFO  epoch: 10/30, acc_iter=67770, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:31:06/1:14:48, time_cost(all): 18:23:29/1 day, 10:49:57, loss=0.489009582682877, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.87(1.03), norm=0.9042443185238624, lr=0.000692037453199537
2023-11-22 02:48:50   INFO  epoch: 10/30, acc_iter=67820, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:31:55/1:17:20, time_cost(all): 18:24:19/1 day, 10:00:27, loss=0.488926468696568, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.83(1.03), norm=4.794369666863815, lr=0.000691716719017613
2023-11-22 02:49:39   INFO  epoch: 10/30, acc_iter=67870, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:32:44/1:17:53, time_cost(all): 18:25:08/1 day, 13:01:16, loss=0.488843354710259, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.91(1.03), norm=2.09036325371942, lr=0.000691395984835688
2023-11-22 02:50:28   INFO  epoch: 10/30, acc_iter=67920, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:33:33/1:16:01, time_cost(all): 18:25:57/1 day, 11:52:35, loss=0.48876024072395, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.19(1.03), norm=1.7150575068730967, lr=0.000691075250653763
2023-11-22 02:51:17   INFO  epoch: 10/30, acc_iter=67970, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:34:22/1:16:07, time_cost(all): 18:26:46/1 day, 11:58:51, loss=0.488677126737641, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.12(1.03), norm=0.9989183341259908, lr=0.000690754516471839
2023-11-22 02:52:06   INFO  epoch: 10/30, acc_iter=68020, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:35:12/1:15:44, time_cost(all): 18:27:35/1 day, 10:36:18, loss=0.488594012751332, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.15(1.03), norm=0.6079717574495962, lr=0.000690433782289914
2023-11-22 02:52:55   INFO  epoch: 10/30, acc_iter=68070, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:36:01/1:15:06, time_cost(all): 18:28:24/1 day, 10:42:47, loss=0.488510898765023, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.0(1.03), norm=4.716590791688384, lr=0.000690113048107989
2023-11-22 02:53:44   INFO  epoch: 10/30, acc_iter=68120, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:36:50/1:09:52, time_cost(all): 18:29:13/1 day, 12:17:24, loss=0.488427784778714, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.16(1.03), norm=1.371632451024572, lr=0.000689792313926064
2023-11-22 02:54:33   INFO  epoch: 10/30, acc_iter=68170, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:37:39/1:11:38, time_cost(all): 18:30:02/1 day, 12:16:18, loss=0.488344670792405, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.23(1.03), norm=1.4959183197752455, lr=0.00068947157974414
2023-11-22 02:55:23   INFO  epoch: 10/30, acc_iter=68220, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:38:28/1:08:30, time_cost(all): 18:30:52/1 day, 12:49:39, loss=0.488261556806096, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.09(1.03), norm=4.817818074823648, lr=0.000689150845562215
2023-11-22 02:56:12   INFO  epoch: 10/30, acc_iter=68270, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:39:17/1:06:51, time_cost(all): 18:31:41/1 day, 11:18:48, loss=0.488178442819787, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.15(1.03), norm=1.3125942642002997, lr=0.00068883011138029
2023-11-22 02:57:01   INFO  epoch: 10/30, acc_iter=68320, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:40:06/1:06:30, time_cost(all): 18:32:30/1 day, 12:35:40, loss=0.488095328833478, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.09(1.03), norm=0.5732656845559627, lr=0.000688509377198366
2023-11-22 02:57:50   INFO  epoch: 10/30, acc_iter=68370, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:40:55/1:06:37, time_cost(all): 18:33:19/1 day, 9:50:48, loss=0.488012214847169, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.93(1.03), norm=0.9197846850620082, lr=0.000688188643016441
2023-11-22 02:58:39   INFO  epoch: 10/30, acc_iter=68420, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:41:44/1:05:07, time_cost(all): 18:34:08/1 day, 11:57:54, loss=0.48792910086086, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.19(1.03), norm=0.7331241174663898, lr=0.000687867908834516
2023-11-22 02:59:28   INFO  epoch: 10/30, acc_iter=68470, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:42:34/1:06:59, time_cost(all): 18:34:57/1 day, 10:50:41, loss=0.487845986874551, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.2(1.03), norm=2.0417725909057625, lr=0.000687547174652591
2023-11-22 03:00:17   INFO  epoch: 10/30, acc_iter=68520, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:43:23/1:05:14, time_cost(all): 18:35:46/1 day, 9:36:55, loss=0.487762872888241, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.91(1.03), norm=3.354129291497944, lr=0.000687226440470667
2023-11-22 03:01:06   INFO  epoch: 10/30, acc_iter=68570, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:44:12/1:00:57, time_cost(all): 18:36:35/1 day, 10:19:36, loss=0.487679758901932, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.1(1.03), norm=2.635003228440243, lr=0.000686905706288742
2023-11-22 03:01:55   INFO  epoch: 10/30, acc_iter=68620, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:45:01/1:03:05, time_cost(all): 18:37:24/1 day, 9:36:50, loss=0.487596644915623, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.88(1.03), norm=4.398685452570068, lr=0.000686584972106817
2023-11-22 03:02:45   INFO  epoch: 10/30, acc_iter=68670, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:45:50/1:03:18, time_cost(all): 18:38:14/1 day, 10:16:18, loss=0.487513530929314, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.12(1.03), norm=0.8820786616843748, lr=0.000686264237924893
2023-11-22 03:03:34   INFO  epoch: 10/30, acc_iter=68720, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:46:39/1:01:15, time_cost(all): 18:39:03/1 day, 10:37:49, loss=0.487430416943005, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.87(1.03), norm=1.5873696716012666, lr=0.000685943503742968
2023-11-22 03:04:23   INFO  epoch: 10/30, acc_iter=68770, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:47:28/0:58:50, time_cost(all): 18:39:52/1 day, 12:01:15, loss=0.487347302956696, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.04(1.03), norm=1.1655594564868816, lr=0.000685622769561043
2023-11-22 03:05:12   INFO  epoch: 10/30, acc_iter=68820, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:48:17/0:58:14, time_cost(all): 18:40:41/1 day, 11:22:59, loss=0.487264188970387, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.1(1.03), norm=1.3087211304930126, lr=0.000685302035379119
2023-11-22 03:06:01   INFO  epoch: 10/30, acc_iter=68870, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:49:07/0:55:56, time_cost(all): 18:41:30/1 day, 10:10:38, loss=0.487181074984078, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.93(1.03), norm=2.4313165637850056, lr=0.000684981301197194
2023-11-22 03:06:50   INFO  epoch: 10/30, acc_iter=68920, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:49:56/0:55:08, time_cost(all): 18:42:19/1 day, 10:04:37, loss=0.487097960997769, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.87(1.03), norm=1.0097423682127016, lr=0.000684660567015269
2023-11-22 03:07:39   INFO  epoch: 10/30, acc_iter=68970, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:50:45/0:56:35, time_cost(all): 18:43:08/1 day, 10:29:38, loss=0.48701484701146, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.84(1.03), norm=0.8903538436856141, lr=0.000684339832833344
2023-11-22 03:08:28   INFO  epoch: 10/30, acc_iter=69020, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 0:51:34/0:55:08, time_cost(all): 18:43:57/1 day, 10:21:14, loss=0.486931733025151, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.21(1.03), norm=3.2780469174788647, lr=0.00068401909865142
2023-11-22 03:09:18   INFO  epoch: 10/30, acc_iter=69070, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 0:52:23/0:57:46, time_cost(all): 18:44:47/1 day, 9:46:07, loss=0.486848619038842, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.05(1.03), norm=0.7364666353443168, lr=0.000683698364469495
2023-11-22 03:10:07   INFO  epoch: 10/30, acc_iter=69120, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 0:53:12/0:56:35, time_cost(all): 18:45:36/1 day, 12:51:27, loss=0.486765505052533, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.92(1.03), norm=2.5878388616453556, lr=0.00068337763028757
2023-11-22 03:10:56   INFO  epoch: 10/30, acc_iter=69170, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 0:54:01/0:52:57, time_cost(all): 18:46:25/1 day, 12:10:15, loss=0.486682391066224, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.07(1.03), norm=3.5606626008439335, lr=0.000683056896105646
2023-11-22 03:11:45   INFO  epoch: 10/30, acc_iter=69220, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 0:54:50/0:53:06, time_cost(all): 18:47:14/1 day, 11:02:54, loss=0.486599277079915, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.98(1.03), norm=4.257112996735996, lr=0.000682736161923721
2023-11-22 03:12:34   INFO  epoch: 10/30, acc_iter=69270, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 0:55:39/0:51:21, time_cost(all): 18:48:03/1 day, 12:49:24, loss=0.486516163093606, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.09(1.03), norm=3.997364880578053, lr=0.000682415427741796
2023-11-22 03:13:23   INFO  epoch: 10/30, acc_iter=69320, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 0:56:29/0:49:25, time_cost(all): 18:48:52/1 day, 9:56:03, loss=0.486433049107296, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.09(1.03), norm=0.8261084903171279, lr=0.000682094693559871
2023-11-22 03:14:12   INFO  epoch: 10/30, acc_iter=69370, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 0:57:18/0:52:37, time_cost(all): 18:49:41/1 day, 12:32:18, loss=0.486349935120987, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.0(1.03), norm=4.103961865641779, lr=0.000681773959377947
2023-11-22 03:15:01   INFO  epoch: 10/30, acc_iter=69420, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 0:58:07/0:49:59, time_cost(all): 18:50:30/1 day, 11:52:06, loss=0.486266821134678, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.87(1.03), norm=0.550055794673044, lr=0.000681453225196022
2023-11-22 03:15:50   INFO  epoch: 10/30, acc_iter=69470, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 0:58:56/0:50:26, time_cost(all): 18:51:19/1 day, 12:47:50, loss=0.486183707148369, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.87(1.03), norm=1.3990553694069572, lr=0.000681132491014097
2023-11-22 03:16:40   INFO  epoch: 10/30, acc_iter=69520, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 0:59:45/0:49:54, time_cost(all): 18:52:09/1 day, 11:00:15, loss=0.48610059316206, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.88(1.03), norm=3.4983368093085887, lr=0.000680811756832173
2023-11-22 03:17:29   INFO  epoch: 10/30, acc_iter=69570, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:00:34/0:48:14, time_cost(all): 18:52:58/1 day, 10:44:35, loss=0.486017479175751, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.15(1.03), norm=2.5197929156532384, lr=0.000680491022650248
2023-11-22 03:18:18   INFO  epoch: 10/30, acc_iter=69620, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:01:23/0:48:03, time_cost(all): 18:53:47/1 day, 11:18:17, loss=0.485934365189442, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.19(1.03), norm=2.460554753863695, lr=0.000680170288468323
2023-11-22 03:19:07   INFO  epoch: 10/30, acc_iter=69670, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:02:12/0:44:55, time_cost(all): 18:54:36/1 day, 10:07:10, loss=0.485851251203133, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.87(1.03), norm=0.8317442812492981, lr=0.000679849554286399
2023-11-22 03:19:56   INFO  epoch: 10/30, acc_iter=69720, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:03:02/0:44:06, time_cost(all): 18:55:25/1 day, 10:53:14, loss=0.485768137216824, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.03(1.03), norm=4.4585203267028, lr=0.000679528820104474
2023-11-22 03:20:45   INFO  epoch: 10/30, acc_iter=69770, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:03:51/0:44:49, time_cost(all): 18:56:14/1 day, 11:14:43, loss=0.485685023230515, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.18(1.03), norm=1.9155965480760475, lr=0.000679208085922549
2023-11-22 03:21:34   INFO  epoch: 10/30, acc_iter=69820, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:04:40/0:41:20, time_cost(all): 18:57:03/1 day, 9:39:45, loss=0.485601909244206, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.9(1.03), norm=3.7132228220391545, lr=0.000678887351740624
2023-11-22 03:22:23   INFO  epoch: 10/30, acc_iter=69870, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:05:29/0:43:26, time_cost(all): 18:57:52/1 day, 9:48:14, loss=0.485518795257897, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.99(1.03), norm=1.40325275046434, lr=0.0006785666175587
2023-11-22 03:23:13   INFO  epoch: 10/30, acc_iter=69920, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:06:18/0:41:35, time_cost(all): 18:58:42/1 day, 12:19:24, loss=0.485435681271588, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.12(1.03), norm=0.5865710922984613, lr=0.000678245883376775
2023-11-22 03:24:02   INFO  epoch: 10/30, acc_iter=69970, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:07:07/0:40:53, time_cost(all): 18:59:31/1 day, 12:26:29, loss=0.485352567285279, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.11(1.03), norm=1.6625834192892421, lr=0.00067792514919485
2023-11-22 03:24:51   INFO  epoch: 10/30, acc_iter=70020, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:07:56/0:39:10, time_cost(all): 19:00:20/1 day, 10:36:46, loss=0.48526945329897, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.97(1.03), norm=4.979292137111202, lr=0.000677604415012926
2023-11-22 03:25:40   INFO  epoch: 10/30, acc_iter=70070, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:08:45/0:40:26, time_cost(all): 19:01:09/1 day, 9:28:14, loss=0.485186339312661, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.13(1.03), norm=1.3563483013099011, lr=0.000677283680831001
2023-11-22 03:26:29   INFO  epoch: 10/30, acc_iter=70120, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:09:34/0:39:54, time_cost(all): 19:01:58/1 day, 12:18:59, loss=0.485103225326351, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.09(1.03), norm=2.48254616053676, lr=0.000676962946649076
2023-11-22 03:27:18   INFO  epoch: 10/30, acc_iter=70170, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:10:24/0:38:22, time_cost(all): 19:02:47/1 day, 9:35:23, loss=0.485020111340042, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.87(1.03), norm=0.8715199623023738, lr=0.000676642212467152
2023-11-22 03:28:07   INFO  epoch: 10/30, acc_iter=70220, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:11:13/0:36:45, time_cost(all): 19:03:36/1 day, 11:28:41, loss=0.484936997353733, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.95(1.03), norm=2.175289162679427, lr=0.000676321478285227
2023-11-22 03:28:56   INFO  epoch: 10/30, acc_iter=70270, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:12:02/0:34:48, time_cost(all): 19:04:25/1 day, 12:09:43, loss=0.484853883367424, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.1(1.03), norm=3.7951262850605545, lr=0.000676000744103302
2023-11-22 03:29:45   INFO  epoch: 10/30, acc_iter=70320, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:12:51/0:36:27, time_cost(all): 19:05:14/1 day, 10:06:04, loss=0.484770769381115, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.21(1.03), norm=1.862082831547426, lr=0.000675680009921377
2023-11-22 03:30:35   INFO  epoch: 10/30, acc_iter=70370, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:13:40/0:32:55, time_cost(all): 19:06:04/1 day, 9:17:25, loss=0.484687655394806, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.98(1.03), norm=2.6060717090481273, lr=0.000675359275739453
2023-11-22 03:31:24   INFO  epoch: 10/30, acc_iter=70420, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:14:29/0:32:40, time_cost(all): 19:06:53/1 day, 11:11:20, loss=0.484604541408497, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.01(1.03), norm=4.97684238698142, lr=0.000675038541557528
2023-11-22 03:32:13   INFO  epoch: 10/30, acc_iter=70470, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:15:18/0:33:18, time_cost(all): 19:07:42/1 day, 10:13:42, loss=0.484521427422188, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.01(1.03), norm=1.1208975625693531, lr=0.000674717807375603
2023-11-22 03:33:02   INFO  epoch: 10/30, acc_iter=70520, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:16:07/0:31:14, time_cost(all): 19:08:31/1 day, 11:46:02, loss=0.484438313435879, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.0(1.03), norm=4.824393527044655, lr=0.000674397073193679
2023-11-22 03:33:51   INFO  epoch: 10/30, acc_iter=70570, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:16:57/0:31:45, time_cost(all): 19:09:20/1 day, 12:24:40, loss=0.48435519944957, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.18(1.03), norm=3.4725393341534563, lr=0.000674076339011754
2023-11-22 03:34:40   INFO  epoch: 10/30, acc_iter=70620, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:17:46/0:29:11, time_cost(all): 19:10:09/1 day, 10:00:23, loss=0.484272085463261, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.89(1.03), norm=2.565843913984732, lr=0.000673755604829829
2023-11-22 03:35:29   INFO  epoch: 10/30, acc_iter=70670, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:18:35/0:29:20, time_cost(all): 19:10:58/1 day, 11:00:11, loss=0.484188971476952, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.0(1.03), norm=4.431444486643018, lr=0.000673434870647905
2023-11-22 03:36:18   INFO  epoch: 10/30, acc_iter=70720, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:19:24/0:28:27, time_cost(all): 19:11:47/1 day, 9:08:12, loss=0.484105857490643, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.16(1.03), norm=2.7064571184527955, lr=0.00067311413646598
2023-11-22 03:37:08   INFO  epoch: 10/30, acc_iter=70770, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:20:13/0:27:53, time_cost(all): 19:12:37/1 day, 9:46:24, loss=0.484022743504334, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.84(1.03), norm=0.8560210163279705, lr=0.000672793402284055
2023-11-22 03:37:57   INFO  epoch: 10/30, acc_iter=70820, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:21:02/0:25:28, time_cost(all): 19:13:26/1 day, 10:14:28, loss=0.483939629518025, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.06(1.03), norm=2.0904523628837746, lr=0.00067247266810213
2023-11-22 03:38:46   INFO  epoch: 10/30, acc_iter=70870, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:21:51/0:26:04, time_cost(all): 19:14:15/1 day, 9:53:32, loss=0.483856515531715, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.98(1.03), norm=3.3431302170813106, lr=0.000672151933920206
2023-11-22 03:39:35   INFO  epoch: 10/30, acc_iter=70920, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:22:40/0:25:55, time_cost(all): 19:15:04/1 day, 11:25:35, loss=0.483773401545406, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.14(1.03), norm=4.207919640832229, lr=0.000671831199738281
2023-11-22 03:40:24   INFO  epoch: 10/30, acc_iter=70970, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:23:29/0:24:48, time_cost(all): 19:15:53/1 day, 9:06:48, loss=0.483690287559097, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.92(1.03), norm=4.765750061999587, lr=0.000671510465556356
2023-11-22 03:41:13   INFO  epoch: 10/30, acc_iter=71020, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:24:19/0:24:29, time_cost(all): 19:16:42/1 day, 10:04:06, loss=0.483607173572788, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.04(1.03), norm=3.8035729002867424, lr=0.000671189731374431
2023-11-22 03:42:02   INFO  epoch: 10/30, acc_iter=71070, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:25:08/0:22:24, time_cost(all): 19:17:31/1 day, 11:12:22, loss=0.483524059586479, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.16(1.03), norm=2.404090080922333, lr=0.000670868997192507
2023-11-22 03:42:51   INFO  epoch: 10/30, acc_iter=71120, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:25:57/0:22:45, time_cost(all): 19:18:20/1 day, 11:21:43, loss=0.48344094560017, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.93(1.03), norm=1.3517401886586666, lr=0.000670548263010582
2023-11-22 03:43:40   INFO  epoch: 10/30, acc_iter=71170, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:26:46/0:21:21, time_cost(all): 19:19:09/1 day, 11:12:46, loss=0.483357831613861, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.02(1.03), norm=3.983346047351154, lr=0.000670227528828657
2023-11-22 03:44:30   INFO  epoch: 10/30, acc_iter=71220, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:27:35/0:19:47, time_cost(all): 19:19:59/1 day, 9:12:10, loss=0.483274717627552, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.21(1.03), norm=3.6094902966232203, lr=0.000669906794646733
2023-11-22 03:45:19   INFO  epoch: 10/30, acc_iter=71270, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:28:24/0:19:06, time_cost(all): 19:20:48/1 day, 10:08:20, loss=0.483191603641243, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.99(1.03), norm=4.122484390222224, lr=0.000669586060464808
2023-11-22 03:46:08   INFO  epoch: 10/30, acc_iter=71320, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:29:13/0:18:59, time_cost(all): 19:21:37/1 day, 9:03:44, loss=0.483108489654934, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.96(1.03), norm=4.930368989040294, lr=0.000669265326282883
2023-11-22 03:46:57   INFO  epoch: 10/30, acc_iter=71370, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:30:02/0:17:17, time_cost(all): 19:22:26/1 day, 10:02:49, loss=0.483025375668625, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.21(1.03), norm=1.2975905266688739, lr=0.000668944592100959
2023-11-22 03:47:46   INFO  epoch: 10/30, acc_iter=71420, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:30:52/0:17:40, time_cost(all): 19:23:15/1 day, 11:33:43, loss=0.482942261682316, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.07(1.03), norm=1.9395467207045396, lr=0.000668623857919034
2023-11-22 03:48:35   INFO  epoch: 10/30, acc_iter=71470, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:31:41/0:16:43, time_cost(all): 19:24:04/1 day, 9:35:32, loss=0.482859147696007, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.03(1.03), norm=2.915774291537372, lr=0.000668303123737109
2023-11-22 03:49:24   INFO  epoch: 10/30, acc_iter=71520, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:32:30/0:14:45, time_cost(all): 19:24:53/1 day, 8:56:37, loss=0.482776033709698, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.96(1.03), norm=4.439798060677365, lr=0.000667982389555184
2023-11-22 03:50:13   INFO  epoch: 10/30, acc_iter=71570, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:33:19/0:14:37, time_cost(all): 19:25:42/1 day, 9:43:28, loss=0.482692919723389, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.0(1.03), norm=2.4520954423975163, lr=0.00066766165537326
2023-11-22 03:51:03   INFO  epoch: 10/30, acc_iter=71620, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:34:08/0:13:46, time_cost(all): 19:26:32/1 day, 8:47:18, loss=0.48260980573708, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.91(1.03), norm=3.525364029834807, lr=0.000667340921191335
2023-11-22 03:51:52   INFO  epoch: 10/30, acc_iter=71670, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:34:57/0:12:40, time_cost(all): 19:27:21/1 day, 11:32:28, loss=0.482526691750771, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.21(1.03), norm=0.9029915469712777, lr=0.00066702018700941
2023-11-22 03:52:41   INFO  epoch: 10/30, acc_iter=71720, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:35:46/0:12:14, time_cost(all): 19:28:10/1 day, 11:28:34, loss=0.482443577764461, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.93(1.03), norm=2.3774994243263206, lr=0.000666699452827486
2023-11-22 03:53:30   INFO  epoch: 10/30, acc_iter=71770, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:36:35/0:11:37, time_cost(all): 19:28:59/1 day, 9:32:31, loss=0.482360463778152, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.15(1.03), norm=2.633002056222842, lr=0.000666378718645561
2023-11-22 03:54:19   INFO  epoch: 10/30, acc_iter=71820, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:37:24/0:10:48, time_cost(all): 19:29:48/1 day, 9:31:42, loss=0.482277349791843, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.12(1.03), norm=4.4173545658650335, lr=0.000666057984463636
2023-11-22 03:55:08   INFO  epoch: 10/30, acc_iter=71870, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:38:14/0:10:00, time_cost(all): 19:30:37/1 day, 11:40:43, loss=0.482194235805534, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.86(1.03), norm=4.654388070858436, lr=0.000665737250281712
2023-11-22 03:55:57   INFO  epoch: 10/30, acc_iter=71920, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:39:03/0:08:52, time_cost(all): 19:31:26/1 day, 9:47:51, loss=0.482111121819225, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.16(1.03), norm=1.7139041084813744, lr=0.000665416516099787
2023-11-22 03:56:46   INFO  epoch: 10/30, acc_iter=71970, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:39:52/0:08:02, time_cost(all): 19:32:15/1 day, 12:04:24, loss=0.482028007832916, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.11(1.03), norm=2.3540963783949955, lr=0.000665095781917862
2023-11-22 03:57:35   INFO  epoch: 10/30, acc_iter=72020, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:40:41/0:07:27, time_cost(all): 19:33:04/1 day, 8:51:51, loss=0.481944893846607, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.84(1.03), norm=4.7609556384310405, lr=0.000664775047735937
2023-11-22 03:58:25   INFO  epoch: 10/30, acc_iter=72070, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:41:30/0:06:36, time_cost(all): 19:33:54/1 day, 11:42:18, loss=0.481861779860298, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.09(1.03), norm=1.4497172949019688, lr=0.000664454313554013
2023-11-22 03:59:14   INFO  epoch: 10/30, acc_iter=72120, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 1:42:19/0:05:31, time_cost(all): 19:34:43/1 day, 10:29:36, loss=0.481778665873989, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.1(1.03), norm=4.819085476114506, lr=0.000664133579372088
2023-11-22 04:00:03   INFO  epoch: 10/30, acc_iter=72170, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 1:43:08/0:04:33, time_cost(all): 19:35:32/1 day, 10:46:35, loss=0.48169555188768, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.2(1.03), norm=2.5107026563802255, lr=0.000663812845190163
2023-11-22 04:00:52   INFO  epoch: 10/30, acc_iter=72220, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 1:43:57/0:03:44, time_cost(all): 19:36:21/1 day, 9:15:14, loss=0.481612437901371, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.16(1.03), norm=3.501271710308797, lr=0.000663492111008239
2023-11-22 04:01:41   INFO  epoch: 10/30, acc_iter=72270, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 1:44:47/0:02:58, time_cost(all): 19:37:10/1 day, 11:09:09, loss=0.481529323915062, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.98(1.03), norm=1.9425853616479674, lr=0.000663171376826314
2023-11-22 04:02:30   INFO  epoch: 10/30, acc_iter=72320, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 1:45:36/0:02:16, time_cost(all): 19:37:59/1 day, 9:23:24, loss=0.481446209928753, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.09(1.03), norm=2.3519836038283013, lr=0.000662850642644389
2023-11-22 04:03:19   INFO  epoch: 10/30, acc_iter=72370, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 1:46:25/0:01:29, time_cost(all): 19:38:48/1 day, 11:16:04, loss=0.481363095942444, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.98(1.03), norm=3.983582488173792, lr=0.000662529908462465
2023-11-22 04:04:08   INFO  epoch: 10/30, acc_iter=72420, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 1:47:14/0:00:36, time_cost(all): 19:39:37/1 day, 10:37:08, loss=0.481279981956135, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.89(1.03), norm=3.126580251382925, lr=0.00066220917428054
2023-11-22 04:04:58   INFO  epoch: 11/30, acc_iter=72507, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:49/1:50:08, time_cost(all): 19:40:27/1 day, 11:02:06, loss=0.481135363619957, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.94(1.03), norm=0.8842629385293237, lr=0.000661651096803991
2023-11-22 04:05:47   INFO  epoch: 11/30, acc_iter=72557, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:38/1:47:25, time_cost(all): 19:41:16/1 day, 10:26:57, loss=0.481052249633648, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.09(1.03), norm=2.1446871256697193, lr=0.000661330362622066
2023-11-22 04:06:36   INFO  epoch: 11/30, acc_iter=72607, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:27/1:47:49, time_cost(all): 19:42:05/1 day, 10:52:48, loss=0.480969135647339, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.12(1.03), norm=2.384762703823523, lr=0.000661009628440141
2023-11-22 04:07:25   INFO  epoch: 11/30, acc_iter=72657, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:16/1:49:20, time_cost(all): 19:42:54/1 day, 10:10:15, loss=0.48088602166103, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.17(1.03), norm=0.6578274442328873, lr=0.000660688894258217
2023-11-22 04:08:14   INFO  epoch: 11/30, acc_iter=72707, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:05/1:48:06, time_cost(all): 19:43:43/1 day, 9:40:49, loss=0.480802907674721, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.22(1.03), norm=3.239722216903365, lr=0.000660368160076292
2023-11-22 04:09:03   INFO  epoch: 11/30, acc_iter=72757, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:04:54/1:45:02, time_cost(all): 19:44:32/1 day, 11:14:59, loss=0.480719793688412, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.95(1.03), norm=1.4989759982940087, lr=0.000660047425894367
2023-11-22 04:09:52   INFO  epoch: 11/30, acc_iter=72807, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:05:43/1:40:25, time_cost(all): 19:45:21/1 day, 8:42:20, loss=0.480636679702102, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.08(1.03), norm=1.1881669492505265, lr=0.000659726691712443
2023-11-22 04:10:41   INFO  epoch: 11/30, acc_iter=72857, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:06:32/1:44:13, time_cost(all): 19:46:10/1 day, 8:56:14, loss=0.480553565715793, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.95(1.03), norm=3.5790917371559443, lr=0.000659405957530518
2023-11-22 04:11:30   INFO  epoch: 11/30, acc_iter=72907, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:07:22/1:43:37, time_cost(all): 19:46:59/1 day, 9:07:28, loss=0.480470451729484, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.93(1.03), norm=3.2627779317750343, lr=0.000659085223348593
2023-11-22 04:12:20   INFO  epoch: 11/30, acc_iter=72957, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:08:11/1:41:02, time_cost(all): 19:47:49/1 day, 10:03:23, loss=0.480387337743175, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.01(1.03), norm=2.184567245085474, lr=0.000658764489166668
2023-11-22 04:13:09   INFO  epoch: 11/30, acc_iter=73007, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:09:00/1:38:46, time_cost(all): 19:48:38/1 day, 9:06:29, loss=0.480304223756866, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.16(1.03), norm=4.257397908719841, lr=0.000658443754984744
2023-11-22 04:13:58   INFO  epoch: 11/30, acc_iter=73057, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:09:49/1:38:50, time_cost(all): 19:49:27/1 day, 9:54:52, loss=0.480221109770557, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.95(1.03), norm=2.9304817581745652, lr=0.000658123020802819
2023-11-22 04:14:47   INFO  epoch: 11/30, acc_iter=73107, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:10:38/1:33:41, time_cost(all): 19:50:16/1 day, 11:16:14, loss=0.480137995784248, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.17(1.03), norm=2.6208622033773685, lr=0.000657802286620894
2023-11-22 04:15:36   INFO  epoch: 11/30, acc_iter=73157, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:11:27/1:37:39, time_cost(all): 19:51:05/1 day, 9:39:48, loss=0.480054881797939, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.92(1.03), norm=3.3454915640668754, lr=0.00065748155243897
2023-11-22 04:16:25   INFO  epoch: 11/30, acc_iter=73207, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:12:16/1:34:31, time_cost(all): 19:51:54/1 day, 8:25:26, loss=0.47997176781163, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.04(1.03), norm=4.519700140721925, lr=0.000657160818257045
2023-11-22 04:17:14   INFO  epoch: 11/30, acc_iter=73257, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:13:05/1:35:07, time_cost(all): 19:52:43/1 day, 8:54:55, loss=0.479888653825321, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.14(1.03), norm=4.10513342672719, lr=0.00065684008407512
2023-11-22 04:18:03   INFO  epoch: 11/30, acc_iter=73307, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:13:54/1:37:45, time_cost(all): 19:53:32/1 day, 9:00:11, loss=0.479805539839012, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=3.927144988904401, lr=0.000656519349893196
2023-11-22 04:18:52   INFO  epoch: 11/30, acc_iter=73357, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:14:44/1:30:12, time_cost(all): 19:54:21/1 day, 9:48:42, loss=0.479722425852703, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.88(1.03), norm=0.8425364572623136, lr=0.000656198615711271
2023-11-22 04:19:42   INFO  epoch: 11/30, acc_iter=73407, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:15:33/1:31:28, time_cost(all): 19:55:11/1 day, 11:00:07, loss=0.479639311866394, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.2(1.03), norm=3.0996075831613505, lr=0.000655877881529346
2023-11-22 04:20:31   INFO  epoch: 11/30, acc_iter=73457, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:16:22/1:31:05, time_cost(all): 19:56:00/1 day, 8:51:47, loss=0.479556197880085, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.84(1.03), norm=1.272011905473611, lr=0.000655557147347421
2023-11-22 04:21:20   INFO  epoch: 11/30, acc_iter=73507, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:17:11/1:35:09, time_cost(all): 19:56:49/1 day, 10:13:45, loss=0.479473083893776, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.04(1.03), norm=1.9401508091941788, lr=0.000655236413165497
2023-11-22 04:22:09   INFO  epoch: 11/30, acc_iter=73557, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:18:00/1:26:44, time_cost(all): 19:57:38/1 day, 9:04:54, loss=0.479389969907467, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.08(1.03), norm=4.259511857610139, lr=0.000654915678983572
2023-11-22 04:22:58   INFO  epoch: 11/30, acc_iter=73607, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:18:49/1:28:32, time_cost(all): 19:58:27/1 day, 8:36:10, loss=0.479306855921157, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.1(1.03), norm=2.732027479579922, lr=0.000654594944801647
2023-11-22 04:23:47   INFO  epoch: 11/30, acc_iter=73657, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:19:38/1:26:37, time_cost(all): 19:59:16/1 day, 9:07:50, loss=0.479223741934848, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.84(1.03), norm=1.0013557817635306, lr=0.000654274210619723
2023-11-22 04:24:36   INFO  epoch: 11/30, acc_iter=73707, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:20:27/1:23:32, time_cost(all): 20:00:05/1 day, 11:06:23, loss=0.479140627948539, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.05(1.03), norm=0.850353133928156, lr=0.000653953476437798
2023-11-22 04:25:25   INFO  epoch: 11/30, acc_iter=73757, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:21:17/1:29:54, time_cost(all): 20:00:54/1 day, 11:15:08, loss=0.47905751396223, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.98(1.03), norm=4.035851574174549, lr=0.000653632742255873
2023-11-22 04:26:15   INFO  epoch: 11/30, acc_iter=73807, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:22:06/1:25:32, time_cost(all): 20:01:44/1 day, 9:21:55, loss=0.478974399975921, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.1(1.03), norm=1.06462242453643, lr=0.000653312008073948
2023-11-22 04:27:04   INFO  epoch: 11/30, acc_iter=73857, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:22:55/1:27:52, time_cost(all): 20:02:33/1 day, 8:29:52, loss=0.478891285989612, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.17(1.03), norm=4.248765306841177, lr=0.000652991273892024
2023-11-22 04:27:53   INFO  epoch: 11/30, acc_iter=73907, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:23:44/1:20:38, time_cost(all): 20:03:22/1 day, 9:22:46, loss=0.478808172003303, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.02(1.03), norm=1.939942477393834, lr=0.000652670539710099
2023-11-22 04:28:42   INFO  epoch: 11/30, acc_iter=73957, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:24:33/1:21:42, time_cost(all): 20:04:11/1 day, 8:15:35, loss=0.478725058016994, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.11(1.03), norm=1.6281239812209154, lr=0.000652349805528174
2023-11-22 04:29:31   INFO  epoch: 11/30, acc_iter=74007, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:25:22/1:24:14, time_cost(all): 20:05:00/1 day, 11:16:04, loss=0.478641944030685, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.02(1.03), norm=4.701963646875145, lr=0.00065202907134625
2023-11-22 04:30:20   INFO  epoch: 11/30, acc_iter=74057, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:26:11/1:18:22, time_cost(all): 20:05:49/1 day, 8:54:30, loss=0.478558830044376, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.14(1.03), norm=0.6985681204009071, lr=0.000651708337164325
2023-11-22 04:31:09   INFO  epoch: 11/30, acc_iter=74107, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:27:00/1:19:38, time_cost(all): 20:06:38/1 day, 8:17:25, loss=0.478475716058067, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.15(1.03), norm=2.1213971228038657, lr=0.0006513876029824
2023-11-22 04:31:58   INFO  epoch: 11/30, acc_iter=74157, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:27:49/1:21:57, time_cost(all): 20:07:27/1 day, 10:02:36, loss=0.478392602071758, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.04(1.03), norm=2.0046807812375462, lr=0.000651066868800476
2023-11-22 04:32:47   INFO  epoch: 11/30, acc_iter=74207, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:28:39/1:22:25, time_cost(all): 20:08:16/1 day, 10:37:39, loss=0.478309488085449, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.87(1.03), norm=3.488688527092421, lr=0.000650746134618551
2023-11-22 04:33:37   INFO  epoch: 11/30, acc_iter=74257, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:29:28/1:19:56, time_cost(all): 20:09:06/1 day, 10:23:53, loss=0.47822637409914, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.97(1.03), norm=3.5582630191186957, lr=0.000650425400436626
2023-11-22 04:34:26   INFO  epoch: 11/30, acc_iter=74307, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:30:17/1:18:55, time_cost(all): 20:09:55/1 day, 11:12:24, loss=0.478143260112831, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.91(1.03), norm=2.0056997239508165, lr=0.000650104666254701
2023-11-22 04:35:15   INFO  epoch: 11/30, acc_iter=74357, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:31:06/1:14:35, time_cost(all): 20:10:44/1 day, 9:48:33, loss=0.478060146126521, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.94(1.03), norm=3.885443568404675, lr=0.000649783932072777
2023-11-22 04:36:04   INFO  epoch: 11/30, acc_iter=74407, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:31:55/1:13:39, time_cost(all): 20:11:33/1 day, 8:18:16, loss=0.477977032140212, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.06(1.03), norm=3.100998663522758, lr=0.000649463197890852
2023-11-22 04:36:53   INFO  epoch: 11/30, acc_iter=74457, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:32:44/1:13:38, time_cost(all): 20:12:22/1 day, 11:02:50, loss=0.477893918153903, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.21(1.03), norm=4.312688514715026, lr=0.000649142463708927
2023-11-22 04:37:42   INFO  epoch: 11/30, acc_iter=74507, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:33:33/1:13:22, time_cost(all): 20:13:11/1 day, 9:47:22, loss=0.477810804167594, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.06(1.03), norm=2.034639512138467, lr=0.000648821729527003
2023-11-22 04:38:31   INFO  epoch: 11/30, acc_iter=74557, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:34:22/1:15:21, time_cost(all): 20:14:00/1 day, 9:31:20, loss=0.477727690181285, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.05(1.03), norm=2.2233977534930265, lr=0.000648500995345078
2023-11-22 04:39:20   INFO  epoch: 11/30, acc_iter=74607, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:35:12/1:14:32, time_cost(all): 20:14:49/1 day, 10:02:55, loss=0.477644576194976, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.89(1.03), norm=4.06165056760721, lr=0.000648180261163153
2023-11-22 04:40:10   INFO  epoch: 11/30, acc_iter=74657, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:36:01/1:10:45, time_cost(all): 20:15:39/1 day, 8:02:27, loss=0.477561462208667, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.13(1.03), norm=4.936737217702636, lr=0.000647859526981229
2023-11-22 04:40:59   INFO  epoch: 11/30, acc_iter=74707, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:36:50/1:14:00, time_cost(all): 20:16:28/1 day, 9:07:44, loss=0.477478348222358, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.04(1.03), norm=1.426482234825681, lr=0.000647538792799304
2023-11-22 04:41:48   INFO  epoch: 11/30, acc_iter=74757, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:37:39/1:07:52, time_cost(all): 20:17:17/1 day, 8:03:31, loss=0.477395234236049, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.92(1.03), norm=4.70963300461841, lr=0.000647218058617379
2023-11-22 04:42:37   INFO  epoch: 11/30, acc_iter=74807, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:38:28/1:09:06, time_cost(all): 20:18:06/1 day, 8:42:40, loss=0.47731212024974, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.88(1.03), norm=0.6868103828487415, lr=0.000646897324435454
2023-11-22 04:43:26   INFO  epoch: 11/30, acc_iter=74857, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:39:17/1:08:44, time_cost(all): 20:18:55/1 day, 8:57:58, loss=0.477229006263431, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.91(1.03), norm=2.019427121641181, lr=0.00064657659025353
2023-11-22 04:44:15   INFO  epoch: 11/30, acc_iter=74907, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:40:06/1:09:34, time_cost(all): 20:19:44/1 day, 10:54:36, loss=0.477145892277122, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.04(1.03), norm=1.248208759377941, lr=0.000646255856071605
2023-11-22 04:45:04   INFO  epoch: 11/30, acc_iter=74957, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:40:55/1:08:17, time_cost(all): 20:20:33/1 day, 10:25:38, loss=0.477062778290813, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.94(1.03), norm=2.6464088397974725, lr=0.00064593512188968
2023-11-22 04:45:53   INFO  epoch: 11/30, acc_iter=75007, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:41:44/1:08:39, time_cost(all): 20:21:22/1 day, 10:23:11, loss=0.476979664304504, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.85(1.03), norm=4.385579953481688, lr=0.000645614387707756
2023-11-22 04:46:42   INFO  epoch: 11/30, acc_iter=75057, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:42:34/1:07:35, time_cost(all): 20:22:11/1 day, 8:34:30, loss=0.476896550318195, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.89(1.03), norm=2.2229044528166186, lr=0.000645293653525831
2023-11-22 04:47:32   INFO  epoch: 11/30, acc_iter=75107, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:43:23/1:02:14, time_cost(all): 20:23:01/1 day, 9:24:52, loss=0.476813436331886, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.06(1.03), norm=2.7990953428229215, lr=0.000644972919343906
2023-11-22 04:48:21   INFO  epoch: 11/30, acc_iter=75157, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:44:12/1:05:04, time_cost(all): 20:23:50/1 day, 10:39:31, loss=0.476730322345576, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.89(1.03), norm=3.5029206736860306, lr=0.000644652185161982
2023-11-22 04:49:10   INFO  epoch: 11/30, acc_iter=75207, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:45:01/1:01:00, time_cost(all): 20:24:39/1 day, 8:18:51, loss=0.476647208359267, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.21(1.03), norm=2.0808894014220796, lr=0.000644331450980057
2023-11-22 04:49:59   INFO  epoch: 11/30, acc_iter=75257, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:45:50/1:03:55, time_cost(all): 20:25:28/1 day, 9:22:37, loss=0.476564094372958, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.97(1.03), norm=2.553795497838597, lr=0.000644010716798132
2023-11-22 04:50:48   INFO  epoch: 11/30, acc_iter=75307, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:46:39/1:02:03, time_cost(all): 20:26:17/1 day, 10:09:06, loss=0.476480980386649, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.15(1.03), norm=4.929877310395996, lr=0.000643689982616207
2023-11-22 04:51:37   INFO  epoch: 11/30, acc_iter=75357, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:47:28/1:01:51, time_cost(all): 20:27:06/1 day, 9:29:57, loss=0.47639786640034, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.88(1.03), norm=1.8409358693296516, lr=0.000643369248434283
2023-11-22 04:52:26   INFO  epoch: 11/30, acc_iter=75407, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:48:17/0:59:35, time_cost(all): 20:27:55/1 day, 8:18:23, loss=0.476314752414031, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.88(1.03), norm=3.2946773462289043, lr=0.000643048514252358
2023-11-22 04:53:15   INFO  epoch: 11/30, acc_iter=75457, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:49:07/0:58:05, time_cost(all): 20:28:44/1 day, 10:09:50, loss=0.476231638427722, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.04(1.03), norm=0.7770531384442999, lr=0.000642727780070433
2023-11-22 04:54:05   INFO  epoch: 11/30, acc_iter=75507, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:49:56/0:56:42, time_cost(all): 20:29:34/1 day, 10:10:57, loss=0.476148524441413, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.07(1.03), norm=1.0300244766487008, lr=0.000642407045888509
2023-11-22 04:54:54   INFO  epoch: 11/30, acc_iter=75557, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:50:45/0:55:11, time_cost(all): 20:30:23/1 day, 8:01:34, loss=0.476065410455104, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.16(1.03), norm=1.387330971971637, lr=0.000642086311706584
2023-11-22 04:55:43   INFO  epoch: 11/30, acc_iter=75607, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 0:51:34/0:57:31, time_cost(all): 20:31:12/1 day, 8:18:27, loss=0.475982296468795, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.97(1.03), norm=2.892872595297311, lr=0.000641765577524659
2023-11-22 04:56:32   INFO  epoch: 11/30, acc_iter=75657, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 0:52:23/0:58:08, time_cost(all): 20:32:01/1 day, 7:49:50, loss=0.475899182482486, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.83(1.03), norm=4.954128673842741, lr=0.000641444843342735
2023-11-22 04:57:21   INFO  epoch: 11/30, acc_iter=75707, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 0:53:12/0:52:32, time_cost(all): 20:32:50/1 day, 10:57:22, loss=0.475816068496177, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.07(1.03), norm=2.5658444882847533, lr=0.00064112410916081
2023-11-22 04:58:10   INFO  epoch: 11/30, acc_iter=75757, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 0:54:01/0:55:50, time_cost(all): 20:33:39/1 day, 9:55:57, loss=0.475732954509868, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.88(1.03), norm=4.761225704366855, lr=0.000640803374978885
2023-11-22 04:58:59   INFO  epoch: 11/30, acc_iter=75807, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 0:54:50/0:50:38, time_cost(all): 20:34:28/1 day, 9:05:41, loss=0.475649840523559, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.21(1.03), norm=1.5598367361411367, lr=0.00064048264079696
2023-11-22 04:59:48   INFO  epoch: 11/30, acc_iter=75857, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 0:55:39/0:51:22, time_cost(all): 20:35:17/1 day, 8:59:00, loss=0.47556672653725, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.0(1.03), norm=0.729385355329883, lr=0.000640161906615036
2023-11-22 05:00:37   INFO  epoch: 11/30, acc_iter=75907, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 0:56:29/0:52:38, time_cost(all): 20:36:06/1 day, 8:07:35, loss=0.475483612550941, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.84(1.03), norm=3.41928676598785, lr=0.000639841172433111
2023-11-22 05:01:27   INFO  epoch: 11/30, acc_iter=75957, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 0:57:18/0:52:22, time_cost(all): 20:36:56/1 day, 10:22:16, loss=0.475400498564631, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.1(1.03), norm=0.797983409463155, lr=0.000639520438251186
2023-11-22 05:02:16   INFO  epoch: 11/30, acc_iter=76007, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 0:58:07/0:50:14, time_cost(all): 20:37:45/1 day, 10:04:03, loss=0.475317384578322, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.04(1.03), norm=0.8801883952946821, lr=0.000639199704069261
2023-11-22 05:03:05   INFO  epoch: 11/30, acc_iter=76057, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 0:58:56/0:47:54, time_cost(all): 20:38:34/1 day, 8:41:04, loss=0.475234270592013, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.08(1.03), norm=1.952163346608839, lr=0.000638878969887337
2023-11-22 05:03:54   INFO  epoch: 11/30, acc_iter=76107, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 0:59:45/0:46:01, time_cost(all): 20:39:23/1 day, 8:23:39, loss=0.475151156605704, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.17(1.03), norm=4.1943880687194355, lr=0.000638558235705412
2023-11-22 05:04:43   INFO  epoch: 11/30, acc_iter=76157, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:00:34/0:47:27, time_cost(all): 20:40:12/1 day, 9:36:36, loss=0.475068042619395, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.21(1.03), norm=4.700054563476206, lr=0.000638237501523487
2023-11-22 05:05:32   INFO  epoch: 11/30, acc_iter=76207, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:01:23/0:47:58, time_cost(all): 20:41:01/1 day, 9:31:30, loss=0.474984928633086, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.18(1.03), norm=1.9143766916443798, lr=0.000637916767341563
2023-11-22 05:06:21   INFO  epoch: 11/30, acc_iter=76257, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:02:12/0:46:31, time_cost(all): 20:41:50/1 day, 8:19:02, loss=0.474901814646777, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.14(1.03), norm=2.568709052964328, lr=0.000637596033159638
2023-11-22 05:07:10   INFO  epoch: 11/30, acc_iter=76307, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:03:02/0:44:23, time_cost(all): 20:42:39/1 day, 8:20:38, loss=0.474818700660468, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.07(1.03), norm=1.4653023618639813, lr=0.000637275298977713
2023-11-22 05:08:00   INFO  epoch: 11/30, acc_iter=76357, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:03:51/0:42:52, time_cost(all): 20:43:29/1 day, 9:39:31, loss=0.474735586674159, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.05(1.03), norm=1.2593031593642945, lr=0.000636954564795789
2023-11-22 05:08:49   INFO  epoch: 11/30, acc_iter=76407, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:04:40/0:44:55, time_cost(all): 20:44:18/1 day, 8:53:31, loss=0.47465247268785, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.07(1.03), norm=4.715906155989142, lr=0.000636633830613864
2023-11-22 05:09:38   INFO  epoch: 11/30, acc_iter=76457, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:05:29/0:40:37, time_cost(all): 20:45:07/1 day, 8:32:03, loss=0.474569358701541, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.93(1.03), norm=2.1673538433378, lr=0.000636313096431939
2023-11-22 05:10:27   INFO  epoch: 11/30, acc_iter=76507, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:06:18/0:41:34, time_cost(all): 20:45:56/1 day, 10:01:42, loss=0.474486244715232, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.05(1.03), norm=4.18164916552819, lr=0.000635992362250014
2023-11-22 05:11:16   INFO  epoch: 11/30, acc_iter=76557, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:07:07/0:41:08, time_cost(all): 20:46:45/1 day, 7:58:28, loss=0.474403130728923, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.14(1.03), norm=4.625566306624831, lr=0.00063567162806809
2023-11-22 05:12:05   INFO  epoch: 11/30, acc_iter=76607, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:07:56/0:38:22, time_cost(all): 20:47:34/1 day, 9:15:08, loss=0.474320016742614, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.15(1.03), norm=0.7873817247372168, lr=0.000635350893886165
2023-11-22 05:12:54   INFO  epoch: 11/30, acc_iter=76657, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:08:45/0:40:50, time_cost(all): 20:48:23/1 day, 10:25:22, loss=0.474236902756305, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.19(1.03), norm=1.9721220570987168, lr=0.00063503015970424
2023-11-22 05:13:43   INFO  epoch: 11/30, acc_iter=76707, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:09:34/0:39:35, time_cost(all): 20:49:12/1 day, 8:55:59, loss=0.474153788769996, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.84(1.03), norm=3.0506391130194226, lr=0.000634709425522316
2023-11-22 05:14:32   INFO  epoch: 11/30, acc_iter=76757, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:10:24/0:37:46, time_cost(all): 20:50:01/1 day, 9:50:58, loss=0.474070674783686, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.14(1.03), norm=1.7840436988131845, lr=0.000634388691340391
2023-11-22 05:15:22   INFO  epoch: 11/30, acc_iter=76807, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:11:13/0:36:32, time_cost(all): 20:50:51/1 day, 10:34:31, loss=0.473987560797377, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.19(1.03), norm=1.4685624971613345, lr=0.000634067957158466
2023-11-22 05:16:11   INFO  epoch: 11/30, acc_iter=76857, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:12:02/0:37:27, time_cost(all): 20:51:40/1 day, 9:57:57, loss=0.473904446811068, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.04(1.03), norm=3.8206383725656643, lr=0.000633747222976542
2023-11-22 05:17:00   INFO  epoch: 11/30, acc_iter=76907, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:12:51/0:35:39, time_cost(all): 20:52:29/1 day, 10:24:22, loss=0.473821332824759, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.04(1.03), norm=0.8804203567319472, lr=0.000633426488794617
2023-11-22 05:17:49   INFO  epoch: 11/30, acc_iter=76957, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:13:40/0:33:18, time_cost(all): 20:53:18/1 day, 8:24:34, loss=0.47373821883845, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.19(1.03), norm=4.121946155744765, lr=0.000633105754612692
2023-11-22 05:18:38   INFO  epoch: 11/30, acc_iter=77007, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:14:29/0:32:43, time_cost(all): 20:54:07/1 day, 7:31:42, loss=0.473655104852141, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.02(1.03), norm=2.4138127261159674, lr=0.000632785020430767
2023-11-22 05:19:27   INFO  epoch: 11/30, acc_iter=77057, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:15:18/0:32:11, time_cost(all): 20:54:56/1 day, 7:31:17, loss=0.473571990865832, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.84(1.03), norm=3.6255393283893786, lr=0.000632464286248843
2023-11-22 05:20:16   INFO  epoch: 11/30, acc_iter=77107, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:16:07/0:31:06, time_cost(all): 20:55:45/1 day, 8:24:22, loss=0.473488876879523, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.13(1.03), norm=3.161333855079955, lr=0.000632143552066918
2023-11-22 05:21:05   INFO  epoch: 11/30, acc_iter=77157, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:16:57/0:30:24, time_cost(all): 20:56:34/1 day, 8:33:33, loss=0.473405762893214, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.92(1.03), norm=3.3993435197063286, lr=0.000631822817884993
2023-11-22 05:21:55   INFO  epoch: 11/30, acc_iter=77207, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:17:46/0:28:39, time_cost(all): 20:57:24/1 day, 9:57:09, loss=0.473322648906905, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.04(1.03), norm=2.1385789833277977, lr=0.000631502083703069
2023-11-22 05:22:44   INFO  epoch: 11/30, acc_iter=77257, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:18:35/0:30:35, time_cost(all): 20:58:13/1 day, 8:52:04, loss=0.473239534920596, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.1(1.03), norm=1.8057523612646085, lr=0.000631181349521144
2023-11-22 05:23:33   INFO  epoch: 11/30, acc_iter=77307, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:19:24/0:29:33, time_cost(all): 20:59:02/1 day, 7:20:59, loss=0.473156420934287, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.08(1.03), norm=2.663062018368281, lr=0.000630860615339219
2023-11-22 05:24:22   INFO  epoch: 11/30, acc_iter=77357, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:20:13/0:27:29, time_cost(all): 20:59:51/1 day, 8:01:50, loss=0.473073306947978, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.17(1.03), norm=1.015606275518923, lr=0.000630539881157295
2023-11-22 05:25:11   INFO  epoch: 11/30, acc_iter=77407, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:21:02/0:26:06, time_cost(all): 21:00:40/1 day, 8:31:44, loss=0.472990192961669, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.91(1.03), norm=4.9904727365354375, lr=0.00063021914697537
2023-11-22 05:26:00   INFO  epoch: 11/30, acc_iter=77457, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:21:51/0:26:28, time_cost(all): 21:01:29/1 day, 9:01:27, loss=0.47290707897536, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.83(1.03), norm=1.3390585169755496, lr=0.000629898412793445
2023-11-22 05:26:49   INFO  epoch: 11/30, acc_iter=77507, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:22:40/0:25:14, time_cost(all): 21:02:18/1 day, 9:33:43, loss=0.472823964989051, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.9(1.03), norm=3.3858407992189035, lr=0.00062957767861152
2023-11-22 05:27:38   INFO  epoch: 11/30, acc_iter=77557, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:23:29/0:24:55, time_cost(all): 21:03:07/1 day, 8:18:38, loss=0.472740851002741, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.19(1.03), norm=2.723020586235495, lr=0.000629256944429596
2023-11-22 05:28:27   INFO  epoch: 11/30, acc_iter=77607, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:24:19/0:22:54, time_cost(all): 21:03:56/1 day, 10:06:51, loss=0.472657737016432, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.93(1.03), norm=1.5082815335297555, lr=0.000628936210247671
2023-11-22 05:29:17   INFO  epoch: 11/30, acc_iter=77657, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:25:08/0:22:43, time_cost(all): 21:04:46/1 day, 8:25:24, loss=0.472574623030123, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.01(1.03), norm=4.306576902508394, lr=0.000628615476065746
2023-11-22 05:30:06   INFO  epoch: 11/30, acc_iter=77707, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:25:57/0:20:54, time_cost(all): 21:05:35/1 day, 9:17:10, loss=0.472491509043814, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.85(1.03), norm=1.8894432715274116, lr=0.000628294741883822
2023-11-22 05:30:55   INFO  epoch: 11/30, acc_iter=77757, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:26:46/0:21:44, time_cost(all): 21:06:24/1 day, 7:33:28, loss=0.472408395057505, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.95(1.03), norm=3.4434147706662768, lr=0.000627974007701897
2023-11-22 05:31:44   INFO  epoch: 11/30, acc_iter=77807, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:27:35/0:21:00, time_cost(all): 21:07:13/1 day, 8:06:08, loss=0.472325281071196, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.0(1.03), norm=1.4857925077009622, lr=0.000627653273519972
2023-11-22 05:32:33   INFO  epoch: 11/30, acc_iter=77857, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:28:24/0:18:44, time_cost(all): 21:08:02/1 day, 7:30:04, loss=0.472242167084887, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.17(1.03), norm=4.107827985830447, lr=0.000627332539338048
2023-11-22 05:33:22   INFO  epoch: 11/30, acc_iter=77907, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:29:13/0:18:22, time_cost(all): 21:08:51/1 day, 10:03:18, loss=0.472159053098578, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.85(1.03), norm=0.8736571373847695, lr=0.000627011805156123
2023-11-22 05:34:11   INFO  epoch: 11/30, acc_iter=77957, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:30:02/0:17:36, time_cost(all): 21:09:40/1 day, 8:03:45, loss=0.472075939112269, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.22(1.03), norm=3.9464500040239296, lr=0.000626691070974198
2023-11-22 05:35:00   INFO  epoch: 11/30, acc_iter=78007, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:30:52/0:17:15, time_cost(all): 21:10:29/1 day, 9:05:54, loss=0.47199282512596, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.14(1.03), norm=3.712732321001348, lr=0.000626370336792273
2023-11-22 05:35:50   INFO  epoch: 11/30, acc_iter=78057, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:31:41/0:16:12, time_cost(all): 21:11:19/1 day, 8:42:42, loss=0.471909711139651, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.23(1.03), norm=1.2427979784293737, lr=0.000626049602610349
2023-11-22 05:36:39   INFO  epoch: 11/30, acc_iter=78107, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:32:30/0:15:38, time_cost(all): 21:12:08/1 day, 9:17:53, loss=0.471826597153342, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.08(1.03), norm=4.725451980736633, lr=0.000625728868428424
2023-11-22 05:37:28   INFO  epoch: 11/30, acc_iter=78157, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:33:19/0:14:35, time_cost(all): 21:12:57/1 day, 7:18:48, loss=0.471743483167033, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.16(1.03), norm=3.775354113770347, lr=0.000625408134246499
2023-11-22 05:38:17   INFO  epoch: 11/30, acc_iter=78207, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:34:08/0:13:21, time_cost(all): 21:13:46/1 day, 9:56:16, loss=0.471660369180724, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.13(1.03), norm=2.4896066897025024, lr=0.000625087400064575
2023-11-22 05:39:06   INFO  epoch: 11/30, acc_iter=78257, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:34:57/0:12:51, time_cost(all): 21:14:35/1 day, 8:45:48, loss=0.471577255194415, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.09(1.03), norm=2.9764250962396357, lr=0.00062476666588265
2023-11-22 05:39:55   INFO  epoch: 11/30, acc_iter=78307, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:35:46/0:11:31, time_cost(all): 21:15:24/1 day, 8:57:42, loss=0.471494141208106, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.88(1.03), norm=2.7862978547089488, lr=0.000624445931700725
2023-11-22 05:40:44   INFO  epoch: 11/30, acc_iter=78357, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:36:35/0:11:30, time_cost(all): 21:16:13/1 day, 7:19:45, loss=0.471411027221796, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.06(1.03), norm=0.9722012009646461, lr=0.0006241251975188
2023-11-22 05:41:33   INFO  epoch: 11/30, acc_iter=78407, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:37:24/0:10:04, time_cost(all): 21:17:02/1 day, 8:12:44, loss=0.471327913235487, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.09(1.03), norm=1.9503802134597208, lr=0.000623804463336876
2023-11-22 05:42:22   INFO  epoch: 11/30, acc_iter=78457, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:38:14/0:09:43, time_cost(all): 21:17:51/1 day, 8:06:31, loss=0.471244799249178, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.09(1.03), norm=1.2926702494136313, lr=0.000623483729154951
2023-11-22 05:43:12   INFO  epoch: 11/30, acc_iter=78507, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:39:03/0:08:37, time_cost(all): 21:18:41/1 day, 8:34:10, loss=0.471161685262869, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.07(1.03), norm=2.925253398599899, lr=0.000623162994973026
2023-11-22 05:44:01   INFO  epoch: 11/30, acc_iter=78557, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:39:52/0:07:39, time_cost(all): 21:19:30/1 day, 9:49:54, loss=0.47107857127656, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.04(1.03), norm=2.2723289439561323, lr=0.000622842260791102
2023-11-22 05:44:50   INFO  epoch: 11/30, acc_iter=78607, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:40:41/0:07:17, time_cost(all): 21:20:19/1 day, 9:56:46, loss=0.470995457290251, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.9(1.03), norm=0.8935622763389887, lr=0.000622521526609177
2023-11-22 05:45:39   INFO  epoch: 11/30, acc_iter=78657, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:41:30/0:06:11, time_cost(all): 21:21:08/1 day, 7:23:45, loss=0.470912343303942, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.01(1.03), norm=3.309003982011726, lr=0.000622200792427252
2023-11-22 05:46:28   INFO  epoch: 11/30, acc_iter=78707, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 1:42:19/0:05:19, time_cost(all): 21:21:57/1 day, 10:10:24, loss=0.470829229317633, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.96(1.03), norm=2.7357588662798844, lr=0.000621880058245327
2023-11-22 05:47:17   INFO  epoch: 11/30, acc_iter=78757, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 1:43:08/0:04:28, time_cost(all): 21:22:46/1 day, 7:00:49, loss=0.470746115331324, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.07(1.03), norm=4.400345068425878, lr=0.000621559324063403
2023-11-22 05:48:06   INFO  epoch: 11/30, acc_iter=78807, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 1:43:57/0:03:49, time_cost(all): 21:23:35/1 day, 8:26:15, loss=0.470663001345015, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.21(1.03), norm=3.8255171811689648, lr=0.000621238589881478
2023-11-22 05:48:55   INFO  epoch: 11/30, acc_iter=78857, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 1:44:47/0:03:11, time_cost(all): 21:24:24/1 day, 9:27:41, loss=0.470579887358706, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.98(1.03), norm=3.425594952097842, lr=0.000620917855699553
2023-11-22 05:49:45   INFO  epoch: 11/30, acc_iter=78907, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 1:45:36/0:02:15, time_cost(all): 21:25:14/1 day, 7:42:05, loss=0.470496773372397, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.93(1.03), norm=3.098012031210622, lr=0.000620597121517629
2023-11-22 05:50:34   INFO  epoch: 11/30, acc_iter=78957, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 1:46:25/0:01:27, time_cost(all): 21:26:03/1 day, 8:35:06, loss=0.470413659386088, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.86(1.03), norm=2.4136611799957515, lr=0.000620276387335704
2023-11-22 05:51:23   INFO  epoch: 11/30, acc_iter=79007, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 1:47:14/0:00:35, time_cost(all): 21:26:52/1 day, 7:38:28, loss=0.470330545399779, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.98(1.03), norm=3.403946598435478, lr=0.000619955653153779
2023-11-22 05:52:12   INFO  epoch: 12/30, acc_iter=79094, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:49/1:49:40, time_cost(all): 21:27:41/1 day, 6:50:51, loss=0.470185927063601, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.97(1.03), norm=4.444753352298993, lr=0.00061939757567723
2023-11-22 05:53:01   INFO  epoch: 12/30, acc_iter=79144, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:38/1:46:37, time_cost(all): 21:28:30/1 day, 9:44:34, loss=0.470102813077292, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.13(1.03), norm=2.651313294420172, lr=0.000619076841495306
2023-11-22 05:53:50   INFO  epoch: 12/30, acc_iter=79194, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:27/1:50:13, time_cost(all): 21:29:19/1 day, 6:57:06, loss=0.470019699090983, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.23(1.03), norm=0.8433404614278106, lr=0.000618756107313381
2023-11-22 05:54:39   INFO  epoch: 12/30, acc_iter=79244, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:16/1:43:29, time_cost(all): 21:30:08/1 day, 7:13:25, loss=0.469936585104674, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.88(1.03), norm=4.299825487668744, lr=0.000618435373131456
2023-11-22 05:55:28   INFO  epoch: 12/30, acc_iter=79294, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:05/1:38:40, time_cost(all): 21:30:57/1 day, 7:51:56, loss=0.469853471118365, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.08(1.03), norm=2.933276663762976, lr=0.000618114638949531
2023-11-22 05:56:17   INFO  epoch: 12/30, acc_iter=79344, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:04:54/1:40:29, time_cost(all): 21:31:46/1 day, 9:09:33, loss=0.469770357132056, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.07(1.03), norm=4.116004304363026, lr=0.000617793904767607
2023-11-22 05:57:07   INFO  epoch: 12/30, acc_iter=79394, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:05:43/1:40:24, time_cost(all): 21:32:36/1 day, 8:48:10, loss=0.469687243145747, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.09(1.03), norm=3.6110016990364415, lr=0.000617473170585682
2023-11-22 05:57:56   INFO  epoch: 12/30, acc_iter=79444, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:06:32/1:38:34, time_cost(all): 21:33:25/1 day, 8:40:39, loss=0.469604129159437, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.01(1.03), norm=2.158534026200755, lr=0.000617152436403757
2023-11-22 05:58:45   INFO  epoch: 12/30, acc_iter=79494, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:07:22/1:36:25, time_cost(all): 21:34:14/1 day, 7:46:38, loss=0.469521015173128, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.92(1.03), norm=2.6830233277579163, lr=0.000616831702221833
2023-11-22 05:59:34   INFO  epoch: 12/30, acc_iter=79544, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:08:11/1:34:40, time_cost(all): 21:35:03/1 day, 9:22:31, loss=0.469437901186819, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.15(1.03), norm=0.7939310558415769, lr=0.000616510968039908
2023-11-22 06:00:23   INFO  epoch: 12/30, acc_iter=79594, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:09:00/1:42:19, time_cost(all): 21:35:52/1 day, 9:36:58, loss=0.46935478720051, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.07(1.03), norm=1.6619534245044167, lr=0.000616190233857983
2023-11-22 06:01:12   INFO  epoch: 12/30, acc_iter=79644, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:09:49/1:37:19, time_cost(all): 21:36:41/1 day, 9:01:34, loss=0.469271673214201, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.2(1.03), norm=2.435699901154859, lr=0.000615869499676059
2023-11-22 06:02:01   INFO  epoch: 12/30, acc_iter=79694, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:10:38/1:32:46, time_cost(all): 21:37:30/1 day, 7:32:11, loss=0.469188559227892, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.86(1.03), norm=1.298708888057628, lr=0.000615548765494134
2023-11-22 06:02:50   INFO  epoch: 12/30, acc_iter=79744, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:11:27/1:34:22, time_cost(all): 21:38:19/1 day, 6:51:30, loss=0.469105445241583, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.97(1.03), norm=4.880957923552724, lr=0.000615228031312209
2023-11-22 06:03:39   INFO  epoch: 12/30, acc_iter=79794, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:12:16/1:39:46, time_cost(all): 21:39:08/1 day, 9:08:41, loss=0.469022331255274, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.98(1.03), norm=2.602986355676102, lr=0.000614907297130284
2023-11-22 06:04:29   INFO  epoch: 12/30, acc_iter=79844, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:13:05/1:32:18, time_cost(all): 21:39:58/1 day, 9:38:04, loss=0.468939217268965, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.14(1.03), norm=2.795295625244224, lr=0.00061458656294836
2023-11-22 06:05:18   INFO  epoch: 12/30, acc_iter=79894, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:13:54/1:34:57, time_cost(all): 21:40:47/1 day, 8:09:12, loss=0.468856103282656, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.04(1.03), norm=2.2747221795108974, lr=0.000614265828766435
2023-11-22 06:06:07   INFO  epoch: 12/30, acc_iter=79944, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:14:44/1:32:13, time_cost(all): 21:41:36/1 day, 6:42:30, loss=0.468772989296347, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.1(1.03), norm=1.0529553256277726, lr=0.00061394509458451
2023-11-22 06:06:56   INFO  epoch: 12/30, acc_iter=79994, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:15:33/1:36:00, time_cost(all): 21:42:25/1 day, 8:54:47, loss=0.468689875310038, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.16(1.03), norm=4.294776835768554, lr=0.000613624360402586
2023-11-22 06:07:45   INFO  epoch: 12/30, acc_iter=80044, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:16:22/1:26:57, time_cost(all): 21:43:14/1 day, 8:45:00, loss=0.468606761323729, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.86(1.03), norm=1.3640996309953595, lr=0.000613303626220661
2023-11-22 06:08:34   INFO  epoch: 12/30, acc_iter=80094, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:17:11/1:33:09, time_cost(all): 21:44:03/1 day, 7:07:45, loss=0.46852364733742, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.13(1.03), norm=3.205318633134903, lr=0.000612982892038736
2023-11-22 06:09:23   INFO  epoch: 12/30, acc_iter=80144, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:18:00/1:26:26, time_cost(all): 21:44:52/1 day, 9:32:59, loss=0.468440533351111, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.01(1.03), norm=2.9425997170428064, lr=0.000612662157856812
2023-11-22 06:10:12   INFO  epoch: 12/30, acc_iter=80194, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:18:49/1:29:51, time_cost(all): 21:45:41/1 day, 7:34:24, loss=0.468357419364801, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.93(1.03), norm=3.2064553783530063, lr=0.000612341423674887
2023-11-22 06:11:02   INFO  epoch: 12/30, acc_iter=80244, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:19:38/1:31:24, time_cost(all): 21:46:31/1 day, 7:44:31, loss=0.468274305378492, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.12(1.03), norm=2.7783712857741905, lr=0.000612020689492962
2023-11-22 06:11:51   INFO  epoch: 12/30, acc_iter=80294, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:20:27/1:24:08, time_cost(all): 21:47:20/1 day, 7:10:46, loss=0.468191191392183, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.14(1.03), norm=2.6629466908457182, lr=0.000611699955311037
2023-11-22 06:12:40   INFO  epoch: 12/30, acc_iter=80344, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:21:17/1:28:31, time_cost(all): 21:48:09/1 day, 9:41:13, loss=0.468108077405874, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.89(1.03), norm=3.701031096378845, lr=0.000611379221129113
2023-11-22 06:13:29   INFO  epoch: 12/30, acc_iter=80394, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:22:06/1:27:58, time_cost(all): 21:48:58/1 day, 8:03:21, loss=0.468024963419565, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.96(1.03), norm=4.87130357622406, lr=0.000611058486947188
2023-11-22 06:14:18   INFO  epoch: 12/30, acc_iter=80444, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:22:55/1:21:22, time_cost(all): 21:49:47/1 day, 9:29:30, loss=0.467941849433256, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.11(1.03), norm=2.1596980491465314, lr=0.000610737752765263
2023-11-22 06:15:07   INFO  epoch: 12/30, acc_iter=80494, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:23:44/1:23:30, time_cost(all): 21:50:36/1 day, 9:36:44, loss=0.467858735446947, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.08(1.03), norm=4.329569536053226, lr=0.000610417018583339
2023-11-22 06:15:56   INFO  epoch: 12/30, acc_iter=80544, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:24:33/1:21:02, time_cost(all): 21:51:25/1 day, 9:06:45, loss=0.467775621460638, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.93(1.03), norm=0.9901857656336543, lr=0.000610096284401414
2023-11-22 06:16:45   INFO  epoch: 12/30, acc_iter=80594, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:25:22/1:21:26, time_cost(all): 21:52:14/1 day, 9:10:34, loss=0.467692507474329, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.03(1.03), norm=1.1396710442816236, lr=0.000609775550219489
2023-11-22 06:17:34   INFO  epoch: 12/30, acc_iter=80644, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:26:11/1:24:34, time_cost(all): 21:53:03/1 day, 6:45:51, loss=0.46760939348802, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.98(1.03), norm=1.6599225281232493, lr=0.000609454816037565
2023-11-22 06:18:24   INFO  epoch: 12/30, acc_iter=80694, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:27:00/1:17:30, time_cost(all): 21:53:53/1 day, 9:19:47, loss=0.467526279501711, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.93(1.03), norm=4.429327106078329, lr=0.00060913408185564
2023-11-22 06:19:13   INFO  epoch: 12/30, acc_iter=80744, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:27:49/1:19:37, time_cost(all): 21:54:42/1 day, 6:55:02, loss=0.467443165515402, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.14(1.03), norm=4.423529309833285, lr=0.000608813347673715
2023-11-22 06:20:02   INFO  epoch: 12/30, acc_iter=80794, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:28:39/1:21:50, time_cost(all): 21:55:31/1 day, 9:14:39, loss=0.467360051529093, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.17(1.03), norm=2.6526992243286367, lr=0.00060849261349179
2023-11-22 06:20:51   INFO  epoch: 12/30, acc_iter=80844, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:29:28/1:18:41, time_cost(all): 21:56:20/1 day, 8:22:28, loss=0.467276937542784, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.17(1.03), norm=3.4904855338591503, lr=0.000608171879309866
2023-11-22 06:21:40   INFO  epoch: 12/30, acc_iter=80894, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:30:17/1:15:39, time_cost(all): 21:57:09/1 day, 8:33:43, loss=0.467193823556475, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.97(1.03), norm=2.4581527179255582, lr=0.000607851145127941
2023-11-22 06:22:29   INFO  epoch: 12/30, acc_iter=80944, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:31:06/1:16:29, time_cost(all): 21:57:58/1 day, 6:24:54, loss=0.467110709570166, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.95(1.03), norm=4.117841681486615, lr=0.000607530410946016
2023-11-22 06:23:18   INFO  epoch: 12/30, acc_iter=80994, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:31:55/1:18:24, time_cost(all): 21:58:47/1 day, 9:08:36, loss=0.467027595583856, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.94(1.03), norm=3.967861175379059, lr=0.000607209676764091
2023-11-22 06:24:07   INFO  epoch: 12/30, acc_iter=81044, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:32:44/1:13:42, time_cost(all): 21:59:36/1 day, 8:08:12, loss=0.466944481597547, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.84(1.03), norm=2.6224181993459643, lr=0.000606888942582167
2023-11-22 06:24:57   INFO  epoch: 12/30, acc_iter=81094, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:33:33/1:12:53, time_cost(all): 22:00:26/1 day, 9:10:38, loss=0.466861367611238, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.17(1.03), norm=3.1451377701951815, lr=0.000606568208400242
2023-11-22 06:25:46   INFO  epoch: 12/30, acc_iter=81144, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:34:22/1:14:11, time_cost(all): 22:01:15/1 day, 9:20:20, loss=0.466778253624929, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.98(1.03), norm=0.9659974953490811, lr=0.000606247474218317
2023-11-22 06:26:35   INFO  epoch: 12/30, acc_iter=81194, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:35:12/1:15:33, time_cost(all): 22:02:04/1 day, 6:53:27, loss=0.46669513963862, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.91(1.03), norm=4.3549093798588, lr=0.000605926740036393
2023-11-22 06:27:24   INFO  epoch: 12/30, acc_iter=81244, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:36:01/1:13:48, time_cost(all): 22:02:53/1 day, 8:09:56, loss=0.466612025652311, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=2.49207165741476, lr=0.000605606005854468
2023-11-22 06:28:13   INFO  epoch: 12/30, acc_iter=81294, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:36:50/1:14:32, time_cost(all): 22:03:42/1 day, 6:37:37, loss=0.466528911666002, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.84(1.03), norm=2.259549986933882, lr=0.000605285271672543
2023-11-22 06:29:02   INFO  epoch: 12/30, acc_iter=81344, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:37:39/1:09:59, time_cost(all): 22:04:31/1 day, 7:20:01, loss=0.466445797679693, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.92(1.03), norm=4.224987900310527, lr=0.000604964537490619
2023-11-22 06:29:51   INFO  epoch: 12/30, acc_iter=81394, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:38:28/1:12:41, time_cost(all): 22:05:20/1 day, 6:22:03, loss=0.466362683693384, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.18(1.03), norm=1.1922604886887618, lr=0.000604643803308694
2023-11-22 06:30:40   INFO  epoch: 12/30, acc_iter=81444, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:39:17/1:11:50, time_cost(all): 22:06:09/1 day, 6:17:30, loss=0.466279569707075, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.92(1.03), norm=2.680417382063491, lr=0.000604323069126769
2023-11-22 06:31:29   INFO  epoch: 12/30, acc_iter=81494, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:40:06/1:09:37, time_cost(all): 22:06:58/1 day, 6:48:08, loss=0.466196455720766, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.97(1.03), norm=0.968791665054305, lr=0.000604002334944844
2023-11-22 06:32:19   INFO  epoch: 12/30, acc_iter=81544, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:40:55/1:05:56, time_cost(all): 22:07:48/1 day, 8:12:24, loss=0.466113341734457, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.99(1.03), norm=3.4058369177318237, lr=0.00060368160076292
2023-11-22 06:33:08   INFO  epoch: 12/30, acc_iter=81594, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:41:44/1:03:07, time_cost(all): 22:08:37/1 day, 7:19:36, loss=0.466030227748148, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.98(1.03), norm=1.7653786710227592, lr=0.000603360866580995
2023-11-22 06:33:57   INFO  epoch: 12/30, acc_iter=81644, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:42:34/1:06:30, time_cost(all): 22:09:26/1 day, 7:17:33, loss=0.465947113761839, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.0(1.03), norm=2.858796311730375, lr=0.00060304013239907
2023-11-22 06:34:46   INFO  epoch: 12/30, acc_iter=81694, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:43:23/1:05:05, time_cost(all): 22:10:15/1 day, 8:08:42, loss=0.46586399977553, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.14(1.03), norm=1.0300922230927714, lr=0.000602719398217146
2023-11-22 06:35:35   INFO  epoch: 12/30, acc_iter=81744, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:44:12/1:05:09, time_cost(all): 22:11:04/1 day, 8:20:17, loss=0.465780885789221, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.86(1.03), norm=4.026694863813683, lr=0.000602398664035221
2023-11-22 06:36:24   INFO  epoch: 12/30, acc_iter=81794, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:45:01/1:05:18, time_cost(all): 22:11:53/1 day, 9:11:44, loss=0.465697771802911, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.89(1.03), norm=3.723334674465199, lr=0.000602077929853296
2023-11-22 06:37:13   INFO  epoch: 12/30, acc_iter=81844, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:45:50/1:00:06, time_cost(all): 22:12:42/1 day, 6:51:23, loss=0.465614657816602, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.1(1.03), norm=2.821580359253243, lr=0.000601757195671372
2023-11-22 06:38:02   INFO  epoch: 12/30, acc_iter=81894, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:46:39/1:02:35, time_cost(all): 22:13:31/1 day, 6:56:43, loss=0.465531543830293, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.84(1.03), norm=2.2936201742000812, lr=0.000601436461489447
2023-11-22 06:38:52   INFO  epoch: 12/30, acc_iter=81944, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:47:28/0:58:17, time_cost(all): 22:14:21/1 day, 8:31:08, loss=0.465448429843984, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.87(1.03), norm=0.7333003275282597, lr=0.000601115727307522
2023-11-22 06:39:41   INFO  epoch: 12/30, acc_iter=81994, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:48:17/1:01:19, time_cost(all): 22:15:10/1 day, 6:37:03, loss=0.465365315857675, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.87(1.03), norm=1.2065672326528627, lr=0.000600794993125597
2023-11-22 06:40:30   INFO  epoch: 12/30, acc_iter=82044, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:49:07/0:56:09, time_cost(all): 22:15:59/1 day, 7:34:36, loss=0.465282201871366, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.02(1.03), norm=2.2569366384878204, lr=0.000600474258943673
2023-11-22 06:41:19   INFO  epoch: 12/30, acc_iter=82094, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:49:56/0:57:32, time_cost(all): 22:16:48/1 day, 8:03:51, loss=0.465199087885057, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.02(1.03), norm=2.86878463279226, lr=0.000600153524761748
2023-11-22 06:42:08   INFO  epoch: 12/30, acc_iter=82144, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:50:45/0:57:16, time_cost(all): 22:17:37/1 day, 8:19:30, loss=0.465115973898748, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.99(1.03), norm=2.980871833574303, lr=0.000599832790579823
2023-11-22 06:42:57   INFO  epoch: 12/30, acc_iter=82194, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 0:51:34/0:55:30, time_cost(all): 22:18:26/1 day, 6:08:14, loss=0.465032859912439, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.05(1.03), norm=4.663411325095774, lr=0.000599512056397899
2023-11-22 06:43:46   INFO  epoch: 12/30, acc_iter=82244, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 0:52:23/0:56:05, time_cost(all): 22:19:15/1 day, 7:45:26, loss=0.46494974592613, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.13(1.03), norm=2.722553503271275, lr=0.000599191322215974
2023-11-22 06:44:35   INFO  epoch: 12/30, acc_iter=82294, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 0:53:12/0:52:06, time_cost(all): 22:20:04/1 day, 8:16:22, loss=0.464866631939821, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.09(1.03), norm=3.4914976865285015, lr=0.000598870588034049
2023-11-22 06:45:24   INFO  epoch: 12/30, acc_iter=82344, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 0:54:01/0:53:26, time_cost(all): 22:20:53/1 day, 6:31:04, loss=0.464783517953512, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.98(1.03), norm=1.269505044098538, lr=0.000598549853852125
2023-11-22 06:46:14   INFO  epoch: 12/30, acc_iter=82394, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 0:54:50/0:53:00, time_cost(all): 22:21:43/1 day, 7:02:25, loss=0.464700403967203, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.85(1.03), norm=3.449564539064686, lr=0.0005982291196702
2023-11-22 06:47:03   INFO  epoch: 12/30, acc_iter=82444, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 0:55:39/0:54:08, time_cost(all): 22:22:32/1 day, 6:06:25, loss=0.464617289980894, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.04(1.03), norm=3.314238105946592, lr=0.000597908385488275
2023-11-22 06:47:52   INFO  epoch: 12/30, acc_iter=82494, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 0:56:29/0:52:39, time_cost(all): 22:23:21/1 day, 6:34:12, loss=0.464534175994585, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.17(1.03), norm=4.206230289034824, lr=0.00059758765130635
2023-11-22 06:48:41   INFO  epoch: 12/30, acc_iter=82544, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 0:57:18/0:48:08, time_cost(all): 22:24:10/1 day, 9:02:19, loss=0.464451062008276, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.13(1.03), norm=4.650239439402142, lr=0.000597266917124426
2023-11-22 06:49:30   INFO  epoch: 12/30, acc_iter=82594, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 0:58:07/0:50:55, time_cost(all): 22:24:59/1 day, 8:49:39, loss=0.464367948021966, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.83(1.03), norm=2.475328189016933, lr=0.000596946182942501
2023-11-22 06:50:19   INFO  epoch: 12/30, acc_iter=82644, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 0:58:56/0:49:58, time_cost(all): 22:25:48/1 day, 8:53:07, loss=0.464284834035657, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.92(1.03), norm=1.848872710155911, lr=0.000596625448760576
2023-11-22 06:51:08   INFO  epoch: 12/30, acc_iter=82694, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 0:59:45/0:47:07, time_cost(all): 22:26:37/1 day, 6:08:49, loss=0.464201720049348, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.96(1.03), norm=3.309995987542988, lr=0.000596304714578652
2023-11-22 06:51:57   INFO  epoch: 12/30, acc_iter=82744, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:00:34/0:48:51, time_cost(all): 22:27:26/1 day, 8:45:02, loss=0.464118606063039, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.08(1.03), norm=4.949890199511417, lr=0.000595983980396727
2023-11-22 06:52:47   INFO  epoch: 12/30, acc_iter=82794, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:01:23/0:46:12, time_cost(all): 22:28:16/1 day, 7:57:53, loss=0.46403549207673, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.16(1.03), norm=4.535799027004797, lr=0.000595663246214802
2023-11-22 06:53:36   INFO  epoch: 12/30, acc_iter=82844, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:02:12/0:43:37, time_cost(all): 22:29:05/1 day, 8:36:09, loss=0.463952378090421, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.1(1.03), norm=4.839576384594755, lr=0.000595342512032877
2023-11-22 06:54:25   INFO  epoch: 12/30, acc_iter=82894, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:03:02/0:43:38, time_cost(all): 22:29:54/1 day, 6:49:05, loss=0.463869264104112, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.87(1.03), norm=1.6230276390507061, lr=0.000595021777850953
2023-11-22 06:55:14   INFO  epoch: 12/30, acc_iter=82944, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:03:51/0:42:50, time_cost(all): 22:30:43/1 day, 8:01:15, loss=0.463786150117803, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.05(1.03), norm=1.854523000400889, lr=0.000594701043669028
2023-11-22 06:56:03   INFO  epoch: 12/30, acc_iter=82994, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:04:40/0:43:50, time_cost(all): 22:31:32/1 day, 8:32:56, loss=0.463703036131494, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.16(1.03), norm=1.7760103336042508, lr=0.000594380309487103
2023-11-22 06:56:52   INFO  epoch: 12/30, acc_iter=83044, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:05:29/0:44:25, time_cost(all): 22:32:21/1 day, 7:28:37, loss=0.463619922145185, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.88(1.03), norm=4.257889125984722, lr=0.000594059575305179
2023-11-22 06:57:41   INFO  epoch: 12/30, acc_iter=83094, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:06:18/0:41:22, time_cost(all): 22:33:10/1 day, 6:41:14, loss=0.463536808158876, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.0(1.03), norm=1.9467633143422192, lr=0.000593738841123254
2023-11-22 06:58:30   INFO  epoch: 12/30, acc_iter=83144, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:07:07/0:38:50, time_cost(all): 22:33:59/1 day, 8:42:08, loss=0.463453694172567, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.02(1.03), norm=4.44258379979656, lr=0.000593418106941329
2023-11-22 06:59:19   INFO  epoch: 12/30, acc_iter=83194, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:07:56/0:38:31, time_cost(all): 22:34:48/1 day, 7:29:36, loss=0.463370580186258, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.11(1.03), norm=1.0600996593123133, lr=0.000593097372759405
2023-11-22 07:00:09   INFO  epoch: 12/30, acc_iter=83244, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:08:45/0:38:40, time_cost(all): 22:35:38/1 day, 6:57:12, loss=0.463287466199949, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.02(1.03), norm=2.4403551573089395, lr=0.00059277663857748
2023-11-22 07:00:58   INFO  epoch: 12/30, acc_iter=83294, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:09:34/0:36:35, time_cost(all): 22:36:27/1 day, 7:07:05, loss=0.46320435221364, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.96(1.03), norm=0.8926190360007917, lr=0.000592455904395555
2023-11-22 07:01:47   INFO  epoch: 12/30, acc_iter=83344, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:10:24/0:37:43, time_cost(all): 22:37:16/1 day, 8:24:28, loss=0.463121238227331, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.09(1.03), norm=3.134396004222706, lr=0.00059213517021363
2023-11-22 07:02:36   INFO  epoch: 12/30, acc_iter=83394, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:11:13/0:36:59, time_cost(all): 22:38:05/1 day, 6:59:33, loss=0.463038124241021, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.13(1.03), norm=2.934955132781579, lr=0.000591814436031706
2023-11-22 07:03:25   INFO  epoch: 12/30, acc_iter=83444, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:12:02/0:34:38, time_cost(all): 22:38:54/1 day, 6:54:06, loss=0.462955010254712, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.15(1.03), norm=4.375103854911973, lr=0.000591493701849781
2023-11-22 07:04:14   INFO  epoch: 12/30, acc_iter=83494, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:12:51/0:35:40, time_cost(all): 22:39:43/1 day, 6:42:59, loss=0.462871896268403, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.23(1.03), norm=4.323184855583811, lr=0.000591172967667856
2023-11-22 07:05:03   INFO  epoch: 12/30, acc_iter=83544, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:13:40/0:33:28, time_cost(all): 22:40:32/1 day, 6:18:16, loss=0.462788782282094, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.08(1.03), norm=4.157584450695093, lr=0.000590852233485932
2023-11-22 07:05:52   INFO  epoch: 12/30, acc_iter=83594, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:14:29/0:33:17, time_cost(all): 22:41:21/1 day, 7:57:38, loss=0.462705668295785, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.91(1.03), norm=3.98873821427874, lr=0.000590531499304007
2023-11-22 07:06:42   INFO  epoch: 12/30, acc_iter=83644, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:15:18/0:32:10, time_cost(all): 22:42:11/1 day, 6:51:51, loss=0.462622554309476, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.19(1.03), norm=1.1249841811378816, lr=0.000590210765122082
2023-11-22 07:07:31   INFO  epoch: 12/30, acc_iter=83694, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:16:07/0:30:59, time_cost(all): 22:43:00/1 day, 6:39:54, loss=0.462539440323167, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.85(1.03), norm=3.3322589579173263, lr=0.000589890030940157
2023-11-22 07:08:20   INFO  epoch: 12/30, acc_iter=83744, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:16:57/0:32:14, time_cost(all): 22:43:49/1 day, 5:53:33, loss=0.462456326336858, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.83(1.03), norm=4.004165230511017, lr=0.000589569296758233
2023-11-22 07:09:09   INFO  epoch: 12/30, acc_iter=83794, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:17:46/0:31:08, time_cost(all): 22:44:38/1 day, 7:35:36, loss=0.462373212350549, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.06(1.03), norm=3.8247903535813954, lr=0.000589248562576308
2023-11-22 07:09:58   INFO  epoch: 12/30, acc_iter=83844, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:18:35/0:29:35, time_cost(all): 22:45:27/1 day, 8:10:25, loss=0.46229009836424, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.98(1.03), norm=0.9784967545648031, lr=0.000588927828394383
2023-11-22 07:10:47   INFO  epoch: 12/30, acc_iter=83894, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:19:24/0:28:46, time_cost(all): 22:46:16/1 day, 8:26:40, loss=0.462206984377931, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.11(1.03), norm=3.794356239019902, lr=0.000588607094212459
2023-11-22 07:11:36   INFO  epoch: 12/30, acc_iter=83944, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:20:13/0:27:27, time_cost(all): 22:47:05/1 day, 5:53:31, loss=0.462123870391622, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.18(1.03), norm=0.6227545238884911, lr=0.000588286360030534
2023-11-22 07:12:25   INFO  epoch: 12/30, acc_iter=83994, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:21:02/0:26:43, time_cost(all): 22:47:54/1 day, 5:42:39, loss=0.462040756405313, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.93(1.03), norm=2.971704907945105, lr=0.000587965625848609
2023-11-22 07:13:14   INFO  epoch: 12/30, acc_iter=84044, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:21:51/0:24:54, time_cost(all): 22:48:43/1 day, 7:44:30, loss=0.461957642419004, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.08(1.03), norm=0.9260417584270428, lr=0.000587644891666685
2023-11-22 07:14:04   INFO  epoch: 12/30, acc_iter=84094, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:22:40/0:24:18, time_cost(all): 22:49:33/1 day, 6:33:57, loss=0.461874528432695, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.92(1.03), norm=4.209540130826248, lr=0.00058732415748476
2023-11-22 07:14:53   INFO  epoch: 12/30, acc_iter=84144, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:23:29/0:24:10, time_cost(all): 22:50:22/1 day, 6:54:11, loss=0.461791414446385, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.87(1.03), norm=2.6170277360422283, lr=0.000587003423302835
2023-11-22 07:15:42   INFO  epoch: 12/30, acc_iter=84194, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:24:19/0:23:49, time_cost(all): 22:51:11/1 day, 6:50:47, loss=0.461708300460076, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.2(1.03), norm=1.7006144425625904, lr=0.00058668268912091
2023-11-22 07:16:31   INFO  epoch: 12/30, acc_iter=84244, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:25:08/0:21:38, time_cost(all): 22:52:00/1 day, 8:25:28, loss=0.461625186473767, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.18(1.03), norm=2.2902972341320558, lr=0.000586361954938986
2023-11-22 07:17:20   INFO  epoch: 12/30, acc_iter=84294, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:25:57/0:21:58, time_cost(all): 22:52:49/1 day, 6:47:20, loss=0.461542072487458, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.03(1.03), norm=0.7765785716935298, lr=0.000586041220757061
2023-11-22 07:18:09   INFO  epoch: 12/30, acc_iter=84344, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:26:46/0:20:57, time_cost(all): 22:53:38/1 day, 7:45:21, loss=0.461458958501149, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.89(1.03), norm=1.2053639043796707, lr=0.000585720486575136
2023-11-22 07:18:58   INFO  epoch: 12/30, acc_iter=84394, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:27:35/0:21:03, time_cost(all): 22:54:27/1 day, 8:04:24, loss=0.46137584451484, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.16(1.03), norm=4.9702200270390495, lr=0.000585399752393212
2023-11-22 07:19:47   INFO  epoch: 12/30, acc_iter=84444, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:28:24/0:18:32, time_cost(all): 22:55:16/1 day, 7:03:37, loss=0.461292730528531, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.94(1.03), norm=1.0751470527832883, lr=0.000585079018211287
2023-11-22 07:20:37   INFO  epoch: 12/30, acc_iter=84494, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:29:13/0:18:37, time_cost(all): 22:56:06/1 day, 6:55:58, loss=0.461209616542222, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.87(1.03), norm=3.062532941209831, lr=0.000584758284029362
2023-11-22 07:21:26   INFO  epoch: 12/30, acc_iter=84544, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:30:02/0:17:18, time_cost(all): 22:56:55/1 day, 6:43:10, loss=0.461126502555913, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.07(1.03), norm=1.126792175219456, lr=0.000584437549847437
2023-11-22 07:22:15   INFO  epoch: 12/30, acc_iter=84594, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:30:52/0:17:28, time_cost(all): 22:57:44/1 day, 6:51:21, loss=0.461043388569604, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.07(1.03), norm=3.360344266709389, lr=0.000584116815665513
2023-11-22 07:23:04   INFO  epoch: 12/30, acc_iter=84644, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:31:41/0:16:47, time_cost(all): 22:58:33/1 day, 8:23:46, loss=0.460960274583295, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.85(1.03), norm=0.6338531772306486, lr=0.000583796081483588
2023-11-22 07:23:53   INFO  epoch: 12/30, acc_iter=84694, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:32:30/0:15:05, time_cost(all): 22:59:22/1 day, 5:55:20, loss=0.460877160596986, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.01(1.03), norm=3.384658769790648, lr=0.000583475347301663
2023-11-22 07:24:42   INFO  epoch: 12/30, acc_iter=84744, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:33:19/0:14:40, time_cost(all): 23:00:11/1 day, 8:19:35, loss=0.460794046610677, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.18(1.03), norm=0.5420258089299216, lr=0.000583154613119739
2023-11-22 07:25:31   INFO  epoch: 12/30, acc_iter=84794, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:34:08/0:13:42, time_cost(all): 23:01:00/1 day, 7:21:30, loss=0.460710932624368, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.9(1.03), norm=1.4416589780499758, lr=0.000582833878937814
2023-11-22 07:26:20   INFO  epoch: 12/30, acc_iter=84844, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:34:57/0:12:47, time_cost(all): 23:01:49/1 day, 6:49:33, loss=0.460627818638059, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.95(1.03), norm=3.65353836610335, lr=0.000582513144755889
2023-11-22 07:27:09   INFO  epoch: 12/30, acc_iter=84894, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:35:46/0:12:06, time_cost(all): 23:02:38/1 day, 6:37:26, loss=0.46054470465175, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.01(1.03), norm=3.2795578489994055, lr=0.000582192410573965
2023-11-22 07:27:59   INFO  epoch: 12/30, acc_iter=84944, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:36:35/0:11:34, time_cost(all): 23:03:28/1 day, 6:16:26, loss=0.460461590665441, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.89(1.03), norm=2.279541392479064, lr=0.00058187167639204
2023-11-22 07:28:48   INFO  epoch: 12/30, acc_iter=84994, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:37:24/0:10:22, time_cost(all): 23:04:17/1 day, 5:49:20, loss=0.460378476679131, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.07(1.03), norm=4.06380041793948, lr=0.000581550942210115
2023-11-22 07:29:37   INFO  epoch: 12/30, acc_iter=85044, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:38:14/0:09:39, time_cost(all): 23:05:06/1 day, 7:50:56, loss=0.460295362692822, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.93(1.03), norm=4.513858966106945, lr=0.00058123020802819
2023-11-22 07:30:26   INFO  epoch: 12/30, acc_iter=85094, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:39:03/0:08:51, time_cost(all): 23:05:55/1 day, 6:10:19, loss=0.460212248706513, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.87(1.03), norm=1.0854080495032037, lr=0.000580909473846266
2023-11-22 07:31:15   INFO  epoch: 12/30, acc_iter=85144, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:39:52/0:08:03, time_cost(all): 23:06:44/1 day, 8:15:32, loss=0.460129134720204, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.97(1.03), norm=3.727506654333314, lr=0.000580588739664341
2023-11-22 07:32:04   INFO  epoch: 12/30, acc_iter=85194, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:40:41/0:07:14, time_cost(all): 23:07:33/1 day, 6:01:56, loss=0.460046020733895, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.09(1.03), norm=1.1023514417860534, lr=0.000580268005482416
2023-11-22 07:32:53   INFO  epoch: 12/30, acc_iter=85244, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:41:30/0:06:26, time_cost(all): 23:08:22/1 day, 7:58:42, loss=0.459962906747586, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.95(1.03), norm=1.8833039651801728, lr=0.000579947271300492
2023-11-22 07:33:42   INFO  epoch: 12/30, acc_iter=85294, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 1:42:19/0:05:24, time_cost(all): 23:09:11/1 day, 7:48:44, loss=0.459879792761277, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.2(1.03), norm=4.0824113055492735, lr=0.000579626537118567
2023-11-22 07:34:32   INFO  epoch: 12/30, acc_iter=85344, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 1:43:08/0:04:46, time_cost(all): 23:10:01/1 day, 6:08:00, loss=0.459796678774968, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.9(1.03), norm=0.5867082097459505, lr=0.000579305802936642
2023-11-22 07:35:21   INFO  epoch: 12/30, acc_iter=85394, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 1:43:57/0:03:43, time_cost(all): 23:10:50/1 day, 6:37:31, loss=0.459713564788659, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.97(1.03), norm=2.6061669469845548, lr=0.000578985068754717
2023-11-22 07:36:10   INFO  epoch: 12/30, acc_iter=85444, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 1:44:47/0:02:54, time_cost(all): 23:11:39/1 day, 8:11:13, loss=0.45963045080235, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.89(1.03), norm=3.0492451741401276, lr=0.000578664334572793
2023-11-22 07:36:59   INFO  epoch: 12/30, acc_iter=85494, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 1:45:36/0:02:18, time_cost(all): 23:12:28/1 day, 5:45:16, loss=0.459547336816041, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.06(1.03), norm=2.7487821953228875, lr=0.000578343600390868
2023-11-22 07:37:48   INFO  epoch: 12/30, acc_iter=85544, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 1:46:25/0:01:26, time_cost(all): 23:13:17/1 day, 6:23:20, loss=0.459464222829732, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.01(1.03), norm=2.2771989061931324, lr=0.000578022866208943
2023-11-22 07:38:37   INFO  epoch: 12/30, acc_iter=85594, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 1:47:14/0:00:36, time_cost(all): 23:14:06/1 day, 5:14:32, loss=0.459381108843423, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.98(1.03), norm=3.1784864277956917, lr=0.000577702132027019
2023-11-22 07:39:26   INFO  epoch: 13/30, acc_iter=85681, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:49/1:42:16, time_cost(all): 23:14:55/1 day, 7:16:39, loss=0.459236490507245, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.98(1.03), norm=3.78931530174325, lr=0.00057714405455047
2023-11-22 07:40:15   INFO  epoch: 13/30, acc_iter=85731, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:38/1:47:46, time_cost(all): 23:15:44/1 day, 5:56:32, loss=0.459153376520936, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.86(1.03), norm=1.0313977365532045, lr=0.000576823320368545
2023-11-22 07:41:04   INFO  epoch: 13/30, acc_iter=85781, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:27/1:47:58, time_cost(all): 23:16:33/1 day, 6:08:18, loss=0.459070262534627, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.17(1.03), norm=4.534159511288539, lr=0.00057650258618662
2023-11-22 07:41:54   INFO  epoch: 13/30, acc_iter=85831, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:16/1:40:03, time_cost(all): 23:17:23/1 day, 7:33:59, loss=0.458987148548318, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.09(1.03), norm=1.7158800060313897, lr=0.000576181852004696
2023-11-22 07:42:43   INFO  epoch: 13/30, acc_iter=85881, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:05/1:39:27, time_cost(all): 23:18:12/1 day, 6:45:31, loss=0.458904034562009, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.12(1.03), norm=3.998423543364399, lr=0.000575861117822771
2023-11-22 07:43:32   INFO  epoch: 13/30, acc_iter=85931, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:04:54/1:42:05, time_cost(all): 23:19:01/1 day, 6:29:29, loss=0.4588209205757, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.83(1.03), norm=2.2601001204355504, lr=0.000575540383640846
2023-11-22 07:44:21   INFO  epoch: 13/30, acc_iter=85981, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:05:43/1:45:28, time_cost(all): 23:19:50/1 day, 8:00:20, loss=0.458737806589391, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.17(1.03), norm=1.7197428728004138, lr=0.000575219649458921
2023-11-22 07:45:10   INFO  epoch: 13/30, acc_iter=86031, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:06:32/1:36:57, time_cost(all): 23:20:39/1 day, 6:56:54, loss=0.458654692603081, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.19(1.03), norm=1.5169996860074408, lr=0.000574898915276997
2023-11-22 07:45:59   INFO  epoch: 13/30, acc_iter=86081, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:07:22/1:42:13, time_cost(all): 23:21:28/1 day, 7:56:06, loss=0.458571578616772, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.98(1.03), norm=3.668296075368653, lr=0.000574578181095072
2023-11-22 07:46:48   INFO  epoch: 13/30, acc_iter=86131, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:08:11/1:42:55, time_cost(all): 23:22:17/1 day, 7:26:32, loss=0.458488464630463, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.97(1.03), norm=1.0263330226328073, lr=0.000574257446913147
2023-11-22 07:47:37   INFO  epoch: 13/30, acc_iter=86181, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:09:00/1:38:34, time_cost(all): 23:23:06/1 day, 5:16:47, loss=0.458405350644154, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.95(1.03), norm=3.7418526666512757, lr=0.000573936712731223
2023-11-22 07:48:26   INFO  epoch: 13/30, acc_iter=86231, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:09:49/1:40:49, time_cost(all): 23:23:55/1 day, 7:35:13, loss=0.458322236657845, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.07(1.03), norm=1.5117872148041742, lr=0.000573615978549298
2023-11-22 07:49:16   INFO  epoch: 13/30, acc_iter=86281, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:10:38/1:35:44, time_cost(all): 23:24:45/1 day, 5:33:29, loss=0.458239122671536, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.98(1.03), norm=1.0230419400152746, lr=0.000573295244367373
2023-11-22 07:50:05   INFO  epoch: 13/30, acc_iter=86331, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:11:27/1:38:33, time_cost(all): 23:25:34/1 day, 6:18:15, loss=0.458156008685227, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.93(1.03), norm=2.21818330308281, lr=0.000572974510185449
2023-11-22 07:50:54   INFO  epoch: 13/30, acc_iter=86381, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:12:16/1:38:06, time_cost(all): 23:26:23/1 day, 6:41:08, loss=0.458072894698918, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.19(1.03), norm=2.4077051366957196, lr=0.000572653776003524
2023-11-22 07:51:43   INFO  epoch: 13/30, acc_iter=86431, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:13:05/1:35:19, time_cost(all): 23:27:12/1 day, 7:13:16, loss=0.457989780712609, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.94(1.03), norm=0.8804028977643641, lr=0.000572333041821599
2023-11-22 07:52:32   INFO  epoch: 13/30, acc_iter=86481, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:13:54/1:37:52, time_cost(all): 23:28:01/1 day, 6:25:50, loss=0.4579066667263, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.15(1.03), norm=4.184415200964274, lr=0.000572012307639674
2023-11-22 07:53:21   INFO  epoch: 13/30, acc_iter=86531, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:14:44/1:37:18, time_cost(all): 23:28:50/1 day, 5:16:39, loss=0.457823552739991, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.18(1.03), norm=2.240719605392548, lr=0.00057169157345775
2023-11-22 07:54:10   INFO  epoch: 13/30, acc_iter=86581, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:15:33/1:30:14, time_cost(all): 23:29:39/1 day, 5:53:16, loss=0.457740438753682, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.98(1.03), norm=1.0550756110024924, lr=0.000571370839275825
2023-11-22 07:54:59   INFO  epoch: 13/30, acc_iter=86631, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:16:22/1:27:07, time_cost(all): 23:30:28/1 day, 7:51:17, loss=0.457657324767373, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.2(1.03), norm=0.9721410152574607, lr=0.0005710501050939
2023-11-22 07:55:49   INFO  epoch: 13/30, acc_iter=86681, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:17:11/1:30:23, time_cost(all): 23:31:18/1 day, 5:16:52, loss=0.457574210781064, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.97(1.03), norm=0.8449482592980506, lr=0.000570729370911976
2023-11-22 07:56:38   INFO  epoch: 13/30, acc_iter=86731, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:18:00/1:32:32, time_cost(all): 23:32:07/1 day, 6:34:18, loss=0.457491096794755, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.21(1.03), norm=3.9757368901846393, lr=0.000570408636730051
2023-11-22 07:57:27   INFO  epoch: 13/30, acc_iter=86781, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:18:49/1:31:21, time_cost(all): 23:32:56/1 day, 5:55:51, loss=0.457407982808446, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.85(1.03), norm=4.206611866942545, lr=0.000570087902548126
2023-11-22 07:58:16   INFO  epoch: 13/30, acc_iter=86831, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:19:38/1:28:25, time_cost(all): 23:33:45/1 day, 6:50:56, loss=0.457324868822137, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.85(1.03), norm=4.429847827925002, lr=0.000569767168366202
2023-11-22 07:59:05   INFO  epoch: 13/30, acc_iter=86881, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:20:27/1:25:28, time_cost(all): 23:34:34/1 day, 5:15:58, loss=0.457241754835827, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.22(1.03), norm=4.313589992980001, lr=0.000569446434184277
2023-11-22 07:59:54   INFO  epoch: 13/30, acc_iter=86931, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:21:17/1:24:03, time_cost(all): 23:35:23/1 day, 6:12:18, loss=0.457158640849518, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.15(1.03), norm=2.827717042230958, lr=0.000569125700002352
2023-11-22 08:00:43   INFO  epoch: 13/30, acc_iter=86981, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:22:06/1:27:16, time_cost(all): 23:36:12/1 day, 7:42:45, loss=0.457075526863209, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.96(1.03), norm=3.9291030479156097, lr=0.000568804965820427
2023-11-22 08:01:32   INFO  epoch: 13/30, acc_iter=87031, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:22:55/1:25:47, time_cost(all): 23:37:01/1 day, 7:20:39, loss=0.4569924128769, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.84(1.03), norm=1.2137500740737184, lr=0.000568484231638503
2023-11-22 08:02:21   INFO  epoch: 13/30, acc_iter=87081, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:23:44/1:20:35, time_cost(all): 23:37:50/1 day, 6:42:41, loss=0.456909298890591, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.12(1.03), norm=3.4076184282791244, lr=0.000568163497456578
2023-11-22 08:03:11   INFO  epoch: 13/30, acc_iter=87131, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:24:33/1:19:14, time_cost(all): 23:38:40/1 day, 6:09:35, loss=0.456826184904282, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.95(1.03), norm=2.708394376608762, lr=0.000567842763274653
2023-11-22 08:04:00   INFO  epoch: 13/30, acc_iter=87181, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:25:22/1:21:25, time_cost(all): 23:39:29/1 day, 7:23:53, loss=0.456743070917973, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.84(1.03), norm=2.18843931040996, lr=0.000567522029092729
2023-11-22 08:04:49   INFO  epoch: 13/30, acc_iter=87231, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:26:11/1:24:26, time_cost(all): 23:40:18/1 day, 6:45:10, loss=0.456659956931664, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.07(1.03), norm=2.06875012482873, lr=0.000567201294910804
2023-11-22 08:05:38   INFO  epoch: 13/30, acc_iter=87281, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:27:00/1:22:33, time_cost(all): 23:41:07/1 day, 6:33:29, loss=0.456576842945355, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.16(1.03), norm=0.7067081904775894, lr=0.000566880560728879
2023-11-22 08:06:27   INFO  epoch: 13/30, acc_iter=87331, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:27:49/1:20:33, time_cost(all): 23:41:56/1 day, 4:43:04, loss=0.456493728959046, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.0(1.03), norm=3.5582386129640278, lr=0.000566559826546954
2023-11-22 08:07:16   INFO  epoch: 13/30, acc_iter=87381, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:28:39/1:21:51, time_cost(all): 23:42:45/1 day, 6:46:47, loss=0.456410614972737, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.87(1.03), norm=4.1194225991476, lr=0.00056623909236503
2023-11-22 08:08:05   INFO  epoch: 13/30, acc_iter=87431, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:29:28/1:21:54, time_cost(all): 23:43:34/1 day, 6:31:29, loss=0.456327500986428, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.21(1.03), norm=4.036258748642284, lr=0.000565918358183105
2023-11-22 08:08:54   INFO  epoch: 13/30, acc_iter=87481, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:30:17/1:15:05, time_cost(all): 23:44:23/1 day, 5:39:05, loss=0.456244387000119, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.05(1.03), norm=2.544058586229356, lr=0.00056559762400118
2023-11-22 08:09:44   INFO  epoch: 13/30, acc_iter=87531, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:31:06/1:15:14, time_cost(all): 23:45:13/1 day, 6:21:52, loss=0.45616127301381, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.89(1.03), norm=4.919930758035071, lr=0.000565276889819256
2023-11-22 08:10:33   INFO  epoch: 13/30, acc_iter=87581, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:31:55/1:17:35, time_cost(all): 23:46:02/1 day, 6:34:02, loss=0.456078159027501, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.12(1.03), norm=4.909869587370729, lr=0.000564956155637331
2023-11-22 08:11:22   INFO  epoch: 13/30, acc_iter=87631, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:32:44/1:16:33, time_cost(all): 23:46:51/1 day, 4:56:27, loss=0.455995045041191, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.98(1.03), norm=4.62126673747956, lr=0.000564635421455406
2023-11-22 08:12:11   INFO  epoch: 13/30, acc_iter=87681, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:33:33/1:12:15, time_cost(all): 23:47:40/1 day, 7:22:15, loss=0.455911931054882, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.85(1.03), norm=2.3983802887133914, lr=0.000564314687273482
2023-11-22 08:13:00   INFO  epoch: 13/30, acc_iter=87731, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:34:22/1:15:55, time_cost(all): 23:48:29/1 day, 5:19:14, loss=0.455828817068573, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.84(1.03), norm=0.8573894160037934, lr=0.000563993953091557
2023-11-22 08:13:49   INFO  epoch: 13/30, acc_iter=87781, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:35:12/1:09:12, time_cost(all): 23:49:18/1 day, 6:53:27, loss=0.455745703082264, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.13(1.03), norm=2.060228052944356, lr=0.000563673218909632
2023-11-22 08:14:38   INFO  epoch: 13/30, acc_iter=87831, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:36:01/1:10:57, time_cost(all): 23:50:07/1 day, 5:22:12, loss=0.455662589095955, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.98(1.03), norm=1.9340696613285386, lr=0.000563352484727707
2023-11-22 08:15:27   INFO  epoch: 13/30, acc_iter=87881, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:36:50/1:10:48, time_cost(all): 23:50:56/1 day, 6:57:49, loss=0.455579475109646, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.11(1.03), norm=2.376698412281252, lr=0.000563031750545783
2023-11-22 08:16:16   INFO  epoch: 13/30, acc_iter=87931, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:37:39/1:06:49, time_cost(all): 23:51:45/1 day, 5:31:07, loss=0.455496361123337, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.14(1.03), norm=2.959319223423533, lr=0.000562711016363858
2023-11-22 08:17:06   INFO  epoch: 13/30, acc_iter=87981, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:38:28/1:12:19, time_cost(all): 23:52:35/1 day, 7:03:56, loss=0.455413247137028, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.18(1.03), norm=1.4993914622979567, lr=0.000562390282181933
2023-11-22 08:17:55   INFO  epoch: 13/30, acc_iter=88031, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:39:17/1:09:21, time_cost(all): 23:53:24/1 day, 7:31:57, loss=0.455330133150719, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.98(1.03), norm=4.364423521799156, lr=0.000562069548000009
2023-11-22 08:18:44   INFO  epoch: 13/30, acc_iter=88081, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:40:06/1:07:05, time_cost(all): 23:54:13/1 day, 5:38:50, loss=0.45524701916441, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.89(1.03), norm=2.6299944014504044, lr=0.000561748813818084
2023-11-22 08:19:33   INFO  epoch: 13/30, acc_iter=88131, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:40:55/1:04:07, time_cost(all): 23:55:02/1 day, 5:50:56, loss=0.455163905178101, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.0(1.03), norm=2.6122100815744735, lr=0.000561428079636159
2023-11-22 08:20:22   INFO  epoch: 13/30, acc_iter=88181, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:41:44/1:08:34, time_cost(all): 23:55:51/1 day, 4:59:01, loss=0.455080791191792, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.9(1.03), norm=1.324164348108864, lr=0.000561107345454234
2023-11-22 08:21:11   INFO  epoch: 13/30, acc_iter=88231, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:42:34/1:02:17, time_cost(all): 23:56:40/1 day, 6:21:07, loss=0.454997677205483, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.9(1.03), norm=2.0768802306924563, lr=0.00056078661127231
2023-11-22 08:22:00   INFO  epoch: 13/30, acc_iter=88281, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:43:23/1:04:36, time_cost(all): 23:57:29/1 day, 6:45:12, loss=0.454914563219174, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.91(1.03), norm=2.2351128284257253, lr=0.000560465877090385
2023-11-22 08:22:49   INFO  epoch: 13/30, acc_iter=88331, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:44:12/1:05:39, time_cost(all): 23:58:18/1 day, 5:16:56, loss=0.454831449232865, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.01(1.03), norm=1.1082949791047931, lr=0.00056014514290846
2023-11-22 08:23:39   INFO  epoch: 13/30, acc_iter=88381, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:45:01/1:03:57, time_cost(all): 23:59:08/1 day, 4:52:35, loss=0.454748335246556, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.95(1.03), norm=1.102079405694223, lr=0.000559824408726536
2023-11-22 08:24:28   INFO  epoch: 13/30, acc_iter=88431, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:45:50/1:01:46, time_cost(all): 23:59:57/1 day, 5:24:53, loss=0.454665221260246, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.02(1.03), norm=4.769619478179708, lr=0.000559503674544611
2023-11-22 08:25:17   INFO  epoch: 13/30, acc_iter=88481, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:46:39/1:03:43, time_cost(all): 1 day, 0:00:46/1 day, 5:30:57, loss=0.454582107273937, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.11(1.03), norm=0.6417532349546266, lr=0.000559182940362686
2023-11-22 08:26:06   INFO  epoch: 13/30, acc_iter=88531, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:47:28/0:57:29, time_cost(all): 1 day, 0:01:35/1 day, 4:57:40, loss=0.454498993287628, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.16(1.03), norm=4.663463496637322, lr=0.000558862206180762
2023-11-22 08:26:55   INFO  epoch: 13/30, acc_iter=88581, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:48:17/1:00:33, time_cost(all): 1 day, 0:02:24/1 day, 5:37:04, loss=0.454415879301319, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.09(1.03), norm=0.8156754917865547, lr=0.000558541471998837
2023-11-22 08:27:44   INFO  epoch: 13/30, acc_iter=88631, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:49:07/0:59:07, time_cost(all): 1 day, 0:03:13/1 day, 5:01:10, loss=0.45433276531501, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.13(1.03), norm=1.385295896470192, lr=0.000558220737816912
2023-11-22 08:28:33   INFO  epoch: 13/30, acc_iter=88681, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:49:56/0:59:53, time_cost(all): 1 day, 0:04:02/1 day, 5:42:30, loss=0.454249651328701, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.23(1.03), norm=4.171623339934458, lr=0.000557900003634987
2023-11-22 08:29:22   INFO  epoch: 13/30, acc_iter=88731, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:50:45/0:57:10, time_cost(all): 1 day, 0:04:51/1 day, 6:18:39, loss=0.454166537342392, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.87(1.03), norm=0.7245972073024698, lr=0.000557579269453063
2023-11-22 08:30:11   INFO  epoch: 13/30, acc_iter=88781, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 0:51:34/0:56:35, time_cost(all): 1 day, 0:05:40/1 day, 7:15:07, loss=0.454083423356083, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.22(1.03), norm=0.6531837605354758, lr=0.000557258535271138
2023-11-22 08:31:01   INFO  epoch: 13/30, acc_iter=88831, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 0:52:23/0:56:35, time_cost(all): 1 day, 0:06:30/1 day, 5:30:25, loss=0.454000309369774, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.92(1.03), norm=0.9101689163166972, lr=0.000556937801089213
2023-11-22 08:31:50   INFO  epoch: 13/30, acc_iter=88881, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 0:53:12/0:52:25, time_cost(all): 1 day, 0:07:19/1 day, 5:12:27, loss=0.453917195383465, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.96(1.03), norm=2.7692491859290587, lr=0.000556617066907289
2023-11-22 08:32:39   INFO  epoch: 13/30, acc_iter=88931, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 0:54:01/0:53:02, time_cost(all): 1 day, 0:08:08/1 day, 5:53:29, loss=0.453834081397156, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.88(1.03), norm=3.844519687598753, lr=0.000556296332725364
2023-11-22 08:33:28   INFO  epoch: 13/30, acc_iter=88981, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 0:54:50/0:52:58, time_cost(all): 1 day, 0:08:57/1 day, 7:08:12, loss=0.453750967410847, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.86(1.03), norm=1.2038274282216543, lr=0.000555975598543439
2023-11-22 08:34:17   INFO  epoch: 13/30, acc_iter=89031, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 0:55:39/0:54:25, time_cost(all): 1 day, 0:09:46/1 day, 6:17:05, loss=0.453667853424538, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.18(1.03), norm=3.970015871042687, lr=0.000555654864361514
2023-11-22 08:35:06   INFO  epoch: 13/30, acc_iter=89081, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 0:56:29/0:51:42, time_cost(all): 1 day, 0:10:35/1 day, 6:42:44, loss=0.453584739438229, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.87(1.03), norm=3.348719312658235, lr=0.00055533413017959
2023-11-22 08:35:55   INFO  epoch: 13/30, acc_iter=89131, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 0:57:18/0:48:19, time_cost(all): 1 day, 0:11:24/1 day, 6:10:02, loss=0.45350162545192, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.91(1.03), norm=0.7327690148159298, lr=0.000555013395997665
2023-11-22 08:36:44   INFO  epoch: 13/30, acc_iter=89181, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 0:58:07/0:49:46, time_cost(all): 1 day, 0:12:13/1 day, 5:25:48, loss=0.453418511465611, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.17(1.03), norm=4.053194585174349, lr=0.00055469266181574
2023-11-22 08:37:34   INFO  epoch: 13/30, acc_iter=89231, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 0:58:56/0:46:30, time_cost(all): 1 day, 0:13:03/1 day, 4:24:54, loss=0.453335397479301, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.88(1.03), norm=4.062407975551814, lr=0.000554371927633816
2023-11-22 08:38:23   INFO  epoch: 13/30, acc_iter=89281, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 0:59:45/0:47:56, time_cost(all): 1 day, 0:13:52/1 day, 4:43:36, loss=0.453252283492992, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.95(1.03), norm=2.1590018977568466, lr=0.000554051193451891
2023-11-22 08:39:12   INFO  epoch: 13/30, acc_iter=89331, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:00:34/0:47:57, time_cost(all): 1 day, 0:14:41/1 day, 6:11:14, loss=0.453169169506683, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.96(1.03), norm=2.895999279373556, lr=0.000553730459269966
2023-11-22 08:40:01   INFO  epoch: 13/30, acc_iter=89381, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:01:23/0:47:56, time_cost(all): 1 day, 0:15:30/1 day, 6:51:59, loss=0.453086055520374, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.04(1.03), norm=2.319836734302328, lr=0.000553409725088042
2023-11-22 08:40:50   INFO  epoch: 13/30, acc_iter=89431, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:02:12/0:45:46, time_cost(all): 1 day, 0:16:19/1 day, 5:34:27, loss=0.453002941534065, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.03(1.03), norm=2.7741261703684823, lr=0.000553088990906117
2023-11-22 08:41:39   INFO  epoch: 13/30, acc_iter=89481, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:03:02/0:44:18, time_cost(all): 1 day, 0:17:08/1 day, 7:00:30, loss=0.452919827547756, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.19(1.03), norm=3.090166432514539, lr=0.000552768256724192
2023-11-22 08:42:28   INFO  epoch: 13/30, acc_iter=89531, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:03:51/0:42:15, time_cost(all): 1 day, 0:17:57/1 day, 5:10:38, loss=0.452836713561447, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.14(1.03), norm=1.361541701711926, lr=0.000552447522542267
2023-11-22 08:43:17   INFO  epoch: 13/30, acc_iter=89581, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:04:40/0:45:15, time_cost(all): 1 day, 0:18:46/1 day, 5:02:52, loss=0.452753599575138, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.89(1.03), norm=4.486301098700714, lr=0.000552126788360343
2023-11-22 08:44:06   INFO  epoch: 13/30, acc_iter=89631, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:05:29/0:40:38, time_cost(all): 1 day, 0:19:35/1 day, 4:43:19, loss=0.452670485588829, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.13(1.03), norm=0.8005764980429144, lr=0.000551806054178418
2023-11-22 08:44:56   INFO  epoch: 13/30, acc_iter=89681, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:06:18/0:42:01, time_cost(all): 1 day, 0:20:25/1 day, 5:39:51, loss=0.45258737160252, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.19(1.03), norm=1.7290300195213057, lr=0.000551485319996493
2023-11-22 08:45:45   INFO  epoch: 13/30, acc_iter=89731, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:07:07/0:40:13, time_cost(all): 1 day, 0:21:14/1 day, 4:37:00, loss=0.452504257616211, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.97(1.03), norm=3.2022069158430257, lr=0.000551164585814569
2023-11-22 08:46:34   INFO  epoch: 13/30, acc_iter=89781, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:07:56/0:40:42, time_cost(all): 1 day, 0:22:03/1 day, 5:06:59, loss=0.452421143629902, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.9(1.03), norm=0.5264366640554488, lr=0.000550843851632644
2023-11-22 08:47:23   INFO  epoch: 13/30, acc_iter=89831, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:08:45/0:40:31, time_cost(all): 1 day, 0:22:52/1 day, 4:48:01, loss=0.452338029643593, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.2(1.03), norm=2.7235819398278758, lr=0.000550523117450719
2023-11-22 08:48:12   INFO  epoch: 13/30, acc_iter=89881, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:09:34/0:39:05, time_cost(all): 1 day, 0:23:41/1 day, 5:42:04, loss=0.452254915657284, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.18(1.03), norm=3.517347867999639, lr=0.000550202383268795
2023-11-22 08:49:01   INFO  epoch: 13/30, acc_iter=89931, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:10:24/0:37:50, time_cost(all): 1 day, 0:24:30/1 day, 4:46:53, loss=0.452171801670975, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.23(1.03), norm=2.582659141727876, lr=0.00054988164908687
2023-11-22 08:49:50   INFO  epoch: 13/30, acc_iter=89981, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:11:13/0:35:37, time_cost(all): 1 day, 0:25:19/1 day, 6:23:37, loss=0.452088687684665, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.91(1.03), norm=4.572033429960503, lr=0.000549560914904945
2023-11-22 08:50:39   INFO  epoch: 13/30, acc_iter=90031, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:12:02/0:34:39, time_cost(all): 1 day, 0:26:08/1 day, 4:33:05, loss=0.452005573698356, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.01(1.03), norm=2.4411287115451614, lr=0.00054924018072302
2023-11-22 08:51:29   INFO  epoch: 13/30, acc_iter=90081, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:12:51/0:33:55, time_cost(all): 1 day, 0:26:58/1 day, 6:09:34, loss=0.451922459712047, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.0(1.03), norm=4.467720502648584, lr=0.000548919446541096
2023-11-22 08:52:18   INFO  epoch: 13/30, acc_iter=90131, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:13:40/0:33:03, time_cost(all): 1 day, 0:27:47/1 day, 5:00:46, loss=0.451839345725738, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.91(1.03), norm=4.008841395684015, lr=0.000548598712359171
2023-11-22 08:53:07   INFO  epoch: 13/30, acc_iter=90181, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:14:29/0:33:01, time_cost(all): 1 day, 0:28:36/1 day, 4:19:57, loss=0.451756231739429, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.94(1.03), norm=2.2510105286517628, lr=0.000548277978177246
2023-11-22 08:53:56   INFO  epoch: 13/30, acc_iter=90231, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:15:18/0:31:19, time_cost(all): 1 day, 0:29:25/1 day, 4:02:52, loss=0.45167311775312, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.92(1.03), norm=0.69091034750929, lr=0.000547957243995322
2023-11-22 08:54:45   INFO  epoch: 13/30, acc_iter=90281, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:16:07/0:32:26, time_cost(all): 1 day, 0:30:14/1 day, 5:02:21, loss=0.451590003766811, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.14(1.03), norm=1.4602741228940126, lr=0.000547636509813397
2023-11-22 08:55:34   INFO  epoch: 13/30, acc_iter=90331, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:16:57/0:29:50, time_cost(all): 1 day, 0:31:03/1 day, 4:14:14, loss=0.451506889780502, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.18(1.03), norm=4.010668373623968, lr=0.000547315775631472
2023-11-22 08:56:23   INFO  epoch: 13/30, acc_iter=90381, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:17:46/0:31:28, time_cost(all): 1 day, 0:31:52/1 day, 4:38:03, loss=0.451423775794193, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.05(1.03), norm=3.0619778553113735, lr=0.000546995041449548
2023-11-22 08:57:12   INFO  epoch: 13/30, acc_iter=90431, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:18:35/0:28:18, time_cost(all): 1 day, 0:32:41/1 day, 5:57:28, loss=0.451340661807884, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.02(1.03), norm=0.726412719344485, lr=0.000546674307267623
2023-11-22 08:58:01   INFO  epoch: 13/30, acc_iter=90481, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:19:24/0:28:17, time_cost(all): 1 day, 0:33:30/1 day, 6:15:05, loss=0.451257547821575, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.95(1.03), norm=0.6798009460697014, lr=0.000546353573085698
2023-11-22 08:58:51   INFO  epoch: 13/30, acc_iter=90531, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:20:13/0:27:30, time_cost(all): 1 day, 0:34:20/1 day, 6:02:08, loss=0.451174433835266, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.09(1.03), norm=4.003361854886968, lr=0.000546032838903773
2023-11-22 08:59:40   INFO  epoch: 13/30, acc_iter=90581, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:21:02/0:25:49, time_cost(all): 1 day, 0:35:09/1 day, 6:21:09, loss=0.451091319848957, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.05(1.03), norm=4.94084353589585, lr=0.000545712104721849
2023-11-22 09:00:29   INFO  epoch: 13/30, acc_iter=90631, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:21:51/0:26:07, time_cost(all): 1 day, 0:35:58/1 day, 5:19:37, loss=0.451008205862648, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.13(1.03), norm=4.579325362606172, lr=0.000545391370539924
2023-11-22 09:01:18   INFO  epoch: 13/30, acc_iter=90681, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:22:40/0:25:21, time_cost(all): 1 day, 0:36:47/1 day, 5:38:32, loss=0.450925091876339, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.08(1.03), norm=2.554846941109836, lr=0.000545070636357999
2023-11-22 09:02:07   INFO  epoch: 13/30, acc_iter=90731, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:23:29/0:23:44, time_cost(all): 1 day, 0:37:36/1 day, 5:15:02, loss=0.45084197789003, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.99(1.03), norm=4.176473533873041, lr=0.000544749902176074
2023-11-22 09:02:56   INFO  epoch: 13/30, acc_iter=90781, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:24:19/0:23:39, time_cost(all): 1 day, 0:38:25/1 day, 5:51:57, loss=0.450758863903721, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.9(1.03), norm=2.698994066294127, lr=0.00054442916799415
2023-11-22 09:03:45   INFO  epoch: 13/30, acc_iter=90831, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:25:08/0:22:50, time_cost(all): 1 day, 0:39:14/1 day, 6:00:46, loss=0.450675749917411, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.05(1.03), norm=3.207793095875537, lr=0.000544108433812225
2023-11-22 09:04:34   INFO  epoch: 13/30, acc_iter=90881, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:25:57/0:20:50, time_cost(all): 1 day, 0:40:03/1 day, 4:03:31, loss=0.450592635931102, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.92(1.03), norm=2.3521328141796523, lr=0.000543787699630301
2023-11-22 09:05:24   INFO  epoch: 13/30, acc_iter=90931, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:26:46/0:20:48, time_cost(all): 1 day, 0:40:53/1 day, 4:19:02, loss=0.450509521944793, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.11(1.03), norm=0.6655781272478558, lr=0.000543466965448376
2023-11-22 09:06:13   INFO  epoch: 13/30, acc_iter=90981, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:27:35/0:20:47, time_cost(all): 1 day, 0:41:42/1 day, 5:53:30, loss=0.450426407958484, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.19(1.03), norm=3.7059914958588074, lr=0.000543146231266451
2023-11-22 09:07:02   INFO  epoch: 13/30, acc_iter=91031, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:28:24/0:19:59, time_cost(all): 1 day, 0:42:31/1 day, 6:36:22, loss=0.450343293972175, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.15(1.03), norm=3.826996166091119, lr=0.000542825497084526
2023-11-22 09:07:51   INFO  epoch: 13/30, acc_iter=91081, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:29:13/0:19:29, time_cost(all): 1 day, 0:43:20/1 day, 5:09:49, loss=0.450260179985866, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.02(1.03), norm=3.3484980121599275, lr=0.000542504762902602
2023-11-22 09:08:40   INFO  epoch: 13/30, acc_iter=91131, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:30:02/0:17:45, time_cost(all): 1 day, 0:44:09/1 day, 4:01:55, loss=0.450177065999557, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.92(1.03), norm=3.8048434941339573, lr=0.000542184028720677
2023-11-22 09:09:29   INFO  epoch: 13/30, acc_iter=91181, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:30:52/0:16:36, time_cost(all): 1 day, 0:44:58/1 day, 5:56:42, loss=0.450093952013248, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.86(1.03), norm=2.176788495370841, lr=0.000541863294538752
2023-11-22 09:10:18   INFO  epoch: 13/30, acc_iter=91231, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:31:41/0:16:48, time_cost(all): 1 day, 0:45:47/1 day, 5:07:43, loss=0.450010838026939, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.91(1.03), norm=1.3637327285303902, lr=0.000541542560356827
2023-11-22 09:11:07   INFO  epoch: 13/30, acc_iter=91281, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:32:30/0:16:03, time_cost(all): 1 day, 0:46:36/1 day, 4:54:01, loss=0.44992772404063, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.03(1.03), norm=2.83512216886254, lr=0.000541221826174903
2023-11-22 09:11:56   INFO  epoch: 13/30, acc_iter=91331, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:33:19/0:14:38, time_cost(all): 1 day, 0:47:25/1 day, 6:09:53, loss=0.449844610054321, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.88(1.03), norm=3.9481014500858276, lr=0.000540901091992978
2023-11-22 09:12:46   INFO  epoch: 13/30, acc_iter=91381, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:34:08/0:14:10, time_cost(all): 1 day, 0:48:15/1 day, 5:26:27, loss=0.449761496068012, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.19(1.03), norm=3.8444362302339194, lr=0.000540580357811053
2023-11-22 09:13:35   INFO  epoch: 13/30, acc_iter=91431, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:34:57/0:12:46, time_cost(all): 1 day, 0:49:04/1 day, 6:22:50, loss=0.449678382081703, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.17(1.03), norm=2.4282465754733367, lr=0.000540259623629129
2023-11-22 09:14:24   INFO  epoch: 13/30, acc_iter=91481, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:35:46/0:12:02, time_cost(all): 1 day, 0:49:53/1 day, 3:44:38, loss=0.449595268095394, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.19(1.03), norm=2.126871943486033, lr=0.000539938889447204
2023-11-22 09:15:13   INFO  epoch: 13/30, acc_iter=91531, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:36:35/0:10:51, time_cost(all): 1 day, 0:50:42/1 day, 4:17:06, loss=0.449512154109085, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.96(1.03), norm=0.9605292491641709, lr=0.000539618155265279
2023-11-22 09:16:02   INFO  epoch: 13/30, acc_iter=91581, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:37:24/0:10:10, time_cost(all): 1 day, 0:51:31/1 day, 4:29:24, loss=0.449429040122776, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.97(1.03), norm=4.2280214181048255, lr=0.000539297421083355
2023-11-22 09:16:51   INFO  epoch: 13/30, acc_iter=91631, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:38:14/0:09:56, time_cost(all): 1 day, 0:52:20/1 day, 5:01:40, loss=0.449345926136466, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.05(1.03), norm=4.248547366871078, lr=0.00053897668690143
2023-11-22 09:17:40   INFO  epoch: 13/30, acc_iter=91681, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:39:03/0:09:03, time_cost(all): 1 day, 0:53:09/1 day, 4:52:29, loss=0.449262812150157, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.02(1.03), norm=4.2035973797042825, lr=0.000538655952719505
2023-11-22 09:18:29   INFO  epoch: 13/30, acc_iter=91731, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:39:52/0:07:41, time_cost(all): 1 day, 0:53:58/1 day, 6:02:54, loss=0.449179698163848, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.95(1.03), norm=0.619546838010677, lr=0.00053833521853758
2023-11-22 09:19:19   INFO  epoch: 13/30, acc_iter=91781, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:40:41/0:06:49, time_cost(all): 1 day, 0:54:48/1 day, 5:42:40, loss=0.449096584177539, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.84(1.03), norm=2.834690241938132, lr=0.000538014484355656
2023-11-22 09:20:08   INFO  epoch: 13/30, acc_iter=91831, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:41:30/0:06:28, time_cost(all): 1 day, 0:55:37/1 day, 3:36:18, loss=0.44901347019123, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.93(1.03), norm=3.761204222175225, lr=0.000537693750173731
2023-11-22 09:20:57   INFO  epoch: 13/30, acc_iter=91881, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 1:42:19/0:05:16, time_cost(all): 1 day, 0:56:26/1 day, 4:56:44, loss=0.448930356204921, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.01(1.03), norm=1.591558498871387, lr=0.000537373015991806
2023-11-22 09:21:46   INFO  epoch: 13/30, acc_iter=91931, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 1:43:08/0:04:35, time_cost(all): 1 day, 0:57:15/1 day, 4:43:03, loss=0.448847242218612, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.85(1.03), norm=4.716934570207083, lr=0.000537052281809882
2023-11-22 09:22:35   INFO  epoch: 13/30, acc_iter=91981, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 1:43:57/0:04:00, time_cost(all): 1 day, 0:58:04/1 day, 4:05:54, loss=0.448764128232303, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.1(1.03), norm=3.8676821617689794, lr=0.000536731547627957
2023-11-22 09:23:24   INFO  epoch: 13/30, acc_iter=92031, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 1:44:47/0:03:11, time_cost(all): 1 day, 0:58:53/1 day, 6:03:26, loss=0.448681014245994, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.97(1.03), norm=4.144928925774185, lr=0.000536410813446032
2023-11-22 09:24:13   INFO  epoch: 13/30, acc_iter=92081, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 1:45:36/0:02:21, time_cost(all): 1 day, 0:59:42/1 day, 3:40:01, loss=0.448597900259685, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.0(1.03), norm=3.4515148682784553, lr=0.000536090079264108
2023-11-22 09:25:02   INFO  epoch: 13/30, acc_iter=92131, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 1:46:25/0:01:26, time_cost(all): 1 day, 1:00:31/1 day, 5:22:19, loss=0.448514786273376, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.9(1.03), norm=2.3905018595450196, lr=0.000535769345082183
2023-11-22 09:25:51   INFO  epoch: 13/30, acc_iter=92181, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 1:47:14/0:00:36, time_cost(all): 1 day, 1:01:20/1 day, 3:52:24, loss=0.448431672287067, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.12(1.03), norm=0.8895043727072091, lr=0.000535448610900258
2023-11-22 09:26:41   INFO  epoch: 14/30, acc_iter=92268, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:49/1:43:40, time_cost(all): 1 day, 1:02:10/1 day, 5:04:47, loss=0.448287053950889, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.09(1.03), norm=2.2716651793190983, lr=0.000534890533423709
2023-11-22 09:27:30   INFO  epoch: 14/30, acc_iter=92318, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:38/1:50:07, time_cost(all): 1 day, 1:02:59/1 day, 5:42:09, loss=0.44820393996458, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.96(1.03), norm=3.252167297752515, lr=0.000534569799241784
2023-11-22 09:28:19   INFO  epoch: 14/30, acc_iter=92368, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:27/1:40:25, time_cost(all): 1 day, 1:03:48/1 day, 5:06:44, loss=0.448120825978271, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.16(1.03), norm=0.8647268308578696, lr=0.00053424906505986
2023-11-22 09:29:08   INFO  epoch: 14/30, acc_iter=92418, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:16/1:48:41, time_cost(all): 1 day, 1:04:37/1 day, 4:34:04, loss=0.448037711991962, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.0(1.03), norm=4.168599488270357, lr=0.000533928330877935
2023-11-22 09:29:57   INFO  epoch: 14/30, acc_iter=92468, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:05/1:40:59, time_cost(all): 1 day, 1:05:26/1 day, 5:22:07, loss=0.447954598005653, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.84(1.03), norm=4.693748978472734, lr=0.00053360759669601
2023-11-22 09:30:46   INFO  epoch: 14/30, acc_iter=92518, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:04:54/1:40:42, time_cost(all): 1 day, 1:06:15/1 day, 6:11:52, loss=0.447871484019344, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.98(1.03), norm=3.577319424428255, lr=0.000533286862514086
2023-11-22 09:31:35   INFO  epoch: 14/30, acc_iter=92568, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:05:43/1:41:46, time_cost(all): 1 day, 1:07:04/1 day, 3:46:30, loss=0.447788370033035, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.02(1.03), norm=4.464282670004797, lr=0.000532966128332161
2023-11-22 09:32:24   INFO  epoch: 14/30, acc_iter=92618, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:06:32/1:42:02, time_cost(all): 1 day, 1:07:53/1 day, 5:15:47, loss=0.447705256046726, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.06(1.03), norm=1.1877724821202962, lr=0.000532645394150236
2023-11-22 09:33:14   INFO  epoch: 14/30, acc_iter=92668, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:07:22/1:36:41, time_cost(all): 1 day, 1:08:43/1 day, 4:09:30, loss=0.447622142060417, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.08(1.03), norm=3.331890031104755, lr=0.000532324659968312
2023-11-22 09:34:03   INFO  epoch: 14/30, acc_iter=92718, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:08:11/1:43:40, time_cost(all): 1 day, 1:09:32/1 day, 6:01:53, loss=0.447539028074107, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.12(1.03), norm=0.5727770695081864, lr=0.000532003925786387
2023-11-22 09:34:52   INFO  epoch: 14/30, acc_iter=92768, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:09:00/1:37:55, time_cost(all): 1 day, 1:10:21/1 day, 4:42:16, loss=0.447455914087798, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.85(1.03), norm=1.400917092940618, lr=0.000531683191604462
2023-11-22 09:35:41   INFO  epoch: 14/30, acc_iter=92818, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:09:49/1:38:10, time_cost(all): 1 day, 1:11:10/1 day, 4:57:25, loss=0.447372800101489, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.02(1.03), norm=0.8942661771218094, lr=0.000531362457422537
2023-11-22 09:36:30   INFO  epoch: 14/30, acc_iter=92868, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:10:38/1:41:45, time_cost(all): 1 day, 1:11:59/1 day, 3:25:42, loss=0.44728968611518, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.07(1.03), norm=0.8230941188923859, lr=0.000531041723240613
2023-11-22 09:37:19   INFO  epoch: 14/30, acc_iter=92918, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:11:27/1:38:48, time_cost(all): 1 day, 1:12:48/1 day, 3:24:16, loss=0.447206572128871, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.18(1.03), norm=0.7525714969277841, lr=0.000530720989058688
2023-11-22 09:38:08   INFO  epoch: 14/30, acc_iter=92968, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:12:16/1:36:42, time_cost(all): 1 day, 1:13:37/1 day, 5:22:47, loss=0.447123458142562, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.22(1.03), norm=4.607632387536784, lr=0.000530400254876763
2023-11-22 09:38:57   INFO  epoch: 14/30, acc_iter=93018, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:13:05/1:38:43, time_cost(all): 1 day, 1:14:26/1 day, 3:56:28, loss=0.447040344156253, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.04(1.03), norm=3.7929155071668212, lr=0.000530079520694839
2023-11-22 09:39:46   INFO  epoch: 14/30, acc_iter=93068, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:13:54/1:35:14, time_cost(all): 1 day, 1:15:15/1 day, 5:52:21, loss=0.446957230169944, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.05(1.03), norm=3.6038293487268307, lr=0.000529758786512914
2023-11-22 09:40:36   INFO  epoch: 14/30, acc_iter=93118, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:14:44/1:30:28, time_cost(all): 1 day, 1:16:05/1 day, 5:09:55, loss=0.446874116183635, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.2(1.03), norm=2.1859379838259536, lr=0.000529438052330989
2023-11-22 09:41:25   INFO  epoch: 14/30, acc_iter=93168, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:15:33/1:34:55, time_cost(all): 1 day, 1:16:54/1 day, 5:00:20, loss=0.446791002197326, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.14(1.03), norm=3.9638855036182585, lr=0.000529117318149065
2023-11-22 09:42:14   INFO  epoch: 14/30, acc_iter=93218, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:16:22/1:35:25, time_cost(all): 1 day, 1:17:43/1 day, 3:40:42, loss=0.446707888211017, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.87(1.03), norm=1.9965949410621417, lr=0.00052879658396714
2023-11-22 09:43:03   INFO  epoch: 14/30, acc_iter=93268, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:17:11/1:28:19, time_cost(all): 1 day, 1:18:32/1 day, 3:43:24, loss=0.446624774224708, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.85(1.03), norm=3.908324008363779, lr=0.000528475849785215
2023-11-22 09:43:52   INFO  epoch: 14/30, acc_iter=93318, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:18:00/1:28:49, time_cost(all): 1 day, 1:19:21/1 day, 5:27:57, loss=0.446541660238399, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.95(1.03), norm=3.1056031285673997, lr=0.00052815511560329
2023-11-22 09:44:41   INFO  epoch: 14/30, acc_iter=93368, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:18:49/1:30:45, time_cost(all): 1 day, 1:20:10/1 day, 3:56:03, loss=0.44645854625209, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.02(1.03), norm=0.5039205443815515, lr=0.000527834381421366
2023-11-22 09:45:30   INFO  epoch: 14/30, acc_iter=93418, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:19:38/1:25:39, time_cost(all): 1 day, 1:20:59/1 day, 5:46:12, loss=0.446375432265781, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.18(1.03), norm=4.769320450884285, lr=0.000527513647239441
2023-11-22 09:46:19   INFO  epoch: 14/30, acc_iter=93468, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:20:27/1:30:48, time_cost(all): 1 day, 1:21:48/1 day, 3:16:54, loss=0.446292318279471, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.92(1.03), norm=3.526355959083758, lr=0.000527192913057516
2023-11-22 09:47:08   INFO  epoch: 14/30, acc_iter=93518, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:21:17/1:27:40, time_cost(all): 1 day, 1:22:37/1 day, 4:40:54, loss=0.446209204293162, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.91(1.03), norm=2.260945660010058, lr=0.000526872178875591
2023-11-22 09:47:58   INFO  epoch: 14/30, acc_iter=93568, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:22:06/1:28:42, time_cost(all): 1 day, 1:23:27/1 day, 4:05:06, loss=0.446126090306853, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.94(1.03), norm=0.9829849658539467, lr=0.000526551444693667
2023-11-22 09:48:47   INFO  epoch: 14/30, acc_iter=93618, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:22:55/1:26:09, time_cost(all): 1 day, 1:24:16/1 day, 5:06:07, loss=0.446042976320544, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.9(1.03), norm=2.973704289778594, lr=0.000526230710511742
2023-11-22 09:49:36   INFO  epoch: 14/30, acc_iter=93668, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:23:44/1:20:19, time_cost(all): 1 day, 1:25:05/1 day, 5:05:49, loss=0.445959862334235, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.98(1.03), norm=4.585640549624524, lr=0.000525909976329818
2023-11-22 09:50:25   INFO  epoch: 14/30, acc_iter=93718, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:24:33/1:26:46, time_cost(all): 1 day, 1:25:54/1 day, 5:08:12, loss=0.445876748347926, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.2(1.03), norm=4.560741699782927, lr=0.000525589242147893
2023-11-22 09:51:14   INFO  epoch: 14/30, acc_iter=93768, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:25:22/1:26:24, time_cost(all): 1 day, 1:26:43/1 day, 4:04:26, loss=0.445793634361617, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.22(1.03), norm=2.5639293141612094, lr=0.000525268507965968
2023-11-22 09:52:03   INFO  epoch: 14/30, acc_iter=93818, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:26:11/1:18:09, time_cost(all): 1 day, 1:27:32/1 day, 5:26:04, loss=0.445710520375308, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.06(1.03), norm=4.5858206545341735, lr=0.000524947773784043
2023-11-22 09:52:52   INFO  epoch: 14/30, acc_iter=93868, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:27:00/1:22:18, time_cost(all): 1 day, 1:28:21/1 day, 5:19:49, loss=0.445627406388999, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.86(1.03), norm=0.9297935882265793, lr=0.000524627039602119
2023-11-22 09:53:41   INFO  epoch: 14/30, acc_iter=93918, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:27:49/1:20:22, time_cost(all): 1 day, 1:29:10/1 day, 3:42:56, loss=0.44554429240269, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.94(1.03), norm=1.5980936864902862, lr=0.000524306305420194
2023-11-22 09:54:31   INFO  epoch: 14/30, acc_iter=93968, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:28:39/1:17:03, time_cost(all): 1 day, 1:30:00/1 day, 5:16:55, loss=0.445461178416381, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.86(1.03), norm=2.013311351645024, lr=0.000523985571238269
2023-11-22 09:55:20   INFO  epoch: 14/30, acc_iter=94018, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:29:28/1:17:16, time_cost(all): 1 day, 1:30:49/1 day, 4:01:22, loss=0.445378064430072, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.97(1.03), norm=4.917029434354616, lr=0.000523664837056344
2023-11-22 09:56:09   INFO  epoch: 14/30, acc_iter=94068, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:30:17/1:19:00, time_cost(all): 1 day, 1:31:38/1 day, 3:23:32, loss=0.445294950443763, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.08(1.03), norm=2.1322900133272507, lr=0.00052334410287442
2023-11-22 09:56:58   INFO  epoch: 14/30, acc_iter=94118, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:31:06/1:16:10, time_cost(all): 1 day, 1:32:27/1 day, 4:59:49, loss=0.445211836457454, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.9(1.03), norm=1.8897585594044215, lr=0.000523023368692495
2023-11-22 09:57:47   INFO  epoch: 14/30, acc_iter=94168, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:31:55/1:19:15, time_cost(all): 1 day, 1:33:16/1 day, 5:40:21, loss=0.445128722471145, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.17(1.03), norm=2.4190182876208186, lr=0.00052270263451057
2023-11-22 09:58:36   INFO  epoch: 14/30, acc_iter=94218, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:32:44/1:18:11, time_cost(all): 1 day, 1:34:05/1 day, 3:42:51, loss=0.445045608484836, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.11(1.03), norm=1.5401054025979803, lr=0.000522381900328646
2023-11-22 09:59:25   INFO  epoch: 14/30, acc_iter=94268, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:33:33/1:13:22, time_cost(all): 1 day, 1:34:54/1 day, 3:35:05, loss=0.444962494498526, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.83(1.03), norm=1.5460573368725918, lr=0.000522061166146721
2023-11-22 10:00:14   INFO  epoch: 14/30, acc_iter=94318, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:34:22/1:11:45, time_cost(all): 1 day, 1:35:43/1 day, 3:11:59, loss=0.444879380512217, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.16(1.03), norm=1.85205054684428, lr=0.000521740431964796
2023-11-22 10:01:03   INFO  epoch: 14/30, acc_iter=94368, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:35:12/1:12:22, time_cost(all): 1 day, 1:36:32/1 day, 3:57:36, loss=0.444796266525908, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.83(1.03), norm=0.920720934323492, lr=0.000521419697782872
2023-11-22 10:01:53   INFO  epoch: 14/30, acc_iter=94418, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:36:01/1:11:26, time_cost(all): 1 day, 1:37:22/1 day, 3:22:54, loss=0.444713152539599, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.07(1.03), norm=2.4024644973365943, lr=0.000521098963600947
2023-11-22 10:02:42   INFO  epoch: 14/30, acc_iter=94468, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:36:50/1:10:06, time_cost(all): 1 day, 1:38:11/1 day, 3:54:09, loss=0.44463003855329, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.08(1.03), norm=1.774292835810154, lr=0.000520778229419022
2023-11-22 10:03:31   INFO  epoch: 14/30, acc_iter=94518, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:37:39/1:08:36, time_cost(all): 1 day, 1:39:00/1 day, 4:33:50, loss=0.444546924566981, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.22(1.03), norm=4.521552173448139, lr=0.000520457495237097
2023-11-22 10:04:20   INFO  epoch: 14/30, acc_iter=94568, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:38:28/1:07:04, time_cost(all): 1 day, 1:39:49/1 day, 5:11:31, loss=0.444463810580672, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.06(1.03), norm=1.842351367044507, lr=0.000520136761055173
2023-11-22 10:05:09   INFO  epoch: 14/30, acc_iter=94618, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:39:17/1:10:55, time_cost(all): 1 day, 1:40:38/1 day, 4:05:12, loss=0.444380696594363, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.0(1.03), norm=3.161169171265154, lr=0.000519816026873248
2023-11-22 10:05:58   INFO  epoch: 14/30, acc_iter=94668, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:40:06/1:08:23, time_cost(all): 1 day, 1:41:27/1 day, 4:20:42, loss=0.444297582608054, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.16(1.03), norm=1.3018484111268207, lr=0.000519495292691323
2023-11-22 10:06:47   INFO  epoch: 14/30, acc_iter=94718, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:40:55/1:08:02, time_cost(all): 1 day, 1:42:16/1 day, 3:44:37, loss=0.444214468621745, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.01(1.03), norm=4.720009849848954, lr=0.000519174558509399
2023-11-22 10:07:36   INFO  epoch: 14/30, acc_iter=94768, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:41:44/1:06:06, time_cost(all): 1 day, 1:43:05/1 day, 5:14:08, loss=0.444131354635436, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.15(1.03), norm=2.4323552286597927, lr=0.000518853824327474
2023-11-22 10:08:26   INFO  epoch: 14/30, acc_iter=94818, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:42:34/1:04:41, time_cost(all): 1 day, 1:43:55/1 day, 4:03:15, loss=0.444048240649127, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.92(1.03), norm=0.9967444871847966, lr=0.000518533090145549
2023-11-22 10:09:15   INFO  epoch: 14/30, acc_iter=94868, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:43:23/1:05:55, time_cost(all): 1 day, 1:44:44/1 day, 3:29:09, loss=0.443965126662818, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.09(1.03), norm=0.5296177596478029, lr=0.000518212355963625
2023-11-22 10:10:04   INFO  epoch: 14/30, acc_iter=94918, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:44:12/1:04:24, time_cost(all): 1 day, 1:45:33/1 day, 4:17:21, loss=0.443882012676509, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.01(1.03), norm=1.8341356077970679, lr=0.0005178916217817
2023-11-22 10:10:53   INFO  epoch: 14/30, acc_iter=94968, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:45:01/1:01:29, time_cost(all): 1 day, 1:46:22/1 day, 5:11:49, loss=0.4437988986902, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.06(1.03), norm=4.662118590705472, lr=0.000517570887599775
2023-11-22 10:11:42   INFO  epoch: 14/30, acc_iter=95018, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:45:50/1:04:41, time_cost(all): 1 day, 1:47:11/1 day, 4:12:47, loss=0.443715784703891, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.1(1.03), norm=0.7998374626920737, lr=0.00051725015341785
2023-11-22 10:12:31   INFO  epoch: 14/30, acc_iter=95068, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:46:39/1:02:33, time_cost(all): 1 day, 1:48:00/1 day, 2:56:03, loss=0.443632670717582, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.11(1.03), norm=3.7580558628678897, lr=0.000516929419235926
2023-11-22 10:13:20   INFO  epoch: 14/30, acc_iter=95118, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:47:28/0:59:55, time_cost(all): 1 day, 1:48:49/1 day, 3:18:35, loss=0.443549556731272, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.14(1.03), norm=4.30815055063645, lr=0.000516608685054001
2023-11-22 10:14:09   INFO  epoch: 14/30, acc_iter=95168, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:48:17/1:00:16, time_cost(all): 1 day, 1:49:38/1 day, 4:09:59, loss=0.443466442744963, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.86(1.03), norm=1.3687864118289448, lr=0.000516287950872076
2023-11-22 10:14:58   INFO  epoch: 14/30, acc_iter=95218, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:49:07/0:59:40, time_cost(all): 1 day, 1:50:27/1 day, 2:52:35, loss=0.443383328758654, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.96(1.03), norm=1.7577525525118982, lr=0.000515967216690152
2023-11-22 10:15:48   INFO  epoch: 14/30, acc_iter=95268, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:49:56/0:57:56, time_cost(all): 1 day, 1:51:17/1 day, 4:18:04, loss=0.443300214772345, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.87(1.03), norm=3.7949155642366024, lr=0.000515646482508227
2023-11-22 10:16:37   INFO  epoch: 14/30, acc_iter=95318, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:50:45/0:58:58, time_cost(all): 1 day, 1:52:06/1 day, 3:47:11, loss=0.443217100786036, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.99(1.03), norm=1.1510119639872107, lr=0.000515325748326302
2023-11-22 10:17:26   INFO  epoch: 14/30, acc_iter=95368, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 0:51:34/0:54:34, time_cost(all): 1 day, 1:52:55/1 day, 4:08:22, loss=0.443133986799727, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.03(1.03), norm=1.232736458825391, lr=0.000515005014144378
2023-11-22 10:18:15   INFO  epoch: 14/30, acc_iter=95418, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 0:52:23/0:57:15, time_cost(all): 1 day, 1:53:44/1 day, 3:31:52, loss=0.443050872813418, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.88(1.03), norm=3.2867551524414016, lr=0.000514684279962453
2023-11-22 10:19:04   INFO  epoch: 14/30, acc_iter=95468, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 0:53:12/0:53:18, time_cost(all): 1 day, 1:54:33/1 day, 4:23:52, loss=0.442967758827109, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.17(1.03), norm=2.6518257289568714, lr=0.000514363545780528
2023-11-22 10:19:53   INFO  epoch: 14/30, acc_iter=95518, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 0:54:01/0:52:17, time_cost(all): 1 day, 1:55:22/1 day, 4:45:12, loss=0.4428846448408, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.86(1.03), norm=2.3983651969286637, lr=0.000514042811598603
2023-11-22 10:20:42   INFO  epoch: 14/30, acc_iter=95568, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 0:54:50/0:54:57, time_cost(all): 1 day, 1:56:11/1 day, 2:59:45, loss=0.442801530854491, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.86(1.03), norm=2.459858213944581, lr=0.000513722077416679
2023-11-22 10:21:31   INFO  epoch: 14/30, acc_iter=95618, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 0:55:39/0:51:05, time_cost(all): 1 day, 1:57:00/1 day, 3:55:24, loss=0.442718416868182, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.84(1.03), norm=0.7988651960585769, lr=0.000513401343234754
2023-11-22 10:22:21   INFO  epoch: 14/30, acc_iter=95668, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 0:56:29/0:49:14, time_cost(all): 1 day, 1:57:50/1 day, 5:03:42, loss=0.442635302881873, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.92(1.03), norm=3.8509676566339066, lr=0.000513080609052829
2023-11-22 10:23:10   INFO  epoch: 14/30, acc_iter=95718, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 0:57:18/0:53:01, time_cost(all): 1 day, 1:58:39/1 day, 3:51:36, loss=0.442552188895564, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.12(1.03), norm=1.8158578917094228, lr=0.000512759874870905
2023-11-22 10:23:59   INFO  epoch: 14/30, acc_iter=95768, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 0:58:07/0:47:57, time_cost(all): 1 day, 1:59:28/1 day, 5:18:42, loss=0.442469074909255, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.15(1.03), norm=0.9976230256316349, lr=0.00051243914068898
2023-11-22 10:24:48   INFO  epoch: 14/30, acc_iter=95818, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 0:58:56/0:50:14, time_cost(all): 1 day, 2:00:17/1 day, 3:05:10, loss=0.442385960922946, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.99(1.03), norm=3.0456559714454534, lr=0.000512118406507055
2023-11-22 10:25:37   INFO  epoch: 14/30, acc_iter=95868, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 0:59:45/0:49:13, time_cost(all): 1 day, 2:01:06/1 day, 3:13:16, loss=0.442302846936636, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.92(1.03), norm=4.058043368395253, lr=0.00051179767232513
2023-11-22 10:26:26   INFO  epoch: 14/30, acc_iter=95918, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:00:34/0:47:07, time_cost(all): 1 day, 2:01:55/1 day, 4:02:07, loss=0.442219732950327, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.13(1.03), norm=2.4248661414257233, lr=0.000511476938143206
2023-11-22 10:27:15   INFO  epoch: 14/30, acc_iter=95968, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:01:23/0:47:27, time_cost(all): 1 day, 2:02:44/1 day, 3:36:39, loss=0.442136618964018, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.22(1.03), norm=2.968233654215127, lr=0.000511156203961281
2023-11-22 10:28:04   INFO  epoch: 14/30, acc_iter=96018, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:02:12/0:47:16, time_cost(all): 1 day, 2:03:33/1 day, 5:01:14, loss=0.442053504977709, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.86(1.03), norm=1.5663265668960582, lr=0.000510835469779356
2023-11-22 10:28:53   INFO  epoch: 14/30, acc_iter=96068, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:03:02/0:43:31, time_cost(all): 1 day, 2:04:22/1 day, 3:06:36, loss=0.4419703909914, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.97(1.03), norm=1.4940722815960112, lr=0.000510514735597432
2023-11-22 10:29:43   INFO  epoch: 14/30, acc_iter=96118, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:03:51/0:45:23, time_cost(all): 1 day, 2:05:12/1 day, 4:10:02, loss=0.441887277005091, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.18(1.03), norm=3.0487468608885675, lr=0.000510194001415507
2023-11-22 10:30:32   INFO  epoch: 14/30, acc_iter=96168, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:04:40/0:41:25, time_cost(all): 1 day, 2:06:01/1 day, 3:45:20, loss=0.441804163018782, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.98(1.03), norm=2.8160553998977016, lr=0.000509873267233582
2023-11-22 10:31:21   INFO  epoch: 14/30, acc_iter=96218, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:05:29/0:42:48, time_cost(all): 1 day, 2:06:50/1 day, 4:00:50, loss=0.441721049032473, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.02(1.03), norm=0.9165770760248693, lr=0.000509552533051657
2023-11-22 10:32:10   INFO  epoch: 14/30, acc_iter=96268, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:06:18/0:39:33, time_cost(all): 1 day, 2:07:39/1 day, 4:19:13, loss=0.441637935046164, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.14(1.03), norm=1.7337843390773977, lr=0.000509231798869733
2023-11-22 10:32:59   INFO  epoch: 14/30, acc_iter=96318, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:07:07/0:41:55, time_cost(all): 1 day, 2:08:28/1 day, 4:11:45, loss=0.441554821059855, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.95(1.03), norm=0.9324941731092502, lr=0.000508911064687808
2023-11-22 10:33:48   INFO  epoch: 14/30, acc_iter=96368, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:07:56/0:38:09, time_cost(all): 1 day, 2:09:17/1 day, 3:40:38, loss=0.441471707073546, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.09(1.03), norm=2.527794071310386, lr=0.000508590330505883
2023-11-22 10:34:37   INFO  epoch: 14/30, acc_iter=96418, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:08:45/0:38:52, time_cost(all): 1 day, 2:10:06/1 day, 4:09:29, loss=0.441388593087237, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.92(1.03), norm=4.387000857897142, lr=0.000508269596323959
2023-11-22 10:35:26   INFO  epoch: 14/30, acc_iter=96468, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:09:34/0:39:58, time_cost(all): 1 day, 2:10:55/1 day, 4:57:47, loss=0.441305479100928, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.17(1.03), norm=3.3565536034311094, lr=0.000507948862142034
2023-11-22 10:36:16   INFO  epoch: 14/30, acc_iter=96518, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:10:24/0:37:51, time_cost(all): 1 day, 2:11:45/1 day, 4:53:07, loss=0.441222365114619, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.12(1.03), norm=2.882192662771335, lr=0.000507628127960109
2023-11-22 10:37:05   INFO  epoch: 14/30, acc_iter=96568, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:11:13/0:37:55, time_cost(all): 1 day, 2:12:34/1 day, 3:11:03, loss=0.44113925112831, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.84(1.03), norm=3.386158566697231, lr=0.000507307393778185
2023-11-22 10:37:54   INFO  epoch: 14/30, acc_iter=96618, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:12:02/0:34:31, time_cost(all): 1 day, 2:13:23/1 day, 2:25:21, loss=0.441056137142001, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.1(1.03), norm=3.383594849048984, lr=0.00050698665959626
2023-11-22 10:38:43   INFO  epoch: 14/30, acc_iter=96668, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:12:51/0:35:40, time_cost(all): 1 day, 2:14:12/1 day, 4:24:32, loss=0.440973023155691, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.12(1.03), norm=1.0100003194720775, lr=0.000506665925414335
2023-11-22 10:39:32   INFO  epoch: 14/30, acc_iter=96718, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:13:40/0:35:13, time_cost(all): 1 day, 2:15:01/1 day, 2:32:45, loss=0.440889909169382, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.94(1.03), norm=2.5673109633885427, lr=0.00050634519123241
2023-11-22 10:40:21   INFO  epoch: 14/30, acc_iter=96768, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:14:29/0:34:49, time_cost(all): 1 day, 2:15:50/1 day, 3:32:52, loss=0.440806795183073, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.22(1.03), norm=1.8541978083097619, lr=0.000506024457050486
2023-11-22 10:41:10   INFO  epoch: 14/30, acc_iter=96818, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:15:18/0:33:16, time_cost(all): 1 day, 2:16:39/1 day, 4:30:02, loss=0.440723681196764, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.11(1.03), norm=3.4428224853739007, lr=0.000505703722868561
2023-11-22 10:41:59   INFO  epoch: 14/30, acc_iter=96868, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:16:07/0:31:25, time_cost(all): 1 day, 2:17:28/1 day, 4:48:44, loss=0.440640567210455, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.02(1.03), norm=1.5735806489464066, lr=0.000505382988686636
2023-11-22 10:42:48   INFO  epoch: 14/30, acc_iter=96918, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:16:57/0:29:21, time_cost(all): 1 day, 2:18:17/1 day, 3:58:03, loss=0.440557453224146, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.23(1.03), norm=1.158386504136423, lr=0.000505062254504712
2023-11-22 10:43:38   INFO  epoch: 14/30, acc_iter=96968, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:17:46/0:31:29, time_cost(all): 1 day, 2:19:07/1 day, 2:51:06, loss=0.440474339237837, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.21(1.03), norm=1.6678370227544648, lr=0.000504741520322787
2023-11-22 10:44:27   INFO  epoch: 14/30, acc_iter=97018, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:18:35/0:30:19, time_cost(all): 1 day, 2:19:56/1 day, 2:44:58, loss=0.440391225251528, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.16(1.03), norm=1.6525091715031541, lr=0.000504420786140862
2023-11-22 10:45:16   INFO  epoch: 14/30, acc_iter=97068, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:19:24/0:28:58, time_cost(all): 1 day, 2:20:45/1 day, 3:46:40, loss=0.440308111265219, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.93(1.03), norm=1.0951746478501534, lr=0.000504100051958938
2023-11-22 10:46:05   INFO  epoch: 14/30, acc_iter=97118, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:20:13/0:28:25, time_cost(all): 1 day, 2:21:34/1 day, 3:32:08, loss=0.44022499727891, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.12(1.03), norm=4.50711517730527, lr=0.000503779317777013
2023-11-22 10:46:54   INFO  epoch: 14/30, acc_iter=97168, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:21:02/0:26:45, time_cost(all): 1 day, 2:22:23/1 day, 4:04:22, loss=0.440141883292601, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.87(1.03), norm=3.8412726508461574, lr=0.000503458583595088
2023-11-22 10:47:43   INFO  epoch: 14/30, acc_iter=97218, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:21:51/0:25:57, time_cost(all): 1 day, 2:23:12/1 day, 2:22:55, loss=0.440058769306292, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.86(1.03), norm=2.1353323231129204, lr=0.000503137849413163
2023-11-22 10:48:32   INFO  epoch: 14/30, acc_iter=97268, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:22:40/0:24:46, time_cost(all): 1 day, 2:24:01/1 day, 2:56:14, loss=0.439975655319983, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.18(1.03), norm=4.605038248633144, lr=0.000502817115231239
2023-11-22 10:49:21   INFO  epoch: 14/30, acc_iter=97318, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:23:29/0:23:40, time_cost(all): 1 day, 2:24:50/1 day, 2:37:57, loss=0.439892541333674, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.06(1.03), norm=0.6587109077247224, lr=0.000502496381049314
2023-11-22 10:50:11   INFO  epoch: 14/30, acc_iter=97368, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:24:19/0:22:44, time_cost(all): 1 day, 2:25:40/1 day, 4:21:25, loss=0.439809427347365, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.05(1.03), norm=0.6640227649406922, lr=0.000502175646867389
2023-11-22 10:51:00   INFO  epoch: 14/30, acc_iter=97418, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:25:08/0:23:46, time_cost(all): 1 day, 2:26:29/1 day, 2:22:50, loss=0.439726313361055, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.1(1.03), norm=1.6963558203904894, lr=0.000501854912685465
2023-11-22 10:51:49   INFO  epoch: 14/30, acc_iter=97468, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:25:57/0:22:50, time_cost(all): 1 day, 2:27:18/1 day, 4:32:45, loss=0.439643199374746, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.11(1.03), norm=2.251711074656124, lr=0.00050153417850354
2023-11-22 10:52:38   INFO  epoch: 14/30, acc_iter=97518, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:26:46/0:20:30, time_cost(all): 1 day, 2:28:07/1 day, 4:32:26, loss=0.439560085388437, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.06(1.03), norm=1.1850545317363277, lr=0.000501213444321615
2023-11-22 10:53:27   INFO  epoch: 14/30, acc_iter=97568, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:27:35/0:19:33, time_cost(all): 1 day, 2:28:56/1 day, 4:47:02, loss=0.439476971402128, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=1.5970736138973707, lr=0.000500892710139691
2023-11-22 10:54:16   INFO  epoch: 14/30, acc_iter=97618, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:28:24/0:19:39, time_cost(all): 1 day, 2:29:45/1 day, 2:58:02, loss=0.439393857415819, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.17(1.03), norm=0.6890486360089709, lr=0.000500571975957766
2023-11-22 10:55:05   INFO  epoch: 14/30, acc_iter=97668, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:29:13/0:17:59, time_cost(all): 1 day, 2:30:34/1 day, 2:34:59, loss=0.43931074342951, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.12(1.03), norm=4.7271640289040215, lr=0.000500251241775841
2023-11-22 10:55:54   INFO  epoch: 14/30, acc_iter=97718, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:30:02/0:17:08, time_cost(all): 1 day, 2:31:23/1 day, 3:33:27, loss=0.439227629443201, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.87(1.03), norm=0.7277969484426823, lr=0.000499930507593916
2023-11-22 10:56:43   INFO  epoch: 14/30, acc_iter=97768, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:30:52/0:16:39, time_cost(all): 1 day, 2:32:12/1 day, 4:21:54, loss=0.439144515456892, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.85(1.03), norm=4.668914012844754, lr=0.000499609773411992
2023-11-22 10:57:33   INFO  epoch: 14/30, acc_iter=97818, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:31:41/0:16:04, time_cost(all): 1 day, 2:33:02/1 day, 2:01:01, loss=0.439061401470583, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.84(1.03), norm=1.3773987487340573, lr=0.000499289039230067
2023-11-22 10:58:22   INFO  epoch: 14/30, acc_iter=97868, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:32:30/0:15:19, time_cost(all): 1 day, 2:33:51/1 day, 4:21:06, loss=0.438978287484274, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.07(1.03), norm=3.5715365168850712, lr=0.000498968305048142
2023-11-22 10:59:11   INFO  epoch: 14/30, acc_iter=97918, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:33:19/0:14:39, time_cost(all): 1 day, 2:34:40/1 day, 3:48:16, loss=0.438895173497965, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.12(1.03), norm=4.431259049847938, lr=0.000498647570866218
2023-11-22 11:00:00   INFO  epoch: 14/30, acc_iter=97968, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:34:08/0:13:41, time_cost(all): 1 day, 2:35:29/1 day, 3:31:46, loss=0.438812059511656, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.2(1.03), norm=3.4982734494117147, lr=0.000498326836684293
2023-11-22 11:00:49   INFO  epoch: 14/30, acc_iter=98018, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:34:57/0:12:41, time_cost(all): 1 day, 2:36:18/1 day, 3:00:35, loss=0.438728945525347, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.92(1.03), norm=0.5246912572209812, lr=0.000498006102502368
2023-11-22 11:01:38   INFO  epoch: 14/30, acc_iter=98068, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:35:46/0:12:03, time_cost(all): 1 day, 2:37:07/1 day, 3:41:44, loss=0.438645831539038, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.97(1.03), norm=3.8740154670688014, lr=0.000497685368320443
2023-11-22 11:02:27   INFO  epoch: 14/30, acc_iter=98118, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:36:35/0:10:56, time_cost(all): 1 day, 2:37:56/1 day, 3:11:23, loss=0.438562717552729, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.84(1.03), norm=3.503472005677202, lr=0.000497364634138519
2023-11-22 11:03:16   INFO  epoch: 14/30, acc_iter=98168, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:37:24/0:10:37, time_cost(all): 1 day, 2:38:45/1 day, 2:24:31, loss=0.43847960356642, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.01(1.03), norm=0.5945838347430092, lr=0.000497043899956594
2023-11-22 11:04:06   INFO  epoch: 14/30, acc_iter=98218, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:38:14/0:09:37, time_cost(all): 1 day, 2:39:35/1 day, 2:03:30, loss=0.438396489580111, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.08(1.03), norm=4.8100702630572, lr=0.000496723165774669
2023-11-22 11:04:55   INFO  epoch: 14/30, acc_iter=98268, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:39:03/0:08:32, time_cost(all): 1 day, 2:40:24/1 day, 4:23:06, loss=0.438313375593801, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.03(1.03), norm=1.5396347266187382, lr=0.000496402431592745
2023-11-22 11:05:44   INFO  epoch: 14/30, acc_iter=98318, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:39:52/0:08:21, time_cost(all): 1 day, 2:41:13/1 day, 3:39:28, loss=0.438230261607492, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.96(1.03), norm=0.7745322375519852, lr=0.00049608169741082
2023-11-22 11:06:33   INFO  epoch: 14/30, acc_iter=98368, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:40:41/0:07:19, time_cost(all): 1 day, 2:42:02/1 day, 3:31:33, loss=0.438147147621183, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.0(1.03), norm=3.985399267914338, lr=0.000495760963228895
2023-11-22 11:07:22   INFO  epoch: 14/30, acc_iter=98418, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:41:30/0:06:12, time_cost(all): 1 day, 2:42:51/1 day, 3:48:05, loss=0.438064033634874, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.18(1.03), norm=1.5066719806481033, lr=0.000495440229046971
2023-11-22 11:08:11   INFO  epoch: 14/30, acc_iter=98468, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 1:42:19/0:05:15, time_cost(all): 1 day, 2:43:40/1 day, 3:14:47, loss=0.437980919648565, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.2(1.03), norm=3.3705687803042466, lr=0.000495119494865046
2023-11-22 11:09:00   INFO  epoch: 14/30, acc_iter=98518, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 1:43:08/0:04:35, time_cost(all): 1 day, 2:44:29/1 day, 1:51:33, loss=0.437897805662256, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.01(1.03), norm=2.806352575868699, lr=0.000494798760683121
2023-11-22 11:09:49   INFO  epoch: 14/30, acc_iter=98568, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 1:43:57/0:03:51, time_cost(all): 1 day, 2:45:18/1 day, 1:59:02, loss=0.437814691675947, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.86(1.03), norm=1.018742580024729, lr=0.000494478026501196
2023-11-22 11:10:38   INFO  epoch: 14/30, acc_iter=98618, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 1:44:47/0:03:12, time_cost(all): 1 day, 2:46:07/1 day, 4:02:14, loss=0.437731577689638, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.98(1.03), norm=2.786342491708646, lr=0.000494157292319272
2023-11-22 11:11:28   INFO  epoch: 14/30, acc_iter=98668, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 1:45:36/0:02:08, time_cost(all): 1 day, 2:46:57/1 day, 3:31:37, loss=0.437648463703329, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.21(1.03), norm=4.577374282862027, lr=0.000493836558137347
2023-11-22 11:12:17   INFO  epoch: 14/30, acc_iter=98718, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 1:46:25/0:01:26, time_cost(all): 1 day, 2:47:46/1 day, 3:09:16, loss=0.43756534971702, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.08(1.03), norm=3.7416693542166954, lr=0.000493515823955422
2023-11-22 11:13:06   INFO  epoch: 14/30, acc_iter=98768, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 1:47:14/0:00:34, time_cost(all): 1 day, 2:48:35/1 day, 3:16:11, loss=0.437482235730711, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.93(1.03), norm=1.1997705770816345, lr=0.000493195089773498
2023-11-22 11:13:55   INFO  epoch: 15/30, acc_iter=98855, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:49/1:51:13, time_cost(all): 1 day, 2:49:24/1 day, 2:56:35, loss=0.437337617394533, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.19(1.03), norm=1.4681022986028198, lr=0.000492637012296949
2023-11-22 11:14:44   INFO  epoch: 15/30, acc_iter=98905, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:38/1:47:52, time_cost(all): 1 day, 2:50:13/1 day, 3:14:00, loss=0.437254503408224, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.95(1.03), norm=2.333673784553518, lr=0.000492316278115024
2023-11-22 11:15:33   INFO  epoch: 15/30, acc_iter=98955, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:27/1:40:26, time_cost(all): 1 day, 2:51:02/1 day, 3:03:30, loss=0.437171389421915, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.04(1.03), norm=3.0324453235150446, lr=0.000491995543933099
2023-11-22 11:16:22   INFO  epoch: 15/30, acc_iter=99005, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:16/1:44:55, time_cost(all): 1 day, 2:51:51/1 day, 2:50:13, loss=0.437088275435606, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.12(1.03), norm=1.2778864103289194, lr=0.000491674809751174
2023-11-22 11:17:11   INFO  epoch: 15/30, acc_iter=99055, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:05/1:48:17, time_cost(all): 1 day, 2:52:40/1 day, 2:34:36, loss=0.437005161449297, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.1(1.03), norm=4.3375247880841075, lr=0.00049135407556925
2023-11-22 11:18:01   INFO  epoch: 15/30, acc_iter=99105, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:04:54/1:42:39, time_cost(all): 1 day, 2:53:30/1 day, 1:55:52, loss=0.436922047462988, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.83(1.03), norm=1.3601345188174156, lr=0.000491033341387325
2023-11-22 11:18:50   INFO  epoch: 15/30, acc_iter=99155, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:05:43/1:45:55, time_cost(all): 1 day, 2:54:19/1 day, 3:10:49, loss=0.436838933476679, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.16(1.03), norm=4.608074454607838, lr=0.0004907126072054
2023-11-22 11:19:39   INFO  epoch: 15/30, acc_iter=99205, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:06:32/1:45:01, time_cost(all): 1 day, 2:55:08/1 day, 4:07:46, loss=0.43675581949037, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.14(1.03), norm=3.6750971673933717, lr=0.000490391873023476
2023-11-22 11:20:28   INFO  epoch: 15/30, acc_iter=99255, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:07:22/1:43:44, time_cost(all): 1 day, 2:55:57/1 day, 3:54:35, loss=0.436672705504061, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.22(1.03), norm=3.921040033287819, lr=0.000490071138841551
2023-11-22 11:21:17   INFO  epoch: 15/30, acc_iter=99305, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:08:11/1:38:04, time_cost(all): 1 day, 2:56:46/1 day, 1:54:21, loss=0.436589591517751, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.11(1.03), norm=0.7440309268824598, lr=0.000489750404659626
2023-11-22 11:22:06   INFO  epoch: 15/30, acc_iter=99355, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:09:00/1:41:04, time_cost(all): 1 day, 2:57:35/1 day, 3:13:48, loss=0.436506477531442, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.04(1.03), norm=1.8937614020517008, lr=0.000489429670477702
2023-11-22 11:22:55   INFO  epoch: 15/30, acc_iter=99405, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:09:49/1:40:58, time_cost(all): 1 day, 2:58:24/1 day, 2:12:52, loss=0.436423363545133, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.91(1.03), norm=0.7422866117675935, lr=0.000489108936295777
2023-11-22 11:23:44   INFO  epoch: 15/30, acc_iter=99455, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:10:38/1:33:41, time_cost(all): 1 day, 2:59:13/1 day, 3:25:18, loss=0.436340249558824, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.02(1.03), norm=2.6807476439269258, lr=0.000488788202113852
2023-11-22 11:24:33   INFO  epoch: 15/30, acc_iter=99505, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:11:27/1:33:30, time_cost(all): 1 day, 3:00:02/1 day, 3:01:13, loss=0.436257135572515, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=3.6535215984709284, lr=0.000488467467931927
2023-11-22 11:25:23   INFO  epoch: 15/30, acc_iter=99555, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:12:16/1:36:04, time_cost(all): 1 day, 3:00:52/1 day, 3:45:08, loss=0.436174021586206, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.13(1.03), norm=4.212752467958154, lr=0.000488146733750003
2023-11-22 11:26:12   INFO  epoch: 15/30, acc_iter=99605, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:13:05/1:38:03, time_cost(all): 1 day, 3:01:41/1 day, 2:16:36, loss=0.436090907599897, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.1(1.03), norm=2.8175942520276207, lr=0.000487825999568078
2023-11-22 11:27:01   INFO  epoch: 15/30, acc_iter=99655, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:13:54/1:33:32, time_cost(all): 1 day, 3:02:30/1 day, 4:07:43, loss=0.436007793613588, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.18(1.03), norm=2.210871266112015, lr=0.000487505265386153
2023-11-22 11:27:50   INFO  epoch: 15/30, acc_iter=99705, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:14:44/1:32:03, time_cost(all): 1 day, 3:03:19/1 day, 3:25:33, loss=0.435924679627279, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.13(1.03), norm=1.0442827537460382, lr=0.000487184531204229
2023-11-22 11:28:39   INFO  epoch: 15/30, acc_iter=99755, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:15:33/1:34:49, time_cost(all): 1 day, 3:04:08/1 day, 3:40:28, loss=0.43584156564097, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.15(1.03), norm=1.4749443943327354, lr=0.000486863797022304
2023-11-22 11:29:28   INFO  epoch: 15/30, acc_iter=99805, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:16:22/1:34:40, time_cost(all): 1 day, 3:04:57/1 day, 2:41:02, loss=0.435758451654661, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.95(1.03), norm=1.7859940628591862, lr=0.000486543062840379
2023-11-22 11:30:17   INFO  epoch: 15/30, acc_iter=99855, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:17:11/1:28:22, time_cost(all): 1 day, 3:05:46/1 day, 4:08:48, loss=0.435675337668352, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.9(1.03), norm=1.810177191602406, lr=0.000486222328658455
2023-11-22 11:31:06   INFO  epoch: 15/30, acc_iter=99905, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:18:00/1:25:50, time_cost(all): 1 day, 3:06:35/1 day, 2:41:27, loss=0.435592223682043, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.96(1.03), norm=1.7081195593514098, lr=0.00048590159447653
2023-11-22 11:31:55   INFO  epoch: 15/30, acc_iter=99955, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:18:49/1:25:19, time_cost(all): 1 day, 3:07:24/1 day, 3:12:14, loss=0.435509109695734, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.09(1.03), norm=3.1613496449463607, lr=0.000485580860294605
2023-11-22 11:32:45   INFO  epoch: 15/30, acc_iter=100005, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:19:38/1:27:09, time_cost(all): 1 day, 3:08:14/1 day, 4:05:41, loss=0.435425995709425, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.09(1.03), norm=4.698508749028793, lr=0.00048526012611268
2023-11-22 11:33:34   INFO  epoch: 15/30, acc_iter=100055, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:20:27/1:27:12, time_cost(all): 1 day, 3:09:03/1 day, 3:26:07, loss=0.435342881723116, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.17(1.03), norm=3.8756037819299736, lr=0.000484939391930756
2023-11-22 11:34:23   INFO  epoch: 15/30, acc_iter=100105, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:21:17/1:28:46, time_cost(all): 1 day, 3:09:52/1 day, 1:44:13, loss=0.435259767736806, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.87(1.03), norm=2.0723408798149614, lr=0.000484618657748831
2023-11-22 11:35:12   INFO  epoch: 15/30, acc_iter=100155, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:22:06/1:26:05, time_cost(all): 1 day, 3:10:41/1 day, 4:01:15, loss=0.435176653750497, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.12(1.03), norm=2.3840251635039627, lr=0.000484297923566906
2023-11-22 11:36:01   INFO  epoch: 15/30, acc_iter=100205, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:22:55/1:25:38, time_cost(all): 1 day, 3:11:30/1 day, 2:53:09, loss=0.435093539764188, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.92(1.03), norm=3.1229851044262738, lr=0.000483977189384982
2023-11-22 11:36:50   INFO  epoch: 15/30, acc_iter=100255, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:23:44/1:22:41, time_cost(all): 1 day, 3:12:19/1 day, 3:23:29, loss=0.435010425777879, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.1(1.03), norm=1.9365423885473925, lr=0.000483656455203057
2023-11-22 11:37:39   INFO  epoch: 15/30, acc_iter=100305, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:24:33/1:23:48, time_cost(all): 1 day, 3:13:08/1 day, 3:14:59, loss=0.43492731179157, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.1(1.03), norm=2.8619345452036775, lr=0.000483335721021132
2023-11-22 11:38:28   INFO  epoch: 15/30, acc_iter=100355, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:25:22/1:23:44, time_cost(all): 1 day, 3:13:57/1 day, 1:41:02, loss=0.434844197805261, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.17(1.03), norm=3.121047816451799, lr=0.000483014986839207
2023-11-22 11:39:18   INFO  epoch: 15/30, acc_iter=100405, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:26:11/1:18:07, time_cost(all): 1 day, 3:14:47/1 day, 2:59:06, loss=0.434761083818952, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.87(1.03), norm=4.756482131002805, lr=0.000482694252657283
2023-11-22 11:40:07   INFO  epoch: 15/30, acc_iter=100455, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:27:00/1:22:36, time_cost(all): 1 day, 3:15:36/1 day, 3:55:16, loss=0.434677969832643, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.93(1.03), norm=3.066228301818319, lr=0.000482373518475358
2023-11-22 11:40:56   INFO  epoch: 15/30, acc_iter=100505, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:27:49/1:20:44, time_cost(all): 1 day, 3:16:25/1 day, 2:53:00, loss=0.434594855846334, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.06(1.03), norm=1.2040462181849445, lr=0.000482052784293433
2023-11-22 11:41:45   INFO  epoch: 15/30, acc_iter=100555, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:28:39/1:22:04, time_cost(all): 1 day, 3:17:14/1 day, 3:08:30, loss=0.434511741860025, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.21(1.03), norm=3.6990276159441247, lr=0.000481732050111509
2023-11-22 11:42:34   INFO  epoch: 15/30, acc_iter=100605, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:29:28/1:21:19, time_cost(all): 1 day, 3:18:03/1 day, 2:06:14, loss=0.434428627873716, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.86(1.03), norm=3.308836113245958, lr=0.000481411315929584
2023-11-22 11:43:23   INFO  epoch: 15/30, acc_iter=100655, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:30:17/1:15:12, time_cost(all): 1 day, 3:18:52/1 day, 3:44:34, loss=0.434345513887407, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.16(1.03), norm=4.624907215746016, lr=0.000481090581747659
2023-11-22 11:44:12   INFO  epoch: 15/30, acc_iter=100705, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:31:06/1:18:45, time_cost(all): 1 day, 3:19:41/1 day, 3:13:00, loss=0.434262399901098, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.21(1.03), norm=0.952560450408071, lr=0.000480769847565735
2023-11-22 11:45:01   INFO  epoch: 15/30, acc_iter=100755, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:31:55/1:19:30, time_cost(all): 1 day, 3:20:30/1 day, 1:37:03, loss=0.434179285914789, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.83(1.03), norm=2.2438383448034678, lr=0.00048044911338381
2023-11-22 11:45:50   INFO  epoch: 15/30, acc_iter=100805, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:32:44/1:14:02, time_cost(all): 1 day, 3:21:19/1 day, 3:28:02, loss=0.43409617192848, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.96(1.03), norm=2.977494913248624, lr=0.000480128379201885
2023-11-22 11:46:40   INFO  epoch: 15/30, acc_iter=100855, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:33:33/1:16:46, time_cost(all): 1 day, 3:22:09/1 day, 2:04:27, loss=0.43401305794217, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.16(1.03), norm=3.620886424847577, lr=0.00047980764501996
2023-11-22 11:47:29   INFO  epoch: 15/30, acc_iter=100905, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:34:22/1:16:21, time_cost(all): 1 day, 3:22:58/1 day, 3:28:39, loss=0.433929943955861, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.01(1.03), norm=4.938643730658272, lr=0.000479486910838036
2023-11-22 11:48:18   INFO  epoch: 15/30, acc_iter=100955, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:35:12/1:12:48, time_cost(all): 1 day, 3:23:47/1 day, 2:05:53, loss=0.433846829969552, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.05(1.03), norm=3.7256747047676226, lr=0.000479166176656111
2023-11-22 11:49:07   INFO  epoch: 15/30, acc_iter=101005, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:36:01/1:13:17, time_cost(all): 1 day, 3:24:36/1 day, 3:44:22, loss=0.433763715983243, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.95(1.03), norm=3.5104976829706676, lr=0.000478845442474186
2023-11-22 11:49:56   INFO  epoch: 15/30, acc_iter=101055, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:36:50/1:08:55, time_cost(all): 1 day, 3:25:25/1 day, 1:23:48, loss=0.433680601996934, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.09(1.03), norm=3.9654989969395578, lr=0.000478524708292262
2023-11-22 11:50:45   INFO  epoch: 15/30, acc_iter=101105, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:37:39/1:11:41, time_cost(all): 1 day, 3:26:14/1 day, 3:27:53, loss=0.433597488010625, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.04(1.03), norm=3.100436090460544, lr=0.000478203974110337
2023-11-22 11:51:34   INFO  epoch: 15/30, acc_iter=101155, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:38:28/1:10:59, time_cost(all): 1 day, 3:27:03/1 day, 1:51:43, loss=0.433514374024316, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.08(1.03), norm=1.1778287278476767, lr=0.000477883239928412
2023-11-22 11:52:23   INFO  epoch: 15/30, acc_iter=101205, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:39:17/1:07:08, time_cost(all): 1 day, 3:27:52/1 day, 2:41:12, loss=0.433431260038007, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.19(1.03), norm=1.1934345451774744, lr=0.000477562505746488
2023-11-22 11:53:13   INFO  epoch: 15/30, acc_iter=101255, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:40:06/1:09:42, time_cost(all): 1 day, 3:28:42/1 day, 1:55:58, loss=0.433348146051698, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.18(1.03), norm=1.4608280598494878, lr=0.000477241771564563
2023-11-22 11:54:02   INFO  epoch: 15/30, acc_iter=101305, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:40:55/1:04:15, time_cost(all): 1 day, 3:29:31/1 day, 1:54:25, loss=0.433265032065389, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.91(1.03), norm=2.660506903991075, lr=0.000476921037382638
2023-11-22 11:54:51   INFO  epoch: 15/30, acc_iter=101355, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:41:44/1:06:06, time_cost(all): 1 day, 3:30:20/1 day, 2:01:07, loss=0.43318191807908, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.93(1.03), norm=3.282530793079789, lr=0.000476600303200713
2023-11-22 11:55:40   INFO  epoch: 15/30, acc_iter=101405, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:42:34/1:04:06, time_cost(all): 1 day, 3:31:09/1 day, 2:37:50, loss=0.433098804092771, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.01(1.03), norm=4.061310906982332, lr=0.000476279569018789
2023-11-22 11:56:29   INFO  epoch: 15/30, acc_iter=101455, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:43:23/1:05:54, time_cost(all): 1 day, 3:31:58/1 day, 3:27:02, loss=0.433015690106462, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.84(1.03), norm=1.6730047685457259, lr=0.000475958834836864
2023-11-22 11:57:18   INFO  epoch: 15/30, acc_iter=101505, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:44:12/1:03:46, time_cost(all): 1 day, 3:32:47/1 day, 3:22:34, loss=0.432932576120153, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.01(1.03), norm=1.5284132973414706, lr=0.000475638100654939
2023-11-22 11:58:07   INFO  epoch: 15/30, acc_iter=101555, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:45:01/1:05:00, time_cost(all): 1 day, 3:33:36/1 day, 1:04:39, loss=0.432849462133844, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.2(1.03), norm=4.818307470507748, lr=0.000475317366473015
2023-11-22 11:58:56   INFO  epoch: 15/30, acc_iter=101605, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:45:50/1:02:56, time_cost(all): 1 day, 3:34:25/1 day, 2:22:05, loss=0.432766348147535, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.14(1.03), norm=3.2360552223001684, lr=0.00047499663229109
2023-11-22 11:59:45   INFO  epoch: 15/30, acc_iter=101655, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:46:39/1:01:01, time_cost(all): 1 day, 3:35:14/1 day, 2:28:41, loss=0.432683234161226, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.22(1.03), norm=3.843908226373841, lr=0.000474675898109165
2023-11-22 12:00:35   INFO  epoch: 15/30, acc_iter=101705, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:47:28/0:57:32, time_cost(all): 1 day, 3:36:04/1 day, 3:18:08, loss=0.432600120174916, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.2(1.03), norm=1.3835648853200135, lr=0.00047435516392724
2023-11-22 12:01:24   INFO  epoch: 15/30, acc_iter=101755, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:48:17/0:59:28, time_cost(all): 1 day, 3:36:53/1 day, 2:16:54, loss=0.432517006188607, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.08(1.03), norm=0.6799637416459829, lr=0.000474034429745316
2023-11-22 12:02:13   INFO  epoch: 15/30, acc_iter=101805, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:49:07/1:01:07, time_cost(all): 1 day, 3:37:42/1 day, 3:23:16, loss=0.432433892202298, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.11(1.03), norm=4.191482343912731, lr=0.000473713695563391
2023-11-22 12:03:02   INFO  epoch: 15/30, acc_iter=101855, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:49:56/0:56:55, time_cost(all): 1 day, 3:38:31/1 day, 3:21:26, loss=0.432350778215989, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.02(1.03), norm=1.7323237150310016, lr=0.000473392961381466
2023-11-22 12:03:51   INFO  epoch: 15/30, acc_iter=101905, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:50:45/0:55:46, time_cost(all): 1 day, 3:39:20/1 day, 2:06:12, loss=0.43226766422968, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.83(1.03), norm=4.459007662317533, lr=0.000473072227199542
2023-11-22 12:04:40   INFO  epoch: 15/30, acc_iter=101955, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 0:51:34/0:55:30, time_cost(all): 1 day, 3:40:09/1 day, 1:39:32, loss=0.432184550243371, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.84(1.03), norm=3.15884284955278, lr=0.000472751493017617
2023-11-22 12:05:29   INFO  epoch: 15/30, acc_iter=102005, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 0:52:23/0:58:09, time_cost(all): 1 day, 3:40:58/1 day, 2:42:20, loss=0.432101436257062, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.93(1.03), norm=0.7928433217223432, lr=0.000472430758835692
2023-11-22 12:06:18   INFO  epoch: 15/30, acc_iter=102055, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 0:53:12/0:56:55, time_cost(all): 1 day, 3:41:47/1 day, 0:59:46, loss=0.432018322270753, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.89(1.03), norm=1.9075791255538816, lr=0.000472110024653768
2023-11-22 12:07:08   INFO  epoch: 15/30, acc_iter=102105, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 0:54:01/0:55:14, time_cost(all): 1 day, 3:42:37/1 day, 2:04:05, loss=0.431935208284444, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.0(1.03), norm=3.2109143024000013, lr=0.000471789290471843
2023-11-22 12:07:57   INFO  epoch: 15/30, acc_iter=102155, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 0:54:50/0:52:37, time_cost(all): 1 day, 3:43:26/1 day, 3:15:28, loss=0.431852094298135, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.89(1.03), norm=3.531348585781906, lr=0.000471468556289918
2023-11-22 12:08:46   INFO  epoch: 15/30, acc_iter=102205, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 0:55:39/0:54:40, time_cost(all): 1 day, 3:44:15/1 day, 2:01:31, loss=0.431768980311826, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.0(1.03), norm=4.327858644179026, lr=0.000471147822107993
2023-11-22 12:09:35   INFO  epoch: 15/30, acc_iter=102255, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 0:56:29/0:51:02, time_cost(all): 1 day, 3:45:04/1 day, 2:08:23, loss=0.431685866325517, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.14(1.03), norm=3.8311343312945003, lr=0.000470827087926069
2023-11-22 12:10:24   INFO  epoch: 15/30, acc_iter=102305, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 0:57:18/0:50:57, time_cost(all): 1 day, 3:45:53/1 day, 1:15:18, loss=0.431602752339208, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.09(1.03), norm=3.007325743690669, lr=0.000470506353744144
2023-11-22 12:11:13   INFO  epoch: 15/30, acc_iter=102355, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 0:58:07/0:52:02, time_cost(all): 1 day, 3:46:42/1 day, 2:09:40, loss=0.431519638352899, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.08(1.03), norm=1.2243010534227108, lr=0.000470185619562219
2023-11-22 12:12:02   INFO  epoch: 15/30, acc_iter=102405, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 0:58:56/0:50:34, time_cost(all): 1 day, 3:47:31/1 day, 3:12:36, loss=0.43143652436659, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.89(1.03), norm=4.9594664298151745, lr=0.000469864885380295
2023-11-22 12:12:51   INFO  epoch: 15/30, acc_iter=102455, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 0:59:45/0:46:35, time_cost(all): 1 day, 3:48:20/1 day, 1:11:23, loss=0.431353410380281, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.15(1.03), norm=0.570867618026715, lr=0.00046954415119837
2023-11-22 12:13:40   INFO  epoch: 15/30, acc_iter=102505, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:00:34/0:46:37, time_cost(all): 1 day, 3:49:09/1 day, 2:11:54, loss=0.431270296393971, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.94(1.03), norm=3.1467161408173387, lr=0.000469223417016445
2023-11-22 12:14:30   INFO  epoch: 15/30, acc_iter=102555, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:01:23/0:46:07, time_cost(all): 1 day, 3:49:59/1 day, 1:54:35, loss=0.431187182407662, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.15(1.03), norm=4.303201871680537, lr=0.00046890268283452
2023-11-22 12:15:19   INFO  epoch: 15/30, acc_iter=102605, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:02:12/0:47:19, time_cost(all): 1 day, 3:50:48/1 day, 2:19:51, loss=0.431104068421353, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.03(1.03), norm=1.044731743053533, lr=0.000468581948652596
2023-11-22 12:16:08   INFO  epoch: 15/30, acc_iter=102655, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:03:02/0:46:53, time_cost(all): 1 day, 3:51:37/1 day, 3:16:41, loss=0.431020954435044, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.97(1.03), norm=3.1514605175163815, lr=0.000468261214470671
2023-11-22 12:16:57   INFO  epoch: 15/30, acc_iter=102705, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:03:51/0:44:52, time_cost(all): 1 day, 3:52:26/1 day, 1:29:13, loss=0.430937840448735, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.11(1.03), norm=2.1180205248509214, lr=0.000467940480288746
2023-11-22 12:17:46   INFO  epoch: 15/30, acc_iter=102755, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:04:40/0:42:52, time_cost(all): 1 day, 3:53:15/1 day, 2:56:37, loss=0.430854726462426, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.84(1.03), norm=3.6505092411481743, lr=0.000467619746106822
2023-11-22 12:18:35   INFO  epoch: 15/30, acc_iter=102805, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:05:29/0:44:26, time_cost(all): 1 day, 3:54:04/1 day, 1:15:23, loss=0.430771612476117, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.12(1.03), norm=3.234904285079741, lr=0.000467299011924897
2023-11-22 12:19:24   INFO  epoch: 15/30, acc_iter=102855, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:06:18/0:39:48, time_cost(all): 1 day, 3:54:53/1 day, 2:54:11, loss=0.430688498489808, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.0(1.03), norm=1.2245874957299814, lr=0.000466978277742972
2023-11-22 12:20:13   INFO  epoch: 15/30, acc_iter=102905, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:07:07/0:42:23, time_cost(all): 1 day, 3:55:42/1 day, 2:30:20, loss=0.430605384503499, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.17(1.03), norm=4.350491445865869, lr=0.000466657543561048
2023-11-22 12:21:03   INFO  epoch: 15/30, acc_iter=102955, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:07:56/0:39:50, time_cost(all): 1 day, 3:56:32/1 day, 0:59:14, loss=0.43052227051719, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.89(1.03), norm=3.873261158041004, lr=0.000466336809379123
2023-11-22 12:21:52   INFO  epoch: 15/30, acc_iter=103005, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:08:45/0:39:07, time_cost(all): 1 day, 3:57:21/1 day, 1:28:30, loss=0.430439156530881, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.94(1.03), norm=1.9719956219539438, lr=0.000466016075197198
2023-11-22 12:22:41   INFO  epoch: 15/30, acc_iter=103055, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:09:34/0:39:06, time_cost(all): 1 day, 3:58:10/1 day, 1:12:37, loss=0.430356042544572, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.1(1.03), norm=3.260813902037507, lr=0.000465695341015273
2023-11-22 12:23:30   INFO  epoch: 15/30, acc_iter=103105, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:10:24/0:36:20, time_cost(all): 1 day, 3:58:59/1 day, 0:54:43, loss=0.430272928558263, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.11(1.03), norm=4.654857674700699, lr=0.000465374606833349
2023-11-22 12:24:19   INFO  epoch: 15/30, acc_iter=103155, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:11:13/0:37:17, time_cost(all): 1 day, 3:59:48/1 day, 0:45:44, loss=0.430189814571954, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.11(1.03), norm=3.8647575470063, lr=0.000465053872651424
2023-11-22 12:25:08   INFO  epoch: 15/30, acc_iter=103205, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:12:02/0:34:11, time_cost(all): 1 day, 4:00:37/1 day, 2:10:23, loss=0.430106700585645, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.22(1.03), norm=1.8134443568857093, lr=0.000464733138469499
2023-11-22 12:25:57   INFO  epoch: 15/30, acc_iter=103255, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:12:51/0:36:21, time_cost(all): 1 day, 4:01:26/1 day, 0:40:47, loss=0.430023586599335, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.02(1.03), norm=2.934016264998855, lr=0.000464412404287575
2023-11-22 12:26:46   INFO  epoch: 15/30, acc_iter=103305, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:13:40/0:32:45, time_cost(all): 1 day, 4:02:15/1 day, 2:39:04, loss=0.429940472613026, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.06(1.03), norm=3.4050165443944804, lr=0.00046409167010565
2023-11-22 12:27:35   INFO  epoch: 15/30, acc_iter=103355, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:14:29/0:31:45, time_cost(all): 1 day, 4:03:04/1 day, 1:38:44, loss=0.429857358626717, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.19(1.03), norm=1.702515373091435, lr=0.000463770935923725
2023-11-22 12:28:25   INFO  epoch: 15/30, acc_iter=103405, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:15:18/0:31:26, time_cost(all): 1 day, 4:03:54/1 day, 2:47:30, loss=0.429774244640408, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.02(1.03), norm=4.73073510501056, lr=0.0004634502017418
2023-11-22 12:29:14   INFO  epoch: 15/30, acc_iter=103455, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:16:07/0:30:33, time_cost(all): 1 day, 4:04:43/1 day, 2:08:34, loss=0.429691130654099, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.22(1.03), norm=2.1992120947131655, lr=0.000463129467559876
2023-11-22 12:30:03   INFO  epoch: 15/30, acc_iter=103505, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:16:57/0:32:03, time_cost(all): 1 day, 4:05:32/1 day, 0:38:29, loss=0.42960801666779, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.04(1.03), norm=2.815144954451172, lr=0.000462808733377951
2023-11-22 12:30:52   INFO  epoch: 15/30, acc_iter=103555, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:17:46/0:28:54, time_cost(all): 1 day, 4:06:21/1 day, 2:18:41, loss=0.429524902681481, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.97(1.03), norm=2.7551313802076827, lr=0.000462487999196026
2023-11-22 12:31:41   INFO  epoch: 15/30, acc_iter=103605, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:18:35/0:29:38, time_cost(all): 1 day, 4:07:10/1 day, 2:49:46, loss=0.429441788695172, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.95(1.03), norm=2.1866507944040334, lr=0.000462167265014102
2023-11-22 12:32:30   INFO  epoch: 15/30, acc_iter=103655, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:19:24/0:27:44, time_cost(all): 1 day, 4:07:59/1 day, 0:31:18, loss=0.429358674708863, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.04(1.03), norm=1.0733574235360905, lr=0.000461846530832177
2023-11-22 12:33:19   INFO  epoch: 15/30, acc_iter=103705, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:20:13/0:27:39, time_cost(all): 1 day, 4:08:48/1 day, 1:32:36, loss=0.429275560722554, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.03(1.03), norm=4.526429202992423, lr=0.000461525796650252
2023-11-22 12:34:08   INFO  epoch: 15/30, acc_iter=103755, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:21:02/0:26:43, time_cost(all): 1 day, 4:09:37/1 day, 0:36:56, loss=0.429192446736245, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.85(1.03), norm=2.0045998764264237, lr=0.000461205062468328
2023-11-22 12:34:58   INFO  epoch: 15/30, acc_iter=103805, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:21:51/0:24:41, time_cost(all): 1 day, 4:10:27/1 day, 0:44:40, loss=0.429109332749936, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.83(1.03), norm=0.9159655511940167, lr=0.000460884328286403
2023-11-22 12:35:47   INFO  epoch: 15/30, acc_iter=103855, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:22:40/0:25:22, time_cost(all): 1 day, 4:11:16/1 day, 1:20:33, loss=0.429026218763627, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.1(1.03), norm=4.2178546683215945, lr=0.000460563594104478
2023-11-22 12:36:36   INFO  epoch: 15/30, acc_iter=103905, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:23:29/0:24:30, time_cost(all): 1 day, 4:12:05/1 day, 0:45:29, loss=0.428943104777318, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.98(1.03), norm=1.9363182810435675, lr=0.000460242859922553
2023-11-22 12:37:25   INFO  epoch: 15/30, acc_iter=103955, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:24:19/0:23:43, time_cost(all): 1 day, 4:12:54/1 day, 2:46:14, loss=0.428859990791009, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.97(1.03), norm=0.9611542759623217, lr=0.000459922125740629
2023-11-22 12:38:14   INFO  epoch: 15/30, acc_iter=104005, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:25:08/0:23:39, time_cost(all): 1 day, 4:13:43/1 day, 0:33:01, loss=0.4287768768047, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.06(1.03), norm=3.3951990311219165, lr=0.000459601391558704
2023-11-22 12:39:03   INFO  epoch: 15/30, acc_iter=104055, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:25:57/0:22:11, time_cost(all): 1 day, 4:14:32/1 day, 0:43:51, loss=0.428693762818391, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.1(1.03), norm=2.586974238535027, lr=0.000459280657376779
2023-11-22 12:39:52   INFO  epoch: 15/30, acc_iter=104105, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:26:46/0:20:30, time_cost(all): 1 day, 4:15:21/1 day, 1:31:27, loss=0.428610648832081, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.2(1.03), norm=2.814912743019024, lr=0.000458959923194855
2023-11-22 12:40:41   INFO  epoch: 15/30, acc_iter=104155, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:27:35/0:20:22, time_cost(all): 1 day, 4:16:10/1 day, 2:02:05, loss=0.428527534845772, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.85(1.03), norm=3.9292550013243464, lr=0.00045863918901293
2023-11-22 12:41:30   INFO  epoch: 15/30, acc_iter=104205, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:28:24/0:19:15, time_cost(all): 1 day, 4:16:59/1 day, 0:56:49, loss=0.428444420859463, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.18(1.03), norm=0.693005912316939, lr=0.000458318454831005
2023-11-22 12:42:20   INFO  epoch: 15/30, acc_iter=104255, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:29:13/0:18:47, time_cost(all): 1 day, 4:17:49/1 day, 0:45:35, loss=0.428361306873154, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.91(1.03), norm=3.531621884054484, lr=0.000457997720649081
2023-11-22 12:43:09   INFO  epoch: 15/30, acc_iter=104305, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:30:02/0:17:28, time_cost(all): 1 day, 4:18:38/1 day, 1:52:27, loss=0.428278192886845, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.06(1.03), norm=1.5732814693093766, lr=0.000457676986467156
2023-11-22 12:43:58   INFO  epoch: 15/30, acc_iter=104355, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:30:52/0:17:25, time_cost(all): 1 day, 4:19:27/1 day, 1:55:19, loss=0.428195078900536, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.84(1.03), norm=4.220258689922729, lr=0.000457356252285231
2023-11-22 12:44:47   INFO  epoch: 15/30, acc_iter=104405, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:31:41/0:16:14, time_cost(all): 1 day, 4:20:16/1 day, 0:42:36, loss=0.428111964914227, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.22(1.03), norm=3.6967386546874446, lr=0.000457035518103306
2023-11-22 12:45:36   INFO  epoch: 15/30, acc_iter=104455, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:32:30/0:15:01, time_cost(all): 1 day, 4:21:05/1 day, 1:22:54, loss=0.428028850927918, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.93(1.03), norm=2.2829697650756064, lr=0.000456714783921382
2023-11-22 12:46:25   INFO  epoch: 15/30, acc_iter=104505, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:33:19/0:15:02, time_cost(all): 1 day, 4:21:54/1 day, 0:37:49, loss=0.427945736941609, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.88(1.03), norm=0.6837649546015714, lr=0.000456394049739457
2023-11-22 12:47:14   INFO  epoch: 15/30, acc_iter=104555, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:34:08/0:13:43, time_cost(all): 1 day, 4:22:43/1 day, 2:34:43, loss=0.4278626229553, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.89(1.03), norm=3.810418978390301, lr=0.000456073315557532
2023-11-22 12:48:03   INFO  epoch: 15/30, acc_iter=104605, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:34:57/0:12:24, time_cost(all): 1 day, 4:23:32/1 day, 2:06:28, loss=0.427779508968991, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.84(1.03), norm=0.7594617599901323, lr=0.000455752581375608
2023-11-22 12:48:53   INFO  epoch: 15/30, acc_iter=104655, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:35:46/0:11:29, time_cost(all): 1 day, 4:24:22/1 day, 1:16:52, loss=0.427696394982682, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.12(1.03), norm=1.883979640509676, lr=0.000455431847193683
2023-11-22 12:49:42   INFO  epoch: 15/30, acc_iter=104705, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:36:35/0:11:35, time_cost(all): 1 day, 4:25:11/1 day, 0:51:04, loss=0.427613280996373, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.09(1.03), norm=1.4633596809265397, lr=0.000455111113011758
2023-11-22 12:50:31   INFO  epoch: 15/30, acc_iter=104755, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:37:24/0:10:36, time_cost(all): 1 day, 4:26:00/1 day, 0:58:28, loss=0.427530167010064, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.13(1.03), norm=4.362351654518086, lr=0.000454790378829834
2023-11-22 12:51:20   INFO  epoch: 15/30, acc_iter=104805, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:38:14/0:09:51, time_cost(all): 1 day, 4:26:49/1 day, 0:20:10, loss=0.427447053023755, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.18(1.03), norm=1.3862311784542702, lr=0.000454469644647909
2023-11-22 12:52:09   INFO  epoch: 15/30, acc_iter=104855, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:39:03/0:08:45, time_cost(all): 1 day, 4:27:38/1 day, 1:45:08, loss=0.427363939037446, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.11(1.03), norm=1.868196384816505, lr=0.000454148910465984
2023-11-22 12:52:58   INFO  epoch: 15/30, acc_iter=104905, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:39:52/0:08:16, time_cost(all): 1 day, 4:28:27/1 day, 0:58:32, loss=0.427280825051136, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.07(1.03), norm=0.5837853029795492, lr=0.000453828176284059
2023-11-22 12:53:47   INFO  epoch: 15/30, acc_iter=104955, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:40:41/0:07:28, time_cost(all): 1 day, 4:29:16/1 day, 2:19:23, loss=0.427197711064827, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.99(1.03), norm=4.717494273591019, lr=0.000453507442102135
2023-11-22 12:54:36   INFO  epoch: 15/30, acc_iter=105005, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:41:30/0:06:07, time_cost(all): 1 day, 4:30:05/1 day, 2:13:59, loss=0.427114597078518, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.0(1.03), norm=0.6426441365779724, lr=0.00045318670792021
2023-11-22 12:55:25   INFO  epoch: 15/30, acc_iter=105055, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 1:42:19/0:05:45, time_cost(all): 1 day, 4:30:54/1 day, 0:08:16, loss=0.427031483092209, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.13(1.03), norm=1.4796840093647454, lr=0.000452865973738285
2023-11-22 12:56:15   INFO  epoch: 15/30, acc_iter=105105, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 1:43:08/0:04:35, time_cost(all): 1 day, 4:31:44/1 day, 1:00:43, loss=0.4269483691059, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.9(1.03), norm=3.8460071269897393, lr=0.00045254523955636
2023-11-22 12:57:04   INFO  epoch: 15/30, acc_iter=105155, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 1:43:57/0:03:59, time_cost(all): 1 day, 4:32:33/1 day, 0:18:09, loss=0.426865255119591, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.98(1.03), norm=1.5338824358116774, lr=0.000452224505374436
2023-11-22 12:57:53   INFO  epoch: 15/30, acc_iter=105205, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 1:44:47/0:03:05, time_cost(all): 1 day, 4:33:22/1 day, 2:18:33, loss=0.426782141133282, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.05(1.03), norm=1.7530851402509415, lr=0.000451903771192511
2023-11-22 12:58:42   INFO  epoch: 15/30, acc_iter=105255, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 1:45:36/0:02:11, time_cost(all): 1 day, 4:34:11/1 day, 0:06:01, loss=0.426699027146973, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.01(1.03), norm=1.1903166230076143, lr=0.000451583037010587
2023-11-22 12:59:31   INFO  epoch: 15/30, acc_iter=105305, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 1:46:25/0:01:26, time_cost(all): 1 day, 4:35:00/1 day, 2:32:28, loss=0.426615913160664, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.0(1.03), norm=1.0049538025547473, lr=0.000451262302828662
2023-11-22 13:00:20   INFO  epoch: 15/30, acc_iter=105355, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 1:47:14/0:00:35, time_cost(all): 1 day, 4:35:49/1 day, 1:10:47, loss=0.426532799174355, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.92(1.03), norm=3.5357254110203753, lr=0.000450941568646737
2023-11-22 13:01:09   INFO  epoch: 16/30, acc_iter=105442, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:49/1:46:02, time_cost(all): 1 day, 4:36:38/1 day, 1:34:00, loss=0.426388180838177, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.91(1.03), norm=3.7857773667136954, lr=0.000450383491170188
2023-11-22 13:01:58   INFO  epoch: 16/30, acc_iter=105492, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:38/1:43:56, time_cost(all): 1 day, 4:37:27/1 day, 1:20:12, loss=0.426305066851868, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.08(1.03), norm=3.936096388109668, lr=0.000450062756988263
2023-11-22 13:02:48   INFO  epoch: 16/30, acc_iter=105542, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:27/1:44:32, time_cost(all): 1 day, 4:38:17/1 day, 2:27:20, loss=0.426221952865559, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.15(1.03), norm=0.8599507776067905, lr=0.000449742022806339
2023-11-22 13:03:37   INFO  epoch: 16/30, acc_iter=105592, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:16/1:39:45, time_cost(all): 1 day, 4:39:06/1 day, 0:28:48, loss=0.42613883887925, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.16(1.03), norm=4.588943446838706, lr=0.000449421288624414
2023-11-22 13:04:26   INFO  epoch: 16/30, acc_iter=105642, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:05/1:42:06, time_cost(all): 1 day, 4:39:55/1 day, 1:39:54, loss=0.426055724892941, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.87(1.03), norm=0.9840342113916754, lr=0.000449100554442489
2023-11-22 13:05:15   INFO  epoch: 16/30, acc_iter=105692, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:04:54/1:38:32, time_cost(all): 1 day, 4:40:44/1 day, 0:31:19, loss=0.425972610906632, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.08(1.03), norm=0.6676112482623154, lr=0.000448779820260565
2023-11-22 13:06:04   INFO  epoch: 16/30, acc_iter=105742, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:05:43/1:42:20, time_cost(all): 1 day, 4:41:33/1 day, 1:00:08, loss=0.425889496920323, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.93(1.03), norm=0.9663011553378267, lr=0.00044845908607864
2023-11-22 13:06:53   INFO  epoch: 16/30, acc_iter=105792, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:06:32/1:38:24, time_cost(all): 1 day, 4:42:22/1 day, 0:05:34, loss=0.425806382934014, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.09(1.03), norm=4.3811447937004875, lr=0.000448138351896715
2023-11-22 13:07:42   INFO  epoch: 16/30, acc_iter=105842, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:07:22/1:37:59, time_cost(all): 1 day, 4:43:11/1 day, 1:46:56, loss=0.425723268947705, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.21(1.03), norm=2.3442547671276097, lr=0.00044781761771479
2023-11-22 13:08:31   INFO  epoch: 16/30, acc_iter=105892, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:08:11/1:37:26, time_cost(all): 1 day, 4:44:00/1 day, 0:04:55, loss=0.425640154961396, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.2(1.03), norm=2.2855397611965347, lr=0.000447496883532866
2023-11-22 13:09:20   INFO  epoch: 16/30, acc_iter=105942, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:09:00/1:42:59, time_cost(all): 1 day, 4:44:49/1 day, 1:31:49, loss=0.425557040975086, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.12(1.03), norm=2.311859903426682, lr=0.000447176149350941
2023-11-22 13:10:10   INFO  epoch: 16/30, acc_iter=105992, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:09:49/1:37:28, time_cost(all): 1 day, 4:45:39/1 day, 2:04:34, loss=0.425473926988777, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.96(1.03), norm=0.5976445159618516, lr=0.000446855415169016
2023-11-22 13:10:59   INFO  epoch: 16/30, acc_iter=106042, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:10:38/1:36:53, time_cost(all): 1 day, 4:46:28/1 day, 2:07:21, loss=0.425390813002468, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.93(1.03), norm=3.191845675083067, lr=0.000446534680987092
2023-11-22 13:11:48   INFO  epoch: 16/30, acc_iter=106092, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:11:27/1:38:50, time_cost(all): 1 day, 4:47:17/23:54:56, loss=0.425307699016159, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.85(1.03), norm=0.6092674347960522, lr=0.000446213946805167
2023-11-22 13:12:37   INFO  epoch: 16/30, acc_iter=106142, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:12:16/1:34:27, time_cost(all): 1 day, 4:48:06/1 day, 0:04:08, loss=0.42522458502985, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.88(1.03), norm=3.915074309724363, lr=0.000445893212623242
2023-11-22 13:13:26   INFO  epoch: 16/30, acc_iter=106192, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:13:05/1:30:13, time_cost(all): 1 day, 4:48:55/1 day, 1:50:48, loss=0.425141471043541, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.84(1.03), norm=4.446394512710874, lr=0.000445572478441317
2023-11-22 13:14:15   INFO  epoch: 16/30, acc_iter=106242, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:13:54/1:36:39, time_cost(all): 1 day, 4:49:44/1 day, 0:44:54, loss=0.425058357057232, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.19(1.03), norm=3.0077382382743116, lr=0.000445251744259393
2023-11-22 13:15:04   INFO  epoch: 16/30, acc_iter=106292, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:14:44/1:33:05, time_cost(all): 1 day, 4:50:33/1 day, 0:55:38, loss=0.424975243070923, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.22(1.03), norm=3.123861864621689, lr=0.000444931010077468
2023-11-22 13:15:53   INFO  epoch: 16/30, acc_iter=106342, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:15:33/1:30:07, time_cost(all): 1 day, 4:51:22/1 day, 0:05:58, loss=0.424892129084614, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.12(1.03), norm=0.7573995723848563, lr=0.000444610275895543
2023-11-22 13:16:42   INFO  epoch: 16/30, acc_iter=106392, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:16:22/1:27:12, time_cost(all): 1 day, 4:52:11/1 day, 0:16:26, loss=0.424809015098305, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.07(1.03), norm=3.158237163444349, lr=0.000444289541713619
2023-11-22 13:17:32   INFO  epoch: 16/30, acc_iter=106442, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:17:11/1:30:38, time_cost(all): 1 day, 4:53:01/23:52:50, loss=0.424725901111996, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.89(1.03), norm=4.462078493449093, lr=0.000443968807531694
2023-11-22 13:18:21   INFO  epoch: 16/30, acc_iter=106492, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:18:00/1:32:07, time_cost(all): 1 day, 4:53:50/1 day, 1:33:37, loss=0.424642787125687, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.12(1.03), norm=4.82899866028564, lr=0.000443648073349769
2023-11-22 13:19:10   INFO  epoch: 16/30, acc_iter=106542, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:18:49/1:31:40, time_cost(all): 1 day, 4:54:39/1 day, 1:43:08, loss=0.424559673139378, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.87(1.03), norm=3.449543597697308, lr=0.000443327339167845
2023-11-22 13:19:59   INFO  epoch: 16/30, acc_iter=106592, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:19:38/1:29:45, time_cost(all): 1 day, 4:55:28/1 day, 1:27:40, loss=0.424476559153069, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.12(1.03), norm=3.8270376824773047, lr=0.00044300660498592
2023-11-22 13:20:48   INFO  epoch: 16/30, acc_iter=106642, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:20:27/1:24:17, time_cost(all): 1 day, 4:56:17/1 day, 1:46:52, loss=0.42439344516676, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.85(1.03), norm=1.0418223953958536, lr=0.000442685870803995
2023-11-22 13:21:37   INFO  epoch: 16/30, acc_iter=106692, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:21:17/1:29:21, time_cost(all): 1 day, 4:57:06/1 day, 0:38:56, loss=0.424310331180451, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.96(1.03), norm=3.5975218318471844, lr=0.00044236513662207
2023-11-22 13:22:26   INFO  epoch: 16/30, acc_iter=106742, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:22:06/1:29:54, time_cost(all): 1 day, 4:57:55/1 day, 1:25:38, loss=0.424227217194141, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.22(1.03), norm=0.7265426503617404, lr=0.000442044402440146
2023-11-22 13:23:15   INFO  epoch: 16/30, acc_iter=106792, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:22:55/1:27:55, time_cost(all): 1 day, 4:58:44/1 day, 1:23:59, loss=0.424144103207832, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.96(1.03), norm=2.694045909184607, lr=0.000441723668258221
2023-11-22 13:24:05   INFO  epoch: 16/30, acc_iter=106842, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:23:44/1:26:08, time_cost(all): 1 day, 4:59:34/1 day, 0:39:38, loss=0.424060989221523, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.86(1.03), norm=1.5606692892619118, lr=0.000441402934076296
2023-11-22 13:24:54   INFO  epoch: 16/30, acc_iter=106892, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:24:33/1:25:42, time_cost(all): 1 day, 5:00:23/1 day, 0:59:44, loss=0.423977875235214, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.99(1.03), norm=3.5438345543060343, lr=0.000441082199894372
2023-11-22 13:25:43   INFO  epoch: 16/30, acc_iter=106942, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:25:22/1:21:31, time_cost(all): 1 day, 5:01:12/23:42:40, loss=0.423894761248905, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.9(1.03), norm=0.5126193258416212, lr=0.000440761465712447
2023-11-22 13:26:32   INFO  epoch: 16/30, acc_iter=106992, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:26:11/1:20:54, time_cost(all): 1 day, 5:02:01/1 day, 1:03:04, loss=0.423811647262596, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.89(1.03), norm=4.031684146352225, lr=0.000440440731530522
2023-11-22 13:27:21   INFO  epoch: 16/30, acc_iter=107042, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:27:00/1:23:15, time_cost(all): 1 day, 5:02:50/1 day, 1:34:16, loss=0.423728533276287, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.07(1.03), norm=2.834307032279112, lr=0.000440119997348597
2023-11-22 13:28:10   INFO  epoch: 16/30, acc_iter=107092, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:27:49/1:16:56, time_cost(all): 1 day, 5:03:39/1 day, 1:54:26, loss=0.423645419289978, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.08(1.03), norm=2.317318456841246, lr=0.000439799263166673
2023-11-22 13:28:59   INFO  epoch: 16/30, acc_iter=107142, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:28:39/1:15:55, time_cost(all): 1 day, 5:04:28/1 day, 1:42:37, loss=0.423562305303669, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.07(1.03), norm=2.3143665549278434, lr=0.000439478528984748
2023-11-22 13:29:48   INFO  epoch: 16/30, acc_iter=107192, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:29:28/1:21:17, time_cost(all): 1 day, 5:05:17/1 day, 0:55:41, loss=0.42347919131736, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.99(1.03), norm=1.6137536626419176, lr=0.000439157794802823
2023-11-22 13:30:37   INFO  epoch: 16/30, acc_iter=107242, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:30:17/1:15:28, time_cost(all): 1 day, 5:06:06/1 day, 1:26:53, loss=0.423396077331051, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.08(1.03), norm=1.648993961686641, lr=0.000438837060620899
2023-11-22 13:31:27   INFO  epoch: 16/30, acc_iter=107292, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:31:06/1:15:01, time_cost(all): 1 day, 5:06:56/1 day, 1:24:04, loss=0.423312963344742, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.22(1.03), norm=4.43099830908793, lr=0.000438516326438974
2023-11-22 13:32:16   INFO  epoch: 16/30, acc_iter=107342, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:31:55/1:18:50, time_cost(all): 1 day, 5:07:45/1 day, 0:50:40, loss=0.423229849358433, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.84(1.03), norm=1.2567329276492556, lr=0.000438195592257049
2023-11-22 13:33:05   INFO  epoch: 16/30, acc_iter=107392, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:32:44/1:17:55, time_cost(all): 1 day, 5:08:34/1 day, 0:00:59, loss=0.423146735372124, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.0(1.03), norm=3.4795118622865715, lr=0.000437874858075125
2023-11-22 13:33:54   INFO  epoch: 16/30, acc_iter=107442, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:33:33/1:12:14, time_cost(all): 1 day, 5:09:23/23:35:00, loss=0.423063621385815, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.08(1.03), norm=4.437949750533372, lr=0.0004375541238932
2023-11-22 13:34:43   INFO  epoch: 16/30, acc_iter=107492, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:34:22/1:15:00, time_cost(all): 1 day, 5:10:12/1 day, 1:06:47, loss=0.422980507399506, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.9(1.03), norm=2.612109275689981, lr=0.000437233389711275
2023-11-22 13:35:32   INFO  epoch: 16/30, acc_iter=107542, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:35:12/1:12:13, time_cost(all): 1 day, 5:11:01/1 day, 0:42:48, loss=0.422897393413196, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.98(1.03), norm=1.1052975648517356, lr=0.00043691265552935
2023-11-22 13:36:21   INFO  epoch: 16/30, acc_iter=107592, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:36:01/1:13:56, time_cost(all): 1 day, 5:11:50/1 day, 0:49:44, loss=0.422814279426887, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.86(1.03), norm=4.931566744448134, lr=0.000436591921347426
2023-11-22 13:37:10   INFO  epoch: 16/30, acc_iter=107642, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:36:50/1:10:51, time_cost(all): 1 day, 5:12:39/23:54:58, loss=0.422731165440578, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.98(1.03), norm=1.3766379730317604, lr=0.000436271187165501
2023-11-22 13:38:00   INFO  epoch: 16/30, acc_iter=107692, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:37:39/1:12:36, time_cost(all): 1 day, 5:13:29/1 day, 1:51:58, loss=0.422648051454269, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.15(1.03), norm=3.695335294363597, lr=0.000435950452983576
2023-11-22 13:38:49   INFO  epoch: 16/30, acc_iter=107742, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:38:28/1:06:04, time_cost(all): 1 day, 5:14:18/1 day, 0:14:03, loss=0.42256493746796, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.11(1.03), norm=4.235380876110751, lr=0.000435629718801652
2023-11-22 13:39:38   INFO  epoch: 16/30, acc_iter=107792, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:39:17/1:11:52, time_cost(all): 1 day, 5:15:07/1 day, 0:09:42, loss=0.422481823481651, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.22(1.03), norm=3.0011005380961997, lr=0.000435308984619727
2023-11-22 13:40:27   INFO  epoch: 16/30, acc_iter=107842, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:40:06/1:08:55, time_cost(all): 1 day, 5:15:56/23:49:44, loss=0.422398709495342, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.06(1.03), norm=1.2696305840132398, lr=0.000434988250437802
2023-11-22 13:41:16   INFO  epoch: 16/30, acc_iter=107892, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:40:55/1:09:47, time_cost(all): 1 day, 5:16:45/23:56:03, loss=0.422315595509033, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.05(1.03), norm=1.576242903478518, lr=0.000434667516255877
2023-11-22 13:42:05   INFO  epoch: 16/30, acc_iter=107942, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:41:44/1:06:19, time_cost(all): 1 day, 5:17:34/1 day, 1:06:30, loss=0.422232481522724, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.23(1.03), norm=2.2900989603182524, lr=0.000434346782073953
2023-11-22 13:42:54   INFO  epoch: 16/30, acc_iter=107992, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:42:34/1:05:18, time_cost(all): 1 day, 5:18:23/1 day, 1:39:27, loss=0.422149367536415, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.95(1.03), norm=0.5988555220220502, lr=0.000434026047892028
2023-11-22 13:43:43   INFO  epoch: 16/30, acc_iter=108042, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:43:23/1:04:34, time_cost(all): 1 day, 5:19:12/1 day, 1:31:32, loss=0.422066253550106, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.16(1.03), norm=1.039809375780938, lr=0.000433705313710103
2023-11-22 13:44:32   INFO  epoch: 16/30, acc_iter=108092, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:44:12/1:00:35, time_cost(all): 1 day, 5:20:01/1 day, 0:32:07, loss=0.421983139563797, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.92(1.03), norm=3.2641099965804137, lr=0.000433384579528179
2023-11-22 13:45:22   INFO  epoch: 16/30, acc_iter=108142, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:45:01/1:00:51, time_cost(all): 1 day, 5:20:51/1 day, 0:53:30, loss=0.421900025577488, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.17(1.03), norm=0.9623728663911629, lr=0.000433063845346254
2023-11-22 13:46:11   INFO  epoch: 16/30, acc_iter=108192, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:45:50/1:00:39, time_cost(all): 1 day, 5:21:40/1 day, 0:08:58, loss=0.421816911591179, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.05(1.03), norm=4.8071020282770505, lr=0.000432743111164329
2023-11-22 13:47:00   INFO  epoch: 16/30, acc_iter=108242, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:46:39/1:00:34, time_cost(all): 1 day, 5:22:29/23:39:01, loss=0.42173379760487, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.19(1.03), norm=2.2972987250374937, lr=0.000432422376982405
2023-11-22 13:47:49   INFO  epoch: 16/30, acc_iter=108292, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:47:28/1:01:53, time_cost(all): 1 day, 5:23:18/1 day, 0:27:02, loss=0.421650683618561, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.95(1.03), norm=1.8464546916096032, lr=0.00043210164280048
2023-11-22 13:48:38   INFO  epoch: 16/30, acc_iter=108342, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:48:17/1:00:30, time_cost(all): 1 day, 5:24:07/1 day, 0:27:44, loss=0.421567569632251, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.16(1.03), norm=1.6545640851060823, lr=0.000431780908618555
2023-11-22 13:49:27   INFO  epoch: 16/30, acc_iter=108392, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:49:07/0:59:20, time_cost(all): 1 day, 5:24:56/23:41:49, loss=0.421484455645942, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.92(1.03), norm=0.569859377838716, lr=0.00043146017443663
2023-11-22 13:50:16   INFO  epoch: 16/30, acc_iter=108442, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:49:56/0:56:32, time_cost(all): 1 day, 5:25:45/1 day, 0:05:58, loss=0.421401341659633, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=1.543501213677457, lr=0.000431139440254706
2023-11-22 13:51:05   INFO  epoch: 16/30, acc_iter=108492, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:50:45/0:56:37, time_cost(all): 1 day, 5:26:34/1 day, 0:23:33, loss=0.421318227673324, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.04(1.03), norm=4.324470523487028, lr=0.000430818706072781
2023-11-22 13:51:55   INFO  epoch: 16/30, acc_iter=108542, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 0:51:34/0:56:34, time_cost(all): 1 day, 5:27:24/1 day, 0:22:56, loss=0.421235113687015, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.87(1.03), norm=0.5244261203850019, lr=0.000430497971890856
2023-11-22 13:52:44   INFO  epoch: 16/30, acc_iter=108592, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 0:52:23/0:58:06, time_cost(all): 1 day, 5:28:13/1 day, 0:46:43, loss=0.421151999700706, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.19(1.03), norm=4.877958068242881, lr=0.000430177237708932
2023-11-22 13:53:33   INFO  epoch: 16/30, acc_iter=108642, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 0:53:12/0:55:58, time_cost(all): 1 day, 5:29:02/23:41:31, loss=0.421068885714397, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.14(1.03), norm=3.560265064906786, lr=0.000429856503527007
2023-11-22 13:54:22   INFO  epoch: 16/30, acc_iter=108692, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 0:54:01/0:54:31, time_cost(all): 1 day, 5:29:51/23:33:49, loss=0.420985771728088, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.89(1.03), norm=2.247890023768445, lr=0.000429535769345082
2023-11-22 13:55:11   INFO  epoch: 16/30, acc_iter=108742, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 0:54:50/0:51:04, time_cost(all): 1 day, 5:30:40/23:15:49, loss=0.420902657741779, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.08(1.03), norm=0.9655744286537127, lr=0.000429215035163157
2023-11-22 13:56:00   INFO  epoch: 16/30, acc_iter=108792, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 0:55:39/0:50:18, time_cost(all): 1 day, 5:31:29/1 day, 0:34:52, loss=0.42081954375547, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.14(1.03), norm=2.957717686433387, lr=0.000428894300981233
2023-11-22 13:56:49   INFO  epoch: 16/30, acc_iter=108842, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 0:56:29/0:51:02, time_cost(all): 1 day, 5:32:18/1 day, 1:06:08, loss=0.420736429769161, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.86(1.03), norm=0.7825205008229195, lr=0.000428573566799308
2023-11-22 13:57:38   INFO  epoch: 16/30, acc_iter=108892, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 0:57:18/0:51:37, time_cost(all): 1 day, 5:33:07/23:35:57, loss=0.420653315782852, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.07(1.03), norm=4.468319562750044, lr=0.000428252832617383
2023-11-22 13:58:27   INFO  epoch: 16/30, acc_iter=108942, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 0:58:07/0:50:30, time_cost(all): 1 day, 5:33:56/1 day, 1:23:38, loss=0.420570201796543, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.22(1.03), norm=2.3467849924558273, lr=0.000427932098435459
2023-11-22 13:59:17   INFO  epoch: 16/30, acc_iter=108992, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 0:58:56/0:46:43, time_cost(all): 1 day, 5:34:46/1 day, 0:35:24, loss=0.420487087810234, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.84(1.03), norm=3.789119621785164, lr=0.000427611364253534
2023-11-22 14:00:06   INFO  epoch: 16/30, acc_iter=109042, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 0:59:45/0:48:40, time_cost(all): 1 day, 5:35:35/1 day, 0:30:27, loss=0.420403973823925, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.17(1.03), norm=2.576612178919673, lr=0.000427290630071609
2023-11-22 14:00:55   INFO  epoch: 16/30, acc_iter=109092, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:00:34/0:47:59, time_cost(all): 1 day, 5:36:24/1 day, 1:09:54, loss=0.420320859837615, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.95(1.03), norm=0.8275292046149376, lr=0.000426969895889685
2023-11-22 14:01:44   INFO  epoch: 16/30, acc_iter=109142, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:01:23/0:46:00, time_cost(all): 1 day, 5:37:13/1 day, 1:05:03, loss=0.420237745851306, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.02(1.03), norm=0.5576535346575062, lr=0.00042664916170776
2023-11-22 14:02:33   INFO  epoch: 16/30, acc_iter=109192, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:02:12/0:43:41, time_cost(all): 1 day, 5:38:02/1 day, 0:34:46, loss=0.420154631864997, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.01(1.03), norm=4.38889119160757, lr=0.000426328427525835
2023-11-22 14:03:22   INFO  epoch: 16/30, acc_iter=109242, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:03:02/0:43:59, time_cost(all): 1 day, 5:38:51/23:58:40, loss=0.420071517878688, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.88(1.03), norm=1.0840562509063483, lr=0.00042600769334391
2023-11-22 14:04:11   INFO  epoch: 16/30, acc_iter=109292, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:03:51/0:42:19, time_cost(all): 1 day, 5:39:40/1 day, 1:18:59, loss=0.419988403892379, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.88(1.03), norm=1.624809170591942, lr=0.000425686959161986
2023-11-22 14:05:00   INFO  epoch: 16/30, acc_iter=109342, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:04:40/0:41:41, time_cost(all): 1 day, 5:40:29/1 day, 0:27:19, loss=0.41990528990607, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.19(1.03), norm=1.9741819540211454, lr=0.000425366224980061
2023-11-22 14:05:50   INFO  epoch: 16/30, acc_iter=109392, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:05:29/0:43:16, time_cost(all): 1 day, 5:41:19/1 day, 0:47:09, loss=0.419822175919761, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.88(1.03), norm=1.6065550525495986, lr=0.000425045490798136
2023-11-22 14:06:39   INFO  epoch: 16/30, acc_iter=109442, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:06:18/0:42:48, time_cost(all): 1 day, 5:42:08/1 day, 0:37:34, loss=0.419739061933452, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.98(1.03), norm=4.536799997923351, lr=0.000424724756616212
2023-11-22 14:07:28   INFO  epoch: 16/30, acc_iter=109492, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:07:07/0:39:21, time_cost(all): 1 day, 5:42:57/23:16:20, loss=0.419655947947143, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.1(1.03), norm=1.6511279956476388, lr=0.000424404022434287
2023-11-22 14:08:17   INFO  epoch: 16/30, acc_iter=109542, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:07:56/0:38:36, time_cost(all): 1 day, 5:43:46/23:15:42, loss=0.419572833960834, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.07(1.03), norm=3.1394950024439217, lr=0.000424083288252362
2023-11-22 14:09:06   INFO  epoch: 16/30, acc_iter=109592, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:08:45/0:40:52, time_cost(all): 1 day, 5:44:35/1 day, 0:22:51, loss=0.419489719974525, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.21(1.03), norm=1.6215588710475324, lr=0.000423762554070438
2023-11-22 14:09:55   INFO  epoch: 16/30, acc_iter=109642, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:09:34/0:40:00, time_cost(all): 1 day, 5:45:24/1 day, 0:29:54, loss=0.419406605988216, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.94(1.03), norm=4.197108401239605, lr=0.000423441819888513
2023-11-22 14:10:44   INFO  epoch: 16/30, acc_iter=109692, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:10:24/0:38:41, time_cost(all): 1 day, 5:46:13/1 day, 0:30:15, loss=0.419323492001907, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.06(1.03), norm=1.264465670772835, lr=0.000423121085706588
2023-11-22 14:11:33   INFO  epoch: 16/30, acc_iter=109742, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:11:13/0:36:18, time_cost(all): 1 day, 5:47:02/1 day, 1:18:55, loss=0.419240378015598, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.9(1.03), norm=1.2433314396304804, lr=0.000422800351524663
2023-11-22 14:12:22   INFO  epoch: 16/30, acc_iter=109792, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:12:02/0:36:40, time_cost(all): 1 day, 5:47:51/1 day, 0:56:54, loss=0.419157264029289, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.2(1.03), norm=4.247572661574273, lr=0.000422479617342739
2023-11-22 14:13:12   INFO  epoch: 16/30, acc_iter=109842, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:12:51/0:36:28, time_cost(all): 1 day, 5:48:41/1 day, 1:09:35, loss=0.41907415004298, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.21(1.03), norm=4.265106397939688, lr=0.000422158883160814
2023-11-22 14:14:01   INFO  epoch: 16/30, acc_iter=109892, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:13:40/0:32:52, time_cost(all): 1 day, 5:49:30/1 day, 1:10:20, loss=0.418991036056671, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.14(1.03), norm=4.615375241260948, lr=0.000421838148978889
2023-11-22 14:14:50   INFO  epoch: 16/30, acc_iter=109942, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:14:29/0:34:00, time_cost(all): 1 day, 5:50:19/1 day, 1:01:12, loss=0.418907922070361, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.04(1.03), norm=2.866194056873554, lr=0.000421517414796965
2023-11-22 14:15:39   INFO  epoch: 16/30, acc_iter=109992, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:15:18/0:31:32, time_cost(all): 1 day, 5:51:08/1 day, 0:04:25, loss=0.418824808084052, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.12(1.03), norm=4.803939248639195, lr=0.00042119668061504
2023-11-22 14:16:28   INFO  epoch: 16/30, acc_iter=110042, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:16:07/0:31:43, time_cost(all): 1 day, 5:51:57/23:51:13, loss=0.418741694097743, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.17(1.03), norm=1.3709922740308143, lr=0.000420875946433115
2023-11-22 14:17:17   INFO  epoch: 16/30, acc_iter=110092, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:16:57/0:31:57, time_cost(all): 1 day, 5:52:46/1 day, 0:07:18, loss=0.418658580111434, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.14(1.03), norm=3.6780776685969734, lr=0.000420555212251191
2023-11-22 14:18:06   INFO  epoch: 16/30, acc_iter=110142, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:17:46/0:29:06, time_cost(all): 1 day, 5:53:35/1 day, 0:00:04, loss=0.418575466125125, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.09(1.03), norm=1.632359185502942, lr=0.000420234478069266
2023-11-22 14:18:55   INFO  epoch: 16/30, acc_iter=110192, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:18:35/0:30:33, time_cost(all): 1 day, 5:54:24/23:44:27, loss=0.418492352138816, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.06(1.03), norm=3.143812242632015, lr=0.000419913743887341
2023-11-22 14:19:45   INFO  epoch: 16/30, acc_iter=110242, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:19:24/0:29:18, time_cost(all): 1 day, 5:55:14/1 day, 0:05:29, loss=0.418409238152507, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.15(1.03), norm=3.9242620535352493, lr=0.000419593009705416
2023-11-22 14:20:34   INFO  epoch: 16/30, acc_iter=110292, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:20:13/0:27:17, time_cost(all): 1 day, 5:56:03/23:03:38, loss=0.418326124166198, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.01(1.03), norm=3.718148191087836, lr=0.000419272275523492
2023-11-22 14:21:23   INFO  epoch: 16/30, acc_iter=110342, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:21:02/0:28:01, time_cost(all): 1 day, 5:56:52/1 day, 0:44:27, loss=0.418243010179889, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.87(1.03), norm=3.8488258005330938, lr=0.000418951541341567
2023-11-22 14:22:12   INFO  epoch: 16/30, acc_iter=110392, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:21:51/0:27:14, time_cost(all): 1 day, 5:57:41/23:16:57, loss=0.41815989619358, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.93(1.03), norm=4.774113511264644, lr=0.000418630807159642
2023-11-22 14:23:01   INFO  epoch: 16/30, acc_iter=110442, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:22:40/0:25:00, time_cost(all): 1 day, 5:58:30/1 day, 0:38:58, loss=0.418076782207271, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.08(1.03), norm=0.6006243870480356, lr=0.000418310072977718
2023-11-22 14:23:50   INFO  epoch: 16/30, acc_iter=110492, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:23:29/0:24:11, time_cost(all): 1 day, 5:59:19/1 day, 0:42:22, loss=0.417993668220962, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.01(1.03), norm=0.7191169278189651, lr=0.000417989338795793
2023-11-22 14:24:39   INFO  epoch: 16/30, acc_iter=110542, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:24:19/0:24:22, time_cost(all): 1 day, 6:00:08/23:27:02, loss=0.417910554234653, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.15(1.03), norm=3.1032964009860575, lr=0.000417668604613868
2023-11-22 14:25:28   INFO  epoch: 16/30, acc_iter=110592, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:25:08/0:22:22, time_cost(all): 1 day, 6:00:57/1 day, 0:29:14, loss=0.417827440248344, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.21(1.03), norm=3.6479455599609376, lr=0.000417347870431943
2023-11-22 14:26:17   INFO  epoch: 16/30, acc_iter=110642, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:25:57/0:22:01, time_cost(all): 1 day, 6:01:46/23:29:57, loss=0.417744326262035, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.98(1.03), norm=3.1363163018157403, lr=0.000417027136250019
2023-11-22 14:27:07   INFO  epoch: 16/30, acc_iter=110692, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:26:46/0:20:09, time_cost(all): 1 day, 6:02:36/1 day, 0:22:48, loss=0.417661212275726, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.21(1.03), norm=4.824263695514406, lr=0.000416706402068094
2023-11-22 14:27:56   INFO  epoch: 16/30, acc_iter=110742, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:27:35/0:19:35, time_cost(all): 1 day, 6:03:25/23:08:06, loss=0.417578098289416, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.2(1.03), norm=3.994526887931024, lr=0.000416385667886169
2023-11-22 14:28:45   INFO  epoch: 16/30, acc_iter=110792, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:28:24/0:18:31, time_cost(all): 1 day, 6:04:14/22:54:04, loss=0.417494984303107, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.07(1.03), norm=2.0709402307411464, lr=0.000416064933704245
2023-11-22 14:29:34   INFO  epoch: 16/30, acc_iter=110842, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:29:13/0:18:40, time_cost(all): 1 day, 6:05:03/1 day, 0:16:20, loss=0.417411870316798, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.19(1.03), norm=2.373950104102226, lr=0.00041574419952232
2023-11-22 14:30:23   INFO  epoch: 16/30, acc_iter=110892, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:30:02/0:18:00, time_cost(all): 1 day, 6:05:52/23:54:37, loss=0.417328756330489, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.18(1.03), norm=3.2963877489543068, lr=0.000415423465340395
2023-11-22 14:31:12   INFO  epoch: 16/30, acc_iter=110942, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:30:52/0:16:58, time_cost(all): 1 day, 6:06:41/23:14:06, loss=0.41724564234418, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.86(1.03), norm=0.7821731027228742, lr=0.000415102731158471
2023-11-22 14:32:01   INFO  epoch: 16/30, acc_iter=110992, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:31:41/0:16:16, time_cost(all): 1 day, 6:07:30/23:36:53, loss=0.417162528357871, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.2(1.03), norm=4.855044785143075, lr=0.000414781996976546
2023-11-22 14:32:50   INFO  epoch: 16/30, acc_iter=111042, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:32:30/0:15:25, time_cost(all): 1 day, 6:08:19/23:18:29, loss=0.417079414371562, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.16(1.03), norm=2.9792294025652173, lr=0.000414461262794621
2023-11-22 14:33:40   INFO  epoch: 16/30, acc_iter=111092, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:33:19/0:14:04, time_cost(all): 1 day, 6:09:09/22:50:16, loss=0.416996300385253, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.01(1.03), norm=3.2282290462574994, lr=0.000414140528612696
2023-11-22 14:34:29   INFO  epoch: 16/30, acc_iter=111142, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:34:08/0:13:45, time_cost(all): 1 day, 6:09:58/23:24:56, loss=0.416913186398944, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.91(1.03), norm=2.9991726490227433, lr=0.000413819794430772
2023-11-22 14:35:18   INFO  epoch: 16/30, acc_iter=111192, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:34:57/0:12:16, time_cost(all): 1 day, 6:10:47/1 day, 0:55:32, loss=0.416830072412635, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.0(1.03), norm=1.8888044018771208, lr=0.000413499060248847
2023-11-22 14:36:07   INFO  epoch: 16/30, acc_iter=111242, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:35:46/0:11:50, time_cost(all): 1 day, 6:11:36/23:52:57, loss=0.416746958426326, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.89(1.03), norm=4.128753433879893, lr=0.000413178326066922
2023-11-22 14:36:56   INFO  epoch: 16/30, acc_iter=111292, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:36:35/0:11:24, time_cost(all): 1 day, 6:12:25/1 day, 0:38:17, loss=0.416663844440017, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.85(1.03), norm=3.8429400285323134, lr=0.000412857591884998
2023-11-22 14:37:45   INFO  epoch: 16/30, acc_iter=111342, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:37:24/0:10:48, time_cost(all): 1 day, 6:13:14/23:51:38, loss=0.416580730453708, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.17(1.03), norm=4.86445310749787, lr=0.000412536857703073
2023-11-22 14:38:34   INFO  epoch: 16/30, acc_iter=111392, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:38:14/0:09:46, time_cost(all): 1 day, 6:14:03/22:40:27, loss=0.416497616467399, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.04(1.03), norm=3.926858927992887, lr=0.000412216123521148
2023-11-22 14:39:23   INFO  epoch: 16/30, acc_iter=111442, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:39:03/0:08:46, time_cost(all): 1 day, 6:14:52/23:30:30, loss=0.41641450248109, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.03(1.03), norm=3.5200961405597724, lr=0.000411895389339224
2023-11-22 14:40:12   INFO  epoch: 16/30, acc_iter=111492, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:39:52/0:07:42, time_cost(all): 1 day, 6:15:41/23:32:24, loss=0.41633138849478, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.89(1.03), norm=0.6504540266024229, lr=0.000411574655157299
2023-11-22 14:41:02   INFO  epoch: 16/30, acc_iter=111542, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:40:41/0:07:28, time_cost(all): 1 day, 6:16:31/23:53:27, loss=0.416248274508471, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.19(1.03), norm=0.9874895666782967, lr=0.000411253920975374
2023-11-22 14:41:51   INFO  epoch: 16/30, acc_iter=111592, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:41:30/0:06:03, time_cost(all): 1 day, 6:17:20/1 day, 0:02:37, loss=0.416165160522162, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.94(1.03), norm=4.277119100443054, lr=0.000410933186793449
2023-11-22 14:42:40   INFO  epoch: 16/30, acc_iter=111642, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 1:42:19/0:05:26, time_cost(all): 1 day, 6:18:09/1 day, 0:22:35, loss=0.416082046535853, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.14(1.03), norm=3.5413741752229386, lr=0.000410612452611525
2023-11-22 14:43:29   INFO  epoch: 16/30, acc_iter=111692, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 1:43:08/0:04:37, time_cost(all): 1 day, 6:18:58/1 day, 0:06:06, loss=0.415998932549544, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.97(1.03), norm=4.108613527707396, lr=0.0004102917184296
2023-11-22 14:44:18   INFO  epoch: 16/30, acc_iter=111742, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 1:43:57/0:03:55, time_cost(all): 1 day, 6:19:47/1 day, 0:31:12, loss=0.415915818563235, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.13(1.03), norm=4.174566529125047, lr=0.000409970984247675
2023-11-22 14:45:07   INFO  epoch: 16/30, acc_iter=111792, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 1:44:47/0:02:55, time_cost(all): 1 day, 6:20:36/22:42:28, loss=0.415832704576926, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.83(1.03), norm=0.5165254842043205, lr=0.000409650250065751
2023-11-22 14:45:56   INFO  epoch: 16/30, acc_iter=111842, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 1:45:36/0:02:11, time_cost(all): 1 day, 6:21:25/23:45:46, loss=0.415749590590617, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.01(1.03), norm=4.299481585435217, lr=0.000409329515883826
2023-11-22 14:46:45   INFO  epoch: 16/30, acc_iter=111892, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 1:46:25/0:01:21, time_cost(all): 1 day, 6:22:14/22:47:18, loss=0.415666476604308, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.93(1.03), norm=1.7829278445168735, lr=0.000409008781701901
2023-11-22 14:47:35   INFO  epoch: 16/30, acc_iter=111942, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 1:47:14/0:00:38, time_cost(all): 1 day, 6:23:04/23:35:49, loss=0.415583362617999, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.21(1.03), norm=4.294952853343315, lr=0.000408688047519976
2023-11-22 14:48:24   INFO  epoch: 17/30, acc_iter=112029, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:49/1:46:52, time_cost(all): 1 day, 6:23:53/22:54:16, loss=0.415438744281821, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.18(1.03), norm=4.59893669590381, lr=0.000408129970043427
2023-11-22 14:49:13   INFO  epoch: 17/30, acc_iter=112079, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:38/1:50:57, time_cost(all): 1 day, 6:24:42/1 day, 0:33:16, loss=0.415355630295512, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.12(1.03), norm=2.9614510776887, lr=0.000407809235861503
2023-11-22 14:50:02   INFO  epoch: 17/30, acc_iter=112129, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:27/1:47:17, time_cost(all): 1 day, 6:25:31/22:34:35, loss=0.415272516309203, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.12(1.03), norm=3.7307928039476717, lr=0.000407488501679578
2023-11-22 14:50:51   INFO  epoch: 17/30, acc_iter=112179, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:16/1:42:37, time_cost(all): 1 day, 6:26:20/23:08:19, loss=0.415189402322894, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.96(1.03), norm=4.966863318363098, lr=0.000407167767497653
2023-11-22 14:51:40   INFO  epoch: 17/30, acc_iter=112229, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:05/1:39:52, time_cost(all): 1 day, 6:27:09/22:20:16, loss=0.415106288336585, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.87(1.03), norm=2.2865169057301467, lr=0.000406847033315729
2023-11-22 14:52:29   INFO  epoch: 17/30, acc_iter=112279, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:04:54/1:45:00, time_cost(all): 1 day, 6:27:58/1 day, 0:36:05, loss=0.415023174350276, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.07(1.03), norm=3.1494135647016543, lr=0.000406526299133804
2023-11-22 14:53:18   INFO  epoch: 17/30, acc_iter=112329, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:05:43/1:42:56, time_cost(all): 1 day, 6:28:47/22:32:22, loss=0.414940060363967, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.09(1.03), norm=3.036649606424257, lr=0.000406205564951879
2023-11-22 14:54:07   INFO  epoch: 17/30, acc_iter=112379, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:06:32/1:37:14, time_cost(all): 1 day, 6:29:36/1 day, 0:21:25, loss=0.414856946377658, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.05(1.03), norm=3.902425147324526, lr=0.000405884830769955
2023-11-22 14:54:57   INFO  epoch: 17/30, acc_iter=112429, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:07:22/1:38:22, time_cost(all): 1 day, 6:30:26/22:28:05, loss=0.414773832391349, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.95(1.03), norm=1.0835594507179422, lr=0.00040556409658803
2023-11-22 14:55:46   INFO  epoch: 17/30, acc_iter=112479, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:08:11/1:38:11, time_cost(all): 1 day, 6:31:15/1 day, 0:28:25, loss=0.41469071840504, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.1(1.03), norm=1.5880355411869405, lr=0.000405243362406105
2023-11-22 14:56:35   INFO  epoch: 17/30, acc_iter=112529, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:09:00/1:38:43, time_cost(all): 1 day, 6:32:04/23:33:43, loss=0.414607604418731, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.16(1.03), norm=2.625127372414184, lr=0.00040492262822418
2023-11-22 14:57:24   INFO  epoch: 17/30, acc_iter=112579, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:09:49/1:33:44, time_cost(all): 1 day, 6:32:53/1 day, 0:23:47, loss=0.414524490432421, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.89(1.03), norm=2.3587005239447105, lr=0.000404601894042256
2023-11-22 14:58:13   INFO  epoch: 17/30, acc_iter=112629, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:10:38/1:39:17, time_cost(all): 1 day, 6:33:42/1 day, 0:03:19, loss=0.414441376446112, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.03(1.03), norm=1.9601138992498561, lr=0.000404281159860331
2023-11-22 14:59:02   INFO  epoch: 17/30, acc_iter=112679, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:11:27/1:41:09, time_cost(all): 1 day, 6:34:31/22:49:18, loss=0.414358262459803, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.02(1.03), norm=1.135479442005579, lr=0.000403960425678406
2023-11-22 14:59:51   INFO  epoch: 17/30, acc_iter=112729, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:12:16/1:33:45, time_cost(all): 1 day, 6:35:20/22:59:28, loss=0.414275148473494, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.07(1.03), norm=1.0388874841970912, lr=0.000403639691496482
2023-11-22 15:00:40   INFO  epoch: 17/30, acc_iter=112779, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:13:05/1:31:01, time_cost(all): 1 day, 6:36:09/22:32:56, loss=0.414192034487185, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.08(1.03), norm=3.8818637743756828, lr=0.000403318957314557
2023-11-22 15:01:29   INFO  epoch: 17/30, acc_iter=112829, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:13:54/1:35:54, time_cost(all): 1 day, 6:36:58/23:11:55, loss=0.414108920500876, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.99(1.03), norm=4.5634198498922265, lr=0.000402998223132632
2023-11-22 15:02:19   INFO  epoch: 17/30, acc_iter=112879, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:14:44/1:29:25, time_cost(all): 1 day, 6:37:48/22:11:38, loss=0.414025806514567, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.9(1.03), norm=2.617336100370741, lr=0.000402677488950708
2023-11-22 15:03:08   INFO  epoch: 17/30, acc_iter=112929, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:15:33/1:33:19, time_cost(all): 1 day, 6:38:37/22:11:49, loss=0.413942692528258, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.06(1.03), norm=2.7797214899126113, lr=0.000402356754768783
2023-11-22 15:03:57   INFO  epoch: 17/30, acc_iter=112979, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:16:22/1:30:16, time_cost(all): 1 day, 6:39:26/22:07:04, loss=0.413859578541949, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.17(1.03), norm=1.8810124524665874, lr=0.000402036020586858
2023-11-22 15:04:46   INFO  epoch: 17/30, acc_iter=113029, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:17:11/1:27:47, time_cost(all): 1 day, 6:40:15/22:33:19, loss=0.41377646455564, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.87(1.03), norm=4.211250142914178, lr=0.000401715286404933
2023-11-22 15:05:35   INFO  epoch: 17/30, acc_iter=113079, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:18:00/1:31:15, time_cost(all): 1 day, 6:41:04/1 day, 0:00:38, loss=0.413693350569331, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.14(1.03), norm=1.79870571568956, lr=0.000401394552223009
2023-11-22 15:06:24   INFO  epoch: 17/30, acc_iter=113129, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:18:49/1:29:52, time_cost(all): 1 day, 6:41:53/22:35:11, loss=0.413610236583022, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.05(1.03), norm=3.7045238756218333, lr=0.000401073818041084
2023-11-22 15:07:13   INFO  epoch: 17/30, acc_iter=113179, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:19:38/1:32:03, time_cost(all): 1 day, 6:42:42/23:22:21, loss=0.413527122596713, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.04(1.03), norm=3.520283916143083, lr=0.000400753083859159
2023-11-22 15:08:02   INFO  epoch: 17/30, acc_iter=113229, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:20:27/1:23:49, time_cost(all): 1 day, 6:43:31/23:34:40, loss=0.413444008610404, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.15(1.03), norm=0.633228996105436, lr=0.000400432349677235
2023-11-22 15:08:52   INFO  epoch: 17/30, acc_iter=113279, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:21:17/1:27:39, time_cost(all): 1 day, 6:44:21/23:45:31, loss=0.413360894624095, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.15(1.03), norm=1.5085080425052064, lr=0.00040011161549531
2023-11-22 15:09:41   INFO  epoch: 17/30, acc_iter=113329, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:22:06/1:26:20, time_cost(all): 1 day, 6:45:10/22:59:08, loss=0.413277780637786, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.97(1.03), norm=0.6458803752793205, lr=0.000399790881313385
2023-11-22 15:10:30   INFO  epoch: 17/30, acc_iter=113379, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:22:55/1:29:02, time_cost(all): 1 day, 6:45:59/23:38:57, loss=0.413194666651476, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.12(1.03), norm=3.1237447004171948, lr=0.00039947014713146
2023-11-22 15:11:19   INFO  epoch: 17/30, acc_iter=113429, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:23:44/1:21:07, time_cost(all): 1 day, 6:46:48/1 day, 0:14:39, loss=0.413111552665167, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.0(1.03), norm=4.5719730860869126, lr=0.000399149412949536
2023-11-22 15:12:08   INFO  epoch: 17/30, acc_iter=113479, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:24:33/1:27:07, time_cost(all): 1 day, 6:47:37/1 day, 0:08:59, loss=0.413028438678858, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.13(1.03), norm=0.6743104216715865, lr=0.000398828678767611
2023-11-22 15:12:57   INFO  epoch: 17/30, acc_iter=113529, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:25:22/1:20:02, time_cost(all): 1 day, 6:48:26/22:58:52, loss=0.412945324692549, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.94(1.03), norm=0.6153860835903904, lr=0.000398507944585686
2023-11-22 15:13:46   INFO  epoch: 17/30, acc_iter=113579, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:26:11/1:24:47, time_cost(all): 1 day, 6:49:15/23:37:32, loss=0.41286221070624, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.94(1.03), norm=1.8989219541318023, lr=0.000398187210403762
2023-11-22 15:14:35   INFO  epoch: 17/30, acc_iter=113629, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:27:00/1:19:54, time_cost(all): 1 day, 6:50:04/23:22:42, loss=0.412779096719931, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.15(1.03), norm=2.5946056638875485, lr=0.000397866476221837
2023-11-22 15:15:24   INFO  epoch: 17/30, acc_iter=113679, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:27:49/1:16:11, time_cost(all): 1 day, 6:50:53/23:06:14, loss=0.412695982733622, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.08(1.03), norm=4.667366579929765, lr=0.000397545742039912
2023-11-22 15:16:14   INFO  epoch: 17/30, acc_iter=113729, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:28:39/1:21:08, time_cost(all): 1 day, 6:51:43/23:42:18, loss=0.412612868747313, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.1(1.03), norm=1.5167469634940012, lr=0.000397225007857988
2023-11-22 15:17:03   INFO  epoch: 17/30, acc_iter=113779, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:29:28/1:20:03, time_cost(all): 1 day, 6:52:32/22:38:36, loss=0.412529754761004, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.05(1.03), norm=2.7646373054159965, lr=0.000396904273676063
2023-11-22 15:17:52   INFO  epoch: 17/30, acc_iter=113829, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:30:17/1:21:20, time_cost(all): 1 day, 6:53:21/22:59:06, loss=0.412446640774695, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.22(1.03), norm=1.5925108060627746, lr=0.000396583539494138
2023-11-22 15:18:41   INFO  epoch: 17/30, acc_iter=113879, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:31:06/1:14:45, time_cost(all): 1 day, 6:54:10/23:28:25, loss=0.412363526788386, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.21(1.03), norm=1.0317480690729222, lr=0.000396262805312213
2023-11-22 15:19:30   INFO  epoch: 17/30, acc_iter=113929, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:31:55/1:17:21, time_cost(all): 1 day, 6:54:59/21:58:40, loss=0.412280412802077, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.2(1.03), norm=0.5917031949200657, lr=0.000395942071130289
2023-11-22 15:20:19   INFO  epoch: 17/30, acc_iter=113979, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:32:44/1:14:24, time_cost(all): 1 day, 6:55:48/22:32:56, loss=0.412197298815768, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.92(1.03), norm=1.972788469593777, lr=0.000395621336948364
2023-11-22 15:21:08   INFO  epoch: 17/30, acc_iter=114029, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:33:33/1:11:34, time_cost(all): 1 day, 6:56:37/22:56:46, loss=0.412114184829459, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.2(1.03), norm=1.9877787264599456, lr=0.000395300602766439
2023-11-22 15:21:57   INFO  epoch: 17/30, acc_iter=114079, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:34:22/1:10:58, time_cost(all): 1 day, 6:57:26/22:25:13, loss=0.41203107084315, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.96(1.03), norm=0.5081153129009328, lr=0.000394979868584515
2023-11-22 15:22:47   INFO  epoch: 17/30, acc_iter=114129, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:35:12/1:12:25, time_cost(all): 1 day, 6:58:16/23:33:46, loss=0.41194795685684, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.13(1.03), norm=4.031293785317455, lr=0.00039465913440259
2023-11-22 15:23:36   INFO  epoch: 17/30, acc_iter=114179, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:36:01/1:10:19, time_cost(all): 1 day, 6:59:05/22:08:18, loss=0.411864842870531, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.07(1.03), norm=1.5740208534995328, lr=0.000394338400220665
2023-11-22 15:24:25   INFO  epoch: 17/30, acc_iter=114229, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:36:50/1:11:33, time_cost(all): 1 day, 6:59:54/23:09:50, loss=0.411781728884222, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.02(1.03), norm=2.0487999294397685, lr=0.000394017666038741
2023-11-22 15:25:14   INFO  epoch: 17/30, acc_iter=114279, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:37:39/1:07:59, time_cost(all): 1 day, 7:00:43/21:49:21, loss=0.411698614897913, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.12(1.03), norm=4.841130318778537, lr=0.000393696931856816
2023-11-22 15:26:03   INFO  epoch: 17/30, acc_iter=114329, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:38:28/1:12:08, time_cost(all): 1 day, 7:01:32/22:54:04, loss=0.411615500911604, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.1(1.03), norm=2.209472377627898, lr=0.000393376197674891
2023-11-22 15:26:52   INFO  epoch: 17/30, acc_iter=114379, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:39:17/1:05:24, time_cost(all): 1 day, 7:02:21/23:02:02, loss=0.411532386925295, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.02(1.03), norm=4.376393854682782, lr=0.000393055463492966
2023-11-22 15:27:41   INFO  epoch: 17/30, acc_iter=114429, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:40:06/1:05:14, time_cost(all): 1 day, 7:03:10/21:51:51, loss=0.411449272938986, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.08(1.03), norm=1.7977049192622898, lr=0.000392734729311042
2023-11-22 15:28:30   INFO  epoch: 17/30, acc_iter=114479, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:40:55/1:05:30, time_cost(all): 1 day, 7:03:59/23:33:31, loss=0.411366158952677, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.15(1.03), norm=1.7498831053495572, lr=0.000392413995129117
2023-11-22 15:29:19   INFO  epoch: 17/30, acc_iter=114529, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:41:44/1:08:54, time_cost(all): 1 day, 7:04:48/23:13:04, loss=0.411283044966368, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.84(1.03), norm=3.6071125471957712, lr=0.000392093260947192
2023-11-22 15:30:09   INFO  epoch: 17/30, acc_iter=114579, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:42:34/1:05:52, time_cost(all): 1 day, 7:05:38/22:10:16, loss=0.411199930980059, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.89(1.03), norm=2.3979588341674516, lr=0.000391772526765268
2023-11-22 15:30:58   INFO  epoch: 17/30, acc_iter=114629, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:43:23/1:05:23, time_cost(all): 1 day, 7:06:27/23:28:07, loss=0.41111681699375, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.16(1.03), norm=3.9609708433600486, lr=0.000391451792583343
2023-11-22 15:31:47   INFO  epoch: 17/30, acc_iter=114679, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:44:12/1:05:35, time_cost(all): 1 day, 7:07:16/23:22:22, loss=0.411033703007441, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.16(1.03), norm=3.559411279900857, lr=0.000391131058401418
2023-11-22 15:32:36   INFO  epoch: 17/30, acc_iter=114729, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:45:01/1:05:43, time_cost(all): 1 day, 7:08:05/22:14:46, loss=0.410950589021132, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.96(1.03), norm=4.074071730345088, lr=0.000390810324219493
2023-11-22 15:33:25   INFO  epoch: 17/30, acc_iter=114779, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:45:50/1:03:29, time_cost(all): 1 day, 7:08:54/22:58:58, loss=0.410867475034823, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.15(1.03), norm=2.045685398818454, lr=0.000390489590037569
2023-11-22 15:34:14   INFO  epoch: 17/30, acc_iter=114829, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:46:39/1:03:55, time_cost(all): 1 day, 7:09:43/21:59:39, loss=0.410784361048514, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.17(1.03), norm=1.879645170766706, lr=0.000390168855855644
2023-11-22 15:35:03   INFO  epoch: 17/30, acc_iter=114879, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:47:28/0:58:40, time_cost(all): 1 day, 7:10:32/23:40:49, loss=0.410701247062205, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.95(1.03), norm=1.0655078102940672, lr=0.000389848121673719
2023-11-22 15:35:52   INFO  epoch: 17/30, acc_iter=114929, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:48:17/0:56:48, time_cost(all): 1 day, 7:11:21/22:17:30, loss=0.410618133075896, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.95(1.03), norm=1.5880232317456882, lr=0.000389527387491795
2023-11-22 15:36:42   INFO  epoch: 17/30, acc_iter=114979, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:49:07/0:55:58, time_cost(all): 1 day, 7:12:11/21:53:11, loss=0.410535019089586, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.06(1.03), norm=3.246855350701136, lr=0.00038920665330987
2023-11-22 15:37:31   INFO  epoch: 17/30, acc_iter=115029, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:49:56/0:58:58, time_cost(all): 1 day, 7:13:00/23:38:06, loss=0.410451905103277, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.97(1.03), norm=3.462570767099421, lr=0.000388885919127945
2023-11-22 15:38:20   INFO  epoch: 17/30, acc_iter=115079, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:50:45/0:59:13, time_cost(all): 1 day, 7:13:49/22:53:45, loss=0.410368791116968, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.21(1.03), norm=1.4124351230092576, lr=0.000388565184946021
2023-11-22 15:39:09   INFO  epoch: 17/30, acc_iter=115129, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 0:51:34/0:58:47, time_cost(all): 1 day, 7:14:38/22:11:50, loss=0.410285677130659, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.06(1.03), norm=4.207158224656127, lr=0.000388244450764096
2023-11-22 15:39:58   INFO  epoch: 17/30, acc_iter=115179, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 0:52:23/0:58:08, time_cost(all): 1 day, 7:15:27/23:32:35, loss=0.41020256314435, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.06(1.03), norm=0.8834968909682483, lr=0.000387923716582171
2023-11-22 15:40:47   INFO  epoch: 17/30, acc_iter=115229, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 0:53:12/0:54:32, time_cost(all): 1 day, 7:16:16/21:52:41, loss=0.410119449158041, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.91(1.03), norm=0.8931188562449418, lr=0.000387602982400246
2023-11-22 15:41:36   INFO  epoch: 17/30, acc_iter=115279, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 0:54:01/0:54:09, time_cost(all): 1 day, 7:17:05/22:09:51, loss=0.410036335171732, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.03(1.03), norm=2.5667868123424484, lr=0.000387282248218322
2023-11-22 15:42:25   INFO  epoch: 17/30, acc_iter=115329, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 0:54:50/0:50:57, time_cost(all): 1 day, 7:17:54/23:32:17, loss=0.409953221185423, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.14(1.03), norm=2.081163702539156, lr=0.000386961514036397
2023-11-22 15:43:14   INFO  epoch: 17/30, acc_iter=115379, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 0:55:39/0:51:09, time_cost(all): 1 day, 7:18:43/23:42:43, loss=0.409870107199114, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.04(1.03), norm=3.8792311944332973, lr=0.000386640779854472
2023-11-22 15:44:04   INFO  epoch: 17/30, acc_iter=115429, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 0:56:29/0:50:45, time_cost(all): 1 day, 7:19:33/23:24:48, loss=0.409786993212805, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.9(1.03), norm=1.8469357955455505, lr=0.000386320045672548
2023-11-22 15:44:53   INFO  epoch: 17/30, acc_iter=115479, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 0:57:18/0:51:41, time_cost(all): 1 day, 7:20:22/23:06:51, loss=0.409703879226496, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.04(1.03), norm=1.9042061840687752, lr=0.000385999311490623
2023-11-22 15:45:42   INFO  epoch: 17/30, acc_iter=115529, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 0:58:07/0:47:15, time_cost(all): 1 day, 7:21:11/22:16:40, loss=0.409620765240187, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.01(1.03), norm=3.789058988497675, lr=0.000385678577308698
2023-11-22 15:46:31   INFO  epoch: 17/30, acc_iter=115579, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 0:58:56/0:47:45, time_cost(all): 1 day, 7:22:00/22:05:21, loss=0.409537651253878, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.22(1.03), norm=1.1406266238591254, lr=0.000385357843126773
2023-11-22 15:47:20   INFO  epoch: 17/30, acc_iter=115629, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 0:59:45/0:48:18, time_cost(all): 1 day, 7:22:49/23:10:13, loss=0.409454537267569, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.16(1.03), norm=3.891785537172914, lr=0.000385037108944849
2023-11-22 15:48:09   INFO  epoch: 17/30, acc_iter=115679, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:00:34/0:46:10, time_cost(all): 1 day, 7:23:38/22:58:32, loss=0.40937142328126, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.2(1.03), norm=1.9574202639596252, lr=0.000384716374762924
2023-11-22 15:48:58   INFO  epoch: 17/30, acc_iter=115729, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:01:23/0:46:09, time_cost(all): 1 day, 7:24:27/21:48:24, loss=0.409288309294951, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.88(1.03), norm=4.555838615921219, lr=0.000384395640580999
2023-11-22 15:49:47   INFO  epoch: 17/30, acc_iter=115779, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:02:12/0:44:20, time_cost(all): 1 day, 7:25:16/22:10:43, loss=0.409205195308641, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.09(1.03), norm=1.785558762852003, lr=0.000384074906399075
2023-11-22 15:50:37   INFO  epoch: 17/30, acc_iter=115829, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:03:02/0:43:58, time_cost(all): 1 day, 7:26:06/22:21:04, loss=0.409122081322332, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.16(1.03), norm=3.1521086428515956, lr=0.00038375417221715
2023-11-22 15:51:26   INFO  epoch: 17/30, acc_iter=115879, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:03:51/0:42:46, time_cost(all): 1 day, 7:26:55/23:15:19, loss=0.409038967336023, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.06(1.03), norm=4.3157288869384365, lr=0.000383433438035225
2023-11-22 15:52:15   INFO  epoch: 17/30, acc_iter=115929, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:04:40/0:44:34, time_cost(all): 1 day, 7:27:44/23:22:52, loss=0.408955853349714, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.84(1.03), norm=3.770438256530787, lr=0.000383112703853301
2023-11-22 15:53:04   INFO  epoch: 17/30, acc_iter=115979, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:05:29/0:43:44, time_cost(all): 1 day, 7:28:33/21:54:36, loss=0.408872739363405, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.89(1.03), norm=4.107964781434644, lr=0.000382791969671376
2023-11-22 15:53:53   INFO  epoch: 17/30, acc_iter=116029, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:06:18/0:42:31, time_cost(all): 1 day, 7:29:22/21:28:29, loss=0.408789625377096, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.08(1.03), norm=1.1277081660149348, lr=0.000382471235489451
2023-11-22 15:54:42   INFO  epoch: 17/30, acc_iter=116079, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:07:07/0:42:01, time_cost(all): 1 day, 7:30:11/22:10:24, loss=0.408706511390787, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.03(1.03), norm=3.022959008866365, lr=0.000382150501307526
2023-11-22 15:55:31   INFO  epoch: 17/30, acc_iter=116129, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:07:56/0:41:05, time_cost(all): 1 day, 7:31:00/21:19:45, loss=0.408623397404478, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.89(1.03), norm=4.434889759831158, lr=0.000381829767125602
2023-11-22 15:56:20   INFO  epoch: 17/30, acc_iter=116179, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:08:45/0:37:13, time_cost(all): 1 day, 7:31:49/22:44:39, loss=0.408540283418169, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.89(1.03), norm=3.2540702636317578, lr=0.000381509032943677
2023-11-22 15:57:09   INFO  epoch: 17/30, acc_iter=116229, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:09:34/0:38:43, time_cost(all): 1 day, 7:32:38/23:05:47, loss=0.40845716943186, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.93(1.03), norm=4.445565134991876, lr=0.000381188298761752
2023-11-22 15:57:59   INFO  epoch: 17/30, acc_iter=116279, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:10:24/0:36:29, time_cost(all): 1 day, 7:33:28/23:00:49, loss=0.408374055445551, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.05(1.03), norm=3.9615943523302626, lr=0.000380867564579828
2023-11-22 15:58:48   INFO  epoch: 17/30, acc_iter=116329, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:11:13/0:35:58, time_cost(all): 1 day, 7:34:17/21:59:26, loss=0.408290941459242, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.85(1.03), norm=3.695983671881784, lr=0.000380546830397903
2023-11-22 15:59:37   INFO  epoch: 17/30, acc_iter=116379, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:12:02/0:35:18, time_cost(all): 1 day, 7:35:06/22:21:04, loss=0.408207827472933, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.2(1.03), norm=2.2401218294374496, lr=0.000380226096215978
2023-11-22 16:00:26   INFO  epoch: 17/30, acc_iter=116429, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:12:51/0:33:35, time_cost(all): 1 day, 7:35:55/22:29:52, loss=0.408124713486624, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.11(1.03), norm=2.016381660259288, lr=0.000379905362034053
2023-11-22 16:01:15   INFO  epoch: 17/30, acc_iter=116479, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:13:40/0:34:49, time_cost(all): 1 day, 7:36:44/23:22:31, loss=0.408041599500315, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.0(1.03), norm=4.860817864977063, lr=0.000379584627852129
2023-11-22 16:02:04   INFO  epoch: 17/30, acc_iter=116529, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:14:29/0:33:26, time_cost(all): 1 day, 7:37:33/21:29:30, loss=0.407958485514005, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.92(1.03), norm=3.5943926782254163, lr=0.000379263893670204
2023-11-22 16:02:53   INFO  epoch: 17/30, acc_iter=116579, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:15:18/0:32:31, time_cost(all): 1 day, 7:38:22/22:50:56, loss=0.407875371527696, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.19(1.03), norm=0.7063710007009816, lr=0.000378943159488279
2023-11-22 16:03:42   INFO  epoch: 17/30, acc_iter=116629, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:16:07/0:32:35, time_cost(all): 1 day, 7:39:11/22:55:26, loss=0.407792257541387, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.04(1.03), norm=2.2698996842170924, lr=0.000378622425306355
2023-11-22 16:04:32   INFO  epoch: 17/30, acc_iter=116679, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:16:57/0:29:32, time_cost(all): 1 day, 7:40:01/22:03:12, loss=0.407709143555078, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.93(1.03), norm=1.2912066866909755, lr=0.00037830169112443
2023-11-22 16:05:21   INFO  epoch: 17/30, acc_iter=116729, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:17:46/0:30:35, time_cost(all): 1 day, 7:40:50/21:43:19, loss=0.407626029568769, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.08(1.03), norm=3.612801072606784, lr=0.000377980956942505
2023-11-22 16:06:10   INFO  epoch: 17/30, acc_iter=116779, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:18:35/0:28:03, time_cost(all): 1 day, 7:41:39/23:13:20, loss=0.40754291558246, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.12(1.03), norm=2.264735750332929, lr=0.000377660222760581
2023-11-22 16:06:59   INFO  epoch: 17/30, acc_iter=116829, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:19:24/0:28:36, time_cost(all): 1 day, 7:42:28/22:24:09, loss=0.407459801596151, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.93(1.03), norm=4.2610122811269475, lr=0.000377339488578656
2023-11-22 16:07:48   INFO  epoch: 17/30, acc_iter=116879, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:20:13/0:27:29, time_cost(all): 1 day, 7:43:17/21:46:49, loss=0.407376687609842, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.9(1.03), norm=3.7171411525936087, lr=0.000377018754396731
2023-11-22 16:08:37   INFO  epoch: 17/30, acc_iter=116929, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:21:02/0:27:27, time_cost(all): 1 day, 7:44:06/21:28:16, loss=0.407293573623533, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.12(1.03), norm=3.224520234090578, lr=0.000376698020214806
2023-11-22 16:09:26   INFO  epoch: 17/30, acc_iter=116979, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:21:51/0:26:31, time_cost(all): 1 day, 7:44:55/21:04:28, loss=0.407210459637224, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.04(1.03), norm=0.7833920402625854, lr=0.000376377286032882
2023-11-22 16:10:15   INFO  epoch: 17/30, acc_iter=117029, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:22:40/0:24:17, time_cost(all): 1 day, 7:45:44/22:51:05, loss=0.407127345650915, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.05(1.03), norm=0.565765489457899, lr=0.000376056551850957
2023-11-22 16:11:04   INFO  epoch: 17/30, acc_iter=117079, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:23:29/0:24:43, time_cost(all): 1 day, 7:46:33/22:05:31, loss=0.407044231664606, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.09(1.03), norm=4.14229917885784, lr=0.000375735817669032
2023-11-22 16:11:54   INFO  epoch: 17/30, acc_iter=117129, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:24:19/0:23:06, time_cost(all): 1 day, 7:47:23/21:07:06, loss=0.406961117678297, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.94(1.03), norm=0.6106073107966097, lr=0.000375415083487108
2023-11-22 16:12:43   INFO  epoch: 17/30, acc_iter=117179, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:25:08/0:22:24, time_cost(all): 1 day, 7:48:12/21:33:05, loss=0.406878003691988, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.19(1.03), norm=4.591397325072747, lr=0.000375094349305183
2023-11-22 16:13:32   INFO  epoch: 17/30, acc_iter=117229, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:25:57/0:20:55, time_cost(all): 1 day, 7:49:01/23:11:09, loss=0.406794889705679, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.03(1.03), norm=1.7919871757110217, lr=0.000374773615123258
2023-11-22 16:14:21   INFO  epoch: 17/30, acc_iter=117279, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:26:46/0:21:35, time_cost(all): 1 day, 7:49:50/21:09:09, loss=0.40671177571937, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.12(1.03), norm=3.3254365308002947, lr=0.000374452880941333
2023-11-22 16:15:10   INFO  epoch: 17/30, acc_iter=117329, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:27:35/0:19:27, time_cost(all): 1 day, 7:50:39/22:24:34, loss=0.40662866173306, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.88(1.03), norm=2.494008581953275, lr=0.000374132146759409
2023-11-22 16:15:59   INFO  epoch: 17/30, acc_iter=117379, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:28:24/0:19:26, time_cost(all): 1 day, 7:51:28/21:59:52, loss=0.406545547746751, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.86(1.03), norm=3.9769394912615468, lr=0.000373811412577484
2023-11-22 16:16:48   INFO  epoch: 17/30, acc_iter=117429, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:29:13/0:18:55, time_cost(all): 1 day, 7:52:17/21:28:40, loss=0.406462433760442, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.88(1.03), norm=3.4540748697019423, lr=0.000373490678395559
2023-11-22 16:17:37   INFO  epoch: 17/30, acc_iter=117479, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:30:02/0:18:35, time_cost(all): 1 day, 7:53:06/22:34:49, loss=0.406379319774133, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.13(1.03), norm=0.8522760666163859, lr=0.000373169944213635
2023-11-22 16:18:27   INFO  epoch: 17/30, acc_iter=117529, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:30:52/0:17:09, time_cost(all): 1 day, 7:53:56/21:54:47, loss=0.406296205787824, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.84(1.03), norm=1.9628796028397721, lr=0.00037284921003171
2023-11-22 16:19:16   INFO  epoch: 17/30, acc_iter=117579, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:31:41/0:15:46, time_cost(all): 1 day, 7:54:45/21:08:52, loss=0.406213091801515, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.12(1.03), norm=0.5244716176072801, lr=0.000372528475849785
2023-11-22 16:20:05   INFO  epoch: 17/30, acc_iter=117629, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:32:30/0:14:35, time_cost(all): 1 day, 7:55:34/22:45:56, loss=0.406129977815206, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.84(1.03), norm=4.01371745402881, lr=0.000372207741667861
2023-11-22 16:20:54   INFO  epoch: 17/30, acc_iter=117679, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:33:19/0:14:22, time_cost(all): 1 day, 7:56:23/21:05:10, loss=0.406046863828897, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.07(1.03), norm=4.492953447605432, lr=0.000371887007485936
2023-11-22 16:21:43   INFO  epoch: 17/30, acc_iter=117729, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:34:08/0:14:09, time_cost(all): 1 day, 7:57:12/22:23:51, loss=0.405963749842588, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.92(1.03), norm=4.550395192420171, lr=0.000371566273304011
2023-11-22 16:22:32   INFO  epoch: 17/30, acc_iter=117779, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:34:57/0:13:15, time_cost(all): 1 day, 7:58:01/22:23:56, loss=0.405880635856279, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.1(1.03), norm=4.527102354105136, lr=0.000371245539122086
2023-11-22 16:23:21   INFO  epoch: 17/30, acc_iter=117829, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:35:46/0:12:05, time_cost(all): 1 day, 7:58:50/21:15:45, loss=0.40579752186997, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.89(1.03), norm=0.8541708327348996, lr=0.000370924804940162
2023-11-22 16:24:10   INFO  epoch: 17/30, acc_iter=117879, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:36:35/0:11:22, time_cost(all): 1 day, 7:59:39/21:22:05, loss=0.405714407883661, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.21(1.03), norm=4.629447870894381, lr=0.000370604070758237
2023-11-22 16:24:59   INFO  epoch: 17/30, acc_iter=117929, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:37:24/0:09:57, time_cost(all): 1 day, 8:00:28/21:49:22, loss=0.405631293897352, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.05(1.03), norm=2.350396647241688, lr=0.000370283336576312
2023-11-22 16:25:49   INFO  epoch: 17/30, acc_iter=117979, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:38:14/0:09:14, time_cost(all): 1 day, 8:01:18/21:39:40, loss=0.405548179911043, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.17(1.03), norm=1.1186626431382964, lr=0.000369962602394388
2023-11-22 16:26:38   INFO  epoch: 17/30, acc_iter=118029, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:39:03/0:08:56, time_cost(all): 1 day, 8:02:07/20:59:23, loss=0.405465065924734, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.86(1.03), norm=4.600607359580202, lr=0.000369641868212463
2023-11-22 16:27:27   INFO  epoch: 17/30, acc_iter=118079, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:39:52/0:07:50, time_cost(all): 1 day, 8:02:56/22:18:42, loss=0.405381951938425, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.96(1.03), norm=2.0988014709300233, lr=0.000369321134030538
2023-11-22 16:28:16   INFO  epoch: 17/30, acc_iter=118129, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:40:41/0:07:15, time_cost(all): 1 day, 8:03:45/22:53:02, loss=0.405298837952116, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.18(1.03), norm=4.261958456045203, lr=0.000369000399848613
2023-11-22 16:29:05   INFO  epoch: 17/30, acc_iter=118179, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:41:30/0:06:33, time_cost(all): 1 day, 8:04:34/22:06:51, loss=0.405215723965806, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.22(1.03), norm=2.3344802069023327, lr=0.000368679665666689
2023-11-22 16:29:54   INFO  epoch: 17/30, acc_iter=118229, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 1:42:19/0:05:44, time_cost(all): 1 day, 8:05:23/22:24:56, loss=0.405132609979497, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.98(1.03), norm=4.818239557500128, lr=0.000368358931484764
2023-11-22 16:30:43   INFO  epoch: 17/30, acc_iter=118279, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 1:43:08/0:04:53, time_cost(all): 1 day, 8:06:12/21:33:34, loss=0.405049495993188, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.83(1.03), norm=3.113067538007272, lr=0.000368038197302839
2023-11-22 16:31:32   INFO  epoch: 17/30, acc_iter=118329, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 1:43:57/0:03:46, time_cost(all): 1 day, 8:07:01/21:07:02, loss=0.404966382006879, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.17(1.03), norm=4.902077260392028, lr=0.000367717463120915
2023-11-22 16:32:22   INFO  epoch: 17/30, acc_iter=118379, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 1:44:47/0:03:08, time_cost(all): 1 day, 8:07:51/20:59:20, loss=0.40488326802057, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.91(1.03), norm=3.1095970014926286, lr=0.00036739672893899
2023-11-22 16:33:11   INFO  epoch: 17/30, acc_iter=118429, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 1:45:36/0:02:09, time_cost(all): 1 day, 8:08:40/21:32:07, loss=0.404800154034261, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.88(1.03), norm=2.191243056178108, lr=0.000367075994757065
2023-11-22 16:34:00   INFO  epoch: 17/30, acc_iter=118479, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 1:46:25/0:01:21, time_cost(all): 1 day, 8:09:29/22:25:39, loss=0.404717040047952, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.04(1.03), norm=4.796898836694738, lr=0.000366755260575141
2023-11-22 16:34:49   INFO  epoch: 17/30, acc_iter=118529, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 1:47:14/0:00:38, time_cost(all): 1 day, 8:10:18/22:16:18, loss=0.404633926061643, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.08(1.03), norm=4.545467938363549, lr=0.000366434526393216
2023-11-22 16:35:38   INFO  epoch: 18/30, acc_iter=118616, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:49/1:49:53, time_cost(all): 1 day, 8:11:07/20:42:35, loss=0.404489307725465, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.11(1.03), norm=1.250609075566789, lr=0.000365876448916667
2023-11-22 16:36:27   INFO  epoch: 18/30, acc_iter=118666, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:38/1:41:45, time_cost(all): 1 day, 8:11:56/22:19:55, loss=0.404406193739156, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.95(1.03), norm=1.66627286846655, lr=0.000365555714734742
2023-11-22 16:37:16   INFO  epoch: 18/30, acc_iter=118716, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:27/1:41:17, time_cost(all): 1 day, 8:12:45/20:41:45, loss=0.404323079752847, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.09(1.03), norm=3.1960132738417375, lr=0.000365234980552818
2023-11-22 16:38:05   INFO  epoch: 18/30, acc_iter=118766, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:16/1:42:37, time_cost(all): 1 day, 8:13:34/21:13:46, loss=0.404239965766538, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.99(1.03), norm=3.366382046969786, lr=0.000364914246370893
2023-11-22 16:38:54   INFO  epoch: 18/30, acc_iter=118816, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:05/1:46:00, time_cost(all): 1 day, 8:14:23/21:02:50, loss=0.404156851780229, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.11(1.03), norm=0.5633638656328737, lr=0.000364593512188968
2023-11-22 16:39:44   INFO  epoch: 18/30, acc_iter=118866, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:04:54/1:48:03, time_cost(all): 1 day, 8:15:13/22:38:43, loss=0.40407373779392, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.94(1.03), norm=0.798629896591075, lr=0.000364272778007043
2023-11-22 16:40:33   INFO  epoch: 18/30, acc_iter=118916, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:05:43/1:40:53, time_cost(all): 1 day, 8:16:02/20:35:38, loss=0.403990623807611, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.93(1.03), norm=1.7146472848881769, lr=0.000363952043825119
2023-11-22 16:41:22   INFO  epoch: 18/30, acc_iter=118966, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:06:32/1:39:03, time_cost(all): 1 day, 8:16:51/21:20:16, loss=0.403907509821302, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.22(1.03), norm=1.359880206818667, lr=0.000363631309643194
2023-11-22 16:42:11   INFO  epoch: 18/30, acc_iter=119016, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:07:22/1:42:04, time_cost(all): 1 day, 8:17:40/21:14:16, loss=0.403824395834993, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.99(1.03), norm=1.22307079276998, lr=0.000363310575461269
2023-11-22 16:43:00   INFO  epoch: 18/30, acc_iter=119066, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:08:11/1:39:56, time_cost(all): 1 day, 8:18:29/21:44:19, loss=0.403741281848684, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.21(1.03), norm=1.1336396496602967, lr=0.000362989841279345
2023-11-22 16:43:49   INFO  epoch: 18/30, acc_iter=119116, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:09:00/1:36:14, time_cost(all): 1 day, 8:19:18/21:27:37, loss=0.403658167862375, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.94(1.03), norm=1.3207803257741046, lr=0.00036266910709742
2023-11-22 16:44:38   INFO  epoch: 18/30, acc_iter=119166, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:09:49/1:37:35, time_cost(all): 1 day, 8:20:07/20:41:03, loss=0.403575053876066, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.2(1.03), norm=1.5019305895469768, lr=0.000362348372915495
2023-11-22 16:45:27   INFO  epoch: 18/30, acc_iter=119216, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:10:38/1:33:19, time_cost(all): 1 day, 8:20:56/20:55:10, loss=0.403491939889756, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.96(1.03), norm=3.503348541268123, lr=0.00036202763873357
2023-11-22 16:46:17   INFO  epoch: 18/30, acc_iter=119266, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:11:27/1:39:03, time_cost(all): 1 day, 8:21:46/22:05:03, loss=0.403408825903447, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.0(1.03), norm=2.3598194729765636, lr=0.000361706904551646
2023-11-22 16:47:06   INFO  epoch: 18/30, acc_iter=119316, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:12:16/1:36:51, time_cost(all): 1 day, 8:22:35/20:53:32, loss=0.403325711917138, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.14(1.03), norm=4.619745127988074, lr=0.000361386170369721
2023-11-22 16:47:55   INFO  epoch: 18/30, acc_iter=119366, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:13:05/1:36:41, time_cost(all): 1 day, 8:23:24/20:42:54, loss=0.403242597930829, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.02(1.03), norm=4.073025726174451, lr=0.000361065436187796
2023-11-22 16:48:44   INFO  epoch: 18/30, acc_iter=119416, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:13:54/1:30:52, time_cost(all): 1 day, 8:24:13/21:45:17, loss=0.40315948394452, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.09(1.03), norm=0.7441823949474263, lr=0.000360744702005872
2023-11-22 16:49:33   INFO  epoch: 18/30, acc_iter=119466, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:14:44/1:30:13, time_cost(all): 1 day, 8:25:02/21:52:15, loss=0.403076369958211, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.93(1.03), norm=3.8771563413395005, lr=0.000360423967823947
2023-11-22 16:50:22   INFO  epoch: 18/30, acc_iter=119516, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:15:33/1:28:23, time_cost(all): 1 day, 8:25:51/22:20:15, loss=0.402993255971902, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.0(1.03), norm=0.8227398304820839, lr=0.000360103233642022
2023-11-22 16:51:11   INFO  epoch: 18/30, acc_iter=119566, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:16:22/1:31:56, time_cost(all): 1 day, 8:26:40/21:07:12, loss=0.402910141985593, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.09(1.03), norm=4.625624761374241, lr=0.000359782499460098
2023-11-22 16:52:00   INFO  epoch: 18/30, acc_iter=119616, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:17:11/1:30:17, time_cost(all): 1 day, 8:27:29/21:55:36, loss=0.402827027999284, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.08(1.03), norm=1.2272239724301044, lr=0.000359461765278173
2023-11-22 16:52:49   INFO  epoch: 18/30, acc_iter=119666, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:18:00/1:33:10, time_cost(all): 1 day, 8:28:18/21:47:04, loss=0.402743914012975, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.05(1.03), norm=2.6210182235260513, lr=0.000359141031096248
2023-11-22 16:53:39   INFO  epoch: 18/30, acc_iter=119716, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:18:49/1:28:01, time_cost(all): 1 day, 8:29:08/20:29:14, loss=0.402660800026666, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.11(1.03), norm=4.076954367073688, lr=0.000358820296914323
2023-11-22 16:54:28   INFO  epoch: 18/30, acc_iter=119766, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:19:38/1:28:15, time_cost(all): 1 day, 8:29:57/20:50:02, loss=0.402577686040357, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.1(1.03), norm=3.748000034627677, lr=0.000358499562732399
2023-11-22 16:55:17   INFO  epoch: 18/30, acc_iter=119816, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:20:27/1:27:53, time_cost(all): 1 day, 8:30:46/21:22:39, loss=0.402494572054048, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.09(1.03), norm=2.4153794578262184, lr=0.000358178828550474
2023-11-22 16:56:06   INFO  epoch: 18/30, acc_iter=119866, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:21:17/1:27:06, time_cost(all): 1 day, 8:31:35/21:03:39, loss=0.402411458067739, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.84(1.03), norm=2.912768920649943, lr=0.000357858094368549
2023-11-22 16:56:55   INFO  epoch: 18/30, acc_iter=119916, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:22:06/1:25:07, time_cost(all): 1 day, 8:32:24/21:15:18, loss=0.40232834408143, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.08(1.03), norm=3.5972790501382783, lr=0.000357537360186625
2023-11-22 16:57:44   INFO  epoch: 18/30, acc_iter=119966, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:22:55/1:22:25, time_cost(all): 1 day, 8:33:13/22:08:24, loss=0.40224523009512, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.99(1.03), norm=0.8021737274757962, lr=0.0003572166260047
2023-11-22 16:58:33   INFO  epoch: 18/30, acc_iter=120016, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:23:44/1:22:53, time_cost(all): 1 day, 8:34:02/21:10:22, loss=0.402162116108811, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.01(1.03), norm=0.7854012793767837, lr=0.000356895891822775
2023-11-22 16:59:22   INFO  epoch: 18/30, acc_iter=120066, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:24:33/1:26:22, time_cost(all): 1 day, 8:34:51/20:39:58, loss=0.402079002122502, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.05(1.03), norm=2.3317717886580303, lr=0.00035657515764085
2023-11-22 17:00:11   INFO  epoch: 18/30, acc_iter=120116, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:25:22/1:25:48, time_cost(all): 1 day, 8:35:40/22:18:11, loss=0.401995888136193, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.06(1.03), norm=0.8510410921564953, lr=0.000356254423458926
2023-11-22 17:01:01   INFO  epoch: 18/30, acc_iter=120166, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:26:11/1:21:25, time_cost(all): 1 day, 8:36:30/22:20:35, loss=0.401912774149884, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.0(1.03), norm=4.562941013141365, lr=0.000355933689277001
2023-11-22 17:01:50   INFO  epoch: 18/30, acc_iter=120216, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:27:00/1:16:58, time_cost(all): 1 day, 8:37:19/21:24:51, loss=0.401829660163575, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.0(1.03), norm=4.752435200106842, lr=0.000355612955095076
2023-11-22 17:02:39   INFO  epoch: 18/30, acc_iter=120266, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:27:49/1:18:56, time_cost(all): 1 day, 8:38:08/21:34:49, loss=0.401746546177266, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.15(1.03), norm=3.0869619954460927, lr=0.000355292220913152
2023-11-22 17:03:28   INFO  epoch: 18/30, acc_iter=120316, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:28:39/1:23:08, time_cost(all): 1 day, 8:38:57/22:09:16, loss=0.401663432190957, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.03(1.03), norm=4.2202593975084195, lr=0.000354971486731227
2023-11-22 17:04:17   INFO  epoch: 18/30, acc_iter=120366, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:29:28/1:22:04, time_cost(all): 1 day, 8:39:46/21:56:08, loss=0.401580318204648, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.01(1.03), norm=3.2758945179872208, lr=0.000354650752549302
2023-11-22 17:05:06   INFO  epoch: 18/30, acc_iter=120416, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:30:17/1:20:44, time_cost(all): 1 day, 8:40:35/21:51:16, loss=0.401497204218339, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.01(1.03), norm=3.069017195844475, lr=0.000354330018367378
2023-11-22 17:05:55   INFO  epoch: 18/30, acc_iter=120466, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:31:06/1:19:05, time_cost(all): 1 day, 8:41:24/20:57:36, loss=0.40141409023203, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.86(1.03), norm=3.809433246127452, lr=0.000354009284185453
2023-11-22 17:06:44   INFO  epoch: 18/30, acc_iter=120516, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:31:55/1:17:15, time_cost(all): 1 day, 8:42:13/21:43:46, loss=0.401330976245721, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.99(1.03), norm=0.9151481955132643, lr=0.000353688550003528
2023-11-22 17:07:34   INFO  epoch: 18/30, acc_iter=120566, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:32:44/1:17:38, time_cost(all): 1 day, 8:43:03/22:08:53, loss=0.401247862259412, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.97(1.03), norm=1.516521064224943, lr=0.000353367815821603
2023-11-22 17:08:23   INFO  epoch: 18/30, acc_iter=120616, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:33:33/1:12:46, time_cost(all): 1 day, 8:43:52/21:51:48, loss=0.401164748273103, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.21(1.03), norm=3.4442625116869103, lr=0.000353047081639679
2023-11-22 17:09:12   INFO  epoch: 18/30, acc_iter=120666, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:34:22/1:16:03, time_cost(all): 1 day, 8:44:41/22:10:00, loss=0.401081634286794, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.19(1.03), norm=0.7699402293744173, lr=0.000352726347457754
2023-11-22 17:10:01   INFO  epoch: 18/30, acc_iter=120716, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:35:12/1:14:44, time_cost(all): 1 day, 8:45:30/21:57:29, loss=0.400998520300485, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.91(1.03), norm=2.438295922763932, lr=0.000352405613275829
2023-11-22 17:10:50   INFO  epoch: 18/30, acc_iter=120766, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:36:01/1:11:55, time_cost(all): 1 day, 8:46:19/21:10:47, loss=0.400915406314176, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.92(1.03), norm=1.691244360126011, lr=0.000352084879093905
2023-11-22 17:11:39   INFO  epoch: 18/30, acc_iter=120816, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:36:50/1:10:46, time_cost(all): 1 day, 8:47:08/21:06:04, loss=0.400832292327866, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.0(1.03), norm=0.8657625596038825, lr=0.00035176414491198
2023-11-22 17:12:28   INFO  epoch: 18/30, acc_iter=120866, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:37:39/1:11:14, time_cost(all): 1 day, 8:47:57/22:09:16, loss=0.400749178341557, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.19(1.03), norm=3.935238236748191, lr=0.000351443410730055
2023-11-22 17:13:17   INFO  epoch: 18/30, acc_iter=120916, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:38:28/1:09:53, time_cost(all): 1 day, 8:48:46/21:55:16, loss=0.400666064355248, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.11(1.03), norm=4.572036609831358, lr=0.00035112267654813
2023-11-22 17:14:06   INFO  epoch: 18/30, acc_iter=120966, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:39:17/1:07:59, time_cost(all): 1 day, 8:49:35/20:18:11, loss=0.400582950368939, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.14(1.03), norm=4.112459955548021, lr=0.000350801942366206
2023-11-22 17:14:56   INFO  epoch: 18/30, acc_iter=121016, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:40:06/1:08:09, time_cost(all): 1 day, 8:50:25/20:58:52, loss=0.40049983638263, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.0(1.03), norm=4.084689574476082, lr=0.000350481208184281
2023-11-22 17:15:45   INFO  epoch: 18/30, acc_iter=121066, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:40:55/1:08:40, time_cost(all): 1 day, 8:51:14/20:23:42, loss=0.400416722396321, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.15(1.03), norm=3.643328317735068, lr=0.000350160474002356
2023-11-22 17:16:34   INFO  epoch: 18/30, acc_iter=121116, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:41:44/1:08:45, time_cost(all): 1 day, 8:52:03/20:42:17, loss=0.400333608410012, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.09(1.03), norm=4.3208466882085785, lr=0.000349839739820432
2023-11-22 17:17:23   INFO  epoch: 18/30, acc_iter=121166, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:42:34/1:06:25, time_cost(all): 1 day, 8:52:52/21:21:16, loss=0.400250494423703, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.12(1.03), norm=4.692030326832951, lr=0.000349519005638507
2023-11-22 17:18:12   INFO  epoch: 18/30, acc_iter=121216, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:43:23/1:03:01, time_cost(all): 1 day, 8:53:41/20:46:42, loss=0.400167380437394, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.86(1.03), norm=1.2349936479518187, lr=0.000349198271456582
2023-11-22 17:19:01   INFO  epoch: 18/30, acc_iter=121266, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:44:12/1:01:38, time_cost(all): 1 day, 8:54:30/21:28:13, loss=0.400084266451085, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.99(1.03), norm=0.8783767393219777, lr=0.000348877537274658
2023-11-22 17:19:50   INFO  epoch: 18/30, acc_iter=121316, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:45:01/1:05:09, time_cost(all): 1 day, 8:55:19/22:02:53, loss=0.400001152464776, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.15(1.03), norm=1.9098114199894578, lr=0.000348556803092733
2023-11-22 17:20:39   INFO  epoch: 18/30, acc_iter=121366, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:45:50/1:01:32, time_cost(all): 1 day, 8:56:08/21:09:04, loss=0.399918038478467, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.12(1.03), norm=1.3135193123227902, lr=0.000348236068910808
2023-11-22 17:21:29   INFO  epoch: 18/30, acc_iter=121416, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:46:39/1:01:39, time_cost(all): 1 day, 8:56:58/21:15:39, loss=0.399834924492158, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.9(1.03), norm=3.461742015174265, lr=0.000347915334728883
2023-11-22 17:22:18   INFO  epoch: 18/30, acc_iter=121466, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:47:28/0:58:16, time_cost(all): 1 day, 8:57:47/21:04:58, loss=0.399751810505849, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.93(1.03), norm=3.114202971632183, lr=0.000347594600546959
2023-11-22 17:23:07   INFO  epoch: 18/30, acc_iter=121516, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:48:17/1:01:19, time_cost(all): 1 day, 8:58:36/21:29:42, loss=0.39966869651954, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.13(1.03), norm=3.853097295149346, lr=0.000347273866365034
2023-11-22 17:23:56   INFO  epoch: 18/30, acc_iter=121566, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:49:07/1:01:20, time_cost(all): 1 day, 8:59:25/20:46:28, loss=0.399585582533231, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.88(1.03), norm=1.092297429815881, lr=0.000346953132183109
2023-11-22 17:24:45   INFO  epoch: 18/30, acc_iter=121616, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:49:56/0:57:38, time_cost(all): 1 day, 9:00:14/20:34:22, loss=0.399502468546921, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.97(1.03), norm=1.0619017929418775, lr=0.000346632398001185
2023-11-22 17:25:34   INFO  epoch: 18/30, acc_iter=121666, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:50:45/0:58:09, time_cost(all): 1 day, 9:01:03/21:15:33, loss=0.399419354560612, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.09(1.03), norm=4.672784246706331, lr=0.00034631166381926
2023-11-22 17:26:23   INFO  epoch: 18/30, acc_iter=121716, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 0:51:34/0:57:34, time_cost(all): 1 day, 9:01:52/21:37:31, loss=0.399336240574303, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.95(1.03), norm=4.582530153412474, lr=0.000345990929637335
2023-11-22 17:27:12   INFO  epoch: 18/30, acc_iter=121766, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 0:52:23/0:55:52, time_cost(all): 1 day, 9:02:41/21:48:31, loss=0.399253126587994, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.05(1.03), norm=1.9183344787679548, lr=0.00034567019545541
2023-11-22 17:28:01   INFO  epoch: 18/30, acc_iter=121816, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 0:53:12/0:52:12, time_cost(all): 1 day, 9:03:30/20:34:18, loss=0.399170012601685, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.15(1.03), norm=1.218865412340358, lr=0.000345349461273486
2023-11-22 17:28:51   INFO  epoch: 18/30, acc_iter=121866, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 0:54:01/0:55:27, time_cost(all): 1 day, 9:04:20/20:46:39, loss=0.399086898615376, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.92(1.03), norm=1.0202626332923401, lr=0.000345028727091561
2023-11-22 17:29:40   INFO  epoch: 18/30, acc_iter=121916, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 0:54:50/0:50:36, time_cost(all): 1 day, 9:05:09/21:36:28, loss=0.399003784629067, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.99(1.03), norm=2.80351486298099, lr=0.000344707992909636
2023-11-22 17:30:29   INFO  epoch: 18/30, acc_iter=121966, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 0:55:39/0:54:27, time_cost(all): 1 day, 9:05:58/21:35:52, loss=0.398920670642758, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.06(1.03), norm=1.2453050635024778, lr=0.000344387258727712
2023-11-22 17:31:18   INFO  epoch: 18/30, acc_iter=122016, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 0:56:29/0:52:23, time_cost(all): 1 day, 9:06:47/21:03:26, loss=0.398837556656449, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.08(1.03), norm=1.4451734269509409, lr=0.000344066524545787
2023-11-22 17:32:07   INFO  epoch: 18/30, acc_iter=122066, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 0:57:18/0:48:09, time_cost(all): 1 day, 9:07:36/20:51:38, loss=0.39875444267014, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.04(1.03), norm=3.1781075594266577, lr=0.000343745790363862
2023-11-22 17:32:56   INFO  epoch: 18/30, acc_iter=122116, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 0:58:07/0:51:52, time_cost(all): 1 day, 9:08:25/19:49:57, loss=0.398671328683831, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.92(1.03), norm=4.885529363539995, lr=0.000343425056181938
2023-11-22 17:33:45   INFO  epoch: 18/30, acc_iter=122166, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 0:58:56/0:49:19, time_cost(all): 1 day, 9:09:14/20:26:15, loss=0.398588214697522, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.04(1.03), norm=3.441074511807266, lr=0.000343104322000013
2023-11-22 17:34:34   INFO  epoch: 18/30, acc_iter=122216, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 0:59:45/0:46:03, time_cost(all): 1 day, 9:10:03/19:45:32, loss=0.398505100711213, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.93(1.03), norm=4.288252761753711, lr=0.000342783587818088
2023-11-22 17:35:24   INFO  epoch: 18/30, acc_iter=122266, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:00:34/0:49:01, time_cost(all): 1 day, 9:10:53/20:44:46, loss=0.398421986724904, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.93(1.03), norm=2.1884431120796113, lr=0.000342462853636163
2023-11-22 17:36:13   INFO  epoch: 18/30, acc_iter=122316, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:01:23/0:45:32, time_cost(all): 1 day, 9:11:42/19:46:05, loss=0.398338872738595, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.2(1.03), norm=2.993882334080891, lr=0.000342142119454239
2023-11-22 17:37:02   INFO  epoch: 18/30, acc_iter=122366, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:02:12/0:47:33, time_cost(all): 1 day, 9:12:31/20:35:43, loss=0.398255758752285, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.02(1.03), norm=2.518566113523225, lr=0.000341821385272314
2023-11-22 17:37:51   INFO  epoch: 18/30, acc_iter=122416, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:03:02/0:44:00, time_cost(all): 1 day, 9:13:20/20:07:42, loss=0.398172644765976, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.03(1.03), norm=1.1222550950218197, lr=0.000341500651090389
2023-11-22 17:38:40   INFO  epoch: 18/30, acc_iter=122466, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:03:51/0:42:11, time_cost(all): 1 day, 9:14:09/19:50:50, loss=0.398089530779667, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.22(1.03), norm=3.811856265294697, lr=0.000341179916908465
2023-11-22 17:39:29   INFO  epoch: 18/30, acc_iter=122516, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:04:40/0:44:29, time_cost(all): 1 day, 9:14:58/21:37:44, loss=0.398006416793358, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.97(1.03), norm=4.120864625373022, lr=0.00034085918272654
2023-11-22 17:40:18   INFO  epoch: 18/30, acc_iter=122566, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:05:29/0:42:23, time_cost(all): 1 day, 9:15:47/20:00:53, loss=0.397923302807049, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.07(1.03), norm=1.5982689864365671, lr=0.000340538448544615
2023-11-22 17:41:07   INFO  epoch: 18/30, acc_iter=122616, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:06:18/0:43:03, time_cost(all): 1 day, 9:16:36/21:21:06, loss=0.39784018882074, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.0(1.03), norm=4.843068980215528, lr=0.000340217714362691
2023-11-22 17:41:56   INFO  epoch: 18/30, acc_iter=122666, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:07:07/0:41:20, time_cost(all): 1 day, 9:17:25/19:53:40, loss=0.397757074834431, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.85(1.03), norm=1.7485514323182756, lr=0.000339896980180766
2023-11-22 17:42:46   INFO  epoch: 18/30, acc_iter=122716, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:07:56/0:39:35, time_cost(all): 1 day, 9:18:15/19:43:13, loss=0.397673960848122, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.87(1.03), norm=0.6403484182404608, lr=0.000339576245998841
2023-11-22 17:43:35   INFO  epoch: 18/30, acc_iter=122766, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:08:45/0:37:14, time_cost(all): 1 day, 9:19:04/20:23:33, loss=0.397590846861813, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.14(1.03), norm=2.518409537576206, lr=0.000339255511816916
2023-11-22 17:44:24   INFO  epoch: 18/30, acc_iter=122816, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:09:34/0:39:02, time_cost(all): 1 day, 9:19:53/20:44:44, loss=0.397507732875504, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.01(1.03), norm=0.8821085574105704, lr=0.000338934777634992
2023-11-22 17:45:13   INFO  epoch: 18/30, acc_iter=122866, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:10:24/0:35:40, time_cost(all): 1 day, 9:20:42/20:55:22, loss=0.397424618889195, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.94(1.03), norm=0.9950358217660958, lr=0.000338614043453067
2023-11-22 17:46:02   INFO  epoch: 18/30, acc_iter=122916, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:11:13/0:36:38, time_cost(all): 1 day, 9:21:31/19:45:29, loss=0.397341504902886, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.13(1.03), norm=1.9019278555402064, lr=0.000338293309271142
2023-11-22 17:46:51   INFO  epoch: 18/30, acc_iter=122966, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:12:02/0:36:31, time_cost(all): 1 day, 9:22:20/19:39:06, loss=0.397258390916577, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.05(1.03), norm=4.356060124358052, lr=0.000337972575089218
2023-11-22 17:47:40   INFO  epoch: 18/30, acc_iter=123016, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:12:51/0:35:28, time_cost(all): 1 day, 9:23:09/19:34:06, loss=0.397175276930268, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.0(1.03), norm=1.9644312029598332, lr=0.000337651840907293
2023-11-22 17:48:29   INFO  epoch: 18/30, acc_iter=123066, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:13:40/0:34:05, time_cost(all): 1 day, 9:23:58/19:48:51, loss=0.397092162943959, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.09(1.03), norm=0.5043510215271114, lr=0.000337331106725368
2023-11-22 17:49:19   INFO  epoch: 18/30, acc_iter=123116, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:14:29/0:32:55, time_cost(all): 1 day, 9:24:48/20:57:28, loss=0.39700904895765, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.85(1.03), norm=1.2088795847975384, lr=0.000337010372543443
2023-11-22 17:50:08   INFO  epoch: 18/30, acc_iter=123166, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:15:18/0:33:54, time_cost(all): 1 day, 9:25:37/19:34:30, loss=0.396925934971341, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.0(1.03), norm=2.2720197690675485, lr=0.000336689638361519
2023-11-22 17:50:57   INFO  epoch: 18/30, acc_iter=123216, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:16:07/0:31:09, time_cost(all): 1 day, 9:26:26/19:44:13, loss=0.396842820985031, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.93(1.03), norm=2.140424116434412, lr=0.000336368904179594
2023-11-22 17:51:46   INFO  epoch: 18/30, acc_iter=123266, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:16:57/0:30:16, time_cost(all): 1 day, 9:27:15/19:32:09, loss=0.396759706998722, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.05(1.03), norm=2.625789066708265, lr=0.000336048169997669
2023-11-22 17:52:35   INFO  epoch: 18/30, acc_iter=123316, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:17:46/0:30:09, time_cost(all): 1 day, 9:28:04/21:10:51, loss=0.396676593012413, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.85(1.03), norm=4.77092595695805, lr=0.000335727435815745
2023-11-22 17:53:24   INFO  epoch: 18/30, acc_iter=123366, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:18:35/0:29:53, time_cost(all): 1 day, 9:28:53/19:57:20, loss=0.396593479026104, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.2(1.03), norm=4.9989771984673625, lr=0.00033540670163382
2023-11-22 17:54:13   INFO  epoch: 18/30, acc_iter=123416, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:19:24/0:28:12, time_cost(all): 1 day, 9:29:42/20:12:32, loss=0.396510365039795, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.17(1.03), norm=4.1372527695021954, lr=0.000335085967451895
2023-11-22 17:55:02   INFO  epoch: 18/30, acc_iter=123466, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:20:13/0:26:32, time_cost(all): 1 day, 9:30:31/20:36:44, loss=0.396427251053486, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.17(1.03), norm=2.0554438396479133, lr=0.000334765233269971
2023-11-22 17:55:51   INFO  epoch: 18/30, acc_iter=123516, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:21:02/0:27:30, time_cost(all): 1 day, 9:31:20/19:43:50, loss=0.396344137067177, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.1(1.03), norm=2.104710354284567, lr=0.000334444499088046
2023-11-22 17:56:41   INFO  epoch: 18/30, acc_iter=123566, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:21:51/0:26:54, time_cost(all): 1 day, 9:32:10/20:06:26, loss=0.396261023080868, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.89(1.03), norm=0.8304824686041069, lr=0.000334123764906121
2023-11-22 17:57:30   INFO  epoch: 18/30, acc_iter=123616, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:22:40/0:24:10, time_cost(all): 1 day, 9:32:59/21:14:32, loss=0.396177909094559, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.2(1.03), norm=1.5534792740783498, lr=0.000333803030724196
2023-11-22 17:58:19   INFO  epoch: 18/30, acc_iter=123666, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:23:29/0:23:11, time_cost(all): 1 day, 9:33:48/21:16:58, loss=0.39609479510825, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.95(1.03), norm=3.806418811212045, lr=0.000333482296542272
2023-11-22 17:59:08   INFO  epoch: 18/30, acc_iter=123716, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:24:19/0:23:03, time_cost(all): 1 day, 9:34:37/20:17:06, loss=0.396011681121941, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.98(1.03), norm=2.3293443518910673, lr=0.000333161562360347
2023-11-22 17:59:57   INFO  epoch: 18/30, acc_iter=123766, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:25:08/0:22:23, time_cost(all): 1 day, 9:35:26/19:36:09, loss=0.395928567135632, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.98(1.03), norm=2.950541087044621, lr=0.000332840828178422
2023-11-22 18:00:46   INFO  epoch: 18/30, acc_iter=123816, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:25:57/0:20:54, time_cost(all): 1 day, 9:36:15/19:54:17, loss=0.395845453149323, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.84(1.03), norm=2.9682064195200533, lr=0.000332520093996498
2023-11-22 18:01:35   INFO  epoch: 18/30, acc_iter=123866, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:26:46/0:21:38, time_cost(all): 1 day, 9:37:04/20:10:28, loss=0.395762339163014, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.15(1.03), norm=1.2284936642352027, lr=0.000332199359814573
2023-11-22 18:02:24   INFO  epoch: 18/30, acc_iter=123916, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:27:35/0:19:24, time_cost(all): 1 day, 9:37:53/20:02:41, loss=0.395679225176705, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.95(1.03), norm=4.823624265571058, lr=0.000331878625632648
2023-11-22 18:03:14   INFO  epoch: 18/30, acc_iter=123966, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:28:24/0:19:44, time_cost(all): 1 day, 9:38:43/21:06:25, loss=0.395596111190396, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.91(1.03), norm=2.6643211219157203, lr=0.000331557891450724
2023-11-22 18:04:03   INFO  epoch: 18/30, acc_iter=124016, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:29:13/0:19:26, time_cost(all): 1 day, 9:39:32/20:57:47, loss=0.395512997204086, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.96(1.03), norm=3.0518093859375437, lr=0.000331237157268799
2023-11-22 18:04:52   INFO  epoch: 18/30, acc_iter=124066, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:30:02/0:18:25, time_cost(all): 1 day, 9:40:21/21:12:14, loss=0.395429883217777, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.84(1.03), norm=2.706276160381786, lr=0.000330916423086874
2023-11-22 18:05:41   INFO  epoch: 18/30, acc_iter=124116, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:30:52/0:16:08, time_cost(all): 1 day, 9:41:10/20:26:48, loss=0.395346769231468, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.18(1.03), norm=3.3027318906373258, lr=0.000330595688904949
2023-11-22 18:06:30   INFO  epoch: 18/30, acc_iter=124166, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:31:41/0:16:41, time_cost(all): 1 day, 9:41:59/20:37:02, loss=0.395263655245159, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.91(1.03), norm=2.5230238587280276, lr=0.000330274954723025
2023-11-22 18:07:19   INFO  epoch: 18/30, acc_iter=124216, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:32:30/0:15:30, time_cost(all): 1 day, 9:42:48/19:47:52, loss=0.39518054125885, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.86(1.03), norm=2.1557702593207577, lr=0.0003299542205411
2023-11-22 18:08:08   INFO  epoch: 18/30, acc_iter=124266, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:33:19/0:14:46, time_cost(all): 1 day, 9:43:37/19:33:28, loss=0.395097427272541, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.15(1.03), norm=4.3614652356935855, lr=0.000329633486359175
2023-11-22 18:08:57   INFO  epoch: 18/30, acc_iter=124316, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:34:08/0:13:07, time_cost(all): 1 day, 9:44:26/19:19:31, loss=0.395014313286232, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.0(1.03), norm=0.9677130143078454, lr=0.000329312752177251
2023-11-22 18:09:46   INFO  epoch: 18/30, acc_iter=124366, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:34:57/0:12:56, time_cost(all): 1 day, 9:45:15/19:17:47, loss=0.394931199299923, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.96(1.03), norm=3.699770605311895, lr=0.000328992017995326
2023-11-22 18:10:36   INFO  epoch: 18/30, acc_iter=124416, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:35:46/0:12:38, time_cost(all): 1 day, 9:46:05/20:39:52, loss=0.394848085313614, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.83(1.03), norm=4.271386097605479, lr=0.000328671283813401
2023-11-22 18:11:25   INFO  epoch: 18/30, acc_iter=124466, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:36:35/0:11:31, time_cost(all): 1 day, 9:46:54/19:25:53, loss=0.394764971327305, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.22(1.03), norm=4.245367346978013, lr=0.000328350549631477
2023-11-22 18:12:14   INFO  epoch: 18/30, acc_iter=124516, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:37:24/0:10:53, time_cost(all): 1 day, 9:47:43/19:27:00, loss=0.394681857340996, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.84(1.03), norm=3.283694855806265, lr=0.000328029815449552
2023-11-22 18:13:03   INFO  epoch: 18/30, acc_iter=124566, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:38:14/0:09:13, time_cost(all): 1 day, 9:48:32/20:36:29, loss=0.394598743354687, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.13(1.03), norm=4.381828001448923, lr=0.000327709081267627
2023-11-22 18:13:52   INFO  epoch: 18/30, acc_iter=124616, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:39:03/0:09:10, time_cost(all): 1 day, 9:49:21/20:34:26, loss=0.394515629368378, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.08(1.03), norm=2.1477542186983403, lr=0.000327388347085702
2023-11-22 18:14:41   INFO  epoch: 18/30, acc_iter=124666, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:39:52/0:07:36, time_cost(all): 1 day, 9:50:10/20:13:16, loss=0.394432515382069, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.0(1.03), norm=2.9243342180278455, lr=0.000327067612903778
2023-11-22 18:15:30   INFO  epoch: 18/30, acc_iter=124716, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:40:41/0:07:16, time_cost(all): 1 day, 9:50:59/19:48:24, loss=0.39434940139576, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.12(1.03), norm=0.6300302780415068, lr=0.000326746878721853
2023-11-22 18:16:19   INFO  epoch: 18/30, acc_iter=124766, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:41:30/0:06:26, time_cost(all): 1 day, 9:51:48/20:18:53, loss=0.39426628740945, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.05(1.03), norm=4.975829603738947, lr=0.000326426144539928
2023-11-22 18:17:09   INFO  epoch: 18/30, acc_iter=124816, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 1:42:19/0:05:27, time_cost(all): 1 day, 9:52:38/19:17:41, loss=0.394183173423141, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.03(1.03), norm=3.8823551784459465, lr=0.000326105410358004
2023-11-22 18:17:58   INFO  epoch: 18/30, acc_iter=124866, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 1:43:08/0:04:29, time_cost(all): 1 day, 9:53:27/20:51:13, loss=0.394100059436832, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.99(1.03), norm=3.701940753892838, lr=0.000325784676176079
2023-11-22 18:18:47   INFO  epoch: 18/30, acc_iter=124916, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 1:43:57/0:04:04, time_cost(all): 1 day, 9:54:16/20:37:53, loss=0.394016945450523, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.15(1.03), norm=4.742303098849166, lr=0.000325463941994154
2023-11-22 18:19:36   INFO  epoch: 18/30, acc_iter=124966, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 1:44:47/0:03:08, time_cost(all): 1 day, 9:55:05/19:03:57, loss=0.393933831464214, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.19(1.03), norm=1.0088448868639102, lr=0.000325143207812229
2023-11-22 18:20:25   INFO  epoch: 18/30, acc_iter=125016, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 1:45:36/0:02:08, time_cost(all): 1 day, 9:55:54/20:35:27, loss=0.393850717477905, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.86(1.03), norm=2.9034109723605526, lr=0.000324822473630305
2023-11-22 18:21:14   INFO  epoch: 18/30, acc_iter=125066, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 1:46:25/0:01:27, time_cost(all): 1 day, 9:56:43/20:57:35, loss=0.393767603491596, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.09(1.03), norm=4.595551924336827, lr=0.00032450173944838
2023-11-22 18:22:03   INFO  epoch: 18/30, acc_iter=125116, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 1:47:14/0:00:36, time_cost(all): 1 day, 9:57:32/19:03:42, loss=0.393684489505287, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.05(1.03), norm=2.6810585238675033, lr=0.000324181005266455
2023-11-22 18:22:52   INFO  epoch: 19/30, acc_iter=125203, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:49/1:50:56, time_cost(all): 1 day, 9:58:21/20:17:46, loss=0.393539871169109, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.15(1.03), norm=1.1678490384488895, lr=0.000323622927789906
2023-11-22 18:23:41   INFO  epoch: 19/30, acc_iter=125253, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:38/1:47:31, time_cost(all): 1 day, 9:59:10/20:40:17, loss=0.3934567571828, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.84(1.03), norm=2.182254752970425, lr=0.000323302193607982
2023-11-22 18:24:31   INFO  epoch: 19/30, acc_iter=125303, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:27/1:41:24, time_cost(all): 1 day, 10:00:00/19:27:32, loss=0.393373643196491, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.16(1.03), norm=4.727915058460727, lr=0.000322981459426057
2023-11-22 18:25:20   INFO  epoch: 19/30, acc_iter=125353, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:16/1:42:15, time_cost(all): 1 day, 10:00:49/19:47:11, loss=0.393290529210182, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.19(1.03), norm=2.2595242723194495, lr=0.000322660725244132
2023-11-22 18:26:09   INFO  epoch: 19/30, acc_iter=125403, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:05/1:46:15, time_cost(all): 1 day, 10:01:38/20:50:42, loss=0.393207415223873, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.93(1.03), norm=0.642851513662375, lr=0.000322339991062207
2023-11-22 18:26:58   INFO  epoch: 19/30, acc_iter=125453, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:04:54/1:41:16, time_cost(all): 1 day, 10:02:27/19:38:11, loss=0.393124301237564, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.95(1.03), norm=2.7241937443473776, lr=0.000322019256880283
2023-11-22 18:27:47   INFO  epoch: 19/30, acc_iter=125503, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:05:43/1:44:27, time_cost(all): 1 day, 10:03:16/19:06:21, loss=0.393041187251255, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.2(1.03), norm=1.1437081936388773, lr=0.000321698522698358
2023-11-22 18:28:36   INFO  epoch: 19/30, acc_iter=125553, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:06:32/1:36:26, time_cost(all): 1 day, 10:04:05/19:04:10, loss=0.392958073264946, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.03(1.03), norm=3.8398859935934713, lr=0.000321377788516433
2023-11-22 18:29:25   INFO  epoch: 19/30, acc_iter=125603, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:07:22/1:40:29, time_cost(all): 1 day, 10:04:54/18:56:54, loss=0.392874959278637, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.13(1.03), norm=2.2332140711616755, lr=0.000321057054334509
2023-11-22 18:30:14   INFO  epoch: 19/30, acc_iter=125653, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:08:11/1:39:02, time_cost(all): 1 day, 10:05:43/19:35:11, loss=0.392791845292328, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.93(1.03), norm=1.6056063431924106, lr=0.000320736320152584
2023-11-22 18:31:04   INFO  epoch: 19/30, acc_iter=125703, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:09:00/1:39:47, time_cost(all): 1 day, 10:06:33/19:54:25, loss=0.392708731306019, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.9(1.03), norm=1.894807071175634, lr=0.000320415585970659
2023-11-22 18:31:53   INFO  epoch: 19/30, acc_iter=125753, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:09:49/1:34:15, time_cost(all): 1 day, 10:07:22/20:03:03, loss=0.39262561731971, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.89(1.03), norm=4.915671116095163, lr=0.000320094851788735
2023-11-22 18:32:42   INFO  epoch: 19/30, acc_iter=125803, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:10:38/1:41:41, time_cost(all): 1 day, 10:08:11/19:39:08, loss=0.392542503333401, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.04(1.03), norm=3.7165151081062726, lr=0.00031977411760681
2023-11-22 18:33:31   INFO  epoch: 19/30, acc_iter=125853, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:11:27/1:34:31, time_cost(all): 1 day, 10:09:00/19:34:20, loss=0.392459389347091, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.11(1.03), norm=4.896516641254386, lr=0.000319453383424885
2023-11-22 18:34:20   INFO  epoch: 19/30, acc_iter=125903, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:12:16/1:35:05, time_cost(all): 1 day, 10:09:49/19:53:17, loss=0.392376275360782, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.1(1.03), norm=1.6752698892413465, lr=0.00031913264924296
2023-11-22 18:35:09   INFO  epoch: 19/30, acc_iter=125953, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:13:05/1:38:44, time_cost(all): 1 day, 10:10:38/19:22:08, loss=0.392293161374473, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.88(1.03), norm=1.2724477333119442, lr=0.000318811915061036
2023-11-22 18:35:58   INFO  epoch: 19/30, acc_iter=126003, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:13:54/1:35:46, time_cost(all): 1 day, 10:11:27/19:38:36, loss=0.392210047388164, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.09(1.03), norm=1.6950908481619233, lr=0.000318491180879111
2023-11-22 18:36:47   INFO  epoch: 19/30, acc_iter=126053, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:14:44/1:36:30, time_cost(all): 1 day, 10:12:16/20:06:04, loss=0.392126933401855, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.92(1.03), norm=3.7513391175485884, lr=0.000318170446697186
2023-11-22 18:37:36   INFO  epoch: 19/30, acc_iter=126103, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:15:33/1:35:34, time_cost(all): 1 day, 10:13:05/20:09:59, loss=0.392043819415546, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.16(1.03), norm=4.371613530436559, lr=0.000317849712515262
2023-11-22 18:38:26   INFO  epoch: 19/30, acc_iter=126153, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:16:22/1:30:00, time_cost(all): 1 day, 10:13:55/19:32:29, loss=0.391960705429237, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.88(1.03), norm=1.6261335446214429, lr=0.000317528978333337
2023-11-22 18:39:15   INFO  epoch: 19/30, acc_iter=126203, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:17:11/1:35:06, time_cost(all): 1 day, 10:14:44/19:09:42, loss=0.391877591442928, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.22(1.03), norm=4.552221810327761, lr=0.000317208244151412
2023-11-22 18:40:04   INFO  epoch: 19/30, acc_iter=126253, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:18:00/1:26:01, time_cost(all): 1 day, 10:15:33/20:38:28, loss=0.391794477456619, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.14(1.03), norm=2.565980825794399, lr=0.000316887509969488
2023-11-22 18:40:53   INFO  epoch: 19/30, acc_iter=126303, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:18:49/1:33:23, time_cost(all): 1 day, 10:16:22/19:15:39, loss=0.39171136347031, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.07(1.03), norm=2.8687526544419315, lr=0.000316566775787563
2023-11-22 18:41:42   INFO  epoch: 19/30, acc_iter=126353, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:19:38/1:27:41, time_cost(all): 1 day, 10:17:11/19:07:23, loss=0.391628249484001, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.08(1.03), norm=0.9332101273064932, lr=0.000316246041605638
2023-11-22 18:42:31   INFO  epoch: 19/30, acc_iter=126403, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:20:27/1:28:10, time_cost(all): 1 day, 10:18:00/19:49:33, loss=0.391545135497692, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.96(1.03), norm=0.5150202373755507, lr=0.000315925307423713
2023-11-22 18:43:20   INFO  epoch: 19/30, acc_iter=126453, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:21:17/1:26:38, time_cost(all): 1 day, 10:18:49/20:10:20, loss=0.391462021511383, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.14(1.03), norm=4.5638397250485045, lr=0.000315604573241789
2023-11-22 18:44:09   INFO  epoch: 19/30, acc_iter=126503, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:22:06/1:27:11, time_cost(all): 1 day, 10:19:38/19:56:09, loss=0.391378907525074, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.05(1.03), norm=2.064376114377697, lr=0.000315283839059864
2023-11-22 18:44:58   INFO  epoch: 19/30, acc_iter=126553, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:22:55/1:22:16, time_cost(all): 1 day, 10:20:27/20:01:38, loss=0.391295793538765, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.08(1.03), norm=3.7458765732302908, lr=0.000314963104877939
2023-11-22 18:45:48   INFO  epoch: 19/30, acc_iter=126603, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:23:44/1:23:44, time_cost(all): 1 day, 10:21:17/18:54:15, loss=0.391212679552456, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.86(1.03), norm=3.2800619378275413, lr=0.000314642370696015
2023-11-22 18:46:37   INFO  epoch: 19/30, acc_iter=126653, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:24:33/1:21:07, time_cost(all): 1 day, 10:22:06/19:20:23, loss=0.391129565566146, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.12(1.03), norm=4.496200839231282, lr=0.00031432163651409
2023-11-22 18:47:26   INFO  epoch: 19/30, acc_iter=126703, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:25:22/1:23:33, time_cost(all): 1 day, 10:22:55/18:51:31, loss=0.391046451579837, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.22(1.03), norm=4.893071213841085, lr=0.000314000902332165
2023-11-22 18:48:15   INFO  epoch: 19/30, acc_iter=126753, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:26:11/1:17:43, time_cost(all): 1 day, 10:23:44/19:27:11, loss=0.390963337593528, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.98(1.03), norm=4.031808456779931, lr=0.000313680168150241
2023-11-22 18:49:04   INFO  epoch: 19/30, acc_iter=126803, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:27:00/1:17:37, time_cost(all): 1 day, 10:24:33/19:47:39, loss=0.390880223607219, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.01(1.03), norm=4.951621575442382, lr=0.000313359433968316
2023-11-22 18:49:53   INFO  epoch: 19/30, acc_iter=126853, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:27:49/1:19:53, time_cost(all): 1 day, 10:25:22/19:39:06, loss=0.39079710962091, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.96(1.03), norm=0.8393328992315007, lr=0.000313038699786391
2023-11-22 18:50:42   INFO  epoch: 19/30, acc_iter=126903, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:28:39/1:22:53, time_cost(all): 1 day, 10:26:11/20:26:50, loss=0.390713995634601, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.1(1.03), norm=0.8013925809772005, lr=0.000312717965604466
2023-11-22 18:51:31   INFO  epoch: 19/30, acc_iter=126953, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:29:28/1:21:30, time_cost(all): 1 day, 10:27:00/19:59:12, loss=0.390630881648292, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.04(1.03), norm=4.842142679404043, lr=0.000312397231422542
2023-11-22 18:52:21   INFO  epoch: 19/30, acc_iter=127003, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:30:17/1:20:40, time_cost(all): 1 day, 10:27:50/19:38:44, loss=0.390547767661983, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.98(1.03), norm=3.2045386184381597, lr=0.000312076497240617
2023-11-22 18:53:10   INFO  epoch: 19/30, acc_iter=127053, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:31:06/1:14:00, time_cost(all): 1 day, 10:28:39/19:43:32, loss=0.390464653675674, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.96(1.03), norm=2.9085805750413725, lr=0.000311755763058692
2023-11-22 18:53:59   INFO  epoch: 19/30, acc_iter=127103, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:31:55/1:12:15, time_cost(all): 1 day, 10:29:28/19:32:50, loss=0.390381539689365, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.07(1.03), norm=0.9368098846403585, lr=0.000311435028876768
2023-11-22 18:54:48   INFO  epoch: 19/30, acc_iter=127153, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:32:44/1:13:29, time_cost(all): 1 day, 10:30:17/18:37:47, loss=0.390298425703056, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.02(1.03), norm=2.5208329247111454, lr=0.000311114294694843
2023-11-22 18:55:37   INFO  epoch: 19/30, acc_iter=127203, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:33:33/1:15:43, time_cost(all): 1 day, 10:31:06/18:41:44, loss=0.390215311716747, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.93(1.03), norm=0.7363687307307656, lr=0.000310793560512918
2023-11-22 18:56:26   INFO  epoch: 19/30, acc_iter=127253, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:34:22/1:11:53, time_cost(all): 1 day, 10:31:55/19:59:42, loss=0.390132197730438, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.91(1.03), norm=4.214336032967287, lr=0.000310472826330993
2023-11-22 18:57:15   INFO  epoch: 19/30, acc_iter=127303, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:35:12/1:14:44, time_cost(all): 1 day, 10:32:44/19:02:30, loss=0.390049083744129, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.05(1.03), norm=3.9694645752488915, lr=0.000310152092149069
2023-11-22 18:58:04   INFO  epoch: 19/30, acc_iter=127353, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:36:01/1:12:20, time_cost(all): 1 day, 10:33:33/19:16:03, loss=0.38996596975782, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.01(1.03), norm=4.117952518841636, lr=0.000309831357967144
2023-11-22 18:58:53   INFO  epoch: 19/30, acc_iter=127403, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:36:50/1:13:33, time_cost(all): 1 day, 10:34:22/19:28:04, loss=0.38988285577151, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.86(1.03), norm=3.462140050744166, lr=0.000309510623785219
2023-11-22 18:59:43   INFO  epoch: 19/30, acc_iter=127453, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:37:39/1:11:04, time_cost(all): 1 day, 10:35:12/19:01:08, loss=0.389799741785201, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.22(1.03), norm=2.7405775403194674, lr=0.000309189889603295
2023-11-22 19:00:32   INFO  epoch: 19/30, acc_iter=127503, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:38:28/1:12:48, time_cost(all): 1 day, 10:36:01/18:28:12, loss=0.389716627798892, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.0(1.03), norm=1.7862351280599733, lr=0.00030886915542137
2023-11-22 19:01:21   INFO  epoch: 19/30, acc_iter=127553, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:39:17/1:08:00, time_cost(all): 1 day, 10:36:50/19:48:25, loss=0.389633513812583, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.09(1.03), norm=4.5396105293519105, lr=0.000308548421239445
2023-11-22 19:02:10   INFO  epoch: 19/30, acc_iter=127603, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:40:06/1:09:05, time_cost(all): 1 day, 10:37:39/20:02:53, loss=0.389550399826274, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.23(1.03), norm=2.4372776930174265, lr=0.000308227687057521
2023-11-22 19:02:59   INFO  epoch: 19/30, acc_iter=127653, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:40:55/1:07:10, time_cost(all): 1 day, 10:38:28/18:48:25, loss=0.389467285839965, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.94(1.03), norm=1.5338151828437068, lr=0.000307906952875596
2023-11-22 19:03:48   INFO  epoch: 19/30, acc_iter=127703, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:41:44/1:06:01, time_cost(all): 1 day, 10:39:17/19:01:47, loss=0.389384171853656, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.15(1.03), norm=1.5986239460956189, lr=0.000307586218693671
2023-11-22 19:04:37   INFO  epoch: 19/30, acc_iter=127753, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:42:34/1:02:55, time_cost(all): 1 day, 10:40:06/18:45:49, loss=0.389301057867347, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.97(1.03), norm=1.670042016198459, lr=0.000307265484511746
2023-11-22 19:05:26   INFO  epoch: 19/30, acc_iter=127803, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:43:23/1:04:12, time_cost(all): 1 day, 10:40:55/19:17:29, loss=0.389217943881038, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.07(1.03), norm=0.538219878995688, lr=0.000306944750329822
2023-11-22 19:06:16   INFO  epoch: 19/30, acc_iter=127853, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:44:12/1:04:13, time_cost(all): 1 day, 10:41:45/19:38:27, loss=0.389134829894729, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.1(1.03), norm=0.6157171212527481, lr=0.000306624016147897
2023-11-22 19:07:05   INFO  epoch: 19/30, acc_iter=127903, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:45:01/1:00:54, time_cost(all): 1 day, 10:42:34/18:59:58, loss=0.38905171590842, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.83(1.03), norm=1.0183601157778435, lr=0.000306303281965972
2023-11-22 19:07:54   INFO  epoch: 19/30, acc_iter=127953, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:45:50/1:03:19, time_cost(all): 1 day, 10:43:23/19:42:33, loss=0.388968601922111, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.23(1.03), norm=2.1091297717094184, lr=0.000305982547784048
2023-11-22 19:08:43   INFO  epoch: 19/30, acc_iter=128003, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:46:39/0:58:49, time_cost(all): 1 day, 10:44:12/19:21:29, loss=0.388885487935802, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.83(1.03), norm=2.8530753945760328, lr=0.000305661813602123
2023-11-22 19:09:32   INFO  epoch: 19/30, acc_iter=128053, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:47:28/0:59:44, time_cost(all): 1 day, 10:45:01/18:52:02, loss=0.388802373949493, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.86(1.03), norm=2.8983009076692747, lr=0.000305341079420198
2023-11-22 19:10:21   INFO  epoch: 19/30, acc_iter=128103, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:48:17/1:02:23, time_cost(all): 1 day, 10:45:50/19:28:41, loss=0.388719259963184, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.18(1.03), norm=2.7511070706983376, lr=0.000305020345238274
2023-11-22 19:11:10   INFO  epoch: 19/30, acc_iter=128153, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:49:07/0:56:02, time_cost(all): 1 day, 10:46:39/19:18:35, loss=0.388636145976875, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.85(1.03), norm=4.390402305808111, lr=0.000304699611056349
2023-11-22 19:11:59   INFO  epoch: 19/30, acc_iter=128203, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:49:56/0:55:36, time_cost(all): 1 day, 10:47:28/18:33:44, loss=0.388553031990565, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.17(1.03), norm=4.407865663654923, lr=0.000304378876874424
2023-11-22 19:12:48   INFO  epoch: 19/30, acc_iter=128253, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:50:45/0:55:19, time_cost(all): 1 day, 10:48:17/18:59:03, loss=0.388469918004256, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.95(1.03), norm=2.471609434485825, lr=0.000304058142692499
2023-11-22 19:13:38   INFO  epoch: 19/30, acc_iter=128303, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 0:51:34/0:56:51, time_cost(all): 1 day, 10:49:07/19:53:51, loss=0.388386804017947, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.06(1.03), norm=2.357098814025635, lr=0.000303737408510575
2023-11-22 19:14:27   INFO  epoch: 19/30, acc_iter=128353, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 0:52:23/0:57:58, time_cost(all): 1 day, 10:49:56/19:13:54, loss=0.388303690031638, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.19(1.03), norm=2.602680348344687, lr=0.00030341667432865
2023-11-22 19:15:16   INFO  epoch: 19/30, acc_iter=128403, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 0:53:12/0:54:20, time_cost(all): 1 day, 10:50:45/19:18:25, loss=0.388220576045329, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.97(1.03), norm=4.607302281884156, lr=0.000303095940146725
2023-11-22 19:16:05   INFO  epoch: 19/30, acc_iter=128453, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 0:54:01/0:53:01, time_cost(all): 1 day, 10:51:34/19:50:16, loss=0.38813746205902, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.16(1.03), norm=4.3543172890156665, lr=0.000302775205964801
2023-11-22 19:16:54   INFO  epoch: 19/30, acc_iter=128503, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 0:54:50/0:51:32, time_cost(all): 1 day, 10:52:23/19:41:58, loss=0.388054348072711, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.09(1.03), norm=0.7268797642965957, lr=0.000302454471782876
2023-11-22 19:17:43   INFO  epoch: 19/30, acc_iter=128553, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 0:55:39/0:51:32, time_cost(all): 1 day, 10:53:12/19:38:08, loss=0.387971234086402, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.21(1.03), norm=3.071834220373025, lr=0.000302133737600951
2023-11-22 19:18:32   INFO  epoch: 19/30, acc_iter=128603, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 0:56:29/0:49:33, time_cost(all): 1 day, 10:54:01/19:55:45, loss=0.387888120100093, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.12(1.03), norm=0.7257556850245248, lr=0.000301813003419027
2023-11-22 19:19:21   INFO  epoch: 19/30, acc_iter=128653, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 0:57:18/0:48:13, time_cost(all): 1 day, 10:54:50/19:56:57, loss=0.387805006113784, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.96(1.03), norm=4.165256734130352, lr=0.000301492269237102
2023-11-22 19:20:11   INFO  epoch: 19/30, acc_iter=128703, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 0:58:07/0:47:26, time_cost(all): 1 day, 10:55:40/18:57:20, loss=0.387721892127475, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.14(1.03), norm=2.596387174183524, lr=0.000301171535055177
2023-11-22 19:21:00   INFO  epoch: 19/30, acc_iter=128753, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 0:58:56/0:50:53, time_cost(all): 1 day, 10:56:29/18:48:28, loss=0.387638778141166, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.9(1.03), norm=1.1079716015585894, lr=0.000300850800873252
2023-11-22 19:21:49   INFO  epoch: 19/30, acc_iter=128803, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 0:59:45/0:48:18, time_cost(all): 1 day, 10:57:18/18:49:44, loss=0.387555664154857, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.02(1.03), norm=3.40298409301951, lr=0.000300530066691328
2023-11-22 19:22:38   INFO  epoch: 19/30, acc_iter=128853, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:00:34/0:45:45, time_cost(all): 1 day, 10:58:07/18:52:59, loss=0.387472550168548, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.19(1.03), norm=4.378767291569488, lr=0.000300209332509403
2023-11-22 19:23:27   INFO  epoch: 19/30, acc_iter=128903, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:01:23/0:48:30, time_cost(all): 1 day, 10:58:56/18:52:38, loss=0.387389436182239, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.98(1.03), norm=3.554724458708855, lr=0.000299888598327478
2023-11-22 19:24:16   INFO  epoch: 19/30, acc_iter=128953, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:02:12/0:47:36, time_cost(all): 1 day, 10:59:45/19:21:53, loss=0.38730632219593, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.0(1.03), norm=1.1731966612971187, lr=0.000299567864145554
2023-11-22 19:25:05   INFO  epoch: 19/30, acc_iter=129003, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:03:02/0:45:20, time_cost(all): 1 day, 11:00:34/19:17:38, loss=0.387223208209621, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.2(1.03), norm=2.5480891367239984, lr=0.000299247129963629
2023-11-22 19:25:54   INFO  epoch: 19/30, acc_iter=129053, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:03:51/0:42:15, time_cost(all): 1 day, 11:01:23/18:54:05, loss=0.387140094223311, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.97(1.03), norm=4.781662507346141, lr=0.000298926395781704
2023-11-22 19:26:43   INFO  epoch: 19/30, acc_iter=129103, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:04:40/0:41:46, time_cost(all): 1 day, 11:02:12/18:18:31, loss=0.387056980237002, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.89(1.03), norm=3.9769202735463365, lr=0.000298605661599779
2023-11-22 19:27:33   INFO  epoch: 19/30, acc_iter=129153, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:05:29/0:40:49, time_cost(all): 1 day, 11:03:02/18:41:06, loss=0.386973866250693, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.92(1.03), norm=3.7753299009279635, lr=0.000298284927417855
2023-11-22 19:28:22   INFO  epoch: 19/30, acc_iter=129203, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:06:18/0:41:54, time_cost(all): 1 day, 11:03:51/19:11:45, loss=0.386890752264384, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.0(1.03), norm=2.8370361382780733, lr=0.00029796419323593
2023-11-22 19:29:11   INFO  epoch: 19/30, acc_iter=129253, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:07:07/0:42:04, time_cost(all): 1 day, 11:04:40/18:58:45, loss=0.386807638278075, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.94(1.03), norm=2.396394799243596, lr=0.000297643459054005
2023-11-22 19:30:00   INFO  epoch: 19/30, acc_iter=129303, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:07:56/0:41:38, time_cost(all): 1 day, 11:05:29/18:46:45, loss=0.386724524291766, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.14(1.03), norm=2.243699297916179, lr=0.000297322724872081
2023-11-22 19:30:49   INFO  epoch: 19/30, acc_iter=129353, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:08:45/0:37:34, time_cost(all): 1 day, 11:06:18/19:18:44, loss=0.386641410305457, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.19(1.03), norm=1.7748413740413311, lr=0.000297001990690156
2023-11-22 19:31:38   INFO  epoch: 19/30, acc_iter=129403, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:09:34/0:36:22, time_cost(all): 1 day, 11:07:07/19:23:03, loss=0.386558296319148, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.09(1.03), norm=3.592073716713005, lr=0.000296681256508231
2023-11-22 19:32:27   INFO  epoch: 19/30, acc_iter=129453, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:10:24/0:37:25, time_cost(all): 1 day, 11:07:56/18:03:52, loss=0.386475182332839, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.14(1.03), norm=4.500630735718803, lr=0.000296360522326307
2023-11-22 19:33:16   INFO  epoch: 19/30, acc_iter=129503, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:11:13/0:36:58, time_cost(all): 1 day, 11:08:45/18:24:37, loss=0.38639206834653, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.2(1.03), norm=1.9688323615956564, lr=0.000296039788144382
2023-11-22 19:34:06   INFO  epoch: 19/30, acc_iter=129553, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:12:02/0:35:44, time_cost(all): 1 day, 11:09:35/18:25:07, loss=0.386308954360221, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.84(1.03), norm=1.7690501687514928, lr=0.000295719053962457
2023-11-22 19:34:55   INFO  epoch: 19/30, acc_iter=129603, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:12:51/0:35:42, time_cost(all): 1 day, 11:10:24/18:12:03, loss=0.386225840373912, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.87(1.03), norm=0.7181053914611075, lr=0.000295398319780532
2023-11-22 19:35:44   INFO  epoch: 19/30, acc_iter=129653, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:13:40/0:32:31, time_cost(all): 1 day, 11:11:13/18:03:04, loss=0.386142726387603, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.89(1.03), norm=2.285324262353431, lr=0.000295077585598608
2023-11-22 19:36:33   INFO  epoch: 19/30, acc_iter=129703, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:14:29/0:31:53, time_cost(all): 1 day, 11:12:02/17:58:00, loss=0.386059612401294, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.17(1.03), norm=4.432610286543759, lr=0.000294756851416683
2023-11-22 19:37:22   INFO  epoch: 19/30, acc_iter=129753, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:15:18/0:31:54, time_cost(all): 1 day, 11:12:51/19:20:24, loss=0.385976498414985, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.23(1.03), norm=3.9814111423811855, lr=0.000294436117234758
2023-11-22 19:38:11   INFO  epoch: 19/30, acc_iter=129803, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:16:07/0:32:12, time_cost(all): 1 day, 11:13:40/18:39:03, loss=0.385893384428675, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.19(1.03), norm=2.9447568917507643, lr=0.000294115383052833
2023-11-22 19:39:00   INFO  epoch: 19/30, acc_iter=129853, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:16:57/0:29:44, time_cost(all): 1 day, 11:14:29/19:19:09, loss=0.385810270442366, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.21(1.03), norm=2.910856550303465, lr=0.000293794648870909
2023-11-22 19:39:49   INFO  epoch: 19/30, acc_iter=129903, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:17:46/0:29:02, time_cost(all): 1 day, 11:15:18/19:22:40, loss=0.385727156456057, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.89(1.03), norm=4.755054752236782, lr=0.000293473914688984
2023-11-22 19:40:38   INFO  epoch: 19/30, acc_iter=129953, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:18:35/0:30:14, time_cost(all): 1 day, 11:16:07/18:47:12, loss=0.385644042469748, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.99(1.03), norm=0.5588730557073587, lr=0.000293153180507059
2023-11-22 19:41:28   INFO  epoch: 19/30, acc_iter=130003, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:19:24/0:29:27, time_cost(all): 1 day, 11:16:57/18:19:27, loss=0.385560928483439, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.91(1.03), norm=2.9493971693978427, lr=0.000292832446325135
2023-11-22 19:42:17   INFO  epoch: 19/30, acc_iter=130053, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:20:13/0:27:15, time_cost(all): 1 day, 11:17:46/19:27:40, loss=0.38547781449713, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.93(1.03), norm=4.268639285634258, lr=0.00029251171214321
2023-11-22 19:43:06   INFO  epoch: 19/30, acc_iter=130103, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:21:02/0:25:41, time_cost(all): 1 day, 11:18:35/18:29:07, loss=0.385394700510821, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.0(1.03), norm=0.7390075013367008, lr=0.000292190977961285
2023-11-22 19:43:55   INFO  epoch: 19/30, acc_iter=130153, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:21:51/0:27:07, time_cost(all): 1 day, 11:19:24/19:01:51, loss=0.385311586524512, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.02(1.03), norm=4.847029111111998, lr=0.000291870243779361
2023-11-22 19:44:44   INFO  epoch: 19/30, acc_iter=130203, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:22:40/0:25:25, time_cost(all): 1 day, 11:20:13/17:49:13, loss=0.385228472538203, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.88(1.03), norm=1.6716304347348787, lr=0.000291549509597436
2023-11-22 19:45:33   INFO  epoch: 19/30, acc_iter=130253, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:23:29/0:25:16, time_cost(all): 1 day, 11:21:02/19:00:48, loss=0.385145358551894, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.21(1.03), norm=1.0555175358336384, lr=0.000291228775415511
2023-11-22 19:46:22   INFO  epoch: 19/30, acc_iter=130303, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:24:19/0:23:02, time_cost(all): 1 day, 11:21:51/17:45:01, loss=0.385062244565585, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.11(1.03), norm=2.6811513194445933, lr=0.000290908041233586
2023-11-22 19:47:11   INFO  epoch: 19/30, acc_iter=130353, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:25:08/0:21:51, time_cost(all): 1 day, 11:22:40/17:49:36, loss=0.384979130579276, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.1(1.03), norm=1.4311659269036585, lr=0.000290587307051662
2023-11-22 19:48:01   INFO  epoch: 19/30, acc_iter=130403, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:25:57/0:21:18, time_cost(all): 1 day, 11:23:30/18:03:31, loss=0.384896016592967, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.17(1.03), norm=2.0645224577457486, lr=0.000290266572869737
2023-11-22 19:48:50   INFO  epoch: 19/30, acc_iter=130453, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:26:46/0:21:01, time_cost(all): 1 day, 11:24:19/17:43:00, loss=0.384812902606658, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.11(1.03), norm=3.238832019359385, lr=0.000289945838687812
2023-11-22 19:49:39   INFO  epoch: 19/30, acc_iter=130503, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:27:35/0:19:39, time_cost(all): 1 day, 11:25:08/19:16:26, loss=0.384729788620349, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.88(1.03), norm=2.37566878701286, lr=0.000289625104505888
2023-11-22 19:50:28   INFO  epoch: 19/30, acc_iter=130553, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:28:24/0:20:02, time_cost(all): 1 day, 11:25:57/18:28:42, loss=0.38464667463404, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.19(1.03), norm=4.657654017342951, lr=0.000289304370323963
2023-11-22 19:51:17   INFO  epoch: 19/30, acc_iter=130603, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:29:13/0:18:13, time_cost(all): 1 day, 11:26:46/18:57:40, loss=0.38456356064773, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.96(1.03), norm=3.1478670081846807, lr=0.000288983636142038
2023-11-22 19:52:06   INFO  epoch: 19/30, acc_iter=130653, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:30:02/0:18:15, time_cost(all): 1 day, 11:27:35/17:59:22, loss=0.384480446661421, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.04(1.03), norm=3.1120898574488822, lr=0.000288662901960114
2023-11-22 19:52:55   INFO  epoch: 19/30, acc_iter=130703, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:30:52/0:16:33, time_cost(all): 1 day, 11:28:24/18:22:13, loss=0.384397332675112, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.97(1.03), norm=3.162695386301697, lr=0.000288342167778189
2023-11-22 19:53:44   INFO  epoch: 19/30, acc_iter=130753, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:31:41/0:16:05, time_cost(all): 1 day, 11:29:13/18:07:45, loss=0.384314218688803, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.92(1.03), norm=3.992642633142708, lr=0.000288021433596264
2023-11-22 19:54:33   INFO  epoch: 19/30, acc_iter=130803, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:32:30/0:15:18, time_cost(all): 1 day, 11:30:02/18:04:30, loss=0.384231104702494, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.92(1.03), norm=3.2906386152065146, lr=0.000287700699414339
2023-11-22 19:55:23   INFO  epoch: 19/30, acc_iter=130853, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:33:19/0:14:47, time_cost(all): 1 day, 11:30:52/19:09:38, loss=0.384147990716185, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.84(1.03), norm=2.3157776925197755, lr=0.000287379965232415
2023-11-22 19:56:12   INFO  epoch: 19/30, acc_iter=130903, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:34:08/0:14:09, time_cost(all): 1 day, 11:31:41/17:36:57, loss=0.384064876729876, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.05(1.03), norm=4.857448039534701, lr=0.00028705923105049
2023-11-22 19:57:01   INFO  epoch: 19/30, acc_iter=130953, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:34:57/0:12:57, time_cost(all): 1 day, 11:32:30/19:13:13, loss=0.383981762743567, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.9(1.03), norm=0.6756874504812018, lr=0.000286738496868565
2023-11-22 19:57:50   INFO  epoch: 19/30, acc_iter=131003, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:35:46/0:12:30, time_cost(all): 1 day, 11:33:19/18:45:14, loss=0.383898648757258, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.93(1.03), norm=2.740183868097466, lr=0.000286417762686641
2023-11-22 19:58:39   INFO  epoch: 19/30, acc_iter=131053, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:36:35/0:10:47, time_cost(all): 1 day, 11:34:08/17:52:09, loss=0.383815534770949, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.11(1.03), norm=1.1873059465050242, lr=0.000286097028504716
2023-11-22 19:59:28   INFO  epoch: 19/30, acc_iter=131103, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:37:24/0:10:55, time_cost(all): 1 day, 11:34:57/19:14:24, loss=0.38373242078464, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.04(1.03), norm=4.306976792346507, lr=0.000285776294322791
2023-11-22 20:00:17   INFO  epoch: 19/30, acc_iter=131153, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:38:14/0:09:21, time_cost(all): 1 day, 11:35:46/18:41:35, loss=0.383649306798331, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.86(1.03), norm=2.554389350788742, lr=0.000285455560140867
2023-11-22 20:01:06   INFO  epoch: 19/30, acc_iter=131203, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:39:03/0:08:39, time_cost(all): 1 day, 11:36:35/17:32:01, loss=0.383566192812022, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.19(1.03), norm=2.0339783930749156, lr=0.000285134825958942
2023-11-22 20:01:56   INFO  epoch: 19/30, acc_iter=131253, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:39:52/0:08:08, time_cost(all): 1 day, 11:37:25/18:12:55, loss=0.383483078825713, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.93(1.03), norm=1.7984942270230315, lr=0.000284814091777017
2023-11-22 20:02:45   INFO  epoch: 19/30, acc_iter=131303, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:40:41/0:07:15, time_cost(all): 1 day, 11:38:14/18:05:50, loss=0.383399964839404, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.1(1.03), norm=2.735475000808046, lr=0.000284493357595092
2023-11-22 20:03:34   INFO  epoch: 19/30, acc_iter=131353, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:41:30/0:06:20, time_cost(all): 1 day, 11:39:03/18:07:45, loss=0.383316850853095, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.95(1.03), norm=1.5278856396069331, lr=0.000284172623413168
2023-11-22 20:04:23   INFO  epoch: 19/30, acc_iter=131403, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 1:42:19/0:05:35, time_cost(all): 1 day, 11:39:52/17:43:36, loss=0.383233736866786, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.01(1.03), norm=4.142659390797516, lr=0.000283851889231243
2023-11-22 20:05:12   INFO  epoch: 19/30, acc_iter=131453, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 1:43:08/0:04:52, time_cost(all): 1 day, 11:40:41/18:12:54, loss=0.383150622880476, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.08(1.03), norm=0.6627725668492039, lr=0.000283531155049318
2023-11-22 20:06:01   INFO  epoch: 19/30, acc_iter=131503, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 1:43:57/0:04:03, time_cost(all): 1 day, 11:41:30/17:41:37, loss=0.383067508894167, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.06(1.03), norm=4.590804219599023, lr=0.000283210420867394
2023-11-22 20:06:50   INFO  epoch: 19/30, acc_iter=131553, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 1:44:47/0:03:03, time_cost(all): 1 day, 11:42:19/18:31:52, loss=0.382984394907858, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.97(1.03), norm=4.183151272628031, lr=0.000282889686685469
2023-11-22 20:07:39   INFO  epoch: 19/30, acc_iter=131603, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 1:45:36/0:02:13, time_cost(all): 1 day, 11:43:08/17:18:02, loss=0.382901280921549, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.01(1.03), norm=2.8797321408410577, lr=0.000282568952503544
2023-11-22 20:08:28   INFO  epoch: 19/30, acc_iter=131653, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 1:46:25/0:01:26, time_cost(all): 1 day, 11:43:57/18:19:49, loss=0.38281816693524, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.08(1.03), norm=4.614166471533969, lr=0.000282248218321619
2023-11-22 20:09:18   INFO  epoch: 19/30, acc_iter=131703, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 1:47:14/0:00:38, time_cost(all): 1 day, 11:44:47/18:20:40, loss=0.382735052948931, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.22(1.03), norm=1.5912420607866775, lr=0.000281927484139695
2023-11-22 20:10:07   INFO  epoch: 20/30, acc_iter=131790, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:49/1:41:50, time_cost(all): 1 day, 11:45:36/18:34:58, loss=0.382590434612753, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.08(1.03), norm=3.9199554580486144, lr=0.000281369406663146
2023-11-22 20:10:56   INFO  epoch: 20/30, acc_iter=131840, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:38/1:42:18, time_cost(all): 1 day, 11:46:25/18:51:33, loss=0.382507320626444, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.19(1.03), norm=4.931387496974277, lr=0.000281048672481221
2023-11-22 20:11:45   INFO  epoch: 20/30, acc_iter=131890, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:27/1:49:50, time_cost(all): 1 day, 11:47:14/17:19:34, loss=0.382424206640135, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.87(1.03), norm=0.855484818149621, lr=0.000280727938299296
2023-11-22 20:12:34   INFO  epoch: 20/30, acc_iter=131940, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:16/1:39:35, time_cost(all): 1 day, 11:48:03/17:35:10, loss=0.382341092653826, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.85(1.03), norm=3.948551115425198, lr=0.000280407204117372
2023-11-22 20:13:23   INFO  epoch: 20/30, acc_iter=131990, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:05/1:39:04, time_cost(all): 1 day, 11:48:52/17:16:14, loss=0.382257978667517, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.16(1.03), norm=3.2228192580957318, lr=0.000280086469935447
2023-11-22 20:14:12   INFO  epoch: 20/30, acc_iter=132040, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:04:54/1:38:35, time_cost(all): 1 day, 11:49:41/18:59:28, loss=0.382174864681208, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.92(1.03), norm=2.8751249292900383, lr=0.000279765735753522
2023-11-22 20:15:01   INFO  epoch: 20/30, acc_iter=132090, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:05:43/1:38:10, time_cost(all): 1 day, 11:50:30/18:33:52, loss=0.382091750694899, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.12(1.03), norm=1.0370741155436227, lr=0.000279445001571597
2023-11-22 20:15:51   INFO  epoch: 20/30, acc_iter=132140, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:06:32/1:40:07, time_cost(all): 1 day, 11:51:20/17:25:32, loss=0.38200863670859, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.84(1.03), norm=4.349401142038872, lr=0.000279124267389673
2023-11-22 20:16:40   INFO  epoch: 20/30, acc_iter=132190, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:07:22/1:42:53, time_cost(all): 1 day, 11:52:09/18:47:23, loss=0.381925522722281, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.99(1.03), norm=0.7838302417928482, lr=0.000278803533207748
2023-11-22 20:17:29   INFO  epoch: 20/30, acc_iter=132240, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:08:11/1:36:09, time_cost(all): 1 day, 11:52:58/17:45:21, loss=0.381842408735972, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.05(1.03), norm=2.502358078431249, lr=0.000278482799025823
2023-11-22 20:18:18   INFO  epoch: 20/30, acc_iter=132290, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:09:00/1:39:06, time_cost(all): 1 day, 11:53:47/17:40:01, loss=0.381759294749663, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.03(1.03), norm=3.1583807549570984, lr=0.000278162064843899
2023-11-22 20:19:07   INFO  epoch: 20/30, acc_iter=132340, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:09:49/1:36:31, time_cost(all): 1 day, 11:54:36/17:55:02, loss=0.381676180763354, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.11(1.03), norm=4.670559381790146, lr=0.000277841330661974
2023-11-22 20:19:56   INFO  epoch: 20/30, acc_iter=132390, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:10:38/1:33:25, time_cost(all): 1 day, 11:55:25/18:14:38, loss=0.381593066777045, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.15(1.03), norm=3.9011329380094626, lr=0.000277520596480049
2023-11-22 20:20:45   INFO  epoch: 20/30, acc_iter=132440, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:11:27/1:36:05, time_cost(all): 1 day, 11:56:14/18:40:18, loss=0.381509952790736, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.15(1.03), norm=1.3895086707156359, lr=0.000277199862298125
2023-11-22 20:21:34   INFO  epoch: 20/30, acc_iter=132490, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:12:16/1:37:09, time_cost(all): 1 day, 11:57:03/17:50:33, loss=0.381426838804426, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.91(1.03), norm=0.9595294791650767, lr=0.0002768791281162
2023-11-22 20:22:23   INFO  epoch: 20/30, acc_iter=132540, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:13:05/1:37:51, time_cost(all): 1 day, 11:57:52/18:14:05, loss=0.381343724818117, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.03(1.03), norm=1.691421477682547, lr=0.000276558393934275
2023-11-22 20:23:13   INFO  epoch: 20/30, acc_iter=132590, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:13:54/1:36:50, time_cost(all): 1 day, 11:58:42/17:46:09, loss=0.381260610831808, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.85(1.03), norm=2.4896282592042094, lr=0.00027623765975235
2023-11-22 20:24:02   INFO  epoch: 20/30, acc_iter=132640, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:14:44/1:31:05, time_cost(all): 1 day, 11:59:31/18:29:47, loss=0.381177496845499, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.9(1.03), norm=2.0602215463180076, lr=0.000275916925570426
2023-11-22 20:24:51   INFO  epoch: 20/30, acc_iter=132690, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:15:33/1:33:12, time_cost(all): 1 day, 12:00:20/17:41:21, loss=0.38109438285919, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.89(1.03), norm=2.5571390950564927, lr=0.000275596191388501
2023-11-22 20:25:40   INFO  epoch: 20/30, acc_iter=132740, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:16:22/1:33:12, time_cost(all): 1 day, 12:01:09/17:41:36, loss=0.381011268872881, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.88(1.03), norm=3.707554989796454, lr=0.000275275457206576
2023-11-22 20:26:29   INFO  epoch: 20/30, acc_iter=132790, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:17:11/1:26:58, time_cost(all): 1 day, 12:01:58/17:37:50, loss=0.380928154886572, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.07(1.03), norm=0.9291135368033636, lr=0.000274954723024652
2023-11-22 20:27:18   INFO  epoch: 20/30, acc_iter=132840, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:18:00/1:26:05, time_cost(all): 1 day, 12:02:47/18:26:07, loss=0.380845040900263, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.03(1.03), norm=1.6298764805042687, lr=0.000274633988842727
2023-11-22 20:28:07   INFO  epoch: 20/30, acc_iter=132890, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:18:49/1:32:52, time_cost(all): 1 day, 12:03:36/17:37:06, loss=0.380761926913954, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.2(1.03), norm=2.4666207236157343, lr=0.000274313254660802
2023-11-22 20:28:56   INFO  epoch: 20/30, acc_iter=132940, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:19:38/1:27:16, time_cost(all): 1 day, 12:04:25/18:30:28, loss=0.380678812927645, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.93(1.03), norm=4.091220721577719, lr=0.000273992520478878
2023-11-22 20:29:45   INFO  epoch: 20/30, acc_iter=132990, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:20:27/1:23:36, time_cost(all): 1 day, 12:05:14/17:25:14, loss=0.380595698941336, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.95(1.03), norm=1.2655423302848068, lr=0.000273671786296953
2023-11-22 20:30:35   INFO  epoch: 20/30, acc_iter=133040, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:21:17/1:26:56, time_cost(all): 1 day, 12:06:04/17:02:21, loss=0.380512584955027, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.88(1.03), norm=4.671358437426204, lr=0.000273351052115028
2023-11-22 20:31:24   INFO  epoch: 20/30, acc_iter=133090, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:22:06/1:24:25, time_cost(all): 1 day, 12:06:53/18:36:03, loss=0.380429470968718, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.11(1.03), norm=3.054780143163038, lr=0.000273030317933103
2023-11-22 20:32:13   INFO  epoch: 20/30, acc_iter=133140, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:22:55/1:22:37, time_cost(all): 1 day, 12:07:42/17:30:52, loss=0.380346356982409, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.83(1.03), norm=1.8776555198720275, lr=0.000272709583751179
2023-11-22 20:33:02   INFO  epoch: 20/30, acc_iter=133190, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:23:44/1:23:18, time_cost(all): 1 day, 12:08:31/18:40:04, loss=0.3802632429961, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.13(1.03), norm=1.8551007172434733, lr=0.000272388849569254
2023-11-22 20:33:51   INFO  epoch: 20/30, acc_iter=133240, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:24:33/1:19:54, time_cost(all): 1 day, 12:09:20/18:33:57, loss=0.38018012900979, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.23(1.03), norm=1.187642821886588, lr=0.000272068115387329
2023-11-22 20:34:40   INFO  epoch: 20/30, acc_iter=133290, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:25:22/1:22:16, time_cost(all): 1 day, 12:10:09/17:49:58, loss=0.380097015023481, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.22(1.03), norm=0.6503873039813173, lr=0.000271747381205405
2023-11-22 20:35:29   INFO  epoch: 20/30, acc_iter=133340, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:26:11/1:25:03, time_cost(all): 1 day, 12:10:58/17:59:59, loss=0.380013901037172, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.12(1.03), norm=4.85544203368018, lr=0.00027142664702348
2023-11-22 20:36:18   INFO  epoch: 20/30, acc_iter=133390, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:27:00/1:21:54, time_cost(all): 1 day, 12:11:47/18:21:07, loss=0.379930787050863, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.9(1.03), norm=1.9796480481519616, lr=0.000271105912841555
2023-11-22 20:37:08   INFO  epoch: 20/30, acc_iter=133440, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:27:49/1:23:54, time_cost(all): 1 day, 12:12:37/18:23:53, loss=0.379847673064554, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.0(1.03), norm=2.5083323573982867, lr=0.000270785178659631
2023-11-22 20:37:57   INFO  epoch: 20/30, acc_iter=133490, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:28:39/1:16:00, time_cost(all): 1 day, 12:13:26/17:46:39, loss=0.379764559078245, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.95(1.03), norm=0.910189449006537, lr=0.000270464444477706
2023-11-22 20:38:46   INFO  epoch: 20/30, acc_iter=133540, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:29:28/1:18:14, time_cost(all): 1 day, 12:14:15/17:05:52, loss=0.379681445091936, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.89(1.03), norm=2.250675352431888, lr=0.000270143710295781
2023-11-22 20:39:35   INFO  epoch: 20/30, acc_iter=133590, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:30:17/1:19:48, time_cost(all): 1 day, 12:15:04/16:51:37, loss=0.379598331105627, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.05(1.03), norm=1.6838500958113711, lr=0.000269822976113856
2023-11-22 20:40:24   INFO  epoch: 20/30, acc_iter=133640, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:31:06/1:18:07, time_cost(all): 1 day, 12:15:53/18:20:31, loss=0.379515217119318, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.97(1.03), norm=2.0626699295757196, lr=0.000269502241931932
2023-11-22 20:41:13   INFO  epoch: 20/30, acc_iter=133690, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:31:55/1:12:28, time_cost(all): 1 day, 12:16:42/17:36:26, loss=0.379432103133009, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.99(1.03), norm=2.3569001165254204, lr=0.000269181507750007
2023-11-22 20:42:02   INFO  epoch: 20/30, acc_iter=133740, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:32:44/1:11:42, time_cost(all): 1 day, 12:17:31/17:24:00, loss=0.3793489891467, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.14(1.03), norm=3.127367245344165, lr=0.000268860773568082
2023-11-22 20:42:51   INFO  epoch: 20/30, acc_iter=133790, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:33:33/1:10:58, time_cost(all): 1 day, 12:18:20/18:14:19, loss=0.379265875160391, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.88(1.03), norm=3.486588141568722, lr=0.000268540039386158
2023-11-22 20:43:40   INFO  epoch: 20/30, acc_iter=133840, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:34:22/1:10:58, time_cost(all): 1 day, 12:19:09/18:18:49, loss=0.379182761174082, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.19(1.03), norm=4.465259463338826, lr=0.000268219305204233
2023-11-22 20:44:30   INFO  epoch: 20/30, acc_iter=133890, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:35:12/1:14:01, time_cost(all): 1 day, 12:19:59/18:04:52, loss=0.379099647187773, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.86(1.03), norm=4.393602981582903, lr=0.000267898571022308
2023-11-22 20:45:19   INFO  epoch: 20/30, acc_iter=133940, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:36:01/1:13:26, time_cost(all): 1 day, 12:20:48/17:20:18, loss=0.379016533201464, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.12(1.03), norm=3.9180517819884098, lr=0.000267577836840384
2023-11-22 20:46:08   INFO  epoch: 20/30, acc_iter=133990, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:36:50/1:13:40, time_cost(all): 1 day, 12:21:37/17:53:19, loss=0.378933419215155, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.0(1.03), norm=3.2419303148726226, lr=0.000267257102658459
2023-11-22 20:46:57   INFO  epoch: 20/30, acc_iter=134040, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:37:39/1:08:36, time_cost(all): 1 day, 12:22:26/17:11:02, loss=0.378850305228845, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.07(1.03), norm=4.7022287663929605, lr=0.000266936368476534
2023-11-22 20:47:46   INFO  epoch: 20/30, acc_iter=134090, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:38:28/1:09:27, time_cost(all): 1 day, 12:23:15/18:24:16, loss=0.378767191242536, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.98(1.03), norm=2.448655533180134, lr=0.000266615634294609
2023-11-22 20:48:35   INFO  epoch: 20/30, acc_iter=134140, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:39:17/1:08:56, time_cost(all): 1 day, 12:24:04/16:53:36, loss=0.378684077256227, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.15(1.03), norm=3.093459914948143, lr=0.000266294900112685
2023-11-22 20:49:24   INFO  epoch: 20/30, acc_iter=134190, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:40:06/1:05:50, time_cost(all): 1 day, 12:24:53/18:14:07, loss=0.378600963269918, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.11(1.03), norm=2.642845202656405, lr=0.00026597416593076
2023-11-22 20:50:13   INFO  epoch: 20/30, acc_iter=134240, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:40:55/1:07:48, time_cost(all): 1 day, 12:25:42/17:34:59, loss=0.378517849283609, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.97(1.03), norm=1.4722970787189933, lr=0.000265653431748835
2023-11-22 20:51:03   INFO  epoch: 20/30, acc_iter=134290, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:41:44/1:03:05, time_cost(all): 1 day, 12:26:32/17:54:14, loss=0.3784347352973, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.83(1.03), norm=4.117925313892723, lr=0.000265332697566911
2023-11-22 20:51:52   INFO  epoch: 20/30, acc_iter=134340, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:42:34/1:06:08, time_cost(all): 1 day, 12:27:21/17:39:49, loss=0.378351621310991, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.0(1.03), norm=3.051122208219644, lr=0.000265011963384986
2023-11-22 20:52:41   INFO  epoch: 20/30, acc_iter=134390, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:43:23/1:04:47, time_cost(all): 1 day, 12:28:10/17:39:17, loss=0.378268507324682, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.08(1.03), norm=3.9353706177421515, lr=0.000264691229203061
2023-11-22 20:53:30   INFO  epoch: 20/30, acc_iter=134440, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:44:12/1:04:30, time_cost(all): 1 day, 12:28:59/17:15:05, loss=0.378185393338373, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.85(1.03), norm=1.8698217438445075, lr=0.000264370495021136
2023-11-22 20:54:19   INFO  epoch: 20/30, acc_iter=134490, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:45:01/1:01:31, time_cost(all): 1 day, 12:29:48/16:49:56, loss=0.378102279352064, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.86(1.03), norm=1.560496308056846, lr=0.000264049760839212
2023-11-22 20:55:08   INFO  epoch: 20/30, acc_iter=134540, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:45:50/0:59:42, time_cost(all): 1 day, 12:30:37/16:57:59, loss=0.378019165365755, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.04(1.03), norm=0.682164408176037, lr=0.000263729026657287
2023-11-22 20:55:57   INFO  epoch: 20/30, acc_iter=134590, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:46:39/1:00:42, time_cost(all): 1 day, 12:31:26/17:42:45, loss=0.377936051379446, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.17(1.03), norm=4.9594301908282095, lr=0.000263408292475362
2023-11-22 20:56:46   INFO  epoch: 20/30, acc_iter=134640, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:47:28/1:03:02, time_cost(all): 1 day, 12:32:15/18:11:34, loss=0.377852937393137, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.17(1.03), norm=3.885936888099397, lr=0.000263087558293438
2023-11-22 20:57:35   INFO  epoch: 20/30, acc_iter=134690, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:48:17/0:59:08, time_cost(all): 1 day, 12:33:04/16:43:09, loss=0.377769823406828, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.23(1.03), norm=2.5712343738147148, lr=0.000262766824111513
2023-11-22 20:58:25   INFO  epoch: 20/30, acc_iter=134740, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:49:07/1:00:26, time_cost(all): 1 day, 12:33:54/17:47:08, loss=0.377686709420519, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.1(1.03), norm=4.900554537407685, lr=0.000262446089929588
2023-11-22 20:59:14   INFO  epoch: 20/30, acc_iter=134790, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:49:56/0:56:20, time_cost(all): 1 day, 12:34:43/16:58:47, loss=0.37760359543421, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.01(1.03), norm=1.0605331247181249, lr=0.000262125355747664
2023-11-22 21:00:03   INFO  epoch: 20/30, acc_iter=134840, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:50:45/0:58:24, time_cost(all): 1 day, 12:35:32/16:52:13, loss=0.377520481447901, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.22(1.03), norm=4.955154765732956, lr=0.000261804621565739
2023-11-22 21:00:52   INFO  epoch: 20/30, acc_iter=134890, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 0:51:34/0:58:18, time_cost(all): 1 day, 12:36:21/17:24:55, loss=0.377437367461591, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.13(1.03), norm=4.16990450322857, lr=0.000261483887383814
2023-11-22 21:01:41   INFO  epoch: 20/30, acc_iter=134940, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 0:52:23/0:52:55, time_cost(all): 1 day, 12:37:10/17:48:31, loss=0.377354253475282, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.91(1.03), norm=1.9712271484564208, lr=0.000261163153201889
2023-11-22 21:02:30   INFO  epoch: 20/30, acc_iter=134990, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 0:53:12/0:52:10, time_cost(all): 1 day, 12:37:59/17:05:24, loss=0.377271139488973, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.08(1.03), norm=3.495805187190151, lr=0.000260842419019965
2023-11-22 21:03:19   INFO  epoch: 20/30, acc_iter=135040, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 0:54:01/0:53:18, time_cost(all): 1 day, 12:38:48/17:09:04, loss=0.377188025502664, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.9(1.03), norm=1.793306611316853, lr=0.00026052168483804
2023-11-22 21:04:08   INFO  epoch: 20/30, acc_iter=135090, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 0:54:50/0:50:29, time_cost(all): 1 day, 12:39:37/17:30:49, loss=0.377104911516355, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.12(1.03), norm=3.767506867476669, lr=0.000260200950656115
2023-11-22 21:04:58   INFO  epoch: 20/30, acc_iter=135140, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 0:55:39/0:49:38, time_cost(all): 1 day, 12:40:27/17:46:00, loss=0.377021797530046, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.21(1.03), norm=2.1699857095079356, lr=0.000259880216474191
2023-11-22 21:05:47   INFO  epoch: 20/30, acc_iter=135190, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 0:56:29/0:51:49, time_cost(all): 1 day, 12:41:16/16:46:19, loss=0.376938683543737, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.22(1.03), norm=4.102782188540577, lr=0.000259559482292266
2023-11-22 21:06:36   INFO  epoch: 20/30, acc_iter=135240, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 0:57:18/0:51:21, time_cost(all): 1 day, 12:42:05/17:45:08, loss=0.376855569557428, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.87(1.03), norm=2.1217892491939656, lr=0.000259238748110341
2023-11-22 21:07:25   INFO  epoch: 20/30, acc_iter=135290, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 0:58:07/0:47:26, time_cost(all): 1 day, 12:42:54/17:42:09, loss=0.376772455571119, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.95(1.03), norm=1.8976344858490317, lr=0.000258918013928416
2023-11-22 21:08:14   INFO  epoch: 20/30, acc_iter=135340, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 0:58:56/0:50:21, time_cost(all): 1 day, 12:43:43/16:59:03, loss=0.37668934158481, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.0(1.03), norm=3.350218888587594, lr=0.000258597279746492
2023-11-22 21:09:03   INFO  epoch: 20/30, acc_iter=135390, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 0:59:45/0:48:00, time_cost(all): 1 day, 12:44:32/17:55:13, loss=0.376606227598501, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.08(1.03), norm=1.5224955999510492, lr=0.000258276545564567
2023-11-22 21:09:52   INFO  epoch: 20/30, acc_iter=135440, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:00:34/0:49:06, time_cost(all): 1 day, 12:45:21/17:19:48, loss=0.376523113612192, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.87(1.03), norm=0.6199777289586527, lr=0.000257955811382642
2023-11-22 21:10:41   INFO  epoch: 20/30, acc_iter=135490, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:01:23/0:48:12, time_cost(all): 1 day, 12:46:10/17:08:30, loss=0.376439999625883, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.99(1.03), norm=1.5927636879286315, lr=0.000257635077200718
2023-11-22 21:11:30   INFO  epoch: 20/30, acc_iter=135540, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:02:12/0:46:59, time_cost(all): 1 day, 12:46:59/17:33:37, loss=0.376356885639574, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.1(1.03), norm=2.2405250533892835, lr=0.000257314343018793
2023-11-22 21:12:20   INFO  epoch: 20/30, acc_iter=135590, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:03:02/0:46:54, time_cost(all): 1 day, 12:47:49/16:44:49, loss=0.376273771653265, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.85(1.03), norm=4.451821933863476, lr=0.000256993608836868
2023-11-22 21:13:09   INFO  epoch: 20/30, acc_iter=135640, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:03:51/0:43:29, time_cost(all): 1 day, 12:48:38/16:58:17, loss=0.376190657666955, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.87(1.03), norm=3.589924713696213, lr=0.000256672874654944
2023-11-22 21:13:58   INFO  epoch: 20/30, acc_iter=135690, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:04:40/0:41:01, time_cost(all): 1 day, 12:49:27/16:58:11, loss=0.376107543680646, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.84(1.03), norm=3.01882942691688, lr=0.000256352140473019
2023-11-22 21:14:47   INFO  epoch: 20/30, acc_iter=135740, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:05:29/0:43:42, time_cost(all): 1 day, 12:50:16/17:46:46, loss=0.376024429694337, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.84(1.03), norm=1.946222121867788, lr=0.000256031406291094
2023-11-22 21:15:36   INFO  epoch: 20/30, acc_iter=135790, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:06:18/0:39:29, time_cost(all): 1 day, 12:51:05/17:37:11, loss=0.375941315708028, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.85(1.03), norm=4.069047409502105, lr=0.000255710672109169
2023-11-22 21:16:25   INFO  epoch: 20/30, acc_iter=135840, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:07:07/0:39:09, time_cost(all): 1 day, 12:51:54/16:23:48, loss=0.375858201721719, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.15(1.03), norm=3.1797262005196996, lr=0.000255389937927245
2023-11-22 21:17:14   INFO  epoch: 20/30, acc_iter=135890, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:07:56/0:41:26, time_cost(all): 1 day, 12:52:43/17:27:18, loss=0.37577508773541, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.89(1.03), norm=4.571858313816326, lr=0.00025506920374532
2023-11-22 21:18:03   INFO  epoch: 20/30, acc_iter=135940, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:08:45/0:38:23, time_cost(all): 1 day, 12:53:32/17:13:18, loss=0.375691973749101, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.06(1.03), norm=3.78440599821529, lr=0.000254748469563395
2023-11-22 21:18:53   INFO  epoch: 20/30, acc_iter=135990, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:09:34/0:39:22, time_cost(all): 1 day, 12:54:22/17:41:24, loss=0.375608859762792, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.23(1.03), norm=4.230363923373096, lr=0.000254427735381471
2023-11-22 21:19:42   INFO  epoch: 20/30, acc_iter=136040, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:10:24/0:37:07, time_cost(all): 1 day, 12:55:11/16:57:39, loss=0.375525745776483, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.92(1.03), norm=2.6544647139089683, lr=0.000254107001199546
2023-11-22 21:20:31   INFO  epoch: 20/30, acc_iter=136090, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:11:13/0:36:49, time_cost(all): 1 day, 12:56:00/16:38:07, loss=0.375442631790174, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.92(1.03), norm=3.088522808480715, lr=0.000253786267017621
2023-11-22 21:21:20   INFO  epoch: 20/30, acc_iter=136140, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:12:02/0:37:20, time_cost(all): 1 day, 12:56:49/16:13:42, loss=0.375359517803865, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.2(1.03), norm=3.7153082990342408, lr=0.000253465532835696
2023-11-22 21:22:09   INFO  epoch: 20/30, acc_iter=136190, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:12:51/0:35:13, time_cost(all): 1 day, 12:57:38/16:23:12, loss=0.375276403817556, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.1(1.03), norm=1.3086645625416269, lr=0.000253144798653772
2023-11-22 21:22:58   INFO  epoch: 20/30, acc_iter=136240, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:13:40/0:34:30, time_cost(all): 1 day, 12:58:27/16:55:27, loss=0.375193289831247, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.18(1.03), norm=2.0720672795278894, lr=0.000252824064471847
2023-11-22 21:23:47   INFO  epoch: 20/30, acc_iter=136290, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:14:29/0:34:36, time_cost(all): 1 day, 12:59:16/16:20:00, loss=0.375110175844938, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.05(1.03), norm=2.070866226763667, lr=0.000252503330289922
2023-11-22 21:24:36   INFO  epoch: 20/30, acc_iter=136340, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:15:18/0:31:08, time_cost(all): 1 day, 13:00:05/16:41:37, loss=0.375027061858629, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.18(1.03), norm=0.685336212235012, lr=0.000252182596107998
2023-11-22 21:25:25   INFO  epoch: 20/30, acc_iter=136390, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:16:07/0:33:08, time_cost(all): 1 day, 13:00:54/17:24:03, loss=0.37494394787232, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.87(1.03), norm=2.5615402874287843, lr=0.000251861861926073
2023-11-22 21:26:15   INFO  epoch: 20/30, acc_iter=136440, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:16:57/0:29:48, time_cost(all): 1 day, 13:01:44/16:19:51, loss=0.37486083388601, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.09(1.03), norm=1.259562675090713, lr=0.000251541127744148
2023-11-22 21:27:04   INFO  epoch: 20/30, acc_iter=136490, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:17:46/0:29:57, time_cost(all): 1 day, 13:02:33/17:14:19, loss=0.374777719899701, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.99(1.03), norm=2.643421062799722, lr=0.000251220393562224
2023-11-22 21:27:53   INFO  epoch: 20/30, acc_iter=136540, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:18:35/0:29:49, time_cost(all): 1 day, 13:03:22/16:33:56, loss=0.374694605913392, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.06(1.03), norm=1.8138886347472352, lr=0.000250899659380299
2023-11-22 21:28:42   INFO  epoch: 20/30, acc_iter=136590, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:19:24/0:27:17, time_cost(all): 1 day, 13:04:11/17:28:42, loss=0.374611491927083, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.02(1.03), norm=3.8133948437139975, lr=0.000250578925198374
2023-11-22 21:29:31   INFO  epoch: 20/30, acc_iter=136640, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:20:13/0:27:56, time_cost(all): 1 day, 13:05:00/16:55:40, loss=0.374528377940774, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.18(1.03), norm=1.3528769461538974, lr=0.000250258191016449
2023-11-22 21:30:20   INFO  epoch: 20/30, acc_iter=136690, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:21:02/0:27:58, time_cost(all): 1 day, 13:05:49/17:31:28, loss=0.374445263954465, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.09(1.03), norm=1.9368225509235268, lr=0.000249937456834525
2023-11-22 21:31:09   INFO  epoch: 20/30, acc_iter=136740, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:21:51/0:26:37, time_cost(all): 1 day, 13:06:38/16:51:09, loss=0.374362149968156, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.87(1.03), norm=3.0031929852242794, lr=0.0002496167226526
2023-11-22 21:31:58   INFO  epoch: 20/30, acc_iter=136790, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:22:40/0:26:13, time_cost(all): 1 day, 13:07:27/16:24:25, loss=0.374279035981847, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.95(1.03), norm=4.753456676878883, lr=0.000249295988470675
2023-11-22 21:32:48   INFO  epoch: 20/30, acc_iter=136840, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:23:29/0:23:59, time_cost(all): 1 day, 13:08:17/16:46:58, loss=0.374195921995538, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.88(1.03), norm=3.186755831328526, lr=0.000248975254288751
2023-11-22 21:33:37   INFO  epoch: 20/30, acc_iter=136890, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:24:19/0:23:36, time_cost(all): 1 day, 13:09:06/17:27:17, loss=0.374112808009229, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.08(1.03), norm=4.336099044100754, lr=0.000248654520106826
2023-11-22 21:34:26   INFO  epoch: 20/30, acc_iter=136940, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:25:08/0:23:30, time_cost(all): 1 day, 13:09:55/17:31:56, loss=0.37402969402292, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.04(1.03), norm=1.6497973864063067, lr=0.000248333785924901
2023-11-22 21:35:15   INFO  epoch: 20/30, acc_iter=136990, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:25:57/0:21:26, time_cost(all): 1 day, 13:10:44/16:30:57, loss=0.373946580036611, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.86(1.03), norm=3.601225014537344, lr=0.000248013051742976
2023-11-22 21:36:04   INFO  epoch: 20/30, acc_iter=137040, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:26:46/0:20:16, time_cost(all): 1 day, 13:11:33/16:32:55, loss=0.373863466050302, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.91(1.03), norm=3.1773301511848633, lr=0.000247692317561052
2023-11-22 21:36:53   INFO  epoch: 20/30, acc_iter=137090, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:27:35/0:21:11, time_cost(all): 1 day, 13:12:22/16:53:54, loss=0.373780352063993, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.87(1.03), norm=4.485874495044386, lr=0.000247371583379127
2023-11-22 21:37:42   INFO  epoch: 20/30, acc_iter=137140, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:28:24/0:18:50, time_cost(all): 1 day, 13:13:11/16:05:27, loss=0.373697238077684, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.21(1.03), norm=4.63579389482893, lr=0.000247050849197202
2023-11-22 21:38:31   INFO  epoch: 20/30, acc_iter=137190, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:29:13/0:18:20, time_cost(all): 1 day, 13:14:00/15:59:39, loss=0.373614124091375, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.16(1.03), norm=1.145535220207067, lr=0.000246730115015278
2023-11-22 21:39:20   INFO  epoch: 20/30, acc_iter=137240, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:30:02/0:17:32, time_cost(all): 1 day, 13:14:49/16:09:30, loss=0.373531010105066, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.93(1.03), norm=2.5677797868651884, lr=0.000246409380833353
2023-11-22 21:40:10   INFO  epoch: 20/30, acc_iter=137290, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:30:52/0:17:17, time_cost(all): 1 day, 13:15:39/17:20:17, loss=0.373447896118756, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.91(1.03), norm=3.619941037342661, lr=0.000246088646651428
2023-11-22 21:40:59   INFO  epoch: 20/30, acc_iter=137340, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:31:41/0:16:49, time_cost(all): 1 day, 13:16:28/15:50:54, loss=0.373364782132447, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.14(1.03), norm=4.071172742455953, lr=0.000245767912469504
2023-11-22 21:41:48   INFO  epoch: 20/30, acc_iter=137390, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:32:30/0:16:02, time_cost(all): 1 day, 13:17:17/16:51:10, loss=0.373281668146138, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.06(1.03), norm=4.709837153602069, lr=0.000245447178287579
2023-11-22 21:42:37   INFO  epoch: 20/30, acc_iter=137440, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:33:19/0:14:04, time_cost(all): 1 day, 13:18:06/16:43:15, loss=0.373198554159829, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.84(1.03), norm=3.485872856144592, lr=0.000245126444105654
2023-11-22 21:43:26   INFO  epoch: 20/30, acc_iter=137490, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:34:08/0:14:06, time_cost(all): 1 day, 13:18:55/17:17:28, loss=0.37311544017352, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.11(1.03), norm=4.481127426472111, lr=0.000244805709923729
2023-11-22 21:44:15   INFO  epoch: 20/30, acc_iter=137540, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:34:57/0:13:27, time_cost(all): 1 day, 13:19:44/15:52:19, loss=0.373032326187211, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.07(1.03), norm=4.769658727949412, lr=0.000244484975741805
2023-11-22 21:45:04   INFO  epoch: 20/30, acc_iter=137590, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:35:46/0:12:11, time_cost(all): 1 day, 13:20:33/16:28:42, loss=0.372949212200902, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.14(1.03), norm=3.6976882736314285, lr=0.00024416424155988
2023-11-22 21:45:53   INFO  epoch: 20/30, acc_iter=137640, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:36:35/0:11:17, time_cost(all): 1 day, 13:21:22/16:52:55, loss=0.372866098214593, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.07(1.03), norm=2.0931206225461816, lr=0.000243843507377955
2023-11-22 21:46:43   INFO  epoch: 20/30, acc_iter=137690, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:37:24/0:10:17, time_cost(all): 1 day, 13:22:12/16:48:26, loss=0.372782984228284, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.21(1.03), norm=4.059681245770285, lr=0.000243522773196031
2023-11-22 21:47:32   INFO  epoch: 20/30, acc_iter=137740, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:38:14/0:09:58, time_cost(all): 1 day, 13:23:01/15:50:02, loss=0.372699870241975, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.9(1.03), norm=1.7086107211829327, lr=0.000243202039014106
2023-11-22 21:48:21   INFO  epoch: 20/30, acc_iter=137790, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:39:03/0:09:12, time_cost(all): 1 day, 13:23:50/16:17:18, loss=0.372616756255666, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.92(1.03), norm=4.037022783322718, lr=0.000242881304832181
2023-11-22 21:49:10   INFO  epoch: 20/30, acc_iter=137840, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:39:52/0:07:48, time_cost(all): 1 day, 13:24:39/15:58:05, loss=0.372533642269357, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.83(1.03), norm=4.0849474727875705, lr=0.000242560570650257
2023-11-22 21:49:59   INFO  epoch: 20/30, acc_iter=137890, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:40:41/0:06:56, time_cost(all): 1 day, 13:25:28/17:01:27, loss=0.372450528283048, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.94(1.03), norm=0.9169784709117729, lr=0.000242239836468332
2023-11-22 21:50:48   INFO  epoch: 20/30, acc_iter=137940, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:41:30/0:06:38, time_cost(all): 1 day, 13:26:17/16:20:30, loss=0.372367414296739, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.16(1.03), norm=2.9790797863025773, lr=0.000241919102286407
2023-11-22 21:51:37   INFO  epoch: 20/30, acc_iter=137990, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 1:42:19/0:05:23, time_cost(all): 1 day, 13:27:06/16:11:11, loss=0.37228430031043, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.07(1.03), norm=2.3223449569507504, lr=0.000241598368104482
2023-11-22 21:52:26   INFO  epoch: 20/30, acc_iter=138040, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 1:43:08/0:04:44, time_cost(all): 1 day, 13:27:55/15:41:09, loss=0.37220118632412, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.96(1.03), norm=4.892808737120976, lr=0.000241277633922558
2023-11-22 21:53:15   INFO  epoch: 20/30, acc_iter=138090, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 1:43:57/0:03:50, time_cost(all): 1 day, 13:28:44/17:00:33, loss=0.372118072337811, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.93(1.03), norm=2.072769038357708, lr=0.000240956899740633
2023-11-22 21:54:05   INFO  epoch: 20/30, acc_iter=138140, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 1:44:47/0:02:56, time_cost(all): 1 day, 13:29:34/16:25:53, loss=0.372034958351502, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.97(1.03), norm=0.5129354213754229, lr=0.000240636165558708
2023-11-22 21:54:54   INFO  epoch: 20/30, acc_iter=138190, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 1:45:36/0:02:16, time_cost(all): 1 day, 13:30:23/16:21:47, loss=0.371951844365193, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.21(1.03), norm=0.9874540673284969, lr=0.000240315431376784
2023-11-22 21:55:43   INFO  epoch: 20/30, acc_iter=138240, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 1:46:25/0:01:27, time_cost(all): 1 day, 13:31:12/16:38:35, loss=0.371868730378884, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.18(1.03), norm=2.3430645684166684, lr=0.000239994697194859
2023-11-22 21:56:32   INFO  epoch: 20/30, acc_iter=138290, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 1:47:14/0:00:36, time_cost(all): 1 day, 13:32:01/16:36:39, loss=0.371785616392575, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.22(1.03), norm=1.6667010771700386, lr=0.000239673963012934
2023-11-22 21:57:21   INFO  epoch: 21/30, acc_iter=138377, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:49/1:49:09, time_cost(all): 1 day, 13:32:50/17:04:44, loss=0.371640998056397, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.91(1.03), norm=2.7981226747759544, lr=0.000239115885536385
2023-11-22 21:58:10   INFO  epoch: 21/30, acc_iter=138427, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:38/1:50:07, time_cost(all): 1 day, 13:33:39/16:39:45, loss=0.371557884070088, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.09(1.03), norm=2.1607287494551617, lr=0.000238795151354461
2023-11-22 21:58:59   INFO  epoch: 21/30, acc_iter=138477, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:27/1:49:45, time_cost(all): 1 day, 13:34:28/16:27:58, loss=0.371474770083779, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.05(1.03), norm=1.9705472224105027, lr=0.000238474417172536
2023-11-22 21:59:48   INFO  epoch: 21/30, acc_iter=138527, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:16/1:41:51, time_cost(all): 1 day, 13:35:17/16:55:51, loss=0.37139165609747, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.0(1.03), norm=3.899227466205598, lr=0.000238153682990611
2023-11-22 22:00:38   INFO  epoch: 21/30, acc_iter=138577, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:05/1:45:54, time_cost(all): 1 day, 13:36:07/16:07:50, loss=0.371308542111161, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.98(1.03), norm=2.4064094994853154, lr=0.000237832948808686
2023-11-22 22:01:27   INFO  epoch: 21/30, acc_iter=138627, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:04:54/1:43:40, time_cost(all): 1 day, 13:36:56/16:45:16, loss=0.371225428124852, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.14(1.03), norm=2.811865195171555, lr=0.000237512214626762
2023-11-22 22:02:16   INFO  epoch: 21/30, acc_iter=138677, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:05:43/1:43:46, time_cost(all): 1 day, 13:37:45/16:59:46, loss=0.371142314138543, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.13(1.03), norm=2.5906743479841636, lr=0.000237191480444837
2023-11-22 22:03:05   INFO  epoch: 21/30, acc_iter=138727, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:06:32/1:40:13, time_cost(all): 1 day, 13:38:34/16:29:11, loss=0.371059200152234, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.91(1.03), norm=4.0587352643520855, lr=0.000236870746262912
2023-11-22 22:03:54   INFO  epoch: 21/30, acc_iter=138777, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:07:22/1:37:36, time_cost(all): 1 day, 13:39:23/17:04:16, loss=0.370976086165925, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.84(1.03), norm=3.8672271208227116, lr=0.000236550012080988
2023-11-22 22:04:43   INFO  epoch: 21/30, acc_iter=138827, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:08:11/1:43:11, time_cost(all): 1 day, 13:40:12/16:35:00, loss=0.370892972179616, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.13(1.03), norm=2.1627134450746754, lr=0.000236229277899063
2023-11-22 22:05:32   INFO  epoch: 21/30, acc_iter=138877, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:09:00/1:40:08, time_cost(all): 1 day, 13:41:01/16:43:11, loss=0.370809858193307, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.02(1.03), norm=4.950251297100921, lr=0.000235908543717138
2023-11-22 22:06:21   INFO  epoch: 21/30, acc_iter=138927, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:09:49/1:37:38, time_cost(all): 1 day, 13:41:50/16:33:01, loss=0.370726744206998, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.21(1.03), norm=3.5731475731277365, lr=0.000235587809535213
2023-11-22 22:07:10   INFO  epoch: 21/30, acc_iter=138977, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:10:38/1:35:38, time_cost(all): 1 day, 13:42:39/15:53:18, loss=0.370643630220689, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.92(1.03), norm=4.5944542095566945, lr=0.000235267075353289
2023-11-22 22:08:00   INFO  epoch: 21/30, acc_iter=139027, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:11:27/1:36:12, time_cost(all): 1 day, 13:43:29/15:23:57, loss=0.37056051623438, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.0(1.03), norm=2.3129113827924597, lr=0.000234946341171364
2023-11-22 22:08:49   INFO  epoch: 21/30, acc_iter=139077, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:12:16/1:36:34, time_cost(all): 1 day, 13:44:18/16:36:33, loss=0.37047740224807, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.93(1.03), norm=2.8123599025279615, lr=0.000234625606989439
2023-11-22 22:09:38   INFO  epoch: 21/30, acc_iter=139127, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:13:05/1:32:26, time_cost(all): 1 day, 13:45:07/16:22:48, loss=0.370394288261761, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.07(1.03), norm=2.3229783739167997, lr=0.000234304872807515
2023-11-22 22:10:27   INFO  epoch: 21/30, acc_iter=139177, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:13:54/1:32:49, time_cost(all): 1 day, 13:45:56/16:23:33, loss=0.370311174275452, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.23(1.03), norm=3.0027017925610413, lr=0.00023398413862559
2023-11-22 22:11:16   INFO  epoch: 21/30, acc_iter=139227, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:14:44/1:31:46, time_cost(all): 1 day, 13:46:45/16:11:21, loss=0.370228060289143, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.84(1.03), norm=1.902742182049551, lr=0.000233663404443665
2023-11-22 22:12:05   INFO  epoch: 21/30, acc_iter=139277, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:15:33/1:34:53, time_cost(all): 1 day, 13:47:34/16:17:09, loss=0.370144946302834, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.07(1.03), norm=3.249748549974878, lr=0.000233342670261741
2023-11-22 22:12:54   INFO  epoch: 21/30, acc_iter=139327, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:16:22/1:28:40, time_cost(all): 1 day, 13:48:23/15:51:50, loss=0.370061832316525, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.07(1.03), norm=3.055883868779574, lr=0.000233021936079816
2023-11-22 22:13:43   INFO  epoch: 21/30, acc_iter=139377, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:17:11/1:29:45, time_cost(all): 1 day, 13:49:12/16:34:24, loss=0.369978718330216, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.14(1.03), norm=0.8979733807934764, lr=0.000232701201897891
2023-11-22 22:14:32   INFO  epoch: 21/30, acc_iter=139427, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:18:00/1:31:07, time_cost(all): 1 day, 13:50:01/16:53:07, loss=0.369895604343907, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.2(1.03), norm=4.721736271826913, lr=0.000232380467715966
2023-11-22 22:15:22   INFO  epoch: 21/30, acc_iter=139477, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:18:49/1:31:00, time_cost(all): 1 day, 13:50:51/16:12:34, loss=0.369812490357598, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.21(1.03), norm=4.405489448566128, lr=0.000232059733534042
2023-11-22 22:16:11   INFO  epoch: 21/30, acc_iter=139527, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:19:38/1:27:08, time_cost(all): 1 day, 13:51:40/15:58:12, loss=0.369729376371289, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.06(1.03), norm=2.0079784520319954, lr=0.000231738999352117
2023-11-22 22:17:00   INFO  epoch: 21/30, acc_iter=139577, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:20:27/1:27:47, time_cost(all): 1 day, 13:52:29/15:42:15, loss=0.36964626238498, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.89(1.03), norm=4.03653824699405, lr=0.000231418265170192
2023-11-22 22:17:49   INFO  epoch: 21/30, acc_iter=139627, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:21:17/1:24:01, time_cost(all): 1 day, 13:53:18/15:53:42, loss=0.369563148398671, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.84(1.03), norm=3.097854678234723, lr=0.000231097530988268
2023-11-22 22:18:38   INFO  epoch: 21/30, acc_iter=139677, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:22:06/1:25:22, time_cost(all): 1 day, 13:54:07/15:22:44, loss=0.369480034412362, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.17(1.03), norm=1.7069884467898746, lr=0.000230776796806343
2023-11-22 22:19:27   INFO  epoch: 21/30, acc_iter=139727, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:22:55/1:27:09, time_cost(all): 1 day, 13:54:56/16:38:08, loss=0.369396920426053, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.2(1.03), norm=4.229542801019521, lr=0.000230456062624418
2023-11-22 22:20:16   INFO  epoch: 21/30, acc_iter=139777, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:23:44/1:26:44, time_cost(all): 1 day, 13:55:45/16:47:11, loss=0.369313806439744, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.11(1.03), norm=4.035348884190454, lr=0.000230135328442493
2023-11-22 22:21:05   INFO  epoch: 21/30, acc_iter=139827, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:24:33/1:20:51, time_cost(all): 1 day, 13:56:34/16:46:11, loss=0.369230692453435, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.04(1.03), norm=1.3145259762394126, lr=0.000229814594260569
2023-11-22 22:21:55   INFO  epoch: 21/30, acc_iter=139877, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:25:22/1:18:32, time_cost(all): 1 day, 13:57:24/16:11:06, loss=0.369147578467126, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.98(1.03), norm=1.4039058699425904, lr=0.000229493860078644
2023-11-22 22:22:44   INFO  epoch: 21/30, acc_iter=139927, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:26:11/1:24:03, time_cost(all): 1 day, 13:58:13/15:17:52, loss=0.369064464480816, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.99(1.03), norm=0.6460847856571499, lr=0.000229173125896719
2023-11-22 22:23:33   INFO  epoch: 21/30, acc_iter=139977, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:27:00/1:22:43, time_cost(all): 1 day, 13:59:02/16:22:11, loss=0.368981350494507, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.06(1.03), norm=2.22132392186111, lr=0.000228852391714795
2023-11-22 22:24:22   INFO  epoch: 21/30, acc_iter=140027, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:27:49/1:16:42, time_cost(all): 1 day, 13:59:51/15:44:15, loss=0.368898236508198, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.05(1.03), norm=0.5527237502038951, lr=0.00022853165753287
2023-11-22 22:25:11   INFO  epoch: 21/30, acc_iter=140077, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:28:39/1:21:10, time_cost(all): 1 day, 14:00:40/16:06:13, loss=0.368815122521889, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.01(1.03), norm=1.8989955987328866, lr=0.000228210923350945
2023-11-22 22:26:00   INFO  epoch: 21/30, acc_iter=140127, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:29:28/1:18:49, time_cost(all): 1 day, 14:01:29/15:16:25, loss=0.36873200853558, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.95(1.03), norm=3.7777086904748693, lr=0.000227890189169021
2023-11-22 22:26:49   INFO  epoch: 21/30, acc_iter=140177, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:30:17/1:19:50, time_cost(all): 1 day, 14:02:18/16:27:37, loss=0.368648894549271, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.05(1.03), norm=3.2600155226020613, lr=0.000227569454987096
2023-11-22 22:27:38   INFO  epoch: 21/30, acc_iter=140227, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:31:06/1:13:31, time_cost(all): 1 day, 14:03:07/15:30:33, loss=0.368565780562962, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.22(1.03), norm=2.233913852612144, lr=0.000227248720805171
2023-11-22 22:28:27   INFO  epoch: 21/30, acc_iter=140277, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:31:55/1:17:21, time_cost(all): 1 day, 14:03:56/15:32:05, loss=0.368482666576653, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.21(1.03), norm=1.4450796744551953, lr=0.000226927986623246
2023-11-22 22:29:17   INFO  epoch: 21/30, acc_iter=140327, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:32:44/1:13:18, time_cost(all): 1 day, 14:04:46/15:07:36, loss=0.368399552590344, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.18(1.03), norm=2.3478204364245228, lr=0.000226607252441322
2023-11-22 22:30:06   INFO  epoch: 21/30, acc_iter=140377, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:33:33/1:12:49, time_cost(all): 1 day, 14:05:35/15:58:22, loss=0.368316438604035, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.83(1.03), norm=3.4412633832707975, lr=0.000226286518259397
2023-11-22 22:30:55   INFO  epoch: 21/30, acc_iter=140427, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:34:22/1:14:33, time_cost(all): 1 day, 14:06:24/15:43:39, loss=0.368233324617726, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.9(1.03), norm=2.7464433235353822, lr=0.000225965784077472
2023-11-22 22:31:44   INFO  epoch: 21/30, acc_iter=140477, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:35:12/1:12:50, time_cost(all): 1 day, 14:07:13/16:09:37, loss=0.368150210631417, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.17(1.03), norm=1.497661963177871, lr=0.000225645049895548
2023-11-22 22:32:33   INFO  epoch: 21/30, acc_iter=140527, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:36:01/1:14:11, time_cost(all): 1 day, 14:08:02/15:37:28, loss=0.368067096645108, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.19(1.03), norm=1.0468560483457936, lr=0.000225324315713623
2023-11-22 22:33:22   INFO  epoch: 21/30, acc_iter=140577, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:36:50/1:12:59, time_cost(all): 1 day, 14:08:51/15:40:26, loss=0.367983982658799, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.9(1.03), norm=1.5262342051398512, lr=0.000225003581531698
2023-11-22 22:34:11   INFO  epoch: 21/30, acc_iter=140627, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:37:39/1:08:48, time_cost(all): 1 day, 14:09:40/15:17:09, loss=0.36790086867249, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.17(1.03), norm=1.4693595133418929, lr=0.000224682847349774
2023-11-22 22:35:00   INFO  epoch: 21/30, acc_iter=140677, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:38:28/1:05:58, time_cost(all): 1 day, 14:10:29/16:14:22, loss=0.36781775468618, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.17(1.03), norm=0.8600576604612484, lr=0.000224362113167849
2023-11-22 22:35:50   INFO  epoch: 21/30, acc_iter=140727, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:39:17/1:06:53, time_cost(all): 1 day, 14:11:19/15:15:17, loss=0.367734640699871, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.23(1.03), norm=4.042719390299142, lr=0.000224041378985924
2023-11-22 22:36:39   INFO  epoch: 21/30, acc_iter=140777, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:40:06/1:09:36, time_cost(all): 1 day, 14:12:08/15:11:56, loss=0.367651526713562, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.99(1.03), norm=3.2654755920236886, lr=0.000223720644803999
2023-11-22 22:37:28   INFO  epoch: 21/30, acc_iter=140827, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:40:55/1:05:35, time_cost(all): 1 day, 14:12:57/15:58:24, loss=0.367568412727253, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.16(1.03), norm=1.375374550006316, lr=0.000223399910622075
2023-11-22 22:38:17   INFO  epoch: 21/30, acc_iter=140877, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:41:44/1:03:22, time_cost(all): 1 day, 14:13:46/15:39:38, loss=0.367485298740944, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.05(1.03), norm=4.495243805039545, lr=0.00022307917644015
2023-11-22 22:39:06   INFO  epoch: 21/30, acc_iter=140927, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:42:34/1:08:29, time_cost(all): 1 day, 14:14:35/15:48:35, loss=0.367402184754635, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.19(1.03), norm=1.1667846582935586, lr=0.000222758442258225
2023-11-22 22:39:55   INFO  epoch: 21/30, acc_iter=140977, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:43:23/1:06:37, time_cost(all): 1 day, 14:15:24/15:54:48, loss=0.367319070768326, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.83(1.03), norm=1.9430179929811675, lr=0.000222437708076301
2023-11-22 22:40:44   INFO  epoch: 21/30, acc_iter=141027, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:44:12/1:06:10, time_cost(all): 1 day, 14:16:13/15:06:37, loss=0.367235956782017, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.85(1.03), norm=4.217188330073961, lr=0.000222116973894376
2023-11-22 22:41:33   INFO  epoch: 21/30, acc_iter=141077, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:45:01/1:05:47, time_cost(all): 1 day, 14:17:02/14:56:50, loss=0.367152842795708, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.92(1.03), norm=3.9872323515496424, lr=0.000221796239712451
2023-11-22 22:42:22   INFO  epoch: 21/30, acc_iter=141127, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:45:50/1:02:46, time_cost(all): 1 day, 14:17:51/15:16:29, loss=0.367069728809399, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.16(1.03), norm=2.9036000096413783, lr=0.000221475505530527
2023-11-22 22:43:12   INFO  epoch: 21/30, acc_iter=141177, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:46:39/1:01:18, time_cost(all): 1 day, 14:18:41/15:13:07, loss=0.36698661482309, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.93(1.03), norm=3.0290992245533985, lr=0.000221154771348602
2023-11-22 22:44:01   INFO  epoch: 21/30, acc_iter=141227, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:47:28/1:01:05, time_cost(all): 1 day, 14:19:30/16:20:09, loss=0.366903500836781, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.97(1.03), norm=2.8912867980609613, lr=0.000220834037166677
2023-11-22 22:44:50   INFO  epoch: 21/30, acc_iter=141277, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:48:17/1:01:38, time_cost(all): 1 day, 14:20:19/15:15:13, loss=0.366820386850472, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.22(1.03), norm=1.8493830514265062, lr=0.000220513302984752
2023-11-22 22:45:39   INFO  epoch: 21/30, acc_iter=141327, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:49:07/0:57:17, time_cost(all): 1 day, 14:21:08/16:18:27, loss=0.366737272864163, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.06(1.03), norm=3.215875970160933, lr=0.000220192568802828
2023-11-22 22:46:28   INFO  epoch: 21/30, acc_iter=141377, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:49:56/1:00:21, time_cost(all): 1 day, 14:21:57/14:53:53, loss=0.366654158877854, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.1(1.03), norm=3.4541463684008153, lr=0.000219871834620903
2023-11-22 22:47:17   INFO  epoch: 21/30, acc_iter=141427, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:50:45/0:54:34, time_cost(all): 1 day, 14:22:46/15:46:34, loss=0.366571044891545, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.03(1.03), norm=2.340464239559879, lr=0.000219551100438978
2023-11-22 22:48:06   INFO  epoch: 21/30, acc_iter=141477, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 0:51:34/0:55:42, time_cost(all): 1 day, 14:23:35/15:03:32, loss=0.366487930905235, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.98(1.03), norm=1.7508919718633125, lr=0.000219230366257054
2023-11-22 22:48:55   INFO  epoch: 21/30, acc_iter=141527, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 0:52:23/0:55:11, time_cost(all): 1 day, 14:24:24/15:11:23, loss=0.366404816918926, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.97(1.03), norm=3.3868561274262463, lr=0.000218909632075129
2023-11-22 22:49:45   INFO  epoch: 21/30, acc_iter=141577, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 0:53:12/0:54:44, time_cost(all): 1 day, 14:25:14/15:17:23, loss=0.366321702932617, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.88(1.03), norm=3.276150740248278, lr=0.000218588897893204
2023-11-22 22:50:34   INFO  epoch: 21/30, acc_iter=141627, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 0:54:01/0:54:48, time_cost(all): 1 day, 14:26:03/16:15:37, loss=0.366238588946308, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.03(1.03), norm=0.9614716556804659, lr=0.000218268163711279
2023-11-22 22:51:23   INFO  epoch: 21/30, acc_iter=141677, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 0:54:50/0:52:12, time_cost(all): 1 day, 14:26:52/15:58:01, loss=0.366155474959999, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.02(1.03), norm=2.176537353825089, lr=0.000217947429529355
2023-11-22 22:52:12   INFO  epoch: 21/30, acc_iter=141727, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 0:55:39/0:51:14, time_cost(all): 1 day, 14:27:41/15:41:08, loss=0.36607236097369, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.86(1.03), norm=2.4464745346119083, lr=0.00021762669534743
2023-11-22 22:53:01   INFO  epoch: 21/30, acc_iter=141777, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 0:56:29/0:49:03, time_cost(all): 1 day, 14:28:30/16:05:33, loss=0.365989246987381, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.16(1.03), norm=3.3012283076758475, lr=0.000217305961165505
2023-11-22 22:53:50   INFO  epoch: 21/30, acc_iter=141827, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 0:57:18/0:51:24, time_cost(all): 1 day, 14:29:19/14:41:23, loss=0.365906133001072, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.1(1.03), norm=4.006232561306663, lr=0.000216985226983581
2023-11-22 22:54:39   INFO  epoch: 21/30, acc_iter=141877, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 0:58:07/0:51:45, time_cost(all): 1 day, 14:30:08/15:58:40, loss=0.365823019014763, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.99(1.03), norm=0.5037934568413855, lr=0.000216664492801656
2023-11-22 22:55:28   INFO  epoch: 21/30, acc_iter=141927, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 0:58:56/0:50:53, time_cost(all): 1 day, 14:30:57/14:59:27, loss=0.365739905028454, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.02(1.03), norm=1.7150889689080293, lr=0.000216343758619731
2023-11-22 22:56:17   INFO  epoch: 21/30, acc_iter=141977, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 0:59:45/0:49:36, time_cost(all): 1 day, 14:31:46/14:43:05, loss=0.365656791042145, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.09(1.03), norm=2.6966382907131017, lr=0.000216023024437807
2023-11-22 22:57:07   INFO  epoch: 21/30, acc_iter=142027, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:00:34/0:49:23, time_cost(all): 1 day, 14:32:36/15:54:51, loss=0.365573677055836, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.07(1.03), norm=0.5316667309751124, lr=0.000215702290255882
2023-11-22 22:57:56   INFO  epoch: 21/30, acc_iter=142077, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:01:23/0:45:44, time_cost(all): 1 day, 14:33:25/15:59:36, loss=0.365490563069527, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.98(1.03), norm=3.955250188207832, lr=0.000215381556073957
2023-11-22 22:58:45   INFO  epoch: 21/30, acc_iter=142127, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:02:12/0:47:52, time_cost(all): 1 day, 14:34:14/14:52:23, loss=0.365407449083218, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.08(1.03), norm=2.7401120709984066, lr=0.000215060821892032
2023-11-22 22:59:34   INFO  epoch: 21/30, acc_iter=142177, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:03:02/0:42:57, time_cost(all): 1 day, 14:35:03/15:34:53, loss=0.365324335096909, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.01(1.03), norm=4.490858519145167, lr=0.000214740087710108
2023-11-22 23:00:23   INFO  epoch: 21/30, acc_iter=142227, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:03:51/0:45:51, time_cost(all): 1 day, 14:35:52/14:47:39, loss=0.3652412211106, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.84(1.03), norm=3.134300721985507, lr=0.000214419353528183
2023-11-22 23:01:12   INFO  epoch: 21/30, acc_iter=142277, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:04:40/0:42:59, time_cost(all): 1 day, 14:36:41/14:46:33, loss=0.365158107124291, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.11(1.03), norm=2.929146716375932, lr=0.000214098619346258
2023-11-22 23:02:01   INFO  epoch: 21/30, acc_iter=142327, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:05:29/0:44:15, time_cost(all): 1 day, 14:37:30/14:47:47, loss=0.365074993137981, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.08(1.03), norm=3.1032336497188115, lr=0.000213777885164333
2023-11-22 23:02:50   INFO  epoch: 21/30, acc_iter=142377, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:06:18/0:42:21, time_cost(all): 1 day, 14:38:19/15:22:19, loss=0.364991879151672, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.99(1.03), norm=1.668545560976871, lr=0.000213457150982409
2023-11-22 23:03:40   INFO  epoch: 21/30, acc_iter=142427, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:07:07/0:40:31, time_cost(all): 1 day, 14:39:09/14:59:17, loss=0.364908765165363, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.07(1.03), norm=1.2129814178263953, lr=0.000213136416800484
2023-11-22 23:04:29   INFO  epoch: 21/30, acc_iter=142477, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:07:56/0:39:33, time_cost(all): 1 day, 14:39:58/15:52:19, loss=0.364825651179054, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.95(1.03), norm=1.7332167964701726, lr=0.00021281568261856
2023-11-22 23:05:18   INFO  epoch: 21/30, acc_iter=142527, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:08:45/0:38:36, time_cost(all): 1 day, 14:40:47/15:19:34, loss=0.364742537192745, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.02(1.03), norm=0.8161938805838249, lr=0.000212494948436635
2023-11-22 23:06:07   INFO  epoch: 21/30, acc_iter=142577, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:09:34/0:38:39, time_cost(all): 1 day, 14:41:36/15:03:31, loss=0.364659423206436, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.2(1.03), norm=0.7615478649611416, lr=0.00021217421425471
2023-11-22 23:06:56   INFO  epoch: 21/30, acc_iter=142627, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:10:24/0:36:01, time_cost(all): 1 day, 14:42:25/15:08:55, loss=0.364576309220127, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.9(1.03), norm=2.723902186640846, lr=0.000211853480072785
2023-11-22 23:07:45   INFO  epoch: 21/30, acc_iter=142677, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:11:13/0:34:51, time_cost(all): 1 day, 14:43:14/15:04:30, loss=0.364493195233818, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.2(1.03), norm=3.456253595407168, lr=0.000211532745890861
2023-11-22 23:08:34   INFO  epoch: 21/30, acc_iter=142727, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:12:02/0:36:15, time_cost(all): 1 day, 14:44:03/15:35:37, loss=0.364410081247509, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.07(1.03), norm=1.011014360905519, lr=0.000211212011708936
2023-11-22 23:09:23   INFO  epoch: 21/30, acc_iter=142777, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:12:51/0:34:56, time_cost(all): 1 day, 14:44:52/15:55:55, loss=0.3643269672612, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.17(1.03), norm=3.7622756183565156, lr=0.000210891277527011
2023-11-22 23:10:12   INFO  epoch: 21/30, acc_iter=142827, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:13:40/0:34:37, time_cost(all): 1 day, 14:45:41/15:31:40, loss=0.364243853274891, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.09(1.03), norm=1.610273917605453, lr=0.000210570543345086
2023-11-22 23:11:02   INFO  epoch: 21/30, acc_iter=142877, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:14:29/0:33:37, time_cost(all): 1 day, 14:46:31/15:27:38, loss=0.364160739288582, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.98(1.03), norm=3.9720536881305346, lr=0.000210249809163162
2023-11-22 23:11:51   INFO  epoch: 21/30, acc_iter=142927, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:15:18/0:32:59, time_cost(all): 1 day, 14:47:20/14:44:26, loss=0.364077625302273, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.09(1.03), norm=1.008803192835984, lr=0.000209929074981237
2023-11-22 23:12:40   INFO  epoch: 21/30, acc_iter=142977, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:16:07/0:33:13, time_cost(all): 1 day, 14:48:09/15:45:45, loss=0.363994511315964, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.06(1.03), norm=4.510047081987138, lr=0.000209608340799312
2023-11-22 23:13:29   INFO  epoch: 21/30, acc_iter=143027, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:16:57/0:32:08, time_cost(all): 1 day, 14:48:58/15:40:59, loss=0.363911397329654, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.0(1.03), norm=4.61682048048217, lr=0.000209287606617388
2023-11-22 23:14:18   INFO  epoch: 21/30, acc_iter=143077, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:17:46/0:29:29, time_cost(all): 1 day, 14:49:47/14:38:38, loss=0.363828283343345, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.12(1.03), norm=2.787182384741485, lr=0.000208966872435463
2023-11-22 23:15:07   INFO  epoch: 21/30, acc_iter=143127, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:18:35/0:29:21, time_cost(all): 1 day, 14:50:36/15:38:19, loss=0.363745169357036, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.12(1.03), norm=1.1284665587699947, lr=0.000208646138253538
2023-11-22 23:15:56   INFO  epoch: 21/30, acc_iter=143177, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:19:24/0:29:16, time_cost(all): 1 day, 14:51:25/14:45:19, loss=0.363662055370727, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.86(1.03), norm=3.435322168031597, lr=0.000208325404071614
2023-11-22 23:16:45   INFO  epoch: 21/30, acc_iter=143227, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:20:13/0:26:16, time_cost(all): 1 day, 14:52:14/14:51:12, loss=0.363578941384418, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.0(1.03), norm=3.039833411780845, lr=0.000208004669889689
2023-11-22 23:17:35   INFO  epoch: 21/30, acc_iter=143277, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:21:02/0:26:26, time_cost(all): 1 day, 14:53:04/14:20:29, loss=0.363495827398109, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.98(1.03), norm=2.323836668238785, lr=0.000207683935707764
2023-11-22 23:18:24   INFO  epoch: 21/30, acc_iter=143327, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:21:51/0:24:53, time_cost(all): 1 day, 14:53:53/14:31:41, loss=0.3634127134118, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.0(1.03), norm=4.322858300123257, lr=0.000207363201525839
2023-11-22 23:19:13   INFO  epoch: 21/30, acc_iter=143377, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:22:40/0:25:37, time_cost(all): 1 day, 14:54:42/14:29:51, loss=0.363329599425491, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.93(1.03), norm=2.4853743069090246, lr=0.000207042467343915
2023-11-22 23:20:02   INFO  epoch: 21/30, acc_iter=143427, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:23:29/0:23:43, time_cost(all): 1 day, 14:55:31/15:20:41, loss=0.363246485439182, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.1(1.03), norm=1.2134313562268852, lr=0.00020672173316199
2023-11-22 23:20:51   INFO  epoch: 21/30, acc_iter=143477, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:24:19/0:23:17, time_cost(all): 1 day, 14:56:20/14:21:19, loss=0.363163371452873, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.2(1.03), norm=1.7621185073537649, lr=0.000206400998980065
2023-11-22 23:21:40   INFO  epoch: 21/30, acc_iter=143527, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:25:08/0:22:34, time_cost(all): 1 day, 14:57:09/15:13:40, loss=0.363080257466564, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.03(1.03), norm=0.5873990539174697, lr=0.000206080264798141
2023-11-22 23:22:29   INFO  epoch: 21/30, acc_iter=143577, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:25:57/0:21:55, time_cost(all): 1 day, 14:57:58/15:29:57, loss=0.362997143480255, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.1(1.03), norm=4.263713133928297, lr=0.000205759530616216
2023-11-22 23:23:18   INFO  epoch: 21/30, acc_iter=143627, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:26:46/0:21:43, time_cost(all): 1 day, 14:58:47/15:10:20, loss=0.362914029493946, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.92(1.03), norm=0.9979432256411673, lr=0.000205438796434291
2023-11-22 23:24:07   INFO  epoch: 21/30, acc_iter=143677, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:27:35/0:20:54, time_cost(all): 1 day, 14:59:36/14:15:42, loss=0.362830915507637, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.85(1.03), norm=4.438770655816702, lr=0.000205118062252367
2023-11-22 23:24:57   INFO  epoch: 21/30, acc_iter=143727, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:28:24/0:18:44, time_cost(all): 1 day, 15:00:26/15:23:03, loss=0.362747801521328, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.88(1.03), norm=1.912822498415569, lr=0.000204797328070442
2023-11-22 23:25:46   INFO  epoch: 21/30, acc_iter=143777, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:29:13/0:19:05, time_cost(all): 1 day, 15:01:15/15:07:53, loss=0.362664687535019, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.96(1.03), norm=4.137817256695409, lr=0.000204476593888517
2023-11-22 23:26:35   INFO  epoch: 21/30, acc_iter=143827, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:30:02/0:18:10, time_cost(all): 1 day, 15:02:04/15:27:30, loss=0.36258157354871, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.91(1.03), norm=3.2672877778922227, lr=0.000204155859706592
2023-11-22 23:27:24   INFO  epoch: 21/30, acc_iter=143877, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:30:52/0:17:48, time_cost(all): 1 day, 15:02:53/14:14:37, loss=0.3624984595624, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.22(1.03), norm=4.7839256694908245, lr=0.000203835125524668
2023-11-22 23:28:13   INFO  epoch: 21/30, acc_iter=143927, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:31:41/0:15:42, time_cost(all): 1 day, 15:03:42/14:16:46, loss=0.362415345576091, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.18(1.03), norm=3.8448940777854617, lr=0.000203514391342743
2023-11-22 23:29:02   INFO  epoch: 21/30, acc_iter=143977, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:32:30/0:15:09, time_cost(all): 1 day, 15:04:31/14:10:48, loss=0.362332231589782, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.93(1.03), norm=1.8222639590761442, lr=0.000203193657160818
2023-11-22 23:29:51   INFO  epoch: 21/30, acc_iter=144027, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:33:19/0:13:47, time_cost(all): 1 day, 15:05:20/15:19:51, loss=0.362249117603473, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.15(1.03), norm=1.5275190434101071, lr=0.000202872922978894
2023-11-22 23:30:40   INFO  epoch: 21/30, acc_iter=144077, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:34:08/0:14:11, time_cost(all): 1 day, 15:06:09/15:27:46, loss=0.362166003617164, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.89(1.03), norm=0.7076288184830166, lr=0.000202552188796969
2023-11-22 23:31:30   INFO  epoch: 21/30, acc_iter=144127, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:34:57/0:12:59, time_cost(all): 1 day, 15:06:59/14:53:29, loss=0.362082889630855, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.16(1.03), norm=0.5599211626022849, lr=0.000202231454615044
2023-11-22 23:32:19   INFO  epoch: 21/30, acc_iter=144177, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:35:46/0:11:48, time_cost(all): 1 day, 15:07:48/14:04:05, loss=0.361999775644546, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.18(1.03), norm=4.9035216997544655, lr=0.000201910720433119
2023-11-22 23:33:08   INFO  epoch: 21/30, acc_iter=144227, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:36:35/0:11:25, time_cost(all): 1 day, 15:08:37/15:06:38, loss=0.361916661658237, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.03(1.03), norm=1.7720071262399915, lr=0.000201589986251195
2023-11-22 23:33:57   INFO  epoch: 21/30, acc_iter=144277, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:37:24/0:10:31, time_cost(all): 1 day, 15:09:26/14:15:45, loss=0.361833547671928, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.22(1.03), norm=3.4612272294518527, lr=0.00020126925206927
2023-11-22 23:34:46   INFO  epoch: 21/30, acc_iter=144327, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:38:14/0:09:09, time_cost(all): 1 day, 15:10:15/14:12:44, loss=0.361750433685619, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.14(1.03), norm=3.0721570417407977, lr=0.000200948517887345
2023-11-22 23:35:35   INFO  epoch: 21/30, acc_iter=144377, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:39:03/0:08:38, time_cost(all): 1 day, 15:11:04/14:56:27, loss=0.36166731969931, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.19(1.03), norm=1.8855150635724183, lr=0.000200627783705421
2023-11-22 23:36:24   INFO  epoch: 21/30, acc_iter=144427, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:39:52/0:08:07, time_cost(all): 1 day, 15:11:53/15:01:26, loss=0.361584205713001, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.95(1.03), norm=3.8712663511947296, lr=0.000200307049523496
2023-11-22 23:37:13   INFO  epoch: 21/30, acc_iter=144477, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:40:41/0:07:28, time_cost(all): 1 day, 15:12:42/14:44:07, loss=0.361501091726692, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.07(1.03), norm=4.261022527114721, lr=0.000199986315341571
2023-11-22 23:38:02   INFO  epoch: 21/30, acc_iter=144527, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:41:30/0:06:03, time_cost(all): 1 day, 15:13:31/14:27:30, loss=0.361417977740383, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.19(1.03), norm=4.824507363237616, lr=0.000199665581159647
2023-11-22 23:38:52   INFO  epoch: 21/30, acc_iter=144577, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 1:42:19/0:05:29, time_cost(all): 1 day, 15:14:21/14:44:07, loss=0.361334863754074, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.17(1.03), norm=4.058356581291289, lr=0.000199344846977722
2023-11-22 23:39:41   INFO  epoch: 21/30, acc_iter=144627, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 1:43:08/0:04:52, time_cost(all): 1 day, 15:15:10/13:58:38, loss=0.361251749767765, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.85(1.03), norm=3.604634738614384, lr=0.000199024112795797
2023-11-22 23:40:30   INFO  epoch: 21/30, acc_iter=144677, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 1:43:57/0:04:01, time_cost(all): 1 day, 15:15:59/14:31:29, loss=0.361168635781456, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.1(1.03), norm=2.8208081623879995, lr=0.000198703378613872
2023-11-22 23:41:19   INFO  epoch: 21/30, acc_iter=144727, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 1:44:47/0:02:58, time_cost(all): 1 day, 15:16:48/14:00:17, loss=0.361085521795146, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.84(1.03), norm=3.045837682345974, lr=0.000198382644431948
2023-11-22 23:42:08   INFO  epoch: 21/30, acc_iter=144777, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 1:45:36/0:02:20, time_cost(all): 1 day, 15:17:37/14:43:13, loss=0.361002407808837, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.93(1.03), norm=4.275588206340874, lr=0.000198061910250023
2023-11-22 23:42:57   INFO  epoch: 21/30, acc_iter=144827, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 1:46:25/0:01:23, time_cost(all): 1 day, 15:18:26/14:04:37, loss=0.360919293822528, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.16(1.03), norm=1.2504654505027815, lr=0.000197741176068098
2023-11-22 23:43:46   INFO  epoch: 21/30, acc_iter=144877, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 1:47:14/0:00:34, time_cost(all): 1 day, 15:19:15/14:11:16, loss=0.360836179836219, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.05(1.03), norm=1.1661343537733118, lr=0.000197420441886174
2023-11-22 23:44:35   INFO  epoch: 22/30, acc_iter=144964, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:49/1:43:55, time_cost(all): 1 day, 15:20:04/14:46:31, loss=0.360691561500041, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.93(1.03), norm=1.222522543420202, lr=0.000196862364409625
2023-11-22 23:45:25   INFO  epoch: 22/30, acc_iter=145014, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:38/1:49:07, time_cost(all): 1 day, 15:20:54/14:29:46, loss=0.360608447513732, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.96(1.03), norm=3.8828995307165997, lr=0.0001965416302277
2023-11-22 23:46:14   INFO  epoch: 22/30, acc_iter=145064, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:27/1:43:11, time_cost(all): 1 day, 15:21:43/14:31:26, loss=0.360525333527423, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.12(1.03), norm=3.0266432616895846, lr=0.000196220896045775
2023-11-22 23:47:03   INFO  epoch: 22/30, acc_iter=145114, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:16/1:41:39, time_cost(all): 1 day, 15:22:32/14:50:20, loss=0.360442219541114, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.09(1.03), norm=1.7264876176368835, lr=0.00019590016186385
2023-11-22 23:47:52   INFO  epoch: 22/30, acc_iter=145164, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:05/1:44:56, time_cost(all): 1 day, 15:23:21/14:47:12, loss=0.360359105554805, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.23(1.03), norm=4.861486410959548, lr=0.000195579427681926
2023-11-22 23:48:41   INFO  epoch: 22/30, acc_iter=145214, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:04:54/1:46:58, time_cost(all): 1 day, 15:24:10/14:03:46, loss=0.360275991568496, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.03(1.03), norm=0.9195170334926728, lr=0.000195258693500001
2023-11-22 23:49:30   INFO  epoch: 22/30, acc_iter=145264, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:05:43/1:41:15, time_cost(all): 1 day, 15:24:59/14:10:14, loss=0.360192877582187, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.97(1.03), norm=3.844931049422906, lr=0.000194937959318076
2023-11-22 23:50:19   INFO  epoch: 22/30, acc_iter=145314, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:06:32/1:41:10, time_cost(all): 1 day, 15:25:48/14:11:52, loss=0.360109763595878, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.03(1.03), norm=2.1066665704027683, lr=0.000194617225136152
2023-11-22 23:51:08   INFO  epoch: 22/30, acc_iter=145364, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:07:22/1:42:07, time_cost(all): 1 day, 15:26:37/14:55:51, loss=0.360026649609569, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.13(1.03), norm=4.680711845092157, lr=0.000194296490954227
2023-11-22 23:51:57   INFO  epoch: 22/30, acc_iter=145414, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:08:11/1:38:46, time_cost(all): 1 day, 15:27:26/14:34:41, loss=0.35994353562326, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.19(1.03), norm=3.4751732165162457, lr=0.000193975756772302
2023-11-22 23:52:47   INFO  epoch: 22/30, acc_iter=145464, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:09:00/1:37:43, time_cost(all): 1 day, 15:28:16/15:10:20, loss=0.359860421636951, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.2(1.03), norm=1.0158713628734395, lr=0.000193655022590378
2023-11-22 23:53:36   INFO  epoch: 22/30, acc_iter=145514, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:09:49/1:41:52, time_cost(all): 1 day, 15:29:05/14:24:26, loss=0.359777307650642, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.2(1.03), norm=4.207563020098315, lr=0.000193334288408453
2023-11-22 23:54:25   INFO  epoch: 22/30, acc_iter=145564, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:10:38/1:33:15, time_cost(all): 1 day, 15:29:54/13:49:41, loss=0.359694193664333, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.96(1.03), norm=3.002890870222309, lr=0.000193013554226528
2023-11-22 23:55:14   INFO  epoch: 22/30, acc_iter=145614, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:11:27/1:33:24, time_cost(all): 1 day, 15:30:43/14:42:25, loss=0.359611079678024, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.88(1.03), norm=1.1797586873266472, lr=0.000192692820044603
2023-11-22 23:56:03   INFO  epoch: 22/30, acc_iter=145664, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:12:16/1:39:06, time_cost(all): 1 day, 15:31:32/13:57:43, loss=0.359527965691715, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.97(1.03), norm=2.3272925084371936, lr=0.000192372085862679
2023-11-22 23:56:52   INFO  epoch: 22/30, acc_iter=145714, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:13:05/1:37:33, time_cost(all): 1 day, 15:32:21/13:49:59, loss=0.359444851705406, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.1(1.03), norm=2.3593584308870135, lr=0.000192051351680754
2023-11-22 23:57:41   INFO  epoch: 22/30, acc_iter=145764, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:13:54/1:33:38, time_cost(all): 1 day, 15:33:10/14:47:28, loss=0.359361737719096, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.03(1.03), norm=3.4371299064371117, lr=0.000191730617498829
2023-11-22 23:58:30   INFO  epoch: 22/30, acc_iter=145814, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:14:44/1:30:06, time_cost(all): 1 day, 15:33:59/14:07:37, loss=0.359278623732787, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.09(1.03), norm=3.513574212588573, lr=0.000191409883316905
2023-11-22 23:59:20   INFO  epoch: 22/30, acc_iter=145864, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:15:33/1:34:28, time_cost(all): 1 day, 15:34:49/15:02:55, loss=0.359195509746478, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.0(1.03), norm=4.90226963970774, lr=0.00019108914913498
2023-11-23 00:00:09   INFO  epoch: 22/30, acc_iter=145914, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:16:22/1:27:28, time_cost(all): 1 day, 15:35:38/14:29:09, loss=0.359112395760169, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.04(1.03), norm=3.04009844610435, lr=0.000190768414953055
2023-11-23 00:00:58   INFO  epoch: 22/30, acc_iter=145964, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:17:11/1:31:12, time_cost(all): 1 day, 15:36:27/13:44:59, loss=0.35902928177386, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.88(1.03), norm=2.1096395337644442, lr=0.000190447680771131
2023-11-23 00:01:47   INFO  epoch: 22/30, acc_iter=146014, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:18:00/1:30:53, time_cost(all): 1 day, 15:37:16/14:59:03, loss=0.358946167787551, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.04(1.03), norm=0.8685186645909259, lr=0.000190126946589206
2023-11-23 00:02:36   INFO  epoch: 22/30, acc_iter=146064, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:18:49/1:27:46, time_cost(all): 1 day, 15:38:05/13:40:09, loss=0.358863053801242, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.92(1.03), norm=2.3975538170566, lr=0.000189806212407281
2023-11-23 00:03:25   INFO  epoch: 22/30, acc_iter=146114, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:19:38/1:29:26, time_cost(all): 1 day, 15:38:54/14:58:07, loss=0.358779939814933, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.0(1.03), norm=2.6052619554852456, lr=0.000189485478225356
2023-11-23 00:04:14   INFO  epoch: 22/30, acc_iter=146164, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:20:27/1:26:18, time_cost(all): 1 day, 15:39:43/13:55:16, loss=0.358696825828624, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.99(1.03), norm=4.698113094646963, lr=0.000189164744043432
2023-11-23 00:05:03   INFO  epoch: 22/30, acc_iter=146214, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:21:17/1:26:31, time_cost(all): 1 day, 15:40:32/14:53:50, loss=0.358613711842315, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.84(1.03), norm=1.0837852197513258, lr=0.000188844009861507
2023-11-23 00:05:52   INFO  epoch: 22/30, acc_iter=146264, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:22:06/1:22:59, time_cost(all): 1 day, 15:41:21/14:38:24, loss=0.358530597856006, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.01(1.03), norm=4.455691733825615, lr=0.000188523275679582
2023-11-23 00:06:42   INFO  epoch: 22/30, acc_iter=146314, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:22:55/1:26:43, time_cost(all): 1 day, 15:42:11/13:50:22, loss=0.358447483869697, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.19(1.03), norm=2.958223263289836, lr=0.000188202541497658
2023-11-23 00:07:31   INFO  epoch: 22/30, acc_iter=146364, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:23:44/1:23:07, time_cost(all): 1 day, 15:43:00/13:31:42, loss=0.358364369883388, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.92(1.03), norm=2.352547650014788, lr=0.000187881807315733
2023-11-23 00:08:20   INFO  epoch: 22/30, acc_iter=146414, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:24:33/1:19:44, time_cost(all): 1 day, 15:43:49/13:58:25, loss=0.358281255897079, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.94(1.03), norm=0.6582981020131312, lr=0.000187561073133808
2023-11-23 00:09:09   INFO  epoch: 22/30, acc_iter=146464, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:25:22/1:21:49, time_cost(all): 1 day, 15:44:38/14:47:31, loss=0.35819814191077, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.07(1.03), norm=3.783879918233549, lr=0.000187240338951883
2023-11-23 00:09:58   INFO  epoch: 22/30, acc_iter=146514, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:26:11/1:21:34, time_cost(all): 1 day, 15:45:27/14:44:28, loss=0.35811502792446, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.16(1.03), norm=3.9915062447971477, lr=0.000186919604769959
2023-11-23 00:10:47   INFO  epoch: 22/30, acc_iter=146564, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:27:00/1:19:46, time_cost(all): 1 day, 15:46:16/14:25:41, loss=0.358031913938151, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.95(1.03), norm=4.0680620390820215, lr=0.000186598870588034
2023-11-23 00:11:36   INFO  epoch: 22/30, acc_iter=146614, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:27:49/1:22:08, time_cost(all): 1 day, 15:47:05/14:05:54, loss=0.357948799951842, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.01(1.03), norm=1.894567699523075, lr=0.000186278136406109
2023-11-23 00:12:25   INFO  epoch: 22/30, acc_iter=146664, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:28:39/1:16:37, time_cost(all): 1 day, 15:47:54/13:34:26, loss=0.357865685965533, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.11(1.03), norm=1.264558312596245, lr=0.000185957402224185
2023-11-23 00:13:14   INFO  epoch: 22/30, acc_iter=146714, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:29:28/1:18:41, time_cost(all): 1 day, 15:48:43/14:34:44, loss=0.357782571979224, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.89(1.03), norm=2.9662023749318, lr=0.00018563666804226
2023-11-23 00:14:04   INFO  epoch: 22/30, acc_iter=146764, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:30:17/1:16:20, time_cost(all): 1 day, 15:49:33/14:37:14, loss=0.357699457992915, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.85(1.03), norm=4.580870814769867, lr=0.000185315933860335
2023-11-23 00:14:53   INFO  epoch: 22/30, acc_iter=146814, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:31:06/1:19:25, time_cost(all): 1 day, 15:50:22/14:27:27, loss=0.357616344006606, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.88(1.03), norm=1.7753804672465359, lr=0.000184995199678411
2023-11-23 00:15:42   INFO  epoch: 22/30, acc_iter=146864, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:31:55/1:18:27, time_cost(all): 1 day, 15:51:11/14:43:14, loss=0.357533230020297, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.87(1.03), norm=4.697834692927095, lr=0.000184674465496486
2023-11-23 00:16:31   INFO  epoch: 22/30, acc_iter=146914, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:32:44/1:11:27, time_cost(all): 1 day, 15:52:00/14:18:05, loss=0.357450116033988, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.93(1.03), norm=3.7654315188548866, lr=0.000184353731314561
2023-11-23 00:17:20   INFO  epoch: 22/30, acc_iter=146964, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:33:33/1:12:37, time_cost(all): 1 day, 15:52:49/14:29:06, loss=0.357367002047679, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.99(1.03), norm=3.9111865051726555, lr=0.000184032997132636
2023-11-23 00:18:09   INFO  epoch: 22/30, acc_iter=147014, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:34:22/1:17:04, time_cost(all): 1 day, 15:53:38/14:07:06, loss=0.35728388806137, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.21(1.03), norm=3.6545476447555463, lr=0.000183712262950712
2023-11-23 00:18:58   INFO  epoch: 22/30, acc_iter=147064, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:35:12/1:15:18, time_cost(all): 1 day, 15:54:27/14:09:56, loss=0.357200774075061, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.89(1.03), norm=4.749568156193633, lr=0.000183391528768787
2023-11-23 00:19:47   INFO  epoch: 22/30, acc_iter=147114, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:36:01/1:14:31, time_cost(all): 1 day, 15:55:16/14:00:26, loss=0.357117660088752, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.97(1.03), norm=2.542433515041436, lr=0.000183070794586862
2023-11-23 00:20:37   INFO  epoch: 22/30, acc_iter=147164, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:36:50/1:10:35, time_cost(all): 1 day, 15:56:06/13:28:33, loss=0.357034546102443, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.99(1.03), norm=2.951939193536873, lr=0.000182750060404938
2023-11-23 00:21:26   INFO  epoch: 22/30, acc_iter=147214, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:37:39/1:12:27, time_cost(all): 1 day, 15:56:55/14:39:39, loss=0.356951432116134, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.06(1.03), norm=4.825430165909039, lr=0.000182429326223013
2023-11-23 00:22:15   INFO  epoch: 22/30, acc_iter=147264, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:38:28/1:06:41, time_cost(all): 1 day, 15:57:44/13:16:32, loss=0.356868318129825, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.97(1.03), norm=3.0233260769054633, lr=0.000182108592041088
2023-11-23 00:23:04   INFO  epoch: 22/30, acc_iter=147314, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:39:17/1:09:56, time_cost(all): 1 day, 15:58:33/14:09:35, loss=0.356785204143515, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.15(1.03), norm=3.1833007616063314, lr=0.000181787857859164
2023-11-23 00:23:53   INFO  epoch: 22/30, acc_iter=147364, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:40:06/1:06:07, time_cost(all): 1 day, 15:59:22/13:28:15, loss=0.356702090157206, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.01(1.03), norm=1.5005958686542793, lr=0.000181467123677239
2023-11-23 00:24:42   INFO  epoch: 22/30, acc_iter=147414, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:40:55/1:08:26, time_cost(all): 1 day, 16:00:11/14:25:36, loss=0.356618976170897, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.19(1.03), norm=2.2685734532833424, lr=0.000181146389495314
2023-11-23 00:25:31   INFO  epoch: 22/30, acc_iter=147464, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:41:44/1:08:49, time_cost(all): 1 day, 16:01:00/13:54:09, loss=0.356535862184588, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.94(1.03), norm=3.423651276492203, lr=0.000180825655313389
2023-11-23 00:26:20   INFO  epoch: 22/30, acc_iter=147514, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:42:34/1:05:47, time_cost(all): 1 day, 16:01:49/14:24:32, loss=0.356452748198279, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.88(1.03), norm=1.2163829848666792, lr=0.000180504921131465
2023-11-23 00:27:09   INFO  epoch: 22/30, acc_iter=147564, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:43:23/1:05:14, time_cost(all): 1 day, 16:02:38/14:07:14, loss=0.35636963421197, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.16(1.03), norm=1.9531174753464402, lr=0.00018018418694954
2023-11-23 00:27:59   INFO  epoch: 22/30, acc_iter=147614, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:44:12/1:00:46, time_cost(all): 1 day, 16:03:28/13:19:36, loss=0.356286520225661, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.97(1.03), norm=3.754464063859681, lr=0.000179863452767615
2023-11-23 00:28:48   INFO  epoch: 22/30, acc_iter=147664, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:45:01/1:01:36, time_cost(all): 1 day, 16:04:17/14:17:02, loss=0.356203406239352, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.06(1.03), norm=1.5195577805969889, lr=0.000179542718585691
2023-11-23 00:29:37   INFO  epoch: 22/30, acc_iter=147714, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:45:50/1:03:43, time_cost(all): 1 day, 16:05:06/13:10:12, loss=0.356120292253043, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.1(1.03), norm=1.3249888588661767, lr=0.000179221984403766
2023-11-23 00:30:26   INFO  epoch: 22/30, acc_iter=147764, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:46:39/1:02:52, time_cost(all): 1 day, 16:05:55/13:55:23, loss=0.356037178266734, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.9(1.03), norm=4.381758920190997, lr=0.000178901250221841
2023-11-23 00:31:15   INFO  epoch: 22/30, acc_iter=147814, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:47:28/1:02:26, time_cost(all): 1 day, 16:06:44/13:55:04, loss=0.355954064280425, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.05(1.03), norm=4.281719869096778, lr=0.000178580516039917
2023-11-23 00:32:04   INFO  epoch: 22/30, acc_iter=147864, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:48:17/1:01:39, time_cost(all): 1 day, 16:07:33/14:04:11, loss=0.355870950294116, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.09(1.03), norm=4.194936834984253, lr=0.000178259781857992
2023-11-23 00:32:53   INFO  epoch: 22/30, acc_iter=147914, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:49:07/1:00:35, time_cost(all): 1 day, 16:08:22/14:16:57, loss=0.355787836307807, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.11(1.03), norm=3.877828908600164, lr=0.000177939047676067
2023-11-23 00:33:42   INFO  epoch: 22/30, acc_iter=147964, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:49:56/0:56:56, time_cost(all): 1 day, 16:09:11/13:48:02, loss=0.355704722321498, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.08(1.03), norm=1.4959973153587085, lr=0.000177618313494142
2023-11-23 00:34:32   INFO  epoch: 22/30, acc_iter=148014, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:50:45/0:58:40, time_cost(all): 1 day, 16:10:01/13:23:11, loss=0.355621608335189, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.0(1.03), norm=1.705502611148988, lr=0.000177297579312218
2023-11-23 00:35:21   INFO  epoch: 22/30, acc_iter=148064, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 0:51:34/0:54:13, time_cost(all): 1 day, 16:10:50/13:12:44, loss=0.35553849434888, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.95(1.03), norm=0.6553809008160115, lr=0.000176976845130293
2023-11-23 00:36:10   INFO  epoch: 22/30, acc_iter=148114, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 0:52:23/0:55:18, time_cost(all): 1 day, 16:11:39/14:02:48, loss=0.355455380362571, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.86(1.03), norm=4.162619065363248, lr=0.000176656110948368
2023-11-23 00:36:59   INFO  epoch: 22/30, acc_iter=148164, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 0:53:12/0:53:43, time_cost(all): 1 day, 16:12:28/13:03:26, loss=0.355372266376261, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.16(1.03), norm=2.233538251589481, lr=0.000176335376766444
2023-11-23 00:37:48   INFO  epoch: 22/30, acc_iter=148214, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 0:54:01/0:51:23, time_cost(all): 1 day, 16:13:17/13:07:24, loss=0.355289152389952, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.2(1.03), norm=3.8249319003877993, lr=0.000176014642584519
2023-11-23 00:38:37   INFO  epoch: 22/30, acc_iter=148264, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 0:54:50/0:50:50, time_cost(all): 1 day, 16:14:06/13:52:57, loss=0.355206038403643, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.17(1.03), norm=1.3568634641178954, lr=0.000175693908402594
2023-11-23 00:39:26   INFO  epoch: 22/30, acc_iter=148314, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 0:55:39/0:50:59, time_cost(all): 1 day, 16:14:55/13:09:47, loss=0.355122924417334, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.23(1.03), norm=1.4740432678560424, lr=0.00017537317422067
2023-11-23 00:40:15   INFO  epoch: 22/30, acc_iter=148364, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 0:56:29/0:48:55, time_cost(all): 1 day, 16:15:44/13:21:49, loss=0.355039810431025, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.0(1.03), norm=2.2280697066565063, lr=0.000175052440038745
2023-11-23 00:41:04   INFO  epoch: 22/30, acc_iter=148414, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 0:57:18/0:48:01, time_cost(all): 1 day, 16:16:33/13:24:21, loss=0.354956696444716, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.95(1.03), norm=4.598735312786837, lr=0.00017473170585682
2023-11-23 00:41:54   INFO  epoch: 22/30, acc_iter=148464, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 0:58:07/0:49:57, time_cost(all): 1 day, 16:17:23/13:18:55, loss=0.354873582458407, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.06(1.03), norm=1.2638216572141543, lr=0.000174410971674895
2023-11-23 00:42:43   INFO  epoch: 22/30, acc_iter=148514, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 0:58:56/0:47:43, time_cost(all): 1 day, 16:18:12/14:10:36, loss=0.354790468472098, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.86(1.03), norm=3.066036211956003, lr=0.000174090237492971
2023-11-23 00:43:32   INFO  epoch: 22/30, acc_iter=148564, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 0:59:45/0:49:40, time_cost(all): 1 day, 16:19:01/13:06:39, loss=0.354707354485789, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.09(1.03), norm=1.5395181121461143, lr=0.000173769503311046
2023-11-23 00:44:21   INFO  epoch: 22/30, acc_iter=148614, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:00:34/0:48:46, time_cost(all): 1 day, 16:19:50/13:09:07, loss=0.35462424049948, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.15(1.03), norm=3.2447702763169355, lr=0.000173448769129121
2023-11-23 00:45:10   INFO  epoch: 22/30, acc_iter=148664, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:01:23/0:48:43, time_cost(all): 1 day, 16:20:39/13:01:27, loss=0.354541126513171, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.09(1.03), norm=1.758978763145522, lr=0.000173128034947197
2023-11-23 00:45:59   INFO  epoch: 22/30, acc_iter=148714, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:02:12/0:46:28, time_cost(all): 1 day, 16:21:28/13:39:43, loss=0.354458012526862, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.03(1.03), norm=3.099527117966401, lr=0.000172807300765272
2023-11-23 00:46:48   INFO  epoch: 22/30, acc_iter=148764, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:03:02/0:44:49, time_cost(all): 1 day, 16:22:17/14:06:56, loss=0.354374898540553, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.89(1.03), norm=3.9626441317057313, lr=0.000172486566583347
2023-11-23 00:47:37   INFO  epoch: 22/30, acc_iter=148814, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:03:51/0:45:38, time_cost(all): 1 day, 16:23:06/13:39:03, loss=0.354291784554244, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.18(1.03), norm=3.662716990937012, lr=0.000172165832401422
2023-11-23 00:48:27   INFO  epoch: 22/30, acc_iter=148864, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:04:40/0:43:33, time_cost(all): 1 day, 16:23:56/13:18:29, loss=0.354208670567935, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.13(1.03), norm=2.894752607847326, lr=0.000171845098219498
2023-11-23 00:49:16   INFO  epoch: 22/30, acc_iter=148914, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:05:29/0:42:22, time_cost(all): 1 day, 16:24:45/13:38:08, loss=0.354125556581625, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.14(1.03), norm=4.130417590343919, lr=0.000171524364037573
2023-11-23 00:50:05   INFO  epoch: 22/30, acc_iter=148964, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:06:18/0:43:09, time_cost(all): 1 day, 16:25:34/13:03:47, loss=0.354042442595316, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.01(1.03), norm=3.684821805040806, lr=0.000171203629855648
2023-11-23 00:50:54   INFO  epoch: 22/30, acc_iter=149014, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:07:07/0:41:18, time_cost(all): 1 day, 16:26:23/14:01:54, loss=0.353959328609007, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.01(1.03), norm=3.6940363708477335, lr=0.000170882895673724
2023-11-23 00:51:43   INFO  epoch: 22/30, acc_iter=149064, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:07:56/0:39:30, time_cost(all): 1 day, 16:27:12/14:03:25, loss=0.353876214622698, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.04(1.03), norm=4.374322745127135, lr=0.000170562161491799
2023-11-23 00:52:32   INFO  epoch: 22/30, acc_iter=149114, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:08:45/0:40:44, time_cost(all): 1 day, 16:28:01/13:56:54, loss=0.353793100636389, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.14(1.03), norm=4.405674245806062, lr=0.000170241427309874
2023-11-23 00:53:21   INFO  epoch: 22/30, acc_iter=149164, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:09:34/0:39:30, time_cost(all): 1 day, 16:28:50/14:02:41, loss=0.35370998665008, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.21(1.03), norm=2.112365990916262, lr=0.00016992069312795
2023-11-23 00:54:10   INFO  epoch: 22/30, acc_iter=149214, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:10:24/0:36:00, time_cost(all): 1 day, 16:29:39/13:43:37, loss=0.353626872663771, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.08(1.03), norm=3.5610383185597585, lr=0.000169599958946025
2023-11-23 00:54:59   INFO  epoch: 22/30, acc_iter=149264, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:11:13/0:38:19, time_cost(all): 1 day, 16:30:28/14:02:57, loss=0.353543758677462, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.93(1.03), norm=4.873340358020966, lr=0.0001692792247641
2023-11-23 00:55:49   INFO  epoch: 22/30, acc_iter=149314, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:12:02/0:34:25, time_cost(all): 1 day, 16:31:18/13:35:41, loss=0.353460644691153, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.16(1.03), norm=2.5938297756672313, lr=0.000168958490582175
2023-11-23 00:56:38   INFO  epoch: 22/30, acc_iter=149364, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:12:51/0:35:07, time_cost(all): 1 day, 16:32:07/12:49:00, loss=0.353377530704844, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.83(1.03), norm=2.426823440976921, lr=0.000168637756400251
2023-11-23 00:57:27   INFO  epoch: 22/30, acc_iter=149414, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:13:40/0:34:45, time_cost(all): 1 day, 16:32:56/13:52:50, loss=0.353294416718535, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.84(1.03), norm=1.74357056582974, lr=0.000168317022218326
2023-11-23 00:58:16   INFO  epoch: 22/30, acc_iter=149464, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:14:29/0:35:00, time_cost(all): 1 day, 16:33:45/13:57:58, loss=0.353211302732226, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.07(1.03), norm=4.017507819207568, lr=0.000167996288036401
2023-11-23 00:59:05   INFO  epoch: 22/30, acc_iter=149514, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:15:18/0:31:36, time_cost(all): 1 day, 16:34:34/13:58:20, loss=0.353128188745917, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.05(1.03), norm=1.3503442597807778, lr=0.000167675553854477
2023-11-23 00:59:54   INFO  epoch: 22/30, acc_iter=149564, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:16:07/0:31:06, time_cost(all): 1 day, 16:35:23/12:58:14, loss=0.353045074759608, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.03(1.03), norm=1.5415442806113657, lr=0.000167354819672552
2023-11-23 01:00:43   INFO  epoch: 22/30, acc_iter=149614, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:16:57/0:31:30, time_cost(all): 1 day, 16:36:12/13:33:24, loss=0.352961960773299, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.87(1.03), norm=0.890655274441895, lr=0.000167034085490627
2023-11-23 01:01:32   INFO  epoch: 22/30, acc_iter=149664, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:17:46/0:30:22, time_cost(all): 1 day, 16:37:01/13:47:55, loss=0.35287884678699, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.97(1.03), norm=4.375721866628446, lr=0.000166713351308702
2023-11-23 01:02:22   INFO  epoch: 22/30, acc_iter=149714, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:18:35/0:27:48, time_cost(all): 1 day, 16:37:51/13:05:15, loss=0.35279573280068, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.15(1.03), norm=3.872440117360749, lr=0.000166392617126778
2023-11-23 01:03:11   INFO  epoch: 22/30, acc_iter=149764, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:19:24/0:28:08, time_cost(all): 1 day, 16:38:40/13:38:04, loss=0.352712618814371, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.96(1.03), norm=2.753772754048361, lr=0.000166071882944853
2023-11-23 01:04:00   INFO  epoch: 22/30, acc_iter=149814, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:20:13/0:28:51, time_cost(all): 1 day, 16:39:29/12:36:08, loss=0.352629504828062, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.05(1.03), norm=4.564255284243486, lr=0.000165751148762928
2023-11-23 01:04:49   INFO  epoch: 22/30, acc_iter=149864, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:21:02/0:27:20, time_cost(all): 1 day, 16:40:18/13:26:18, loss=0.352546390841753, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.92(1.03), norm=3.001221408452538, lr=0.000165430414581004
2023-11-23 01:05:38   INFO  epoch: 22/30, acc_iter=149914, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:21:51/0:25:09, time_cost(all): 1 day, 16:41:07/13:32:54, loss=0.352463276855444, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.1(1.03), norm=4.8848772682691814, lr=0.000165109680399079
2023-11-23 01:06:27   INFO  epoch: 22/30, acc_iter=149964, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:22:40/0:24:43, time_cost(all): 1 day, 16:41:56/13:50:53, loss=0.352380162869135, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.04(1.03), norm=4.230039369128116, lr=0.000164788946217154
2023-11-23 01:07:16   INFO  epoch: 22/30, acc_iter=150014, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:23:29/0:23:58, time_cost(all): 1 day, 16:42:45/13:31:24, loss=0.352297048882826, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.18(1.03), norm=1.6891275988943717, lr=0.00016446821203523
2023-11-23 01:08:05   INFO  epoch: 22/30, acc_iter=150064, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:24:19/0:23:25, time_cost(all): 1 day, 16:43:34/13:23:50, loss=0.352213934896517, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.16(1.03), norm=0.8247298684958599, lr=0.000164147477853305
2023-11-23 01:08:54   INFO  epoch: 22/30, acc_iter=150114, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:25:08/0:21:47, time_cost(all): 1 day, 16:44:23/13:42:42, loss=0.352130820910208, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.21(1.03), norm=0.7809044050464193, lr=0.00016382674367138
2023-11-23 01:09:44   INFO  epoch: 22/30, acc_iter=150164, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:25:57/0:22:17, time_cost(all): 1 day, 16:45:13/12:51:08, loss=0.352047706923899, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.05(1.03), norm=3.8664250451011446, lr=0.000163506009489455
2023-11-23 01:10:33   INFO  epoch: 22/30, acc_iter=150214, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:26:46/0:21:10, time_cost(all): 1 day, 16:46:02/12:56:49, loss=0.35196459293759, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.13(1.03), norm=0.6066824736125667, lr=0.000163185275307531
2023-11-23 01:11:22   INFO  epoch: 22/30, acc_iter=150264, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:27:35/0:19:55, time_cost(all): 1 day, 16:46:51/13:22:37, loss=0.351881478951281, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.98(1.03), norm=3.673060935743395, lr=0.000162864541125606
2023-11-23 01:12:11   INFO  epoch: 22/30, acc_iter=150314, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:28:24/0:20:11, time_cost(all): 1 day, 16:47:40/13:16:58, loss=0.351798364964972, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.17(1.03), norm=4.439735593896499, lr=0.000162543806943681
2023-11-23 01:13:00   INFO  epoch: 22/30, acc_iter=150364, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:29:13/0:19:32, time_cost(all): 1 day, 16:48:29/12:59:08, loss=0.351715250978663, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.03(1.03), norm=4.816637673521383, lr=0.000162223072761757
2023-11-23 01:13:49   INFO  epoch: 22/30, acc_iter=150414, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:30:02/0:17:34, time_cost(all): 1 day, 16:49:18/13:37:18, loss=0.351632136992354, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.23(1.03), norm=2.858058285791743, lr=0.000161902338579832
2023-11-23 01:14:38   INFO  epoch: 22/30, acc_iter=150464, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:30:52/0:16:43, time_cost(all): 1 day, 16:50:07/13:19:29, loss=0.351549023006045, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.84(1.03), norm=3.2506628171824703, lr=0.000161581604397907
2023-11-23 01:15:27   INFO  epoch: 22/30, acc_iter=150514, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:31:41/0:16:51, time_cost(all): 1 day, 16:50:56/13:09:46, loss=0.351465909019736, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.17(1.03), norm=3.412816301657285, lr=0.000161260870215982
2023-11-23 01:16:17   INFO  epoch: 22/30, acc_iter=150564, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:32:30/0:15:03, time_cost(all): 1 day, 16:51:46/12:25:16, loss=0.351382795033426, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.09(1.03), norm=4.441209719710406, lr=0.000160940136034058
2023-11-23 01:17:06   INFO  epoch: 22/30, acc_iter=150614, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:33:19/0:14:44, time_cost(all): 1 day, 16:52:35/12:55:31, loss=0.351299681047117, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.15(1.03), norm=1.945113226112221, lr=0.000160619401852133
2023-11-23 01:17:55   INFO  epoch: 22/30, acc_iter=150664, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:34:08/0:13:44, time_cost(all): 1 day, 16:53:24/13:03:05, loss=0.351216567060808, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.84(1.03), norm=1.7755691434314935, lr=0.000160298667670208
2023-11-23 01:18:44   INFO  epoch: 22/30, acc_iter=150714, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:34:57/0:12:36, time_cost(all): 1 day, 16:54:13/13:33:46, loss=0.351133453074499, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.98(1.03), norm=2.704591793869443, lr=0.000159977933488284
2023-11-23 01:19:33   INFO  epoch: 22/30, acc_iter=150764, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:35:46/0:11:46, time_cost(all): 1 day, 16:55:02/13:22:50, loss=0.35105033908819, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.14(1.03), norm=4.848506612087437, lr=0.000159657199306359
2023-11-23 01:20:22   INFO  epoch: 22/30, acc_iter=150814, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:36:35/0:11:22, time_cost(all): 1 day, 16:55:51/12:43:34, loss=0.350967225101881, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.97(1.03), norm=3.5479670067608655, lr=0.000159336465124434
2023-11-23 01:21:11   INFO  epoch: 22/30, acc_iter=150864, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:37:24/0:10:33, time_cost(all): 1 day, 16:56:40/13:37:34, loss=0.350884111115572, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.2(1.03), norm=4.51874589561808, lr=0.00015901573094251
2023-11-23 01:22:00   INFO  epoch: 22/30, acc_iter=150914, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:38:14/0:09:36, time_cost(all): 1 day, 16:57:29/13:29:17, loss=0.350800997129263, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.21(1.03), norm=2.8574801585508225, lr=0.000158694996760585
2023-11-23 01:22:49   INFO  epoch: 22/30, acc_iter=150964, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:39:03/0:08:50, time_cost(all): 1 day, 16:58:18/12:40:10, loss=0.350717883142954, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.04(1.03), norm=2.2333097600803704, lr=0.00015837426257866
2023-11-23 01:23:39   INFO  epoch: 22/30, acc_iter=151014, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:39:52/0:07:56, time_cost(all): 1 day, 16:59:08/12:24:50, loss=0.350634769156645, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.85(1.03), norm=1.3859668579229578, lr=0.000158053528396735
2023-11-23 01:24:28   INFO  epoch: 22/30, acc_iter=151064, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:40:41/0:06:54, time_cost(all): 1 day, 16:59:57/13:30:08, loss=0.350551655170336, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.89(1.03), norm=2.7975634964756773, lr=0.000157732794214811
2023-11-23 01:25:17   INFO  epoch: 22/30, acc_iter=151114, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:41:30/0:06:02, time_cost(all): 1 day, 17:00:46/13:21:34, loss=0.350468541184027, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.91(1.03), norm=2.5508438862723666, lr=0.000157412060032886
2023-11-23 01:26:06   INFO  epoch: 22/30, acc_iter=151164, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 1:42:19/0:05:39, time_cost(all): 1 day, 17:01:35/13:31:34, loss=0.350385427197718, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.02(1.03), norm=3.115264542989162, lr=0.000157091325850961
2023-11-23 01:26:55   INFO  epoch: 22/30, acc_iter=151214, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 1:43:08/0:04:32, time_cost(all): 1 day, 17:02:24/12:52:09, loss=0.350302313211409, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.91(1.03), norm=1.505739484744295, lr=0.000156770591669037
2023-11-23 01:27:44   INFO  epoch: 22/30, acc_iter=151264, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 1:43:57/0:03:43, time_cost(all): 1 day, 17:03:13/13:14:00, loss=0.3502191992251, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.95(1.03), norm=1.4708398409758339, lr=0.000156449857487112
2023-11-23 01:28:33   INFO  epoch: 22/30, acc_iter=151314, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 1:44:47/0:03:00, time_cost(all): 1 day, 17:04:02/12:17:14, loss=0.35013608523879, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.1(1.03), norm=2.0526464730565737, lr=0.000156129123305187
2023-11-23 01:29:22   INFO  epoch: 22/30, acc_iter=151364, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 1:45:36/0:02:17, time_cost(all): 1 day, 17:04:51/13:06:14, loss=0.350052971252481, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.0(1.03), norm=3.119368455415634, lr=0.000155808389123262
2023-11-23 01:30:12   INFO  epoch: 22/30, acc_iter=151414, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 1:46:25/0:01:28, time_cost(all): 1 day, 17:05:41/12:52:11, loss=0.349969857266172, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.05(1.03), norm=0.6493094008163542, lr=0.000155487654941338
2023-11-23 01:31:01   INFO  epoch: 22/30, acc_iter=151464, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 1:47:14/0:00:34, time_cost(all): 1 day, 17:06:30/13:23:37, loss=0.349886743279863, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.84(1.03), norm=2.6357236599596208, lr=0.000155166920759413
2023-11-23 01:31:50   INFO  epoch: 23/30, acc_iter=151551, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:49/1:48:50, time_cost(all): 1 day, 17:07:19/12:26:46, loss=0.349742124943686, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.83(1.03), norm=2.9523796989178273, lr=0.000154608843282864
2023-11-23 01:32:39   INFO  epoch: 23/30, acc_iter=151601, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:38/1:41:25, time_cost(all): 1 day, 17:08:08/13:07:34, loss=0.349659010957376, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.21(1.03), norm=4.651176885273222, lr=0.000154288109100939
2023-11-23 01:33:28   INFO  epoch: 23/30, acc_iter=151651, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:27/1:44:52, time_cost(all): 1 day, 17:08:57/12:24:36, loss=0.349575896971067, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.2(1.03), norm=3.6810321322116355, lr=0.000153967374919015
2023-11-23 01:34:17   INFO  epoch: 23/30, acc_iter=151701, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:16/1:41:57, time_cost(all): 1 day, 17:09:46/12:47:02, loss=0.349492782984758, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.09(1.03), norm=2.403739360730434, lr=0.00015364664073709
2023-11-23 01:35:06   INFO  epoch: 23/30, acc_iter=151751, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:05/1:46:15, time_cost(all): 1 day, 17:10:35/12:24:58, loss=0.349409668998449, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.05(1.03), norm=0.625689627519075, lr=0.000153325906555165
2023-11-23 01:35:55   INFO  epoch: 23/30, acc_iter=151801, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:04:54/1:38:34, time_cost(all): 1 day, 17:11:24/13:02:08, loss=0.34932655501214, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.99(1.03), norm=0.5406578059871212, lr=0.000153005172373241
2023-11-23 01:36:44   INFO  epoch: 23/30, acc_iter=151851, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:05:43/1:39:21, time_cost(all): 1 day, 17:12:13/12:19:51, loss=0.349243441025831, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.96(1.03), norm=2.168093875247888, lr=0.000152684438191316
2023-11-23 01:37:34   INFO  epoch: 23/30, acc_iter=151901, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:06:32/1:43:31, time_cost(all): 1 day, 17:13:03/12:32:33, loss=0.349160327039522, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.97(1.03), norm=4.8261156931513876, lr=0.000152363704009391
2023-11-23 01:38:23   INFO  epoch: 23/30, acc_iter=151951, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:07:22/1:39:45, time_cost(all): 1 day, 17:13:52/12:42:49, loss=0.349077213053213, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.91(1.03), norm=2.885583668498935, lr=0.000152042969827467
2023-11-23 01:39:12   INFO  epoch: 23/30, acc_iter=152001, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:08:11/1:40:32, time_cost(all): 1 day, 17:14:41/12:54:15, loss=0.348994099066904, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.83(1.03), norm=1.4533570532882985, lr=0.000151722235645542
2023-11-23 01:40:01   INFO  epoch: 23/30, acc_iter=152051, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:09:00/1:41:56, time_cost(all): 1 day, 17:15:30/13:01:52, loss=0.348910985080595, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.96(1.03), norm=4.153007403821909, lr=0.000151401501463617
2023-11-23 01:40:50   INFO  epoch: 23/30, acc_iter=152101, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:09:49/1:41:57, time_cost(all): 1 day, 17:16:19/12:23:10, loss=0.348827871094286, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.05(1.03), norm=2.3763777727711655, lr=0.000151080767281692
2023-11-23 01:41:39   INFO  epoch: 23/30, acc_iter=152151, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:10:38/1:34:48, time_cost(all): 1 day, 17:17:08/12:59:35, loss=0.348744757107977, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.87(1.03), norm=3.570472274622267, lr=0.000150760033099768
2023-11-23 01:42:28   INFO  epoch: 23/30, acc_iter=152201, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:11:27/1:38:20, time_cost(all): 1 day, 17:17:57/13:12:27, loss=0.348661643121668, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.98(1.03), norm=3.0816479176085227, lr=0.000150439298917843
2023-11-23 01:43:17   INFO  epoch: 23/30, acc_iter=152251, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:12:16/1:33:32, time_cost(all): 1 day, 17:18:46/12:40:38, loss=0.348578529135359, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.17(1.03), norm=0.7633472803173016, lr=0.000150118564735918
2023-11-23 01:44:07   INFO  epoch: 23/30, acc_iter=152301, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:13:05/1:32:46, time_cost(all): 1 day, 17:19:36/13:13:29, loss=0.34849541514905, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.21(1.03), norm=2.206808861056973, lr=0.000149797830553994
2023-11-23 01:44:56   INFO  epoch: 23/30, acc_iter=152351, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:13:54/1:37:04, time_cost(all): 1 day, 17:20:25/13:00:09, loss=0.348412301162741, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.13(1.03), norm=2.1675671688574005, lr=0.000149477096372069
2023-11-23 01:45:45   INFO  epoch: 23/30, acc_iter=152401, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:14:44/1:34:39, time_cost(all): 1 day, 17:21:14/12:52:36, loss=0.348329187176431, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.89(1.03), norm=2.8367539894090577, lr=0.000149156362190144
2023-11-23 01:46:34   INFO  epoch: 23/30, acc_iter=152451, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:15:33/1:32:58, time_cost(all): 1 day, 17:22:03/12:28:05, loss=0.348246073190122, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.23(1.03), norm=4.204055844602132, lr=0.000148835628008219
2023-11-23 01:47:23   INFO  epoch: 23/30, acc_iter=152501, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:16:22/1:29:47, time_cost(all): 1 day, 17:22:52/13:06:16, loss=0.348162959203813, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.01(1.03), norm=3.1227960576777316, lr=0.000148514893826295
2023-11-23 01:48:12   INFO  epoch: 23/30, acc_iter=152551, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:17:11/1:32:34, time_cost(all): 1 day, 17:23:41/12:00:55, loss=0.348079845217504, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.89(1.03), norm=0.9587263230160953, lr=0.00014819415964437
2023-11-23 01:49:01   INFO  epoch: 23/30, acc_iter=152601, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:18:00/1:31:13, time_cost(all): 1 day, 17:24:30/11:54:38, loss=0.347996731231195, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.04(1.03), norm=3.502795418076787, lr=0.000147873425462445
2023-11-23 01:49:50   INFO  epoch: 23/30, acc_iter=152651, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:18:49/1:27:34, time_cost(all): 1 day, 17:25:19/12:49:41, loss=0.347913617244886, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.22(1.03), norm=3.953474711737376, lr=0.000147552691280521
2023-11-23 01:50:39   INFO  epoch: 23/30, acc_iter=152701, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:19:38/1:32:02, time_cost(all): 1 day, 17:26:08/12:32:11, loss=0.347830503258577, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.11(1.03), norm=4.656468030677126, lr=0.000147231957098596
2023-11-23 01:51:29   INFO  epoch: 23/30, acc_iter=152751, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:20:27/1:23:54, time_cost(all): 1 day, 17:26:58/11:58:33, loss=0.347747389272268, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.07(1.03), norm=3.7664662155791717, lr=0.000146911222916671
2023-11-23 01:52:18   INFO  epoch: 23/30, acc_iter=152801, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:21:17/1:23:00, time_cost(all): 1 day, 17:27:47/12:59:17, loss=0.347664275285959, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.84(1.03), norm=4.170569394202204, lr=0.000146590488734747
2023-11-23 01:53:07   INFO  epoch: 23/30, acc_iter=152851, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:22:06/1:26:51, time_cost(all): 1 day, 17:28:36/12:20:24, loss=0.34758116129965, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.08(1.03), norm=1.6805198987222043, lr=0.000146269754552822
2023-11-23 01:53:56   INFO  epoch: 23/30, acc_iter=152901, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:22:55/1:21:49, time_cost(all): 1 day, 17:29:25/12:45:57, loss=0.347498047313341, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.96(1.03), norm=2.0177829460680337, lr=0.000145949020370897
2023-11-23 01:54:45   INFO  epoch: 23/30, acc_iter=152951, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:23:44/1:20:38, time_cost(all): 1 day, 17:30:14/12:38:10, loss=0.347414933327032, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.06(1.03), norm=2.016179763390204, lr=0.000145628286188972
2023-11-23 01:55:34   INFO  epoch: 23/30, acc_iter=153001, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:24:33/1:26:29, time_cost(all): 1 day, 17:31:03/11:58:25, loss=0.347331819340723, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.17(1.03), norm=2.2680797242964927, lr=0.000145307552007048
2023-11-23 01:56:23   INFO  epoch: 23/30, acc_iter=153051, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:25:22/1:18:30, time_cost(all): 1 day, 17:31:52/12:54:54, loss=0.347248705354414, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.09(1.03), norm=1.2663762385173498, lr=0.000144986817825123
2023-11-23 01:57:12   INFO  epoch: 23/30, acc_iter=153101, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:26:11/1:18:56, time_cost(all): 1 day, 17:32:41/12:24:07, loss=0.347165591368105, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.01(1.03), norm=1.7790388172026195, lr=0.000144666083643198
2023-11-23 01:58:01   INFO  epoch: 23/30, acc_iter=153151, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:27:00/1:19:23, time_cost(all): 1 day, 17:33:30/12:56:22, loss=0.347082477381796, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.07(1.03), norm=3.6828011041493074, lr=0.000144345349461274
2023-11-23 01:58:51   INFO  epoch: 23/30, acc_iter=153201, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:27:49/1:17:25, time_cost(all): 1 day, 17:34:20/11:59:52, loss=0.346999363395486, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.15(1.03), norm=2.3267915378343482, lr=0.000144024615279349
2023-11-23 01:59:40   INFO  epoch: 23/30, acc_iter=153251, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:28:39/1:22:08, time_cost(all): 1 day, 17:35:09/12:21:08, loss=0.346916249409177, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.19(1.03), norm=2.3931277845502055, lr=0.000143703881097424
2023-11-23 02:00:29   INFO  epoch: 23/30, acc_iter=153301, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:29:28/1:18:07, time_cost(all): 1 day, 17:35:58/12:24:44, loss=0.346833135422868, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.9(1.03), norm=0.6478814098211784, lr=0.000143383146915499
2023-11-23 02:01:18   INFO  epoch: 23/30, acc_iter=153351, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:30:17/1:18:35, time_cost(all): 1 day, 17:36:47/12:04:14, loss=0.346750021436559, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.07(1.03), norm=2.137949829292948, lr=0.000143062412733575
2023-11-23 02:02:07   INFO  epoch: 23/30, acc_iter=153401, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:31:06/1:18:33, time_cost(all): 1 day, 17:37:36/12:32:07, loss=0.34666690745025, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.03(1.03), norm=4.393433338265272, lr=0.00014274167855165
2023-11-23 02:02:56   INFO  epoch: 23/30, acc_iter=153451, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:31:55/1:16:25, time_cost(all): 1 day, 17:38:25/11:44:50, loss=0.346583793463941, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.93(1.03), norm=4.954416646084085, lr=0.000142420944369725
2023-11-23 02:03:45   INFO  epoch: 23/30, acc_iter=153501, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:32:44/1:12:21, time_cost(all): 1 day, 17:39:14/12:30:27, loss=0.346500679477632, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.94(1.03), norm=4.539018860088439, lr=0.000142100210187801
2023-11-23 02:04:34   INFO  epoch: 23/30, acc_iter=153551, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:33:33/1:11:56, time_cost(all): 1 day, 17:40:03/12:18:50, loss=0.346417565491323, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.14(1.03), norm=3.6712036744495973, lr=0.000141779476005876
2023-11-23 02:05:24   INFO  epoch: 23/30, acc_iter=153601, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:34:22/1:15:48, time_cost(all): 1 day, 17:40:53/12:37:05, loss=0.346334451505014, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.18(1.03), norm=3.5795153296701434, lr=0.000141458741823951
2023-11-23 02:06:13   INFO  epoch: 23/30, acc_iter=153651, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:35:12/1:11:44, time_cost(all): 1 day, 17:41:42/11:47:45, loss=0.346251337518705, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.95(1.03), norm=4.706106452989457, lr=0.000141138007642027
2023-11-23 02:07:02   INFO  epoch: 23/30, acc_iter=153701, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:36:01/1:14:45, time_cost(all): 1 day, 17:42:31/12:28:27, loss=0.346168223532396, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.05(1.03), norm=0.8912240743332658, lr=0.000140817273460102
2023-11-23 02:07:51   INFO  epoch: 23/30, acc_iter=153751, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:36:50/1:12:41, time_cost(all): 1 day, 17:43:20/12:19:37, loss=0.346085109546087, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.14(1.03), norm=4.604620196219118, lr=0.000140496539278177
2023-11-23 02:08:40   INFO  epoch: 23/30, acc_iter=153801, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:37:39/1:07:41, time_cost(all): 1 day, 17:44:09/11:47:58, loss=0.346001995559778, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.97(1.03), norm=4.929753597912762, lr=0.000140175805096252
2023-11-23 02:09:29   INFO  epoch: 23/30, acc_iter=153851, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:38:28/1:09:57, time_cost(all): 1 day, 17:44:58/12:33:25, loss=0.345918881573469, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.14(1.03), norm=4.075231850515557, lr=0.000139855070914328
2023-11-23 02:10:18   INFO  epoch: 23/30, acc_iter=153901, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:39:17/1:09:12, time_cost(all): 1 day, 17:45:47/11:56:34, loss=0.34583576758716, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.14(1.03), norm=3.212699503775447, lr=0.000139534336732403
2023-11-23 02:11:07   INFO  epoch: 23/30, acc_iter=153951, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:40:06/1:09:08, time_cost(all): 1 day, 17:46:36/12:34:17, loss=0.345752653600851, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.05(1.03), norm=4.365461727171493, lr=0.000139213602550478
2023-11-23 02:11:56   INFO  epoch: 23/30, acc_iter=154001, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:40:55/1:05:46, time_cost(all): 1 day, 17:47:25/12:00:35, loss=0.345669539614541, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.9(1.03), norm=1.1111554507996337, lr=0.000138892868368554
2023-11-23 02:12:46   INFO  epoch: 23/30, acc_iter=154051, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:41:44/1:05:27, time_cost(all): 1 day, 17:48:15/12:28:19, loss=0.345586425628232, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.04(1.03), norm=0.5095680054560098, lr=0.000138572134186629
2023-11-23 02:13:35   INFO  epoch: 23/30, acc_iter=154101, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:42:34/1:03:48, time_cost(all): 1 day, 17:49:04/12:09:28, loss=0.345503311641923, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.02(1.03), norm=2.926574208372944, lr=0.000138251400004704
2023-11-23 02:14:24   INFO  epoch: 23/30, acc_iter=154151, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:43:23/1:03:05, time_cost(all): 1 day, 17:49:53/12:19:03, loss=0.345420197655614, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.96(1.03), norm=2.516164863658335, lr=0.000137930665822779
2023-11-23 02:15:13   INFO  epoch: 23/30, acc_iter=154201, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:44:12/1:01:05, time_cost(all): 1 day, 17:50:42/11:32:39, loss=0.345337083669305, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.11(1.03), norm=2.7902856996576677, lr=0.000137609931640855
2023-11-23 02:16:02   INFO  epoch: 23/30, acc_iter=154251, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:45:01/1:01:27, time_cost(all): 1 day, 17:51:31/11:28:29, loss=0.345253969682996, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.0(1.03), norm=1.933440516410733, lr=0.00013728919745893
2023-11-23 02:16:51   INFO  epoch: 23/30, acc_iter=154301, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:45:50/1:04:51, time_cost(all): 1 day, 17:52:20/12:10:57, loss=0.345170855696687, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.93(1.03), norm=1.7490849671243334, lr=0.000136968463277005
2023-11-23 02:17:40   INFO  epoch: 23/30, acc_iter=154351, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:46:39/1:01:30, time_cost(all): 1 day, 17:53:09/12:21:09, loss=0.345087741710378, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.94(1.03), norm=0.567680923446545, lr=0.000136647729095081
2023-11-23 02:18:29   INFO  epoch: 23/30, acc_iter=154401, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:47:28/1:03:00, time_cost(all): 1 day, 17:53:58/11:37:42, loss=0.345004627724069, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.98(1.03), norm=4.965861540486692, lr=0.000136326994913156
2023-11-23 02:19:19   INFO  epoch: 23/30, acc_iter=154451, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:48:17/0:56:38, time_cost(all): 1 day, 17:54:48/11:42:09, loss=0.34492151373776, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.97(1.03), norm=2.821293352458828, lr=0.000136006260731231
2023-11-23 02:20:08   INFO  epoch: 23/30, acc_iter=154501, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:49:07/0:57:49, time_cost(all): 1 day, 17:55:37/11:36:13, loss=0.344838399751451, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.18(1.03), norm=4.686091498878929, lr=0.000135685526549307
2023-11-23 02:20:57   INFO  epoch: 23/30, acc_iter=154551, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:49:56/0:55:52, time_cost(all): 1 day, 17:56:26/11:30:30, loss=0.344755285765142, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.19(1.03), norm=2.6771203486993196, lr=0.000135364792367382
2023-11-23 02:21:46   INFO  epoch: 23/30, acc_iter=154601, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:50:45/0:58:57, time_cost(all): 1 day, 17:57:15/12:29:47, loss=0.344672171778833, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.2(1.03), norm=2.8608274069681463, lr=0.000135044058185457
2023-11-23 02:22:35   INFO  epoch: 23/30, acc_iter=154651, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 0:51:34/0:55:48, time_cost(all): 1 day, 17:58:04/12:06:47, loss=0.344589057792524, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.92(1.03), norm=4.6204575085005954, lr=0.000134723324003532
2023-11-23 02:23:24   INFO  epoch: 23/30, acc_iter=154701, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 0:52:23/0:54:38, time_cost(all): 1 day, 17:58:53/11:37:47, loss=0.344505943806215, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.01(1.03), norm=1.067302532488257, lr=0.000134402589821608
2023-11-23 02:24:13   INFO  epoch: 23/30, acc_iter=154751, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 0:53:12/0:56:01, time_cost(all): 1 day, 17:59:42/11:43:41, loss=0.344422829819905, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.0(1.03), norm=1.307276739504375, lr=0.000134081855639683
2023-11-23 02:25:02   INFO  epoch: 23/30, acc_iter=154801, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 0:54:01/0:51:15, time_cost(all): 1 day, 18:00:31/11:41:46, loss=0.344339715833596, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.16(1.03), norm=1.1801456328980544, lr=0.000133761121457758
2023-11-23 02:25:51   INFO  epoch: 23/30, acc_iter=154851, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 0:54:50/0:53:44, time_cost(all): 1 day, 18:01:20/12:27:36, loss=0.344256601847287, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.18(1.03), norm=2.856403296781221, lr=0.000133440387275834
2023-11-23 02:26:41   INFO  epoch: 23/30, acc_iter=154901, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 0:55:39/0:54:24, time_cost(all): 1 day, 18:02:10/12:01:47, loss=0.344173487860978, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.97(1.03), norm=4.470076491236046, lr=0.000133119653093909
2023-11-23 02:27:30   INFO  epoch: 23/30, acc_iter=154951, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 0:56:29/0:50:47, time_cost(all): 1 day, 18:02:59/11:52:46, loss=0.344090373874669, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.92(1.03), norm=4.582091940719277, lr=0.000132798918911984
2023-11-23 02:28:19   INFO  epoch: 23/30, acc_iter=155001, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 0:57:18/0:52:40, time_cost(all): 1 day, 18:03:48/12:16:42, loss=0.34400725988836, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.97(1.03), norm=4.1351951920149945, lr=0.000132478184730059
2023-11-23 02:29:08   INFO  epoch: 23/30, acc_iter=155051, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 0:58:07/0:47:58, time_cost(all): 1 day, 18:04:37/11:16:52, loss=0.343924145902051, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.84(1.03), norm=3.0704504351979276, lr=0.000132157450548135
2023-11-23 02:29:57   INFO  epoch: 23/30, acc_iter=155101, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 0:58:56/0:51:10, time_cost(all): 1 day, 18:05:26/12:00:52, loss=0.343841031915742, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.99(1.03), norm=1.244816159050531, lr=0.00013183671636621
2023-11-23 02:30:46   INFO  epoch: 23/30, acc_iter=155151, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 0:59:45/0:46:49, time_cost(all): 1 day, 18:06:15/12:05:15, loss=0.343757917929433, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.01(1.03), norm=2.508502188006221, lr=0.000131515982184285
2023-11-23 02:31:35   INFO  epoch: 23/30, acc_iter=155201, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:00:34/0:48:08, time_cost(all): 1 day, 18:07:04/11:43:04, loss=0.343674803943124, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.83(1.03), norm=2.232706285569822, lr=0.000131195248002361
2023-11-23 02:32:24   INFO  epoch: 23/30, acc_iter=155251, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:01:23/0:44:47, time_cost(all): 1 day, 18:07:53/11:54:35, loss=0.343591689956815, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.05(1.03), norm=4.218787916484956, lr=0.000130874513820436
2023-11-23 02:33:14   INFO  epoch: 23/30, acc_iter=155301, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:02:12/0:46:19, time_cost(all): 1 day, 18:08:43/12:21:25, loss=0.343508575970506, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.86(1.03), norm=1.054637959925024, lr=0.000130553779638511
2023-11-23 02:34:03   INFO  epoch: 23/30, acc_iter=155351, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:03:02/0:42:52, time_cost(all): 1 day, 18:09:32/11:16:05, loss=0.343425461984197, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.89(1.03), norm=4.774840886371954, lr=0.000130233045456587
2023-11-23 02:34:52   INFO  epoch: 23/30, acc_iter=155401, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:03:51/0:44:03, time_cost(all): 1 day, 18:10:21/11:47:30, loss=0.343342347997888, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.18(1.03), norm=1.8890099896778183, lr=0.000129912311274662
2023-11-23 02:35:41   INFO  epoch: 23/30, acc_iter=155451, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:04:40/0:44:59, time_cost(all): 1 day, 18:11:10/12:17:26, loss=0.343259234011579, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.98(1.03), norm=0.7809489530386584, lr=0.000129591577092737
2023-11-23 02:36:30   INFO  epoch: 23/30, acc_iter=155501, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:05:29/0:41:21, time_cost(all): 1 day, 18:11:59/11:16:30, loss=0.34317612002527, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.08(1.03), norm=3.8041601037961685, lr=0.000129270842910812
2023-11-23 02:37:19   INFO  epoch: 23/30, acc_iter=155551, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:06:18/0:43:19, time_cost(all): 1 day, 18:12:48/11:10:07, loss=0.34309300603896, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.22(1.03), norm=4.9680006748502095, lr=0.000128950108728888
2023-11-23 02:38:08   INFO  epoch: 23/30, acc_iter=155601, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:07:07/0:42:02, time_cost(all): 1 day, 18:13:37/11:31:47, loss=0.343009892052651, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.01(1.03), norm=1.1010914849862865, lr=0.000128629374546963
2023-11-23 02:38:57   INFO  epoch: 23/30, acc_iter=155651, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:07:56/0:39:02, time_cost(all): 1 day, 18:14:26/11:26:22, loss=0.342926778066342, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.86(1.03), norm=4.508822115815366, lr=0.000128308640365038
2023-11-23 02:39:46   INFO  epoch: 23/30, acc_iter=155701, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:08:45/0:39:06, time_cost(all): 1 day, 18:15:15/11:52:22, loss=0.342843664080033, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.04(1.03), norm=2.1547869429058375, lr=0.000127987906183114
2023-11-23 02:40:36   INFO  epoch: 23/30, acc_iter=155751, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:09:34/0:39:50, time_cost(all): 1 day, 18:16:05/11:45:54, loss=0.342760550093724, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.96(1.03), norm=1.4085135027852176, lr=0.000127667172001189
2023-11-23 02:41:25   INFO  epoch: 23/30, acc_iter=155801, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:10:24/0:37:49, time_cost(all): 1 day, 18:16:54/11:35:43, loss=0.342677436107415, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.02(1.03), norm=2.771688776224247, lr=0.000127346437819264
2023-11-23 02:42:14   INFO  epoch: 23/30, acc_iter=155851, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:11:13/0:35:14, time_cost(all): 1 day, 18:17:43/11:42:53, loss=0.342594322121106, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.21(1.03), norm=2.6111599125748235, lr=0.000127025703637339
2023-11-23 02:43:03   INFO  epoch: 23/30, acc_iter=155901, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:12:02/0:37:14, time_cost(all): 1 day, 18:18:32/11:39:46, loss=0.342511208134797, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.97(1.03), norm=3.645538597485065, lr=0.000126704969455415
2023-11-23 02:43:52   INFO  epoch: 23/30, acc_iter=155951, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:12:51/0:34:34, time_cost(all): 1 day, 18:19:21/11:03:44, loss=0.342428094148488, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.23(1.03), norm=2.3010470023458174, lr=0.00012638423527349
2023-11-23 02:44:41   INFO  epoch: 23/30, acc_iter=156001, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:13:40/0:35:19, time_cost(all): 1 day, 18:20:10/11:08:13, loss=0.342344980162179, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.18(1.03), norm=3.6594251769926482, lr=0.000126063501091565
2023-11-23 02:45:30   INFO  epoch: 23/30, acc_iter=156051, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:14:29/0:33:38, time_cost(all): 1 day, 18:20:59/11:32:44, loss=0.34226186617587, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.06(1.03), norm=3.9179391678483655, lr=0.000125742766909641
2023-11-23 02:46:19   INFO  epoch: 23/30, acc_iter=156101, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:15:18/0:32:20, time_cost(all): 1 day, 18:21:48/11:02:10, loss=0.342178752189561, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.84(1.03), norm=1.9066963167850715, lr=0.000125422032727716
2023-11-23 02:47:09   INFO  epoch: 23/30, acc_iter=156151, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:16:07/0:30:58, time_cost(all): 1 day, 18:22:38/11:16:23, loss=0.342095638203252, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.96(1.03), norm=2.958146809856307, lr=0.000125101298545791
2023-11-23 02:47:58   INFO  epoch: 23/30, acc_iter=156201, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:16:57/0:30:49, time_cost(all): 1 day, 18:23:27/11:14:02, loss=0.342012524216943, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.85(1.03), norm=3.507948028492333, lr=0.000124780564363867
2023-11-23 02:48:47   INFO  epoch: 23/30, acc_iter=156251, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:17:46/0:28:41, time_cost(all): 1 day, 18:24:16/11:27:24, loss=0.341929410230634, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.86(1.03), norm=4.950957500207459, lr=0.000124459830181942
2023-11-23 02:49:36   INFO  epoch: 23/30, acc_iter=156301, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:18:35/0:29:53, time_cost(all): 1 day, 18:25:05/11:59:38, loss=0.341846296244324, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.06(1.03), norm=2.7427194105779544, lr=0.000124139096000017
2023-11-23 02:50:25   INFO  epoch: 23/30, acc_iter=156351, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:19:24/0:27:09, time_cost(all): 1 day, 18:25:54/11:48:03, loss=0.341763182258015, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.2(1.03), norm=2.0719061453127257, lr=0.000123818361818092
2023-11-23 02:51:14   INFO  epoch: 23/30, acc_iter=156401, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:20:13/0:28:49, time_cost(all): 1 day, 18:26:43/11:57:33, loss=0.341680068271706, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.95(1.03), norm=1.6849047446806051, lr=0.000123497627636168
2023-11-23 02:52:03   INFO  epoch: 23/30, acc_iter=156451, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:21:02/0:27:07, time_cost(all): 1 day, 18:27:32/11:24:13, loss=0.341596954285397, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.14(1.03), norm=4.436752125042792, lr=0.000123176893454243
2023-11-23 02:52:52   INFO  epoch: 23/30, acc_iter=156501, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:21:51/0:27:02, time_cost(all): 1 day, 18:28:21/11:37:49, loss=0.341513840299088, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.97(1.03), norm=1.8224388434776895, lr=0.000122856159272318
2023-11-23 02:53:41   INFO  epoch: 23/30, acc_iter=156551, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:22:40/0:25:13, time_cost(all): 1 day, 18:29:10/11:17:10, loss=0.341430726312779, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.9(1.03), norm=1.664669103033336, lr=0.000122535425090394
2023-11-23 02:54:31   INFO  epoch: 23/30, acc_iter=156601, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:23:29/0:25:16, time_cost(all): 1 day, 18:30:00/10:53:01, loss=0.34134761232647, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=4.255846658305804, lr=0.000122214690908469
2023-11-23 02:55:20   INFO  epoch: 23/30, acc_iter=156651, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:24:19/0:23:28, time_cost(all): 1 day, 18:30:49/11:38:26, loss=0.341264498340161, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.12(1.03), norm=2.6459279531223476, lr=0.000121893956726544
2023-11-23 02:56:09   INFO  epoch: 23/30, acc_iter=156701, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:25:08/0:22:13, time_cost(all): 1 day, 18:31:38/11:45:12, loss=0.341181384353852, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.23(1.03), norm=1.0104648842235284, lr=0.000121573222544619
2023-11-23 02:56:58   INFO  epoch: 23/30, acc_iter=156751, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:25:57/0:21:34, time_cost(all): 1 day, 18:32:27/11:16:19, loss=0.341098270367543, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.1(1.03), norm=3.8162695141543352, lr=0.000121252488362695
2023-11-23 02:57:47   INFO  epoch: 23/30, acc_iter=156801, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:26:46/0:22:02, time_cost(all): 1 day, 18:33:16/11:43:45, loss=0.341015156381234, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.04(1.03), norm=3.9288640207470804, lr=0.00012093175418077
2023-11-23 02:58:36   INFO  epoch: 23/30, acc_iter=156851, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:27:35/0:20:53, time_cost(all): 1 day, 18:34:05/11:36:19, loss=0.340932042394925, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.19(1.03), norm=4.56830162822896, lr=0.000120611019998845
2023-11-23 02:59:25   INFO  epoch: 23/30, acc_iter=156901, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:28:24/0:19:34, time_cost(all): 1 day, 18:34:54/11:17:06, loss=0.340848928408616, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.03(1.03), norm=4.929565865489005, lr=0.000120290285816921
2023-11-23 03:00:14   INFO  epoch: 23/30, acc_iter=156951, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:29:13/0:18:38, time_cost(all): 1 day, 18:35:43/10:56:24, loss=0.340765814422307, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.99(1.03), norm=1.4426067163921585, lr=0.000119969551634996
2023-11-23 03:01:04   INFO  epoch: 23/30, acc_iter=157001, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:30:02/0:17:55, time_cost(all): 1 day, 18:36:33/11:43:49, loss=0.340682700435998, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.84(1.03), norm=3.8861575945442777, lr=0.000119648817453071
2023-11-23 03:01:53   INFO  epoch: 23/30, acc_iter=157051, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:30:52/0:16:49, time_cost(all): 1 day, 18:37:22/11:44:59, loss=0.340599586449689, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.2(1.03), norm=4.384614564282346, lr=0.000119328083271147
2023-11-23 03:02:42   INFO  epoch: 23/30, acc_iter=157101, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:31:41/0:16:38, time_cost(all): 1 day, 18:38:11/11:36:13, loss=0.34051647246338, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.19(1.03), norm=4.968063923710224, lr=0.000119007349089222
2023-11-23 03:03:31   INFO  epoch: 23/30, acc_iter=157151, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:32:30/0:15:18, time_cost(all): 1 day, 18:39:00/10:44:49, loss=0.34043335847707, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.1(1.03), norm=4.642451768544167, lr=0.000118686614907297
2023-11-23 03:04:20   INFO  epoch: 23/30, acc_iter=157201, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:33:19/0:14:47, time_cost(all): 1 day, 18:39:49/11:09:05, loss=0.340350244490761, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.83(1.03), norm=1.8749527331313296, lr=0.000118365880725372
2023-11-23 03:05:09   INFO  epoch: 23/30, acc_iter=157251, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:34:08/0:13:44, time_cost(all): 1 day, 18:40:38/11:05:53, loss=0.340267130504452, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.97(1.03), norm=1.874246642745148, lr=0.000118045146543448
2023-11-23 03:05:58   INFO  epoch: 23/30, acc_iter=157301, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:34:57/0:13:22, time_cost(all): 1 day, 18:41:27/11:47:09, loss=0.340184016518143, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.84(1.03), norm=0.6812786389792295, lr=0.000117724412361523
2023-11-23 03:06:47   INFO  epoch: 23/30, acc_iter=157351, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:35:46/0:11:35, time_cost(all): 1 day, 18:42:16/10:44:59, loss=0.340100902531834, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.07(1.03), norm=2.16693853976891, lr=0.000117403678179598
2023-11-23 03:07:36   INFO  epoch: 23/30, acc_iter=157401, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:36:35/0:11:19, time_cost(all): 1 day, 18:43:05/11:30:51, loss=0.340017788545525, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.2(1.03), norm=0.5502819569376196, lr=0.000117082943997674
2023-11-23 03:08:26   INFO  epoch: 23/30, acc_iter=157451, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:37:24/0:10:52, time_cost(all): 1 day, 18:43:55/11:17:07, loss=0.339934674559216, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.06(1.03), norm=2.571693897754639, lr=0.000116762209815749
2023-11-23 03:09:15   INFO  epoch: 23/30, acc_iter=157501, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:38:14/0:09:34, time_cost(all): 1 day, 18:44:44/11:13:44, loss=0.339851560572907, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.99(1.03), norm=1.676273904908727, lr=0.000116441475633824
2023-11-23 03:10:04   INFO  epoch: 23/30, acc_iter=157551, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:39:03/0:08:22, time_cost(all): 1 day, 18:45:33/11:25:04, loss=0.339768446586598, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.06(1.03), norm=0.9252661779432012, lr=0.0001161207414519
2023-11-23 03:10:53   INFO  epoch: 23/30, acc_iter=157601, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:39:52/0:08:15, time_cost(all): 1 day, 18:46:22/11:12:29, loss=0.339685332600289, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.11(1.03), norm=4.73252240513125, lr=0.000115800007269975
2023-11-23 03:11:42   INFO  epoch: 23/30, acc_iter=157651, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:40:41/0:07:01, time_cost(all): 1 day, 18:47:11/11:03:09, loss=0.33960221861398, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.97(1.03), norm=1.7625312232907584, lr=0.00011547927308805
2023-11-23 03:12:31   INFO  epoch: 23/30, acc_iter=157701, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:41:30/0:06:30, time_cost(all): 1 day, 18:48:00/10:44:43, loss=0.339519104627671, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.83(1.03), norm=1.5994755631289332, lr=0.000115158538906125
2023-11-23 03:13:20   INFO  epoch: 23/30, acc_iter=157751, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 1:42:19/0:05:19, time_cost(all): 1 day, 18:48:49/11:24:09, loss=0.339435990641362, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.99(1.03), norm=3.746348818417847, lr=0.000114837804724201
2023-11-23 03:14:09   INFO  epoch: 23/30, acc_iter=157801, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 1:43:08/0:04:40, time_cost(all): 1 day, 18:49:38/10:51:09, loss=0.339352876655053, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.86(1.03), norm=3.9312899723102728, lr=0.000114517070542276
2023-11-23 03:14:59   INFO  epoch: 23/30, acc_iter=157851, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 1:43:57/0:03:56, time_cost(all): 1 day, 18:50:28/10:41:09, loss=0.339269762668744, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.05(1.03), norm=4.14953023379964, lr=0.000114196336360351
2023-11-23 03:15:48   INFO  epoch: 23/30, acc_iter=157901, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 1:44:47/0:03:02, time_cost(all): 1 day, 18:51:17/11:12:06, loss=0.339186648682435, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.89(1.03), norm=3.941978519857357, lr=0.000113875602178427
2023-11-23 03:16:37   INFO  epoch: 23/30, acc_iter=157951, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 1:45:36/0:02:09, time_cost(all): 1 day, 18:52:06/11:02:59, loss=0.339103534696125, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.88(1.03), norm=2.119104675957508, lr=0.000113554867996502
2023-11-23 03:17:26   INFO  epoch: 23/30, acc_iter=158001, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 1:46:25/0:01:27, time_cost(all): 1 day, 18:52:55/10:42:58, loss=0.339020420709816, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.0(1.03), norm=3.3843520953323365, lr=0.000113234133814577
2023-11-23 03:18:15   INFO  epoch: 23/30, acc_iter=158051, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 1:47:14/0:00:38, time_cost(all): 1 day, 18:53:44/11:20:23, loss=0.338937306723507, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.97(1.03), norm=3.027424309184003, lr=0.000112913399632653
2023-11-23 03:19:04   INFO  epoch: 24/30, acc_iter=158138, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:49/1:42:36, time_cost(all): 1 day, 18:54:33/10:30:01, loss=0.33879268838733, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.17(1.03), norm=1.9383634837374994, lr=0.000112355322156104
2023-11-23 03:19:53   INFO  epoch: 24/30, acc_iter=158188, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:38/1:44:01, time_cost(all): 1 day, 18:55:22/11:23:12, loss=0.338709574401021, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.08(1.03), norm=3.548029318025566, lr=0.000112034587974179
2023-11-23 03:20:42   INFO  epoch: 24/30, acc_iter=158238, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:27/1:40:38, time_cost(all): 1 day, 18:56:11/10:47:45, loss=0.338626460414711, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.98(1.03), norm=3.7871956307414214, lr=0.000111713853792254
2023-11-23 03:21:31   INFO  epoch: 24/30, acc_iter=158288, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:16/1:48:08, time_cost(all): 1 day, 18:57:00/10:49:18, loss=0.338543346428402, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.92(1.03), norm=3.9933478878028827, lr=0.000111393119610329
2023-11-23 03:22:21   INFO  epoch: 24/30, acc_iter=158338, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:05/1:40:18, time_cost(all): 1 day, 18:57:50/10:56:04, loss=0.338460232442093, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.84(1.03), norm=4.209102588858746, lr=0.000111072385428405
2023-11-23 03:23:10   INFO  epoch: 24/30, acc_iter=158388, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:04:54/1:39:11, time_cost(all): 1 day, 18:58:39/11:17:16, loss=0.338377118455784, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.94(1.03), norm=2.2559473523226377, lr=0.00011075165124648
2023-11-23 03:23:59   INFO  epoch: 24/30, acc_iter=158438, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:05:43/1:42:30, time_cost(all): 1 day, 18:59:28/10:34:55, loss=0.338294004469475, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.12(1.03), norm=3.971696228766248, lr=0.000110430917064555
2023-11-23 03:24:48   INFO  epoch: 24/30, acc_iter=158488, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:06:32/1:40:12, time_cost(all): 1 day, 19:00:17/10:41:24, loss=0.338210890483166, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.09(1.03), norm=3.6741429664494163, lr=0.000110110182882631
2023-11-23 03:25:37   INFO  epoch: 24/30, acc_iter=158538, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:07:22/1:44:13, time_cost(all): 1 day, 19:01:06/10:31:30, loss=0.338127776496857, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.93(1.03), norm=3.516525066639334, lr=0.000109789448700706
2023-11-23 03:26:26   INFO  epoch: 24/30, acc_iter=158588, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:08:11/1:39:25, time_cost(all): 1 day, 19:01:55/11:02:17, loss=0.338044662510548, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.22(1.03), norm=3.514368526878351, lr=0.000109468714518781
2023-11-23 03:27:15   INFO  epoch: 24/30, acc_iter=158638, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:09:00/1:41:13, time_cost(all): 1 day, 19:02:44/10:21:24, loss=0.337961548524239, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.17(1.03), norm=4.865388904593946, lr=0.000109147980336856
2023-11-23 03:28:04   INFO  epoch: 24/30, acc_iter=158688, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:09:49/1:36:06, time_cost(all): 1 day, 19:03:33/10:56:11, loss=0.33787843453793, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.18(1.03), norm=3.15000261071673, lr=0.000108827246154932
2023-11-23 03:28:54   INFO  epoch: 24/30, acc_iter=158738, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:10:38/1:41:41, time_cost(all): 1 day, 19:04:23/11:12:34, loss=0.337795320551621, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.92(1.03), norm=4.997425946991739, lr=0.000108506511973007
2023-11-23 03:29:43   INFO  epoch: 24/30, acc_iter=158788, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:11:27/1:34:40, time_cost(all): 1 day, 19:05:12/11:17:27, loss=0.337712206565312, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.01(1.03), norm=1.9284315358416673, lr=0.000108185777791082
2023-11-23 03:30:32   INFO  epoch: 24/30, acc_iter=158838, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:12:16/1:38:47, time_cost(all): 1 day, 19:06:01/10:22:07, loss=0.337629092579003, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.97(1.03), norm=4.303716015414453, lr=0.000107865043609158
2023-11-23 03:31:21   INFO  epoch: 24/30, acc_iter=158888, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:13:05/1:31:28, time_cost(all): 1 day, 19:06:50/11:14:34, loss=0.337545978592694, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.17(1.03), norm=4.810168030521162, lr=0.000107544309427233
2023-11-23 03:32:10   INFO  epoch: 24/30, acc_iter=158938, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:13:54/1:33:41, time_cost(all): 1 day, 19:07:39/10:58:44, loss=0.337462864606385, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.06(1.03), norm=1.8315279214184823, lr=0.000107223575245308
2023-11-23 03:32:59   INFO  epoch: 24/30, acc_iter=158988, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:14:44/1:31:10, time_cost(all): 1 day, 19:08:28/10:29:50, loss=0.337379750620076, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.99(1.03), norm=1.948101026861188, lr=0.000106902841063384
2023-11-23 03:33:48   INFO  epoch: 24/30, acc_iter=159038, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:15:33/1:34:07, time_cost(all): 1 day, 19:09:17/10:24:52, loss=0.337296636633766, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.94(1.03), norm=4.7862404000738925, lr=0.000106582106881459
2023-11-23 03:34:37   INFO  epoch: 24/30, acc_iter=159088, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:16:22/1:29:39, time_cost(all): 1 day, 19:10:06/11:17:28, loss=0.337213522647457, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.85(1.03), norm=2.0254807160940493, lr=0.000106261372699534
2023-11-23 03:35:26   INFO  epoch: 24/30, acc_iter=159138, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:17:11/1:28:46, time_cost(all): 1 day, 19:10:55/11:07:08, loss=0.337130408661148, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.2(1.03), norm=3.3379325947497964, lr=0.000105940638517609
2023-11-23 03:36:16   INFO  epoch: 24/30, acc_iter=159188, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:18:00/1:27:01, time_cost(all): 1 day, 19:11:45/10:20:43, loss=0.337047294674839, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.08(1.03), norm=1.1889854727008928, lr=0.000105619904335685
2023-11-23 03:37:05   INFO  epoch: 24/30, acc_iter=159238, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:18:49/1:26:43, time_cost(all): 1 day, 19:12:34/11:05:37, loss=0.33696418068853, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.95(1.03), norm=1.10304618771463, lr=0.00010529917015376
2023-11-23 03:37:54   INFO  epoch: 24/30, acc_iter=159288, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:19:38/1:26:54, time_cost(all): 1 day, 19:13:23/10:36:39, loss=0.336881066702221, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.92(1.03), norm=4.702775642673851, lr=0.000104978435971835
2023-11-23 03:38:43   INFO  epoch: 24/30, acc_iter=159338, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:20:27/1:26:36, time_cost(all): 1 day, 19:14:12/10:32:24, loss=0.336797952715912, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.17(1.03), norm=1.0722474498138042, lr=0.000104657701789911
2023-11-23 03:39:32   INFO  epoch: 24/30, acc_iter=159388, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:21:17/1:23:28, time_cost(all): 1 day, 19:15:01/10:45:29, loss=0.336714838729603, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.02(1.03), norm=0.5000189986304748, lr=0.000104336967607986
2023-11-23 03:40:21   INFO  epoch: 24/30, acc_iter=159438, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:22:06/1:24:12, time_cost(all): 1 day, 19:15:50/10:44:40, loss=0.336631724743294, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.91(1.03), norm=1.7189281788981314, lr=0.000104016233426061
2023-11-23 03:41:10   INFO  epoch: 24/30, acc_iter=159488, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:22:55/1:21:50, time_cost(all): 1 day, 19:16:39/10:15:20, loss=0.336548610756985, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.21(1.03), norm=1.8691244438022907, lr=0.000103695499244136
2023-11-23 03:41:59   INFO  epoch: 24/30, acc_iter=159538, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:23:44/1:26:17, time_cost(all): 1 day, 19:17:28/10:28:25, loss=0.336465496770676, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.08(1.03), norm=2.7679140994742077, lr=0.000103374765062212
2023-11-23 03:42:48   INFO  epoch: 24/30, acc_iter=159588, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:24:33/1:19:12, time_cost(all): 1 day, 19:18:17/10:38:39, loss=0.336382382784367, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.06(1.03), norm=0.5541498734183135, lr=0.000103054030880287
2023-11-23 03:43:38   INFO  epoch: 24/30, acc_iter=159638, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:25:22/1:20:46, time_cost(all): 1 day, 19:19:07/10:18:29, loss=0.336299268798058, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.97(1.03), norm=4.179199764371249, lr=0.000102733296698362
2023-11-23 03:44:27   INFO  epoch: 24/30, acc_iter=159688, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:26:11/1:17:49, time_cost(all): 1 day, 19:19:56/10:30:50, loss=0.336216154811749, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.96(1.03), norm=1.7619129446999096, lr=0.000102412562516438
2023-11-23 03:45:16   INFO  epoch: 24/30, acc_iter=159738, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:27:00/1:17:44, time_cost(all): 1 day, 19:20:45/10:43:46, loss=0.33613304082544, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.08(1.03), norm=1.982445996715664, lr=0.000102091828334513
2023-11-23 03:46:05   INFO  epoch: 24/30, acc_iter=159788, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:27:49/1:23:40, time_cost(all): 1 day, 19:21:34/10:52:31, loss=0.33604992683913, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.0(1.03), norm=3.223588960569846, lr=0.000101771094152588
2023-11-23 03:46:54   INFO  epoch: 24/30, acc_iter=159838, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:28:39/1:18:42, time_cost(all): 1 day, 19:22:23/10:32:32, loss=0.335966812852821, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.98(1.03), norm=4.812280958009707, lr=0.000101450359970664
2023-11-23 03:47:43   INFO  epoch: 24/30, acc_iter=159888, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:29:28/1:20:20, time_cost(all): 1 day, 19:23:12/10:19:06, loss=0.335883698866512, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.06(1.03), norm=1.5607189262186316, lr=0.000101129625788739
2023-11-23 03:48:32   INFO  epoch: 24/30, acc_iter=159938, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:30:17/1:20:48, time_cost(all): 1 day, 19:24:01/10:30:25, loss=0.335800584880203, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.18(1.03), norm=4.343284492066567, lr=0.000100808891606814
2023-11-23 03:49:21   INFO  epoch: 24/30, acc_iter=159988, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:31:06/1:20:03, time_cost(all): 1 day, 19:24:50/10:30:41, loss=0.335717470893894, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.9(1.03), norm=1.4799074963728092, lr=0.000100488157424889
2023-11-23 03:50:11   INFO  epoch: 24/30, acc_iter=160038, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:31:55/1:19:26, time_cost(all): 1 day, 19:25:40/10:27:40, loss=0.335634356907585, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.0(1.03), norm=1.1743705962762214, lr=0.000100167423242965
2023-11-23 03:51:00   INFO  epoch: 24/30, acc_iter=160088, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:32:44/1:15:14, time_cost(all): 1 day, 19:26:29/10:45:35, loss=0.335551242921276, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.05(1.03), norm=4.915086772684015, lr=9.9919962671572e-05
2023-11-23 03:51:49   INFO  epoch: 24/30, acc_iter=160138, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:33:33/1:16:16, time_cost(all): 1 day, 19:27:18/10:34:50, loss=0.335468128934967, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.84(1.03), norm=3.600882846436413, lr=9.9752520561891e-05
2023-11-23 03:52:38   INFO  epoch: 24/30, acc_iter=160188, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:34:22/1:15:47, time_cost(all): 1 day, 19:28:07/10:09:28, loss=0.335385014948658, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.02(1.03), norm=1.940950349972002, lr=9.958507845221e-05
2023-11-23 03:53:27   INFO  epoch: 24/30, acc_iter=160238, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:35:12/1:12:37, time_cost(all): 1 day, 19:28:56/10:09:58, loss=0.335301900962349, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.03(1.03), norm=3.837643118506547, lr=9.9417636342529e-05
2023-11-23 03:54:16   INFO  epoch: 24/30, acc_iter=160288, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:36:01/1:14:41, time_cost(all): 1 day, 19:29:45/10:19:03, loss=0.33521878697604, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.93(1.03), norm=2.2738004289630336, lr=9.9250194232847e-05
2023-11-23 03:55:05   INFO  epoch: 24/30, acc_iter=160338, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:36:50/1:09:46, time_cost(all): 1 day, 19:30:34/10:33:20, loss=0.335135672989731, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.21(1.03), norm=2.2268034838656785, lr=9.9082752123166e-05
2023-11-23 03:55:54   INFO  epoch: 24/30, acc_iter=160388, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:37:39/1:09:33, time_cost(all): 1 day, 19:31:23/10:02:58, loss=0.335052559003422, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.86(1.03), norm=4.979616295707872, lr=9.8915310013485e-05
2023-11-23 03:56:43   INFO  epoch: 24/30, acc_iter=160438, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:38:28/1:07:53, time_cost(all): 1 day, 19:32:12/10:13:57, loss=0.334969445017113, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.86(1.03), norm=2.4666408850099795, lr=9.8747867903803e-05
2023-11-23 03:57:33   INFO  epoch: 24/30, acc_iter=160488, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:39:17/1:09:38, time_cost(all): 1 day, 19:33:02/10:48:42, loss=0.334886331030804, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.02(1.03), norm=0.9332083911059599, lr=9.8580425794122e-05
2023-11-23 03:58:22   INFO  epoch: 24/30, acc_iter=160538, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:40:06/1:06:36, time_cost(all): 1 day, 19:33:51/10:43:30, loss=0.334803217044495, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.85(1.03), norm=4.786349268525034, lr=9.8412983684441e-05
2023-11-23 03:59:11   INFO  epoch: 24/30, acc_iter=160588, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:40:55/1:07:49, time_cost(all): 1 day, 19:34:40/10:29:08, loss=0.334720103058186, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.15(1.03), norm=1.2829937892486185, lr=9.824554157476e-05
2023-11-23 04:00:00   INFO  epoch: 24/30, acc_iter=160638, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:41:44/1:03:27, time_cost(all): 1 day, 19:35:29/9:52:51, loss=0.334636989071876, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.01(1.03), norm=2.5042775378558044, lr=9.8078099465078e-05
2023-11-23 04:00:49   INFO  epoch: 24/30, acc_iter=160688, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:42:34/1:04:34, time_cost(all): 1 day, 19:36:18/9:53:30, loss=0.334553875085567, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.08(1.03), norm=4.234998590295391, lr=9.7910657355397e-05
2023-11-23 04:01:38   INFO  epoch: 24/30, acc_iter=160738, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:43:23/1:01:40, time_cost(all): 1 day, 19:37:07/10:39:52, loss=0.334470761099258, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.18(1.03), norm=2.989844689669973, lr=9.7743215245716e-05
2023-11-23 04:02:27   INFO  epoch: 24/30, acc_iter=160788, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:44:12/1:03:53, time_cost(all): 1 day, 19:37:56/9:53:04, loss=0.334387647112949, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.14(1.03), norm=3.8513934471961657, lr=9.7575773136034e-05
2023-11-23 04:03:16   INFO  epoch: 24/30, acc_iter=160838, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:45:01/1:00:19, time_cost(all): 1 day, 19:38:45/10:08:46, loss=0.33430453312664, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.15(1.03), norm=4.880734525360687, lr=9.7408331026353e-05
2023-11-23 04:04:06   INFO  epoch: 24/30, acc_iter=160888, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:45:50/0:59:01, time_cost(all): 1 day, 19:39:35/10:12:10, loss=0.334221419140331, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.13(1.03), norm=2.4016612473550105, lr=9.7240888916672e-05
2023-11-23 04:04:55   INFO  epoch: 24/30, acc_iter=160938, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:46:39/1:03:14, time_cost(all): 1 day, 19:40:24/10:03:19, loss=0.334138305154022, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.22(1.03), norm=3.7178214157054588, lr=9.7073446806991e-05
2023-11-23 04:05:44   INFO  epoch: 24/30, acc_iter=160988, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:47:28/0:59:44, time_cost(all): 1 day, 19:41:13/9:44:33, loss=0.334055191167713, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.21(1.03), norm=1.724122290469273, lr=9.6906004697309e-05
2023-11-23 04:06:33   INFO  epoch: 24/30, acc_iter=161038, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:48:17/1:00:05, time_cost(all): 1 day, 19:42:02/10:36:27, loss=0.333972077181404, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.09(1.03), norm=3.3894212674264574, lr=9.6738562587628e-05
2023-11-23 04:07:22   INFO  epoch: 24/30, acc_iter=161088, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:49:07/0:58:05, time_cost(all): 1 day, 19:42:51/10:03:19, loss=0.333888963195095, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.87(1.03), norm=1.6707876502916714, lr=9.6571120477947e-05
2023-11-23 04:08:11   INFO  epoch: 24/30, acc_iter=161138, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:49:56/1:00:23, time_cost(all): 1 day, 19:43:40/10:35:51, loss=0.333805849208786, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.02(1.03), norm=1.167484247805637, lr=9.6403678368265e-05
2023-11-23 04:09:00   INFO  epoch: 24/30, acc_iter=161188, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:50:45/0:57:13, time_cost(all): 1 day, 19:44:29/10:05:23, loss=0.333722735222477, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.12(1.03), norm=3.4495351978585553, lr=9.6236236258584e-05
2023-11-23 04:09:49   INFO  epoch: 24/30, acc_iter=161238, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 0:51:34/0:58:03, time_cost(all): 1 day, 19:45:18/9:48:49, loss=0.333639621236168, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.97(1.03), norm=3.6312691855899706, lr=9.6068794148903e-05
2023-11-23 04:10:38   INFO  epoch: 24/30, acc_iter=161288, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 0:52:23/0:56:10, time_cost(all): 1 day, 19:46:07/10:27:03, loss=0.333556507249859, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.15(1.03), norm=1.2033132963870161, lr=9.5901352039222e-05
2023-11-23 04:11:28   INFO  epoch: 24/30, acc_iter=161338, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 0:53:12/0:56:19, time_cost(all): 1 day, 19:46:57/10:33:00, loss=0.33347339326355, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.96(1.03), norm=1.5072393670995923, lr=9.573390992954e-05
2023-11-23 04:12:17   INFO  epoch: 24/30, acc_iter=161388, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 0:54:01/0:52:57, time_cost(all): 1 day, 19:47:46/9:50:48, loss=0.333390279277241, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.94(1.03), norm=0.8022432692167569, lr=9.5566467819859e-05
2023-11-23 04:13:06   INFO  epoch: 24/30, acc_iter=161438, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 0:54:50/0:55:24, time_cost(all): 1 day, 19:48:35/9:40:32, loss=0.333307165290931, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.09(1.03), norm=2.0926248049749896, lr=9.5399025710178e-05
2023-11-23 04:13:55   INFO  epoch: 24/30, acc_iter=161488, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 0:55:39/0:52:35, time_cost(all): 1 day, 19:49:24/10:23:53, loss=0.333224051304622, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.17(1.03), norm=4.687147830047107, lr=9.5231583600497e-05
2023-11-23 04:14:44   INFO  epoch: 24/30, acc_iter=161538, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 0:56:29/0:49:00, time_cost(all): 1 day, 19:50:13/10:03:09, loss=0.333140937318313, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.02(1.03), norm=1.8588032863539539, lr=9.5064141490815e-05
2023-11-23 04:15:33   INFO  epoch: 24/30, acc_iter=161588, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 0:57:18/0:52:01, time_cost(all): 1 day, 19:51:02/10:17:48, loss=0.333057823332004, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.07(1.03), norm=4.014911950878824, lr=9.4896699381134e-05
2023-11-23 04:16:22   INFO  epoch: 24/30, acc_iter=161638, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 0:58:07/0:49:51, time_cost(all): 1 day, 19:51:51/10:30:04, loss=0.332974709345695, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.84(1.03), norm=1.955934798014537, lr=9.4729257271453e-05
2023-11-23 04:17:11   INFO  epoch: 24/30, acc_iter=161688, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 0:58:56/0:49:34, time_cost(all): 1 day, 19:52:40/10:13:24, loss=0.332891595359386, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.97(1.03), norm=3.019576971703639, lr=9.4561815161771e-05
2023-11-23 04:18:01   INFO  epoch: 24/30, acc_iter=161738, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 0:59:45/0:48:53, time_cost(all): 1 day, 19:53:30/9:40:50, loss=0.332808481373077, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.08(1.03), norm=0.8497613617320907, lr=9.439437305209e-05
2023-11-23 04:18:50   INFO  epoch: 24/30, acc_iter=161788, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:00:34/0:47:59, time_cost(all): 1 day, 19:54:19/9:52:06, loss=0.332725367386768, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.99(1.03), norm=4.135203632996317, lr=9.4226930942409e-05
2023-11-23 04:19:39   INFO  epoch: 24/30, acc_iter=161838, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:01:23/0:45:19, time_cost(all): 1 day, 19:55:08/9:45:28, loss=0.332642253400459, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.11(1.03), norm=3.1069654113578817, lr=9.4059488832728e-05
2023-11-23 04:20:28   INFO  epoch: 24/30, acc_iter=161888, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:02:12/0:44:40, time_cost(all): 1 day, 19:55:57/9:40:41, loss=0.33255913941415, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.83(1.03), norm=3.8565542413674834, lr=9.3892046723046e-05
2023-11-23 04:21:17   INFO  epoch: 24/30, acc_iter=161938, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:03:02/0:46:37, time_cost(all): 1 day, 19:56:46/10:08:49, loss=0.332476025427841, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.88(1.03), norm=1.2056703530329895, lr=9.3724604613365e-05
2023-11-23 04:22:06   INFO  epoch: 24/30, acc_iter=161988, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:03:51/0:45:23, time_cost(all): 1 day, 19:57:35/9:40:19, loss=0.332392911441532, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.21(1.03), norm=1.3331663856161529, lr=9.3557162503684e-05
2023-11-23 04:22:55   INFO  epoch: 24/30, acc_iter=162038, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:04:40/0:45:08, time_cost(all): 1 day, 19:58:24/10:08:51, loss=0.332309797455223, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.88(1.03), norm=2.900541316455708, lr=9.3389720394002e-05
2023-11-23 04:23:44   INFO  epoch: 24/30, acc_iter=162088, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:05:29/0:44:19, time_cost(all): 1 day, 19:59:13/9:39:02, loss=0.332226683468914, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.21(1.03), norm=2.1950412350217148, lr=9.3222278284321e-05
2023-11-23 04:24:33   INFO  epoch: 24/30, acc_iter=162138, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:06:18/0:41:14, time_cost(all): 1 day, 20:00:02/9:33:32, loss=0.332143569482605, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.04(1.03), norm=2.177196565106047, lr=9.305483617464e-05
2023-11-23 04:25:23   INFO  epoch: 24/30, acc_iter=162188, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:07:07/0:38:57, time_cost(all): 1 day, 20:00:52/9:29:41, loss=0.332060455496295, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.13(1.03), norm=2.7265937210349547, lr=9.2887394064959e-05
2023-11-23 04:26:12   INFO  epoch: 24/30, acc_iter=162238, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:07:56/0:38:57, time_cost(all): 1 day, 20:01:41/10:21:17, loss=0.331977341509986, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.92(1.03), norm=2.3763441520853013, lr=9.2719951955277e-05
2023-11-23 04:27:01   INFO  epoch: 24/30, acc_iter=162288, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:08:45/0:37:55, time_cost(all): 1 day, 20:02:30/9:43:46, loss=0.331894227523677, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.1(1.03), norm=4.847193112078862, lr=9.2552509845596e-05
2023-11-23 04:27:50   INFO  epoch: 24/30, acc_iter=162338, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:09:34/0:38:35, time_cost(all): 1 day, 20:03:19/9:43:58, loss=0.331811113537368, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.17(1.03), norm=0.9653264003459061, lr=9.2385067735915e-05
2023-11-23 04:28:39   INFO  epoch: 24/30, acc_iter=162388, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:10:24/0:36:08, time_cost(all): 1 day, 20:04:08/9:51:19, loss=0.331727999551059, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.83(1.03), norm=3.7269156321499204, lr=9.2217625626234e-05
2023-11-23 04:29:28   INFO  epoch: 24/30, acc_iter=162438, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:11:13/0:36:47, time_cost(all): 1 day, 20:04:57/10:02:57, loss=0.33164488556475, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.02(1.03), norm=2.8616184162358946, lr=9.2050183516552e-05
2023-11-23 04:30:17   INFO  epoch: 24/30, acc_iter=162488, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:12:02/0:35:03, time_cost(all): 1 day, 20:05:46/9:34:58, loss=0.331561771578441, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.05(1.03), norm=0.7041173761270458, lr=9.1882741406871e-05
2023-11-23 04:31:06   INFO  epoch: 24/30, acc_iter=162538, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:12:51/0:34:53, time_cost(all): 1 day, 20:06:35/10:06:25, loss=0.331478657592132, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.08(1.03), norm=0.6141870947165384, lr=9.171529929719e-05
2023-11-23 04:31:56   INFO  epoch: 24/30, acc_iter=162588, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:13:40/0:34:31, time_cost(all): 1 day, 20:07:25/9:52:11, loss=0.331395543605823, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.93(1.03), norm=4.6821878155234256, lr=9.1547857187508e-05
2023-11-23 04:32:45   INFO  epoch: 24/30, acc_iter=162638, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:14:29/0:34:04, time_cost(all): 1 day, 20:08:14/9:29:51, loss=0.331312429619514, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.91(1.03), norm=3.3624760782328673, lr=9.1380415077827e-05
2023-11-23 04:33:34   INFO  epoch: 24/30, acc_iter=162688, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:15:18/0:33:28, time_cost(all): 1 day, 20:09:03/10:15:00, loss=0.331229315633205, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.88(1.03), norm=4.92807856773024, lr=9.1212972968146e-05
2023-11-23 04:34:23   INFO  epoch: 24/30, acc_iter=162738, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:16:07/0:30:19, time_cost(all): 1 day, 20:09:52/9:38:22, loss=0.331146201646896, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.0(1.03), norm=4.803911589120398, lr=9.1045530858465e-05
2023-11-23 04:35:12   INFO  epoch: 24/30, acc_iter=162788, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:16:57/0:30:48, time_cost(all): 1 day, 20:10:41/9:41:46, loss=0.331063087660587, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.05(1.03), norm=0.775074990160906, lr=9.0878088748783e-05
2023-11-23 04:36:01   INFO  epoch: 24/30, acc_iter=162838, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:17:46/0:28:48, time_cost(all): 1 day, 20:11:30/9:53:04, loss=0.330979973674278, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.89(1.03), norm=2.6227220679515115, lr=9.0710646639102e-05
2023-11-23 04:36:50   INFO  epoch: 24/30, acc_iter=162888, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:18:35/0:28:20, time_cost(all): 1 day, 20:12:19/10:09:51, loss=0.330896859687969, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.84(1.03), norm=3.4035001965053233, lr=9.0543204529421e-05
2023-11-23 04:37:39   INFO  epoch: 24/30, acc_iter=162938, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:19:24/0:28:35, time_cost(all): 1 day, 20:13:08/9:37:52, loss=0.33081374570166, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.16(1.03), norm=3.888275225292777, lr=9.0375762419739e-05
2023-11-23 04:38:28   INFO  epoch: 24/30, acc_iter=162988, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:20:13/0:26:42, time_cost(all): 1 day, 20:13:57/9:17:59, loss=0.33073063171535, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.12(1.03), norm=2.9734293293198464, lr=9.0208320310058e-05
2023-11-23 04:39:18   INFO  epoch: 24/30, acc_iter=163038, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:21:02/0:26:34, time_cost(all): 1 day, 20:14:47/10:08:30, loss=0.330647517729041, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.85(1.03), norm=2.9365499664738204, lr=9.0040878200377e-05
2023-11-23 04:40:07   INFO  epoch: 24/30, acc_iter=163088, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:21:51/0:25:23, time_cost(all): 1 day, 20:15:36/9:34:52, loss=0.330564403742732, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.12(1.03), norm=4.3061306801135215, lr=8.9873436090696e-05
2023-11-23 04:40:56   INFO  epoch: 24/30, acc_iter=163138, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:22:40/0:24:58, time_cost(all): 1 day, 20:16:25/9:13:01, loss=0.330481289756423, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.18(1.03), norm=3.267270495256983, lr=8.9705993981014e-05
2023-11-23 04:41:45   INFO  epoch: 24/30, acc_iter=163188, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:23:29/0:24:07, time_cost(all): 1 day, 20:17:14/9:21:05, loss=0.330398175770114, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=1.2334474705424934, lr=8.9538551871333e-05
2023-11-23 04:42:34   INFO  epoch: 24/30, acc_iter=163238, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:24:19/0:22:40, time_cost(all): 1 day, 20:18:03/9:25:49, loss=0.330315061783805, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.84(1.03), norm=4.6082374685808185, lr=8.9371109761652e-05
2023-11-23 04:43:23   INFO  epoch: 24/30, acc_iter=163288, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:25:08/0:21:59, time_cost(all): 1 day, 20:18:52/9:18:39, loss=0.330231947797496, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.85(1.03), norm=1.5718193157705918, lr=8.920366765197e-05
2023-11-23 04:44:12   INFO  epoch: 24/30, acc_iter=163338, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:25:57/0:22:49, time_cost(all): 1 day, 20:19:41/9:24:38, loss=0.330148833811187, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.9(1.03), norm=1.4581274431053044, lr=8.9036225542289e-05
2023-11-23 04:45:01   INFO  epoch: 24/30, acc_iter=163388, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:26:46/0:20:11, time_cost(all): 1 day, 20:20:30/9:49:05, loss=0.330065719824878, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.88(1.03), norm=1.4636402614619757, lr=8.8868783432608e-05
2023-11-23 04:45:51   INFO  epoch: 24/30, acc_iter=163438, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:27:35/0:20:21, time_cost(all): 1 day, 20:21:20/9:46:51, loss=0.329982605838569, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.19(1.03), norm=1.1859888474166105, lr=8.8701341322927e-05
2023-11-23 04:46:40   INFO  epoch: 24/30, acc_iter=163488, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:28:24/0:19:08, time_cost(all): 1 day, 20:22:09/9:36:09, loss=0.32989949185226, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.06(1.03), norm=0.8342596554773082, lr=8.8533899213245e-05
2023-11-23 04:47:29   INFO  epoch: 24/30, acc_iter=163538, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:29:13/0:18:14, time_cost(all): 1 day, 20:22:58/9:21:25, loss=0.329816377865951, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.18(1.03), norm=2.3398482884746032, lr=8.8366457103564e-05
2023-11-23 04:48:18   INFO  epoch: 24/30, acc_iter=163588, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:30:02/0:18:12, time_cost(all): 1 day, 20:23:47/9:08:44, loss=0.329733263879642, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.99(1.03), norm=0.6723722482307207, lr=8.8199014993883e-05
2023-11-23 04:49:07   INFO  epoch: 24/30, acc_iter=163638, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:30:52/0:17:21, time_cost(all): 1 day, 20:24:36/9:37:34, loss=0.329650149893333, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.87(1.03), norm=0.8512213103919013, lr=8.8031572884202e-05
2023-11-23 04:49:56   INFO  epoch: 24/30, acc_iter=163688, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:31:41/0:15:26, time_cost(all): 1 day, 20:25:25/9:22:50, loss=0.329567035907024, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.13(1.03), norm=4.383613238544565, lr=8.786413077452e-05
2023-11-23 04:50:45   INFO  epoch: 24/30, acc_iter=163738, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:32:30/0:15:28, time_cost(all): 1 day, 20:26:14/9:57:21, loss=0.329483921920715, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.03(1.03), norm=4.879542616757375, lr=8.7696688664839e-05
2023-11-23 04:51:34   INFO  epoch: 24/30, acc_iter=163788, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:33:19/0:15:06, time_cost(all): 1 day, 20:27:03/9:38:39, loss=0.329400807934405, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.19(1.03), norm=0.5945675084455517, lr=8.7529246555158e-05
2023-11-23 04:52:23   INFO  epoch: 24/30, acc_iter=163838, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:34:08/0:13:16, time_cost(all): 1 day, 20:27:52/9:55:25, loss=0.329317693948096, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.83(1.03), norm=2.208065417241632, lr=8.7361804445476e-05
2023-11-23 04:53:13   INFO  epoch: 24/30, acc_iter=163888, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:34:57/0:12:40, time_cost(all): 1 day, 20:28:42/9:10:51, loss=0.329234579961787, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.16(1.03), norm=4.106991838900461, lr=8.7194362335795e-05
2023-11-23 04:54:02   INFO  epoch: 24/30, acc_iter=163938, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:35:46/0:12:00, time_cost(all): 1 day, 20:29:31/9:52:48, loss=0.329151465975478, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.0(1.03), norm=0.7762185274096349, lr=8.7026920226114e-05
2023-11-23 04:54:51   INFO  epoch: 24/30, acc_iter=163988, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:36:35/0:10:52, time_cost(all): 1 day, 20:30:20/9:16:52, loss=0.329068351989169, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.91(1.03), norm=3.880103227028154, lr=8.6859478116433e-05
2023-11-23 04:55:40   INFO  epoch: 24/30, acc_iter=164038, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:37:24/0:10:26, time_cost(all): 1 day, 20:31:09/9:33:26, loss=0.32898523800286, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.95(1.03), norm=0.8752506676448957, lr=8.6692036006751e-05
2023-11-23 04:56:29   INFO  epoch: 24/30, acc_iter=164088, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:38:14/0:09:43, time_cost(all): 1 day, 20:31:58/9:32:20, loss=0.328902124016551, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.88(1.03), norm=3.488293768897346, lr=8.652459389707e-05
2023-11-23 04:57:18   INFO  epoch: 24/30, acc_iter=164138, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:39:03/0:09:07, time_cost(all): 1 day, 20:32:47/9:06:47, loss=0.328819010030242, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.01(1.03), norm=1.9817786681891412, lr=8.6357151787389e-05
2023-11-23 04:58:07   INFO  epoch: 24/30, acc_iter=164188, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:39:52/0:08:15, time_cost(all): 1 day, 20:33:36/9:27:04, loss=0.328735896043933, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.91(1.03), norm=2.8112402855348275, lr=8.6189709677707e-05
2023-11-23 04:58:56   INFO  epoch: 24/30, acc_iter=164238, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:40:41/0:06:51, time_cost(all): 1 day, 20:34:25/8:53:24, loss=0.328652782057624, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.07(1.03), norm=3.2718639418081397, lr=8.6022267568026e-05
2023-11-23 04:59:46   INFO  epoch: 24/30, acc_iter=164288, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:41:30/0:06:31, time_cost(all): 1 day, 20:35:15/9:03:55, loss=0.328569668071315, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.93(1.03), norm=3.327950339189522, lr=8.5854825458345e-05
2023-11-23 05:00:35   INFO  epoch: 24/30, acc_iter=164338, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 1:42:19/0:05:27, time_cost(all): 1 day, 20:36:04/9:17:55, loss=0.328486554085006, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.13(1.03), norm=2.0333871548368982, lr=8.5687383348664e-05
2023-11-23 05:01:24   INFO  epoch: 24/30, acc_iter=164388, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 1:43:08/0:04:35, time_cost(all): 1 day, 20:36:53/9:10:12, loss=0.328403440098697, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.96(1.03), norm=1.4758415098546118, lr=8.5519941238982e-05
2023-11-23 05:02:13   INFO  epoch: 24/30, acc_iter=164438, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 1:43:57/0:03:47, time_cost(all): 1 day, 20:37:42/9:02:21, loss=0.328320326112388, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.9(1.03), norm=3.010414739750318, lr=8.5352499129301e-05
2023-11-23 05:03:02   INFO  epoch: 24/30, acc_iter=164488, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 1:44:47/0:03:07, time_cost(all): 1 day, 20:38:31/9:01:15, loss=0.328237212126079, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.94(1.03), norm=2.4924582910980737, lr=8.518505701962e-05
2023-11-23 05:03:51   INFO  epoch: 24/30, acc_iter=164538, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 1:45:36/0:02:15, time_cost(all): 1 day, 20:39:20/9:41:04, loss=0.32815409813977, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.2(1.03), norm=3.673903062004965, lr=8.5017614909938e-05
2023-11-23 05:04:40   INFO  epoch: 24/30, acc_iter=164588, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 1:46:25/0:01:25, time_cost(all): 1 day, 20:40:09/9:08:55, loss=0.32807098415346, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.22(1.03), norm=2.00422561612227, lr=8.4850172800257e-05
2023-11-23 05:05:29   INFO  epoch: 24/30, acc_iter=164638, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 1:47:14/0:00:37, time_cost(all): 1 day, 20:40:58/8:48:41, loss=0.327987870167151, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.98(1.03), norm=3.802556350623147, lr=8.4682730690576e-05
2023-11-23 05:06:18   INFO  epoch: 25/30, acc_iter=164725, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:49/1:45:17, time_cost(all): 1 day, 20:41:47/8:58:25, loss=0.327843251830974, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.04(1.03), norm=1.972699165347403, lr=8.4391381419731e-05
2023-11-23 05:07:08   INFO  epoch: 25/30, acc_iter=164775, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:38/1:46:43, time_cost(all): 1 day, 20:42:37/9:20:19, loss=0.327760137844665, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.99(1.03), norm=3.7916168200448648, lr=8.4223939310049e-05
2023-11-23 05:07:57   INFO  epoch: 25/30, acc_iter=164825, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:27/1:49:52, time_cost(all): 1 day, 20:43:26/9:14:19, loss=0.327677023858356, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.1(1.03), norm=0.5573709550405658, lr=8.4056497200368e-05
2023-11-23 05:08:46   INFO  epoch: 25/30, acc_iter=164875, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:16/1:49:24, time_cost(all): 1 day, 20:44:15/9:16:14, loss=0.327593909872046, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.18(1.03), norm=4.771495923035174, lr=8.3889055090687e-05
2023-11-23 05:09:35   INFO  epoch: 25/30, acc_iter=164925, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:05/1:47:17, time_cost(all): 1 day, 20:45:04/9:11:36, loss=0.327510795885737, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.99(1.03), norm=0.8811111792850393, lr=8.3721612981005e-05
2023-11-23 05:10:24   INFO  epoch: 25/30, acc_iter=164975, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:04:54/1:41:25, time_cost(all): 1 day, 20:45:53/8:51:15, loss=0.327427681899428, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.97(1.03), norm=2.194696471684828, lr=8.3554170871324e-05
2023-11-23 05:11:13   INFO  epoch: 25/30, acc_iter=165025, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:05:43/1:43:25, time_cost(all): 1 day, 20:46:42/9:32:14, loss=0.327344567913119, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.13(1.03), norm=0.5906264817666991, lr=8.3386728761643e-05
2023-11-23 05:12:02   INFO  epoch: 25/30, acc_iter=165075, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:06:32/1:45:15, time_cost(all): 1 day, 20:47:31/9:21:21, loss=0.32726145392681, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.21(1.03), norm=4.167587541196199, lr=8.3219286651962e-05
2023-11-23 05:12:51   INFO  epoch: 25/30, acc_iter=165125, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:07:22/1:41:45, time_cost(all): 1 day, 20:48:20/9:02:33, loss=0.327178339940501, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.05(1.03), norm=1.3549403610270696, lr=8.305184454228e-05
2023-11-23 05:13:41   INFO  epoch: 25/30, acc_iter=165175, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:08:11/1:34:44, time_cost(all): 1 day, 20:49:10/9:14:30, loss=0.327095225954192, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.22(1.03), norm=4.387724014797831, lr=8.2884402432599e-05
2023-11-23 05:14:30   INFO  epoch: 25/30, acc_iter=165225, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:09:00/1:40:20, time_cost(all): 1 day, 20:49:59/8:39:46, loss=0.327012111967883, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.97(1.03), norm=1.980526203339999, lr=8.2716960322918e-05
2023-11-23 05:15:19   INFO  epoch: 25/30, acc_iter=165275, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:09:49/1:33:14, time_cost(all): 1 day, 20:50:48/9:17:29, loss=0.326928997981574, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.08(1.03), norm=2.7092581956782515, lr=8.2549518213236e-05
2023-11-23 05:16:08   INFO  epoch: 25/30, acc_iter=165325, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:10:38/1:41:52, time_cost(all): 1 day, 20:51:37/9:12:49, loss=0.326845883995265, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.99(1.03), norm=1.5738948389143106, lr=8.2382076103555e-05
2023-11-23 05:16:57   INFO  epoch: 25/30, acc_iter=165375, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:11:27/1:34:50, time_cost(all): 1 day, 20:52:26/9:11:10, loss=0.326762770008956, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.04(1.03), norm=0.5765562562747901, lr=8.2214633993874e-05
2023-11-23 05:17:46   INFO  epoch: 25/30, acc_iter=165425, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:12:16/1:33:40, time_cost(all): 1 day, 20:53:15/8:58:07, loss=0.326679656022647, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.91(1.03), norm=1.3453731908531024, lr=8.2047191884193e-05
2023-11-23 05:18:35   INFO  epoch: 25/30, acc_iter=165475, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:13:05/1:30:28, time_cost(all): 1 day, 20:54:04/8:36:14, loss=0.326596542036338, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.1(1.03), norm=4.482387364504778, lr=8.1879749774511e-05
2023-11-23 05:19:24   INFO  epoch: 25/30, acc_iter=165525, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:13:54/1:36:12, time_cost(all): 1 day, 20:54:53/8:42:36, loss=0.326513428050029, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.0(1.03), norm=2.5712433416504443, lr=8.171230766483e-05
2023-11-23 05:20:13   INFO  epoch: 25/30, acc_iter=165575, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:14:44/1:36:28, time_cost(all): 1 day, 20:55:42/9:07:36, loss=0.32643031406372, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.23(1.03), norm=3.2445033378802366, lr=8.1544865555149e-05
2023-11-23 05:21:03   INFO  epoch: 25/30, acc_iter=165625, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:15:33/1:31:56, time_cost(all): 1 day, 20:56:32/9:15:16, loss=0.32634720007741, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.05(1.03), norm=2.4083335750852104, lr=8.1377423445467e-05
2023-11-23 05:21:52   INFO  epoch: 25/30, acc_iter=165675, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:16:22/1:30:33, time_cost(all): 1 day, 20:57:21/9:07:41, loss=0.326264086091101, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.06(1.03), norm=0.8504402409975326, lr=8.1209981335786e-05
2023-11-23 05:22:41   INFO  epoch: 25/30, acc_iter=165725, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:17:11/1:29:50, time_cost(all): 1 day, 20:58:10/8:36:32, loss=0.326180972104792, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.2(1.03), norm=1.595590796326954, lr=8.1042539226105e-05
2023-11-23 05:23:30   INFO  epoch: 25/30, acc_iter=165775, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:18:00/1:30:31, time_cost(all): 1 day, 20:58:59/8:37:55, loss=0.326097858118483, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.06(1.03), norm=2.008845857659839, lr=8.0875097116424e-05
2023-11-23 05:24:19   INFO  epoch: 25/30, acc_iter=165825, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:18:49/1:28:03, time_cost(all): 1 day, 20:59:48/9:16:19, loss=0.326014744132174, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.98(1.03), norm=3.948352269891351, lr=8.0707655006742e-05
2023-11-23 05:25:08   INFO  epoch: 25/30, acc_iter=165875, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:19:38/1:29:24, time_cost(all): 1 day, 21:00:37/8:50:26, loss=0.325931630145865, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.0(1.03), norm=1.5157247889966283, lr=8.0540212897061e-05
2023-11-23 05:25:57   INFO  epoch: 25/30, acc_iter=165925, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:20:27/1:28:03, time_cost(all): 1 day, 21:01:26/9:02:10, loss=0.325848516159556, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.9(1.03), norm=2.089278697665862, lr=8.037277078738e-05
2023-11-23 05:26:46   INFO  epoch: 25/30, acc_iter=165975, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:21:17/1:26:57, time_cost(all): 1 day, 21:02:15/8:29:24, loss=0.325765402173247, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.96(1.03), norm=3.3582514286372813, lr=8.0205328677699e-05
2023-11-23 05:27:35   INFO  epoch: 25/30, acc_iter=166025, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:22:06/1:28:57, time_cost(all): 1 day, 21:03:04/9:01:43, loss=0.325682288186938, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.91(1.03), norm=3.0493007664435945, lr=8.0037886568017e-05
2023-11-23 05:28:25   INFO  epoch: 25/30, acc_iter=166075, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:22:55/1:22:11, time_cost(all): 1 day, 21:03:54/8:53:34, loss=0.325599174200629, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.95(1.03), norm=3.29490966346926, lr=7.9870444458336e-05
2023-11-23 05:29:14   INFO  epoch: 25/30, acc_iter=166125, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:23:44/1:22:04, time_cost(all): 1 day, 21:04:43/9:11:22, loss=0.32551606021432, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.0(1.03), norm=0.535511096857674, lr=7.9703002348655e-05
2023-11-23 05:30:03   INFO  epoch: 25/30, acc_iter=166175, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:24:33/1:21:55, time_cost(all): 1 day, 21:05:32/8:28:38, loss=0.325432946228011, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.04(1.03), norm=1.5762846306803477, lr=7.9535560238973e-05
2023-11-23 05:30:52   INFO  epoch: 25/30, acc_iter=166225, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:25:22/1:20:23, time_cost(all): 1 day, 21:06:21/8:27:03, loss=0.325349832241702, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.96(1.03), norm=4.374465713612187, lr=7.9368118129292e-05
2023-11-23 05:31:41   INFO  epoch: 25/30, acc_iter=166275, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:26:11/1:20:36, time_cost(all): 1 day, 21:07:10/9:06:15, loss=0.325266718255393, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.01(1.03), norm=4.203464171048079, lr=7.9200676019611e-05
2023-11-23 05:32:30   INFO  epoch: 25/30, acc_iter=166325, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:27:00/1:17:55, time_cost(all): 1 day, 21:07:59/9:11:56, loss=0.325183604269084, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.11(1.03), norm=2.862808634876373, lr=7.903323390993e-05
2023-11-23 05:33:19   INFO  epoch: 25/30, acc_iter=166375, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:27:49/1:17:44, time_cost(all): 1 day, 21:08:48/8:23:36, loss=0.325100490282775, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.06(1.03), norm=2.054982843368397, lr=7.8865791800248e-05
2023-11-23 05:34:08   INFO  epoch: 25/30, acc_iter=166425, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:28:39/1:22:33, time_cost(all): 1 day, 21:09:37/9:09:06, loss=0.325017376296465, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.93(1.03), norm=4.445958033852852, lr=7.8698349690567e-05
2023-11-23 05:34:58   INFO  epoch: 25/30, acc_iter=166475, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:29:28/1:15:22, time_cost(all): 1 day, 21:10:27/9:01:50, loss=0.324934262310156, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.15(1.03), norm=4.597023033451081, lr=7.8530907580886e-05
2023-11-23 05:35:47   INFO  epoch: 25/30, acc_iter=166525, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:30:17/1:20:08, time_cost(all): 1 day, 21:11:16/8:59:41, loss=0.324851148323847, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.94(1.03), norm=2.2661738707628456, lr=7.8363465471204e-05
2023-11-23 05:36:36   INFO  epoch: 25/30, acc_iter=166575, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:31:06/1:18:43, time_cost(all): 1 day, 21:12:05/8:22:38, loss=0.324768034337538, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.19(1.03), norm=3.0466432649914994, lr=7.8196023361523e-05
2023-11-23 05:37:25   INFO  epoch: 25/30, acc_iter=166625, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:31:55/1:15:56, time_cost(all): 1 day, 21:12:54/8:19:46, loss=0.324684920351229, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.08(1.03), norm=4.88207324361476, lr=7.8028581251842e-05
2023-11-23 05:38:14   INFO  epoch: 25/30, acc_iter=166675, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:32:44/1:16:34, time_cost(all): 1 day, 21:13:43/8:33:47, loss=0.32460180636492, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.2(1.03), norm=0.8982218492697902, lr=7.7861139142161e-05
2023-11-23 05:39:03   INFO  epoch: 25/30, acc_iter=166725, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:33:33/1:15:29, time_cost(all): 1 day, 21:14:32/8:19:40, loss=0.324518692378611, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.18(1.03), norm=1.9917448383402594, lr=7.7693697032479e-05
2023-11-23 05:39:52   INFO  epoch: 25/30, acc_iter=166775, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:34:22/1:12:03, time_cost(all): 1 day, 21:15:21/8:28:42, loss=0.324435578392302, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.11(1.03), norm=2.009498105554881, lr=7.7526254922798e-05
2023-11-23 05:40:41   INFO  epoch: 25/30, acc_iter=166825, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:35:12/1:14:24, time_cost(all): 1 day, 21:16:10/8:49:52, loss=0.324352464405993, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.99(1.03), norm=2.1652777588417713, lr=7.7358812813117e-05
2023-11-23 05:41:30   INFO  epoch: 25/30, acc_iter=166875, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:36:01/1:12:44, time_cost(all): 1 day, 21:16:59/8:16:45, loss=0.324269350419684, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.03(1.03), norm=4.784520066097424, lr=7.7191370703435e-05
2023-11-23 05:42:20   INFO  epoch: 25/30, acc_iter=166925, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:36:50/1:10:11, time_cost(all): 1 day, 21:17:49/8:19:55, loss=0.324186236433375, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.99(1.03), norm=4.507465256940219, lr=7.7023928593754e-05
2023-11-23 05:43:09   INFO  epoch: 25/30, acc_iter=166975, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:37:39/1:09:01, time_cost(all): 1 day, 21:18:38/8:25:36, loss=0.324103122447066, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.16(1.03), norm=2.2522272527464313, lr=7.6856486484073e-05
2023-11-23 05:43:58   INFO  epoch: 25/30, acc_iter=167025, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:38:28/1:08:38, time_cost(all): 1 day, 21:19:27/8:59:29, loss=0.324020008460757, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.92(1.03), norm=0.7792420973659537, lr=7.6689044374392e-05
2023-11-23 05:44:47   INFO  epoch: 25/30, acc_iter=167075, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:39:17/1:07:04, time_cost(all): 1 day, 21:20:16/8:29:27, loss=0.323936894474448, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.85(1.03), norm=0.7061011519027784, lr=7.652160226471e-05
2023-11-23 05:45:36   INFO  epoch: 25/30, acc_iter=167125, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:40:06/1:06:07, time_cost(all): 1 day, 21:21:05/8:28:08, loss=0.323853780488139, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.9(1.03), norm=3.814623408854171, lr=7.6354160155029e-05
2023-11-23 05:46:25   INFO  epoch: 25/30, acc_iter=167175, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:40:55/1:06:11, time_cost(all): 1 day, 21:21:54/8:36:08, loss=0.32377066650183, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.09(1.03), norm=2.3693954624944364, lr=7.6186718045348e-05
2023-11-23 05:47:14   INFO  epoch: 25/30, acc_iter=167225, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:41:44/1:05:23, time_cost(all): 1 day, 21:22:43/8:30:18, loss=0.323687552515521, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.09(1.03), norm=2.8230448677477966, lr=7.6019275935667e-05
2023-11-23 05:48:03   INFO  epoch: 25/30, acc_iter=167275, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:42:34/1:04:36, time_cost(all): 1 day, 21:23:32/8:14:43, loss=0.323604438529211, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.86(1.03), norm=3.0527666895206638, lr=7.5851833825985e-05
2023-11-23 05:48:53   INFO  epoch: 25/30, acc_iter=167325, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:43:23/1:04:12, time_cost(all): 1 day, 21:24:22/8:54:21, loss=0.323521324542902, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.18(1.03), norm=2.596245488901824, lr=7.5684391716304e-05
2023-11-23 05:49:42   INFO  epoch: 25/30, acc_iter=167375, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:44:12/1:05:22, time_cost(all): 1 day, 21:25:11/8:16:48, loss=0.323438210556593, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.02(1.03), norm=3.9818092596474397, lr=7.5516949606623e-05
2023-11-23 05:50:31   INFO  epoch: 25/30, acc_iter=167425, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:45:01/1:04:34, time_cost(all): 1 day, 21:26:00/8:21:01, loss=0.323355096570284, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.19(1.03), norm=1.4552727869545952, lr=7.5349507496941e-05
2023-11-23 05:51:20   INFO  epoch: 25/30, acc_iter=167475, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:45:50/1:03:13, time_cost(all): 1 day, 21:26:49/8:35:36, loss=0.323271982583975, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.12(1.03), norm=4.944539045842187, lr=7.518206538726e-05
2023-11-23 05:52:09   INFO  epoch: 25/30, acc_iter=167525, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:46:39/1:02:22, time_cost(all): 1 day, 21:27:38/8:03:53, loss=0.323188868597666, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.89(1.03), norm=2.12796189188053, lr=7.5014623277579e-05
2023-11-23 05:52:58   INFO  epoch: 25/30, acc_iter=167575, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:47:28/1:02:26, time_cost(all): 1 day, 21:28:27/8:31:19, loss=0.323105754611357, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.05(1.03), norm=1.8794334383859193, lr=7.4847181167898e-05
2023-11-23 05:53:47   INFO  epoch: 25/30, acc_iter=167625, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:48:17/1:00:45, time_cost(all): 1 day, 21:29:16/8:34:31, loss=0.323022640625048, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.93(1.03), norm=3.1520312942338866, lr=7.4679739058216e-05
2023-11-23 05:54:36   INFO  epoch: 25/30, acc_iter=167675, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:49:07/0:59:30, time_cost(all): 1 day, 21:30:05/8:11:35, loss=0.322939526638739, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.07(1.03), norm=4.661480326767594, lr=7.4512296948535e-05
2023-11-23 05:55:25   INFO  epoch: 25/30, acc_iter=167725, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:49:56/0:59:20, time_cost(all): 1 day, 21:30:54/8:37:26, loss=0.32285641265243, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.95(1.03), norm=4.0660340741084156, lr=7.4344854838854e-05
2023-11-23 05:56:15   INFO  epoch: 25/30, acc_iter=167775, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:50:45/0:57:05, time_cost(all): 1 day, 21:31:44/8:24:30, loss=0.322773298666121, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.12(1.03), norm=4.301491800018983, lr=7.4177412729172e-05
2023-11-23 05:57:04   INFO  epoch: 25/30, acc_iter=167825, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 0:51:34/0:54:46, time_cost(all): 1 day, 21:32:33/8:20:56, loss=0.322690184679812, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.04(1.03), norm=1.3392887653469565, lr=7.4009970619491e-05
2023-11-23 05:57:53   INFO  epoch: 25/30, acc_iter=167875, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 0:52:23/0:56:18, time_cost(all): 1 day, 21:33:22/8:08:20, loss=0.322607070693503, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.1(1.03), norm=4.30772299451057, lr=7.384252850981e-05
2023-11-23 05:58:42   INFO  epoch: 25/30, acc_iter=167925, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 0:53:12/0:53:41, time_cost(all): 1 day, 21:34:11/8:12:55, loss=0.322523956707194, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.21(1.03), norm=1.8570426916108906, lr=7.3675086400129e-05
2023-11-23 05:59:31   INFO  epoch: 25/30, acc_iter=167975, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 0:54:01/0:54:31, time_cost(all): 1 day, 21:35:00/8:31:54, loss=0.322440842720885, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.99(1.03), norm=1.964593088190324, lr=7.3507644290447e-05
2023-11-23 06:00:20   INFO  epoch: 25/30, acc_iter=168025, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 0:54:50/0:53:42, time_cost(all): 1 day, 21:35:49/8:05:31, loss=0.322357728734575, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.92(1.03), norm=4.535003532436202, lr=7.3340202180766e-05
2023-11-23 06:01:09   INFO  epoch: 25/30, acc_iter=168075, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 0:55:39/0:51:59, time_cost(all): 1 day, 21:36:38/8:11:50, loss=0.322274614748266, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.99(1.03), norm=1.2017509346363098, lr=7.3172760071085e-05
2023-11-23 06:01:58   INFO  epoch: 25/30, acc_iter=168125, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 0:56:29/0:48:53, time_cost(all): 1 day, 21:37:27/7:54:20, loss=0.322191500761957, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.04(1.03), norm=3.9098586078907682, lr=7.3005317961404e-05
2023-11-23 06:02:48   INFO  epoch: 25/30, acc_iter=168175, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 0:57:18/0:50:40, time_cost(all): 1 day, 21:38:17/8:40:58, loss=0.322108386775648, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.99(1.03), norm=4.701453048366554, lr=7.2837875851722e-05
2023-11-23 06:03:37   INFO  epoch: 25/30, acc_iter=168225, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 0:58:07/0:49:00, time_cost(all): 1 day, 21:39:06/8:37:32, loss=0.322025272789339, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.96(1.03), norm=3.2942735596027344, lr=7.2670433742041e-05
2023-11-23 06:04:26   INFO  epoch: 25/30, acc_iter=168275, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 0:58:56/0:49:43, time_cost(all): 1 day, 21:39:55/8:03:38, loss=0.32194215880303, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.13(1.03), norm=4.811344788489637, lr=7.250299163236e-05
2023-11-23 06:05:15   INFO  epoch: 25/30, acc_iter=168325, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 0:59:45/0:48:12, time_cost(all): 1 day, 21:40:44/8:09:30, loss=0.321859044816721, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.05(1.03), norm=4.645311027864083, lr=7.2335549522678e-05
2023-11-23 06:06:04   INFO  epoch: 25/30, acc_iter=168375, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:00:34/0:45:25, time_cost(all): 1 day, 21:41:33/8:10:18, loss=0.321775930830412, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.97(1.03), norm=3.0938894829982644, lr=7.2168107412997e-05
2023-11-23 06:06:53   INFO  epoch: 25/30, acc_iter=168425, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:01:23/0:47:06, time_cost(all): 1 day, 21:42:22/8:01:10, loss=0.321692816844103, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.06(1.03), norm=3.4961676343586015, lr=7.2000665303316e-05
2023-11-23 06:07:42   INFO  epoch: 25/30, acc_iter=168475, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:02:12/0:44:13, time_cost(all): 1 day, 21:43:11/8:34:56, loss=0.321609702857794, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.2(1.03), norm=4.1555431270133525, lr=7.1833223193635e-05
2023-11-23 06:08:31   INFO  epoch: 25/30, acc_iter=168525, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:03:02/0:46:43, time_cost(all): 1 day, 21:44:00/8:24:16, loss=0.321526588871485, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.9(1.03), norm=4.276907294879437, lr=7.1665781083953e-05
2023-11-23 06:09:20   INFO  epoch: 25/30, acc_iter=168575, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:03:51/0:43:36, time_cost(all): 1 day, 21:44:49/8:09:05, loss=0.321443474885176, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.17(1.03), norm=1.3329868155939846, lr=7.1498338974272e-05
2023-11-23 06:10:10   INFO  epoch: 25/30, acc_iter=168625, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:04:40/0:42:22, time_cost(all): 1 day, 21:45:39/7:52:01, loss=0.321360360898867, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.07(1.03), norm=4.979668170133533, lr=7.1330896864591e-05
2023-11-23 06:10:59   INFO  epoch: 25/30, acc_iter=168675, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:05:29/0:41:22, time_cost(all): 1 day, 21:46:28/7:52:40, loss=0.321277246912558, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.98(1.03), norm=1.7627021707034813, lr=7.1163454754909e-05
2023-11-23 06:11:48   INFO  epoch: 25/30, acc_iter=168725, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:06:18/0:43:34, time_cost(all): 1 day, 21:47:17/8:00:38, loss=0.321194132926249, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.17(1.03), norm=3.7409125584189566, lr=7.0996012645228e-05
2023-11-23 06:12:37   INFO  epoch: 25/30, acc_iter=168775, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:07:07/0:42:20, time_cost(all): 1 day, 21:48:06/8:25:08, loss=0.32111101893994, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.93(1.03), norm=2.3774086487588484, lr=7.0828570535547e-05
2023-11-23 06:13:26   INFO  epoch: 25/30, acc_iter=168825, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:07:56/0:39:59, time_cost(all): 1 day, 21:48:55/7:45:49, loss=0.32102790495363, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.88(1.03), norm=1.2587145728686397, lr=7.0661128425866e-05
2023-11-23 06:14:15   INFO  epoch: 25/30, acc_iter=168875, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:08:45/0:38:47, time_cost(all): 1 day, 21:49:44/8:09:35, loss=0.320944790967321, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.15(1.03), norm=4.4677977555133745, lr=7.0493686316184e-05
2023-11-23 06:15:04   INFO  epoch: 25/30, acc_iter=168925, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:09:34/0:39:13, time_cost(all): 1 day, 21:50:33/7:46:25, loss=0.320861676981012, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.98(1.03), norm=3.212071593884602, lr=7.0326244206503e-05
2023-11-23 06:15:53   INFO  epoch: 25/30, acc_iter=168975, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:10:24/0:37:23, time_cost(all): 1 day, 21:51:22/8:03:37, loss=0.320778562994703, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.16(1.03), norm=4.5077125605693, lr=7.0158802096822e-05
2023-11-23 06:16:43   INFO  epoch: 25/30, acc_iter=169025, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:11:13/0:36:48, time_cost(all): 1 day, 21:52:12/8:10:11, loss=0.320695449008394, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.91(1.03), norm=0.6295376742637262, lr=6.999135998714e-05
2023-11-23 06:17:32   INFO  epoch: 25/30, acc_iter=169075, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:12:02/0:37:30, time_cost(all): 1 day, 21:53:01/8:23:43, loss=0.320612335022085, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.17(1.03), norm=2.6980628156662796, lr=6.9823917877459e-05
2023-11-23 06:18:21   INFO  epoch: 25/30, acc_iter=169125, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:12:51/0:36:04, time_cost(all): 1 day, 21:53:50/7:40:33, loss=0.320529221035776, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.04(1.03), norm=1.2641250260359318, lr=6.9656475767778e-05
2023-11-23 06:19:10   INFO  epoch: 25/30, acc_iter=169175, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:13:40/0:34:39, time_cost(all): 1 day, 21:54:39/7:48:36, loss=0.320446107049467, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.1(1.03), norm=3.34875586584693, lr=6.9489033658097e-05
2023-11-23 06:19:59   INFO  epoch: 25/30, acc_iter=169225, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:14:29/0:34:37, time_cost(all): 1 day, 21:55:28/8:11:47, loss=0.320362993063158, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.96(1.03), norm=4.065904571222132, lr=6.9321591548415e-05
2023-11-23 06:20:48   INFO  epoch: 25/30, acc_iter=169275, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:15:18/0:31:41, time_cost(all): 1 day, 21:56:17/7:54:43, loss=0.320279879076849, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.21(1.03), norm=4.164264553133547, lr=6.9154149438734e-05
2023-11-23 06:21:37   INFO  epoch: 25/30, acc_iter=169325, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:16:07/0:31:45, time_cost(all): 1 day, 21:57:06/8:10:17, loss=0.32019676509054, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.85(1.03), norm=0.6334581357713805, lr=6.8986707329053e-05
2023-11-23 06:22:26   INFO  epoch: 25/30, acc_iter=169375, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:16:57/0:31:20, time_cost(all): 1 day, 21:57:55/8:15:58, loss=0.320113651104231, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.9(1.03), norm=3.7119130099152216, lr=6.8819265219371e-05
2023-11-23 06:23:15   INFO  epoch: 25/30, acc_iter=169425, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:17:46/0:30:06, time_cost(all): 1 day, 21:58:44/7:58:49, loss=0.320030537117922, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.1(1.03), norm=2.6075486246956863, lr=6.865182310969e-05
2023-11-23 06:24:05   INFO  epoch: 25/30, acc_iter=169475, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:18:35/0:29:11, time_cost(all): 1 day, 21:59:34/7:45:40, loss=0.319947423131613, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.92(1.03), norm=0.7147597518590892, lr=6.8484381000009e-05
2023-11-23 06:24:54   INFO  epoch: 25/30, acc_iter=169525, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:19:24/0:28:27, time_cost(all): 1 day, 22:00:23/8:04:26, loss=0.319864309145304, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.16(1.03), norm=2.045438974140642, lr=6.8316938890328e-05
2023-11-23 06:25:43   INFO  epoch: 25/30, acc_iter=169575, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:20:13/0:28:54, time_cost(all): 1 day, 22:01:12/8:17:25, loss=0.319781195158995, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.19(1.03), norm=2.2514056297326026, lr=6.8149496780646e-05
2023-11-23 06:26:32   INFO  epoch: 25/30, acc_iter=169625, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:21:02/0:27:33, time_cost(all): 1 day, 22:02:01/7:59:07, loss=0.319698081172685, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.91(1.03), norm=4.110191195145184, lr=6.7982054670965e-05
2023-11-23 06:27:21   INFO  epoch: 25/30, acc_iter=169675, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:21:51/0:26:10, time_cost(all): 1 day, 22:02:50/7:33:54, loss=0.319614967186376, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.06(1.03), norm=1.5897497772690412, lr=6.7814612561284e-05
2023-11-23 06:28:10   INFO  epoch: 25/30, acc_iter=169725, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:22:40/0:24:05, time_cost(all): 1 day, 22:03:39/8:01:44, loss=0.319531853200067, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.84(1.03), norm=1.5356068205933944, lr=6.7647170451603e-05
2023-11-23 06:28:59   INFO  epoch: 25/30, acc_iter=169775, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:23:29/0:24:59, time_cost(all): 1 day, 22:04:28/8:12:55, loss=0.319448739213758, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.99(1.03), norm=1.083199165786389, lr=6.7479728341921e-05
2023-11-23 06:29:48   INFO  epoch: 25/30, acc_iter=169825, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:24:19/0:22:29, time_cost(all): 1 day, 22:05:17/7:52:01, loss=0.319365625227449, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.84(1.03), norm=4.830068186040413, lr=6.731228623224e-05
2023-11-23 06:30:38   INFO  epoch: 25/30, acc_iter=169875, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:25:08/0:22:36, time_cost(all): 1 day, 22:06:07/7:49:49, loss=0.31928251124114, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.92(1.03), norm=1.9949758477306478, lr=6.7144844122559e-05
2023-11-23 06:31:27   INFO  epoch: 25/30, acc_iter=169925, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:25:57/0:22:07, time_cost(all): 1 day, 22:06:56/8:04:43, loss=0.319199397254831, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.87(1.03), norm=4.500452901229367, lr=6.6977402012877e-05
2023-11-23 06:32:16   INFO  epoch: 25/30, acc_iter=169975, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:26:46/0:20:59, time_cost(all): 1 day, 22:07:45/7:40:51, loss=0.319116283268522, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.89(1.03), norm=1.4200506445567502, lr=6.6809959903196e-05
2023-11-23 06:33:05   INFO  epoch: 25/30, acc_iter=170025, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:27:35/0:19:49, time_cost(all): 1 day, 22:08:34/7:34:50, loss=0.319033169282213, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.16(1.03), norm=1.0804611098988641, lr=6.6642517793515e-05
2023-11-23 06:33:54   INFO  epoch: 25/30, acc_iter=170075, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:28:24/0:18:41, time_cost(all): 1 day, 22:09:23/7:27:11, loss=0.318950055295904, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.89(1.03), norm=4.447914722207292, lr=6.6475075683834e-05
2023-11-23 06:34:43   INFO  epoch: 25/30, acc_iter=170125, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:29:13/0:18:55, time_cost(all): 1 day, 22:10:12/7:34:01, loss=0.318866941309595, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.92(1.03), norm=1.6208224127751478, lr=6.6307633574152e-05
2023-11-23 06:35:32   INFO  epoch: 25/30, acc_iter=170175, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:30:02/0:17:41, time_cost(all): 1 day, 22:11:01/8:02:41, loss=0.318783827323286, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.92(1.03), norm=3.2093559701625787, lr=6.6140191464471e-05
2023-11-23 06:36:21   INFO  epoch: 25/30, acc_iter=170225, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:30:52/0:16:33, time_cost(all): 1 day, 22:11:50/7:26:26, loss=0.318700713336977, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.97(1.03), norm=0.7810035309830428, lr=6.597274935479e-05
2023-11-23 06:37:10   INFO  epoch: 25/30, acc_iter=170275, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:31:41/0:15:24, time_cost(all): 1 day, 22:12:39/7:46:33, loss=0.318617599350668, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.9(1.03), norm=2.156439192421787, lr=6.5805307245108e-05
2023-11-23 06:38:00   INFO  epoch: 25/30, acc_iter=170325, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:32:30/0:15:23, time_cost(all): 1 day, 22:13:29/7:36:45, loss=0.318534485364359, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.1(1.03), norm=1.8779737339500413, lr=6.5637865135427e-05
2023-11-23 06:38:49   INFO  epoch: 25/30, acc_iter=170375, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:33:19/0:15:02, time_cost(all): 1 day, 22:14:18/7:49:59, loss=0.31845137137805, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.06(1.03), norm=3.387276481304305, lr=6.5470423025746e-05
2023-11-23 06:39:38   INFO  epoch: 25/30, acc_iter=170425, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:34:08/0:13:03, time_cost(all): 1 day, 22:15:07/7:20:54, loss=0.31836825739174, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.9(1.03), norm=4.82378928879322, lr=6.5302980916065e-05
2023-11-23 06:40:27   INFO  epoch: 25/30, acc_iter=170475, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:34:57/0:13:09, time_cost(all): 1 day, 22:15:56/7:49:24, loss=0.318285143405431, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.14(1.03), norm=4.145966761574096, lr=6.5135538806383e-05
2023-11-23 06:41:16   INFO  epoch: 25/30, acc_iter=170525, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:35:46/0:12:14, time_cost(all): 1 day, 22:16:45/7:55:10, loss=0.318202029419122, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.11(1.03), norm=0.6016111260908212, lr=6.4968096696702e-05
2023-11-23 06:42:05   INFO  epoch: 25/30, acc_iter=170575, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:36:35/0:11:08, time_cost(all): 1 day, 22:17:34/7:58:00, loss=0.318118915432813, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.15(1.03), norm=2.9138342612836867, lr=6.4800654587021e-05
2023-11-23 06:42:54   INFO  epoch: 25/30, acc_iter=170625, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:37:24/0:10:35, time_cost(all): 1 day, 22:18:23/7:45:15, loss=0.318035801446504, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.85(1.03), norm=2.245819676810032, lr=6.463321247734e-05
2023-11-23 06:43:43   INFO  epoch: 25/30, acc_iter=170675, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:38:14/0:09:18, time_cost(all): 1 day, 22:19:12/7:33:17, loss=0.317952687460195, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.93(1.03), norm=3.2913422216567625, lr=6.4465770367658e-05
2023-11-23 06:44:33   INFO  epoch: 25/30, acc_iter=170725, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:39:03/0:08:22, time_cost(all): 1 day, 22:20:02/7:22:49, loss=0.317869573473886, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.0(1.03), norm=2.7203724879110123, lr=6.4298328257977e-05
2023-11-23 06:45:22   INFO  epoch: 25/30, acc_iter=170775, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:39:52/0:07:56, time_cost(all): 1 day, 22:20:51/7:37:27, loss=0.317786459487577, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.86(1.03), norm=1.8443186692932108, lr=6.4130886148296e-05
2023-11-23 06:46:11   INFO  epoch: 25/30, acc_iter=170825, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:40:41/0:07:25, time_cost(all): 1 day, 22:21:40/7:46:00, loss=0.317703345501268, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.11(1.03), norm=0.6883997216072224, lr=6.3963444038614e-05
2023-11-23 06:47:00   INFO  epoch: 25/30, acc_iter=170875, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:41:30/0:06:31, time_cost(all): 1 day, 22:22:29/7:52:07, loss=0.317620231514959, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.23(1.03), norm=2.1612759153435963, lr=6.3796001928933e-05
2023-11-23 06:47:49   INFO  epoch: 25/30, acc_iter=170925, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 1:42:19/0:05:35, time_cost(all): 1 day, 22:23:18/7:52:01, loss=0.31753711752865, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.07(1.03), norm=4.493151544105599, lr=6.3628559819252e-05
2023-11-23 06:48:38   INFO  epoch: 25/30, acc_iter=170975, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 1:43:08/0:04:40, time_cost(all): 1 day, 22:24:07/7:14:04, loss=0.317454003542341, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.84(1.03), norm=1.8097183712299927, lr=6.3461117709571e-05
2023-11-23 06:49:27   INFO  epoch: 25/30, acc_iter=171025, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 1:43:57/0:03:50, time_cost(all): 1 day, 22:24:56/7:41:22, loss=0.317370889556032, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.2(1.03), norm=4.308883234438573, lr=6.3293675599889e-05
2023-11-23 06:50:16   INFO  epoch: 25/30, acc_iter=171075, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 1:44:47/0:02:58, time_cost(all): 1 day, 22:25:45/7:10:21, loss=0.317287775569723, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.21(1.03), norm=4.175661625993897, lr=6.3126233490208e-05
2023-11-23 06:51:05   INFO  epoch: 25/30, acc_iter=171125, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 1:45:36/0:02:11, time_cost(all): 1 day, 22:26:34/7:23:17, loss=0.317204661583414, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.12(1.03), norm=4.852941045039609, lr=6.2958791380527e-05
2023-11-23 06:51:55   INFO  epoch: 25/30, acc_iter=171175, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 1:46:25/0:01:24, time_cost(all): 1 day, 22:27:24/7:14:19, loss=0.317121547597105, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.01(1.03), norm=4.835314406464383, lr=6.2791349270845e-05
2023-11-23 06:52:44   INFO  epoch: 25/30, acc_iter=171225, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 1:47:14/0:00:35, time_cost(all): 1 day, 22:28:13/7:45:48, loss=0.317038433610795, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.1(1.03), norm=2.788541803681684, lr=6.2623907161164e-05
2023-11-23 06:53:33   INFO  epoch: 26/30, acc_iter=171312, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:49/1:44:59, time_cost(all): 1 day, 22:29:02/7:05:27, loss=0.316893815274618, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.08(1.03), norm=0.6462663019953075, lr=6.2332557890319e-05
2023-11-23 06:54:22   INFO  epoch: 26/30, acc_iter=171362, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:38/1:45:59, time_cost(all): 1 day, 22:29:51/7:30:46, loss=0.316810701288309, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.94(1.03), norm=4.073460111227775, lr=6.2165115780637e-05
2023-11-23 06:55:11   INFO  epoch: 26/30, acc_iter=171412, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:27/1:49:42, time_cost(all): 1 day, 22:30:40/7:38:14, loss=0.316727587302, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.13(1.03), norm=4.976169527506765, lr=6.1997673670956e-05
2023-11-23 06:56:00   INFO  epoch: 26/30, acc_iter=171462, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:16/1:39:57, time_cost(all): 1 day, 22:31:29/7:06:12, loss=0.316644473315691, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.87(1.03), norm=0.8710486080552258, lr=6.1830231561275e-05
2023-11-23 06:56:49   INFO  epoch: 26/30, acc_iter=171512, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:05/1:41:02, time_cost(all): 1 day, 22:32:18/7:28:04, loss=0.316561359329381, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.91(1.03), norm=4.4553086767300805, lr=6.1662789451594e-05
2023-11-23 06:57:38   INFO  epoch: 26/30, acc_iter=171562, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:04:54/1:45:38, time_cost(all): 1 day, 22:33:07/7:01:22, loss=0.316478245343072, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.11(1.03), norm=4.718099970899888, lr=6.1495347341912e-05
2023-11-23 06:58:28   INFO  epoch: 26/30, acc_iter=171612, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:05:43/1:38:32, time_cost(all): 1 day, 22:33:57/7:21:33, loss=0.316395131356763, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.11(1.03), norm=4.754869639163227, lr=6.1327905232231e-05
2023-11-23 06:59:17   INFO  epoch: 26/30, acc_iter=171662, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:06:32/1:42:05, time_cost(all): 1 day, 22:34:46/7:11:14, loss=0.316312017370454, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.01(1.03), norm=2.9717618020295378, lr=6.116046312255e-05
2023-11-23 07:00:06   INFO  epoch: 26/30, acc_iter=171712, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:07:22/1:44:54, time_cost(all): 1 day, 22:35:35/7:27:01, loss=0.316228903384145, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.98(1.03), norm=4.194539315285102, lr=6.0993021012868e-05
2023-11-23 07:00:55   INFO  epoch: 26/30, acc_iter=171762, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:08:11/1:44:23, time_cost(all): 1 day, 22:36:24/7:15:18, loss=0.316145789397836, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.16(1.03), norm=1.616209060679537, lr=6.0825578903187e-05
2023-11-23 07:01:44   INFO  epoch: 26/30, acc_iter=171812, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:09:00/1:38:31, time_cost(all): 1 day, 22:37:13/7:19:26, loss=0.316062675411527, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.21(1.03), norm=4.880387679728569, lr=6.0658136793506e-05
2023-11-23 07:02:33   INFO  epoch: 26/30, acc_iter=171862, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:09:49/1:35:58, time_cost(all): 1 day, 22:38:02/7:17:17, loss=0.315979561425218, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.93(1.03), norm=4.745124299594791, lr=6.0490694683825e-05
2023-11-23 07:03:22   INFO  epoch: 26/30, acc_iter=171912, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:10:38/1:37:25, time_cost(all): 1 day, 22:38:51/7:24:17, loss=0.315896447438909, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.18(1.03), norm=3.9205360470461725, lr=6.0323252574143e-05
2023-11-23 07:04:11   INFO  epoch: 26/30, acc_iter=171962, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:11:27/1:34:05, time_cost(all): 1 day, 22:39:40/7:34:31, loss=0.3158133334526, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.1(1.03), norm=3.3810074780888453, lr=6.0155810464462e-05
2023-11-23 07:05:00   INFO  epoch: 26/30, acc_iter=172012, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:12:16/1:33:25, time_cost(all): 1 day, 22:40:29/7:16:10, loss=0.315730219466291, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.2(1.03), norm=1.046019096382072, lr=5.9988368354781e-05
2023-11-23 07:05:50   INFO  epoch: 26/30, acc_iter=172062, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:13:05/1:35:54, time_cost(all): 1 day, 22:41:19/7:00:12, loss=0.315647105479982, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.99(1.03), norm=0.7002616993874108, lr=5.98209262451e-05
2023-11-23 07:06:39   INFO  epoch: 26/30, acc_iter=172112, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:13:54/1:33:45, time_cost(all): 1 day, 22:42:08/7:09:05, loss=0.315563991493673, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.07(1.03), norm=2.3994967303430124, lr=5.9653484135418e-05
2023-11-23 07:07:28   INFO  epoch: 26/30, acc_iter=172162, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:14:44/1:33:18, time_cost(all): 1 day, 22:42:57/7:30:23, loss=0.315480877507364, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.06(1.03), norm=4.728277277810065, lr=5.9486042025737e-05
2023-11-23 07:08:17   INFO  epoch: 26/30, acc_iter=172212, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:15:33/1:34:26, time_cost(all): 1 day, 22:43:46/6:53:36, loss=0.315397763521055, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.23(1.03), norm=4.802629433834639, lr=5.9318599916056e-05
2023-11-23 07:09:06   INFO  epoch: 26/30, acc_iter=172262, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:16:22/1:35:16, time_cost(all): 1 day, 22:44:35/7:24:15, loss=0.315314649534746, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.12(1.03), norm=3.876887320826185, lr=5.9151157806374e-05
2023-11-23 07:09:55   INFO  epoch: 26/30, acc_iter=172312, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:17:11/1:34:22, time_cost(all): 1 day, 22:45:24/7:25:54, loss=0.315231535548436, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.21(1.03), norm=3.894289459689639, lr=5.8983715696693e-05
2023-11-23 07:10:44   INFO  epoch: 26/30, acc_iter=172362, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:18:00/1:25:48, time_cost(all): 1 day, 22:46:13/7:27:04, loss=0.315148421562127, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.93(1.03), norm=1.0115988825010873, lr=5.8816273587012e-05
2023-11-23 07:11:33   INFO  epoch: 26/30, acc_iter=172412, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:18:49/1:29:45, time_cost(all): 1 day, 22:47:02/6:47:36, loss=0.315065307575818, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.15(1.03), norm=3.396594340462843, lr=5.8648831477331e-05
2023-11-23 07:12:22   INFO  epoch: 26/30, acc_iter=172462, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:19:38/1:29:13, time_cost(all): 1 day, 22:47:51/7:25:35, loss=0.314982193589509, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.87(1.03), norm=1.2691775413675574, lr=5.8481389367649e-05
2023-11-23 07:13:12   INFO  epoch: 26/30, acc_iter=172512, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:20:27/1:27:01, time_cost(all): 1 day, 22:48:41/7:12:20, loss=0.3148990796032, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.86(1.03), norm=0.8497899943387084, lr=5.8313947257968e-05
2023-11-23 07:14:01   INFO  epoch: 26/30, acc_iter=172562, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:21:17/1:30:21, time_cost(all): 1 day, 22:49:30/7:12:45, loss=0.314815965616891, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.05(1.03), norm=4.097885993401976, lr=5.8146505148287e-05
2023-11-23 07:14:50   INFO  epoch: 26/30, acc_iter=172612, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:22:06/1:28:38, time_cost(all): 1 day, 22:50:19/7:12:43, loss=0.314732851630582, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.1(1.03), norm=0.731247129754915, lr=5.7979063038605e-05
2023-11-23 07:15:39   INFO  epoch: 26/30, acc_iter=172662, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:22:55/1:21:36, time_cost(all): 1 day, 22:51:08/7:00:00, loss=0.314649737644273, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.99(1.03), norm=2.5226643239922195, lr=5.7811620928924e-05
2023-11-23 07:16:28   INFO  epoch: 26/30, acc_iter=172712, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:23:44/1:27:33, time_cost(all): 1 day, 22:51:57/6:51:52, loss=0.314566623657964, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.9(1.03), norm=1.567511874879643, lr=5.7644178819243e-05
2023-11-23 07:17:17   INFO  epoch: 26/30, acc_iter=172762, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:24:33/1:20:19, time_cost(all): 1 day, 22:52:46/6:46:18, loss=0.314483509671655, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.97(1.03), norm=4.081312396949931, lr=5.7476736709562e-05
2023-11-23 07:18:06   INFO  epoch: 26/30, acc_iter=172812, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:25:22/1:23:05, time_cost(all): 1 day, 22:53:35/6:50:36, loss=0.314400395685346, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.07(1.03), norm=3.299739009905191, lr=5.730929459988e-05
2023-11-23 07:18:55   INFO  epoch: 26/30, acc_iter=172862, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:26:11/1:18:14, time_cost(all): 1 day, 22:54:24/7:09:21, loss=0.314317281699037, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.22(1.03), norm=1.4851185193884247, lr=5.7141852490199e-05
2023-11-23 07:19:45   INFO  epoch: 26/30, acc_iter=172912, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:27:00/1:21:40, time_cost(all): 1 day, 22:55:14/7:16:27, loss=0.314234167712728, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.2(1.03), norm=1.807278678198271, lr=5.6974410380518e-05
2023-11-23 07:20:34   INFO  epoch: 26/30, acc_iter=172962, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:27:49/1:16:50, time_cost(all): 1 day, 22:56:03/6:43:53, loss=0.314151053726419, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.83(1.03), norm=1.6520270099374932, lr=5.6806968270837e-05
2023-11-23 07:21:23   INFO  epoch: 26/30, acc_iter=173012, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:28:39/1:22:30, time_cost(all): 1 day, 22:56:52/7:07:29, loss=0.31406793974011, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.98(1.03), norm=1.2042511794959163, lr=5.6639526161155e-05
2023-11-23 07:22:12   INFO  epoch: 26/30, acc_iter=173062, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:29:28/1:16:05, time_cost(all): 1 day, 22:57:41/7:08:32, loss=0.3139848257538, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.98(1.03), norm=1.3584438252913928, lr=5.6472084051474e-05
2023-11-23 07:23:01   INFO  epoch: 26/30, acc_iter=173112, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:30:17/1:16:40, time_cost(all): 1 day, 22:58:30/6:44:21, loss=0.313901711767491, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.17(1.03), norm=2.142249614515722, lr=5.6304641941793e-05
2023-11-23 07:23:50   INFO  epoch: 26/30, acc_iter=173162, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:31:06/1:18:28, time_cost(all): 1 day, 22:59:19/6:52:37, loss=0.313818597781182, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.99(1.03), norm=3.745952076768005, lr=5.6137199832111e-05
2023-11-23 07:24:39   INFO  epoch: 26/30, acc_iter=173212, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:31:55/1:17:45, time_cost(all): 1 day, 23:00:08/7:13:20, loss=0.313735483794873, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.18(1.03), norm=2.0731775528403875, lr=5.596975772243e-05
2023-11-23 07:25:28   INFO  epoch: 26/30, acc_iter=173262, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:32:44/1:13:30, time_cost(all): 1 day, 23:00:57/7:10:14, loss=0.313652369808564, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.92(1.03), norm=4.48506939339102, lr=5.5802315612749e-05
2023-11-23 07:26:17   INFO  epoch: 26/30, acc_iter=173312, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:33:33/1:15:36, time_cost(all): 1 day, 23:01:46/7:03:54, loss=0.313569255822255, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.92(1.03), norm=4.269419018825392, lr=5.5634873503068e-05
2023-11-23 07:27:07   INFO  epoch: 26/30, acc_iter=173362, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:34:22/1:12:58, time_cost(all): 1 day, 23:02:36/7:04:29, loss=0.313486141835946, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.08(1.03), norm=1.9449396880444687, lr=5.5467431393386e-05
2023-11-23 07:27:56   INFO  epoch: 26/30, acc_iter=173412, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:35:12/1:11:32, time_cost(all): 1 day, 23:03:25/7:09:05, loss=0.313403027849637, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.16(1.03), norm=0.7329383857091438, lr=5.5299989283705e-05
2023-11-23 07:28:45   INFO  epoch: 26/30, acc_iter=173462, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:36:01/1:14:01, time_cost(all): 1 day, 23:04:14/6:43:38, loss=0.313319913863328, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.06(1.03), norm=1.5999474611005855, lr=5.5132547174024e-05
2023-11-23 07:29:34   INFO  epoch: 26/30, acc_iter=173512, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:36:50/1:08:52, time_cost(all): 1 day, 23:05:03/6:52:59, loss=0.313236799877019, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.95(1.03), norm=2.8143847448851154, lr=5.4965105064342e-05
2023-11-23 07:30:23   INFO  epoch: 26/30, acc_iter=173562, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:37:39/1:13:10, time_cost(all): 1 day, 23:05:52/6:56:05, loss=0.31315368589071, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.02(1.03), norm=4.116431369436068, lr=5.4797662954661e-05
2023-11-23 07:31:12   INFO  epoch: 26/30, acc_iter=173612, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:38:28/1:09:05, time_cost(all): 1 day, 23:06:41/6:31:26, loss=0.313070571904401, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.86(1.03), norm=3.9369558204400525, lr=5.463022084498e-05
2023-11-23 07:32:01   INFO  epoch: 26/30, acc_iter=173662, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:39:17/1:06:02, time_cost(all): 1 day, 23:07:30/6:49:18, loss=0.312987457918092, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.07(1.03), norm=4.1629497917253, lr=5.4462778735299e-05
2023-11-23 07:32:50   INFO  epoch: 26/30, acc_iter=173712, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:40:06/1:07:06, time_cost(all): 1 day, 23:08:19/6:34:27, loss=0.312904343931783, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.23(1.03), norm=3.4219143186795073, lr=5.4295336625617e-05
2023-11-23 07:33:40   INFO  epoch: 26/30, acc_iter=173762, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:40:55/1:09:49, time_cost(all): 1 day, 23:09:09/7:02:43, loss=0.312821229945474, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.85(1.03), norm=3.693585636086225, lr=5.4127894515936e-05
2023-11-23 07:34:29   INFO  epoch: 26/30, acc_iter=173812, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:41:44/1:09:07, time_cost(all): 1 day, 23:09:58/6:43:01, loss=0.312738115959165, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.09(1.03), norm=1.4124008747886427, lr=5.3960452406255e-05
2023-11-23 07:35:18   INFO  epoch: 26/30, acc_iter=173862, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:42:34/1:06:42, time_cost(all): 1 day, 23:10:47/6:59:27, loss=0.312655001972855, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.17(1.03), norm=3.4643401879181437, lr=5.3793010296573e-05
2023-11-23 07:36:07   INFO  epoch: 26/30, acc_iter=173912, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:43:23/1:01:57, time_cost(all): 1 day, 23:11:36/6:43:11, loss=0.312571887986546, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.02(1.03), norm=1.3392320309548402, lr=5.3625568186892e-05
2023-11-23 07:36:56   INFO  epoch: 26/30, acc_iter=173962, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:44:12/1:01:30, time_cost(all): 1 day, 23:12:25/6:53:50, loss=0.312488774000237, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.87(1.03), norm=2.74577487663922, lr=5.3458126077211e-05
2023-11-23 07:37:45   INFO  epoch: 26/30, acc_iter=174012, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:45:01/1:05:51, time_cost(all): 1 day, 23:13:14/7:01:13, loss=0.312405660013928, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.01(1.03), norm=1.9519009986855276, lr=5.329068396753e-05
2023-11-23 07:38:34   INFO  epoch: 26/30, acc_iter=174062, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:45:50/1:00:09, time_cost(all): 1 day, 23:14:03/6:33:29, loss=0.312322546027619, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.0(1.03), norm=1.8941591131658762, lr=5.3123241857848e-05
2023-11-23 07:39:23   INFO  epoch: 26/30, acc_iter=174112, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:46:39/0:59:28, time_cost(all): 1 day, 23:14:52/6:39:29, loss=0.31223943204131, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.86(1.03), norm=3.6733346723137363, lr=5.2955799748167e-05
2023-11-23 07:40:12   INFO  epoch: 26/30, acc_iter=174162, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:47:28/1:01:15, time_cost(all): 1 day, 23:15:41/6:38:37, loss=0.312156318055001, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.0(1.03), norm=3.955795318835031, lr=5.2788357638486e-05
2023-11-23 07:41:02   INFO  epoch: 26/30, acc_iter=174212, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:48:17/0:58:20, time_cost(all): 1 day, 23:16:31/6:41:29, loss=0.312073204068692, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.9(1.03), norm=1.4014483915522562, lr=5.2620915528805e-05
2023-11-23 07:41:51   INFO  epoch: 26/30, acc_iter=174262, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:49:07/0:59:04, time_cost(all): 1 day, 23:17:20/6:31:04, loss=0.311990090082383, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.97(1.03), norm=3.047410267305571, lr=5.2453473419123e-05
2023-11-23 07:42:40   INFO  epoch: 26/30, acc_iter=174312, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:49:56/0:57:04, time_cost(all): 1 day, 23:18:09/6:48:37, loss=0.311906976096074, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.11(1.03), norm=1.4380870675467563, lr=5.2286031309442e-05
2023-11-23 07:43:29   INFO  epoch: 26/30, acc_iter=174362, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:50:45/0:56:39, time_cost(all): 1 day, 23:18:58/6:46:19, loss=0.311823862109765, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.93(1.03), norm=2.36692360403556, lr=5.2118589199761e-05
2023-11-23 07:44:18   INFO  epoch: 26/30, acc_iter=174412, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 0:51:34/0:55:39, time_cost(all): 1 day, 23:19:47/6:48:41, loss=0.311740748123456, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.93(1.03), norm=3.127919527118066, lr=5.1951147090079e-05
2023-11-23 07:45:07   INFO  epoch: 26/30, acc_iter=174462, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 0:52:23/0:57:50, time_cost(all): 1 day, 23:20:36/6:31:00, loss=0.311657634137147, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.92(1.03), norm=0.504671043078003, lr=5.1783704980398e-05
2023-11-23 07:45:56   INFO  epoch: 26/30, acc_iter=174512, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 0:53:12/0:57:17, time_cost(all): 1 day, 23:21:25/6:32:33, loss=0.311574520150838, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.05(1.03), norm=3.883089081522141, lr=5.1616262870717e-05
2023-11-23 07:46:45   INFO  epoch: 26/30, acc_iter=174562, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 0:54:01/0:53:33, time_cost(all): 1 day, 23:22:14/6:37:07, loss=0.311491406164529, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.03(1.03), norm=0.7822725618231703, lr=5.1448820761036e-05
2023-11-23 07:47:35   INFO  epoch: 26/30, acc_iter=174612, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 0:54:50/0:53:06, time_cost(all): 1 day, 23:23:04/6:18:11, loss=0.31140829217822, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.06(1.03), norm=4.852662598608516, lr=5.1281378651354e-05
2023-11-23 07:48:24   INFO  epoch: 26/30, acc_iter=174662, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 0:55:39/0:50:17, time_cost(all): 1 day, 23:23:53/6:23:09, loss=0.31132517819191, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.2(1.03), norm=0.9378517920597649, lr=5.1113936541673e-05
2023-11-23 07:49:13   INFO  epoch: 26/30, acc_iter=174712, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 0:56:29/0:52:46, time_cost(all): 1 day, 23:24:42/6:43:23, loss=0.311242064205601, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.93(1.03), norm=3.1227020877011, lr=5.0946494431992e-05
2023-11-23 07:50:02   INFO  epoch: 26/30, acc_iter=174762, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 0:57:18/0:51:50, time_cost(all): 1 day, 23:25:31/6:23:49, loss=0.311158950219292, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.12(1.03), norm=2.2682142083412855, lr=5.077905232231e-05
2023-11-23 07:50:51   INFO  epoch: 26/30, acc_iter=174812, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 0:58:07/0:48:01, time_cost(all): 1 day, 23:26:20/6:34:00, loss=0.311075836232983, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.04(1.03), norm=3.5534880657374748, lr=5.0611610212629e-05
2023-11-23 07:51:40   INFO  epoch: 26/30, acc_iter=174862, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 0:58:56/0:46:42, time_cost(all): 1 day, 23:27:09/6:14:08, loss=0.310992722246674, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.89(1.03), norm=3.0329152495354954, lr=5.0444168102948e-05
2023-11-23 07:52:29   INFO  epoch: 26/30, acc_iter=174912, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 0:59:45/0:49:23, time_cost(all): 1 day, 23:27:58/6:08:34, loss=0.310909608260365, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.16(1.03), norm=2.3364926461435154, lr=5.0276725993267e-05
2023-11-23 07:53:18   INFO  epoch: 26/30, acc_iter=174962, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:00:34/0:46:52, time_cost(all): 1 day, 23:28:47/6:37:37, loss=0.310826494274056, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.17(1.03), norm=1.0954636922863148, lr=5.0109283883585e-05
2023-11-23 07:54:07   INFO  epoch: 26/30, acc_iter=175012, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:01:23/0:44:59, time_cost(all): 1 day, 23:29:36/6:10:44, loss=0.310743380287747, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.08(1.03), norm=1.0844887685280313, lr=4.9941841773904e-05
2023-11-23 07:54:57   INFO  epoch: 26/30, acc_iter=175062, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:02:12/0:43:43, time_cost(all): 1 day, 23:30:26/6:30:53, loss=0.310660266301438, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.06(1.03), norm=1.443119507314201, lr=4.9774399664223e-05
2023-11-23 07:55:46   INFO  epoch: 26/30, acc_iter=175112, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:03:02/0:43:03, time_cost(all): 1 day, 23:31:15/6:06:03, loss=0.310577152315129, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.07(1.03), norm=3.5593532896597027, lr=4.9606957554541e-05
2023-11-23 07:56:35   INFO  epoch: 26/30, acc_iter=175162, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:03:51/0:44:46, time_cost(all): 1 day, 23:32:04/6:33:50, loss=0.31049403832882, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.94(1.03), norm=4.033102645241155, lr=4.943951544486e-05
2023-11-23 07:57:24   INFO  epoch: 26/30, acc_iter=175212, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:04:40/0:42:52, time_cost(all): 1 day, 23:32:53/6:30:44, loss=0.310410924342511, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.97(1.03), norm=3.72998650310178, lr=4.9272073335179e-05
2023-11-23 07:58:13   INFO  epoch: 26/30, acc_iter=175262, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:05:29/0:41:37, time_cost(all): 1 day, 23:33:42/6:04:08, loss=0.310327810356202, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.16(1.03), norm=1.5669017030828662, lr=4.9104631225498e-05
2023-11-23 07:59:02   INFO  epoch: 26/30, acc_iter=175312, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:06:18/0:42:43, time_cost(all): 1 day, 23:34:31/6:26:49, loss=0.310244696369893, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.01(1.03), norm=1.5284841210616427, lr=4.8937189115816e-05
2023-11-23 07:59:51   INFO  epoch: 26/30, acc_iter=175362, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:07:07/0:40:01, time_cost(all): 1 day, 23:35:20/6:23:05, loss=0.310161582383584, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.89(1.03), norm=2.3709003894809415, lr=4.8769747006135e-05
2023-11-23 08:00:40   INFO  epoch: 26/30, acc_iter=175412, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:07:56/0:41:15, time_cost(all): 1 day, 23:36:09/6:13:44, loss=0.310078468397274, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.99(1.03), norm=3.0803291866021048, lr=4.8602304896454e-05
2023-11-23 08:01:30   INFO  epoch: 26/30, acc_iter=175462, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:08:45/0:40:25, time_cost(all): 1 day, 23:36:59/6:00:44, loss=0.309995354410965, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.03(1.03), norm=0.6277221606141974, lr=4.8434862786773e-05
2023-11-23 08:02:19   INFO  epoch: 26/30, acc_iter=175512, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:09:34/0:38:30, time_cost(all): 1 day, 23:37:48/6:00:17, loss=0.309912240424656, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.93(1.03), norm=1.4194931876520125, lr=4.8267420677091e-05
2023-11-23 08:03:08   INFO  epoch: 26/30, acc_iter=175562, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:10:24/0:37:57, time_cost(all): 1 day, 23:38:37/6:25:46, loss=0.309829126438347, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.88(1.03), norm=1.201600079663307, lr=4.809997856741e-05
2023-11-23 08:03:57   INFO  epoch: 26/30, acc_iter=175612, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:11:13/0:35:23, time_cost(all): 1 day, 23:39:26/6:08:12, loss=0.309746012452038, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.0(1.03), norm=4.869465694009863, lr=4.7932536457729e-05
2023-11-23 08:04:46   INFO  epoch: 26/30, acc_iter=175662, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:12:02/0:34:54, time_cost(all): 1 day, 23:40:15/6:15:46, loss=0.309662898465729, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.04(1.03), norm=1.1669548851664007, lr=4.7765094348047e-05
2023-11-23 08:05:35   INFO  epoch: 26/30, acc_iter=175712, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:12:51/0:35:49, time_cost(all): 1 day, 23:41:04/6:02:07, loss=0.30957978447942, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.2(1.03), norm=1.8057208823790307, lr=4.7597652238366e-05
2023-11-23 08:06:24   INFO  epoch: 26/30, acc_iter=175762, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:13:40/0:35:46, time_cost(all): 1 day, 23:41:53/6:18:28, loss=0.309496670493111, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.18(1.03), norm=4.784789791742178, lr=4.7430210128685e-05
2023-11-23 08:07:13   INFO  epoch: 26/30, acc_iter=175812, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:14:29/0:32:13, time_cost(all): 1 day, 23:42:42/6:06:20, loss=0.309413556506802, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.03(1.03), norm=1.730983849906953, lr=4.7262768019004e-05
2023-11-23 08:08:02   INFO  epoch: 26/30, acc_iter=175862, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:15:18/0:33:58, time_cost(all): 1 day, 23:43:31/6:20:03, loss=0.309330442520493, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.92(1.03), norm=3.8736955817939185, lr=4.7095325909322e-05
2023-11-23 08:08:52   INFO  epoch: 26/30, acc_iter=175912, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:16:07/0:31:41, time_cost(all): 1 day, 23:44:21/5:52:56, loss=0.309247328534184, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.17(1.03), norm=4.975506860278531, lr=4.6927883799641e-05
2023-11-23 08:09:41   INFO  epoch: 26/30, acc_iter=175962, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:16:57/0:31:52, time_cost(all): 1 day, 23:45:10/5:58:33, loss=0.309164214547875, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.05(1.03), norm=2.413945641772325, lr=4.676044168996e-05
2023-11-23 08:10:30   INFO  epoch: 26/30, acc_iter=176012, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:17:46/0:29:33, time_cost(all): 1 day, 23:45:59/6:06:08, loss=0.309081100561566, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.88(1.03), norm=1.7538647828767895, lr=4.6592999580278e-05
2023-11-23 08:11:19   INFO  epoch: 26/30, acc_iter=176062, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:18:35/0:30:33, time_cost(all): 1 day, 23:46:48/6:06:39, loss=0.308997986575257, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.99(1.03), norm=3.7693209055449075, lr=4.6425557470597e-05
2023-11-23 08:12:08   INFO  epoch: 26/30, acc_iter=176112, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:19:24/0:27:32, time_cost(all): 1 day, 23:47:37/5:58:56, loss=0.308914872588948, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.98(1.03), norm=3.1618121787712163, lr=4.6258115360916e-05
2023-11-23 08:12:57   INFO  epoch: 26/30, acc_iter=176162, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:20:13/0:28:54, time_cost(all): 1 day, 23:48:26/6:02:51, loss=0.308831758602639, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.03(1.03), norm=1.784309586751764, lr=4.6090673251235e-05
2023-11-23 08:13:46   INFO  epoch: 26/30, acc_iter=176212, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:21:02/0:25:42, time_cost(all): 1 day, 23:49:15/6:12:45, loss=0.30874864461633, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.07(1.03), norm=1.7550399107884165, lr=4.5923231141553e-05
2023-11-23 08:14:35   INFO  epoch: 26/30, acc_iter=176262, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:21:51/0:26:58, time_cost(all): 1 day, 23:50:04/6:22:49, loss=0.30866553063002, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.2(1.03), norm=4.925564610236557, lr=4.5755789031872e-05
2023-11-23 08:15:25   INFO  epoch: 26/30, acc_iter=176312, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:22:40/0:25:36, time_cost(all): 1 day, 23:50:54/5:55:35, loss=0.308582416643711, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.03(1.03), norm=1.54190974967763, lr=4.5588346922191e-05
2023-11-23 08:16:14   INFO  epoch: 26/30, acc_iter=176362, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:23:29/0:23:29, time_cost(all): 1 day, 23:51:43/5:49:18, loss=0.308499302657402, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.84(1.03), norm=0.7807200954405664, lr=4.5420904812509e-05
2023-11-23 08:17:03   INFO  epoch: 26/30, acc_iter=176412, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:24:19/0:24:24, time_cost(all): 1 day, 23:52:32/6:10:50, loss=0.308416188671093, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.03(1.03), norm=2.043645873158056, lr=4.5253462702828e-05
2023-11-23 08:17:52   INFO  epoch: 26/30, acc_iter=176462, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:25:08/0:22:23, time_cost(all): 1 day, 23:53:21/6:17:50, loss=0.308333074684784, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.87(1.03), norm=1.225834106941345, lr=4.5086020593147e-05
2023-11-23 08:18:41   INFO  epoch: 26/30, acc_iter=176512, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:25:57/0:22:58, time_cost(all): 1 day, 23:54:10/5:49:06, loss=0.308249960698475, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.09(1.03), norm=2.193874104903546, lr=4.4918578483466e-05
2023-11-23 08:19:30   INFO  epoch: 26/30, acc_iter=176562, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:26:46/0:21:06, time_cost(all): 1 day, 23:54:59/6:06:13, loss=0.308166846712166, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.16(1.03), norm=1.8165452884717848, lr=4.4751136373784e-05
2023-11-23 08:20:19   INFO  epoch: 26/30, acc_iter=176612, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:27:35/0:20:36, time_cost(all): 1 day, 23:55:48/6:10:15, loss=0.308083732725857, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.01(1.03), norm=2.530685781548395, lr=4.4583694264103e-05
2023-11-23 08:21:08   INFO  epoch: 26/30, acc_iter=176662, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:28:24/0:20:10, time_cost(all): 1 day, 23:56:37/5:44:20, loss=0.308000618739548, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.1(1.03), norm=4.743333040414576, lr=4.4416252154422e-05
2023-11-23 08:21:57   INFO  epoch: 26/30, acc_iter=176712, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:29:13/0:19:16, time_cost(all): 1 day, 23:57:26/6:13:06, loss=0.307917504753239, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.17(1.03), norm=0.8682784730104973, lr=4.4248810044741e-05
2023-11-23 08:22:47   INFO  epoch: 26/30, acc_iter=176762, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:30:02/0:18:35, time_cost(all): 1 day, 23:58:16/6:14:47, loss=0.30783439076693, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.14(1.03), norm=3.942825740588094, lr=4.4081367935059e-05
2023-11-23 08:23:36   INFO  epoch: 26/30, acc_iter=176812, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:30:52/0:17:15, time_cost(all): 1 day, 23:59:05/5:57:28, loss=0.307751276780621, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.97(1.03), norm=3.694429339273882, lr=4.3913925825378e-05
2023-11-23 08:24:25   INFO  epoch: 26/30, acc_iter=176862, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:31:41/0:16:23, time_cost(all): 1 day, 23:59:54/6:02:16, loss=0.307668162794312, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.13(1.03), norm=3.3922643544304245, lr=4.3746483715697e-05
2023-11-23 08:25:14   INFO  epoch: 26/30, acc_iter=176912, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:32:30/0:15:25, time_cost(all): 2 days, 0:00:43/6:12:12, loss=0.307585048808003, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.09(1.03), norm=4.90840290580206, lr=4.3579041606015e-05
2023-11-23 08:26:03   INFO  epoch: 26/30, acc_iter=176962, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:33:19/0:14:48, time_cost(all): 2 days, 0:01:32/6:10:03, loss=0.307501934821694, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.91(1.03), norm=4.962509435824609, lr=4.3411599496334e-05
2023-11-23 08:26:52   INFO  epoch: 26/30, acc_iter=177012, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:34:08/0:14:07, time_cost(all): 2 days, 0:02:21/6:08:44, loss=0.307418820835385, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.03(1.03), norm=1.703863321170712, lr=4.3244157386653e-05
2023-11-23 08:27:41   INFO  epoch: 26/30, acc_iter=177062, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:34:57/0:12:54, time_cost(all): 2 days, 0:03:10/5:44:11, loss=0.307335706849075, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.06(1.03), norm=1.741418802106365, lr=4.3076715276972e-05
2023-11-23 08:28:30   INFO  epoch: 26/30, acc_iter=177112, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:35:46/0:11:34, time_cost(all): 2 days, 0:03:59/5:57:17, loss=0.307252592862766, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.92(1.03), norm=1.5064401295320493, lr=4.290927316729e-05
2023-11-23 08:29:20   INFO  epoch: 26/30, acc_iter=177162, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:36:35/0:10:57, time_cost(all): 2 days, 0:04:49/5:40:16, loss=0.307169478876457, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.87(1.03), norm=4.357832309129384, lr=4.2741831057609e-05
2023-11-23 08:30:09   INFO  epoch: 26/30, acc_iter=177212, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:37:24/0:10:18, time_cost(all): 2 days, 0:05:38/6:04:30, loss=0.307086364890148, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.13(1.03), norm=4.461490720220484, lr=4.2574388947928e-05
2023-11-23 08:30:58   INFO  epoch: 26/30, acc_iter=177262, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:38:14/0:09:34, time_cost(all): 2 days, 0:06:27/5:35:55, loss=0.307003250903839, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.89(1.03), norm=1.4227051460686577, lr=4.2406946838246e-05
2023-11-23 08:31:47   INFO  epoch: 26/30, acc_iter=177312, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:39:03/0:09:02, time_cost(all): 2 days, 0:07:16/5:53:36, loss=0.30692013691753, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.19(1.03), norm=2.1170520750388446, lr=4.2239504728565e-05
2023-11-23 08:32:36   INFO  epoch: 26/30, acc_iter=177362, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:39:52/0:07:43, time_cost(all): 2 days, 0:08:05/5:46:23, loss=0.306837022931221, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.07(1.03), norm=4.388515282397843, lr=4.2072062618884e-05
2023-11-23 08:33:25   INFO  epoch: 26/30, acc_iter=177412, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:40:41/0:07:18, time_cost(all): 2 days, 0:08:54/5:53:25, loss=0.306753908944912, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.15(1.03), norm=2.5208000629087244, lr=4.1904620509203e-05
2023-11-23 08:34:14   INFO  epoch: 26/30, acc_iter=177462, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:41:30/0:06:28, time_cost(all): 2 days, 0:09:43/5:44:29, loss=0.306670794958603, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.1(1.03), norm=2.726283589789836, lr=4.1737178399521e-05
2023-11-23 08:35:03   INFO  epoch: 26/30, acc_iter=177512, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 1:42:19/0:05:29, time_cost(all): 2 days, 0:10:32/5:43:29, loss=0.306587680972294, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.07(1.03), norm=4.212955750062591, lr=4.156973628984e-05
2023-11-23 08:35:52   INFO  epoch: 26/30, acc_iter=177562, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 1:43:08/0:04:52, time_cost(all): 2 days, 0:11:21/5:56:41, loss=0.306504566985985, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.96(1.03), norm=3.527838489887656, lr=4.1402294180159e-05
2023-11-23 08:36:42   INFO  epoch: 26/30, acc_iter=177612, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 1:43:57/0:03:43, time_cost(all): 2 days, 0:12:11/5:42:51, loss=0.306421452999676, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.21(1.03), norm=2.646731565296093, lr=4.1234852070478e-05
2023-11-23 08:37:31   INFO  epoch: 26/30, acc_iter=177662, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 1:44:47/0:03:11, time_cost(all): 2 days, 0:13:00/5:35:06, loss=0.306338339013367, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.21(1.03), norm=4.419091777872213, lr=4.1067409960796e-05
2023-11-23 08:38:20   INFO  epoch: 26/30, acc_iter=177712, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 1:45:36/0:02:12, time_cost(all): 2 days, 0:13:49/5:41:28, loss=0.306255225027058, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.22(1.03), norm=3.796124466054148, lr=4.0899967851115e-05
2023-11-23 08:39:09   INFO  epoch: 26/30, acc_iter=177762, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 1:46:25/0:01:26, time_cost(all): 2 days, 0:14:38/5:49:25, loss=0.306172111040749, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.15(1.03), norm=4.386162244152249, lr=4.0732525741434e-05
2023-11-23 08:39:58   INFO  epoch: 26/30, acc_iter=177812, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 1:47:14/0:00:35, time_cost(all): 2 days, 0:15:27/5:52:32, loss=0.306088997054439, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.99(1.03), norm=1.1364674434756892, lr=4.0565083631752e-05
2023-11-23 08:40:47   INFO  epoch: 27/30, acc_iter=177899, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:49/1:47:57, time_cost(all): 2 days, 0:16:16/5:41:32, loss=0.305944378718262, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.83(1.03), norm=3.4922282525856665, lr=4.0273734360907e-05
2023-11-23 08:41:36   INFO  epoch: 27/30, acc_iter=177949, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:38/1:42:37, time_cost(all): 2 days, 0:17:05/5:23:53, loss=0.305861264731953, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.94(1.03), norm=2.4090129323563456, lr=4.0106292251226e-05
2023-11-23 08:42:25   INFO  epoch: 27/30, acc_iter=177999, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:27/1:43:53, time_cost(all): 2 days, 0:17:54/5:40:20, loss=0.305778150745644, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.22(1.03), norm=3.0286738360227736, lr=3.9938850141544e-05
2023-11-23 08:43:15   INFO  epoch: 27/30, acc_iter=178049, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:16/1:39:44, time_cost(all): 2 days, 0:18:44/5:38:32, loss=0.305695036759335, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.9(1.03), norm=1.5665715168011065, lr=3.9771408031863e-05
2023-11-23 08:44:04   INFO  epoch: 27/30, acc_iter=178099, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:05/1:48:25, time_cost(all): 2 days, 0:19:33/5:27:28, loss=0.305611922773026, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.09(1.03), norm=2.743723124416927, lr=3.9603965922182e-05
2023-11-23 08:44:53   INFO  epoch: 27/30, acc_iter=178149, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:04:54/1:38:33, time_cost(all): 2 days, 0:20:22/5:50:39, loss=0.305528808786716, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.85(1.03), norm=1.3203441019330053, lr=3.9436523812501e-05
2023-11-23 08:45:42   INFO  epoch: 27/30, acc_iter=178199, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:05:43/1:39:08, time_cost(all): 2 days, 0:21:11/5:32:19, loss=0.305445694800407, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.84(1.03), norm=1.2329851017270776, lr=3.9269081702819e-05
2023-11-23 08:46:31   INFO  epoch: 27/30, acc_iter=178249, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:06:32/1:45:57, time_cost(all): 2 days, 0:22:00/5:46:59, loss=0.305362580814098, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.93(1.03), norm=1.3581409658495907, lr=3.9101639593138e-05
2023-11-23 08:47:20   INFO  epoch: 27/30, acc_iter=178299, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:07:22/1:35:59, time_cost(all): 2 days, 0:22:49/5:31:25, loss=0.305279466827789, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.17(1.03), norm=1.0699445894315052, lr=3.8934197483457e-05
2023-11-23 08:48:09   INFO  epoch: 27/30, acc_iter=178349, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:08:11/1:35:13, time_cost(all): 2 days, 0:23:38/5:40:22, loss=0.30519635284148, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.94(1.03), norm=1.5068308059902797, lr=3.8766755373775e-05
2023-11-23 08:48:58   INFO  epoch: 27/30, acc_iter=178399, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:09:00/1:41:10, time_cost(all): 2 days, 0:24:27/5:15:25, loss=0.305113238855171, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.18(1.03), norm=4.980471233087982, lr=3.8599313264094e-05
2023-11-23 08:49:47   INFO  epoch: 27/30, acc_iter=178449, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:09:49/1:36:48, time_cost(all): 2 days, 0:25:16/5:19:15, loss=0.305030124868862, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.0(1.03), norm=4.189861528999947, lr=3.8431871154413e-05
2023-11-23 08:50:37   INFO  epoch: 27/30, acc_iter=178499, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:10:38/1:40:49, time_cost(all): 2 days, 0:26:06/5:25:03, loss=0.304947010882553, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.88(1.03), norm=4.417967988502591, lr=3.8264429044732e-05
2023-11-23 08:51:26   INFO  epoch: 27/30, acc_iter=178549, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:11:27/1:34:03, time_cost(all): 2 days, 0:26:55/5:28:49, loss=0.304863896896244, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.85(1.03), norm=2.9302998594989473, lr=3.809698693505e-05
2023-11-23 08:52:15   INFO  epoch: 27/30, acc_iter=178599, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:12:16/1:40:02, time_cost(all): 2 days, 0:27:44/5:38:13, loss=0.304780782909935, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.9(1.03), norm=3.8620965377560026, lr=3.7929544825369e-05
2023-11-23 08:53:04   INFO  epoch: 27/30, acc_iter=178649, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:13:05/1:35:11, time_cost(all): 2 days, 0:28:33/5:12:41, loss=0.304697668923626, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.88(1.03), norm=4.105521849650424, lr=3.7762102715688e-05
2023-11-23 08:53:53   INFO  epoch: 27/30, acc_iter=178699, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:13:54/1:37:35, time_cost(all): 2 days, 0:29:22/5:19:21, loss=0.304614554937317, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.13(1.03), norm=1.7919575721718752, lr=3.7594660606006e-05
2023-11-23 08:54:42   INFO  epoch: 27/30, acc_iter=178749, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:14:44/1:32:01, time_cost(all): 2 days, 0:30:11/5:29:24, loss=0.304531440951008, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.11(1.03), norm=1.476895350883707, lr=3.7427218496325e-05
2023-11-23 08:55:31   INFO  epoch: 27/30, acc_iter=178799, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:15:33/1:32:06, time_cost(all): 2 days, 0:31:00/5:37:48, loss=0.304448326964699, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.95(1.03), norm=4.516405219191738, lr=3.7259776386644e-05
2023-11-23 08:56:20   INFO  epoch: 27/30, acc_iter=178849, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:16:22/1:33:33, time_cost(all): 2 days, 0:31:49/5:20:58, loss=0.30436521297839, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.21(1.03), norm=1.6404031553544967, lr=3.7092334276963e-05
2023-11-23 08:57:10   INFO  epoch: 27/30, acc_iter=178899, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:17:11/1:33:19, time_cost(all): 2 days, 0:32:39/5:09:49, loss=0.30428209899208, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.83(1.03), norm=2.1280031599947637, lr=3.6924892167281e-05
2023-11-23 08:57:59   INFO  epoch: 27/30, acc_iter=178949, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:18:00/1:28:20, time_cost(all): 2 days, 0:33:28/5:37:02, loss=0.304198985005771, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.04(1.03), norm=0.8200220673243919, lr=3.67574500576e-05
2023-11-23 08:58:48   INFO  epoch: 27/30, acc_iter=178999, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:18:49/1:25:37, time_cost(all): 2 days, 0:34:17/5:22:46, loss=0.304115871019462, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.13(1.03), norm=1.3596607810373291, lr=3.6590007947919e-05
2023-11-23 08:59:37   INFO  epoch: 27/30, acc_iter=179049, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:19:38/1:30:36, time_cost(all): 2 days, 0:35:06/5:16:49, loss=0.304032757033153, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.12(1.03), norm=4.657023459363631, lr=3.6422565838238e-05
2023-11-23 09:00:26   INFO  epoch: 27/30, acc_iter=179099, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:20:27/1:27:20, time_cost(all): 2 days, 0:35:55/5:22:50, loss=0.303949643046844, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.08(1.03), norm=2.7589411995106703, lr=3.6255123728556e-05
2023-11-23 09:01:15   INFO  epoch: 27/30, acc_iter=179149, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:21:17/1:26:08, time_cost(all): 2 days, 0:36:44/5:31:56, loss=0.303866529060535, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.11(1.03), norm=0.6234437171178064, lr=3.6087681618875e-05
2023-11-23 09:02:04   INFO  epoch: 27/30, acc_iter=179199, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:22:06/1:22:19, time_cost(all): 2 days, 0:37:33/5:26:10, loss=0.303783415074226, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.14(1.03), norm=3.176572894846076, lr=3.5920239509194e-05
2023-11-23 09:02:53   INFO  epoch: 27/30, acc_iter=179249, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:22:55/1:21:49, time_cost(all): 2 days, 0:38:22/5:23:41, loss=0.303700301087917, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.16(1.03), norm=0.7364380456070907, lr=3.5752797399512e-05
2023-11-23 09:03:42   INFO  epoch: 27/30, acc_iter=179299, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:23:44/1:24:19, time_cost(all): 2 days, 0:39:11/5:05:52, loss=0.303617187101608, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.05(1.03), norm=0.9592206309961492, lr=3.5585355289831e-05
2023-11-23 09:04:32   INFO  epoch: 27/30, acc_iter=179349, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:24:33/1:27:12, time_cost(all): 2 days, 0:40:01/5:03:32, loss=0.303534073115299, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.84(1.03), norm=2.412997780251086, lr=3.541791318015e-05
2023-11-23 09:05:21   INFO  epoch: 27/30, acc_iter=179399, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:25:22/1:19:31, time_cost(all): 2 days, 0:40:50/5:01:01, loss=0.30345095912899, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.07(1.03), norm=0.9589397947353362, lr=3.5250471070469e-05
2023-11-23 09:06:10   INFO  epoch: 27/30, acc_iter=179449, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:26:11/1:19:59, time_cost(all): 2 days, 0:41:39/5:12:34, loss=0.303367845142681, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.04(1.03), norm=4.012335006264946, lr=3.5083028960787e-05
2023-11-23 09:06:59   INFO  epoch: 27/30, acc_iter=179499, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:27:00/1:19:34, time_cost(all): 2 days, 0:42:28/4:59:23, loss=0.303284731156372, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.04(1.03), norm=3.9235097301343, lr=3.4915586851106e-05
2023-11-23 09:07:48   INFO  epoch: 27/30, acc_iter=179549, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:27:49/1:20:47, time_cost(all): 2 days, 0:43:17/5:22:06, loss=0.303201617170063, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.93(1.03), norm=3.432588177663078, lr=3.4748144741425e-05
2023-11-23 09:08:37   INFO  epoch: 27/30, acc_iter=179599, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:28:39/1:22:30, time_cost(all): 2 days, 0:44:06/5:12:34, loss=0.303118503183754, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.84(1.03), norm=3.955971472491169, lr=3.4580702631743e-05
2023-11-23 09:09:26   INFO  epoch: 27/30, acc_iter=179649, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:29:28/1:18:36, time_cost(all): 2 days, 0:44:55/5:22:25, loss=0.303035389197445, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.04(1.03), norm=3.3269048307255047, lr=3.4413260522062e-05
2023-11-23 09:10:15   INFO  epoch: 27/30, acc_iter=179699, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:30:17/1:14:06, time_cost(all): 2 days, 0:45:44/5:18:10, loss=0.302952275211135, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.04(1.03), norm=2.136641641584666, lr=3.4245818412381e-05
2023-11-23 09:11:04   INFO  epoch: 27/30, acc_iter=179749, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:31:06/1:15:15, time_cost(all): 2 days, 0:46:33/5:17:28, loss=0.302869161224826, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.07(1.03), norm=1.6082246722026987, lr=3.40783763027e-05
2023-11-23 09:11:54   INFO  epoch: 27/30, acc_iter=179799, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:31:55/1:16:10, time_cost(all): 2 days, 0:47:23/5:12:01, loss=0.302786047238517, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.13(1.03), norm=4.66700111241103, lr=3.3910934193018e-05
2023-11-23 09:12:43   INFO  epoch: 27/30, acc_iter=179849, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:32:44/1:18:45, time_cost(all): 2 days, 0:48:12/5:20:55, loss=0.302702933252208, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.99(1.03), norm=4.063058919692429, lr=3.3743492083337e-05
2023-11-23 09:13:32   INFO  epoch: 27/30, acc_iter=179899, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:33:33/1:14:54, time_cost(all): 2 days, 0:49:01/5:07:42, loss=0.302619819265899, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.12(1.03), norm=2.326374881450092, lr=3.3576049973656e-05
2023-11-23 09:14:21   INFO  epoch: 27/30, acc_iter=179949, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:34:22/1:12:07, time_cost(all): 2 days, 0:49:50/4:55:51, loss=0.30253670527959, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.01(1.03), norm=2.0317349613482, lr=3.3408607863975e-05
2023-11-23 09:15:10   INFO  epoch: 27/30, acc_iter=179999, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:35:12/1:09:55, time_cost(all): 2 days, 0:50:39/4:53:10, loss=0.302453591293281, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.99(1.03), norm=1.728211742973005, lr=3.3241165754293e-05
2023-11-23 09:15:59   INFO  epoch: 27/30, acc_iter=180049, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:36:01/1:14:19, time_cost(all): 2 days, 0:51:28/4:54:17, loss=0.302370477306972, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.18(1.03), norm=0.9406380991773233, lr=3.3073723644612e-05
2023-11-23 09:16:48   INFO  epoch: 27/30, acc_iter=180099, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:36:50/1:08:55, time_cost(all): 2 days, 0:52:17/5:05:05, loss=0.302287363320663, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.03(1.03), norm=2.6655179248667182, lr=3.2906281534931e-05
2023-11-23 09:17:37   INFO  epoch: 27/30, acc_iter=180149, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:37:39/1:07:20, time_cost(all): 2 days, 0:53:06/5:05:10, loss=0.302204249334354, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.16(1.03), norm=4.396084742777459, lr=3.2738839425249e-05
2023-11-23 09:18:27   INFO  epoch: 27/30, acc_iter=180199, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:38:28/1:07:25, time_cost(all): 2 days, 0:53:56/5:09:30, loss=0.302121135348045, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.06(1.03), norm=3.6844227970576426, lr=3.2571397315568e-05
2023-11-23 09:19:16   INFO  epoch: 27/30, acc_iter=180249, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:39:17/1:09:37, time_cost(all): 2 days, 0:54:45/5:03:55, loss=0.302038021361736, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.08(1.03), norm=4.6907522536815245, lr=3.2403955205887e-05
2023-11-23 09:20:05   INFO  epoch: 27/30, acc_iter=180299, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:40:06/1:09:40, time_cost(all): 2 days, 0:55:34/5:05:23, loss=0.301954907375427, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.08(1.03), norm=4.461415894659002, lr=3.2236513096206e-05
2023-11-23 09:20:54   INFO  epoch: 27/30, acc_iter=180349, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:40:55/1:04:53, time_cost(all): 2 days, 0:56:23/5:12:58, loss=0.301871793389118, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.15(1.03), norm=4.440320343826258, lr=3.2069070986524e-05
2023-11-23 09:21:43   INFO  epoch: 27/30, acc_iter=180399, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:41:44/1:03:33, time_cost(all): 2 days, 0:57:12/5:00:49, loss=0.301788679402809, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.09(1.03), norm=1.111067764363528, lr=3.1901628876843e-05
2023-11-23 09:22:32   INFO  epoch: 27/30, acc_iter=180449, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:42:34/1:08:19, time_cost(all): 2 days, 0:58:01/4:50:18, loss=0.3017055654165, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.95(1.03), norm=1.9098692357907687, lr=3.1734186767162e-05
2023-11-23 09:23:21   INFO  epoch: 27/30, acc_iter=180499, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:43:23/1:03:02, time_cost(all): 2 days, 0:58:50/4:43:30, loss=0.301622451430191, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.83(1.03), norm=4.050622026115801, lr=3.156674465748e-05
2023-11-23 09:24:10   INFO  epoch: 27/30, acc_iter=180549, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:44:12/1:02:09, time_cost(all): 2 days, 0:59:39/4:46:23, loss=0.301539337443881, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.22(1.03), norm=1.0459118409152, lr=3.1399302547799e-05
2023-11-23 09:24:59   INFO  epoch: 27/30, acc_iter=180599, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:45:01/1:04:27, time_cost(all): 2 days, 1:00:28/5:09:09, loss=0.301456223457572, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.05(1.03), norm=3.421131151445828, lr=3.1231860438118e-05
2023-11-23 09:25:49   INFO  epoch: 27/30, acc_iter=180649, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:45:50/1:00:01, time_cost(all): 2 days, 1:01:18/4:48:14, loss=0.301373109471263, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.98(1.03), norm=4.990494717703419, lr=3.1064418328437e-05
2023-11-23 09:26:38   INFO  epoch: 27/30, acc_iter=180699, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:46:39/0:58:21, time_cost(all): 2 days, 1:02:07/4:54:57, loss=0.301289995484954, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.96(1.03), norm=3.726901403039395, lr=3.0896976218755e-05
2023-11-23 09:27:27   INFO  epoch: 27/30, acc_iter=180749, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:47:28/1:02:04, time_cost(all): 2 days, 1:02:56/4:59:53, loss=0.301206881498645, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.04(1.03), norm=2.9954969606670008, lr=3.0729534109074e-05
2023-11-23 09:28:16   INFO  epoch: 27/30, acc_iter=180799, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:48:17/0:58:15, time_cost(all): 2 days, 1:03:45/4:40:14, loss=0.301123767512336, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.93(1.03), norm=1.710241777187482, lr=3.0562091999393e-05
2023-11-23 09:29:05   INFO  epoch: 27/30, acc_iter=180849, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:49:07/1:00:11, time_cost(all): 2 days, 1:04:34/4:50:30, loss=0.301040653526027, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.02(1.03), norm=3.2402872110317236, lr=3.0394649889711e-05
2023-11-23 09:29:54   INFO  epoch: 27/30, acc_iter=180899, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:49:56/0:55:52, time_cost(all): 2 days, 1:05:23/5:02:11, loss=0.300957539539718, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.18(1.03), norm=1.5935369034309665, lr=3.022720778003e-05
2023-11-23 09:30:43   INFO  epoch: 27/30, acc_iter=180949, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:50:45/0:57:46, time_cost(all): 2 days, 1:06:12/4:45:58, loss=0.300874425553409, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.87(1.03), norm=3.0425117173011507, lr=3.0059765670349e-05
2023-11-23 09:31:32   INFO  epoch: 27/30, acc_iter=180999, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 0:51:34/0:56:45, time_cost(all): 2 days, 1:07:01/4:59:51, loss=0.3007913115671, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.86(1.03), norm=4.957853235674214, lr=2.9892323560668e-05
2023-11-23 09:32:22   INFO  epoch: 27/30, acc_iter=181049, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 0:52:23/0:56:41, time_cost(all): 2 days, 1:07:51/4:50:05, loss=0.300708197580791, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.91(1.03), norm=1.5242247372270543, lr=2.9724881450986e-05
2023-11-23 09:33:11   INFO  epoch: 27/30, acc_iter=181099, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 0:53:12/0:55:57, time_cost(all): 2 days, 1:08:40/4:41:36, loss=0.300625083594482, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.13(1.03), norm=4.86791639621061, lr=2.9557439341305e-05
2023-11-23 09:34:00   INFO  epoch: 27/30, acc_iter=181149, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 0:54:01/0:53:35, time_cost(all): 2 days, 1:09:29/4:58:16, loss=0.300541969608173, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.19(1.03), norm=2.323520103615831, lr=2.9389997231624e-05
2023-11-23 09:34:49   INFO  epoch: 27/30, acc_iter=181199, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 0:54:50/0:55:15, time_cost(all): 2 days, 1:10:18/4:57:14, loss=0.300458855621864, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.13(1.03), norm=0.8269241134978238, lr=2.9222555121942e-05
2023-11-23 09:35:38   INFO  epoch: 27/30, acc_iter=181249, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 0:55:39/0:53:06, time_cost(all): 2 days, 1:11:07/4:49:40, loss=0.300375741635555, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.96(1.03), norm=3.260777697041237, lr=2.9055113012261e-05
2023-11-23 09:36:27   INFO  epoch: 27/30, acc_iter=181299, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 0:56:29/0:53:44, time_cost(all): 2 days, 1:11:56/4:52:43, loss=0.300292627649245, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.17(1.03), norm=2.027347478931274, lr=2.888767090258e-05
2023-11-23 09:37:16   INFO  epoch: 27/30, acc_iter=181349, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 0:57:18/0:48:40, time_cost(all): 2 days, 1:12:45/4:48:04, loss=0.300209513662936, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.93(1.03), norm=4.858068593697593, lr=2.8720228792899e-05
2023-11-23 09:38:05   INFO  epoch: 27/30, acc_iter=181399, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 0:58:07/0:49:08, time_cost(all): 2 days, 1:13:34/4:47:37, loss=0.300126399676627, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.04(1.03), norm=2.6331136501378465, lr=2.8552786683217e-05
2023-11-23 09:38:54   INFO  epoch: 27/30, acc_iter=181449, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 0:58:56/0:51:20, time_cost(all): 2 days, 1:14:23/4:48:23, loss=0.300043285690318, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.96(1.03), norm=1.6676804771672595, lr=2.8385344573536e-05
2023-11-23 09:39:44   INFO  epoch: 27/30, acc_iter=181499, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 0:59:45/0:46:16, time_cost(all): 2 days, 1:15:13/4:42:25, loss=0.299960171704009, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.84(1.03), norm=2.568707971474053, lr=2.8217902463855e-05
2023-11-23 09:40:33   INFO  epoch: 27/30, acc_iter=181549, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:00:34/0:45:41, time_cost(all): 2 days, 1:16:02/4:33:08, loss=0.2998770577177, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.04(1.03), norm=3.1053367969924177, lr=2.8050460354174e-05
2023-11-23 09:41:22   INFO  epoch: 27/30, acc_iter=181599, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:01:23/0:47:14, time_cost(all): 2 days, 1:16:51/4:37:16, loss=0.299793943731391, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.94(1.03), norm=1.9029063003926912, lr=2.7883018244492e-05
2023-11-23 09:42:11   INFO  epoch: 27/30, acc_iter=181649, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:02:12/0:45:16, time_cost(all): 2 days, 1:17:40/4:39:56, loss=0.299710829745082, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.15(1.03), norm=4.70769024872452, lr=2.7715576134811e-05
2023-11-23 09:43:00   INFO  epoch: 27/30, acc_iter=181699, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:03:02/0:44:06, time_cost(all): 2 days, 1:18:29/4:44:51, loss=0.299627715758773, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.84(1.03), norm=3.1519796939301856, lr=2.754813402513e-05
2023-11-23 09:43:49   INFO  epoch: 27/30, acc_iter=181749, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:03:51/0:42:37, time_cost(all): 2 days, 1:19:18/4:28:15, loss=0.299544601772464, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.11(1.03), norm=2.940379681716544, lr=2.7380691915448e-05
2023-11-23 09:44:38   INFO  epoch: 27/30, acc_iter=181799, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:04:40/0:41:03, time_cost(all): 2 days, 1:20:07/4:26:24, loss=0.299461487786155, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.07(1.03), norm=1.4816164042468936, lr=2.7213249805767e-05
2023-11-23 09:45:27   INFO  epoch: 27/30, acc_iter=181849, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:05:29/0:42:21, time_cost(all): 2 days, 1:20:56/4:35:51, loss=0.299378373799846, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.88(1.03), norm=3.271930961055716, lr=2.7045807696086e-05
2023-11-23 09:46:17   INFO  epoch: 27/30, acc_iter=181899, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:06:18/0:42:28, time_cost(all): 2 days, 1:21:46/4:36:48, loss=0.299295259813537, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.23(1.03), norm=4.0292717066152, lr=2.6878365586405e-05
2023-11-23 09:47:06   INFO  epoch: 27/30, acc_iter=181949, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:07:07/0:38:47, time_cost(all): 2 days, 1:22:35/4:22:48, loss=0.299212145827228, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.83(1.03), norm=1.1737096537562455, lr=2.6710923476723e-05
2023-11-23 09:47:55   INFO  epoch: 27/30, acc_iter=181999, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:07:56/0:38:49, time_cost(all): 2 days, 1:23:24/4:25:50, loss=0.299129031840919, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.97(1.03), norm=2.908284630400129, lr=2.6543481367042e-05
2023-11-23 09:48:44   INFO  epoch: 27/30, acc_iter=182049, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:08:45/0:39:53, time_cost(all): 2 days, 1:24:13/4:44:11, loss=0.29904591785461, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.1(1.03), norm=0.5994307097772349, lr=2.6376039257361e-05
2023-11-23 09:49:33   INFO  epoch: 27/30, acc_iter=182099, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:09:34/0:36:30, time_cost(all): 2 days, 1:25:02/4:34:09, loss=0.2989628038683, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.0(1.03), norm=1.15918391269026, lr=2.6208597147679e-05
2023-11-23 09:50:22   INFO  epoch: 27/30, acc_iter=182149, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:10:24/0:36:58, time_cost(all): 2 days, 1:25:51/4:18:23, loss=0.298879689881991, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.17(1.03), norm=3.6791723204526887, lr=2.6041155037998e-05
2023-11-23 09:51:11   INFO  epoch: 27/30, acc_iter=182199, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:11:13/0:37:17, time_cost(all): 2 days, 1:26:40/4:35:53, loss=0.298796575895682, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.89(1.03), norm=2.8982644686460146, lr=2.5873712928317e-05
2023-11-23 09:52:00   INFO  epoch: 27/30, acc_iter=182249, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:12:02/0:35:56, time_cost(all): 2 days, 1:27:29/4:40:07, loss=0.298713461909373, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.97(1.03), norm=0.6954043836085003, lr=2.5706270818636e-05
2023-11-23 09:52:49   INFO  epoch: 27/30, acc_iter=182299, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:12:51/0:34:52, time_cost(all): 2 days, 1:28:18/4:25:55, loss=0.298630347923064, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.04(1.03), norm=4.759573353482509, lr=2.5538828708954e-05
2023-11-23 09:53:39   INFO  epoch: 27/30, acc_iter=182349, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:13:40/0:33:21, time_cost(all): 2 days, 1:29:08/4:13:16, loss=0.298547233936755, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.95(1.03), norm=1.8908047575028495, lr=2.5371386599273e-05
2023-11-23 09:54:28   INFO  epoch: 27/30, acc_iter=182399, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:14:29/0:34:25, time_cost(all): 2 days, 1:29:57/4:35:01, loss=0.298464119950446, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.89(1.03), norm=1.9688002978886898, lr=2.5203944489592e-05
2023-11-23 09:55:17   INFO  epoch: 27/30, acc_iter=182449, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:15:18/0:32:33, time_cost(all): 2 days, 1:30:46/4:18:57, loss=0.298381005964137, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.2(1.03), norm=0.6047108501908187, lr=2.5036502379911e-05
2023-11-23 09:56:06   INFO  epoch: 27/30, acc_iter=182499, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:16:07/0:30:12, time_cost(all): 2 days, 1:31:35/4:14:55, loss=0.298297891977828, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.04(1.03), norm=2.122738140320081, lr=2.4869060270229e-05
2023-11-23 09:56:55   INFO  epoch: 27/30, acc_iter=182549, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:16:57/0:31:59, time_cost(all): 2 days, 1:32:24/4:30:26, loss=0.298214777991519, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.11(1.03), norm=2.6047771858318147, lr=2.4701618160548e-05
2023-11-23 09:57:44   INFO  epoch: 27/30, acc_iter=182599, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:17:46/0:29:48, time_cost(all): 2 days, 1:33:13/4:15:52, loss=0.29813166400521, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.22(1.03), norm=3.229474022391071, lr=2.4534176050867e-05
2023-11-23 09:58:33   INFO  epoch: 27/30, acc_iter=182649, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:18:35/0:28:15, time_cost(all): 2 days, 1:34:02/4:15:50, loss=0.298048550018901, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.98(1.03), norm=4.557047992465375, lr=2.4366733941185e-05
2023-11-23 09:59:22   INFO  epoch: 27/30, acc_iter=182699, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:19:24/0:29:20, time_cost(all): 2 days, 1:34:51/4:30:58, loss=0.297965436032592, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.09(1.03), norm=4.919175652738083, lr=2.4199291831504e-05
2023-11-23 10:00:12   INFO  epoch: 27/30, acc_iter=182749, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:20:13/0:27:14, time_cost(all): 2 days, 1:35:41/4:09:25, loss=0.297882322046283, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.15(1.03), norm=4.4146121295808705, lr=2.4031849721823e-05
2023-11-23 10:01:01   INFO  epoch: 27/30, acc_iter=182799, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:21:02/0:25:36, time_cost(all): 2 days, 1:36:30/4:14:57, loss=0.297799208059974, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.04(1.03), norm=4.86211909572215, lr=2.3864407612142e-05
2023-11-23 10:01:50   INFO  epoch: 27/30, acc_iter=182849, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:21:51/0:26:09, time_cost(all): 2 days, 1:37:19/4:19:25, loss=0.297716094073665, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.86(1.03), norm=2.7762137254073553, lr=2.369696550246e-05
2023-11-23 10:02:39   INFO  epoch: 27/30, acc_iter=182899, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:22:40/0:25:24, time_cost(all): 2 days, 1:38:08/4:26:07, loss=0.297632980087355, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.96(1.03), norm=0.7171718626519401, lr=2.3529523392779e-05
2023-11-23 10:03:28   INFO  epoch: 27/30, acc_iter=182949, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:23:29/0:23:54, time_cost(all): 2 days, 1:38:57/4:21:00, loss=0.297549866101046, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.83(1.03), norm=3.7376165896649423, lr=2.3362081283098e-05
2023-11-23 10:04:17   INFO  epoch: 27/30, acc_iter=182999, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:24:19/0:24:02, time_cost(all): 2 days, 1:39:46/4:11:18, loss=0.297466752114737, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.05(1.03), norm=4.571931762837708, lr=2.3194639173416e-05
2023-11-23 10:05:06   INFO  epoch: 27/30, acc_iter=183049, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:25:08/0:23:36, time_cost(all): 2 days, 1:40:35/4:17:51, loss=0.297383638128428, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.2(1.03), norm=3.71719816919752, lr=2.3027197063735e-05
2023-11-23 10:05:55   INFO  epoch: 27/30, acc_iter=183099, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:25:57/0:21:01, time_cost(all): 2 days, 1:41:24/4:17:16, loss=0.297300524142119, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.04(1.03), norm=0.5186730528254868, lr=2.2859754954054e-05
2023-11-23 10:06:44   INFO  epoch: 27/30, acc_iter=183149, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:26:46/0:20:34, time_cost(all): 2 days, 1:42:13/4:18:13, loss=0.29721741015581, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.04(1.03), norm=2.585741872230516, lr=2.2692312844373e-05
2023-11-23 10:07:34   INFO  epoch: 27/30, acc_iter=183199, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:27:35/0:19:23, time_cost(all): 2 days, 1:43:03/4:01:56, loss=0.297134296169501, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.89(1.03), norm=3.842933259568916, lr=2.2524870734691e-05
2023-11-23 10:08:23   INFO  epoch: 27/30, acc_iter=183249, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:28:24/0:20:23, time_cost(all): 2 days, 1:43:52/4:08:43, loss=0.297051182183192, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.08(1.03), norm=0.7918820444104122, lr=2.235742862501e-05
2023-11-23 10:09:12   INFO  epoch: 27/30, acc_iter=183299, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:29:13/0:18:15, time_cost(all): 2 days, 1:44:41/4:15:51, loss=0.296968068196883, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.03(1.03), norm=3.0330756666962424, lr=2.2189986515329e-05
2023-11-23 10:10:01   INFO  epoch: 27/30, acc_iter=183349, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:30:02/0:17:21, time_cost(all): 2 days, 1:45:30/4:13:59, loss=0.296884954210574, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.14(1.03), norm=4.233524211292872, lr=2.2022544405647e-05
2023-11-23 10:10:50   INFO  epoch: 27/30, acc_iter=183399, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:30:52/0:17:31, time_cost(all): 2 days, 1:46:19/3:58:56, loss=0.296801840224265, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.91(1.03), norm=2.0727630310923373, lr=2.1855102295966e-05
2023-11-23 10:11:39   INFO  epoch: 27/30, acc_iter=183449, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:31:41/0:16:04, time_cost(all): 2 days, 1:47:08/4:05:29, loss=0.296718726237956, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.12(1.03), norm=2.377913403111064, lr=2.1687660186285e-05
2023-11-23 10:12:28   INFO  epoch: 27/30, acc_iter=183499, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:32:30/0:15:41, time_cost(all): 2 days, 1:47:57/4:05:48, loss=0.296635612251647, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.92(1.03), norm=4.061149426494712, lr=2.1520218076604e-05
2023-11-23 10:13:17   INFO  epoch: 27/30, acc_iter=183549, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:33:19/0:14:54, time_cost(all): 2 days, 1:48:46/4:02:21, loss=0.296552498265338, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.04(1.03), norm=4.259960210071575, lr=2.1352775966922e-05
2023-11-23 10:14:07   INFO  epoch: 27/30, acc_iter=183599, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:34:08/0:13:39, time_cost(all): 2 days, 1:49:36/3:59:05, loss=0.296469384279029, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.12(1.03), norm=2.07693832916201, lr=2.1185333857241e-05
2023-11-23 10:14:56   INFO  epoch: 27/30, acc_iter=183649, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:34:57/0:13:00, time_cost(all): 2 days, 1:50:25/3:55:45, loss=0.29638627029272, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.09(1.03), norm=0.8488183219528838, lr=2.101789174756e-05
2023-11-23 10:15:45   INFO  epoch: 27/30, acc_iter=183699, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:35:46/0:11:30, time_cost(all): 2 days, 1:51:14/4:12:53, loss=0.29630315630641, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.03(1.03), norm=0.5782771523119004, lr=2.0850449637879e-05
2023-11-23 10:16:34   INFO  epoch: 27/30, acc_iter=183749, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:36:35/0:11:08, time_cost(all): 2 days, 1:52:03/4:00:01, loss=0.296220042320101, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.08(1.03), norm=2.9453884220579805, lr=2.0683007528197e-05
2023-11-23 10:17:23   INFO  epoch: 27/30, acc_iter=183799, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:37:24/0:10:49, time_cost(all): 2 days, 1:52:52/4:00:58, loss=0.296136928333792, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.9(1.03), norm=3.4229749561001412, lr=2.0515565418516e-05
2023-11-23 10:18:12   INFO  epoch: 27/30, acc_iter=183849, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:38:14/0:09:31, time_cost(all): 2 days, 1:53:41/3:53:55, loss=0.296053814347483, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.15(1.03), norm=4.848079739400194, lr=2.0348123308835e-05
2023-11-23 10:19:01   INFO  epoch: 27/30, acc_iter=183899, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:39:03/0:08:38, time_cost(all): 2 days, 1:54:30/4:09:17, loss=0.295970700361174, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.94(1.03), norm=4.617175766015283, lr=2.0180681199153e-05
2023-11-23 10:19:50   INFO  epoch: 27/30, acc_iter=183949, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:39:52/0:08:13, time_cost(all): 2 days, 1:55:19/4:11:54, loss=0.295887586374865, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.94(1.03), norm=2.3650450862506482, lr=2.0013239089472e-05
2023-11-23 10:20:39   INFO  epoch: 27/30, acc_iter=183999, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:40:41/0:07:17, time_cost(all): 2 days, 1:56:08/3:49:22, loss=0.295804472388556, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.0(1.03), norm=4.560439141961704, lr=1.9845796979791e-05
2023-11-23 10:21:29   INFO  epoch: 27/30, acc_iter=184049, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:41:30/0:06:09, time_cost(all): 2 days, 1:56:58/3:47:16, loss=0.295721358402247, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.13(1.03), norm=1.2448658819336567, lr=1.967835487011e-05
2023-11-23 10:22:18   INFO  epoch: 27/30, acc_iter=184099, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 1:42:19/0:05:36, time_cost(all): 2 days, 1:57:47/3:49:42, loss=0.295638244415938, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.19(1.03), norm=3.672284028599025, lr=1.9510912760428e-05
2023-11-23 10:23:07   INFO  epoch: 27/30, acc_iter=184149, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 1:43:08/0:04:28, time_cost(all): 2 days, 1:58:36/4:05:45, loss=0.295555130429629, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.14(1.03), norm=1.4188551713305881, lr=1.9343470650747e-05
2023-11-23 10:23:56   INFO  epoch: 27/30, acc_iter=184199, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 1:43:57/0:03:52, time_cost(all): 2 days, 1:59:25/4:07:30, loss=0.29547201644332, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.05(1.03), norm=1.1443346232282217, lr=1.9176028541066e-05
2023-11-23 10:24:45   INFO  epoch: 27/30, acc_iter=184249, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 1:44:47/0:03:00, time_cost(all): 2 days, 2:00:14/3:46:58, loss=0.295388902457011, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.17(1.03), norm=4.7326202538119775, lr=1.9008586431384e-05
2023-11-23 10:25:34   INFO  epoch: 27/30, acc_iter=184299, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 1:45:36/0:02:16, time_cost(all): 2 days, 2:01:03/3:44:05, loss=0.295305788470702, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.94(1.03), norm=3.241132409486219, lr=1.8841144321703e-05
2023-11-23 10:26:23   INFO  epoch: 27/30, acc_iter=184349, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 1:46:25/0:01:22, time_cost(all): 2 days, 2:01:52/4:00:02, loss=0.295222674484393, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.95(1.03), norm=3.7873064554042095, lr=1.8673702212022e-05
2023-11-23 10:27:12   INFO  epoch: 27/30, acc_iter=184399, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 1:47:14/0:00:35, time_cost(all): 2 days, 2:02:41/3:44:04, loss=0.295139560498084, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.15(1.03), norm=1.4472341734202394, lr=1.8506260102341e-05
2023-11-23 10:28:02   INFO  epoch: 28/30, acc_iter=184486, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:49/1:44:02, time_cost(all): 2 days, 2:03:31/3:54:36, loss=0.294994942161906, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.14(1.03), norm=0.6499542404472121, lr=1.8214910831495e-05
2023-11-23 10:28:51   INFO  epoch: 28/30, acc_iter=184536, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:38/1:44:01, time_cost(all): 2 days, 2:04:20/3:58:49, loss=0.294911828175597, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.04(1.03), norm=3.3727645081497, lr=1.8047468721814e-05
2023-11-23 10:29:40   INFO  epoch: 28/30, acc_iter=184586, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:27/1:42:55, time_cost(all): 2 days, 2:05:09/4:00:49, loss=0.294828714189288, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.85(1.03), norm=3.777258163984411, lr=1.7880026612133e-05
2023-11-23 10:30:29   INFO  epoch: 28/30, acc_iter=184636, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:16/1:42:17, time_cost(all): 2 days, 2:05:58/3:40:45, loss=0.294745600202979, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.09(1.03), norm=2.8309807736806105, lr=1.7712584502451e-05
2023-11-23 10:31:18   INFO  epoch: 28/30, acc_iter=184686, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:05/1:41:57, time_cost(all): 2 days, 2:06:47/3:46:11, loss=0.29466248621667, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.13(1.03), norm=4.019488607583973, lr=1.754514239277e-05
2023-11-23 10:32:07   INFO  epoch: 28/30, acc_iter=184736, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:04:54/1:44:39, time_cost(all): 2 days, 2:07:36/3:55:57, loss=0.29457937223036, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.93(1.03), norm=2.1956970914388303, lr=1.7377700283089e-05
2023-11-23 10:32:56   INFO  epoch: 28/30, acc_iter=184786, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:05:43/1:43:58, time_cost(all): 2 days, 2:08:25/3:49:09, loss=0.294496258244051, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.01(1.03), norm=4.858479694848634, lr=1.7210258173408e-05
2023-11-23 10:33:45   INFO  epoch: 28/30, acc_iter=184836, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:06:32/1:41:30, time_cost(all): 2 days, 2:09:14/3:46:45, loss=0.294413144257742, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.84(1.03), norm=3.1333045444524252, lr=1.7042816063726e-05
2023-11-23 10:34:34   INFO  epoch: 28/30, acc_iter=184886, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:07:22/1:44:44, time_cost(all): 2 days, 2:10:03/3:48:31, loss=0.294330030271433, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.98(1.03), norm=1.387817773148011, lr=1.6875373954045e-05
2023-11-23 10:35:24   INFO  epoch: 28/30, acc_iter=184936, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:08:11/1:41:42, time_cost(all): 2 days, 2:10:53/3:46:06, loss=0.294246916285124, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.05(1.03), norm=3.5914576402900327, lr=1.6707931844364e-05
2023-11-23 10:36:13   INFO  epoch: 28/30, acc_iter=184986, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:09:00/1:39:13, time_cost(all): 2 days, 2:11:42/3:39:04, loss=0.294163802298815, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.22(1.03), norm=1.582327697205171, lr=1.6540489734682e-05
2023-11-23 10:37:02   INFO  epoch: 28/30, acc_iter=185036, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:09:49/1:36:24, time_cost(all): 2 days, 2:12:31/3:51:24, loss=0.294080688312506, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.08(1.03), norm=3.8897173418693085, lr=1.6373047625001e-05
2023-11-23 10:37:51   INFO  epoch: 28/30, acc_iter=185086, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:10:38/1:37:26, time_cost(all): 2 days, 2:13:20/3:45:56, loss=0.293997574326197, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.99(1.03), norm=4.322002383571807, lr=1.620560551532e-05
2023-11-23 10:38:40   INFO  epoch: 28/30, acc_iter=185136, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:11:27/1:35:55, time_cost(all): 2 days, 2:14:09/3:45:20, loss=0.293914460339888, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.18(1.03), norm=3.8275460948987297, lr=1.6038163405639e-05
2023-11-23 10:39:29   INFO  epoch: 28/30, acc_iter=185186, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:12:16/1:31:48, time_cost(all): 2 days, 2:14:58/3:40:59, loss=0.293831346353579, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.97(1.03), norm=4.730126312185208, lr=1.5870721295957e-05
2023-11-23 10:40:18   INFO  epoch: 28/30, acc_iter=185236, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:13:05/1:38:47, time_cost(all): 2 days, 2:15:47/3:41:13, loss=0.29374823236727, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.03(1.03), norm=1.2083302912799638, lr=1.5703279186276e-05
2023-11-23 10:41:07   INFO  epoch: 28/30, acc_iter=185286, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:13:54/1:34:43, time_cost(all): 2 days, 2:16:36/3:29:45, loss=0.293665118380961, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.08(1.03), norm=3.9977760414193613, lr=1.5535837076595e-05
2023-11-23 10:41:57   INFO  epoch: 28/30, acc_iter=185336, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:14:44/1:33:40, time_cost(all): 2 days, 2:17:26/3:46:52, loss=0.293582004394652, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.92(1.03), norm=2.1590518578279485, lr=1.5368394966913e-05
2023-11-23 10:42:46   INFO  epoch: 28/30, acc_iter=185386, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:15:33/1:30:11, time_cost(all): 2 days, 2:18:15/3:29:08, loss=0.293498890408343, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.0(1.03), norm=3.3885507814404936, lr=1.5200952857232e-05
2023-11-23 10:43:35   INFO  epoch: 28/30, acc_iter=185436, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:16:22/1:28:01, time_cost(all): 2 days, 2:19:04/3:33:54, loss=0.293415776422034, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.17(1.03), norm=0.6262149856589587, lr=1.5033510747551e-05
2023-11-23 10:44:24   INFO  epoch: 28/30, acc_iter=185486, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:17:11/1:29:08, time_cost(all): 2 days, 2:19:53/3:35:25, loss=0.293332662435725, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.21(1.03), norm=0.6622395212358214, lr=1.486606863787e-05
2023-11-23 10:45:13   INFO  epoch: 28/30, acc_iter=185536, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:18:00/1:31:13, time_cost(all): 2 days, 2:20:42/3:42:05, loss=0.293249548449415, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.14(1.03), norm=1.5504454395229428, lr=1.4698626528188e-05
2023-11-23 10:46:02   INFO  epoch: 28/30, acc_iter=185586, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:18:49/1:25:00, time_cost(all): 2 days, 2:21:31/3:25:29, loss=0.293166434463106, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.9(1.03), norm=2.4579968084936485, lr=1.4531184418507e-05
2023-11-23 10:46:51   INFO  epoch: 28/30, acc_iter=185636, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:19:38/1:28:54, time_cost(all): 2 days, 2:22:20/3:33:41, loss=0.293083320476797, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.15(1.03), norm=2.9657807569523227, lr=1.4363742308826e-05
2023-11-23 10:47:40   INFO  epoch: 28/30, acc_iter=185686, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:20:27/1:31:03, time_cost(all): 2 days, 2:23:09/3:34:41, loss=0.293000206490488, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.2(1.03), norm=1.0992757018844832, lr=1.4196300199144e-05
2023-11-23 10:48:29   INFO  epoch: 28/30, acc_iter=185736, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:21:17/1:24:40, time_cost(all): 2 days, 2:23:58/3:39:32, loss=0.292917092504179, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.92(1.03), norm=1.9900629325483692, lr=1.4028858089463e-05
2023-11-23 10:49:19   INFO  epoch: 28/30, acc_iter=185786, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:22:06/1:29:52, time_cost(all): 2 days, 2:24:48/3:34:17, loss=0.29283397851787, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.06(1.03), norm=2.4891647615820274, lr=1.3861415979782e-05
2023-11-23 10:50:08   INFO  epoch: 28/30, acc_iter=185836, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:22:55/1:22:49, time_cost(all): 2 days, 2:25:37/3:23:23, loss=0.292750864531561, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.84(1.03), norm=1.7762024145006552, lr=1.3693973870101e-05
2023-11-23 10:50:57   INFO  epoch: 28/30, acc_iter=185886, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:23:44/1:25:31, time_cost(all): 2 days, 2:26:26/3:38:05, loss=0.292667750545252, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.88(1.03), norm=1.2212072764255826, lr=1.3526531760419e-05
2023-11-23 10:51:46   INFO  epoch: 28/30, acc_iter=185936, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:24:33/1:25:05, time_cost(all): 2 days, 2:27:15/3:30:07, loss=0.292584636558943, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.08(1.03), norm=4.895865637378538, lr=1.3359089650738e-05
2023-11-23 10:52:35   INFO  epoch: 28/30, acc_iter=185986, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:25:22/1:25:08, time_cost(all): 2 days, 2:28:04/3:36:36, loss=0.292501522572634, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.88(1.03), norm=4.2702594845246935, lr=1.3191647541057e-05
2023-11-23 10:53:24   INFO  epoch: 28/30, acc_iter=186036, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:26:11/1:22:30, time_cost(all): 2 days, 2:28:53/3:21:57, loss=0.292418408586325, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.89(1.03), norm=0.9120469409034485, lr=1.3024205431376e-05
2023-11-23 10:54:13   INFO  epoch: 28/30, acc_iter=186086, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:27:00/1:17:09, time_cost(all): 2 days, 2:29:42/3:17:53, loss=0.292335294600016, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.05(1.03), norm=1.032991414562102, lr=1.2856763321694e-05
2023-11-23 10:55:02   INFO  epoch: 28/30, acc_iter=186136, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:27:49/1:21:53, time_cost(all): 2 days, 2:30:31/3:22:25, loss=0.292252180613707, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.07(1.03), norm=4.805080641255115, lr=1.2689321212013e-05
2023-11-23 10:55:51   INFO  epoch: 28/30, acc_iter=186186, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:28:39/1:21:26, time_cost(all): 2 days, 2:31:20/3:29:15, loss=0.292169066627398, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.89(1.03), norm=1.0723901126192072, lr=1.2521879102332e-05
2023-11-23 10:56:41   INFO  epoch: 28/30, acc_iter=186236, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:29:28/1:15:50, time_cost(all): 2 days, 2:32:10/3:13:21, loss=0.292085952641089, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.91(1.03), norm=1.1961656713605167, lr=1.235443699265e-05
2023-11-23 10:57:30   INFO  epoch: 28/30, acc_iter=186286, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:30:17/1:14:33, time_cost(all): 2 days, 2:32:59/3:27:20, loss=0.29200283865478, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.85(1.03), norm=4.519280024515163, lr=1.2186994882969e-05
2023-11-23 10:58:19   INFO  epoch: 28/30, acc_iter=186336, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:31:06/1:13:29, time_cost(all): 2 days, 2:33:48/3:23:04, loss=0.29191972466847, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.93(1.03), norm=0.9112826932663185, lr=1.2019552773288e-05
2023-11-23 10:59:08   INFO  epoch: 28/30, acc_iter=186386, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:31:55/1:12:54, time_cost(all): 2 days, 2:34:37/3:24:53, loss=0.291836610682161, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.94(1.03), norm=3.2093591046763925, lr=1.1852110663607e-05
2023-11-23 10:59:57   INFO  epoch: 28/30, acc_iter=186436, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:32:44/1:16:38, time_cost(all): 2 days, 2:35:26/3:23:53, loss=0.291753496695852, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.97(1.03), norm=4.908083034574118, lr=1.1684668553925e-05
2023-11-23 11:00:46   INFO  epoch: 28/30, acc_iter=186486, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:33:33/1:15:51, time_cost(all): 2 days, 2:36:15/3:24:04, loss=0.291670382709543, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.92(1.03), norm=2.813912623023575, lr=1.1517226444244e-05
2023-11-23 11:01:35   INFO  epoch: 28/30, acc_iter=186536, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:34:22/1:16:14, time_cost(all): 2 days, 2:37:04/3:12:44, loss=0.291587268723234, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.01(1.03), norm=4.710485646008435, lr=1.1349784334563e-05
2023-11-23 11:02:24   INFO  epoch: 28/30, acc_iter=186586, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:35:12/1:13:01, time_cost(all): 2 days, 2:37:53/3:22:24, loss=0.291504154736925, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.08(1.03), norm=3.8352007620399777, lr=1.1182342224881e-05
2023-11-23 11:03:14   INFO  epoch: 28/30, acc_iter=186636, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:36:01/1:08:23, time_cost(all): 2 days, 2:38:43/3:24:50, loss=0.291421040750616, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.13(1.03), norm=0.8070455946291855, lr=1.10149001152e-05
2023-11-23 11:04:03   INFO  epoch: 28/30, acc_iter=186686, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:36:50/1:07:34, time_cost(all): 2 days, 2:39:32/3:07:40, loss=0.291337926764307, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.18(1.03), norm=3.1324622057210414, lr=1.0847458005519e-05
2023-11-23 11:04:52   INFO  epoch: 28/30, acc_iter=186736, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:37:39/1:10:29, time_cost(all): 2 days, 2:40:21/3:15:15, loss=0.291254812777998, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.04(1.03), norm=2.5481206788969546, lr=1.0680015895838e-05
2023-11-23 11:05:41   INFO  epoch: 28/30, acc_iter=186786, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:38:28/1:08:08, time_cost(all): 2 days, 2:41:10/3:15:26, loss=0.291171698791689, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.89(1.03), norm=0.6187783440562082, lr=1.0512573786156e-05
2023-11-23 11:06:30   INFO  epoch: 28/30, acc_iter=186836, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:39:17/1:10:07, time_cost(all): 2 days, 2:41:59/3:08:31, loss=0.29108858480538, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.12(1.03), norm=4.763269938011039, lr=1.0345131676475e-05
2023-11-23 11:07:19   INFO  epoch: 28/30, acc_iter=186886, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:40:06/1:06:21, time_cost(all): 2 days, 2:42:48/3:09:01, loss=0.291005470819071, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.84(1.03), norm=3.7151437651699624, lr=1.0177689566794e-05
2023-11-23 11:08:08   INFO  epoch: 28/30, acc_iter=186936, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:40:55/1:06:53, time_cost(all): 2 days, 2:43:37/3:20:23, loss=0.290922356832762, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.9(1.03), norm=2.099112813631546, lr=1.0010247457112e-05
2023-11-23 11:08:57   INFO  epoch: 28/30, acc_iter=186986, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:41:44/1:04:53, time_cost(all): 2 days, 2:44:26/3:09:33, loss=0.290839242846453, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.91(1.03), norm=4.781424118601246, lr=9.956015771806e-06
2023-11-23 11:09:46   INFO  epoch: 28/30, acc_iter=187036, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:42:34/1:06:50, time_cost(all): 2 days, 2:45:15/3:05:34, loss=0.290756128860144, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.98(1.03), norm=2.513444483175439, lr=9.909164229206e-06
2023-11-23 11:10:36   INFO  epoch: 28/30, acc_iter=187086, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:43:23/1:05:12, time_cost(all): 2 days, 2:46:05/3:08:52, loss=0.290673014873835, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.2(1.03), norm=3.9630671228956547, lr=9.862312686605e-06
2023-11-23 11:11:25   INFO  epoch: 28/30, acc_iter=187136, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:44:12/1:02:46, time_cost(all): 2 days, 2:46:54/3:11:41, loss=0.290589900887525, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.88(1.03), norm=2.444148758426695, lr=9.815461144004e-06
2023-11-23 11:12:14   INFO  epoch: 28/30, acc_iter=187186, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:45:01/1:02:14, time_cost(all): 2 days, 2:47:43/3:14:52, loss=0.290506786901216, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.22(1.03), norm=1.7999535616983238, lr=9.768609601403e-06
2023-11-23 11:13:03   INFO  epoch: 28/30, acc_iter=187236, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:45:50/1:02:26, time_cost(all): 2 days, 2:48:32/3:14:51, loss=0.290423672914907, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.98(1.03), norm=4.958976463773617, lr=9.721758058803e-06
2023-11-23 11:13:52   INFO  epoch: 28/30, acc_iter=187286, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:46:39/1:03:54, time_cost(all): 2 days, 2:49:21/2:58:02, loss=0.290340558928598, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.99(1.03), norm=1.9028892986509447, lr=9.674906516202e-06
2023-11-23 11:14:41   INFO  epoch: 28/30, acc_iter=187336, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:47:28/0:58:44, time_cost(all): 2 days, 2:50:10/3:01:18, loss=0.290257444942289, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.16(1.03), norm=4.181355909560884, lr=9.628054973601e-06
2023-11-23 11:15:30   INFO  epoch: 28/30, acc_iter=187386, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:48:17/0:59:57, time_cost(all): 2 days, 2:50:59/3:12:45, loss=0.29017433095598, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.22(1.03), norm=2.9053229755444416, lr=9.581203431e-06
2023-11-23 11:16:19   INFO  epoch: 28/30, acc_iter=187436, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:49:07/0:59:55, time_cost(all): 2 days, 2:51:48/3:12:26, loss=0.290091216969671, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.09(1.03), norm=0.6445644514381579, lr=9.5343518884e-06
2023-11-23 11:17:09   INFO  epoch: 28/30, acc_iter=187486, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:49:56/0:55:53, time_cost(all): 2 days, 2:52:38/2:54:00, loss=0.290008102983362, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.22(1.03), norm=1.365082148871547, lr=9.487500345799e-06
2023-11-23 11:17:58   INFO  epoch: 28/30, acc_iter=187536, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:50:45/0:59:45, time_cost(all): 2 days, 2:53:27/3:02:19, loss=0.289924988997053, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.93(1.03), norm=2.076494336048275, lr=9.440648803198e-06
2023-11-23 11:18:47   INFO  epoch: 28/30, acc_iter=187586, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 0:51:34/0:53:48, time_cost(all): 2 days, 2:54:16/2:56:06, loss=0.289841875010744, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.19(1.03), norm=4.813602350270275, lr=9.393797260597e-06
2023-11-23 11:19:36   INFO  epoch: 28/30, acc_iter=187636, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 0:52:23/0:56:29, time_cost(all): 2 days, 2:55:05/3:05:15, loss=0.289758761024435, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.14(1.03), norm=1.3922253812002547, lr=9.346945717997e-06
2023-11-23 11:20:25   INFO  epoch: 28/30, acc_iter=187686, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 0:53:12/0:55:12, time_cost(all): 2 days, 2:55:54/2:54:15, loss=0.289675647038126, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.99(1.03), norm=4.032202729686114, lr=9.300094175396e-06
2023-11-23 11:21:14   INFO  epoch: 28/30, acc_iter=187736, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 0:54:01/0:53:37, time_cost(all): 2 days, 2:56:43/2:51:28, loss=0.289592533051817, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.95(1.03), norm=1.0137577793473826, lr=9.253242632795e-06
2023-11-23 11:22:03   INFO  epoch: 28/30, acc_iter=187786, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 0:54:50/0:53:06, time_cost(all): 2 days, 2:57:32/2:49:36, loss=0.289509419065508, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.16(1.03), norm=0.615219660523624, lr=9.206391090194e-06
2023-11-23 11:22:52   INFO  epoch: 28/30, acc_iter=187836, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 0:55:39/0:50:42, time_cost(all): 2 days, 2:58:21/3:03:08, loss=0.289426305079199, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.03(1.03), norm=4.174875651961941, lr=9.159539547594e-06
2023-11-23 11:23:41   INFO  epoch: 28/30, acc_iter=187886, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 0:56:29/0:52:09, time_cost(all): 2 days, 2:59:10/2:50:49, loss=0.28934319109289, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.12(1.03), norm=4.6597017944937456, lr=9.112688004993e-06
2023-11-23 11:24:31   INFO  epoch: 28/30, acc_iter=187936, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 0:57:18/0:48:12, time_cost(all): 2 days, 3:00:00/2:56:34, loss=0.28926007710658, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.12(1.03), norm=3.0330093185262252, lr=9.065836462392e-06
2023-11-23 11:25:20   INFO  epoch: 28/30, acc_iter=187986, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 0:58:07/0:50:58, time_cost(all): 2 days, 3:00:49/3:01:23, loss=0.289176963120271, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.22(1.03), norm=0.8002228553717984, lr=9.018984919792e-06
2023-11-23 11:26:09   INFO  epoch: 28/30, acc_iter=188036, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 0:58:56/0:48:21, time_cost(all): 2 days, 3:01:38/2:53:50, loss=0.289093849133962, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.9(1.03), norm=4.616734993807651, lr=8.972133377191e-06
2023-11-23 11:26:58   INFO  epoch: 28/30, acc_iter=188086, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 0:59:45/0:46:39, time_cost(all): 2 days, 3:02:27/3:01:01, loss=0.289010735147653, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.03(1.03), norm=3.081672625157051, lr=8.92528183459e-06
2023-11-23 11:27:47   INFO  epoch: 28/30, acc_iter=188136, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:00:34/0:48:24, time_cost(all): 2 days, 3:03:16/2:48:37, loss=0.288927621161344, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.02(1.03), norm=1.07978724227516, lr=8.878430291989e-06
2023-11-23 11:28:36   INFO  epoch: 28/30, acc_iter=188186, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:01:23/0:45:49, time_cost(all): 2 days, 3:04:05/2:53:37, loss=0.288844507175035, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.9(1.03), norm=2.669111431681423, lr=8.831578749389e-06
2023-11-23 11:29:25   INFO  epoch: 28/30, acc_iter=188236, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:02:12/0:44:56, time_cost(all): 2 days, 3:04:54/2:49:17, loss=0.288761393188726, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.06(1.03), norm=2.6731742373355147, lr=8.784727206788e-06
2023-11-23 11:30:14   INFO  epoch: 28/30, acc_iter=188286, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:03:02/0:45:09, time_cost(all): 2 days, 3:05:43/2:46:18, loss=0.288678279202417, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.83(1.03), norm=4.306661188739854, lr=8.737875664187e-06
2023-11-23 11:31:04   INFO  epoch: 28/30, acc_iter=188336, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:03:51/0:42:08, time_cost(all): 2 days, 3:06:33/2:50:04, loss=0.288595165216108, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.88(1.03), norm=0.8779473776190196, lr=8.691024121586e-06
2023-11-23 11:31:53   INFO  epoch: 28/30, acc_iter=188386, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:04:40/0:44:06, time_cost(all): 2 days, 3:07:22/2:55:36, loss=0.288512051229799, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.12(1.03), norm=0.6157917061223995, lr=8.644172578986e-06
2023-11-23 11:32:42   INFO  epoch: 28/30, acc_iter=188436, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:05:29/0:44:15, time_cost(all): 2 days, 3:08:11/2:55:07, loss=0.28842893724349, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.95(1.03), norm=4.970695379047678, lr=8.597321036385e-06
2023-11-23 11:33:31   INFO  epoch: 28/30, acc_iter=188486, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:06:18/0:40:08, time_cost(all): 2 days, 3:09:00/2:50:00, loss=0.288345823257181, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.91(1.03), norm=4.936542248195347, lr=8.550469493784e-06
2023-11-23 11:34:20   INFO  epoch: 28/30, acc_iter=188536, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:07:07/0:40:27, time_cost(all): 2 days, 3:09:49/2:53:17, loss=0.288262709270872, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.91(1.03), norm=4.097813706283906, lr=8.503617951183e-06
2023-11-23 11:35:09   INFO  epoch: 28/30, acc_iter=188586, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:07:56/0:38:03, time_cost(all): 2 days, 3:10:38/2:44:43, loss=0.288179595284563, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.06(1.03), norm=3.789085354483767, lr=8.456766408583e-06
2023-11-23 11:35:58   INFO  epoch: 28/30, acc_iter=188636, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:08:45/0:40:07, time_cost(all): 2 days, 3:11:27/2:36:48, loss=0.288096481298254, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.96(1.03), norm=1.8552237553616167, lr=8.409914865982e-06
2023-11-23 11:36:47   INFO  epoch: 28/30, acc_iter=188686, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:09:34/0:38:43, time_cost(all): 2 days, 3:12:16/2:49:36, loss=0.288013367311944, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.93(1.03), norm=3.842526322032267, lr=8.363063323381e-06
2023-11-23 11:37:36   INFO  epoch: 28/30, acc_iter=188736, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:10:24/0:37:20, time_cost(all): 2 days, 3:13:05/2:36:59, loss=0.287930253325635, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.95(1.03), norm=2.7271130285209297, lr=8.31621178078e-06
2023-11-23 11:38:26   INFO  epoch: 28/30, acc_iter=188786, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:11:13/0:37:07, time_cost(all): 2 days, 3:13:55/2:44:38, loss=0.287847139339326, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.98(1.03), norm=3.3194410170748028, lr=8.26936023818e-06
2023-11-23 11:39:15   INFO  epoch: 28/30, acc_iter=188836, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:12:02/0:37:32, time_cost(all): 2 days, 3:14:44/2:38:02, loss=0.287764025353017, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.9(1.03), norm=3.0355050264776624, lr=8.222508695579e-06
2023-11-23 11:40:04   INFO  epoch: 28/30, acc_iter=188886, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:12:51/0:35:47, time_cost(all): 2 days, 3:15:33/2:39:16, loss=0.287680911366708, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.05(1.03), norm=3.446197630282925, lr=8.175657152978e-06
2023-11-23 11:40:53   INFO  epoch: 28/30, acc_iter=188936, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:13:40/0:34:24, time_cost(all): 2 days, 3:16:22/2:37:21, loss=0.287597797380399, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.2(1.03), norm=4.071539063907057, lr=8.128805610377e-06
2023-11-23 11:41:42   INFO  epoch: 28/30, acc_iter=188986, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:14:29/0:34:20, time_cost(all): 2 days, 3:17:11/2:42:10, loss=0.28751468339409, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.16(1.03), norm=4.0526847577645455, lr=8.081954067777e-06
2023-11-23 11:42:31   INFO  epoch: 28/30, acc_iter=189036, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:15:18/0:32:25, time_cost(all): 2 days, 3:18:00/2:42:51, loss=0.287431569407781, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.07(1.03), norm=1.2604147962935728, lr=8.035102525176e-06
2023-11-23 11:43:20   INFO  epoch: 28/30, acc_iter=189086, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:16:07/0:31:26, time_cost(all): 2 days, 3:18:49/2:35:03, loss=0.287348455421472, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.11(1.03), norm=2.0945213051321976, lr=7.988250982575e-06
2023-11-23 11:44:09   INFO  epoch: 28/30, acc_iter=189136, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:16:57/0:29:26, time_cost(all): 2 days, 3:19:38/2:28:10, loss=0.287265341435163, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.98(1.03), norm=1.5557650349085366, lr=7.941399439974e-06
2023-11-23 11:44:59   INFO  epoch: 28/30, acc_iter=189186, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:17:46/0:29:48, time_cost(all): 2 days, 3:20:28/2:37:06, loss=0.287182227448854, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.1(1.03), norm=0.9230366335635936, lr=7.894547897374e-06
2023-11-23 11:45:48   INFO  epoch: 28/30, acc_iter=189236, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:18:35/0:29:55, time_cost(all): 2 days, 3:21:17/2:27:58, loss=0.287099113462545, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.05(1.03), norm=2.273497625152962, lr=7.847696354773e-06
2023-11-23 11:46:37   INFO  epoch: 28/30, acc_iter=189286, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:19:24/0:28:35, time_cost(all): 2 days, 3:22:06/2:28:37, loss=0.287015999476236, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.86(1.03), norm=1.980906755340837, lr=7.800844812172e-06
2023-11-23 11:47:26   INFO  epoch: 28/30, acc_iter=189336, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:20:13/0:28:31, time_cost(all): 2 days, 3:22:55/2:38:36, loss=0.286932885489927, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.84(1.03), norm=1.4968263215104796, lr=7.753993269571e-06
2023-11-23 11:48:15   INFO  epoch: 28/30, acc_iter=189386, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:21:02/0:26:04, time_cost(all): 2 days, 3:23:44/2:32:19, loss=0.286849771503618, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.92(1.03), norm=0.7207340564487537, lr=7.707141726971e-06
2023-11-23 11:49:04   INFO  epoch: 28/30, acc_iter=189436, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:21:51/0:26:23, time_cost(all): 2 days, 3:24:33/2:36:48, loss=0.286766657517309, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.95(1.03), norm=1.6320296651614443, lr=7.66029018437e-06
2023-11-23 11:49:53   INFO  epoch: 28/30, acc_iter=189486, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:22:40/0:25:11, time_cost(all): 2 days, 3:25:22/2:24:04, loss=0.286683543531, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.91(1.03), norm=0.8418028636678994, lr=7.613438641769e-06
2023-11-23 11:50:42   INFO  epoch: 28/30, acc_iter=189536, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:23:29/0:24:55, time_cost(all): 2 days, 3:26:11/2:23:13, loss=0.28660042954469, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.97(1.03), norm=4.989388925750194, lr=7.566587099168e-06
2023-11-23 11:51:31   INFO  epoch: 28/30, acc_iter=189586, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:24:19/0:22:54, time_cost(all): 2 days, 3:27:00/2:22:00, loss=0.286517315558381, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.2(1.03), norm=1.2523903458347543, lr=7.519735556568e-06
2023-11-23 11:52:21   INFO  epoch: 28/30, acc_iter=189636, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:25:08/0:22:39, time_cost(all): 2 days, 3:27:50/2:28:14, loss=0.286434201572072, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.11(1.03), norm=1.3361726635099305, lr=7.472884013967e-06
2023-11-23 11:53:10   INFO  epoch: 28/30, acc_iter=189686, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:25:57/0:21:26, time_cost(all): 2 days, 3:28:39/2:32:48, loss=0.286351087585763, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.84(1.03), norm=3.7439548661729014, lr=7.426032471366e-06
2023-11-23 11:53:59   INFO  epoch: 28/30, acc_iter=189736, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:26:46/0:21:33, time_cost(all): 2 days, 3:29:28/2:28:03, loss=0.286267973599454, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.94(1.03), norm=3.646996130927897, lr=7.379180928765e-06
2023-11-23 11:54:48   INFO  epoch: 28/30, acc_iter=189786, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:27:35/0:20:33, time_cost(all): 2 days, 3:30:17/2:23:17, loss=0.286184859613145, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.2(1.03), norm=2.4850976115625976, lr=7.332329386165e-06
2023-11-23 11:55:37   INFO  epoch: 28/30, acc_iter=189836, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:28:24/0:20:15, time_cost(all): 2 days, 3:31:06/2:18:35, loss=0.286101745626836, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.96(1.03), norm=3.3124473186380787, lr=7.285477843564e-06
2023-11-23 11:56:26   INFO  epoch: 28/30, acc_iter=189886, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:29:13/0:19:10, time_cost(all): 2 days, 3:31:55/2:23:37, loss=0.286018631640527, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.22(1.03), norm=4.475844645802239, lr=7.238626300963e-06
2023-11-23 11:57:15   INFO  epoch: 28/30, acc_iter=189936, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:30:02/0:18:03, time_cost(all): 2 days, 3:32:44/2:15:53, loss=0.285935517654218, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.12(1.03), norm=2.623930331008789, lr=7.191774758362e-06
2023-11-23 11:58:04   INFO  epoch: 28/30, acc_iter=189986, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:30:52/0:17:16, time_cost(all): 2 days, 3:33:33/2:24:41, loss=0.285852403667909, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.04(1.03), norm=2.901962260895971, lr=7.144923215762e-06
2023-11-23 11:58:54   INFO  epoch: 28/30, acc_iter=190036, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:31:41/0:15:28, time_cost(all): 2 days, 3:34:23/2:14:37, loss=0.2857692896816, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.22(1.03), norm=0.6062888500217493, lr=7.098071673161e-06
2023-11-23 11:59:43   INFO  epoch: 28/30, acc_iter=190086, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:32:30/0:14:46, time_cost(all): 2 days, 3:35:12/2:18:40, loss=0.285686175695291, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.17(1.03), norm=1.7225885056616492, lr=7.05122013056e-06
2023-11-23 12:00:32   INFO  epoch: 28/30, acc_iter=190136, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:33:19/0:14:20, time_cost(all): 2 days, 3:36:01/2:18:47, loss=0.285603061708982, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.12(1.03), norm=1.394990722423455, lr=7.004368587959e-06
2023-11-23 12:01:21   INFO  epoch: 28/30, acc_iter=190186, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:34:08/0:14:21, time_cost(all): 2 days, 3:36:50/2:17:45, loss=0.285519947722673, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.07(1.03), norm=1.3704285720976221, lr=6.957517045359e-06
2023-11-23 12:02:10   INFO  epoch: 28/30, acc_iter=190236, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:34:57/0:12:49, time_cost(all): 2 days, 3:37:39/2:19:36, loss=0.285436833736364, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.1(1.03), norm=4.0304167121394645, lr=6.910665502758e-06
2023-11-23 12:02:59   INFO  epoch: 28/30, acc_iter=190286, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:35:46/0:12:20, time_cost(all): 2 days, 3:38:28/2:14:02, loss=0.285353719750055, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.22(1.03), norm=2.746901929275192, lr=6.863813960157e-06
2023-11-23 12:03:48   INFO  epoch: 28/30, acc_iter=190336, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:36:35/0:10:53, time_cost(all): 2 days, 3:39:17/2:09:30, loss=0.285270605763745, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.19(1.03), norm=2.8421774683399548, lr=6.816962417556e-06
2023-11-23 12:04:37   INFO  epoch: 28/30, acc_iter=190386, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:37:24/0:10:13, time_cost(all): 2 days, 3:40:06/2:21:33, loss=0.285187491777436, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.87(1.03), norm=0.649894425860348, lr=6.770110874956e-06
2023-11-23 12:05:26   INFO  epoch: 28/30, acc_iter=190436, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:38:14/0:09:54, time_cost(all): 2 days, 3:40:55/2:15:23, loss=0.285104377791127, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.85(1.03), norm=2.0818431804557402, lr=6.723259332355e-06
2023-11-23 12:06:16   INFO  epoch: 28/30, acc_iter=190486, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:39:03/0:08:29, time_cost(all): 2 days, 3:41:45/2:19:51, loss=0.285021263804818, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.86(1.03), norm=1.5456933241854354, lr=6.676407789754e-06
2023-11-23 12:07:05   INFO  epoch: 28/30, acc_iter=190536, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:39:52/0:07:45, time_cost(all): 2 days, 3:42:34/2:16:56, loss=0.284938149818509, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.12(1.03), norm=1.0628690820694564, lr=6.629556247153e-06
2023-11-23 12:07:54   INFO  epoch: 28/30, acc_iter=190586, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:40:41/0:06:49, time_cost(all): 2 days, 3:43:23/2:12:55, loss=0.2848550358322, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.17(1.03), norm=2.587338084435147, lr=6.582704704553e-06
2023-11-23 12:08:43   INFO  epoch: 28/30, acc_iter=190636, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:41:30/0:06:02, time_cost(all): 2 days, 3:44:12/2:14:09, loss=0.284771921845891, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.99(1.03), norm=0.6408926180131196, lr=6.535853161952e-06
2023-11-23 12:09:32   INFO  epoch: 28/30, acc_iter=190686, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 1:42:19/0:05:21, time_cost(all): 2 days, 3:45:01/2:03:57, loss=0.284688807859582, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.87(1.03), norm=2.584878920830429, lr=6.489001619351e-06
2023-11-23 12:10:21   INFO  epoch: 28/30, acc_iter=190736, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 1:43:08/0:04:31, time_cost(all): 2 days, 3:45:50/2:10:23, loss=0.284605693873273, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.93(1.03), norm=1.0823622338635484, lr=6.442150076751e-06
2023-11-23 12:11:10   INFO  epoch: 28/30, acc_iter=190786, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 1:43:57/0:03:59, time_cost(all): 2 days, 3:46:39/2:07:32, loss=0.284522579886964, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.92(1.03), norm=0.7898022751312159, lr=6.39529853415e-06
2023-11-23 12:11:59   INFO  epoch: 28/30, acc_iter=190836, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 1:44:47/0:03:08, time_cost(all): 2 days, 3:47:28/2:04:16, loss=0.284439465900655, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.12(1.03), norm=1.211640497245784, lr=6.348446991549e-06
2023-11-23 12:12:49   INFO  epoch: 28/30, acc_iter=190886, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 1:45:36/0:02:16, time_cost(all): 2 days, 3:48:18/2:02:48, loss=0.284356351914346, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.95(1.03), norm=1.2237590951220803, lr=6.301595448948e-06
2023-11-23 12:13:38   INFO  epoch: 28/30, acc_iter=190936, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 1:46:25/0:01:24, time_cost(all): 2 days, 3:49:07/2:05:18, loss=0.284273237928037, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.98(1.03), norm=2.3758463127486023, lr=6.254743906348e-06
2023-11-23 12:14:27   INFO  epoch: 28/30, acc_iter=190986, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 1:47:14/0:00:35, time_cost(all): 2 days, 3:49:56/2:05:30, loss=0.284190123941728, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.83(1.03), norm=1.8022171715127326, lr=6.207892363747e-06
2023-11-23 12:15:16   INFO  epoch: 29/30, acc_iter=191073, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:49/1:45:37, time_cost(all): 2 days, 3:50:45/1:59:48, loss=0.28404550560555, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.02(1.03), norm=4.262379454729708, lr=6.126370679621e-06
2023-11-23 12:16:05   INFO  epoch: 29/30, acc_iter=191123, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:38/1:45:31, time_cost(all): 2 days, 3:51:34/2:07:41, loss=0.283962391619241, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.96(1.03), norm=3.3729770453415817, lr=6.079519137021e-06
2023-11-23 12:16:54   INFO  epoch: 29/30, acc_iter=191173, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:27/1:49:01, time_cost(all): 2 days, 3:52:23/2:00:40, loss=0.283879277632932, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.09(1.03), norm=1.704963208626105, lr=6.03266759442e-06
2023-11-23 12:17:43   INFO  epoch: 29/30, acc_iter=191223, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:16/1:48:46, time_cost(all): 2 days, 3:53:12/2:04:38, loss=0.283796163646623, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.88(1.03), norm=0.5085622433556638, lr=5.985816051819e-06
2023-11-23 12:18:32   INFO  epoch: 29/30, acc_iter=191273, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:05/1:41:27, time_cost(all): 2 days, 3:54:01/1:56:21, loss=0.283713049660314, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.09(1.03), norm=2.249780032225838, lr=5.938964509218e-06
2023-11-23 12:19:21   INFO  epoch: 29/30, acc_iter=191323, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:04:54/1:46:51, time_cost(all): 2 days, 3:54:50/1:59:21, loss=0.283629935674005, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.01(1.03), norm=2.1539023000204573, lr=5.892112966618e-06
2023-11-23 12:20:11   INFO  epoch: 29/30, acc_iter=191373, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:05:43/1:44:04, time_cost(all): 2 days, 3:55:40/1:54:57, loss=0.283546821687696, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.85(1.03), norm=0.7271507890172684, lr=5.845261424017e-06
2023-11-23 12:21:00   INFO  epoch: 29/30, acc_iter=191423, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:06:32/1:37:14, time_cost(all): 2 days, 3:56:29/2:01:52, loss=0.283463707701386, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.89(1.03), norm=2.9878012866593697, lr=5.798409881416e-06
2023-11-23 12:21:49   INFO  epoch: 29/30, acc_iter=191473, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:07:22/1:42:17, time_cost(all): 2 days, 3:57:18/1:52:32, loss=0.283380593715077, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.22(1.03), norm=3.167903428605492, lr=5.751558338816e-06
2023-11-23 12:22:38   INFO  epoch: 29/30, acc_iter=191523, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:08:11/1:42:15, time_cost(all): 2 days, 3:58:07/1:59:21, loss=0.283297479728768, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.0(1.03), norm=0.9512141866024281, lr=5.704706796215e-06
2023-11-23 12:23:27   INFO  epoch: 29/30, acc_iter=191573, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:09:00/1:35:20, time_cost(all): 2 days, 3:58:56/1:56:40, loss=0.283214365742459, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.2(1.03), norm=0.7877399366880469, lr=5.657855253614e-06
2023-11-23 12:24:16   INFO  epoch: 29/30, acc_iter=191623, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:09:49/1:34:24, time_cost(all): 2 days, 3:59:45/1:52:29, loss=0.28313125175615, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.92(1.03), norm=4.159446017813288, lr=5.611003711013e-06
2023-11-23 12:25:05   INFO  epoch: 29/30, acc_iter=191673, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:10:38/1:41:43, time_cost(all): 2 days, 4:00:34/1:57:05, loss=0.283048137769841, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.98(1.03), norm=2.8903469784189437, lr=5.564152168413e-06
2023-11-23 12:25:54   INFO  epoch: 29/30, acc_iter=191723, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:11:27/1:31:41, time_cost(all): 2 days, 4:01:23/1:48:19, loss=0.282965023783532, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.88(1.03), norm=3.8660910315245625, lr=5.517300625812e-06
2023-11-23 12:26:44   INFO  epoch: 29/30, acc_iter=191773, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:12:16/1:38:44, time_cost(all): 2 days, 4:02:13/1:54:47, loss=0.282881909797223, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.15(1.03), norm=4.836640487264927, lr=5.470449083211e-06
2023-11-23 12:27:33   INFO  epoch: 29/30, acc_iter=191823, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:13:05/1:34:11, time_cost(all): 2 days, 4:03:02/1:56:59, loss=0.282798795810914, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.84(1.03), norm=2.0471649826650444, lr=5.42359754061e-06
2023-11-23 12:28:22   INFO  epoch: 29/30, acc_iter=191873, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:13:54/1:37:39, time_cost(all): 2 days, 4:03:51/1:50:59, loss=0.282715681824605, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.93(1.03), norm=1.686000029024394, lr=5.37674599801e-06
2023-11-23 12:29:11   INFO  epoch: 29/30, acc_iter=191923, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:14:44/1:34:38, time_cost(all): 2 days, 4:04:40/1:46:53, loss=0.282632567838296, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.03(1.03), norm=4.4281717493944255, lr=5.329894455409e-06
2023-11-23 12:30:00   INFO  epoch: 29/30, acc_iter=191973, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:15:33/1:29:59, time_cost(all): 2 days, 4:05:29/1:51:33, loss=0.282549453851987, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.07(1.03), norm=0.5264371361465774, lr=5.283042912808e-06
2023-11-23 12:30:49   INFO  epoch: 29/30, acc_iter=192023, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:16:22/1:30:56, time_cost(all): 2 days, 4:06:18/1:46:51, loss=0.282466339865678, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.91(1.03), norm=4.338157286744249, lr=5.236191370207e-06
2023-11-23 12:31:38   INFO  epoch: 29/30, acc_iter=192073, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:17:11/1:26:42, time_cost(all): 2 days, 4:07:07/1:51:53, loss=0.282383225879369, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.93(1.03), norm=2.120744382910281, lr=5.189339827607e-06
2023-11-23 12:32:27   INFO  epoch: 29/30, acc_iter=192123, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:18:00/1:28:47, time_cost(all): 2 days, 4:07:56/1:51:26, loss=0.28230011189306, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.15(1.03), norm=2.1039654981073093, lr=5.142488285006e-06
2023-11-23 12:33:16   INFO  epoch: 29/30, acc_iter=192173, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:18:49/1:27:35, time_cost(all): 2 days, 4:08:45/1:45:05, loss=0.28221699790675, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.95(1.03), norm=4.867984364030577, lr=5.095636742405e-06
2023-11-23 12:34:06   INFO  epoch: 29/30, acc_iter=192223, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:19:38/1:32:29, time_cost(all): 2 days, 4:09:35/1:46:02, loss=0.282133883920441, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.93(1.03), norm=3.315526557837164, lr=5.048785199804e-06
2023-11-23 12:34:55   INFO  epoch: 29/30, acc_iter=192273, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:20:27/1:26:33, time_cost(all): 2 days, 4:10:24/1:49:15, loss=0.282050769934132, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.2(1.03), norm=2.7164693599134835, lr=5.001933657204e-06
2023-11-23 12:35:44   INFO  epoch: 29/30, acc_iter=192323, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:21:17/1:22:29, time_cost(all): 2 days, 4:11:13/1:39:02, loss=0.281967655947823, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.94(1.03), norm=1.9233143600094353, lr=4.955082114603e-06
2023-11-23 12:36:33   INFO  epoch: 29/30, acc_iter=192373, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:22:06/1:24:23, time_cost(all): 2 days, 4:12:02/1:45:27, loss=0.281884541961514, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.09(1.03), norm=1.1733282719413265, lr=4.908230572002e-06
2023-11-23 12:37:22   INFO  epoch: 29/30, acc_iter=192423, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:22:55/1:24:08, time_cost(all): 2 days, 4:12:51/1:44:26, loss=0.281801427975205, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.17(1.03), norm=4.000129117291046, lr=4.861379029401e-06
2023-11-23 12:38:11   INFO  epoch: 29/30, acc_iter=192473, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:23:44/1:25:56, time_cost(all): 2 days, 4:13:40/1:40:59, loss=0.281718313988896, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.03(1.03), norm=3.633517471922606, lr=4.814527486801e-06
2023-11-23 12:39:00   INFO  epoch: 29/30, acc_iter=192523, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:24:33/1:27:15, time_cost(all): 2 days, 4:14:29/1:38:30, loss=0.281635200002587, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.13(1.03), norm=4.357510632956853, lr=4.7676759442e-06
2023-11-23 12:39:49   INFO  epoch: 29/30, acc_iter=192573, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:25:22/1:19:46, time_cost(all): 2 days, 4:15:18/1:37:47, loss=0.281552086016278, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.99(1.03), norm=1.1626332574915765, lr=4.720824401599e-06
2023-11-23 12:40:38   INFO  epoch: 29/30, acc_iter=192623, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:26:11/1:22:48, time_cost(all): 2 days, 4:16:07/1:39:19, loss=0.281468972029969, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.02(1.03), norm=4.064658834236423, lr=4.673972858998e-06
2023-11-23 12:41:28   INFO  epoch: 29/30, acc_iter=192673, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:27:00/1:24:00, time_cost(all): 2 days, 4:16:57/1:42:05, loss=0.28138585804366, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.86(1.03), norm=0.9740587418016842, lr=4.627121316398e-06
2023-11-23 12:42:17   INFO  epoch: 29/30, acc_iter=192723, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:27:49/1:16:32, time_cost(all): 2 days, 4:17:46/1:39:33, loss=0.281302744057351, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.13(1.03), norm=4.969582384045413, lr=4.580269773797e-06
2023-11-23 12:43:06   INFO  epoch: 29/30, acc_iter=192773, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:28:39/1:22:05, time_cost(all): 2 days, 4:18:35/1:38:17, loss=0.281219630071042, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.15(1.03), norm=1.5063236333355747, lr=4.533418231196e-06
2023-11-23 12:43:55   INFO  epoch: 29/30, acc_iter=192823, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:29:28/1:21:06, time_cost(all): 2 days, 4:19:24/1:36:30, loss=0.281136516084733, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.07(1.03), norm=4.24855047476939, lr=4.486566688595e-06
2023-11-23 12:44:44   INFO  epoch: 29/30, acc_iter=192873, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:30:17/1:20:57, time_cost(all): 2 days, 4:20:13/1:34:36, loss=0.281053402098424, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.97(1.03), norm=2.202264319902042, lr=4.439715145995e-06
2023-11-23 12:45:33   INFO  epoch: 29/30, acc_iter=192923, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:31:06/1:19:33, time_cost(all): 2 days, 4:21:02/1:35:33, loss=0.280970288112115, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.12(1.03), norm=4.1563291139297265, lr=4.392863603394e-06
2023-11-23 12:46:22   INFO  epoch: 29/30, acc_iter=192973, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:31:55/1:16:45, time_cost(all): 2 days, 4:21:51/1:33:38, loss=0.280887174125805, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.94(1.03), norm=4.541528671587608, lr=4.346012060793e-06
2023-11-23 12:47:11   INFO  epoch: 29/30, acc_iter=193023, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:32:44/1:14:10, time_cost(all): 2 days, 4:22:40/1:31:30, loss=0.280804060139496, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.04(1.03), norm=3.6948549614985824, lr=4.299160518192e-06
2023-11-23 12:48:01   INFO  epoch: 29/30, acc_iter=193073, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:33:33/1:11:27, time_cost(all): 2 days, 4:23:30/1:27:33, loss=0.280720946153187, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.18(1.03), norm=3.5942979268623865, lr=4.252308975592e-06
2023-11-23 12:48:50   INFO  epoch: 29/30, acc_iter=193123, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:34:22/1:16:12, time_cost(all): 2 days, 4:24:19/1:32:41, loss=0.280637832166878, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.93(1.03), norm=4.729235523021359, lr=4.205457432991e-06
2023-11-23 12:49:39   INFO  epoch: 29/30, acc_iter=193173, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:35:12/1:13:02, time_cost(all): 2 days, 4:25:08/1:31:11, loss=0.280554718180569, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.92(1.03), norm=2.0161444359416065, lr=4.15860589039e-06
2023-11-23 12:50:28   INFO  epoch: 29/30, acc_iter=193223, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:36:01/1:14:50, time_cost(all): 2 days, 4:25:57/1:27:51, loss=0.28047160419426, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.21(1.03), norm=4.135282681602709, lr=4.111754347789e-06
2023-11-23 12:51:17   INFO  epoch: 29/30, acc_iter=193273, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:36:50/1:13:19, time_cost(all): 2 days, 4:26:46/1:29:51, loss=0.280388490207951, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.2(1.03), norm=2.343926932671165, lr=4.064902805189e-06
2023-11-23 12:52:06   INFO  epoch: 29/30, acc_iter=193323, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:37:39/1:08:35, time_cost(all): 2 days, 4:27:35/1:23:49, loss=0.280305376221642, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.84(1.03), norm=0.7663467370704777, lr=4.018051262588e-06
2023-11-23 12:52:55   INFO  epoch: 29/30, acc_iter=193373, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:38:28/1:07:41, time_cost(all): 2 days, 4:28:24/1:23:43, loss=0.280222262235333, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.98(1.03), norm=2.705075622227776, lr=3.971199719987e-06
2023-11-23 12:53:44   INFO  epoch: 29/30, acc_iter=193423, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:39:17/1:09:28, time_cost(all): 2 days, 4:29:13/1:22:50, loss=0.280139148249024, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.11(1.03), norm=2.9524528702913613, lr=3.924348177386e-06
2023-11-23 12:54:33   INFO  epoch: 29/30, acc_iter=193473, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:40:06/1:06:15, time_cost(all): 2 days, 4:30:02/1:26:06, loss=0.280056034262715, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.16(1.03), norm=4.435349725628314, lr=3.877496634786e-06
2023-11-23 12:55:23   INFO  epoch: 29/30, acc_iter=193523, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:40:55/1:06:06, time_cost(all): 2 days, 4:30:52/1:24:48, loss=0.279972920276406, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.94(1.03), norm=4.204809661747456, lr=3.830645092185e-06
2023-11-23 12:56:12   INFO  epoch: 29/30, acc_iter=193573, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:41:44/1:04:52, time_cost(all): 2 days, 4:31:41/1:27:00, loss=0.279889806290097, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.97(1.03), norm=2.176031815944987, lr=3.783793549584e-06
2023-11-23 12:57:01   INFO  epoch: 29/30, acc_iter=193623, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:42:34/1:07:16, time_cost(all): 2 days, 4:32:30/1:23:09, loss=0.279806692303788, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.97(1.03), norm=2.861679426096559, lr=3.736942006983e-06
2023-11-23 12:57:50   INFO  epoch: 29/30, acc_iter=193673, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:43:23/1:04:05, time_cost(all): 2 days, 4:33:19/1:25:35, loss=0.279723578317479, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.2(1.03), norm=1.1099750344007684, lr=3.690090464383e-06
2023-11-23 12:58:39   INFO  epoch: 29/30, acc_iter=193723, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:44:12/1:04:25, time_cost(all): 2 days, 4:34:08/1:21:04, loss=0.27964046433117, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.83(1.03), norm=0.800512488085351, lr=3.643238921782e-06
2023-11-23 12:59:28   INFO  epoch: 29/30, acc_iter=193773, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:45:01/1:01:40, time_cost(all): 2 days, 4:34:57/1:18:05, loss=0.27955735034486, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.16(1.03), norm=0.6521711697009933, lr=3.596387379181e-06
2023-11-23 13:00:17   INFO  epoch: 29/30, acc_iter=193823, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:45:50/0:59:18, time_cost(all): 2 days, 4:35:46/1:19:08, loss=0.279474236358551, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.18(1.03), norm=1.4655776448032132, lr=3.54953583658e-06
2023-11-23 13:01:06   INFO  epoch: 29/30, acc_iter=193873, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:46:39/1:00:27, time_cost(all): 2 days, 4:36:35/1:19:31, loss=0.279391122372242, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.2(1.03), norm=4.227793908634663, lr=3.50268429398e-06
2023-11-23 13:01:56   INFO  epoch: 29/30, acc_iter=193923, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:47:28/1:01:48, time_cost(all): 2 days, 4:37:25/1:15:21, loss=0.279308008385933, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.13(1.03), norm=1.352697756029207, lr=3.455832751379e-06
2023-11-23 13:02:45   INFO  epoch: 29/30, acc_iter=193973, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:48:17/0:59:07, time_cost(all): 2 days, 4:38:14/1:20:35, loss=0.279224894399624, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.11(1.03), norm=3.816404677308032, lr=3.408981208778e-06
2023-11-23 13:03:34   INFO  epoch: 29/30, acc_iter=194023, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:49:07/0:58:03, time_cost(all): 2 days, 4:39:03/1:14:13, loss=0.279141780413315, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.22(1.03), norm=1.9896782673711386, lr=3.362129666177e-06
2023-11-23 13:04:23   INFO  epoch: 29/30, acc_iter=194073, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:49:56/0:56:37, time_cost(all): 2 days, 4:39:52/1:15:33, loss=0.279058666427006, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.87(1.03), norm=1.405254448470219, lr=3.315278123577e-06
2023-11-23 13:05:12   INFO  epoch: 29/30, acc_iter=194123, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:50:45/0:55:09, time_cost(all): 2 days, 4:40:41/1:13:00, loss=0.278975552440697, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.2(1.03), norm=4.460053702225663, lr=3.268426580976e-06
2023-11-23 13:06:01   INFO  epoch: 29/30, acc_iter=194173, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 0:51:34/0:54:46, time_cost(all): 2 days, 4:41:30/1:14:35, loss=0.278892438454388, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.87(1.03), norm=3.2793104538678146, lr=3.221575038375e-06
2023-11-23 13:06:50   INFO  epoch: 29/30, acc_iter=194223, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 0:52:23/0:57:58, time_cost(all): 2 days, 4:42:19/1:14:21, loss=0.278809324468079, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.19(1.03), norm=2.239656141075154, lr=3.174723495775e-06
2023-11-23 13:07:39   INFO  epoch: 29/30, acc_iter=194273, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 0:53:12/0:55:59, time_cost(all): 2 days, 4:43:08/1:15:01, loss=0.27872621048177, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.0(1.03), norm=4.5253104745905794, lr=3.127871953174e-06
2023-11-23 13:08:28   INFO  epoch: 29/30, acc_iter=194323, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 0:54:01/0:53:02, time_cost(all): 2 days, 4:43:57/1:11:43, loss=0.278643096495461, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.87(1.03), norm=1.5212190748758647, lr=3.081020410573e-06
2023-11-23 13:09:18   INFO  epoch: 29/30, acc_iter=194373, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 0:54:50/0:52:41, time_cost(all): 2 days, 4:44:47/1:13:08, loss=0.278559982509152, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.01(1.03), norm=4.633692657789766, lr=3.034168867972e-06
2023-11-23 13:10:07   INFO  epoch: 29/30, acc_iter=194423, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 0:55:39/0:50:25, time_cost(all): 2 days, 4:45:36/1:10:27, loss=0.278476868522843, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.02(1.03), norm=3.5179931222545777, lr=2.987317325372e-06
2023-11-23 13:10:56   INFO  epoch: 29/30, acc_iter=194473, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 0:56:29/0:53:43, time_cost(all): 2 days, 4:46:25/1:12:06, loss=0.278393754536534, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.9(1.03), norm=0.789265193818768, lr=2.940465782771e-06
2023-11-23 13:11:45   INFO  epoch: 29/30, acc_iter=194523, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 0:57:18/0:49:14, time_cost(all): 2 days, 4:47:14/1:05:22, loss=0.278310640550225, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.86(1.03), norm=3.865232866698332, lr=2.89361424017e-06
2023-11-23 13:12:34   INFO  epoch: 29/30, acc_iter=194573, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 0:58:07/0:47:47, time_cost(all): 2 days, 4:48:03/1:07:48, loss=0.278227526563915, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.17(1.03), norm=1.4412618518661202, lr=2.846762697569e-06
2023-11-23 13:13:23   INFO  epoch: 29/30, acc_iter=194623, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 0:58:56/0:48:26, time_cost(all): 2 days, 4:48:52/1:06:22, loss=0.278144412577606, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.1(1.03), norm=0.681838560336919, lr=2.799911154969e-06
2023-11-23 13:14:12   INFO  epoch: 29/30, acc_iter=194673, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 0:59:45/0:49:00, time_cost(all): 2 days, 4:49:41/1:03:39, loss=0.278061298591297, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.02(1.03), norm=3.920523967764081, lr=2.753059612368e-06
2023-11-23 13:15:01   INFO  epoch: 29/30, acc_iter=194723, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:00:34/0:47:34, time_cost(all): 2 days, 4:50:30/1:03:53, loss=0.277978184604988, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.05(1.03), norm=3.8558848007799527, lr=2.706208069767e-06
2023-11-23 13:15:51   INFO  epoch: 29/30, acc_iter=194773, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:01:23/0:48:26, time_cost(all): 2 days, 4:51:20/1:02:22, loss=0.277895070618679, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.13(1.03), norm=0.9060180171505059, lr=2.659356527166e-06
2023-11-23 13:16:40   INFO  epoch: 29/30, acc_iter=194823, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:02:12/0:43:47, time_cost(all): 2 days, 4:52:09/1:04:42, loss=0.27781195663237, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.15(1.03), norm=0.5266267427313908, lr=2.612504984566e-06
2023-11-23 13:17:29   INFO  epoch: 29/30, acc_iter=194873, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:03:02/0:47:01, time_cost(all): 2 days, 4:52:58/1:01:16, loss=0.277728842646061, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.88(1.03), norm=1.0872662718801336, lr=2.565653441965e-06
2023-11-23 13:18:18   INFO  epoch: 29/30, acc_iter=194923, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:03:51/0:45:42, time_cost(all): 2 days, 4:53:47/1:01:55, loss=0.277645728659752, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.86(1.03), norm=4.592670091499336, lr=2.518801899364e-06
2023-11-23 13:19:07   INFO  epoch: 29/30, acc_iter=194973, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:04:40/0:43:58, time_cost(all): 2 days, 4:54:36/0:57:44, loss=0.277562614673443, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.83(1.03), norm=0.6439331860425699, lr=2.471950356763e-06
2023-11-23 13:19:56   INFO  epoch: 29/30, acc_iter=195023, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:05:29/0:40:53, time_cost(all): 2 days, 4:55:25/0:57:08, loss=0.277479500687134, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.88(1.03), norm=1.344234413236301, lr=2.425098814163e-06
2023-11-23 13:20:45   INFO  epoch: 29/30, acc_iter=195073, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:06:18/0:42:35, time_cost(all): 2 days, 4:56:14/1:01:34, loss=0.277396386700825, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.0(1.03), norm=1.301781486624904, lr=2.378247271562e-06
2023-11-23 13:21:34   INFO  epoch: 29/30, acc_iter=195123, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:07:07/0:39:15, time_cost(all): 2 days, 4:57:03/0:59:32, loss=0.277313272714516, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.95(1.03), norm=4.626303440854833, lr=2.331395728961e-06
2023-11-23 13:22:23   INFO  epoch: 29/30, acc_iter=195173, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:07:56/0:39:46, time_cost(all): 2 days, 4:57:52/0:54:36, loss=0.277230158728207, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.09(1.03), norm=1.8279847812424053, lr=2.28454418636e-06
2023-11-23 13:23:13   INFO  epoch: 29/30, acc_iter=195223, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:08:45/0:40:54, time_cost(all): 2 days, 4:58:42/0:58:13, loss=0.277147044741898, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.95(1.03), norm=2.4844546391383813, lr=2.23769264376e-06
2023-11-23 13:24:02   INFO  epoch: 29/30, acc_iter=195273, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:09:34/0:38:48, time_cost(all): 2 days, 4:59:31/0:57:43, loss=0.277063930755589, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.12(1.03), norm=2.392395285621596, lr=2.190841101159e-06
2023-11-23 13:24:51   INFO  epoch: 29/30, acc_iter=195323, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:10:24/0:39:05, time_cost(all): 2 days, 5:00:20/0:56:20, loss=0.27698081676928, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.19(1.03), norm=3.7452130264178067, lr=2.143989558558e-06
2023-11-23 13:25:40   INFO  epoch: 29/30, acc_iter=195373, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:11:13/0:37:49, time_cost(all): 2 days, 5:01:09/0:53:22, loss=0.27689770278297, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.19(1.03), norm=3.2170694576099748, lr=2.097138015957e-06
2023-11-23 13:26:29   INFO  epoch: 29/30, acc_iter=195423, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:12:02/0:34:05, time_cost(all): 2 days, 5:01:58/0:51:14, loss=0.276814588796661, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.0(1.03), norm=1.728063659479031, lr=2.050286473357e-06
2023-11-23 13:27:18   INFO  epoch: 29/30, acc_iter=195473, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:12:51/0:34:00, time_cost(all): 2 days, 5:02:47/0:53:40, loss=0.276731474810352, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.1(1.03), norm=3.155383140743072, lr=2.003434930756e-06
2023-11-23 13:28:07   INFO  epoch: 29/30, acc_iter=195523, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:13:40/0:33:55, time_cost(all): 2 days, 5:03:36/0:52:57, loss=0.276648360824043, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.13(1.03), norm=2.3156614211857036, lr=1.956583388155e-06
2023-11-23 13:28:56   INFO  epoch: 29/30, acc_iter=195573, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:14:29/0:33:40, time_cost(all): 2 days, 5:04:25/0:53:15, loss=0.276565246837734, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.9(1.03), norm=0.9968463156183933, lr=1.909731845554e-06
2023-11-23 13:29:46   INFO  epoch: 29/30, acc_iter=195623, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:15:18/0:31:03, time_cost(all): 2 days, 5:05:15/0:50:31, loss=0.276482132851425, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.85(1.03), norm=4.504374132320899, lr=1.862880302954e-06
2023-11-23 13:30:35   INFO  epoch: 29/30, acc_iter=195673, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:16:07/0:31:52, time_cost(all): 2 days, 5:06:04/0:48:46, loss=0.276399018865116, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.04(1.03), norm=0.8681284648658683, lr=1.816028760353e-06
2023-11-23 13:31:24   INFO  epoch: 29/30, acc_iter=195723, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:16:57/0:31:29, time_cost(all): 2 days, 5:06:53/0:46:22, loss=0.276315904878807, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.15(1.03), norm=3.4139024024438607, lr=1.769177217752e-06
2023-11-23 13:32:13   INFO  epoch: 29/30, acc_iter=195773, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:17:46/0:29:44, time_cost(all): 2 days, 5:07:42/0:48:07, loss=0.276232790892498, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.86(1.03), norm=2.5522485753834627, lr=1.722325675151e-06
2023-11-23 13:33:02   INFO  epoch: 29/30, acc_iter=195823, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:18:35/0:28:51, time_cost(all): 2 days, 5:08:31/0:46:34, loss=0.276149676906189, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.2(1.03), norm=1.7314435867837215, lr=1.675474132551e-06
2023-11-23 13:33:51   INFO  epoch: 29/30, acc_iter=195873, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:19:24/0:29:08, time_cost(all): 2 days, 5:09:20/0:44:38, loss=0.27606656291988, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.09(1.03), norm=1.9474291337484635, lr=1.62862258995e-06
2023-11-23 13:34:40   INFO  epoch: 29/30, acc_iter=195923, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:20:13/0:26:49, time_cost(all): 2 days, 5:10:09/0:47:04, loss=0.275983448933571, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.08(1.03), norm=4.253736915617442, lr=1.581771047349e-06
2023-11-23 13:35:29   INFO  epoch: 29/30, acc_iter=195973, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:21:02/0:26:11, time_cost(all): 2 days, 5:10:58/0:42:25, loss=0.275900334947262, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.14(1.03), norm=0.7821910505598743, lr=1.534919504748e-06
2023-11-23 13:36:18   INFO  epoch: 29/30, acc_iter=196023, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:21:51/0:27:02, time_cost(all): 2 days, 5:11:47/0:43:31, loss=0.275817220960953, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.84(1.03), norm=2.26737602787968, lr=1.488067962148e-06
2023-11-23 13:37:08   INFO  epoch: 29/30, acc_iter=196073, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:22:40/0:24:13, time_cost(all): 2 days, 5:12:37/0:41:35, loss=0.275734106974644, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.06(1.03), norm=0.711691376003633, lr=1.441216419547e-06
2023-11-23 13:37:57   INFO  epoch: 29/30, acc_iter=196123, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:23:29/0:23:55, time_cost(all): 2 days, 5:13:26/0:43:48, loss=0.275650992988335, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.92(1.03), norm=1.3011645220586172, lr=1.394364876946e-06
2023-11-23 13:38:46   INFO  epoch: 29/30, acc_iter=196173, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:24:19/0:22:56, time_cost(all): 2 days, 5:14:15/0:39:24, loss=0.275567879002025, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.18(1.03), norm=4.515813783244318, lr=1.347513334345e-06
2023-11-23 13:39:35   INFO  epoch: 29/30, acc_iter=196223, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:25:08/0:23:28, time_cost(all): 2 days, 5:15:04/0:39:40, loss=0.275484765015716, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.04(1.03), norm=2.7678558836575062, lr=1.300661791745e-06
2023-11-23 13:40:24   INFO  epoch: 29/30, acc_iter=196273, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:25:57/0:20:49, time_cost(all): 2 days, 5:15:53/0:39:10, loss=0.275401651029407, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.13(1.03), norm=4.474632907539534, lr=1.253810249144e-06
2023-11-23 13:41:13   INFO  epoch: 29/30, acc_iter=196323, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:26:46/0:20:45, time_cost(all): 2 days, 5:16:42/0:38:50, loss=0.275318537043098, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.08(1.03), norm=3.242094959089917, lr=1.206958706543e-06
2023-11-23 13:42:02   INFO  epoch: 29/30, acc_iter=196373, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:27:35/0:20:59, time_cost(all): 2 days, 5:17:31/0:38:11, loss=0.275235423056789, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.08(1.03), norm=1.6129135568974793, lr=1.160107163942e-06
2023-11-23 13:42:51   INFO  epoch: 29/30, acc_iter=196423, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:28:24/0:20:16, time_cost(all): 2 days, 5:18:20/0:35:11, loss=0.27515230907048, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.92(1.03), norm=4.201353629454545, lr=1.113255621342e-06
2023-11-23 13:43:41   INFO  epoch: 29/30, acc_iter=196473, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:29:13/0:19:20, time_cost(all): 2 days, 5:19:10/0:34:55, loss=0.275069195084171, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.07(1.03), norm=2.9411422877743787, lr=1.066404078741e-06
2023-11-23 13:44:30   INFO  epoch: 29/30, acc_iter=196523, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:30:02/0:18:38, time_cost(all): 2 days, 5:19:59/0:33:44, loss=0.274986081097862, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.08(1.03), norm=4.35254555651864, lr=1.01955253614e-06
2023-11-23 13:45:19   INFO  epoch: 29/30, acc_iter=196573, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:30:52/0:17:10, time_cost(all): 2 days, 5:20:48/0:33:09, loss=0.274902967111553, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.07(1.03), norm=1.659693099975909, lr=9.72700993539e-07
2023-11-23 13:46:08   INFO  epoch: 29/30, acc_iter=196623, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:31:41/0:16:06, time_cost(all): 2 days, 5:21:37/0:33:02, loss=0.274819853125244, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.21(1.03), norm=4.277897632741676, lr=9.25849450939e-07
2023-11-23 13:46:57   INFO  epoch: 29/30, acc_iter=196673, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:32:30/0:15:08, time_cost(all): 2 days, 5:22:26/0:32:44, loss=0.274736739138935, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.14(1.03), norm=1.9385171051347256, lr=8.78997908338e-07
2023-11-23 13:47:46   INFO  epoch: 29/30, acc_iter=196723, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:33:19/0:15:10, time_cost(all): 2 days, 5:23:15/0:33:15, loss=0.274653625152626, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.09(1.03), norm=3.4575318814275118, lr=8.32146365737e-07
2023-11-23 13:48:35   INFO  epoch: 29/30, acc_iter=196773, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:34:08/0:13:49, time_cost(all): 2 days, 5:24:04/0:30:03, loss=0.274570511166317, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.84(1.03), norm=3.856687123251888, lr=7.85294823136e-07
2023-11-23 13:49:24   INFO  epoch: 29/30, acc_iter=196823, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:34:57/0:13:09, time_cost(all): 2 days, 5:24:53/0:29:10, loss=0.274487397180008, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.92(1.03), norm=1.3181026709592396, lr=7.38443280536e-07
2023-11-23 13:50:13   INFO  epoch: 29/30, acc_iter=196873, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:35:46/0:11:41, time_cost(all): 2 days, 5:25:42/0:28:49, loss=0.274404283193699, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.11(1.03), norm=0.5479363524802391, lr=6.91591737935e-07
2023-11-23 13:51:03   INFO  epoch: 29/30, acc_iter=196923, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:36:35/0:11:34, time_cost(all): 2 days, 5:26:32/0:27:55, loss=0.27432116920739, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.03(1.03), norm=2.9547925876116046, lr=6.44740195334e-07
2023-11-23 13:51:52   INFO  epoch: 29/30, acc_iter=196973, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:37:24/0:10:25, time_cost(all): 2 days, 5:27:21/0:29:00, loss=0.27423805522108, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.87(1.03), norm=1.4712989454598346, lr=5.97888652733e-07
2023-11-23 13:52:41   INFO  epoch: 29/30, acc_iter=197023, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:38:14/0:09:12, time_cost(all): 2 days, 5:28:10/0:26:26, loss=0.274154941234771, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.85(1.03), norm=2.7822295247400284, lr=5.51037110133e-07
2023-11-23 13:53:30   INFO  epoch: 29/30, acc_iter=197073, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:39:03/0:08:21, time_cost(all): 2 days, 5:28:59/0:26:46, loss=0.274071827248462, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.19(1.03), norm=4.224257755921845, lr=5.04185567532e-07
2023-11-23 13:54:19   INFO  epoch: 29/30, acc_iter=197123, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:39:52/0:07:37, time_cost(all): 2 days, 5:29:48/0:25:33, loss=0.273988713262153, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.14(1.03), norm=2.955244656916846, lr=4.57334024931e-07
2023-11-23 13:55:08   INFO  epoch: 29/30, acc_iter=197173, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:40:41/0:07:11, time_cost(all): 2 days, 5:30:37/0:25:47, loss=0.273905599275844, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.11(1.03), norm=3.5072408992518924, lr=4.10482482331e-07
2023-11-23 13:55:57   INFO  epoch: 29/30, acc_iter=197223, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:41:30/0:06:04, time_cost(all): 2 days, 5:31:26/0:24:16, loss=0.273822485289535, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.92(1.03), norm=2.839906635305307, lr=3.6363093973e-07
2023-11-23 13:56:46   INFO  epoch: 29/30, acc_iter=197273, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 1:42:19/0:05:20, time_cost(all): 2 days, 5:32:15/0:22:56, loss=0.273739371303226, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.08(1.03), norm=2.8932974973629317, lr=3.16779397129e-07
2023-11-23 13:57:36   INFO  epoch: 29/30, acc_iter=197323, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 1:43:08/0:04:51, time_cost(all): 2 days, 5:33:05/0:23:20, loss=0.273656257316917, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.06(1.03), norm=2.2049339897831266, lr=2.69927854528e-07
2023-11-23 13:58:25   INFO  epoch: 29/30, acc_iter=197373, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 1:43:57/0:03:51, time_cost(all): 2 days, 5:33:54/0:20:40, loss=0.273573143330608, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.9(1.03), norm=1.6033258868370357, lr=2.23076311928e-07
2023-11-23 13:59:14   INFO  epoch: 29/30, acc_iter=197423, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 1:44:47/0:03:09, time_cost(all): 2 days, 5:34:43/0:21:08, loss=0.273490029344299, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.2(1.03), norm=4.886850185043812, lr=1.76224769327e-07
2023-11-23 14:00:03   INFO  epoch: 29/30, acc_iter=197473, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 1:45:36/0:02:13, time_cost(all): 2 days, 5:35:32/0:19:47, loss=0.27340691535799, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.06(1.03), norm=0.6302443314297743, lr=1.29373226726e-07
2023-11-23 14:00:52   INFO  epoch: 29/30, acc_iter=197523, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 1:46:25/0:01:27, time_cost(all): 2 days, 5:36:21/0:18:13, loss=0.273323801371681, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.1(1.03), norm=3.7033400202812894, lr=8.2521684125e-08
2023-11-23 14:01:41   INFO  epoch: 29/30, acc_iter=197573, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 1:47:14/0:00:35, time_cost(all): 2 days, 5:37:10/0:17:44, loss=0.273240687385372, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.95(1.03), norm=3.1721636299547966, lr=3.5670141525e-08
2023-11-23 14:01:41   INFO  **********************End training picture_models/picture_waymo_ssl_seal_decoder_mask(offline_30e)**********************