2023-12-05 10:53:08   INFO  **********************Start logging**********************14:11:03,271
2023-12-05 10:53:08   INFO  CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7
2023-12-05 10:53:08   INFO  cfg_file         cfgs/picture_models/picture_nuscenes_ssl_seal_decoder_mask.yaml
2023-12-05 10:53:08   INFO  batch_size       4
2023-12-05 10:53:08   INFO  epochs           72
2023-12-05 10:53:08   INFO  workers          4
2023-12-05 10:53:08   INFO  extra_tag        offline_30e
2023-12-05 10:53:08   INFO  ckpt             None
2023-12-05 10:53:08   INFO  pretrained_model None
2023-12-05 10:53:08   INFO  launcher         none
2023-12-05 10:53:08   INFO  tcp_port         18888
2023-12-05 10:53:08   INFO  sync_bn          False
2023-12-05 10:53:08   INFO  fix_random_seed  False
2023-12-05 10:53:08   INFO  ckpt_save_interval 1
2023-12-05 10:53:08   INFO  local_rank       0
2023-12-05 10:53:08   INFO  max_ckpt_save_num 1
2023-12-05 10:53:08   INFO  merge_all_iters_to_one_epoch False
2023-12-05 10:53:08   INFO  set_cfgs         None
2023-12-05 10:53:08   INFO  max_waiting_mins 0
2023-12-05 10:53:08   INFO  start_epoch      0
2023-12-05 10:53:08   INFO  num_epochs_to_eval 0
2023-12-05 10:53:08   INFO  save_to_file     False
2023-12-05 10:53:08   INFO  use_tqdm_to_record False
2023-12-05 10:53:08   INFO  logger_iter_interval 50
2023-12-05 10:53:08   INFO  ckpt_save_time_interval 300
2023-12-05 10:53:08   INFO  wo_gpu_stat      False
2023-12-05 10:53:08   INFO  fp16             False
2023-12-05 10:53:08   INFO  cfg.LOCAL_RANK: 0
2023-12-05 10:53:08   INFO  
cfg.DATA_CONFIG = edict()
2023-12-05 10:53:08   INFO  cfg.DATA_CONFIG.DATASET: NuScenesDataset
2023-12-05 10:53:08   INFO  cfg.DATA_CONFIG.DATA_PATH: ../data/nuscenes
2023-12-05 10:53:08   INFO  cfg.DATA_CONFIG.VERSION: v1.0-trainval
2023-12-05 10:53:08   INFO  cfg.DATA_CONFIG.PRED_VELOCITY: True
2023-12-05 10:53:08   INFO  cfg.DATA_CONFIG.SET_NAN_VELOCITY_TO_ZEROS: True
2023-12-05 10:53:08   INFO  cfg.DATA_CONFIG.FILTER_MIN_POINTS_IN_GT: 1
2023-12-05 10:53:08   INFO  
cfg.DATA_CONFIG.DATA_SPLIT = edict()
2023-12-05 10:53:08   INFO  cfg.DATA_CONFIG.DATA_SPLIT.train: train
2023-12-05 10:53:08   INFO  cfg.DATA_CONFIG.DATA_SPLIT.test: val
2023-12-05 10:53:08   INFO  
cfg.DATA_CONFIG.INFO_PATH = edict()
2023-12-05 10:53:08   INFO  cfg.DATA_CONFIG.INFO_PATH.train: ['nuscenes_infos_10sweeps_train.pkl']
2023-12-05 10:53:08   INFO  cfg.DATA_CONFIG.INFO_PATH.test: ['nuscenes_infos_10sweeps_val.pkl']
2023-12-05 10:53:08   INFO  cfg.DATA_CONFIG.POINT_CLOUD_RANGE: [-51.2, -51.2, -5.0, 51.2, 51.2, 3.0]
2023-12-05 10:53:08   INFO  cfg.DATA_CONFIG.BALANCED_RESAMPLING: True
2023-12-05 10:53:08   INFO  
cfg.DATA_CONFIG.DATA_AUGMENTOR = edict()
2023-12-05 10:53:08   INFO  cfg.DATA_CONFIG.DATA_AUGMENTOR.DISABLE_AUG_LIST: ['placeholder']
2023-12-05 10:53:08   INFO  
cfg.DATA_CONFIG.POINT_FEATURE_ENCODING = edict()
2023-12-05 10:53:08   INFO  cfg.DATA_CONFIG.POINT_FEATURE_ENCODING.encoding_type: absolute_coordinates_encoding
2023-12-05 10:53:08   INFO  cfg.DATA_CONFIG.POINT_FEATURE_ENCODING.used_feature_list: ['x', 'y', 'z', 'intensity', 'timestamp']
2023-12-05 10:53:08   INFO  cfg.DATA_CONFIG.POINT_FEATURE_ENCODING.src_feature_list: ['x', 'y', 'z', 'intensity', 'timestamp']
2023-12-05 10:53:08   INFO  cfg.DATA_CONFIG.DATA_PROCESSOR: [{'NAME': 'mask_points_and_boxes_outside_range', 'REMOVE_OUTSIDE_BOXES': True, 'MASK_Z': True}, {'NAME': 'shuffle_points', 'SHUFFLE_ENABLED': {'train': True, 'test': True}}, {'NAME': 'transform_points_to_voxels_placeholder', 'VOXEL_SIZE': [0.3, 0.3, 8.0]}]
2023-12-05 10:53:08   INFO  cfg.DATA_CONFIG._BASE_CONFIG_: cfgs/dataset_configs/nuscenes_dataset.yaml
2023-12-05 10:53:08   INFO  
cfg.MODEL = edict()
2023-12-05 10:53:08   INFO  cfg.MODEL.NAME: PICTURE
2023-12-05 10:53:08   INFO  
cfg.MODEL.VFE = edict()
2023-12-05 10:53:08   INFO  cfg.MODEL.VFE.NAME: DynPillarVFE3D
2023-12-05 10:53:08   INFO  cfg.MODEL.VFE.WITH_DISTANCE: False
2023-12-05 10:53:08   INFO  cfg.MODEL.VFE.USE_ABSLOTE_XYZ: True
2023-12-05 10:53:08   INFO  cfg.MODEL.VFE.USE_NORM: True
2023-12-05 10:53:08   INFO  cfg.MODEL.VFE.NUM_FILTERS: [256, 256]
2023-12-05 10:53:08   INFO  
cfg.MODEL.BACKBONE_3D = edict()
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.NAME: DSVTBackboneMAE
2023-12-05 10:53:08   INFO  
cfg.MODEL.BACKBONE_3D.INPUT_LAYER = edict()
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.sparse_shape: [ 512, 512, 40 ]
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.downsample_stride: [ [ 1, 1, 4 ], [ 1, 1, 4 ], [ 1, 1, 2 ] ]
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.d_model: [ 256, 256, 256, 256 ]
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.set_info: [ [ 48, 1 ], [ 48, 1 ], [ 48, 1 ], [ 48, 1 ] ]
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.window_shape: [ [ 12, 12, 32 ], [ 12, 12, 8 ], [ 12, 12, 2 ], [ 12, 12, 1 ] ]
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.hybrid_factor: [2, 2, 1]
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.shifts_list: [ [ [ 0, 0, 0 ], [ 6, 6, 0 ] ], [ [ 0, 0, 0 ], [ 6, 6, 0 ] ], [ [ 0, 0, 0 ], [ 6, 6, 0 ] ], [ [ 0, 0, 0 ], [ 6, 6, 0 ] ] ]
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.normalize_pos: False
2023-12-05 10:53:08   INFO  
cfg.MODEL.BACKBONE_3D.MASK_CONFIG = edict()
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.MASK_CONFIG.n_clusters: 8
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.MASK_CONFIG.n_partition: [3, 3, 2]
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.MASK_CONFIG.lambda_threshold: 0.6
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.MASK_CONFIG.base_mask_ratio: [0.9, 0.45, 0]
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.MASK_CONFIG.NUM_SEAL_FEATURES: 64
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.MASK_CONFIG.GENERATE_MODE: offline
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.block_name: [ 'DSVTBlock','DSVTBlock','DSVTBlock','DSVTBlock' ]
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.set_info: [ [ 48, 1 ], [ 48, 1 ], [ 48, 1 ], [ 48, 1 ] ]
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.d_model: [ 256, 256, 256, 256 ]
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.nhead: [ 8, 8, 8, 8 ]
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.dim_feedforward: [ 384, 384, 384, 384 ]
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.dropout: 0.0
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.activation: gelu
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.output_shape: [ 512, 512 ]
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.conv_out_channel: 256
2023-12-05 10:53:08   INFO  
cfg.MODEL.BACKBONE_2D = edict()
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_2D.NAME: LightDecoder
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_2D.INPUT_LAYER.sparse_shape: [ 512, 512, 40 ]
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_2D.INPUT_LAYER.downsample_stride: [ [ 1, 1, 4 ], [ 1, 1, 4 ], [ 1, 1, 2 ] ]
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_2D.INPUT_LAYER.d_model: [ 256, 256 ]
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_2D.INPUT_LAYER.set_info: [ [ 48, 1 ], [ 48, 1 ]]
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_2D.INPUT_LAYER.window_shape: [ [ 12, 12, 32 ], [ 12, 12, 1 ] ]
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_2D.INPUT_LAYER.hybrid_factor: [ 2, 2, 1 ]
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_2D.INPUT_LAYER.shifts_list: [ [ [ 0, 0, 0 ], [ 6, 6, 0 ] ], [ [ 0, 0, 0 ], [ 6, 6, 0 ] ] ]
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_2D.INPUT_LAYER.shifts_list: normalize_pos: False
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_2D.INPUT_SHAPE: [ 512, 512, 40 ]
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_2D.NUM_BEV_FEATURES: 256
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.block_name: [ 'DSVTBlock','DSVTBlock' ]
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.set_info: [ [ 48, 1 ], [ 48, 1 ] ]
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.d_model: [ 256, 256 ]
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.nhead: [ 8, 8 ]
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.dim_feedforward: [ 384, 384 ]
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.dropout: 0.0
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.activation: gelu
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.output_shape: [ 512, 512 ]
2023-12-05 10:53:08   INFO  cfg.MODEL.BACKBONE_3D.conv_out_channel: 256
2023-12-05 10:53:08   INFO  
cfg.MODEL.DENSE_HEAD = edict()
2023-12-05 10:53:08   INFO  cfg.MODEL.DENSE_HEAD.NAME: PretrainHead3D
2023-12-05 10:53:08   INFO  cfg.MODEL.DENSE_HEAD.CLASS_AGNOSTIC: False
2023-12-05 10:53:08   INFO  
cfg.MODEL.DENSE_HEAD.MASK_CONFIG = edict()
2023-12-05 10:53:08   INFO  cfg.MODEL.DENSE_HEAD.MASK_CONFIG.NUM_PRD_POINTS: 16
2023-12-05 10:53:08   INFO  cfg.MODEL.DENSE_HEAD.MASK_CONFIG.NUM_GT_POINTS: 64
2023-12-05 10:53:08   INFO  cfg.MODEL.DENSE_HEAD.INPUT_SHAPE: [ 512, 512, 1 ]
2023-12-05 10:53:08   INFO  cfg.MODEL.DENSE_HEAD.NUM_MINK_FEATURES: 64
2023-12-05 10:53:08   INFO  cfg.MODEL.DENSE_HEAD.LOSS_WEIGHT: [1.0, 3.0]
2023-12-05 10:53:08   INFO  cfg.MODEL.DENSE_HEAD.GENERATE_MODE: offline
2023-12-05 10:53:08   INFO  
cfg.MODEL.POST_PROCESSING = edict()
2023-12-05 10:53:08   INFO  cfg.MODEL.POST_PROCESSING: None
2023-12-05 10:53:08   INFO  
cfg.OPTIMIZATION = edict()
2023-12-05 10:53:08   INFO  cfg.OPTIMIZATION.BATCH_SIZE_PER_GPU: 4
2023-12-05 10:53:08   INFO  cfg.OPTIMIZATION.NUM_EPOCHS: 72
2023-12-05 10:53:08   INFO  cfg.OPTIMIZATION.OPTIMIZER: adamw
2023-12-05 10:53:08   INFO  cfg.OPTIMIZATION.LR: 0.005
2023-12-05 10:53:08   INFO  cfg.OPTIMIZATION.WEIGHT_DECAY: 0.05
2023-12-05 10:53:08   INFO  cfg.OPTIMIZATION.MOMENTUM: 0.9
2023-12-05 10:53:08   INFO  cfg.OPTIMIZATION.MOMS: [0.95, 0.85]
2023-12-05 10:53:08   INFO  cfg.OPTIMIZATION.PCT_START: 0.4
2023-12-05 10:53:08   INFO  cfg.OPTIMIZATION.DIV_FACTOR: 10
2023-12-05 10:53:08   INFO  cfg.OPTIMIZATION.DECAY_STEP_LIST: [35, 45]
2023-12-05 10:53:08   INFO  cfg.OPTIMIZATION.LR_DECAY: 0.1
2023-12-05 10:53:08   INFO  cfg.OPTIMIZATION.LR_CLIP: 1e-07
2023-12-05 10:53:08   INFO  cfg.OPTIMIZATION.LR_WARMUP: False
2023-12-05 10:53:08   INFO  cfg.OPTIMIZATION.WARMUP_EPOCH: 1
2023-12-05 10:53:08   INFO  cfg.OPTIMIZATION.GRAD_NORM_CLIP: 35
2023-12-05 10:53:08   INFO  cfg.TAG: picture_nuscenes_ssl_seal_decoder_mask
2023-12-05 10:53:08   INFO  cfg.EXP_GROUP_PATH: picture_models
2023-12-05 10:53:08   INFO  Loading NuScenes dataset
2023-12-05 10:53:12   INFO  Total skipped info 0
2023-12-05 10:53:12   INFO  Total samples for NuScenes dataset: 28130
2023-12-05 10:53:12   INFO  Total samples after balanced resampling: 123580
2023-12-05 10:53:15   INFO  PICTURE(
  (vfe): DynamicPillarVFE_3d(
    (pfn_layers): ModuleList(
      (0): PFNLayerV2(
        (linear): Linear(in_features=11, out_features=64, bias=False)
        (norm): BatchNorm1d(64, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
        (relu): ReLU()
      )
      (1): PFNLayerV2(
        (linear): Linear(in_features=256, out_features=256, bias=False)
        (norm): BatchNorm1d(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
        (relu): ReLU()
      )
    )
  )
  (backbone_3d): DSVTBackboneMAE(
    (input_layer): DSVTInputLayer(
      (posembed_layers): ModuleList(
        (0): ModuleList(
          (0): ModuleList(
            (0): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=256, bias=True)
                (1): BatchNorm1d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=256, out_features=256, bias=True)
              )
            )
            (1): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=256, bias=True)
                (1): BatchNorm1d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=256, out_features=256, bias=True)
              )
            )
          )
          (1): ModuleList(
            (0): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=256, bias=True)
                (1): BatchNorm1d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=256, out_features=256, bias=True)
              )
            )
            (1): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=256, bias=True)
                (1): BatchNorm1d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=256, out_features=256, bias=True)
              )
            )
          )
          (2): ModuleList(
            (0): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=256, bias=True)
                (1): BatchNorm1d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=256, out_features=256, bias=True)
              )
            )
            (1): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=256, bias=True)
                (1): BatchNorm1d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=256, out_features=256, bias=True)
              )
            )
          )
          (3): ModuleList(
            (0): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=256, bias=True)
                (1): BatchNorm1d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=256, out_features=256, bias=True)
              )
            )
            (1): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=256, bias=True)
                (1): BatchNorm1d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=256, out_features=256, bias=True)
              )
            )
          )
        )
      )
    )
    (stage_0): ModuleList(
      (0): DSVTBlock(
        (encoder_list): ModuleList(
          (0): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=256, out_features=256, bias=True)
              )
              (linear1): Linear(in_features=256, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=256, bias=True)
              (norm1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
          )
          (1): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=256, out_features=256, bias=True)
              )
              (linear1): Linear(in_features=256, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=256, bias=True)
              (norm1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
          )
        )
      )
      (1): DSVTBlock(
        (encoder_list): ModuleList(
          (0): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=256, out_features=256, bias=True)
              )
              (linear1): Linear(in_features=256, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=256, bias=True)
              (norm1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
          )
          (1): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=256, out_features=256, bias=True)
              )
              (linear1): Linear(in_features=256, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=256, bias=True)
              (norm1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
          )
        )
      )
      (2): DSVTBlock(
        (encoder_list): ModuleList(
          (0): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=256, out_features=256, bias=True)
              )
              (linear1): Linear(in_features=256, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=256, bias=True)
              (norm1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
          )
          (1): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=256, out_features=256, bias=True)
              )
              (linear1): Linear(in_features=256, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=256, bias=True)
              (norm1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
          )
        )
      )
      (3): DSVTBlock(
        (encoder_list): ModuleList(
          (0): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=256, out_features=256, bias=True)
              )
              (linear1): Linear(in_features=256, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=256, bias=True)
              (norm1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
          )
          (1): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=256, out_features=256, bias=True)
              )
              (linear1): Linear(in_features=256, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=256, bias=True)
              (norm1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
          )
        )
      )
    )
    (residual_norm_stage_0): ModuleList(
      (0): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
      (1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
      (2): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
      (3): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
    )
  )
  (map_to_bev_module): None
  (pfe): None
  (backbone_2d): LightDecoder(
    (input_layer): DSVTInputLayer(
      (posembed_layers): ModuleList(
        (0): ModuleList(
          (0): ModuleList(
            (0): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=256, bias=True)
                (1): BatchNorm1d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=256, out_features=256, bias=True)
              )
            )
            (1): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=256, bias=True)
                (1): BatchNorm1d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=256, out_features=256, bias=True)
              )
            )
          )
          (1): ModuleList(
            (0): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=256, bias=True)
                (1): BatchNorm1d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=256, out_features=256, bias=True)
              )
            )
            (1): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=256, bias=True)
                (1): BatchNorm1d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=256, out_features=256, bias=True)
              )
            )
          )
        )
      )
    )
    (stage_0): ModuleList(
      (0): DSVTBlock(
        (encoder_list): ModuleList(
          (0): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=256, out_features=256, bias=True)
              )
              (linear1): Linear(in_features=256, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=256, bias=True)
              (norm1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
          )
          (1): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=256, out_features=256, bias=True)
              )
              (linear1): Linear(in_features=256, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=256, bias=True)
              (norm1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
          )
        )
      )
      (1): DSVTBlock(
        (encoder_list): ModuleList(
          (0): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=256, out_features=256, bias=True)
              )
              (linear1): Linear(in_features=256, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=256, bias=True)
              (norm1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
          )
          (1): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=256, out_features=256, bias=True)
              )
              (linear1): Linear(in_features=256, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=256, bias=True)
              (norm1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
          )
        )
      )
    )
    (residual_norm_stage_0): ModuleList(
      (0): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
      (1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
    )
  )
  (dense_head): PretrainHead3D(
    (decoder_pred): Linear(in_features=256, out_features=48, bias=True)
    (decoder_seal): Linear(in_features=256, out_features=64, bias=True)
    (seal_loss): SmoothL1Loss()
  )
  (point_head): None
  (roi_head): None
)
2023-12-05 10:54:12   INFO  Total number of parameters: 6238451
2023-12-05 10:54:12   INFO  **********************Start training picture_models/picture_nuscenes_ssl_seal_decoder_mask(offline_30e)**********************
2023-12-05 10:55:56   INFO  epoch: 0/72, acc_iter=50, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:55:08, time_cost(all): 0:00:41/2 days, 18:17:17, loss=3.301661112993501, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.91(1.03), norm=3.3698413850724056, lr=0.005404583117555671
2023-12-05 10:56:38   INFO  epoch: 0/72, acc_iter=100, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:54:58, time_cost(all): 0:01:23/2 days, 14:47:05, loss=3.159468422835948, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.2(1.03), norm=1.9179717844338247, lr=0.005809166235111341
2023-12-05 10:57:20   INFO  epoch: 0/72, acc_iter=150, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:52:56, time_cost(all): 0:02:05/2 days, 16:52:51, loss=3.017275732678395, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.04(1.03), norm=3.290096444902586, lr=0.006213749352667012
2023-12-05 10:58:02   INFO  epoch: 0/72, acc_iter=200, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:52:13, time_cost(all): 0:02:47/2 days, 16:49:34, loss=2.875083042520843, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.89(1.03), norm=0.5188903405614039, lr=0.006618332470222683
2023-12-05 10:58:43   INFO  epoch: 0/72, acc_iter=250, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:52:40, time_cost(all): 0:03:28/2 days, 14:33:14, loss=2.73289035236329, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.18(1.03), norm=4.442607751524147, lr=0.007022915587778353
2023-12-05 10:59:25   INFO  epoch: 0/72, acc_iter=300, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:49:11, time_cost(all): 0:04:10/2 days, 18:11:58, loss=2.590697662205737, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.14(1.03), norm=3.283997346186123, lr=0.007427498705334024
2023-12-05 11:00:07   INFO  epoch: 0/72, acc_iter=350, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:50:48, time_cost(all): 0:04:52/2 days, 19:26:23, loss=2.448504972048185, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.14(1.03), norm=0.7651789208998219, lr=0.007832081822889695
2023-12-05 11:00:49   INFO  epoch: 0/72, acc_iter=400, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:48:35, time_cost(all): 0:05:34/2 days, 15:13:47, loss=2.306312281890632, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.07(1.03), norm=3.5778370793211067, lr=0.008236664940445365
2023-12-05 11:01:31   INFO  epoch: 0/72, acc_iter=450, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:49:42, time_cost(all): 0:06:16/2 days, 16:17:42, loss=2.164119591733079, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.97(1.03), norm=1.5348600008966762, lr=0.008641248058001037
2023-12-05 11:02:12   INFO  epoch: 0/72, acc_iter=500, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:48:09, time_cost(all): 0:06:57/2 days, 14:31:40, loss=2.021926901575527, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.89(1.03), norm=1.1243695887881087, lr=0.009045831175556707
2023-12-05 11:02:54   INFO  epoch: 0/72, acc_iter=550, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:47:42, time_cost(all): 0:07:39/2 days, 17:47:31, loss=1.879734211417974, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.14(1.03), norm=1.0055468826284917, lr=0.009450414293112379
2023-12-05 11:03:36   INFO  epoch: 0/72, acc_iter=600, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:43:43, time_cost(all): 0:08:21/2 days, 17:17:02, loss=1.737541521260421, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.06(1.03), norm=0.8612477041993459, lr=0.009854997410668049
2023-12-05 11:04:18   INFO  epoch: 0/72, acc_iter=650, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:44:38, time_cost(all): 0:09:03/2 days, 16:55:03, loss=1.595348831102869, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.86(1.03), norm=4.6154221634871515, lr=0.010259580528223719
2023-12-05 11:04:59   INFO  epoch: 0/72, acc_iter=700, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:42:07, time_cost(all): 0:09:44/2 days, 15:52:51, loss=1.453156140945316, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.85(1.03), norm=4.8400190749343555, lr=0.010664163645779389
2023-12-05 11:05:41   INFO  epoch: 0/72, acc_iter=750, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:44:57, time_cost(all): 0:10:26/2 days, 18:10:43, loss=1.310963450787763, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.89(1.03), norm=1.6072381782121086, lr=0.01106874676333506
2023-12-05 11:06:23   INFO  epoch: 0/72, acc_iter=800, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:41:37, time_cost(all): 0:11:08/2 days, 19:34:00, loss=1.168770760630211, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.92(1.03), norm=4.065190038115655, lr=0.011473329880890733
2023-12-05 11:07:05   INFO  epoch: 0/72, acc_iter=850, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:42:42, time_cost(all): 0:11:50/2 days, 19:26:31, loss=1.026578070472658, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.99(1.03), norm=4.689877520507417, lr=0.011877912998446403
2023-12-05 11:07:47   INFO  epoch: 0/72, acc_iter=900, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:41:15, time_cost(all): 0:12:32/2 days, 15:07:41, loss=0.884385380315105, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.86(1.03), norm=0.5470615011537928, lr=0.012282496116002073
2023-12-05 11:08:28   INFO  epoch: 0/72, acc_iter=950, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:38:54, time_cost(all): 0:13:13/2 days, 19:17:34, loss=0.742192690157553, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.11(1.03), norm=2.6036379106832315, lr=0.012687079233557743
2023-12-05 11:09:10   INFO  epoch: 0/72, acc_iter=1000, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:41:34, time_cost(all): 0:13:55/2 days, 17:48:58, loss=0.666425200272253, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.92(1.03), norm=2.79003865910453, lr=0.013091662351113413
2023-12-05 11:09:52   INFO  epoch: 0/72, acc_iter=1050, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:38:12, time_cost(all): 0:14:37/2 days, 13:22:47, loss=0.599940802574059, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.19(1.03), norm=4.270645287566369, lr=0.013496245468669083
2023-12-05 11:10:34   INFO  epoch: 0/72, acc_iter=1100, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:36:59, time_cost(all): 0:15:19/2 days, 18:42:24, loss=0.599881605148118, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.97(1.03), norm=4.63998176345559, lr=0.013900828586224757
2023-12-05 11:11:15   INFO  epoch: 0/72, acc_iter=1150, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:38:48, time_cost(all): 0:16:00/2 days, 16:09:24, loss=0.599822407722177, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.97(1.03), norm=3.6372680673408055, lr=0.014305411703780427
2023-12-05 11:11:57   INFO  epoch: 0/72, acc_iter=1200, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:38:54, time_cost(all): 0:16:42/2 days, 14:49:47, loss=0.599763210296236, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.9(1.03), norm=3.4607606709987278, lr=0.014709994821336097
2023-12-05 11:12:39   INFO  epoch: 0/72, acc_iter=1250, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:36:33, time_cost(all): 0:17:24/2 days, 18:00:06, loss=0.599704012870295, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.85(1.03), norm=4.558143814577122, lr=0.015114577938891767
2023-12-05 11:13:21   INFO  epoch: 0/72, acc_iter=1300, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:36:06, time_cost(all): 0:18:06/2 days, 17:28:35, loss=0.599644815444354, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.0(1.03), norm=4.7876543980548725, lr=0.015519161056447437
2023-12-05 11:14:03   INFO  epoch: 0/72, acc_iter=1350, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:34:21, time_cost(all): 0:18:48/2 days, 13:01:51, loss=0.599585618018413, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.15(1.03), norm=2.886008704983521, lr=0.015923744174003107
2023-12-05 11:14:44   INFO  epoch: 0/72, acc_iter=1400, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:34:54, time_cost(all): 0:19:29/2 days, 16:12:09, loss=0.599526420592472, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.06(1.03), norm=1.365636699822282, lr=0.01632832729155878
2023-12-05 11:15:26   INFO  epoch: 0/72, acc_iter=1450, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:35:03, time_cost(all): 0:20:11/2 days, 17:39:36, loss=0.599467223166532, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.87(1.03), norm=4.629585335524428, lr=0.01673291040911445
2023-12-05 11:16:08   INFO  epoch: 0/72, acc_iter=1500, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:32:09, time_cost(all): 0:20:53/2 days, 13:56:20, loss=0.599408025740591, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.19(1.03), norm=2.7952960486988156, lr=0.01713749352667012
2023-12-05 11:16:50   INFO  epoch: 0/72, acc_iter=1550, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:31:43, time_cost(all): 0:21:35/2 days, 16:12:04, loss=0.59934882831465, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.04(1.03), norm=1.853310239912834, lr=0.01754207664422579
2023-12-05 11:17:31   INFO  epoch: 0/72, acc_iter=1600, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:30:52, time_cost(all): 0:22:16/2 days, 14:51:27, loss=0.599289630888709, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.92(1.03), norm=2.576563905874663, lr=0.017946659761781465
2023-12-05 11:18:13   INFO  epoch: 0/72, acc_iter=1650, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:30:25, time_cost(all): 0:22:58/2 days, 14:53:40, loss=0.599230433462768, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.05(1.03), norm=2.035595578604833, lr=0.018351242879337135
2023-12-05 11:18:55   INFO  epoch: 0/72, acc_iter=1700, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:30:56, time_cost(all): 0:23:40/2 days, 18:14:35, loss=0.599171236036827, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.19(1.03), norm=0.8460176467926367, lr=0.018755825996892805
2023-12-05 11:19:37   INFO  epoch: 0/72, acc_iter=1750, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:30:02, time_cost(all): 0:24:22/2 days, 17:31:32, loss=0.599112038610886, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.91(1.03), norm=1.2128684354662789, lr=0.019160409114448475
2023-12-05 11:20:19   INFO  epoch: 0/72, acc_iter=1800, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:29:45, time_cost(all): 0:25:04/2 days, 17:40:10, loss=0.599052841184945, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.92(1.03), norm=1.6626691354772811, lr=0.019564992232004145
2023-12-05 11:21:00   INFO  epoch: 0/72, acc_iter=1850, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:27:43, time_cost(all): 0:25:45/2 days, 17:21:50, loss=0.598993643759004, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.21(1.03), norm=2.706611013842147, lr=0.019969575349559815
2023-12-05 11:21:42   INFO  epoch: 0/72, acc_iter=1900, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:26:01, time_cost(all): 0:26:27/2 days, 14:29:33, loss=0.598934446333063, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.88(1.03), norm=2.9908811995038036, lr=0.02037415846711549
2023-12-05 11:22:24   INFO  epoch: 0/72, acc_iter=1950, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:27:55, time_cost(all): 0:27:09/2 days, 18:54:37, loss=0.598875248907122, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.89(1.03), norm=1.6055503101328514, lr=0.02077874158467116
2023-12-05 11:23:06   INFO  epoch: 0/72, acc_iter=2000, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:25:17, time_cost(all): 0:27:51/2 days, 16:09:59, loss=0.598816051481181, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.19(1.03), norm=0.5443793009549975, lr=0.02118332470222683
2023-12-05 11:23:47   INFO  epoch: 0/72, acc_iter=2050, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:26, time_cost(all): 0:28:32/2 days, 18:19:51, loss=0.59875685405524, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.0(1.03), norm=3.497734411638674, lr=0.0215879078197825
2023-12-05 11:24:29   INFO  epoch: 0/72, acc_iter=2100, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:24:26, time_cost(all): 0:29:14/2 days, 17:02:40, loss=0.598697656629299, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.15(1.03), norm=2.799679910381366, lr=0.02199249093733817
2023-12-05 11:25:11   INFO  epoch: 0/72, acc_iter=2150, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:23:54, time_cost(all): 0:29:56/2 days, 14:23:55, loss=0.598638459203358, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.14(1.03), norm=3.3892879796150632, lr=0.02239707405489384
2023-12-05 11:25:53   INFO  epoch: 0/72, acc_iter=2200, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:23:17, time_cost(all): 0:30:38/2 days, 17:19:14, loss=0.598579261777417, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.94(1.03), norm=1.2611999260338456, lr=0.022801657172449512
2023-12-05 11:26:35   INFO  epoch: 0/72, acc_iter=2250, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:23:00, time_cost(all): 0:31:20/2 days, 14:01:10, loss=0.598520064351476, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.97(1.03), norm=2.6260063663365023, lr=0.023206240290005183
2023-12-05 11:27:16   INFO  epoch: 0/72, acc_iter=2300, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:22:08, time_cost(all): 0:32:01/2 days, 19:09:12, loss=0.598460866925535, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.04(1.03), norm=2.3983672291634144, lr=0.023610823407560853
2023-12-05 11:27:58   INFO  epoch: 0/72, acc_iter=2350, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:39, time_cost(all): 0:32:43/2 days, 18:24:30, loss=0.598401669499595, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.85(1.03), norm=3.0174900989796916, lr=0.024015406525116523
2023-12-05 11:28:40   INFO  epoch: 0/72, acc_iter=2400, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:19:49, time_cost(all): 0:33:25/2 days, 15:21:52, loss=0.598342472073654, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.03(1.03), norm=2.5630420432869925, lr=0.024419989642672193
2023-12-05 11:29:22   INFO  epoch: 0/72, acc_iter=2450, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:19:19, time_cost(all): 0:34:07/2 days, 13:07:38, loss=0.598283274647713, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.15(1.03), norm=1.636398759413715, lr=0.024824572760227863
2023-12-05 11:30:03   INFO  epoch: 0/72, acc_iter=2500, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:25, time_cost(all): 0:34:48/2 days, 13:43:55, loss=0.598224077221772, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.14(1.03), norm=1.7377750359269704, lr=0.025229155877783536
2023-12-05 11:30:45   INFO  epoch: 0/72, acc_iter=2550, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:17:31, time_cost(all): 0:35:30/2 days, 18:05:22, loss=0.598164879795831, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.93(1.03), norm=3.129693961379214, lr=0.025633738995339207
2023-12-05 11:31:27   INFO  epoch: 0/72, acc_iter=2600, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:00, time_cost(all): 0:36:12/2 days, 18:57:30, loss=0.59810568236989, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.08(1.03), norm=1.9019547589615462, lr=0.026038322112894877
2023-12-05 11:32:09   INFO  epoch: 0/72, acc_iter=2650, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:17:16, time_cost(all): 0:36:54/2 days, 14:01:16, loss=0.598046484943949, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.11(1.03), norm=1.355654628776907, lr=0.026442905230450547
2023-12-05 11:32:51   INFO  epoch: 0/72, acc_iter=2700, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:15:45, time_cost(all): 0:37:36/2 days, 15:17:46, loss=0.597987287518008, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.22(1.03), norm=4.83475186132955, lr=0.026847488348006217
2023-12-05 11:33:32   INFO  epoch: 0/72, acc_iter=2750, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:16:05, time_cost(all): 0:38:17/2 days, 16:42:58, loss=0.597928090092067, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.03(1.03), norm=0.873962880265472, lr=0.027252071465561887
2023-12-05 11:34:14   INFO  epoch: 0/72, acc_iter=2800, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:53, time_cost(all): 0:38:59/2 days, 12:59:04, loss=0.597868892666126, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.08(1.03), norm=1.0780005916516249, lr=0.02765665458311756
2023-12-05 11:34:56   INFO  epoch: 0/72, acc_iter=2850, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:13:44, time_cost(all): 0:39:41/2 days, 15:53:08, loss=0.597809695240185, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.16(1.03), norm=0.5224194473956603, lr=0.02806123770067323
2023-12-05 11:35:38   INFO  epoch: 0/72, acc_iter=2900, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:25, time_cost(all): 0:40:23/2 days, 18:58:01, loss=0.597750497814244, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.99(1.03), norm=3.194001092697138, lr=0.0284658208182289
2023-12-05 11:36:20   INFO  epoch: 0/72, acc_iter=2950, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:13:13, time_cost(all): 0:41:05/2 days, 13:14:29, loss=0.597691300388303, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.2(1.03), norm=3.697506411887667, lr=0.02887040393578457
2023-12-05 11:37:01   INFO  epoch: 0/72, acc_iter=3000, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:11:44, time_cost(all): 0:41:46/2 days, 18:28:57, loss=0.597632102962362, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.86(1.03), norm=0.9555516684655475, lr=0.02927498705334024
2023-12-05 11:37:43   INFO  epoch: 0/72, acc_iter=3050, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:13, time_cost(all): 0:42:28/2 days, 18:18:11, loss=0.597572905536421, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.9(1.03), norm=3.271595288464467, lr=0.02967957017089591
2023-12-05 11:38:25   INFO  epoch: 0/72, acc_iter=3100, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:27, time_cost(all): 0:43:10/2 days, 13:25:05, loss=0.59751370811048, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.89(1.03), norm=2.9731167822726663, lr=0.030084153288451584
2023-12-05 11:39:07   INFO  epoch: 0/72, acc_iter=3150, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:10:12, time_cost(all): 0:43:52/2 days, 16:16:30, loss=0.597454510684539, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.19(1.03), norm=3.2906780610894253, lr=0.030488736406007255
2023-12-05 11:39:48   INFO  epoch: 0/72, acc_iter=3200, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:22, time_cost(all): 0:44:33/2 days, 18:05:15, loss=0.597395313258599, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.05(1.03), norm=2.096414108720481, lr=0.030893319523562928
2023-12-05 11:40:30   INFO  epoch: 0/72, acc_iter=3250, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:48, time_cost(all): 0:45:15/2 days, 17:27:59, loss=0.597336115832658, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.13(1.03), norm=3.575500754227935, lr=0.0312979026411186
2023-12-05 11:41:12   INFO  epoch: 0/72, acc_iter=3300, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:37, time_cost(all): 0:45:57/2 days, 13:54:45, loss=0.597276918406717, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.95(1.03), norm=3.675829668590117, lr=0.031702485758674265
2023-12-05 11:41:54   INFO  epoch: 0/72, acc_iter=3350, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:06:49, time_cost(all): 0:46:39/2 days, 18:42:04, loss=0.597217720980776, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.86(1.03), norm=0.6737191390500261, lr=0.03210706887622994
2023-12-05 11:42:36   INFO  epoch: 0/72, acc_iter=3400, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:41, time_cost(all): 0:47:21/2 days, 14:21:42, loss=0.597158523554835, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.22(1.03), norm=3.883860099450497, lr=0.032511651993785605
2023-12-05 11:43:17   INFO  epoch: 0/72, acc_iter=3450, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:34, time_cost(all): 0:48:02/2 days, 12:51:29, loss=0.597099326128894, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.01(1.03), norm=4.579877296635486, lr=0.03291623511134128
2023-12-05 11:43:59   INFO  epoch: 0/72, acc_iter=3500, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:03, time_cost(all): 0:48:44/2 days, 17:02:49, loss=0.597040128702953, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.0(1.03), norm=3.777499292461326, lr=0.033320818228896945
2023-12-05 11:44:41   INFO  epoch: 0/72, acc_iter=3550, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:11, time_cost(all): 0:49:26/2 days, 14:32:24, loss=0.596980931277012, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.1(1.03), norm=0.916300388592009, lr=0.03372540134645262
2023-12-05 11:45:23   INFO  epoch: 0/72, acc_iter=3600, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:35, time_cost(all): 0:50:08/2 days, 14:25:48, loss=0.596921733851071, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.07(1.03), norm=0.9663959288785375, lr=0.03412998446400829
2023-12-05 11:46:04   INFO  epoch: 0/72, acc_iter=3650, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:03:05, time_cost(all): 0:50:49/2 days, 16:32:16, loss=0.59686253642513, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.11(1.03), norm=2.892536561618346, lr=0.03453456758156396
2023-12-05 11:46:46   INFO  epoch: 0/72, acc_iter=3700, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:17, time_cost(all): 0:51:31/2 days, 14:45:39, loss=0.596803338999189, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.2(1.03), norm=4.563504458833678, lr=0.03493915069911963
2023-12-05 11:47:28   INFO  epoch: 0/72, acc_iter=3750, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:29, time_cost(all): 0:52:13/2 days, 15:11:44, loss=0.596744141573248, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.99(1.03), norm=4.006081519415706, lr=0.0353437338166753
2023-12-05 11:48:10   INFO  epoch: 0/72, acc_iter=3800, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:52, time_cost(all): 0:52:55/2 days, 17:42:26, loss=0.596684944147307, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.94(1.03), norm=1.019998894209773, lr=0.03574831693423097
2023-12-05 11:48:52   INFO  epoch: 0/72, acc_iter=3850, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 0:53:37/2 days, 18:14:02, loss=0.596625746721366, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.21(1.03), norm=1.5765559748900522, lr=0.036152900051786646
2023-12-05 11:49:33   INFO  epoch: 1/72, acc_iter=3912, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:52:46, time_cost(all): 0:54:18/2 days, 18:15:40, loss=0.5965523419132, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.95(1.03), norm=1.7410139353649166, lr=0.03665458311755568
2023-12-05 11:50:15   INFO  epoch: 1/72, acc_iter=3962, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:54:27, time_cost(all): 0:55:00/2 days, 18:43:52, loss=0.596493144487259, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.13(1.03), norm=2.5188931137664845, lr=0.037059166235111345
2023-12-05 11:50:57   INFO  epoch: 1/72, acc_iter=4012, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:52:51, time_cost(all): 0:55:42/2 days, 18:23:11, loss=0.596433947061318, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.19(1.03), norm=1.8845842618620443, lr=0.03746374935266702
2023-12-05 11:51:39   INFO  epoch: 1/72, acc_iter=4062, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:50:00, time_cost(all): 0:56:24/2 days, 17:26:31, loss=0.596374749635377, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.95(1.03), norm=4.023407355744647, lr=0.037868332470222685
2023-12-05 11:52:20   INFO  epoch: 1/72, acc_iter=4112, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:48:54, time_cost(all): 0:57:05/2 days, 15:43:46, loss=0.596315552209436, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.23(1.03), norm=3.872661644756553, lr=0.03827291558777836
2023-12-05 11:53:02   INFO  epoch: 1/72, acc_iter=4162, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:51:10, time_cost(all): 0:57:47/2 days, 13:33:11, loss=0.596256354783495, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.22(1.03), norm=4.392893919995152, lr=0.038677498705334025
2023-12-05 11:53:44   INFO  epoch: 1/72, acc_iter=4212, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:51:04, time_cost(all): 0:58:29/2 days, 13:37:04, loss=0.596197157357554, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.86(1.03), norm=0.54553630010152, lr=0.0390820818228897
2023-12-05 11:54:26   INFO  epoch: 1/72, acc_iter=4262, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:48:39, time_cost(all): 0:59:11/2 days, 16:38:32, loss=0.596137959931613, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.17(1.03), norm=1.9841965391441008, lr=0.039486664940445365
2023-12-05 11:55:08   INFO  epoch: 1/72, acc_iter=4312, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:45:30, time_cost(all): 0:59:53/2 days, 12:43:44, loss=0.596078762505672, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.91(1.03), norm=4.2384254216377855, lr=0.03989124805800104
2023-12-05 11:55:49   INFO  epoch: 1/72, acc_iter=4362, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:48:55, time_cost(all): 1:00:34/2 days, 14:06:33, loss=0.596019565079731, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.95(1.03), norm=2.8402329252879657, lr=0.040295831175556705
2023-12-05 11:56:31   INFO  epoch: 1/72, acc_iter=4412, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:48:20, time_cost(all): 1:01:16/2 days, 15:29:17, loss=0.59596036765379, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.02(1.03), norm=4.382199326254968, lr=0.04070041429311238
2023-12-05 11:57:13   INFO  epoch: 1/72, acc_iter=4462, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:44:40, time_cost(all): 1:01:58/2 days, 16:17:56, loss=0.595901170227849, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.9(1.03), norm=0.8927071767460961, lr=0.04110499741066805
2023-12-05 11:57:55   INFO  epoch: 1/72, acc_iter=4512, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:43:47, time_cost(all): 1:02:40/2 days, 18:09:04, loss=0.595841972801908, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.19(1.03), norm=2.909058905063472, lr=0.041509580528223726
2023-12-05 11:58:36   INFO  epoch: 1/72, acc_iter=4562, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:44:21, time_cost(all): 1:03:21/2 days, 13:33:02, loss=0.595782775375967, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.03(1.03), norm=2.4982519036953104, lr=0.04191416364577939
2023-12-05 11:59:18   INFO  epoch: 1/72, acc_iter=4612, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:44:43, time_cost(all): 1:04:03/2 days, 15:10:05, loss=0.595723577950026, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.87(1.03), norm=3.5178015844233186, lr=0.042318746763335066
2023-12-05 12:00:00   INFO  epoch: 1/72, acc_iter=4662, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:43:17, time_cost(all): 1:04:45/2 days, 13:40:12, loss=0.595664380524085, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.91(1.03), norm=1.1050241526644502, lr=0.04272332988089073
2023-12-05 12:00:42   INFO  epoch: 1/72, acc_iter=4712, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:40:11, time_cost(all): 1:05:27/2 days, 16:43:38, loss=0.595605183098144, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.86(1.03), norm=1.9386481368221613, lr=0.043127912998446406
2023-12-05 12:01:24   INFO  epoch: 1/72, acc_iter=4762, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:39:17, time_cost(all): 1:06:09/2 days, 16:04:49, loss=0.595545985672204, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.84(1.03), norm=0.7751492601380473, lr=0.04353249611600207
2023-12-05 12:02:05   INFO  epoch: 1/72, acc_iter=4812, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:40:28, time_cost(all): 1:06:50/2 days, 12:20:56, loss=0.595486788246263, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.01(1.03), norm=2.517990166110034, lr=0.043937079233557746
2023-12-05 12:02:47   INFO  epoch: 1/72, acc_iter=4862, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:40:46, time_cost(all): 1:07:32/2 days, 15:49:55, loss=0.595427590820322, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.94(1.03), norm=1.8413504467393464, lr=0.04434166235111341
2023-12-05 12:03:29   INFO  epoch: 1/72, acc_iter=4912, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:37:47, time_cost(all): 1:08:14/2 days, 12:35:29, loss=0.595368393394381, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.12(1.03), norm=2.4510454489992783, lr=0.04474624546866909
2023-12-05 12:04:11   INFO  epoch: 1/72, acc_iter=4962, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:39:32, time_cost(all): 1:08:56/2 days, 17:37:44, loss=0.59530919596844, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.92(1.03), norm=1.7048124398779159, lr=0.04515082858622475
2023-12-05 12:04:52   INFO  epoch: 1/72, acc_iter=5012, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:37:28, time_cost(all): 1:09:37/2 days, 16:03:18, loss=0.595249998542499, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.93(1.03), norm=4.909182275348913, lr=0.04555541170378043
2023-12-05 12:05:34   INFO  epoch: 1/72, acc_iter=5062, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:37:48, time_cost(all): 1:10:19/2 days, 17:52:01, loss=0.595190801116558, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.84(1.03), norm=3.2196903012599254, lr=0.0459599948213361
2023-12-05 12:06:16   INFO  epoch: 1/72, acc_iter=5112, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:35:24, time_cost(all): 1:11:01/2 days, 14:12:22, loss=0.595131603690617, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.14(1.03), norm=4.980837289707348, lr=0.046364577938891774
2023-12-05 12:06:58   INFO  epoch: 1/72, acc_iter=5162, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:35:40, time_cost(all): 1:11:43/2 days, 14:05:19, loss=0.595072406264676, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.91(1.03), norm=4.027787219823724, lr=0.04676916105644744
2023-12-05 12:07:40   INFO  epoch: 1/72, acc_iter=5212, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:33:36, time_cost(all): 1:12:25/2 days, 13:42:24, loss=0.595013208838735, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.87(1.03), norm=3.1115387390016003, lr=0.047173744174003114
2023-12-05 12:08:21   INFO  epoch: 1/72, acc_iter=5262, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:32:46, time_cost(all): 1:13:06/2 days, 13:21:03, loss=0.594954011412794, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=0.5771707937651136, lr=0.04757832729155878
2023-12-05 12:09:03   INFO  epoch: 1/72, acc_iter=5312, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:35:15, time_cost(all): 1:13:48/2 days, 16:40:35, loss=0.594894813986853, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.87(1.03), norm=4.436334176860813, lr=0.047982910409114454
2023-12-05 12:09:45   INFO  epoch: 1/72, acc_iter=5362, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:32:55, time_cost(all): 1:14:30/2 days, 13:35:08, loss=0.594835616560912, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.01(1.03), norm=0.9170657047939161, lr=0.04838749352667012
2023-12-05 12:10:27   INFO  epoch: 1/72, acc_iter=5412, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:33:42, time_cost(all): 1:15:12/2 days, 17:30:10, loss=0.594776419134971, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.91(1.03), norm=1.28004952417427, lr=0.048792076644225794
2023-12-05 12:11:09   INFO  epoch: 1/72, acc_iter=5462, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:31:30, time_cost(all): 1:15:54/2 days, 15:01:57, loss=0.59471722170903, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.93(1.03), norm=4.753295004937446, lr=0.04919665976178146
2023-12-05 12:11:50   INFO  epoch: 1/72, acc_iter=5512, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:30:05, time_cost(all): 1:16:35/2 days, 16:29:29, loss=0.594658024283089, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.89(1.03), norm=3.9063665523478064, lr=0.049601242879337135
2023-12-05 12:12:32   INFO  epoch: 1/72, acc_iter=5562, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:28:55, time_cost(all): 1:17:17/2 days, 13:37:43, loss=0.594598826857148, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.0(1.03), norm=2.2511690497967125, lr=0.05001456499223201
2023-12-05 12:13:14   INFO  epoch: 1/72, acc_iter=5612, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:28:19, time_cost(all): 1:17:59/2 days, 12:25:35, loss=0.594539629431208, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.99(1.03), norm=2.748123001763562, lr=0.051026022786121186
2023-12-05 12:13:56   INFO  epoch: 1/72, acc_iter=5662, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:27:24, time_cost(all): 1:18:41/2 days, 12:23:34, loss=0.594480432005267, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.03(1.03), norm=3.8581785922528895, lr=0.05203748058001036
2023-12-05 12:14:37   INFO  epoch: 1/72, acc_iter=5712, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:27:36, time_cost(all): 1:19:22/2 days, 15:48:28, loss=0.594421234579326, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.15(1.03), norm=2.4186413827880857, lr=0.05304893837389954
2023-12-05 12:15:19   INFO  epoch: 1/72, acc_iter=5762, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:27:54, time_cost(all): 1:20:04/2 days, 14:49:59, loss=0.594362037153385, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.97(1.03), norm=3.5161726430576445, lr=0.05406039616778872
2023-12-05 12:16:01   INFO  epoch: 1/72, acc_iter=5812, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:25:36, time_cost(all): 1:20:46/2 days, 18:15:18, loss=0.594302839727444, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.08(1.03), norm=3.807724063448876, lr=0.055071853961677894
2023-12-05 12:16:43   INFO  epoch: 1/72, acc_iter=5862, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:26:36, time_cost(all): 1:21:28/2 days, 14:59:41, loss=0.594243642301503, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.01(1.03), norm=3.1110560375476264, lr=0.05608331175556707
2023-12-05 12:17:25   INFO  epoch: 1/72, acc_iter=5912, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:37, time_cost(all): 1:22:10/2 days, 15:11:39, loss=0.594184444875562, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.92(1.03), norm=2.2993306014611132, lr=0.05709476954945625
2023-12-05 12:18:06   INFO  epoch: 1/72, acc_iter=5962, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:24:12, time_cost(all): 1:22:51/2 days, 14:09:44, loss=0.594125247449621, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.16(1.03), norm=1.9011854670434702, lr=0.058106227343345425
2023-12-05 12:18:48   INFO  epoch: 1/72, acc_iter=6012, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:23:19, time_cost(all): 1:23:33/2 days, 12:22:35, loss=0.59406605002368, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.2(1.03), norm=3.8428042425599624, lr=0.0591176851372346
2023-12-05 12:19:30   INFO  epoch: 1/72, acc_iter=6062, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:13, time_cost(all): 1:24:15/2 days, 13:12:04, loss=0.594006852597739, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.93(1.03), norm=1.6193898922006118, lr=0.06012914293112378
2023-12-05 12:20:12   INFO  epoch: 1/72, acc_iter=6112, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:22:06, time_cost(all): 1:24:57/2 days, 12:49:00, loss=0.593947655171798, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.05(1.03), norm=1.5442406922990606, lr=0.061140600725012956
2023-12-05 12:20:53   INFO  epoch: 1/72, acc_iter=6162, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:22:21, time_cost(all): 1:25:38/2 days, 17:25:17, loss=0.593888457745857, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.95(1.03), norm=3.806088210273693, lr=0.06215205851890213
2023-12-05 12:21:35   INFO  epoch: 1/72, acc_iter=6212, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:51, time_cost(all): 1:26:20/2 days, 12:24:37, loss=0.593829260319916, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.18(1.03), norm=2.8988048095510646, lr=0.06316351631279131
2023-12-05 12:22:17   INFO  epoch: 1/72, acc_iter=6262, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:21:10, time_cost(all): 1:27:02/2 days, 14:55:27, loss=0.593770062893975, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.1(1.03), norm=2.947521070115911, lr=0.06417497410668048
2023-12-05 12:22:59   INFO  epoch: 1/72, acc_iter=6312, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:19:23, time_cost(all): 1:27:44/2 days, 16:46:02, loss=0.593710865468034, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.97(1.03), norm=2.3339962049252145, lr=0.06518643190056966
2023-12-05 12:23:41   INFO  epoch: 1/72, acc_iter=6362, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:03, time_cost(all): 1:28:26/2 days, 16:26:22, loss=0.593651668042093, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.12(1.03), norm=0.7948699671818966, lr=0.06619788969445883
2023-12-05 12:24:22   INFO  epoch: 1/72, acc_iter=6412, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:17:42, time_cost(all): 1:29:07/2 days, 15:11:55, loss=0.593592470616153, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.83(1.03), norm=3.2642931012587617, lr=0.06720934748834802
2023-12-05 12:25:04   INFO  epoch: 1/72, acc_iter=6462, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:37, time_cost(all): 1:29:49/2 days, 16:00:54, loss=0.593533273190212, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.83(1.03), norm=0.5008997549399437, lr=0.06822080528223719
2023-12-05 12:25:46   INFO  epoch: 1/72, acc_iter=6512, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:33, time_cost(all): 1:30:31/2 days, 12:11:50, loss=0.593474075764271, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.0(1.03), norm=3.423947020009092, lr=0.06923226307612637
2023-12-05 12:26:28   INFO  epoch: 1/72, acc_iter=6562, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:15:33, time_cost(all): 1:31:13/2 days, 16:09:55, loss=0.59341487833833, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.21(1.03), norm=0.9975756596884997, lr=0.07024372087001554
2023-12-05 12:27:09   INFO  epoch: 1/72, acc_iter=6612, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:53, time_cost(all): 1:31:54/2 days, 13:57:36, loss=0.593355680912389, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.98(1.03), norm=3.1122039152497547, lr=0.07125517866390471
2023-12-05 12:27:51   INFO  epoch: 1/72, acc_iter=6662, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:15:21, time_cost(all): 1:32:36/2 days, 11:55:52, loss=0.593296483486448, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.94(1.03), norm=0.7063014085863377, lr=0.0722666364577939
2023-12-05 12:28:33   INFO  epoch: 1/72, acc_iter=6712, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:02, time_cost(all): 1:33:18/2 days, 15:23:24, loss=0.593237286060507, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.19(1.03), norm=2.7626454842444566, lr=0.07327809425168308
2023-12-05 12:29:15   INFO  epoch: 1/72, acc_iter=6762, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:16, time_cost(all): 1:34:00/2 days, 13:14:08, loss=0.593178088634566, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.96(1.03), norm=2.118248848518226, lr=0.07428955204557225
2023-12-05 12:29:57   INFO  epoch: 1/72, acc_iter=6812, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:13:01, time_cost(all): 1:34:42/2 days, 16:29:57, loss=0.593118891208625, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.97(1.03), norm=2.3435066625692764, lr=0.07530100983946142
2023-12-05 12:30:38   INFO  epoch: 1/72, acc_iter=6862, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:14, time_cost(all): 1:35:23/2 days, 18:01:41, loss=0.593059693782684, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.94(1.03), norm=4.959719898090638, lr=0.0763124676333506
2023-12-05 12:31:20   INFO  epoch: 1/72, acc_iter=6912, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:33, time_cost(all): 1:36:05/2 days, 15:33:53, loss=0.593000496356743, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.87(1.03), norm=1.0659723654004059, lr=0.07732392542723979
2023-12-05 12:32:02   INFO  epoch: 1/72, acc_iter=6962, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:17, time_cost(all): 1:36:47/2 days, 15:15:45, loss=0.592941298930802, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.0(1.03), norm=1.3740798255122122, lr=0.07833538322112896
2023-12-05 12:32:44   INFO  epoch: 1/72, acc_iter=7012, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:10:19, time_cost(all): 1:37:29/2 days, 15:05:52, loss=0.592882101504861, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.09(1.03), norm=0.6955912260476568, lr=0.07934684101501813
2023-12-05 12:33:25   INFO  epoch: 1/72, acc_iter=7062, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:27, time_cost(all): 1:38:10/2 days, 13:53:39, loss=0.59282290407892, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.87(1.03), norm=2.752352049684605, lr=0.0803582988089073
2023-12-05 12:34:07   INFO  epoch: 1/72, acc_iter=7112, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:13, time_cost(all): 1:38:52/2 days, 12:05:27, loss=0.592763706652979, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.83(1.03), norm=2.839880543101676, lr=0.08136975660279648
2023-12-05 12:34:49   INFO  epoch: 1/72, acc_iter=7162, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:41, time_cost(all): 1:39:34/2 days, 12:43:14, loss=0.592704509227038, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.19(1.03), norm=4.162383851207368, lr=0.08238121439668566
2023-12-05 12:35:31   INFO  epoch: 1/72, acc_iter=7212, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:18, time_cost(all): 1:40:16/2 days, 14:39:07, loss=0.592645311801097, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.17(1.03), norm=3.8076619945082193, lr=0.08339267219057483
2023-12-05 12:36:13   INFO  epoch: 1/72, acc_iter=7262, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:24, time_cost(all): 1:40:58/2 days, 15:52:51, loss=0.592586114375157, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.91(1.03), norm=0.5695654656534799, lr=0.084404129984464
2023-12-05 12:36:54   INFO  epoch: 1/72, acc_iter=7312, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:29, time_cost(all): 1:41:39/2 days, 12:41:27, loss=0.592526916949216, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.89(1.03), norm=2.997130856110702, lr=0.08541558777835319
2023-12-05 12:37:36   INFO  epoch: 1/72, acc_iter=7362, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:08, time_cost(all): 1:42:21/2 days, 16:39:20, loss=0.592467719523275, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.99(1.03), norm=4.467957007345419, lr=0.08642704557224237
2023-12-05 12:38:18   INFO  epoch: 1/72, acc_iter=7412, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:28, time_cost(all): 1:43:03/2 days, 15:01:33, loss=0.592408522097334, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.18(1.03), norm=3.1950892433272537, lr=0.08743850336613154
2023-12-05 12:39:00   INFO  epoch: 1/72, acc_iter=7462, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:36, time_cost(all): 1:43:45/2 days, 13:25:26, loss=0.592349324671393, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.85(1.03), norm=1.2806737801045962, lr=0.08844996116002071
2023-12-05 12:39:41   INFO  epoch: 1/72, acc_iter=7512, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:53, time_cost(all): 1:44:26/2 days, 13:52:23, loss=0.592290127245452, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.04(1.03), norm=1.9303037892627033, lr=0.0894614189539099
2023-12-05 12:40:23   INFO  epoch: 1/72, acc_iter=7562, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:21, time_cost(all): 1:45:08/2 days, 12:42:55, loss=0.592230929819511, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.01(1.03), norm=4.484533922817015, lr=0.09047287674779908
2023-12-05 12:41:05   INFO  epoch: 1/72, acc_iter=7612, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:36, time_cost(all): 1:45:50/2 days, 12:29:47, loss=0.59217173239357, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.98(1.03), norm=3.4181585573569135, lr=0.09148433454168825
2023-12-05 12:41:47   INFO  epoch: 1/72, acc_iter=7662, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:52, time_cost(all): 1:46:32/2 days, 16:29:56, loss=0.592112534967629, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.1(1.03), norm=2.199566932158883, lr=0.09249579233557742
2023-12-05 12:42:29   INFO  epoch: 1/72, acc_iter=7712, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 1:47:14/2 days, 16:28:12, loss=0.592053337541688, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.91(1.03), norm=4.200966078119744, lr=0.09350725012946659
2023-12-05 12:43:10   INFO  epoch: 2/72, acc_iter=7774, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:51:51, time_cost(all): 1:47:55/2 days, 17:38:13, loss=0.591979932733521, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.84(1.03), norm=4.208187581434532, lr=0.09476145779388917
2023-12-05 12:43:52   INFO  epoch: 2/72, acc_iter=7824, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:51:48, time_cost(all): 1:48:37/2 days, 11:37:52, loss=0.59192073530758, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.85(1.03), norm=0.7174557600387067, lr=0.09577291558777835
2023-12-05 12:44:34   INFO  epoch: 2/72, acc_iter=7874, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:50:10, time_cost(all): 1:49:19/2 days, 16:22:29, loss=0.591861537881639, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.99(1.03), norm=1.6678258660293341, lr=0.09678437338166754
2023-12-05 12:45:16   INFO  epoch: 2/72, acc_iter=7924, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:49:10, time_cost(all): 1:50:01/2 days, 11:55:22, loss=0.591802340455698, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.16(1.03), norm=4.388612720732829, lr=0.09779583117555671
2023-12-05 12:45:58   INFO  epoch: 2/72, acc_iter=7974, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:52:31, time_cost(all): 1:50:43/2 days, 16:33:37, loss=0.591743143029758, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.91(1.03), norm=3.364489274134983, lr=0.09880728896944589
2023-12-05 12:46:39   INFO  epoch: 2/72, acc_iter=8024, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:49:27, time_cost(all): 1:51:24/2 days, 13:06:46, loss=0.591683945603817, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.99(1.03), norm=2.756196811172157, lr=0.09981874676333506
2023-12-05 12:47:21   INFO  epoch: 2/72, acc_iter=8074, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:51:16, time_cost(all): 1:52:06/2 days, 16:35:07, loss=0.591624748177876, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.0(1.03), norm=3.3278193788730803, lr=0.10083020455722425
2023-12-05 12:48:03   INFO  epoch: 2/72, acc_iter=8124, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:46:21, time_cost(all): 1:52:48/2 days, 15:41:49, loss=0.591565550751935, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.98(1.03), norm=2.8055165694695203, lr=0.10184166235111342
2023-12-05 12:48:45   INFO  epoch: 2/72, acc_iter=8174, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:47:55, time_cost(all): 1:53:30/2 days, 17:25:43, loss=0.591506353325994, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.03(1.03), norm=3.3386404191645136, lr=0.10285312014500259
2023-12-05 12:49:26   INFO  epoch: 2/72, acc_iter=8224, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:44:43, time_cost(all): 1:54:11/2 days, 17:13:22, loss=0.591447155900053, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.23(1.03), norm=1.0450866784354935, lr=0.10386457793889177
2023-12-05 12:50:08   INFO  epoch: 2/72, acc_iter=8274, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:44:25, time_cost(all): 1:54:53/2 days, 13:46:11, loss=0.591387958474112, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.18(1.03), norm=3.6348263508403793, lr=0.10487603573278094
2023-12-05 12:50:50   INFO  epoch: 2/72, acc_iter=8324, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:44:07, time_cost(all): 1:55:35/2 days, 17:32:43, loss=0.591328761048171, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.84(1.03), norm=1.3782388737426352, lr=0.10588749352667012
2023-12-05 12:51:32   INFO  epoch: 2/72, acc_iter=8374, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:45:44, time_cost(all): 1:56:17/2 days, 14:53:49, loss=0.59126956362223, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.93(1.03), norm=1.5099125148240953, lr=0.1068989513205593
2023-12-05 12:52:14   INFO  epoch: 2/72, acc_iter=8424, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:45:16, time_cost(all): 1:56:59/2 days, 17:23:43, loss=0.591210366196289, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.91(1.03), norm=2.3900405382210463, lr=0.10791040911444846
2023-12-05 12:52:55   INFO  epoch: 2/72, acc_iter=8474, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:42:04, time_cost(all): 1:57:40/2 days, 16:34:13, loss=0.591151168770348, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.91(1.03), norm=1.5362233314521792, lr=0.10892186690833766
2023-12-05 12:53:37   INFO  epoch: 2/72, acc_iter=8524, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:42:13, time_cost(all): 1:58:22/2 days, 15:17:18, loss=0.591091971344407, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.23(1.03), norm=1.810858378256404, lr=0.10993332470222683
2023-12-05 12:54:19   INFO  epoch: 2/72, acc_iter=8574, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:41:22, time_cost(all): 1:59:04/2 days, 13:20:39, loss=0.591032773918466, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.96(1.03), norm=1.0651700130771768, lr=0.110944782496116
2023-12-05 12:55:01   INFO  epoch: 2/72, acc_iter=8624, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:40:43, time_cost(all): 1:59:46/2 days, 13:08:39, loss=0.590973576492525, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.06(1.03), norm=3.467985464344295, lr=0.11195624029000519
2023-12-05 12:55:42   INFO  epoch: 2/72, acc_iter=8674, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:40:18, time_cost(all): 2:00:27/2 days, 15:45:30, loss=0.590914379066584, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.04(1.03), norm=4.815518529279659, lr=0.11296769808389436
2023-12-05 12:56:24   INFO  epoch: 2/72, acc_iter=8724, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:41:30, time_cost(all): 2:01:09/2 days, 16:39:46, loss=0.590855181640643, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.22(1.03), norm=2.4772027744579335, lr=0.11397915587778354
2023-12-05 12:57:06   INFO  epoch: 2/72, acc_iter=8774, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:37:19, time_cost(all): 2:01:51/2 days, 14:26:16, loss=0.590795984214702, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.89(1.03), norm=3.0566561024691428, lr=0.11499061367167271
2023-12-05 12:57:48   INFO  epoch: 2/72, acc_iter=8824, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:40:09, time_cost(all): 2:02:33/2 days, 12:05:47, loss=0.590736786788762, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.92(1.03), norm=2.523744342750555, lr=0.11600207146556189
2023-12-05 12:58:30   INFO  epoch: 2/72, acc_iter=8874, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:38:25, time_cost(all): 2:03:15/2 days, 11:26:06, loss=0.590677589362821, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.86(1.03), norm=4.533035156647998, lr=0.11701352925945106
2023-12-05 12:59:11   INFO  epoch: 2/72, acc_iter=8924, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:37:06, time_cost(all): 2:03:56/2 days, 13:59:16, loss=0.59061839193688, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.83(1.03), norm=2.779871435868739, lr=0.11802498705334023
2023-12-05 12:59:53   INFO  epoch: 2/72, acc_iter=8974, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:34:39, time_cost(all): 2:04:38/2 days, 16:39:03, loss=0.590559194510939, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.15(1.03), norm=0.605765621081602, lr=0.11903644484722942
2023-12-05 13:00:35   INFO  epoch: 2/72, acc_iter=9024, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:34:28, time_cost(all): 2:05:20/2 days, 11:47:56, loss=0.590499997084998, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.05(1.03), norm=2.8855113066844424, lr=0.12004790264111859
2023-12-05 13:01:17   INFO  epoch: 2/72, acc_iter=9074, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:35:32, time_cost(all): 2:06:02/2 days, 11:20:15, loss=0.590440799659057, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.09(1.03), norm=2.924957599072137, lr=0.12105936043500777
2023-12-05 13:01:58   INFO  epoch: 2/72, acc_iter=9124, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:34:53, time_cost(all): 2:06:43/2 days, 11:53:53, loss=0.590381602233116, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.99(1.03), norm=3.1366597973461863, lr=0.12207081822889694
2023-12-05 13:02:40   INFO  epoch: 2/72, acc_iter=9174, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:34:27, time_cost(all): 2:07:25/2 days, 12:43:33, loss=0.590322404807175, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.07(1.03), norm=0.659287686803588, lr=0.12308227602278612
2023-12-05 13:03:22   INFO  epoch: 2/72, acc_iter=9224, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:32:36, time_cost(all): 2:08:07/2 days, 14:07:05, loss=0.590263207381234, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.18(1.03), norm=1.350021560749725, lr=0.12409373381667531
2023-12-05 13:04:04   INFO  epoch: 2/72, acc_iter=9274, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:31:21, time_cost(all): 2:08:49/2 days, 12:09:18, loss=0.590204009955293, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.0(1.03), norm=2.8886624389193307, lr=0.1251051916105645
2023-12-05 13:04:46   INFO  epoch: 2/72, acc_iter=9324, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:30:26, time_cost(all): 2:09:31/2 days, 11:56:26, loss=0.590144812529352, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.16(1.03), norm=4.299886798959414, lr=0.12611664940445366
2023-12-05 13:05:27   INFO  epoch: 2/72, acc_iter=9374, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:31:37, time_cost(all): 2:10:12/2 days, 16:06:09, loss=0.590085615103411, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.09(1.03), norm=4.057212549963966, lr=0.12712810719834283
2023-12-05 13:06:09   INFO  epoch: 2/72, acc_iter=9424, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:30:51, time_cost(all): 2:10:54/2 days, 13:43:04, loss=0.59002641767747, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.02(1.03), norm=1.565012466080536, lr=0.128139564992232
2023-12-05 13:06:51   INFO  epoch: 2/72, acc_iter=9474, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:30:44, time_cost(all): 2:11:36/2 days, 15:53:42, loss=0.589967220251529, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.88(1.03), norm=1.4960417917161268, lr=0.12915102278612117
2023-12-05 13:07:33   INFO  epoch: 2/72, acc_iter=9524, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:29:25, time_cost(all): 2:12:18/2 days, 16:21:35, loss=0.589908022825588, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.03(1.03), norm=1.1920892559275056, lr=0.13016248058001034
2023-12-05 13:08:14   INFO  epoch: 2/72, acc_iter=9574, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:26:54, time_cost(all): 2:12:59/2 days, 16:50:17, loss=0.589848825399647, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.87(1.03), norm=1.8561375595582312, lr=0.1311739383738995
2023-12-05 13:08:56   INFO  epoch: 2/72, acc_iter=9624, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:27:39, time_cost(all): 2:13:41/2 days, 15:39:06, loss=0.589789627973706, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.15(1.03), norm=0.7662667948395857, lr=0.1321853961677887
2023-12-05 13:09:38   INFO  epoch: 2/72, acc_iter=9674, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:27:22, time_cost(all): 2:14:23/2 days, 12:30:12, loss=0.589730430547766, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.16(1.03), norm=2.4902258116505953, lr=0.13319685396167788
2023-12-05 13:10:20   INFO  epoch: 2/72, acc_iter=9724, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:25:44, time_cost(all): 2:15:05/2 days, 12:37:21, loss=0.589671233121825, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.95(1.03), norm=3.8384194369080644, lr=0.13420831175556708
2023-12-05 13:11:02   INFO  epoch: 2/72, acc_iter=9774, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:43, time_cost(all): 2:15:47/2 days, 12:28:00, loss=0.589612035695884, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.02(1.03), norm=0.5337186529143325, lr=0.13521976954945625
2023-12-05 13:11:43   INFO  epoch: 2/72, acc_iter=9824, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:25:14, time_cost(all): 2:16:28/2 days, 16:47:11, loss=0.589552838269943, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.88(1.03), norm=1.708947051225501, lr=0.13623122734334542
2023-12-05 13:12:25   INFO  epoch: 2/72, acc_iter=9874, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:23:10, time_cost(all): 2:17:10/2 days, 16:01:16, loss=0.589493640844002, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.97(1.03), norm=3.188834213381534, lr=0.1372426851372346
2023-12-05 13:13:07   INFO  epoch: 2/72, acc_iter=9924, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:23:40, time_cost(all): 2:17:52/2 days, 16:54:34, loss=0.589434443418061, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.97(1.03), norm=4.050593488615806, lr=0.13825414293112376
2023-12-05 13:13:49   INFO  epoch: 2/72, acc_iter=9974, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:23:21, time_cost(all): 2:18:34/2 days, 14:43:16, loss=0.58937524599212, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.84(1.03), norm=3.8331368576350697, lr=0.13926560072501296
2023-12-05 13:14:30   INFO  epoch: 2/72, acc_iter=10024, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:16, time_cost(all): 2:19:15/2 days, 12:38:57, loss=0.589316048566179, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.21(1.03), norm=3.5381675974627678, lr=0.14027705851890213
2023-12-05 13:15:12   INFO  epoch: 2/72, acc_iter=10074, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:22:02, time_cost(all): 2:19:57/2 days, 15:47:22, loss=0.589256851140238, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.03(1.03), norm=1.8879430486262265, lr=0.1412885163127913
2023-12-05 13:15:54   INFO  epoch: 2/72, acc_iter=10124, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:19:42, time_cost(all): 2:20:39/2 days, 14:14:41, loss=0.589197653714297, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.86(1.03), norm=3.735030293560044, lr=0.1422999741066805
2023-12-05 13:16:36   INFO  epoch: 2/72, acc_iter=10174, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:20:19, time_cost(all): 2:21:21/2 days, 14:28:54, loss=0.589138456288356, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.83(1.03), norm=3.398748526511627, lr=0.14331143190056966
2023-12-05 13:17:18   INFO  epoch: 2/72, acc_iter=10224, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:51, time_cost(all): 2:22:03/2 days, 13:49:55, loss=0.589079258862415, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.05(1.03), norm=0.6748123943238743, lr=0.14432288969445883
2023-12-05 13:17:59   INFO  epoch: 2/72, acc_iter=10274, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:17:53, time_cost(all): 2:22:44/2 days, 13:32:48, loss=0.589020061436474, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.95(1.03), norm=0.6896076273629224, lr=0.145334347488348
2023-12-05 13:18:41   INFO  epoch: 2/72, acc_iter=10324, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:18, time_cost(all): 2:23:26/2 days, 17:05:45, loss=0.588960864010533, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.22(1.03), norm=4.795754047087009, lr=0.14634580528223717
2023-12-05 13:19:23   INFO  epoch: 2/72, acc_iter=10374, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:38, time_cost(all): 2:24:08/2 days, 16:19:38, loss=0.588901666584592, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.83(1.03), norm=3.4067455986307307, lr=0.14735726307612634
2023-12-05 13:20:05   INFO  epoch: 2/72, acc_iter=10424, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:38, time_cost(all): 2:24:50/2 days, 12:41:27, loss=0.588842469158651, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.21(1.03), norm=3.2534328905986225, lr=0.1483687208700155
2023-12-05 13:20:47   INFO  epoch: 2/72, acc_iter=10474, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:00, time_cost(all): 2:25:32/2 days, 13:49:03, loss=0.58878327173271, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.11(1.03), norm=2.204457700553877, lr=0.1493801786639047
2023-12-05 13:21:28   INFO  epoch: 2/72, acc_iter=10524, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:46, time_cost(all): 2:26:13/2 days, 14:37:42, loss=0.58872407430677, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.14(1.03), norm=1.4958190325133258, lr=0.1503916364577939
2023-12-05 13:22:10   INFO  epoch: 2/72, acc_iter=10574, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:07, time_cost(all): 2:26:55/2 days, 12:07:26, loss=0.588664876880829, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.1(1.03), norm=2.5493181382146046, lr=0.15140309425168308
2023-12-05 13:22:52   INFO  epoch: 2/72, acc_iter=10624, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:30, time_cost(all): 2:27:37/2 days, 12:17:49, loss=0.588605679454888, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.02(1.03), norm=0.9702352582697111, lr=0.15241455204557225
2023-12-05 13:23:34   INFO  epoch: 2/72, acc_iter=10674, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:25, time_cost(all): 2:28:19/2 days, 17:09:23, loss=0.588546482028947, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.04(1.03), norm=4.039290540136723, lr=0.15342600983946142
2023-12-05 13:24:15   INFO  epoch: 2/72, acc_iter=10724, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:11:53, time_cost(all): 2:29:00/2 days, 16:33:48, loss=0.588487284603006, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.04(1.03), norm=3.3426906112275763, lr=0.1544374676333506
2023-12-05 13:24:57   INFO  epoch: 2/72, acc_iter=10774, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:38, time_cost(all): 2:29:42/2 days, 11:22:34, loss=0.588428087177065, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.91(1.03), norm=3.2685257770364484, lr=0.15544892542723976
2023-12-05 13:25:39   INFO  epoch: 2/72, acc_iter=10824, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:18, time_cost(all): 2:30:24/2 days, 11:46:32, loss=0.588368889751124, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.08(1.03), norm=1.1598114485373436, lr=0.15646038322112893
2023-12-05 13:26:21   INFO  epoch: 2/72, acc_iter=10874, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:10:19, time_cost(all): 2:31:06/2 days, 16:56:51, loss=0.588309692325183, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.19(1.03), norm=3.0061247792774575, lr=0.15747184101501813
2023-12-05 13:27:03   INFO  epoch: 2/72, acc_iter=10924, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:08:54, time_cost(all): 2:31:48/2 days, 14:18:56, loss=0.588250494899242, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.06(1.03), norm=1.7055932002014698, lr=0.1584832988089073
2023-12-05 13:27:44   INFO  epoch: 2/72, acc_iter=10974, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:39, time_cost(all): 2:32:29/2 days, 14:04:50, loss=0.588191297473301, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.14(1.03), norm=3.1011017613857375, lr=0.15949475660279647
2023-12-05 13:28:26   INFO  epoch: 2/72, acc_iter=11024, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:46, time_cost(all): 2:33:11/2 days, 13:48:07, loss=0.58813210004736, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.21(1.03), norm=3.5802443928560503, lr=0.16050621439668566
2023-12-05 13:29:08   INFO  epoch: 2/72, acc_iter=11074, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:05, time_cost(all): 2:33:53/2 days, 15:39:40, loss=0.588072902621419, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.03(1.03), norm=2.3543614021305177, lr=0.16151767219057483
2023-12-05 13:29:50   INFO  epoch: 2/72, acc_iter=11124, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:17, time_cost(all): 2:34:35/2 days, 13:41:27, loss=0.588013705195478, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.01(1.03), norm=4.638909057276348, lr=0.162529129984464
2023-12-05 13:30:31   INFO  epoch: 2/72, acc_iter=11174, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:33, time_cost(all): 2:35:16/2 days, 11:36:51, loss=0.587954507769537, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.2(1.03), norm=0.6263762549892731, lr=0.16354058777835317
2023-12-05 13:31:13   INFO  epoch: 2/72, acc_iter=11224, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:08, time_cost(all): 2:35:58/2 days, 16:29:53, loss=0.587895310343596, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.14(1.03), norm=3.178660129995517, lr=0.16455204557224234
2023-12-05 13:31:55   INFO  epoch: 2/72, acc_iter=11274, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:25, time_cost(all): 2:36:40/2 days, 15:34:23, loss=0.587836112917655, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.09(1.03), norm=4.0265529261777875, lr=0.16556350336613154
2023-12-05 13:32:37   INFO  epoch: 2/72, acc_iter=11324, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:34, time_cost(all): 2:37:22/2 days, 14:26:14, loss=0.587776915491714, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.02(1.03), norm=1.1923025622907497, lr=0.1665749611600207
2023-12-05 13:33:19   INFO  epoch: 2/72, acc_iter=11374, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:53, time_cost(all): 2:38:04/2 days, 13:41:31, loss=0.587717718065774, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.08(1.03), norm=2.8554060364679636, lr=0.1675864189539099
2023-12-05 13:34:00   INFO  epoch: 2/72, acc_iter=11424, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:12, time_cost(all): 2:38:45/2 days, 13:05:15, loss=0.587658520639833, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.03(1.03), norm=3.3366799611373823, lr=0.16859787674779908
2023-12-05 13:34:42   INFO  epoch: 2/72, acc_iter=11474, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:31, time_cost(all): 2:39:27/2 days, 15:04:13, loss=0.587599323213892, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.02(1.03), norm=4.456481946780677, lr=0.16960933454168825
2023-12-05 13:35:24   INFO  epoch: 2/72, acc_iter=11524, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:54, time_cost(all): 2:40:09/2 days, 13:09:21, loss=0.587540125787951, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.93(1.03), norm=0.5359069282169411, lr=0.17062079233557742
2023-12-05 13:36:06   INFO  epoch: 2/72, acc_iter=11574, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 2:40:51/2 days, 11:42:32, loss=0.58748092836201, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.11(1.03), norm=4.888092928970752, lr=0.1716322501294666
2023-12-05 13:36:47   INFO  epoch: 3/72, acc_iter=11636, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:55:12, time_cost(all): 2:41:32/2 days, 10:58:21, loss=0.587407523553843, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.13(1.03), norm=4.958586359953023, lr=0.17288645779388917
2023-12-05 13:37:29   INFO  epoch: 3/72, acc_iter=11686, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:51:14, time_cost(all): 2:42:14/2 days, 12:39:41, loss=0.587348326127902, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.99(1.03), norm=1.176943838792099, lr=0.17389791558777834
2023-12-05 13:38:11   INFO  epoch: 3/72, acc_iter=11736, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:52:58, time_cost(all): 2:42:56/2 days, 14:12:12, loss=0.587289128701961, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.92(1.03), norm=4.602735646888363, lr=0.1749093733816675
2023-12-05 13:38:53   INFO  epoch: 3/72, acc_iter=11786, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:50:11, time_cost(all): 2:43:38/2 days, 11:46:47, loss=0.58722993127602, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.19(1.03), norm=1.4468590972307935, lr=0.17592083117555668
2023-12-05 13:39:35   INFO  epoch: 3/72, acc_iter=11836, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:51:44, time_cost(all): 2:44:20/2 days, 15:44:48, loss=0.587170733850079, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.04(1.03), norm=3.2179743139059327, lr=0.17693228896944585
2023-12-05 13:40:16   INFO  epoch: 3/72, acc_iter=11886, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:51:28, time_cost(all): 2:45:01/2 days, 13:11:21, loss=0.587111536424138, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.09(1.03), norm=3.768897338840603, lr=0.17794374676333508
2023-12-05 13:40:58   INFO  epoch: 3/72, acc_iter=11936, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:48:50, time_cost(all): 2:45:43/2 days, 14:08:10, loss=0.587052338998197, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.92(1.03), norm=4.860707077888877, lr=0.17895520455722425
2023-12-05 13:41:40   INFO  epoch: 3/72, acc_iter=11986, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:50:16, time_cost(all): 2:46:25/2 days, 11:08:37, loss=0.586993141572256, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.93(1.03), norm=1.9782931762790312, lr=0.17996666235111342
2023-12-05 13:42:22   INFO  epoch: 3/72, acc_iter=12036, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:45:15, time_cost(all): 2:47:07/2 days, 12:07:18, loss=0.586933944146316, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.0(1.03), norm=1.4035960142376211, lr=0.18097812014500259
2023-12-05 13:43:03   INFO  epoch: 3/72, acc_iter=12086, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:46:48, time_cost(all): 2:47:48/2 days, 12:07:12, loss=0.586874746720375, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.2(1.03), norm=4.225260980085013, lr=0.18198957793889176
2023-12-05 13:43:45   INFO  epoch: 3/72, acc_iter=12136, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:44:06, time_cost(all): 2:48:30/2 days, 12:39:20, loss=0.586815549294434, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.93(1.03), norm=4.841708263764416, lr=0.18300103573278093
2023-12-05 13:44:27   INFO  epoch: 3/72, acc_iter=12186, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:45:17, time_cost(all): 2:49:12/2 days, 11:14:39, loss=0.586756351868493, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.99(1.03), norm=4.386585605872802, lr=0.1840124935266701
2023-12-05 13:45:09   INFO  epoch: 3/72, acc_iter=12236, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:44:46, time_cost(all): 2:49:54/2 days, 13:50:11, loss=0.586697154442552, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.08(1.03), norm=4.3198729298601, lr=0.18502395132055927
2023-12-05 13:45:51   INFO  epoch: 3/72, acc_iter=12286, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:45:30, time_cost(all): 2:50:36/2 days, 13:15:41, loss=0.586637957016611, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.04(1.03), norm=0.6985928793290722, lr=0.18603540911444844
2023-12-05 13:46:32   INFO  epoch: 3/72, acc_iter=12336, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:45:29, time_cost(all): 2:51:17/2 days, 12:52:56, loss=0.58657875959067, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.19(1.03), norm=2.326904587613759, lr=0.18704686690833766
2023-12-05 13:47:14   INFO  epoch: 3/72, acc_iter=12386, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:40:59, time_cost(all): 2:51:59/2 days, 11:07:03, loss=0.586519562164729, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.13(1.03), norm=2.468903698597857, lr=0.18805832470222683
2023-12-05 13:47:56   INFO  epoch: 3/72, acc_iter=12436, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:43:58, time_cost(all): 2:52:41/2 days, 12:07:21, loss=0.586460364738788, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.01(1.03), norm=1.2824304677724898, lr=0.189069782496116
2023-12-05 13:48:38   INFO  epoch: 3/72, acc_iter=12486, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:42:22, time_cost(all): 2:53:23/2 days, 14:48:33, loss=0.586401167312847, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.19(1.03), norm=4.398683058792645, lr=0.19008124029000517
2023-12-05 13:49:19   INFO  epoch: 3/72, acc_iter=12536, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:40:31, time_cost(all): 2:54:04/2 days, 12:57:05, loss=0.586341969886906, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.09(1.03), norm=3.1640431873342805, lr=0.19109269808389434
2023-12-05 13:50:01   INFO  epoch: 3/72, acc_iter=12586, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:41:44, time_cost(all): 2:54:46/2 days, 15:47:26, loss=0.586282772460965, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.96(1.03), norm=1.7536864032904345, lr=0.1921041558777835
2023-12-05 13:50:43   INFO  epoch: 3/72, acc_iter=12636, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:38:15, time_cost(all): 2:55:28/2 days, 13:43:45, loss=0.586223575035024, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.12(1.03), norm=1.2055671420165526, lr=0.19311561367167274
2023-12-05 13:51:25   INFO  epoch: 3/72, acc_iter=12686, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:40:19, time_cost(all): 2:56:10/2 days, 11:07:49, loss=0.586164377609083, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.14(1.03), norm=1.569931099503676, lr=0.1941270714655619
2023-12-05 13:52:07   INFO  epoch: 3/72, acc_iter=12736, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:38:40, time_cost(all): 2:56:52/2 days, 11:05:28, loss=0.586105180183142, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.87(1.03), norm=4.6028382619700565, lr=0.19513852925945108
2023-12-05 13:52:48   INFO  epoch: 3/72, acc_iter=12786, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:38:06, time_cost(all): 2:57:33/2 days, 14:03:52, loss=0.586045982757201, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.84(1.03), norm=3.768276932422138, lr=0.19614998705334025
2023-12-05 13:53:30   INFO  epoch: 3/72, acc_iter=12836, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:37:19, time_cost(all): 2:58:15/2 days, 13:03:00, loss=0.58598678533126, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.09(1.03), norm=4.1450034461607155, lr=0.19716144484722942
2023-12-05 13:54:12   INFO  epoch: 3/72, acc_iter=12886, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:35:42, time_cost(all): 2:58:57/2 days, 14:17:13, loss=0.585927587905319, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.95(1.03), norm=2.943895796808961, lr=0.1981729026411186
2023-12-05 13:54:54   INFO  epoch: 3/72, acc_iter=12936, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:33:17, time_cost(all): 2:59:39/2 days, 11:09:39, loss=0.585868390479379, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.95(1.03), norm=1.0075275256574256, lr=0.19918436043500776
2023-12-05 13:55:36   INFO  epoch: 3/72, acc_iter=12986, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:33:44, time_cost(all): 3:00:21/2 days, 15:16:15, loss=0.585809193053438, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.86(1.03), norm=3.2408066301323943, lr=0.20019581822889693
2023-12-05 13:56:17   INFO  epoch: 3/72, acc_iter=13036, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:35:15, time_cost(all): 3:01:02/2 days, 14:15:34, loss=0.585749995627497, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.87(1.03), norm=1.2905951895093573, lr=0.2012072760227861
2023-12-05 13:56:59   INFO  epoch: 3/72, acc_iter=13086, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:34:03, time_cost(all): 3:01:44/2 days, 11:40:32, loss=0.585690798201556, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.04(1.03), norm=3.31758835635221, lr=0.20221873381667527
2023-12-05 13:57:41   INFO  epoch: 3/72, acc_iter=13136, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:32:29, time_cost(all): 3:02:26/2 days, 14:22:12, loss=0.585631600775615, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.97(1.03), norm=1.423721478319648, lr=0.2032301916105645
2023-12-05 13:58:23   INFO  epoch: 3/72, acc_iter=13186, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:32:32, time_cost(all): 3:03:08/2 days, 12:37:11, loss=0.585572403349674, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.13(1.03), norm=4.746667985052278, lr=0.20424164940445366
2023-12-05 13:59:04   INFO  epoch: 3/72, acc_iter=13236, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:30:08, time_cost(all): 3:03:49/2 days, 12:09:00, loss=0.585513205923733, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.93(1.03), norm=0.985583874128755, lr=0.20525310719834283
2023-12-05 13:59:46   INFO  epoch: 3/72, acc_iter=13286, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:29:06, time_cost(all): 3:04:31/2 days, 12:19:08, loss=0.585454008497792, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.84(1.03), norm=3.5073577426933538, lr=0.206264564992232
2023-12-05 14:00:28   INFO  epoch: 3/72, acc_iter=13336, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:29:54, time_cost(all): 3:05:13/2 days, 15:01:30, loss=0.585394811071851, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.08(1.03), norm=1.882566476489971, lr=0.20727602278612117
2023-12-05 14:01:10   INFO  epoch: 3/72, acc_iter=13386, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:29:00, time_cost(all): 3:05:55/2 days, 10:58:55, loss=0.58533561364591, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.13(1.03), norm=2.9987103943998914, lr=0.20828748058001034
2023-12-05 14:01:52   INFO  epoch: 3/72, acc_iter=13436, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:28:53, time_cost(all): 3:06:37/2 days, 13:19:12, loss=0.585276416219969, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.88(1.03), norm=3.1543229537407735, lr=0.2092989383738995
2023-12-05 14:02:33   INFO  epoch: 3/72, acc_iter=13486, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:27:32, time_cost(all): 3:07:18/2 days, 10:22:28, loss=0.585217218794028, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.85(1.03), norm=1.1414638195914617, lr=0.21031039616778868
2023-12-05 14:03:15   INFO  epoch: 3/72, acc_iter=13536, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:27:25, time_cost(all): 3:08:00/2 days, 10:52:57, loss=0.585158021368087, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.97(1.03), norm=1.3124702851164478, lr=0.21132185396167785
2023-12-05 14:03:57   INFO  epoch: 3/72, acc_iter=13586, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:26:53, time_cost(all): 3:08:42/2 days, 10:55:34, loss=0.585098823942146, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.83(1.03), norm=4.849673737914788, lr=0.21233331175556702
2023-12-05 14:04:39   INFO  epoch: 3/72, acc_iter=13636, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:18, time_cost(all): 3:09:24/2 days, 12:18:00, loss=0.585039626516205, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.93(1.03), norm=4.18384354192923, lr=0.21334476954945625
2023-12-05 14:05:20   INFO  epoch: 3/72, acc_iter=13686, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:24:08, time_cost(all): 3:10:05/2 days, 16:16:32, loss=0.584980429090264, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.14(1.03), norm=2.368767384705299, lr=0.21435622734334542
2023-12-05 14:06:02   INFO  epoch: 3/72, acc_iter=13736, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:25, time_cost(all): 3:10:47/2 days, 14:41:05, loss=0.584921231664323, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.87(1.03), norm=3.9684475747766954, lr=0.2153676851372346
2023-12-05 14:06:44   INFO  epoch: 3/72, acc_iter=13786, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:13, time_cost(all): 3:11:29/2 days, 13:32:38, loss=0.584862034238382, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.0(1.03), norm=3.3449593605284056, lr=0.21637914293112376
2023-12-05 14:07:26   INFO  epoch: 3/72, acc_iter=13836, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:21:28, time_cost(all): 3:12:11/2 days, 12:35:36, loss=0.584802836812442, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.1(1.03), norm=1.166819158732077, lr=0.21739060072501293
2023-12-05 14:08:08   INFO  epoch: 3/72, acc_iter=13886, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:20:47, time_cost(all): 3:12:53/2 days, 14:30:59, loss=0.584743639386501, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.15(1.03), norm=1.6284855215020129, lr=0.2184020585189021
2023-12-05 14:08:49   INFO  epoch: 3/72, acc_iter=13936, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:29, time_cost(all): 3:13:34/2 days, 13:09:46, loss=0.58468444196056, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.2(1.03), norm=3.7725313573506636, lr=0.21941351631279132
2023-12-05 14:09:31   INFO  epoch: 3/72, acc_iter=13986, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:20:13, time_cost(all): 3:14:16/2 days, 12:26:05, loss=0.584625244534619, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.87(1.03), norm=2.9122837848013154, lr=0.2204249741066805
2023-12-05 14:10:13   INFO  epoch: 3/72, acc_iter=14036, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:20:08, time_cost(all): 3:14:58/2 days, 11:37:23, loss=0.584566047108678, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.19(1.03), norm=4.95880554892857, lr=0.22143643190056966
2023-12-05 14:10:55   INFO  epoch: 3/72, acc_iter=14086, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:47, time_cost(all): 3:15:40/2 days, 11:21:01, loss=0.584506849682737, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.07(1.03), norm=3.8197672201073027, lr=0.22244788969445883
2023-12-05 14:11:36   INFO  epoch: 3/72, acc_iter=14136, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:35, time_cost(all): 3:16:21/2 days, 11:57:44, loss=0.584447652256796, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.87(1.03), norm=3.6982488207933915, lr=0.223459347488348
2023-12-05 14:12:18   INFO  epoch: 3/72, acc_iter=14186, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:34, time_cost(all): 3:17:03/2 days, 11:31:34, loss=0.584388454830855, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.14(1.03), norm=3.5556826334543676, lr=0.22447080528223723
2023-12-05 14:13:00   INFO  epoch: 3/72, acc_iter=14236, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:33, time_cost(all): 3:17:45/2 days, 15:01:58, loss=0.584329257404914, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.16(1.03), norm=2.1049999648996933, lr=0.2254822630761264
2023-12-05 14:13:42   INFO  epoch: 3/72, acc_iter=14286, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:15:37, time_cost(all): 3:18:27/2 days, 14:58:21, loss=0.584270059978973, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.89(1.03), norm=3.427519042561997, lr=0.22649372087001557
2023-12-05 14:14:24   INFO  epoch: 3/72, acc_iter=14336, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:18, time_cost(all): 3:19:09/2 days, 12:09:05, loss=0.584210862553032, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.14(1.03), norm=2.465228668628755, lr=0.22750517866390474
2023-12-05 14:15:05   INFO  epoch: 3/72, acc_iter=14386, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:11, time_cost(all): 3:19:50/2 days, 12:58:43, loss=0.584151665127091, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.91(1.03), norm=3.9566925693452246, lr=0.2285166364577939
2023-12-05 14:15:47   INFO  epoch: 3/72, acc_iter=14436, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:13:48, time_cost(all): 3:20:32/2 days, 15:55:30, loss=0.58409246770115, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.15(1.03), norm=3.070918745156383, lr=0.22952809425168308
2023-12-05 14:16:29   INFO  epoch: 3/72, acc_iter=14486, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:12:54, time_cost(all): 3:21:14/2 days, 11:34:16, loss=0.584033270275209, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.89(1.03), norm=1.2311869833851905, lr=0.23053955204557225
2023-12-05 14:17:11   INFO  epoch: 3/72, acc_iter=14536, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:20, time_cost(all): 3:21:56/2 days, 13:08:58, loss=0.583974072849268, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.95(1.03), norm=2.308967025501418, lr=0.23155100983946142
2023-12-05 14:17:52   INFO  epoch: 3/72, acc_iter=14586, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:08, time_cost(all): 3:22:37/2 days, 12:14:39, loss=0.583914875423327, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.11(1.03), norm=3.002004390997659, lr=0.2325624676333506
2023-12-05 14:18:34   INFO  epoch: 3/72, acc_iter=14636, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:10:45, time_cost(all): 3:23:19/2 days, 14:57:49, loss=0.583855677997386, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.96(1.03), norm=0.7339213741857125, lr=0.23357392542723976
2023-12-05 14:19:16   INFO  epoch: 3/72, acc_iter=14686, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:22, time_cost(all): 3:24:01/2 days, 10:34:52, loss=0.583796480571446, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.21(1.03), norm=0.6228046704658682, lr=0.23458538322112898
2023-12-05 14:19:58   INFO  epoch: 3/72, acc_iter=14736, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:10:04, time_cost(all): 3:24:43/2 days, 15:19:33, loss=0.583737283145505, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.08(1.03), norm=2.2964402251210125, lr=0.23559684101501815
2023-12-05 14:20:40   INFO  epoch: 3/72, acc_iter=14786, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:13, time_cost(all): 3:25:25/2 days, 14:17:36, loss=0.583678085719564, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.95(1.03), norm=1.312319968763204, lr=0.23660829880890732
2023-12-05 14:21:21   INFO  epoch: 3/72, acc_iter=14836, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:34, time_cost(all): 3:26:06/2 days, 14:28:33, loss=0.583618888293623, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.93(1.03), norm=3.867683000458739, lr=0.2376197566027965
2023-12-05 14:22:03   INFO  epoch: 3/72, acc_iter=14886, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:53, time_cost(all): 3:26:48/2 days, 11:24:02, loss=0.583559690867682, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.84(1.03), norm=2.6883375690503017, lr=0.23863121439668566
2023-12-05 14:22:45   INFO  epoch: 3/72, acc_iter=14936, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:22, time_cost(all): 3:27:30/2 days, 14:11:32, loss=0.583500493441741, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.85(1.03), norm=3.3439918504219475, lr=0.23964267219057483
2023-12-05 14:23:27   INFO  epoch: 3/72, acc_iter=14986, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:38, time_cost(all): 3:28:12/2 days, 15:33:50, loss=0.5834412960158, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.13(1.03), norm=0.916231827710419, lr=0.240654129984464
2023-12-05 14:24:08   INFO  epoch: 3/72, acc_iter=15036, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:39, time_cost(all): 3:28:53/2 days, 11:20:13, loss=0.583382098589859, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.87(1.03), norm=4.454923286025538, lr=0.24166558777835317
2023-12-05 14:24:50   INFO  epoch: 3/72, acc_iter=15086, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:04:55, time_cost(all): 3:29:35/2 days, 12:01:42, loss=0.583322901163918, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.2(1.03), norm=1.642708410506256, lr=0.24267704557224234
2023-12-05 14:25:32   INFO  epoch: 3/72, acc_iter=15136, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:28, time_cost(all): 3:30:17/2 days, 11:12:30, loss=0.583263703737977, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.94(1.03), norm=1.8393040360240895, lr=0.24368850336613151
2023-12-05 14:26:14   INFO  epoch: 3/72, acc_iter=15186, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:38, time_cost(all): 3:30:59/2 days, 11:23:18, loss=0.583204506312036, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.03(1.03), norm=3.6583156422816843, lr=0.24469996116002074
2023-12-05 14:26:56   INFO  epoch: 3/72, acc_iter=15236, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:54, time_cost(all): 3:31:41/2 days, 12:32:06, loss=0.583145308886095, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.93(1.03), norm=3.3517699836724857, lr=0.2457114189539099
2023-12-05 14:27:37   INFO  epoch: 3/72, acc_iter=15286, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:16, time_cost(all): 3:32:22/2 days, 13:08:21, loss=0.583086111460154, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.87(1.03), norm=1.0900091757904442, lr=0.24672287674779908
2023-12-05 14:28:19   INFO  epoch: 3/72, acc_iter=15336, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:33, time_cost(all): 3:33:04/2 days, 14:01:19, loss=0.583026914034213, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.85(1.03), norm=1.474519050860175, lr=0.24773433454168825
2023-12-05 14:29:01   INFO  epoch: 3/72, acc_iter=15386, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:51, time_cost(all): 3:33:46/2 days, 9:58:23, loss=0.582967716608272, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.95(1.03), norm=2.5831644416544823, lr=0.24874579233557742
2023-12-05 14:29:43   INFO  epoch: 3/72, acc_iter=15436, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 3:34:28/2 days, 13:57:22, loss=0.582908519182331, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.18(1.03), norm=1.0378977593906091, lr=0.2497572501294666
2023-12-05 14:30:25   INFO  epoch: 4/72, acc_iter=15498, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:50:50, time_cost(all): 3:35:10/2 days, 11:33:01, loss=0.582835114374165, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.97(1.03), norm=1.0131735056681284, lr=0.25101145779388917
2023-12-05 14:31:06   INFO  epoch: 4/72, acc_iter=15548, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:54:45, time_cost(all): 3:35:51/2 days, 11:20:45, loss=0.582775916948224, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.22(1.03), norm=2.059343672395827, lr=0.2520229155877784
2023-12-05 14:31:48   INFO  epoch: 4/72, acc_iter=15598, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:50:07, time_cost(all): 3:36:33/2 days, 12:02:15, loss=0.582716719522283, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.18(1.03), norm=4.215078301783404, lr=0.25303437338166757
2023-12-05 14:32:30   INFO  epoch: 4/72, acc_iter=15648, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:50:13, time_cost(all): 3:37:15/2 days, 15:37:11, loss=0.582657522096342, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.08(1.03), norm=4.873091994436061, lr=0.25404583117555674
2023-12-05 14:33:12   INFO  epoch: 4/72, acc_iter=15698, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:48:41, time_cost(all): 3:37:57/2 days, 13:48:03, loss=0.582598324670401, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.01(1.03), norm=2.7284595487903407, lr=0.2550572889694459
2023-12-05 14:33:53   INFO  epoch: 4/72, acc_iter=15748, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:48:51, time_cost(all): 3:38:38/2 days, 11:24:33, loss=0.58253912724446, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.0(1.03), norm=4.66132599719642, lr=0.2560687467633351
2023-12-05 14:34:35   INFO  epoch: 4/72, acc_iter=15798, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:46:40, time_cost(all): 3:39:20/2 days, 12:44:30, loss=0.582479929818519, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.01(1.03), norm=1.2529439488425276, lr=0.25708020455722425
2023-12-05 14:35:17   INFO  epoch: 4/72, acc_iter=15848, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:45:49, time_cost(all): 3:40:02/2 days, 10:26:05, loss=0.582420732392578, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.84(1.03), norm=1.2430873717319653, lr=0.2580916623511134
2023-12-05 14:35:59   INFO  epoch: 4/72, acc_iter=15898, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:45:59, time_cost(all): 3:40:44/2 days, 13:32:58, loss=0.582361534966637, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.0(1.03), norm=1.1885933594528266, lr=0.2591031201450026
2023-12-05 14:36:41   INFO  epoch: 4/72, acc_iter=15948, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:45:22, time_cost(all): 3:41:26/2 days, 13:42:41, loss=0.582302337540696, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.19(1.03), norm=4.093394532125271, lr=0.26011457793889176
2023-12-05 14:37:22   INFO  epoch: 4/72, acc_iter=15998, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:44:55, time_cost(all): 3:42:07/2 days, 10:30:38, loss=0.582243140114755, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.08(1.03), norm=3.26817033501515, lr=0.2611260357327809
2023-12-05 14:38:04   INFO  epoch: 4/72, acc_iter=16048, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:45:13, time_cost(all): 3:42:49/2 days, 11:39:05, loss=0.582183942688814, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.09(1.03), norm=4.497680042212032, lr=0.26213749352667015
2023-12-05 14:38:46   INFO  epoch: 4/72, acc_iter=16098, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:45:44, time_cost(all): 3:43:31/2 days, 14:51:28, loss=0.582124745262873, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.17(1.03), norm=1.391251305595231, lr=0.2631489513205593
2023-12-05 14:39:28   INFO  epoch: 4/72, acc_iter=16148, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:43:59, time_cost(all): 3:44:13/2 days, 12:01:47, loss=0.582065547836932, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.92(1.03), norm=1.2083214075306872, lr=0.2641604091144485
2023-12-05 14:40:09   INFO  epoch: 4/72, acc_iter=16198, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:43:00, time_cost(all): 3:44:54/2 days, 14:06:13, loss=0.582006350410991, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.19(1.03), norm=3.6334816765389726, lr=0.26517186690833766
2023-12-05 14:40:51   INFO  epoch: 4/72, acc_iter=16248, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:42:16, time_cost(all): 3:45:36/2 days, 11:15:57, loss=0.581947152985051, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.84(1.03), norm=2.818909582355522, lr=0.26618332470222683
2023-12-05 14:41:33   INFO  epoch: 4/72, acc_iter=16298, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:41:46, time_cost(all): 3:46:18/2 days, 9:45:40, loss=0.58188795555911, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.84(1.03), norm=0.9195206440409196, lr=0.267194782496116
2023-12-05 14:42:15   INFO  epoch: 4/72, acc_iter=16348, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:42:56, time_cost(all): 3:47:00/2 days, 15:40:12, loss=0.581828758133169, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.13(1.03), norm=1.6427804271605684, lr=0.26820624029000517
2023-12-05 14:42:57   INFO  epoch: 4/72, acc_iter=16398, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:38:47, time_cost(all): 3:47:42/2 days, 11:20:14, loss=0.581769560707228, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.04(1.03), norm=1.1508723578714375, lr=0.26921769808389434
2023-12-05 14:43:38   INFO  epoch: 4/72, acc_iter=16448, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:41:05, time_cost(all): 3:48:23/2 days, 13:02:13, loss=0.581710363281287, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.2(1.03), norm=1.6540847648776955, lr=0.2702291558777835
2023-12-05 14:44:20   INFO  epoch: 4/72, acc_iter=16498, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:39:42, time_cost(all): 3:49:05/2 days, 14:19:38, loss=0.581651165855346, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.18(1.03), norm=2.80939402112862, lr=0.2712406136716727
2023-12-05 14:45:02   INFO  epoch: 4/72, acc_iter=16548, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:37:40, time_cost(all): 3:49:47/2 days, 9:53:28, loss=0.581591968429405, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.85(1.03), norm=3.9585753093377605, lr=0.2722520714655619
2023-12-05 14:45:44   INFO  epoch: 4/72, acc_iter=16598, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:37:23, time_cost(all): 3:50:29/2 days, 14:39:56, loss=0.581532771003464, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.83(1.03), norm=3.0147963490405574, lr=0.2732635292594511
2023-12-05 14:46:25   INFO  epoch: 4/72, acc_iter=16648, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:37:51, time_cost(all): 3:51:10/2 days, 14:21:38, loss=0.581473573577523, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.88(1.03), norm=1.807107608944989, lr=0.27427498705334025
2023-12-05 14:47:07   INFO  epoch: 4/72, acc_iter=16698, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:35:50, time_cost(all): 3:51:52/2 days, 10:26:02, loss=0.581414376151582, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.02(1.03), norm=3.1971558832265465, lr=0.2752864448472294
2023-12-05 14:47:49   INFO  epoch: 4/72, acc_iter=16748, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:37:05, time_cost(all): 3:52:34/2 days, 15:14:23, loss=0.581355178725641, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.11(1.03), norm=1.3747516183550956, lr=0.2762979026411186
2023-12-05 14:48:31   INFO  epoch: 4/72, acc_iter=16798, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:35:09, time_cost(all): 3:53:16/2 days, 13:45:38, loss=0.5812959812997, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.84(1.03), norm=2.880924887079052, lr=0.2773093604350078
2023-12-05 14:49:13   INFO  epoch: 4/72, acc_iter=16848, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:35:35, time_cost(all): 3:53:58/2 days, 11:54:12, loss=0.581236783873759, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.01(1.03), norm=2.8140571529144482, lr=0.278320818228897
2023-12-05 14:49:54   INFO  epoch: 4/72, acc_iter=16898, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:34:07, time_cost(all): 3:54:39/2 days, 13:07:32, loss=0.581177586447818, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.86(1.03), norm=0.6093630849875029, lr=0.27933227602278615
2023-12-05 14:50:36   INFO  epoch: 4/72, acc_iter=16948, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:31:22, time_cost(all): 3:55:21/2 days, 13:38:57, loss=0.581118389021877, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.21(1.03), norm=1.9981995965307988, lr=0.2803437338166753
2023-12-05 14:51:18   INFO  epoch: 4/72, acc_iter=16998, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:31:52, time_cost(all): 3:56:03/2 days, 13:32:10, loss=0.581059191595936, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.2(1.03), norm=2.3149819963939655, lr=0.2813551916105645
2023-12-05 14:52:00   INFO  epoch: 4/72, acc_iter=17048, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:31:42, time_cost(all): 3:56:45/2 days, 9:35:08, loss=0.580999994169995, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.06(1.03), norm=1.8477738671013482, lr=0.28236664940445366
2023-12-05 14:52:41   INFO  epoch: 4/72, acc_iter=17098, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:29:56, time_cost(all): 3:57:26/2 days, 10:32:55, loss=0.580940796744055, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.85(1.03), norm=0.5403175074783417, lr=0.28337810719834283
2023-12-05 14:53:23   INFO  epoch: 4/72, acc_iter=17148, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:31:22, time_cost(all): 3:58:08/2 days, 9:34:44, loss=0.580881599318114, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.89(1.03), norm=2.7522569690589744, lr=0.284389564992232
2023-12-05 14:54:05   INFO  epoch: 4/72, acc_iter=17198, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:30:49, time_cost(all): 3:58:50/2 days, 14:01:57, loss=0.580822401892173, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.88(1.03), norm=1.2982428520779044, lr=0.2854010227861212
2023-12-05 14:54:47   INFO  epoch: 4/72, acc_iter=17248, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:29:24, time_cost(all): 3:59:32/2 days, 13:33:34, loss=0.580763204466232, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.23(1.03), norm=2.9709025965657583, lr=0.28641248058001034
2023-12-05 14:55:29   INFO  epoch: 4/72, acc_iter=17298, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:27:12, time_cost(all): 4:00:14/2 days, 15:05:26, loss=0.580704007040291, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.97(1.03), norm=3.53881747155905, lr=0.28742393837389957
2023-12-05 14:56:10   INFO  epoch: 4/72, acc_iter=17348, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:28:39, time_cost(all): 4:00:55/2 days, 14:47:57, loss=0.58064480961435, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.09(1.03), norm=1.9394714207821986, lr=0.28843539616778874
2023-12-05 14:56:52   INFO  epoch: 4/72, acc_iter=17398, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:25:21, time_cost(all): 4:01:37/2 days, 15:30:28, loss=0.580585612188409, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.99(1.03), norm=4.660827448035779, lr=0.2894468539616779
2023-12-05 14:57:34   INFO  epoch: 4/72, acc_iter=17448, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:26:18, time_cost(all): 4:02:19/2 days, 14:42:26, loss=0.580526414762468, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.97(1.03), norm=1.593345631192942, lr=0.2904583117555671
2023-12-05 14:58:16   INFO  epoch: 4/72, acc_iter=17498, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:26:14, time_cost(all): 4:03:01/2 days, 10:25:05, loss=0.580467217336527, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.16(1.03), norm=1.670944446649158, lr=0.29146976954945625
2023-12-05 14:58:57   INFO  epoch: 4/72, acc_iter=17548, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:24:15, time_cost(all): 4:03:42/2 days, 12:06:51, loss=0.580408019910586, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.21(1.03), norm=1.7788086376363526, lr=0.2924812273433454
2023-12-05 14:59:39   INFO  epoch: 4/72, acc_iter=17598, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:23, time_cost(all): 4:04:24/2 days, 10:25:37, loss=0.580348822484645, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.86(1.03), norm=4.1191484989066645, lr=0.2934926851372346
2023-12-05 15:00:21   INFO  epoch: 4/72, acc_iter=17648, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:23:17, time_cost(all): 4:05:06/2 days, 9:36:45, loss=0.580289625058704, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.18(1.03), norm=0.6765572929571353, lr=0.29450414293112376
2023-12-05 15:01:03   INFO  epoch: 4/72, acc_iter=17698, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:23:27, time_cost(all): 4:05:48/2 days, 9:47:51, loss=0.580230427632763, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.06(1.03), norm=2.980905461224607, lr=0.29551560072501293
2023-12-05 15:01:45   INFO  epoch: 4/72, acc_iter=17748, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:29, time_cost(all): 4:06:30/2 days, 12:05:26, loss=0.580171230206822, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.2(1.03), norm=3.7514027957128957, lr=0.2965270585189021
2023-12-05 15:02:26   INFO  epoch: 4/72, acc_iter=17798, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:57, time_cost(all): 4:07:11/2 days, 11:12:48, loss=0.580112032780881, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.13(1.03), norm=0.7459723369430747, lr=0.2975385163127913
2023-12-05 15:03:08   INFO  epoch: 4/72, acc_iter=17848, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:20:11, time_cost(all): 4:07:53/2 days, 14:39:57, loss=0.58005283535494, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.87(1.03), norm=1.1289032739458318, lr=0.2985499741066805
2023-12-05 15:03:50   INFO  epoch: 4/72, acc_iter=17898, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:19:48, time_cost(all): 4:08:35/2 days, 12:21:29, loss=0.579993637928999, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.83(1.03), norm=4.460156894130124, lr=0.29956143190056966
2023-12-05 15:04:32   INFO  epoch: 4/72, acc_iter=17948, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:18:04, time_cost(all): 4:09:17/2 days, 9:48:59, loss=0.579934440503059, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.15(1.03), norm=3.693599089744233, lr=0.30057288969445883
2023-12-05 15:05:14   INFO  epoch: 4/72, acc_iter=17998, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:59, time_cost(all): 4:09:59/2 days, 12:20:46, loss=0.579875243077118, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.22(1.03), norm=1.3035767640436633, lr=0.301584347488348
2023-12-05 15:05:55   INFO  epoch: 4/72, acc_iter=18048, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:18:07, time_cost(all): 4:10:40/2 days, 11:44:46, loss=0.579816045651177, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.03(1.03), norm=0.916716559991148, lr=0.3025958052822372
2023-12-05 15:06:37   INFO  epoch: 4/72, acc_iter=18098, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:56, time_cost(all): 4:11:22/2 days, 11:42:11, loss=0.579756848225236, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.17(1.03), norm=3.3104663591119845, lr=0.30360726307612634
2023-12-05 15:07:19   INFO  epoch: 4/72, acc_iter=18148, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:49, time_cost(all): 4:12:04/2 days, 13:15:01, loss=0.579697650799295, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.97(1.03), norm=1.914596266034492, lr=0.3046187208700155
2023-12-05 15:08:01   INFO  epoch: 4/72, acc_iter=18198, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:53, time_cost(all): 4:12:46/2 days, 12:27:22, loss=0.579638453373354, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.2(1.03), norm=1.3846846357555893, lr=0.3056301786639047
2023-12-05 15:08:42   INFO  epoch: 4/72, acc_iter=18248, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:54, time_cost(all): 4:13:27/2 days, 9:52:43, loss=0.579579255947413, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.89(1.03), norm=4.302957155042569, lr=0.30664163645779385
2023-12-05 15:09:24   INFO  epoch: 4/72, acc_iter=18298, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:13:31, time_cost(all): 4:14:09/2 days, 11:13:27, loss=0.579520058521472, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.86(1.03), norm=4.191836147910052, lr=0.307653094251683
2023-12-05 15:10:06   INFO  epoch: 4/72, acc_iter=18348, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:37, time_cost(all): 4:14:51/2 days, 11:44:57, loss=0.579460861095531, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.22(1.03), norm=3.6955694184678904, lr=0.3086645520455722
2023-12-05 15:10:48   INFO  epoch: 4/72, acc_iter=18398, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:25, time_cost(all): 4:15:33/2 days, 10:55:51, loss=0.57940166366959, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.86(1.03), norm=3.057932367673186, lr=0.3096760098394614
2023-12-05 15:11:30   INFO  epoch: 4/72, acc_iter=18448, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:13, time_cost(all): 4:16:15/2 days, 12:18:53, loss=0.579342466243649, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.22(1.03), norm=1.0054763124362447, lr=0.3106874676333506
2023-12-05 15:12:11   INFO  epoch: 4/72, acc_iter=18498, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:23, time_cost(all): 4:16:56/2 days, 11:54:21, loss=0.579283268817708, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.97(1.03), norm=1.415021641760747, lr=0.31169892542723976
2023-12-05 15:12:53   INFO  epoch: 4/72, acc_iter=18548, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:57, time_cost(all): 4:17:38/2 days, 14:21:47, loss=0.579224071391767, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.17(1.03), norm=2.7894966453560897, lr=0.31271038322112893
2023-12-05 15:13:35   INFO  epoch: 4/72, acc_iter=18598, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:44, time_cost(all): 4:18:20/2 days, 13:46:22, loss=0.579164873965826, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.05(1.03), norm=2.561164080018286, lr=0.3137218410150181
2023-12-05 15:14:17   INFO  epoch: 4/72, acc_iter=18648, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:10, time_cost(all): 4:19:02/2 days, 14:13:14, loss=0.579105676539885, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.9(1.03), norm=3.2994799444106016, lr=0.31473329880890727
2023-12-05 15:14:58   INFO  epoch: 4/72, acc_iter=18698, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:17, time_cost(all): 4:19:43/2 days, 10:14:06, loss=0.579046479113944, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.96(1.03), norm=3.4254795652902996, lr=0.31574475660279644
2023-12-05 15:15:40   INFO  epoch: 4/72, acc_iter=18748, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:08:06, time_cost(all): 4:20:25/2 days, 12:54:47, loss=0.578987281688004, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.09(1.03), norm=1.5847220169874314, lr=0.3167562143966856
2023-12-05 15:16:22   INFO  epoch: 4/72, acc_iter=18798, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:16, time_cost(all): 4:21:07/2 days, 10:26:37, loss=0.578928084262063, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.83(1.03), norm=3.1286633277639875, lr=0.3177676721905748
2023-12-05 15:17:04   INFO  epoch: 4/72, acc_iter=18848, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:26, time_cost(all): 4:21:49/2 days, 12:58:08, loss=0.578868886836122, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.87(1.03), norm=4.855938327826669, lr=0.31877912998446395
2023-12-05 15:17:46   INFO  epoch: 4/72, acc_iter=18898, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:43, time_cost(all): 4:22:31/2 days, 11:43:43, loss=0.578809689410181, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.98(1.03), norm=4.6320038358568585, lr=0.3197905877783532
2023-12-05 15:18:27   INFO  epoch: 4/72, acc_iter=18948, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:02, time_cost(all): 4:23:12/2 days, 14:47:48, loss=0.57875049198424, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.0(1.03), norm=3.786379901473804, lr=0.3208020455722424
2023-12-05 15:19:09   INFO  epoch: 4/72, acc_iter=18998, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:09, time_cost(all): 4:23:54/2 days, 12:39:59, loss=0.578691294558299, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.9(1.03), norm=4.812275636084766, lr=0.32181350336613157
2023-12-05 15:19:51   INFO  epoch: 4/72, acc_iter=19048, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:28, time_cost(all): 4:24:36/2 days, 15:00:54, loss=0.578632097132358, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.0(1.03), norm=1.0228934500749325, lr=0.32282496116002074
2023-12-05 15:20:33   INFO  epoch: 4/72, acc_iter=19098, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:03:03, time_cost(all): 4:25:18/2 days, 9:52:37, loss=0.578572899706417, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.23(1.03), norm=2.4295297079640763, lr=0.3238364189539099
2023-12-05 15:21:14   INFO  epoch: 4/72, acc_iter=19148, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:09, time_cost(all): 4:25:59/2 days, 12:36:32, loss=0.578513702280476, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.86(1.03), norm=4.00865499549869, lr=0.3248478767477991
2023-12-05 15:21:56   INFO  epoch: 4/72, acc_iter=19198, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:37, time_cost(all): 4:26:41/2 days, 11:34:38, loss=0.578454504854535, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.23(1.03), norm=4.844062079416103, lr=0.32585933454168825
2023-12-05 15:22:38   INFO  epoch: 4/72, acc_iter=19248, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:51, time_cost(all): 4:27:23/2 days, 13:57:34, loss=0.578395307428594, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.88(1.03), norm=0.8045015788489955, lr=0.3268707923355774
2023-12-05 15:23:20   INFO  epoch: 4/72, acc_iter=19298, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 4:28:05/2 days, 11:55:26, loss=0.578336110002653, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.0(1.03), norm=1.465454099469421, lr=0.3278822501294666
2023-12-05 15:24:02   INFO  epoch: 5/72, acc_iter=19360, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:50:55, time_cost(all): 4:28:47/2 days, 13:47:05, loss=0.578262705194486, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.23(1.03), norm=2.133336336272637, lr=0.32913645779388917
2023-12-05 15:24:43   INFO  epoch: 5/72, acc_iter=19410, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:53:22, time_cost(all): 4:29:28/2 days, 9:13:15, loss=0.578203507768545, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.23(1.03), norm=1.8702478058981178, lr=0.33014791558777834
2023-12-05 15:25:25   INFO  epoch: 5/72, acc_iter=19460, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:53:23, time_cost(all): 4:30:10/2 days, 11:08:20, loss=0.578144310342604, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.89(1.03), norm=4.927277630106343, lr=0.3311593733816675
2023-12-05 15:26:07   INFO  epoch: 5/72, acc_iter=19510, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:53:03, time_cost(all): 4:30:52/2 days, 14:39:09, loss=0.578085112916664, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.1(1.03), norm=3.855568127211585, lr=0.33217083117555674
2023-12-05 15:26:49   INFO  epoch: 5/72, acc_iter=19560, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:51:50, time_cost(all): 4:31:34/2 days, 13:55:34, loss=0.578025915490723, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.13(1.03), norm=1.6031367308740267, lr=0.3331822889694459
2023-12-05 15:27:30   INFO  epoch: 5/72, acc_iter=19610, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:50:42, time_cost(all): 4:32:15/2 days, 9:41:05, loss=0.577966718064782, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.19(1.03), norm=3.147267900780559, lr=0.3341937467633351
2023-12-05 15:28:12   INFO  epoch: 5/72, acc_iter=19660, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:48:05, time_cost(all): 4:32:57/2 days, 13:49:19, loss=0.577907520638841, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.13(1.03), norm=0.8053644333294652, lr=0.33520520455722425
2023-12-05 15:28:54   INFO  epoch: 5/72, acc_iter=19710, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:46:37, time_cost(all): 4:33:39/2 days, 13:04:55, loss=0.5778483232129, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.95(1.03), norm=0.8904131328935978, lr=0.3362166623511134
2023-12-05 15:29:36   INFO  epoch: 5/72, acc_iter=19760, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:48:18, time_cost(all): 4:34:21/2 days, 9:39:02, loss=0.577789125786959, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.04(1.03), norm=2.0630298692961317, lr=0.3372281201450026
2023-12-05 15:30:18   INFO  epoch: 5/72, acc_iter=19810, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:48:58, time_cost(all): 4:35:03/2 days, 12:16:06, loss=0.577729928361018, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.98(1.03), norm=1.3291124125402891, lr=0.33823957793889176
2023-12-05 15:30:59   INFO  epoch: 5/72, acc_iter=19860, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:46:00, time_cost(all): 4:35:44/2 days, 10:17:15, loss=0.577670730935077, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.17(1.03), norm=3.980169031777364, lr=0.3392510357327809
2023-12-05 15:31:41   INFO  epoch: 5/72, acc_iter=19910, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:45:38, time_cost(all): 4:36:26/2 days, 12:19:41, loss=0.577611533509136, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.93(1.03), norm=1.7195848948752062, lr=0.3402624935266701
2023-12-05 15:32:23   INFO  epoch: 5/72, acc_iter=19960, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:42:59, time_cost(all): 4:37:08/2 days, 10:41:09, loss=0.577552336083195, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.96(1.03), norm=2.208804854948305, lr=0.34127395132055927
2023-12-05 15:33:05   INFO  epoch: 5/72, acc_iter=20010, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:42:38, time_cost(all): 4:37:50/2 days, 9:23:15, loss=0.577493138657254, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.98(1.03), norm=3.199671957412333, lr=0.3422854091144485
2023-12-05 15:33:46   INFO  epoch: 5/72, acc_iter=20060, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:44:35, time_cost(all): 4:38:31/2 days, 9:26:30, loss=0.577433941231313, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.08(1.03), norm=3.7933233876616113, lr=0.34329686690833766
2023-12-05 15:34:28   INFO  epoch: 5/72, acc_iter=20110, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:42:57, time_cost(all): 4:39:13/2 days, 13:57:31, loss=0.577374743805372, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.04(1.03), norm=4.778301305938968, lr=0.34430832470222683
2023-12-05 15:35:10   INFO  epoch: 5/72, acc_iter=20160, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:42:21, time_cost(all): 4:39:55/2 days, 10:12:51, loss=0.577315546379431, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.96(1.03), norm=4.2177946704540075, lr=0.345319782496116
2023-12-05 15:35:52   INFO  epoch: 5/72, acc_iter=20210, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:42:17, time_cost(all): 4:40:37/2 days, 14:06:53, loss=0.57725634895349, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.21(1.03), norm=1.6936177437935767, lr=0.34633124029000517
2023-12-05 15:36:34   INFO  epoch: 5/72, acc_iter=20260, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:41:11, time_cost(all): 4:41:19/2 days, 11:02:33, loss=0.577197151527549, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.22(1.03), norm=2.181087457431795, lr=0.34734269808389434
2023-12-05 15:37:15   INFO  epoch: 5/72, acc_iter=20310, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:38:27, time_cost(all): 4:42:00/2 days, 12:15:36, loss=0.577137954101609, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.97(1.03), norm=1.8580062338462044, lr=0.3483541558777835
2023-12-05 15:37:57   INFO  epoch: 5/72, acc_iter=20360, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:38:56, time_cost(all): 4:42:42/2 days, 10:27:09, loss=0.577078756675668, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.01(1.03), norm=4.9805393874267025, lr=0.3493656136716727
2023-12-05 15:38:39   INFO  epoch: 5/72, acc_iter=20410, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:37:23, time_cost(all): 4:43:24/2 days, 11:53:57, loss=0.577019559249727, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.84(1.03), norm=3.5762545089114885, lr=0.35037707146556185
2023-12-05 15:39:21   INFO  epoch: 5/72, acc_iter=20460, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:36:42, time_cost(all): 4:44:06/2 days, 11:24:46, loss=0.576960361823786, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.15(1.03), norm=4.417391488647219, lr=0.351388529259451
2023-12-05 15:40:02   INFO  epoch: 5/72, acc_iter=20510, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:36:15, time_cost(all): 4:44:47/2 days, 13:43:51, loss=0.576901164397845, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.02(1.03), norm=2.783951401107631, lr=0.35239998705334025
2023-12-05 15:40:44   INFO  epoch: 5/72, acc_iter=20560, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:37:55, time_cost(all): 4:45:29/2 days, 9:40:50, loss=0.576841966971904, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.95(1.03), norm=1.9897288309116796, lr=0.3534114448472294
2023-12-05 15:41:26   INFO  epoch: 5/72, acc_iter=20610, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:34:36, time_cost(all): 4:46:11/2 days, 14:37:39, loss=0.576782769545963, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.84(1.03), norm=4.788677184062932, lr=0.3544229026411186
2023-12-05 15:42:08   INFO  epoch: 5/72, acc_iter=20660, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:36:20, time_cost(all): 4:46:53/2 days, 10:45:21, loss=0.576723572120022, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.85(1.03), norm=2.115118494629187, lr=0.35543436043500776
2023-12-05 15:42:50   INFO  epoch: 5/72, acc_iter=20710, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:35:19, time_cost(all): 4:47:35/2 days, 8:50:00, loss=0.576664374694081, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.06(1.03), norm=3.868467443775633, lr=0.3564458182288969
2023-12-05 15:43:31   INFO  epoch: 5/72, acc_iter=20760, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:32:14, time_cost(all): 4:48:16/2 days, 10:37:42, loss=0.57660517726814, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.18(1.03), norm=3.85678179698214, lr=0.3574572760227861
2023-12-05 15:44:13   INFO  epoch: 5/72, acc_iter=20810, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:31:41, time_cost(all): 4:48:58/2 days, 12:17:03, loss=0.576545979842199, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.14(1.03), norm=2.697883107332555, lr=0.35846873381667527
2023-12-05 15:44:55   INFO  epoch: 5/72, acc_iter=20860, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:31:00, time_cost(all): 4:49:40/2 days, 11:14:22, loss=0.576486782416258, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.84(1.03), norm=0.6999641970218081, lr=0.35948019161056444
2023-12-05 15:45:37   INFO  epoch: 5/72, acc_iter=20910, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:30:55, time_cost(all): 4:50:22/2 days, 11:35:09, loss=0.576427584990317, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.04(1.03), norm=1.3025640841255217, lr=0.3604916494044536
2023-12-05 15:46:19   INFO  epoch: 5/72, acc_iter=20960, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:30:23, time_cost(all): 4:51:04/2 days, 10:54:59, loss=0.576368387564376, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.87(1.03), norm=2.508443643097576, lr=0.3615031071983428
2023-12-05 15:47:00   INFO  epoch: 5/72, acc_iter=21010, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:30:06, time_cost(all): 4:51:45/2 days, 10:00:52, loss=0.576309190138435, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.87(1.03), norm=4.754524417284755, lr=0.362514564992232
2023-12-05 15:47:42   INFO  epoch: 5/72, acc_iter=21060, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:30:31, time_cost(all): 4:52:27/2 days, 10:34:02, loss=0.576249992712494, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.87(1.03), norm=1.7171603304023937, lr=0.3635260227861212
2023-12-05 15:48:24   INFO  epoch: 5/72, acc_iter=21110, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:29:58, time_cost(all): 4:53:09/2 days, 11:51:43, loss=0.576190795286553, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.01(1.03), norm=4.325134183024225, lr=0.36453748058001034
2023-12-05 15:49:06   INFO  epoch: 5/72, acc_iter=21160, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:26:57, time_cost(all): 4:53:51/2 days, 10:12:52, loss=0.576131597860613, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.21(1.03), norm=0.7598475068376846, lr=0.3655489383738995
2023-12-05 15:49:47   INFO  epoch: 5/72, acc_iter=21210, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:27:31, time_cost(all): 4:54:32/2 days, 14:08:10, loss=0.576072400434672, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.92(1.03), norm=0.5554535305219773, lr=0.3665603961677887
2023-12-05 15:50:29   INFO  epoch: 5/72, acc_iter=21260, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:27:52, time_cost(all): 4:55:14/2 days, 10:35:56, loss=0.576013203008731, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.04(1.03), norm=2.2992032887539677, lr=0.36757185396167785
2023-12-05 15:51:11   INFO  epoch: 5/72, acc_iter=21310, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:26:48, time_cost(all): 4:55:56/2 days, 13:30:46, loss=0.57595400558279, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.16(1.03), norm=4.365724212011108, lr=0.368583311755567
2023-12-05 15:51:53   INFO  epoch: 5/72, acc_iter=21360, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:13, time_cost(all): 4:56:38/2 days, 11:01:11, loss=0.575894808156849, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.22(1.03), norm=1.5950096712297652, lr=0.3695947695494562
2023-12-05 15:52:35   INFO  epoch: 5/72, acc_iter=21410, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:25:32, time_cost(all): 4:57:20/2 days, 8:57:22, loss=0.575835610730908, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.22(1.03), norm=0.913024538359738, lr=0.37060622734334536
2023-12-05 15:53:16   INFO  epoch: 5/72, acc_iter=21460, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:22:45, time_cost(all): 4:58:01/2 days, 8:40:51, loss=0.575776413304967, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.18(1.03), norm=1.009347421944987, lr=0.37161768513723453
2023-12-05 15:53:58   INFO  epoch: 5/72, acc_iter=21510, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:53, time_cost(all): 4:58:43/2 days, 9:45:28, loss=0.575717215879026, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.19(1.03), norm=3.851569404563317, lr=0.37262914293112376
2023-12-05 15:54:40   INFO  epoch: 5/72, acc_iter=21560, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:22:02, time_cost(all): 4:59:25/2 days, 9:32:50, loss=0.575658018453085, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.15(1.03), norm=1.7717317745710621, lr=0.37364060072501293
2023-12-05 15:55:22   INFO  epoch: 5/72, acc_iter=21610, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:20:40, time_cost(all): 5:00:07/2 days, 10:25:54, loss=0.575598821027144, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.21(1.03), norm=1.1759852767839918, lr=0.3746520585189021
2023-12-05 15:56:03   INFO  epoch: 5/72, acc_iter=21660, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:54, time_cost(all): 5:00:48/2 days, 9:25:38, loss=0.575539623601203, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.14(1.03), norm=2.6693557711244984, lr=0.37566351631279127
2023-12-05 15:56:45   INFO  epoch: 5/72, acc_iter=21710, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:20:09, time_cost(all): 5:01:30/2 days, 10:44:04, loss=0.575480426175262, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.19(1.03), norm=2.6064085393651215, lr=0.37667497410668044
2023-12-05 15:57:27   INFO  epoch: 5/72, acc_iter=21760, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:20:14, time_cost(all): 5:02:12/2 days, 13:25:29, loss=0.575421228749321, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.08(1.03), norm=2.3903070141867104, lr=0.3776864319005696
2023-12-05 15:58:09   INFO  epoch: 5/72, acc_iter=21810, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:18, time_cost(all): 5:02:54/2 days, 13:46:26, loss=0.57536203132338, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.06(1.03), norm=2.8649787075829116, lr=0.37869788969445883
2023-12-05 15:58:51   INFO  epoch: 5/72, acc_iter=21860, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:04, time_cost(all): 5:03:36/2 days, 13:21:51, loss=0.575302833897439, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.02(1.03), norm=3.3887712991999015, lr=0.379709347488348
2023-12-05 15:59:32   INFO  epoch: 5/72, acc_iter=21910, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:08, time_cost(all): 5:04:17/2 days, 10:57:09, loss=0.575243636471498, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.05(1.03), norm=1.3192064925513236, lr=0.3807208052822372
2023-12-05 16:00:14   INFO  epoch: 5/72, acc_iter=21960, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:04, time_cost(all): 5:04:59/2 days, 10:42:08, loss=0.575184439045557, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.93(1.03), norm=1.9794330819676624, lr=0.38173226307612634
2023-12-05 16:00:56   INFO  epoch: 5/72, acc_iter=22010, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:15:41, time_cost(all): 5:05:41/2 days, 12:15:24, loss=0.575125241619617, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.06(1.03), norm=1.1155979992939289, lr=0.3827437208700155
2023-12-05 16:01:38   INFO  epoch: 5/72, acc_iter=22060, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:39, time_cost(all): 5:06:23/2 days, 9:25:21, loss=0.575066044193676, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.88(1.03), norm=4.383820472161794, lr=0.38375517866390474
2023-12-05 16:02:19   INFO  epoch: 5/72, acc_iter=22110, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:49, time_cost(all): 5:07:04/2 days, 12:57:39, loss=0.575006846767735, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.9(1.03), norm=4.015399217394646, lr=0.3847666364577939
2023-12-05 16:03:01   INFO  epoch: 5/72, acc_iter=22160, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:37, time_cost(all): 5:07:46/2 days, 12:41:04, loss=0.574947649341794, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.03(1.03), norm=1.0191241907551085, lr=0.3857780942516831
2023-12-05 16:03:43   INFO  epoch: 5/72, acc_iter=22210, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:06, time_cost(all): 5:08:28/2 days, 9:35:59, loss=0.574888451915853, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.19(1.03), norm=3.0661435045901433, lr=0.38678955204557225
2023-12-05 16:04:25   INFO  epoch: 5/72, acc_iter=22260, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:52, time_cost(all): 5:09:10/2 days, 10:48:38, loss=0.574829254489912, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.05(1.03), norm=2.7632404985528947, lr=0.3878010098394614
2023-12-05 16:05:07   INFO  epoch: 5/72, acc_iter=22310, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:11:53, time_cost(all): 5:09:52/2 days, 9:22:36, loss=0.574770057063971, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.02(1.03), norm=1.043144306660754, lr=0.3888124676333506
2023-12-05 16:05:48   INFO  epoch: 5/72, acc_iter=22360, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:30, time_cost(all): 5:10:33/2 days, 8:57:44, loss=0.57471085963803, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.95(1.03), norm=0.6912247248174426, lr=0.38982392542723976
2023-12-05 16:06:30   INFO  epoch: 5/72, acc_iter=22410, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:11:03, time_cost(all): 5:11:15/2 days, 8:27:18, loss=0.574651662212089, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.83(1.03), norm=2.224145626752689, lr=0.39083538322112893
2023-12-05 16:07:12   INFO  epoch: 5/72, acc_iter=22460, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:34, time_cost(all): 5:11:57/2 days, 13:22:31, loss=0.574592464786148, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.05(1.03), norm=2.3444776457220193, lr=0.3918468410150181
2023-12-05 16:07:54   INFO  epoch: 5/72, acc_iter=22510, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:08:45, time_cost(all): 5:12:39/2 days, 11:18:00, loss=0.574533267360207, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.97(1.03), norm=0.5355483642581287, lr=0.39285829880890727
2023-12-05 16:08:35   INFO  epoch: 5/72, acc_iter=22560, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:50, time_cost(all): 5:13:20/2 days, 9:41:11, loss=0.574474069934266, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.83(1.03), norm=4.310298841354571, lr=0.3938697566027965
2023-12-05 16:09:17   INFO  epoch: 5/72, acc_iter=22610, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:58, time_cost(all): 5:14:02/2 days, 13:30:46, loss=0.574414872508325, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.95(1.03), norm=1.9876605974539077, lr=0.39488121439668566
2023-12-05 16:09:59   INFO  epoch: 5/72, acc_iter=22660, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:01, time_cost(all): 5:14:44/2 days, 13:39:16, loss=0.574355675082384, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.14(1.03), norm=2.010874801050132, lr=0.39589267219057483
2023-12-05 16:10:41   INFO  epoch: 5/72, acc_iter=22710, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:11, time_cost(all): 5:15:26/2 days, 12:02:09, loss=0.574296477656443, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.15(1.03), norm=0.8379089294809106, lr=0.396904129984464
2023-12-05 16:11:23   INFO  epoch: 5/72, acc_iter=22760, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:59, time_cost(all): 5:16:08/2 days, 9:11:09, loss=0.574237280230502, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.01(1.03), norm=4.550046533421407, lr=0.3979155877783532
2023-12-05 16:12:04   INFO  epoch: 5/72, acc_iter=22810, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:01, time_cost(all): 5:16:49/2 days, 10:10:32, loss=0.574178082804561, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.16(1.03), norm=2.2620526166756276, lr=0.39892704557224234
2023-12-05 16:12:46   INFO  epoch: 5/72, acc_iter=22860, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:09, time_cost(all): 5:17:31/2 days, 11:44:04, loss=0.574118885378621, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.08(1.03), norm=4.757275292059469, lr=0.3999385033661315
2023-12-05 16:13:28   INFO  epoch: 5/72, acc_iter=22910, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:48, time_cost(all): 5:18:13/2 days, 9:58:40, loss=0.57405968795268, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.06(1.03), norm=1.6453923702961955, lr=0.4009499611600207
2023-12-05 16:14:10   INFO  epoch: 5/72, acc_iter=22960, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:48, time_cost(all): 5:18:55/2 days, 10:44:32, loss=0.574000490526739, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.0(1.03), norm=3.739959873767878, lr=0.40196141895390985
2023-12-05 16:14:51   INFO  epoch: 5/72, acc_iter=23010, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:13, time_cost(all): 5:19:36/2 days, 13:03:44, loss=0.573941293100798, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.08(1.03), norm=1.2989094178403007, lr=0.402972876747799
2023-12-05 16:15:33   INFO  epoch: 5/72, acc_iter=23060, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:30, time_cost(all): 5:20:18/2 days, 13:40:52, loss=0.573882095674857, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.03(1.03), norm=3.026495807505899, lr=0.40398433454168825
2023-12-05 16:16:15   INFO  epoch: 5/72, acc_iter=23110, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:54, time_cost(all): 5:21:00/2 days, 11:53:44, loss=0.573822898248916, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.1(1.03), norm=2.6353696528859794, lr=0.4049957923355774
2023-12-05 16:16:57   INFO  epoch: 5/72, acc_iter=23160, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 5:21:42/2 days, 10:02:53, loss=0.573763700822975, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.12(1.03), norm=1.9799694641970813, lr=0.4060072501294666
2023-12-05 16:17:39   INFO  epoch: 6/72, acc_iter=23222, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:50:52, time_cost(all): 5:22:24/2 days, 11:05:19, loss=0.573690296014808, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.21(1.03), norm=3.7107721225596353, lr=0.40726145779388917
2023-12-05 16:18:20   INFO  epoch: 6/72, acc_iter=23272, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:53:37, time_cost(all): 5:23:05/2 days, 12:13:02, loss=0.573631098588867, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.86(1.03), norm=0.7654972479589237, lr=0.40827291558777834
2023-12-05 16:19:02   INFO  epoch: 6/72, acc_iter=23322, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:49:36, time_cost(all): 5:23:47/2 days, 11:58:44, loss=0.573571901162926, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.1(1.03), norm=2.3822737557694316, lr=0.4092843733816675
2023-12-05 16:19:44   INFO  epoch: 6/72, acc_iter=23372, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:53:17, time_cost(all): 5:24:29/2 days, 13:30:17, loss=0.573512703736985, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.99(1.03), norm=4.944694257702883, lr=0.4102958311755567
2023-12-05 16:20:26   INFO  epoch: 6/72, acc_iter=23422, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:49:26, time_cost(all): 5:25:11/2 days, 12:27:06, loss=0.573453506311044, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.14(1.03), norm=4.339130824472882, lr=0.41130728896944585
2023-12-05 16:21:08   INFO  epoch: 6/72, acc_iter=23472, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:47:36, time_cost(all): 5:25:53/2 days, 12:07:52, loss=0.573394308885103, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.23(1.03), norm=1.3520436036214176, lr=0.412318746763335
2023-12-05 16:21:49   INFO  epoch: 6/72, acc_iter=23522, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:50:16, time_cost(all): 5:26:34/2 days, 8:21:01, loss=0.573335111459162, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.94(1.03), norm=1.949868336195161, lr=0.4133302045572242
2023-12-05 16:22:31   INFO  epoch: 6/72, acc_iter=23572, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:47:00, time_cost(all): 5:27:16/2 days, 13:23:20, loss=0.573275914033222, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.98(1.03), norm=3.914938213005097, lr=0.41434166235111336
2023-12-05 16:23:13   INFO  epoch: 6/72, acc_iter=23622, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:45:15, time_cost(all): 5:27:58/2 days, 10:31:35, loss=0.573216716607281, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.08(1.03), norm=4.510644468394162, lr=0.4153531201450026
2023-12-05 16:23:55   INFO  epoch: 6/72, acc_iter=23672, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:45:46, time_cost(all): 5:28:40/2 days, 11:42:33, loss=0.57315751918134, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.19(1.03), norm=2.698890037519472, lr=0.41636457793889176
2023-12-05 16:24:36   INFO  epoch: 6/72, acc_iter=23722, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:45:13, time_cost(all): 5:29:21/2 days, 10:47:45, loss=0.573098321755399, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.96(1.03), norm=1.5690468891976235, lr=0.4173760357327809
2023-12-05 16:25:18   INFO  epoch: 6/72, acc_iter=23772, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:46:04, time_cost(all): 5:30:03/2 days, 8:30:31, loss=0.573039124329458, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.9(1.03), norm=2.2508402695062606, lr=0.4183874935266701
2023-12-05 16:26:00   INFO  epoch: 6/72, acc_iter=23822, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:42:56, time_cost(all): 5:30:45/2 days, 12:19:08, loss=0.572979926903517, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.11(1.03), norm=0.9295032103420552, lr=0.41939895132055927
2023-12-05 16:26:42   INFO  epoch: 6/72, acc_iter=23872, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:43:30, time_cost(all): 5:31:27/2 days, 13:10:13, loss=0.572920729477576, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.15(1.03), norm=0.5202259507817044, lr=0.42041040911444844
2023-12-05 16:27:24   INFO  epoch: 6/72, acc_iter=23922, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:44:52, time_cost(all): 5:32:09/2 days, 10:04:54, loss=0.572861532051635, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.88(1.03), norm=1.0113746587797352, lr=0.4214218669083376
2023-12-05 16:28:05   INFO  epoch: 6/72, acc_iter=23972, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:41:43, time_cost(all): 5:32:50/2 days, 10:46:27, loss=0.572802334625694, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.1(1.03), norm=4.360717587508994, lr=0.4224333247022268
2023-12-05 16:28:47   INFO  epoch: 6/72, acc_iter=24022, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:41:08, time_cost(all): 5:33:32/2 days, 12:22:03, loss=0.572743137199753, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.06(1.03), norm=0.9772832560390101, lr=0.42344478249611595
2023-12-05 16:29:29   INFO  epoch: 6/72, acc_iter=24072, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:42:48, time_cost(all): 5:34:14/2 days, 9:59:40, loss=0.572683939773812, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.87(1.03), norm=3.648989380912837, lr=0.4244562402900051
2023-12-05 16:30:11   INFO  epoch: 6/72, acc_iter=24122, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:39:11, time_cost(all): 5:34:56/2 days, 12:58:37, loss=0.572624742347871, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.19(1.03), norm=3.757254394055787, lr=0.42546769808389434
2023-12-05 16:30:52   INFO  epoch: 6/72, acc_iter=24172, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:40:21, time_cost(all): 5:35:37/2 days, 13:29:10, loss=0.57256554492193, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.97(1.03), norm=4.875029528612661, lr=0.4264791558777835
2023-12-05 16:31:34   INFO  epoch: 6/72, acc_iter=24222, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:37:20, time_cost(all): 5:36:19/2 days, 12:34:04, loss=0.572506347495989, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.21(1.03), norm=1.5414108691248616, lr=0.4274906136716727
2023-12-05 16:32:16   INFO  epoch: 6/72, acc_iter=24272, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:38:48, time_cost(all): 5:37:01/2 days, 10:44:14, loss=0.572447150070048, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.89(1.03), norm=0.9417404939771479, lr=0.42850207146556185
2023-12-05 16:32:58   INFO  epoch: 6/72, acc_iter=24322, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:37:13, time_cost(all): 5:37:43/2 days, 11:19:50, loss=0.572387952644107, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.97(1.03), norm=1.8508774826094214, lr=0.429513529259451
2023-12-05 16:33:40   INFO  epoch: 6/72, acc_iter=24372, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:36:57, time_cost(all): 5:38:25/2 days, 10:36:36, loss=0.572328755218166, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.15(1.03), norm=4.481922993374375, lr=0.4305249870533402
2023-12-05 16:34:21   INFO  epoch: 6/72, acc_iter=24422, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:35:58, time_cost(all): 5:39:06/2 days, 7:59:04, loss=0.572269557792226, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.97(1.03), norm=2.7287395430373205, lr=0.43153644484722936
2023-12-05 16:35:03   INFO  epoch: 6/72, acc_iter=24472, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:34:58, time_cost(all): 5:39:48/2 days, 13:46:12, loss=0.572210360366285, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.13(1.03), norm=2.478896319991796, lr=0.43254790264111853
2023-12-05 16:35:45   INFO  epoch: 6/72, acc_iter=24522, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:34:09, time_cost(all): 5:40:30/2 days, 11:46:06, loss=0.572151162940344, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.22(1.03), norm=1.5528693674840603, lr=0.4335593604350077
2023-12-05 16:36:27   INFO  epoch: 6/72, acc_iter=24572, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:33:07, time_cost(all): 5:41:12/2 days, 11:15:01, loss=0.572091965514403, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.23(1.03), norm=1.2880566609595683, lr=0.43457081822889687
2023-12-05 16:37:08   INFO  epoch: 6/72, acc_iter=24622, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:33:47, time_cost(all): 5:41:53/2 days, 13:34:06, loss=0.572032768088462, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.07(1.03), norm=1.015360820507817, lr=0.4355822760227861
2023-12-05 16:37:50   INFO  epoch: 6/72, acc_iter=24672, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:33:27, time_cost(all): 5:42:35/2 days, 8:20:54, loss=0.571973570662521, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.87(1.03), norm=4.023845127682022, lr=0.4365937338166753
2023-12-05 16:38:32   INFO  epoch: 6/72, acc_iter=24722, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:32:55, time_cost(all): 5:43:17/2 days, 9:16:38, loss=0.57191437323658, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.04(1.03), norm=4.3103515877182605, lr=0.4376051916105645
2023-12-05 16:39:14   INFO  epoch: 6/72, acc_iter=24772, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:32:31, time_cost(all): 5:43:59/2 days, 10:13:22, loss=0.571855175810639, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.11(1.03), norm=3.402986378286374, lr=0.43861664940445366
2023-12-05 16:39:56   INFO  epoch: 6/72, acc_iter=24822, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:30:02, time_cost(all): 5:44:41/2 days, 9:15:30, loss=0.571795978384698, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.16(1.03), norm=3.022507499667486, lr=0.43962810719834283
2023-12-05 16:40:37   INFO  epoch: 6/72, acc_iter=24872, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:30:59, time_cost(all): 5:45:22/2 days, 8:18:03, loss=0.571736780958757, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.85(1.03), norm=1.8941252599209817, lr=0.440639564992232
2023-12-05 16:41:19   INFO  epoch: 6/72, acc_iter=24922, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:28:49, time_cost(all): 5:46:04/2 days, 12:23:11, loss=0.571677583532816, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.18(1.03), norm=4.5816311998422385, lr=0.4416510227861212
2023-12-05 16:42:01   INFO  epoch: 6/72, acc_iter=24972, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:28:28, time_cost(all): 5:46:46/2 days, 11:52:58, loss=0.571618386106875, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.98(1.03), norm=1.6216946873307443, lr=0.44266248058001034
2023-12-05 16:42:43   INFO  epoch: 6/72, acc_iter=25022, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:29:00, time_cost(all): 5:47:28/2 days, 12:58:01, loss=0.571559188680934, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.2(1.03), norm=1.583111832134414, lr=0.4436739383738995
2023-12-05 16:43:24   INFO  epoch: 6/72, acc_iter=25072, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:26:27, time_cost(all): 5:48:09/2 days, 11:20:29, loss=0.571499991254993, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.9(1.03), norm=3.875760812949565, lr=0.4446853961677887
2023-12-05 16:44:06   INFO  epoch: 6/72, acc_iter=25122, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:26:58, time_cost(all): 5:48:51/2 days, 11:57:12, loss=0.571440793829052, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.04(1.03), norm=4.019868928209307, lr=0.44569685396167785
2023-12-05 16:44:48   INFO  epoch: 6/72, acc_iter=25172, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:26:54, time_cost(all): 5:49:33/2 days, 8:36:58, loss=0.571381596403111, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.87(1.03), norm=2.6281672092499164, lr=0.4467083117555671
2023-12-05 16:45:30   INFO  epoch: 6/72, acc_iter=25222, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:24:51, time_cost(all): 5:50:15/2 days, 9:55:09, loss=0.57132239897717, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.9(1.03), norm=0.7776650929277664, lr=0.44771976954945625
2023-12-05 16:46:12   INFO  epoch: 6/72, acc_iter=25272, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:23:42, time_cost(all): 5:50:57/2 days, 8:01:18, loss=0.571263201551229, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.91(1.03), norm=3.3069256185300606, lr=0.4487312273433454
2023-12-05 16:46:53   INFO  epoch: 6/72, acc_iter=25322, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:08, time_cost(all): 5:51:38/2 days, 11:55:24, loss=0.571204004125289, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.08(1.03), norm=2.59754385644999, lr=0.4497426851372346
2023-12-05 16:47:35   INFO  epoch: 6/72, acc_iter=25372, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:45, time_cost(all): 5:52:20/2 days, 10:38:20, loss=0.571144806699348, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.9(1.03), norm=1.4210210763566369, lr=0.45075414293112376
2023-12-05 16:48:17   INFO  epoch: 6/72, acc_iter=25422, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:22:35, time_cost(all): 5:53:02/2 days, 10:10:08, loss=0.571085609273407, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.07(1.03), norm=1.752002363897215, lr=0.45176560072501293
2023-12-05 16:48:59   INFO  epoch: 6/72, acc_iter=25472, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:17, time_cost(all): 5:53:44/2 days, 8:39:53, loss=0.571026411847466, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.09(1.03), norm=2.2972032093067107, lr=0.4527770585189021
2023-12-05 16:49:40   INFO  epoch: 6/72, acc_iter=25522, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:09, time_cost(all): 5:54:25/2 days, 10:06:29, loss=0.570967214421525, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.15(1.03), norm=1.0023560664903453, lr=0.45378851631279127
2023-12-05 16:50:22   INFO  epoch: 6/72, acc_iter=25572, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:20:22, time_cost(all): 5:55:07/2 days, 11:53:54, loss=0.570908016995584, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.21(1.03), norm=4.907907196654397, lr=0.45479997410668044
2023-12-05 16:51:04   INFO  epoch: 6/72, acc_iter=25622, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:18:57, time_cost(all): 5:55:49/2 days, 10:26:04, loss=0.570848819569643, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.2(1.03), norm=4.754733412132001, lr=0.4558114319005696
2023-12-05 16:51:46   INFO  epoch: 6/72, acc_iter=25672, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:42, time_cost(all): 5:56:31/2 days, 8:46:20, loss=0.570789622143702, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.84(1.03), norm=2.7209227911508846, lr=0.45682288969445883
2023-12-05 16:52:28   INFO  epoch: 6/72, acc_iter=25722, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:17:29, time_cost(all): 5:57:13/2 days, 9:20:36, loss=0.570730424717761, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.18(1.03), norm=0.5882840199260768, lr=0.457834347488348
2023-12-05 16:53:09   INFO  epoch: 6/72, acc_iter=25772, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:18:11, time_cost(all): 5:57:54/2 days, 10:34:56, loss=0.57067122729182, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.97(1.03), norm=2.558402025113629, lr=0.4588458052822372
2023-12-05 16:53:51   INFO  epoch: 6/72, acc_iter=25822, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:25, time_cost(all): 5:58:36/2 days, 12:25:57, loss=0.570612029865879, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.99(1.03), norm=1.0386914721048348, lr=0.45985726307612634
2023-12-05 16:54:33   INFO  epoch: 6/72, acc_iter=25872, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:15:24, time_cost(all): 5:59:18/2 days, 9:48:56, loss=0.570552832439938, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.09(1.03), norm=1.0149084731626683, lr=0.4608687208700155
2023-12-05 16:55:15   INFO  epoch: 6/72, acc_iter=25922, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:52, time_cost(all): 6:00:00/2 days, 12:12:49, loss=0.570493635013997, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.14(1.03), norm=2.9282810445372, lr=0.4618801786639047
2023-12-05 16:55:57   INFO  epoch: 6/72, acc_iter=25972, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:17, time_cost(all): 6:00:42/2 days, 11:50:59, loss=0.570434437588056, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.17(1.03), norm=1.5147730767643734, lr=0.46289163645779385
2023-12-05 16:56:38   INFO  epoch: 6/72, acc_iter=26022, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:13:39, time_cost(all): 6:01:23/2 days, 11:36:29, loss=0.570375240162115, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.2(1.03), norm=4.2884873716999765, lr=0.463903094251683
2023-12-05 16:57:20   INFO  epoch: 6/72, acc_iter=26072, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:31, time_cost(all): 6:02:05/2 days, 7:35:09, loss=0.570316042736174, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.0(1.03), norm=1.2740584421306476, lr=0.4649145520455722
2023-12-05 16:58:02   INFO  epoch: 6/72, acc_iter=26122, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:15, time_cost(all): 6:02:47/2 days, 12:54:51, loss=0.570256845310233, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.08(1.03), norm=4.641089288686295, lr=0.46592600983946136
2023-12-05 16:58:44   INFO  epoch: 6/72, acc_iter=26172, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:21, time_cost(all): 6:03:29/2 days, 12:05:37, loss=0.570197647884293, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.04(1.03), norm=2.6945100605065613, lr=0.4669374676333506
2023-12-05 16:59:25   INFO  epoch: 6/72, acc_iter=26222, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:32, time_cost(all): 6:04:10/2 days, 9:39:02, loss=0.570138450458352, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.85(1.03), norm=4.261876551536281, lr=0.46794892542723976
2023-12-05 17:00:07   INFO  epoch: 6/72, acc_iter=26272, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:17, time_cost(all): 6:04:52/2 days, 10:36:44, loss=0.570079253032411, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.88(1.03), norm=2.278019012005327, lr=0.46896038322112893
2023-12-05 17:00:49   INFO  epoch: 6/72, acc_iter=26322, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:55, time_cost(all): 6:05:34/2 days, 12:08:57, loss=0.57002005560647, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.0(1.03), norm=4.881286390153638, lr=0.4699718410150181
2023-12-05 17:01:31   INFO  epoch: 6/72, acc_iter=26372, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:00, time_cost(all): 6:06:16/2 days, 8:20:02, loss=0.569960858180529, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.98(1.03), norm=2.1008966510274183, lr=0.47098329880890727
2023-12-05 17:02:13   INFO  epoch: 6/72, acc_iter=26422, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:07, time_cost(all): 6:06:58/2 days, 12:10:21, loss=0.569901660754588, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.02(1.03), norm=1.7795801245150902, lr=0.47199475660279644
2023-12-05 17:02:54   INFO  epoch: 6/72, acc_iter=26472, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:08:04, time_cost(all): 6:07:39/2 days, 12:15:21, loss=0.569842463328647, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.98(1.03), norm=2.0001954146411647, lr=0.4730062143966856
2023-12-05 17:03:36   INFO  epoch: 6/72, acc_iter=26522, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:03, time_cost(all): 6:08:21/2 days, 10:20:50, loss=0.569783265902706, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.05(1.03), norm=1.6149367964344548, lr=0.4740176721905748
2023-12-05 17:04:18   INFO  epoch: 6/72, acc_iter=26572, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:17, time_cost(all): 6:09:03/2 days, 12:11:40, loss=0.569724068476765, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.19(1.03), norm=3.598742420106978, lr=0.47502912998446395
2023-12-05 17:05:00   INFO  epoch: 6/72, acc_iter=26622, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:06:00, time_cost(all): 6:09:45/2 days, 8:35:47, loss=0.569664871050824, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.85(1.03), norm=2.167112907010127, lr=0.4760405877783531
2023-12-05 17:05:41   INFO  epoch: 6/72, acc_iter=26672, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:03, time_cost(all): 6:10:26/2 days, 7:37:42, loss=0.569605673624883, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.03(1.03), norm=3.93676371926748, lr=0.47705204557224234
2023-12-05 17:06:23   INFO  epoch: 6/72, acc_iter=26722, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:24, time_cost(all): 6:11:08/2 days, 9:59:44, loss=0.569546476198942, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.19(1.03), norm=4.6341883042055985, lr=0.4780635033661315
2023-12-05 17:07:05   INFO  epoch: 6/72, acc_iter=26772, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:37, time_cost(all): 6:11:50/2 days, 8:16:46, loss=0.569487278773001, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.11(1.03), norm=1.860963801252364, lr=0.4790749611600207
2023-12-05 17:07:47   INFO  epoch: 6/72, acc_iter=26822, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:56, time_cost(all): 6:12:32/2 days, 9:31:02, loss=0.56942808134706, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.87(1.03), norm=4.755076750820534, lr=0.48008641895390985
2023-12-05 17:08:29   INFO  epoch: 6/72, acc_iter=26872, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:12, time_cost(all): 6:13:14/2 days, 11:08:31, loss=0.569368883921119, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.92(1.03), norm=2.399088600139595, lr=0.481097876747799
2023-12-05 17:09:10   INFO  epoch: 6/72, acc_iter=26922, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:38, time_cost(all): 6:13:55/2 days, 10:07:55, loss=0.569309686495178, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.94(1.03), norm=4.9923973688168655, lr=0.4821093345416882
2023-12-05 17:09:52   INFO  epoch: 6/72, acc_iter=26972, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:52, time_cost(all): 6:14:37/2 days, 9:13:24, loss=0.569250489069237, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.12(1.03), norm=1.8253048455306509, lr=0.48312079233557736
2023-12-05 17:10:34   INFO  epoch: 6/72, acc_iter=27022, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 6:15:19/2 days, 9:51:54, loss=0.569191291643297, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.98(1.03), norm=1.2258217967863154, lr=0.48413225012946653
2023-12-05 17:11:16   INFO  epoch: 7/72, acc_iter=27084, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:52:50, time_cost(all): 6:16:01/2 days, 11:35:15, loss=0.56911788683513, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.14(1.03), norm=4.259597344245005, lr=0.4853864577938891
2023-12-05 17:11:57   INFO  epoch: 7/72, acc_iter=27134, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:51:47, time_cost(all): 6:16:42/2 days, 10:36:59, loss=0.569058689409189, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.97(1.03), norm=1.722137929971216, lr=0.4863979155877783
2023-12-05 17:12:39   INFO  epoch: 7/72, acc_iter=27184, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:53:01, time_cost(all): 6:17:24/2 days, 8:55:57, loss=0.568999491983248, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.87(1.03), norm=3.1656666306894947, lr=0.48740937338166745
2023-12-05 17:13:21   INFO  epoch: 7/72, acc_iter=27234, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:49:31, time_cost(all): 6:18:06/2 days, 9:24:03, loss=0.568940294557307, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.84(1.03), norm=0.8875111348969795, lr=0.4884208311755567
2023-12-05 17:14:03   INFO  epoch: 7/72, acc_iter=27284, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:48:30, time_cost(all): 6:18:48/2 days, 9:07:36, loss=0.568881097131366, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.89(1.03), norm=0.9892953302402128, lr=0.48943228896944585
2023-12-05 17:14:45   INFO  epoch: 7/72, acc_iter=27334, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:48:42, time_cost(all): 6:19:30/2 days, 7:54:49, loss=0.568821899705425, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.17(1.03), norm=4.895949821955325, lr=0.490443746763335
2023-12-05 17:15:26   INFO  epoch: 7/72, acc_iter=27384, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:47:11, time_cost(all): 6:20:11/2 days, 9:59:08, loss=0.568762702279484, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.22(1.03), norm=3.7782263784608183, lr=0.4914552045572242
2023-12-05 17:16:08   INFO  epoch: 7/72, acc_iter=27434, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:49:14, time_cost(all): 6:20:53/2 days, 12:02:21, loss=0.568703504853543, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.19(1.03), norm=3.3912596233347547, lr=0.49246666235111336
2023-12-05 17:16:50   INFO  epoch: 7/72, acc_iter=27484, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:46:28, time_cost(all): 6:21:35/2 days, 9:33:47, loss=0.568644307427602, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.11(1.03), norm=1.982588538896406, lr=0.4934781201450026
2023-12-05 17:17:32   INFO  epoch: 7/72, acc_iter=27534, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:47:34, time_cost(all): 6:22:17/2 days, 11:45:17, loss=0.568585110001661, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.05(1.03), norm=3.5746765360425963, lr=0.49448957793889176
2023-12-05 17:18:13   INFO  epoch: 7/72, acc_iter=27584, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:47:28, time_cost(all): 6:22:58/2 days, 9:16:24, loss=0.56852591257572, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.93(1.03), norm=1.701273455320814, lr=0.4955010357327809
2023-12-05 17:18:55   INFO  epoch: 7/72, acc_iter=27634, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:47:32, time_cost(all): 6:23:40/2 days, 9:09:38, loss=0.568466715149779, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.15(1.03), norm=0.587091327803031, lr=0.4965124935266701
2023-12-05 17:19:37   INFO  epoch: 7/72, acc_iter=27684, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:46:56, time_cost(all): 6:24:22/2 days, 10:24:06, loss=0.568407517723838, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.16(1.03), norm=4.870354158033101, lr=0.49752395132055927
2023-12-05 17:20:19   INFO  epoch: 7/72, acc_iter=27734, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:44:19, time_cost(all): 6:25:04/2 days, 12:03:46, loss=0.568348320297898, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.07(1.03), norm=1.9255811964777456, lr=0.49853540911444844
2023-12-05 17:21:01   INFO  epoch: 7/72, acc_iter=27784, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:42:46, time_cost(all): 6:25:46/2 days, 11:58:42, loss=0.568289122871957, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.96(1.03), norm=2.2081338571807536, lr=0.49954686690833766
2023-12-05 17:21:42   INFO  epoch: 7/72, acc_iter=27834, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:40:43, time_cost(all): 6:26:27/2 days, 9:09:08, loss=0.568229925446016, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.22(1.03), norm=2.852025918240679, lr=0.49993709017439697
2023-12-05 17:22:24   INFO  epoch: 7/72, acc_iter=27884, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:42:31, time_cost(all): 6:27:09/2 days, 12:09:06, loss=0.568170728020075, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.1(1.03), norm=3.637435871493769, lr=0.4998231230990292
2023-12-05 17:23:06   INFO  epoch: 7/72, acc_iter=27934, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:39:46, time_cost(all): 6:27:51/2 days, 7:15:21, loss=0.568111530594134, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.19(1.03), norm=4.29006623202685, lr=0.4997091560236614
2023-12-05 17:23:48   INFO  epoch: 7/72, acc_iter=27984, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:40:34, time_cost(all): 6:28:33/2 days, 7:14:10, loss=0.568052333168193, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.87(1.03), norm=4.1506004700731385, lr=0.4995951889482936
2023-12-05 17:24:29   INFO  epoch: 7/72, acc_iter=28034, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:37:53, time_cost(all): 6:29:14/2 days, 9:18:58, loss=0.567993135742252, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.03(1.03), norm=2.488099261653259, lr=0.4994812218729258
2023-12-05 17:25:11   INFO  epoch: 7/72, acc_iter=28084, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:39:37, time_cost(all): 6:29:56/2 days, 9:10:35, loss=0.567933938316311, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.2(1.03), norm=1.3401846044445898, lr=0.499367254797558
2023-12-05 17:25:53   INFO  epoch: 7/72, acc_iter=28134, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:37:38, time_cost(all): 6:30:38/2 days, 12:10:20, loss=0.56787474089037, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.21(1.03), norm=1.4667597908632461, lr=0.4992532877221902
2023-12-05 17:26:35   INFO  epoch: 7/72, acc_iter=28184, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:39:18, time_cost(all): 6:31:20/2 days, 12:46:23, loss=0.567815543464429, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.92(1.03), norm=4.014136942220764, lr=0.4991393206468224
2023-12-05 17:27:17   INFO  epoch: 7/72, acc_iter=28234, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:35:17, time_cost(all): 6:32:02/2 days, 8:53:40, loss=0.567756346038488, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.04(1.03), norm=3.0434255123518605, lr=0.4990253535714546
2023-12-05 17:27:58   INFO  epoch: 7/72, acc_iter=28284, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:37:22, time_cost(all): 6:32:43/2 days, 10:12:59, loss=0.567697148612547, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.04(1.03), norm=2.616541640423879, lr=0.49891138649608685
2023-12-05 17:28:40   INFO  epoch: 7/72, acc_iter=28334, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:34:32, time_cost(all): 6:33:25/2 days, 7:41:42, loss=0.567637951186606, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.93(1.03), norm=4.344322587427795, lr=0.49879741942071903
2023-12-05 17:29:22   INFO  epoch: 7/72, acc_iter=28384, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:35:04, time_cost(all): 6:34:07/2 days, 10:11:09, loss=0.567578753760665, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.86(1.03), norm=3.709286343824121, lr=0.4986834523453512
2023-12-05 17:30:04   INFO  epoch: 7/72, acc_iter=28434, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:33:57, time_cost(all): 6:34:49/2 days, 7:41:40, loss=0.567519556334724, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.83(1.03), norm=3.177938946391111, lr=0.49856948526998346
2023-12-05 17:30:46   INFO  epoch: 7/72, acc_iter=28484, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:32:35, time_cost(all): 6:35:31/2 days, 8:34:03, loss=0.567460358908783, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.91(1.03), norm=4.482531720287927, lr=0.49845551819461564
2023-12-05 17:31:27   INFO  epoch: 7/72, acc_iter=28534, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:32:14, time_cost(all): 6:36:12/2 days, 11:23:00, loss=0.567401161482842, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.91(1.03), norm=2.824027839159525, lr=0.4983415511192479
2023-12-05 17:32:09   INFO  epoch: 7/72, acc_iter=28584, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:30:41, time_cost(all): 6:36:54/2 days, 9:07:46, loss=0.567341964056902, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.87(1.03), norm=3.98372853174602, lr=0.49822758404388007
2023-12-05 17:32:51   INFO  epoch: 7/72, acc_iter=28634, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:32:05, time_cost(all): 6:37:36/2 days, 12:18:01, loss=0.567282766630961, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.94(1.03), norm=2.000719477057086, lr=0.49811361696851225
2023-12-05 17:33:33   INFO  epoch: 7/72, acc_iter=28684, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:29:31, time_cost(all): 6:38:18/2 days, 12:04:34, loss=0.56722356920502, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.84(1.03), norm=0.9870184247424392, lr=0.4979996498931445
2023-12-05 17:34:14   INFO  epoch: 7/72, acc_iter=28734, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:29:25, time_cost(all): 6:38:59/2 days, 12:14:50, loss=0.567164371779079, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.16(1.03), norm=3.2513746209224594, lr=0.4978856828177767
2023-12-05 17:34:56   INFO  epoch: 7/72, acc_iter=28784, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:29:16, time_cost(all): 6:39:41/2 days, 12:17:21, loss=0.567105174353138, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.99(1.03), norm=1.6734369807293306, lr=0.49777171574240886
2023-12-05 17:35:38   INFO  epoch: 7/72, acc_iter=28834, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:30:06, time_cost(all): 6:40:23/2 days, 7:19:40, loss=0.567045976927197, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.94(1.03), norm=3.585559848027717, lr=0.4976577486670411
2023-12-05 17:36:20   INFO  epoch: 7/72, acc_iter=28884, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:28:19, time_cost(all): 6:41:05/2 days, 9:27:37, loss=0.566986779501256, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.04(1.03), norm=4.99850615652144, lr=0.4975437815916733
2023-12-05 17:37:02   INFO  epoch: 7/72, acc_iter=28934, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:27:33, time_cost(all): 6:41:47/2 days, 10:40:32, loss=0.566927582075315, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.06(1.03), norm=1.8703612402393555, lr=0.4974298145163055
2023-12-05 17:37:43   INFO  epoch: 7/72, acc_iter=28984, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:25:39, time_cost(all): 6:42:28/2 days, 10:50:07, loss=0.566868384649374, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.07(1.03), norm=4.234444770190942, lr=0.4973158474409377
2023-12-05 17:38:25   INFO  epoch: 7/72, acc_iter=29034, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:24:41, time_cost(all): 6:43:10/2 days, 7:25:30, loss=0.566809187223433, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.22(1.03), norm=0.6425556125209222, lr=0.4972018803655699
2023-12-05 17:39:07   INFO  epoch: 7/72, acc_iter=29084, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:24:31, time_cost(all): 6:43:52/2 days, 10:47:35, loss=0.566749989797492, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.9(1.03), norm=1.5727712811657062, lr=0.49708791329020213
2023-12-05 17:39:49   INFO  epoch: 7/72, acc_iter=29134, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:25:38, time_cost(all): 6:44:34/2 days, 8:31:18, loss=0.566690792371551, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.2(1.03), norm=3.5518930293777573, lr=0.4969739462148343
2023-12-05 17:40:30   INFO  epoch: 7/72, acc_iter=29184, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:06, time_cost(all): 6:45:15/2 days, 7:51:05, loss=0.56663159494561, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.84(1.03), norm=4.6800205161651345, lr=0.4968599791394665
2023-12-05 17:41:12   INFO  epoch: 7/72, acc_iter=29234, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:23:18, time_cost(all): 6:45:57/2 days, 8:50:14, loss=0.566572397519669, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.87(1.03), norm=4.211117848681452, lr=0.49674601206409874
2023-12-05 17:41:54   INFO  epoch: 7/72, acc_iter=29284, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:22:51, time_cost(all): 6:46:39/2 days, 8:38:22, loss=0.566513200093728, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.13(1.03), norm=0.5350664121057173, lr=0.4966320449887309
2023-12-05 17:42:36   INFO  epoch: 7/72, acc_iter=29334, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:20:59, time_cost(all): 6:47:21/2 days, 11:57:38, loss=0.566454002667787, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.12(1.03), norm=2.147259646038946, lr=0.49651807791336316
2023-12-05 17:43:18   INFO  epoch: 7/72, acc_iter=29384, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:37, time_cost(all): 6:48:03/2 days, 6:59:19, loss=0.566394805241846, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.2(1.03), norm=1.9946655607905641, lr=0.49640411083799535
2023-12-05 17:43:59   INFO  epoch: 7/72, acc_iter=29434, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:20:52, time_cost(all): 6:48:44/2 days, 8:44:09, loss=0.566335607815906, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.05(1.03), norm=4.797709856371395, lr=0.49629014376262753
2023-12-05 17:44:41   INFO  epoch: 7/72, acc_iter=29484, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:19:53, time_cost(all): 6:49:26/2 days, 11:43:37, loss=0.566276410389965, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.23(1.03), norm=4.600522520623686, lr=0.4961761766872598
2023-12-05 17:45:23   INFO  epoch: 7/72, acc_iter=29534, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:37, time_cost(all): 6:50:08/2 days, 8:30:18, loss=0.566217212964024, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.01(1.03), norm=3.8624288309981845, lr=0.49606220961189196
2023-12-05 17:46:05   INFO  epoch: 7/72, acc_iter=29584, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:42, time_cost(all): 6:50:50/2 days, 12:31:33, loss=0.566158015538083, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.22(1.03), norm=4.261790623609549, lr=0.4959482425365242
2023-12-05 17:46:46   INFO  epoch: 7/72, acc_iter=29634, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:52, time_cost(all): 6:51:31/2 days, 10:44:51, loss=0.566098818112142, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.93(1.03), norm=4.836859346765437, lr=0.4958342754611564
2023-12-05 17:47:28   INFO  epoch: 7/72, acc_iter=29684, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:17:39, time_cost(all): 6:52:13/2 days, 12:25:53, loss=0.566039620686201, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.96(1.03), norm=1.6037089535461988, lr=0.49572030838578857
2023-12-05 17:48:10   INFO  epoch: 7/72, acc_iter=29734, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:38, time_cost(all): 6:52:55/2 days, 8:03:44, loss=0.56598042326026, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.88(1.03), norm=4.907421914805153, lr=0.4956063413104208
2023-12-05 17:48:52   INFO  epoch: 7/72, acc_iter=29784, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:14:56, time_cost(all): 6:53:37/2 days, 10:12:30, loss=0.565921225834319, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.05(1.03), norm=4.900287029612895, lr=0.495492374235053
2023-12-05 17:49:34   INFO  epoch: 7/72, acc_iter=29834, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:32, time_cost(all): 6:54:19/2 days, 9:16:08, loss=0.565862028408378, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.21(1.03), norm=3.8544891518690747, lr=0.4953784071596852
2023-12-05 17:50:15   INFO  epoch: 7/72, acc_iter=29884, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:40, time_cost(all): 6:55:00/2 days, 11:12:11, loss=0.565802830982437, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.92(1.03), norm=1.1999625847387987, lr=0.4952644400843174
2023-12-05 17:50:57   INFO  epoch: 7/72, acc_iter=29934, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:16, time_cost(all): 6:55:42/2 days, 11:21:57, loss=0.565743633556496, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.96(1.03), norm=1.2545355044876512, lr=0.4951504730089496
2023-12-05 17:51:39   INFO  epoch: 7/72, acc_iter=29984, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:51, time_cost(all): 6:56:24/2 days, 9:22:31, loss=0.565684436130555, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.95(1.03), norm=4.018874691688476, lr=0.49503650593358184
2023-12-05 17:52:21   INFO  epoch: 7/72, acc_iter=30034, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:01, time_cost(all): 6:57:06/2 days, 12:16:09, loss=0.565625238704614, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.86(1.03), norm=0.7634009400343478, lr=0.494922538858214
2023-12-05 17:53:02   INFO  epoch: 7/72, acc_iter=30084, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:10:56, time_cost(all): 6:57:47/2 days, 10:20:14, loss=0.565566041278673, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.94(1.03), norm=3.469659032881502, lr=0.4948085717828462
2023-12-05 17:53:44   INFO  epoch: 7/72, acc_iter=30134, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:23, time_cost(all): 6:58:29/2 days, 8:30:44, loss=0.565506843852732, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.11(1.03), norm=3.7620458029347086, lr=0.49469460470747845
2023-12-05 17:54:26   INFO  epoch: 7/72, acc_iter=30184, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:35, time_cost(all): 6:59:11/2 days, 8:46:20, loss=0.565447646426791, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.19(1.03), norm=4.964481608701793, lr=0.49458063763211063
2023-12-05 17:55:08   INFO  epoch: 7/72, acc_iter=30234, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:35, time_cost(all): 6:59:53/2 days, 7:59:01, loss=0.565388449000851, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.09(1.03), norm=4.349783997810317, lr=0.49446667055674287
2023-12-05 17:55:50   INFO  epoch: 7/72, acc_iter=30284, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:27, time_cost(all): 7:00:35/2 days, 8:25:34, loss=0.56532925157491, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.88(1.03), norm=3.286733700133971, lr=0.49435270348137506
2023-12-05 17:56:31   INFO  epoch: 7/72, acc_iter=30334, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:08:02, time_cost(all): 7:01:16/2 days, 12:14:53, loss=0.565270054148969, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.93(1.03), norm=1.1855410031378375, lr=0.49423873640600724
2023-12-05 17:57:13   INFO  epoch: 7/72, acc_iter=30384, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:13, time_cost(all): 7:01:58/2 days, 10:01:34, loss=0.565210856723028, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.84(1.03), norm=3.3019068479681284, lr=0.4941247693306395
2023-12-05 17:57:55   INFO  epoch: 7/72, acc_iter=30434, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:41, time_cost(all): 7:02:40/2 days, 8:18:11, loss=0.565151659297087, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.85(1.03), norm=2.5780253581650947, lr=0.49401080225527166
2023-12-05 17:58:37   INFO  epoch: 7/72, acc_iter=30484, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:58, time_cost(all): 7:03:22/2 days, 9:54:46, loss=0.565092461871146, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.03(1.03), norm=1.152370460660481, lr=0.49389683517990385
2023-12-05 17:59:18   INFO  epoch: 7/72, acc_iter=30534, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:15, time_cost(all): 7:04:03/2 days, 11:52:42, loss=0.565033264445205, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.87(1.03), norm=1.4888344002816343, lr=0.4937828681045361
2023-12-05 18:00:00   INFO  epoch: 7/72, acc_iter=30584, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:09, time_cost(all): 7:04:45/2 days, 6:56:49, loss=0.564974067019264, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.08(1.03), norm=3.385960150512206, lr=0.49366890102916827
2023-12-05 18:00:42   INFO  epoch: 7/72, acc_iter=30634, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:38, time_cost(all): 7:05:27/2 days, 12:12:10, loss=0.564914869593323, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.85(1.03), norm=2.9688950428108356, lr=0.4935549339538005
2023-12-05 18:01:24   INFO  epoch: 7/72, acc_iter=30684, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:03:01, time_cost(all): 7:06:09/2 days, 8:34:50, loss=0.564855672167382, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.86(1.03), norm=2.5629005741614836, lr=0.4934409668784327
2023-12-05 18:02:06   INFO  epoch: 7/72, acc_iter=30734, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:09, time_cost(all): 7:06:51/2 days, 10:22:24, loss=0.564796474741441, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.89(1.03), norm=1.3299532120831934, lr=0.4933269998030649
2023-12-05 18:02:47   INFO  epoch: 7/72, acc_iter=30784, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:36, time_cost(all): 7:07:32/2 days, 10:48:22, loss=0.5647372773155, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.95(1.03), norm=3.173452549802646, lr=0.4932130327276971
2023-12-05 18:03:29   INFO  epoch: 7/72, acc_iter=30834, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:54, time_cost(all): 7:08:14/2 days, 11:16:35, loss=0.564678079889559, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.01(1.03), norm=0.9370746574213172, lr=0.4930990656523293
2023-12-05 18:04:11   INFO  epoch: 7/72, acc_iter=30884, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 7:08:56/2 days, 7:32:37, loss=0.564618882463618, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.12(1.03), norm=2.160641523562142, lr=0.4929850985769615
2023-12-05 18:04:53   INFO  epoch: 8/72, acc_iter=30946, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:53:45, time_cost(all): 7:09:38/2 days, 12:03:07, loss=0.564545477655451, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.01(1.03), norm=2.974072736889898, lr=0.49284377940350543
2023-12-05 18:05:35   INFO  epoch: 8/72, acc_iter=30996, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:53:17, time_cost(all): 7:10:20/2 days, 8:28:37, loss=0.564486280229511, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.95(1.03), norm=4.506937580909945, lr=0.4927298123281377
2023-12-05 18:06:16   INFO  epoch: 8/72, acc_iter=31046, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:52:23, time_cost(all): 7:11:01/2 days, 7:37:34, loss=0.56442708280357, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.97(1.03), norm=4.780592176822713, lr=0.49261584525276986
2023-12-05 18:06:58   INFO  epoch: 8/72, acc_iter=31096, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:50:08, time_cost(all): 7:11:43/2 days, 10:57:55, loss=0.564367885377629, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.19(1.03), norm=2.0980219990169022, lr=0.49250187817740204
2023-12-05 18:07:40   INFO  epoch: 8/72, acc_iter=31146, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:51:36, time_cost(all): 7:12:25/2 days, 11:26:35, loss=0.564308687951688, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.1(1.03), norm=0.7214842387927225, lr=0.4923879111020343
2023-12-05 18:08:22   INFO  epoch: 8/72, acc_iter=31196, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:48:22, time_cost(all): 7:13:07/2 days, 9:10:39, loss=0.564249490525747, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.04(1.03), norm=2.477609227508573, lr=0.49227394402666647
2023-12-05 18:09:03   INFO  epoch: 8/72, acc_iter=31246, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:47:31, time_cost(all): 7:13:48/2 days, 8:54:44, loss=0.564190293099806, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.19(1.03), norm=2.170141907832234, lr=0.4921599769512987
2023-12-05 18:09:45   INFO  epoch: 8/72, acc_iter=31296, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:48:53, time_cost(all): 7:14:30/2 days, 11:29:43, loss=0.564131095673865, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.0(1.03), norm=3.8713508413165973, lr=0.4920460098759309
2023-12-05 18:10:27   INFO  epoch: 8/72, acc_iter=31346, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:46:53, time_cost(all): 7:15:12/2 days, 11:23:55, loss=0.564071898247924, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.16(1.03), norm=0.954498240639565, lr=0.4919320428005631
2023-12-05 18:11:09   INFO  epoch: 8/72, acc_iter=31396, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:45:39, time_cost(all): 7:15:54/2 days, 7:49:28, loss=0.564012700821983, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.04(1.03), norm=1.0134565973269611, lr=0.4918180757251953
2023-12-05 18:11:51   INFO  epoch: 8/72, acc_iter=31446, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:44:48, time_cost(all): 7:16:36/2 days, 9:55:41, loss=0.563953503396042, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.94(1.03), norm=3.800602419451375, lr=0.4917041086498275
2023-12-05 18:12:32   INFO  epoch: 8/72, acc_iter=31496, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:47:34, time_cost(all): 7:17:17/2 days, 6:25:51, loss=0.563894305970101, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.03(1.03), norm=3.176289453699704, lr=0.4915901415744597
2023-12-05 18:13:14   INFO  epoch: 8/72, acc_iter=31546, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:44:53, time_cost(all): 7:17:59/2 days, 7:59:48, loss=0.56383510854416, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.02(1.03), norm=2.9866049759431554, lr=0.4914761744990919
2023-12-05 18:13:56   INFO  epoch: 8/72, acc_iter=31596, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:43:25, time_cost(all): 7:18:41/2 days, 11:02:24, loss=0.563775911118219, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.22(1.03), norm=4.67149545662356, lr=0.4913622074237241
2023-12-05 18:14:38   INFO  epoch: 8/72, acc_iter=31646, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:41:47, time_cost(all): 7:19:23/2 days, 10:34:17, loss=0.563716713692278, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.97(1.03), norm=2.354758479618914, lr=0.49124824034835635
2023-12-05 18:15:19   INFO  epoch: 8/72, acc_iter=31696, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:43:43, time_cost(all): 7:20:04/2 days, 7:30:19, loss=0.563657516266337, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.21(1.03), norm=1.4457903579122724, lr=0.49113427327298853
2023-12-05 18:16:01   INFO  epoch: 8/72, acc_iter=31746, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:40:29, time_cost(all): 7:20:46/2 days, 6:37:30, loss=0.563598318840396, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.09(1.03), norm=4.769891474370678, lr=0.4910203061976207
2023-12-05 18:16:43   INFO  epoch: 8/72, acc_iter=31796, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:42:00, time_cost(all): 7:21:28/2 days, 6:55:13, loss=0.563539121414456, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.9(1.03), norm=1.9341453477156372, lr=0.49090633912225295
2023-12-05 18:17:25   INFO  epoch: 8/72, acc_iter=31846, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:40:43, time_cost(all): 7:22:10/2 days, 7:56:20, loss=0.563479923988515, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.92(1.03), norm=2.9519226453891205, lr=0.49079237204688514
2023-12-05 18:18:07   INFO  epoch: 8/72, acc_iter=31896, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:40:55, time_cost(all): 7:22:52/2 days, 7:04:13, loss=0.563420726562574, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.89(1.03), norm=4.034882274032712, lr=0.4906784049715174
2023-12-05 18:18:48   INFO  epoch: 8/72, acc_iter=31946, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:40:55, time_cost(all): 7:23:33/2 days, 9:02:34, loss=0.563361529136633, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.92(1.03), norm=2.76102748613915, lr=0.49056443789614956
2023-12-05 18:19:30   INFO  epoch: 8/72, acc_iter=31996, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:38:37, time_cost(all): 7:24:15/2 days, 10:46:03, loss=0.563302331710692, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.87(1.03), norm=1.183766090942426, lr=0.49045047082078175
2023-12-05 18:20:12   INFO  epoch: 8/72, acc_iter=32046, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:36:56, time_cost(all): 7:24:57/2 days, 6:19:12, loss=0.563243134284751, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.06(1.03), norm=2.521319244589615, lr=0.490336503745414
2023-12-05 18:20:54   INFO  epoch: 8/72, acc_iter=32096, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:36:41, time_cost(all): 7:25:39/2 days, 8:20:51, loss=0.56318393685881, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.18(1.03), norm=4.2290589449147085, lr=0.49022253667004617
2023-12-05 18:21:35   INFO  epoch: 8/72, acc_iter=32146, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:37:56, time_cost(all): 7:26:20/2 days, 6:49:44, loss=0.563124739432869, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.22(1.03), norm=3.996400048009538, lr=0.49010856959467836
2023-12-05 18:22:17   INFO  epoch: 8/72, acc_iter=32196, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:36:04, time_cost(all): 7:27:02/2 days, 7:45:39, loss=0.563065542006928, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.22(1.03), norm=0.5095934621259858, lr=0.4899946025193106
2023-12-05 18:22:59   INFO  epoch: 8/72, acc_iter=32246, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:33:53, time_cost(all): 7:27:44/2 days, 7:20:23, loss=0.563006344580987, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.05(1.03), norm=0.5285973530245611, lr=0.4898806354439428
2023-12-05 18:23:41   INFO  epoch: 8/72, acc_iter=32296, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:35:35, time_cost(all): 7:28:26/2 days, 10:18:24, loss=0.562947147155046, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.97(1.03), norm=2.1648374578821437, lr=0.489766668368575
2023-12-05 18:24:23   INFO  epoch: 8/72, acc_iter=32346, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:33:35, time_cost(all): 7:29:08/2 days, 6:58:49, loss=0.562887949729105, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.22(1.03), norm=2.205646480933497, lr=0.4896527012932072
2023-12-05 18:25:04   INFO  epoch: 8/72, acc_iter=32396, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:32:52, time_cost(all): 7:29:49/2 days, 7:54:36, loss=0.562828752303164, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.88(1.03), norm=1.7577564775749512, lr=0.4895387342178394
2023-12-05 18:25:46   INFO  epoch: 8/72, acc_iter=32446, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:33:15, time_cost(all): 7:30:31/2 days, 8:11:40, loss=0.562769554877223, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.99(1.03), norm=2.3351385312831576, lr=0.48942476714247163
2023-12-05 18:26:28   INFO  epoch: 8/72, acc_iter=32496, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:30:37, time_cost(all): 7:31:13/2 days, 11:08:55, loss=0.562710357451282, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.23(1.03), norm=1.243350752995813, lr=0.4893108000671038
2023-12-05 18:27:10   INFO  epoch: 8/72, acc_iter=32546, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:31:50, time_cost(all): 7:31:55/2 days, 8:35:26, loss=0.562651160025341, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.83(1.03), norm=1.764612540553935, lr=0.489196832991736
2023-12-05 18:27:51   INFO  epoch: 8/72, acc_iter=32596, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:29:37, time_cost(all): 7:32:36/2 days, 7:12:27, loss=0.5625919625994, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.98(1.03), norm=0.5014953322696416, lr=0.48908286591636824
2023-12-05 18:28:33   INFO  epoch: 8/72, acc_iter=32646, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:30:26, time_cost(all): 7:33:18/2 days, 10:12:39, loss=0.56253276517346, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.02(1.03), norm=3.8041283553444574, lr=0.4889688988410004
2023-12-05 18:29:15   INFO  epoch: 8/72, acc_iter=32696, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:29:08, time_cost(all): 7:34:00/2 days, 10:07:40, loss=0.562473567747519, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.91(1.03), norm=4.610293791745063, lr=0.48885493176563266
2023-12-05 18:29:57   INFO  epoch: 8/72, acc_iter=32746, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:26:56, time_cost(all): 7:34:42/2 days, 9:52:58, loss=0.562414370321578, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.1(1.03), norm=1.2527053310899516, lr=0.48874096469026485
2023-12-05 18:30:39   INFO  epoch: 8/72, acc_iter=32796, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:26:49, time_cost(all): 7:35:24/2 days, 9:12:37, loss=0.562355172895637, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.02(1.03), norm=3.1567445688547844, lr=0.48862699761489703
2023-12-05 18:31:20   INFO  epoch: 8/72, acc_iter=32846, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:27:30, time_cost(all): 7:36:05/2 days, 8:17:51, loss=0.562295975469696, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.2(1.03), norm=1.2675408410452078, lr=0.48851303053952927
2023-12-05 18:32:02   INFO  epoch: 8/72, acc_iter=32896, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:26:47, time_cost(all): 7:36:47/2 days, 7:15:29, loss=0.562236778043755, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.06(1.03), norm=0.9110942055886682, lr=0.48839906346416145
2023-12-05 18:32:44   INFO  epoch: 8/72, acc_iter=32946, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:43, time_cost(all): 7:37:29/2 days, 10:33:47, loss=0.562177580617814, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.07(1.03), norm=2.752254911962009, lr=0.4882850963887937
2023-12-05 18:33:26   INFO  epoch: 8/72, acc_iter=32996, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:24:05, time_cost(all): 7:38:11/2 days, 10:53:41, loss=0.562118383191873, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.98(1.03), norm=4.113200992161222, lr=0.4881711293134259
2023-12-05 18:34:07   INFO  epoch: 8/72, acc_iter=33046, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:23:08, time_cost(all): 7:38:52/2 days, 6:21:11, loss=0.562059185765932, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.95(1.03), norm=4.6223867116152135, lr=0.48805716223805806
2023-12-05 18:34:49   INFO  epoch: 8/72, acc_iter=33096, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:23:30, time_cost(all): 7:39:34/2 days, 7:49:10, loss=0.561999988339991, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.97(1.03), norm=2.6998974114904155, lr=0.4879431951626903
2023-12-05 18:35:31   INFO  epoch: 8/72, acc_iter=33146, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:22:37, time_cost(all): 7:40:16/2 days, 10:11:45, loss=0.56194079091405, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.05(1.03), norm=4.149044698694064, lr=0.4878292280873225
2023-12-05 18:36:13   INFO  epoch: 8/72, acc_iter=33196, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:31, time_cost(all): 7:40:58/2 days, 9:44:31, loss=0.561881593488109, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.03(1.03), norm=2.3534344715475637, lr=0.48771526101195467
2023-12-05 18:36:55   INFO  epoch: 8/72, acc_iter=33246, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:47, time_cost(all): 7:41:40/2 days, 6:30:58, loss=0.561822396062168, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.02(1.03), norm=0.5718847058474206, lr=0.4876012939365869
2023-12-05 18:37:36   INFO  epoch: 8/72, acc_iter=33296, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:19:22, time_cost(all): 7:42:21/2 days, 11:12:54, loss=0.561763198636227, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.92(1.03), norm=1.5333027178571401, lr=0.4874873268612191
2023-12-05 18:38:18   INFO  epoch: 8/72, acc_iter=33346, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:18:44, time_cost(all): 7:43:03/2 days, 8:32:12, loss=0.561704001210286, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.09(1.03), norm=3.51257346500739, lr=0.48737335978585133
2023-12-05 18:39:00   INFO  epoch: 8/72, acc_iter=33396, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:40, time_cost(all): 7:43:45/2 days, 8:55:17, loss=0.561644803784345, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.86(1.03), norm=3.0987543887796356, lr=0.4872593927104835
2023-12-05 18:39:42   INFO  epoch: 8/72, acc_iter=33446, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:17:36, time_cost(all): 7:44:27/2 days, 7:27:14, loss=0.561585606358404, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.12(1.03), norm=1.9325258265392504, lr=0.4871454256351157
2023-12-05 18:40:24   INFO  epoch: 8/72, acc_iter=33496, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:54, time_cost(all): 7:45:09/2 days, 10:15:16, loss=0.561526408932464, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.97(1.03), norm=4.1113828946838025, lr=0.48703145855974794
2023-12-05 18:41:05   INFO  epoch: 8/72, acc_iter=33546, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:23, time_cost(all): 7:45:50/2 days, 7:35:34, loss=0.561467211506523, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.89(1.03), norm=2.808080482391026, lr=0.4869174914843801
2023-12-05 18:41:47   INFO  epoch: 8/72, acc_iter=33596, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:40, time_cost(all): 7:46:32/2 days, 7:30:20, loss=0.561408014080582, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.07(1.03), norm=2.875338070363738, lr=0.4868035244090123
2023-12-05 18:42:29   INFO  epoch: 8/72, acc_iter=33646, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:25, time_cost(all): 7:47:14/2 days, 9:06:14, loss=0.561348816654641, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.98(1.03), norm=2.8588226170630993, lr=0.48668955733364455
2023-12-05 18:43:11   INFO  epoch: 8/72, acc_iter=33696, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:33, time_cost(all): 7:47:56/2 days, 7:28:39, loss=0.5612896192287, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.83(1.03), norm=4.929606542797434, lr=0.48657559025827674
2023-12-05 18:43:52   INFO  epoch: 8/72, acc_iter=33746, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:03, time_cost(all): 7:48:37/2 days, 5:58:22, loss=0.561230421802759, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.14(1.03), norm=4.361583448473262, lr=0.486461623182909
2023-12-05 18:44:34   INFO  epoch: 8/72, acc_iter=33796, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:14:03, time_cost(all): 7:49:19/2 days, 11:07:44, loss=0.561171224376818, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.07(1.03), norm=3.9457170942215707, lr=0.48634765610754116
2023-12-05 18:45:16   INFO  epoch: 8/72, acc_iter=33846, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:13:13, time_cost(all): 7:50:01/2 days, 9:25:43, loss=0.561112026950877, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.97(1.03), norm=4.085229002372865, lr=0.48623368903217334
2023-12-05 18:45:58   INFO  epoch: 8/72, acc_iter=33896, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:24, time_cost(all): 7:50:43/2 days, 10:28:53, loss=0.561052829524936, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.92(1.03), norm=2.724274473995805, lr=0.4861197219568056
2023-12-05 18:46:40   INFO  epoch: 8/72, acc_iter=33946, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:10:58, time_cost(all): 7:51:25/2 days, 9:04:20, loss=0.560993632098995, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.13(1.03), norm=2.265429200127311, lr=0.48600575488143777
2023-12-05 18:47:21   INFO  epoch: 8/72, acc_iter=33996, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:50, time_cost(all): 7:52:06/2 days, 8:19:09, loss=0.560934434673054, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.15(1.03), norm=4.641928153977015, lr=0.48589178780607
2023-12-05 18:48:03   INFO  epoch: 8/72, acc_iter=34046, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:10:00, time_cost(all): 7:52:48/2 days, 8:05:26, loss=0.560875237247113, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.2(1.03), norm=2.7805739691991525, lr=0.4857778207307022
2023-12-05 18:48:45   INFO  epoch: 8/72, acc_iter=34096, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:08:49, time_cost(all): 7:53:30/2 days, 6:01:00, loss=0.560816039821172, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.18(1.03), norm=2.621908232213228, lr=0.4856638536553344
2023-12-05 18:49:27   INFO  epoch: 8/72, acc_iter=34146, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:29, time_cost(all): 7:54:12/2 days, 6:49:30, loss=0.560756842395231, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.17(1.03), norm=4.279346467221229, lr=0.4855498865799666
2023-12-05 18:50:08   INFO  epoch: 8/72, acc_iter=34196, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:46, time_cost(all): 7:54:53/2 days, 7:12:45, loss=0.56069764496929, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.12(1.03), norm=1.421649336561089, lr=0.4854359195045988
2023-12-05 18:50:50   INFO  epoch: 8/72, acc_iter=34246, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:12, time_cost(all): 7:55:35/2 days, 6:07:45, loss=0.560638447543349, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.84(1.03), norm=4.476984977298601, lr=0.485321952429231
2023-12-05 18:51:32   INFO  epoch: 8/72, acc_iter=34296, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:32, time_cost(all): 7:56:17/2 days, 8:33:45, loss=0.560579250117408, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.11(1.03), norm=3.893644382227571, lr=0.4852079853538632
2023-12-05 18:52:14   INFO  epoch: 8/72, acc_iter=34346, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:39, time_cost(all): 7:56:59/2 days, 7:03:30, loss=0.560520052691468, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.03(1.03), norm=3.014260339123729, lr=0.4850940182784954
2023-12-05 18:52:56   INFO  epoch: 8/72, acc_iter=34396, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:04:54, time_cost(all): 7:57:41/2 days, 6:30:04, loss=0.560460855265527, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.97(1.03), norm=2.3325974671266, lr=0.48498005120312765
2023-12-05 18:53:37   INFO  epoch: 8/72, acc_iter=34446, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:22, time_cost(all): 7:58:22/2 days, 9:33:19, loss=0.560401657839586, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.18(1.03), norm=3.7514617929247516, lr=0.48486608412775983
2023-12-05 18:54:19   INFO  epoch: 8/72, acc_iter=34496, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:37, time_cost(all): 7:59:04/2 days, 7:28:54, loss=0.560342460413645, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.05(1.03), norm=4.211035439328976, lr=0.484752117052392
2023-12-05 18:55:01   INFO  epoch: 8/72, acc_iter=34546, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:55, time_cost(all): 7:59:46/2 days, 10:52:20, loss=0.560283262987704, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.91(1.03), norm=2.424064358797829, lr=0.48463814997702426
2023-12-05 18:55:43   INFO  epoch: 8/72, acc_iter=34596, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:21, time_cost(all): 8:00:28/2 days, 6:36:34, loss=0.560224065561763, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.86(1.03), norm=1.4071151377799818, lr=0.48452418290165644
2023-12-05 18:56:24   INFO  epoch: 8/72, acc_iter=34646, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:37, time_cost(all): 8:01:09/2 days, 9:05:56, loss=0.560164868135822, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.05(1.03), norm=3.875753357405711, lr=0.4844102158262886
2023-12-05 18:57:06   INFO  epoch: 8/72, acc_iter=34696, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:51, time_cost(all): 8:01:51/2 days, 7:23:00, loss=0.560105670709881, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.12(1.03), norm=3.028258997884143, lr=0.48429624875092087
2023-12-05 18:57:48   INFO  epoch: 8/72, acc_iter=34746, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 8:02:33/2 days, 7:29:14, loss=0.56004647328394, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.19(1.03), norm=3.047389827086125, lr=0.48418228167555305
2023-12-05 18:58:30   INFO  epoch: 9/72, acc_iter=34808, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:53:49, time_cost(all): 8:03:15/2 days, 10:03:16, loss=0.559973068475773, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.03(1.03), norm=2.442390939188311, lr=0.484040962502097
2023-12-05 18:59:12   INFO  epoch: 9/72, acc_iter=34858, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:54:12, time_cost(all): 8:03:57/2 days, 6:20:13, loss=0.559913871049832, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.05(1.03), norm=2.634104035197963, lr=0.48392699542672923
2023-12-05 18:59:53   INFO  epoch: 9/72, acc_iter=34908, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:50:35, time_cost(all): 8:04:38/2 days, 6:35:50, loss=0.559854673623891, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.1(1.03), norm=0.7871713242474965, lr=0.4838130283513614
2023-12-05 19:00:35   INFO  epoch: 9/72, acc_iter=34958, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:51:39, time_cost(all): 8:05:20/2 days, 7:14:12, loss=0.55979547619795, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.08(1.03), norm=2.589750496653443, lr=0.4836990612759936
2023-12-05 19:01:17   INFO  epoch: 9/72, acc_iter=35008, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:49:02, time_cost(all): 8:06:02/2 days, 6:53:50, loss=0.559736278772009, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.87(1.03), norm=2.0410472443174927, lr=0.48358509420062584
2023-12-05 19:01:59   INFO  epoch: 9/72, acc_iter=35058, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:51:35, time_cost(all): 8:06:44/2 days, 10:20:18, loss=0.559677081346069, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.99(1.03), norm=4.080928407603734, lr=0.483471127125258
2023-12-05 19:02:40   INFO  epoch: 9/72, acc_iter=35108, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:48:10, time_cost(all): 8:07:25/2 days, 7:36:42, loss=0.559617883920128, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.92(1.03), norm=4.258382864228134, lr=0.4833571600498902
2023-12-05 19:03:22   INFO  epoch: 9/72, acc_iter=35158, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:46:32, time_cost(all): 8:08:07/2 days, 8:31:23, loss=0.559558686494187, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.03(1.03), norm=1.3884591179262, lr=0.48324319297452245
2023-12-05 19:04:04   INFO  epoch: 9/72, acc_iter=35208, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:48:20, time_cost(all): 8:08:49/2 days, 7:50:01, loss=0.559499489068246, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.16(1.03), norm=0.657737839854137, lr=0.48312922589915464
2023-12-05 19:04:46   INFO  epoch: 9/72, acc_iter=35258, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:48:40, time_cost(all): 8:09:31/2 days, 9:16:20, loss=0.559440291642305, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.04(1.03), norm=3.1250521469382995, lr=0.4830152588237868
2023-12-05 19:05:28   INFO  epoch: 9/72, acc_iter=35308, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:44:03, time_cost(all): 8:10:13/2 days, 9:12:21, loss=0.559381094216364, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.88(1.03), norm=1.649408297126349, lr=0.48290129174841906
2023-12-05 19:06:09   INFO  epoch: 9/72, acc_iter=35358, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:44:49, time_cost(all): 8:10:54/2 days, 7:59:26, loss=0.559321896790423, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.19(1.03), norm=4.503163876300964, lr=0.48278732467305124
2023-12-05 19:06:51   INFO  epoch: 9/72, acc_iter=35408, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:46:19, time_cost(all): 8:11:36/2 days, 7:12:24, loss=0.559262699364482, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.0(1.03), norm=3.7604832081373014, lr=0.4826733575976835
2023-12-05 19:07:33   INFO  epoch: 9/72, acc_iter=35458, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:43:49, time_cost(all): 8:12:18/2 days, 7:44:05, loss=0.559203501938541, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.18(1.03), norm=0.99655462970243, lr=0.48255939052231567
2023-12-05 19:08:15   INFO  epoch: 9/72, acc_iter=35508, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:45:24, time_cost(all): 8:13:00/2 days, 6:41:45, loss=0.5591443045126, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.94(1.03), norm=1.3823378804622277, lr=0.48244542344694785
2023-12-05 19:08:56   INFO  epoch: 9/72, acc_iter=35558, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:44:15, time_cost(all): 8:13:41/2 days, 8:49:56, loss=0.559085107086659, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.99(1.03), norm=4.972165810416019, lr=0.4823314563715801
2023-12-05 19:09:38   INFO  epoch: 9/72, acc_iter=35608, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:40:37, time_cost(all): 8:14:23/2 days, 9:49:09, loss=0.559025909660718, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.96(1.03), norm=2.5170504874223463, lr=0.4822174892962123
2023-12-05 19:10:20   INFO  epoch: 9/72, acc_iter=35658, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:41:04, time_cost(all): 8:15:05/2 days, 5:53:07, loss=0.558966712234777, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.97(1.03), norm=1.2315301286100149, lr=0.4821035222208445
2023-12-05 19:11:02   INFO  epoch: 9/72, acc_iter=35708, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:39:58, time_cost(all): 8:15:47/2 days, 6:36:35, loss=0.558907514808836, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.85(1.03), norm=0.7845856356408654, lr=0.4819895551454767
2023-12-05 19:11:44   INFO  epoch: 9/72, acc_iter=35758, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:40:26, time_cost(all): 8:16:29/2 days, 5:55:31, loss=0.558848317382895, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.87(1.03), norm=3.812595285057161, lr=0.4818755880701089
2023-12-05 19:12:25   INFO  epoch: 9/72, acc_iter=35808, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:37:48, time_cost(all): 8:17:10/2 days, 10:41:14, loss=0.558789119956954, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.83(1.03), norm=2.2256837526433886, lr=0.4817616209947411
2023-12-05 19:13:07   INFO  epoch: 9/72, acc_iter=35858, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:39:44, time_cost(all): 8:17:52/2 days, 8:55:00, loss=0.558729922531013, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.23(1.03), norm=3.6845812884106404, lr=0.4816476539193733
2023-12-05 19:13:49   INFO  epoch: 9/72, acc_iter=35908, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:39:15, time_cost(all): 8:18:34/2 days, 8:56:09, loss=0.558670725105072, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.11(1.03), norm=1.2750609316844213, lr=0.48153368684400555
2023-12-05 19:14:31   INFO  epoch: 9/72, acc_iter=35958, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:37:59, time_cost(all): 8:19:16/2 days, 8:46:06, loss=0.558611527679132, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.13(1.03), norm=2.8774297934700663, lr=0.48141971976863773
2023-12-05 19:15:13   INFO  epoch: 9/72, acc_iter=36008, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:34:48, time_cost(all): 8:19:58/2 days, 10:05:52, loss=0.558552330253191, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.9(1.03), norm=1.6743839106966327, lr=0.4813057526932699
2023-12-05 19:15:54   INFO  epoch: 9/72, acc_iter=36058, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:36:55, time_cost(all): 8:20:39/2 days, 7:55:14, loss=0.55849313282725, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.84(1.03), norm=4.690875346628963, lr=0.48119178561790216
2023-12-05 19:16:36   INFO  epoch: 9/72, acc_iter=36108, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:34:31, time_cost(all): 8:21:21/2 days, 9:10:55, loss=0.558433935401309, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.85(1.03), norm=4.258063660051175, lr=0.48107781854253434
2023-12-05 19:17:18   INFO  epoch: 9/72, acc_iter=36158, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:35:21, time_cost(all): 8:22:03/2 days, 6:02:33, loss=0.558374737975368, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.93(1.03), norm=3.4924577155507666, lr=0.4809638514671665
2023-12-05 19:18:00   INFO  epoch: 9/72, acc_iter=36208, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:31:56, time_cost(all): 8:22:45/2 days, 9:36:18, loss=0.558315540549427, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.91(1.03), norm=1.7101492800930287, lr=0.48084988439179877
2023-12-05 19:18:41   INFO  epoch: 9/72, acc_iter=36258, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:32:36, time_cost(all): 8:23:26/2 days, 6:22:38, loss=0.558256343123486, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.0(1.03), norm=4.245088744073278, lr=0.48073591731643095
2023-12-05 19:19:23   INFO  epoch: 9/72, acc_iter=36308, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:33:35, time_cost(all): 8:24:08/2 days, 7:25:49, loss=0.558197145697545, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.11(1.03), norm=1.4213332863122154, lr=0.48062195024106313
2023-12-05 19:20:05   INFO  epoch: 9/72, acc_iter=36358, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:33:03, time_cost(all): 8:24:50/2 days, 9:14:18, loss=0.558137948271604, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.0(1.03), norm=0.8744483814345884, lr=0.4805079831656954
2023-12-05 19:20:47   INFO  epoch: 9/72, acc_iter=36408, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:31:59, time_cost(all): 8:25:32/2 days, 9:35:14, loss=0.558078750845663, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.15(1.03), norm=0.8202790607355921, lr=0.48039401609032756
2023-12-05 19:21:29   INFO  epoch: 9/72, acc_iter=36458, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:29:01, time_cost(all): 8:26:14/2 days, 10:38:54, loss=0.558019553419722, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.95(1.03), norm=3.001463793291013, lr=0.4802800490149598
2023-12-05 19:22:10   INFO  epoch: 9/72, acc_iter=36508, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:30:35, time_cost(all): 8:26:55/2 days, 6:24:21, loss=0.557960355993781, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.16(1.03), norm=4.790154490529316, lr=0.480166081939592
2023-12-05 19:22:52   INFO  epoch: 9/72, acc_iter=36558, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:27:42, time_cost(all): 8:27:37/2 days, 10:43:23, loss=0.55790115856784, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.93(1.03), norm=3.9604105455363894, lr=0.48005211486422417
2023-12-05 19:23:34   INFO  epoch: 9/72, acc_iter=36608, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:27:15, time_cost(all): 8:28:19/2 days, 9:43:18, loss=0.557841961141899, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.11(1.03), norm=2.6003812364518386, lr=0.4799381477888564
2023-12-05 19:24:16   INFO  epoch: 9/72, acc_iter=36658, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:27:55, time_cost(all): 8:29:01/2 days, 9:52:37, loss=0.557782763715958, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.19(1.03), norm=2.341931155984006, lr=0.4798241807134886
2023-12-05 19:24:57   INFO  epoch: 9/72, acc_iter=36708, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:25:49, time_cost(all): 8:29:42/2 days, 9:28:04, loss=0.557723566290017, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.06(1.03), norm=2.117308988449936, lr=0.47971021363812083
2023-12-05 19:25:39   INFO  epoch: 9/72, acc_iter=36758, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:26:07, time_cost(all): 8:30:24/2 days, 9:55:18, loss=0.557664368864077, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.95(1.03), norm=1.5883729584171011, lr=0.479596246562753
2023-12-05 19:26:21   INFO  epoch: 9/72, acc_iter=36808, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:33, time_cost(all): 8:31:06/2 days, 5:52:39, loss=0.557605171438136, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.21(1.03), norm=0.5766994244529944, lr=0.4794822794873852
2023-12-05 19:27:03   INFO  epoch: 9/72, acc_iter=36858, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:23:27, time_cost(all): 8:31:48/2 days, 6:34:41, loss=0.557545974012195, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.88(1.03), norm=1.1994171822071866, lr=0.47936831241201744
2023-12-05 19:27:45   INFO  epoch: 9/72, acc_iter=36908, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:22:49, time_cost(all): 8:32:30/2 days, 9:59:13, loss=0.557486776586254, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.19(1.03), norm=0.5129327458311033, lr=0.4792543453366496
2023-12-05 19:28:26   INFO  epoch: 9/72, acc_iter=36958, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:23:07, time_cost(all): 8:33:11/2 days, 5:36:09, loss=0.557427579160313, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.19(1.03), norm=1.7408295559779479, lr=0.47914037826128186
2023-12-05 19:29:08   INFO  epoch: 9/72, acc_iter=37008, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:22:36, time_cost(all): 8:33:53/2 days, 10:41:48, loss=0.557368381734372, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.85(1.03), norm=3.2795751837352958, lr=0.47902641118591405
2023-12-05 19:29:50   INFO  epoch: 9/72, acc_iter=37058, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:22:04, time_cost(all): 8:34:35/2 days, 7:58:21, loss=0.557309184308431, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.08(1.03), norm=2.088720548854323, lr=0.47891244411054623
2023-12-05 19:30:32   INFO  epoch: 9/72, acc_iter=37108, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:34, time_cost(all): 8:35:17/2 days, 8:19:53, loss=0.55724998688249, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.21(1.03), norm=0.6443068961476215, lr=0.47879847703517847
2023-12-05 19:31:13   INFO  epoch: 9/72, acc_iter=37158, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:20:00, time_cost(all): 8:35:58/2 days, 10:02:01, loss=0.557190789456549, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.84(1.03), norm=2.9256107221219363, lr=0.47868450995981066
2023-12-05 19:31:55   INFO  epoch: 9/72, acc_iter=37208, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:20:01, time_cost(all): 8:36:40/2 days, 7:35:58, loss=0.557131592030608, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.17(1.03), norm=2.656599258968351, lr=0.47857054288444284
2023-12-05 19:32:37   INFO  epoch: 9/72, acc_iter=37258, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:18:08, time_cost(all): 8:37:22/2 days, 7:18:26, loss=0.557072394604667, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.16(1.03), norm=3.1874695152686057, lr=0.4784565758090751
2023-12-05 19:33:19   INFO  epoch: 9/72, acc_iter=37308, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:16, time_cost(all): 8:38:04/2 days, 10:15:54, loss=0.557013197178726, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.09(1.03), norm=1.6532815962149052, lr=0.47834260873370726
2023-12-05 19:34:01   INFO  epoch: 9/72, acc_iter=37358, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:20, time_cost(all): 8:38:46/2 days, 8:10:46, loss=0.556953999752785, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.09(1.03), norm=4.191224504498775, lr=0.4782286416583395
2023-12-05 19:34:42   INFO  epoch: 9/72, acc_iter=37408, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:58, time_cost(all): 8:39:27/2 days, 9:13:45, loss=0.556894802326844, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.91(1.03), norm=4.651079216172728, lr=0.4781146745829717
2023-12-05 19:35:24   INFO  epoch: 9/72, acc_iter=37458, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:15:23, time_cost(all): 8:40:09/2 days, 5:15:06, loss=0.556835604900903, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.03(1.03), norm=4.76582249758572, lr=0.4780007075076039
2023-12-05 19:36:06   INFO  epoch: 9/72, acc_iter=37508, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:33, time_cost(all): 8:40:51/2 days, 10:07:20, loss=0.556776407474962, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.92(1.03), norm=2.72163038633985, lr=0.4778867404322361
2023-12-05 19:36:48   INFO  epoch: 9/72, acc_iter=37558, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:15:05, time_cost(all): 8:41:33/2 days, 7:20:32, loss=0.556717210049021, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.15(1.03), norm=3.901529850107896, lr=0.4777727733568683
2023-12-05 19:37:29   INFO  epoch: 9/72, acc_iter=37608, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:13:49, time_cost(all): 8:42:14/2 days, 9:06:38, loss=0.55665801262308, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.04(1.03), norm=0.641066515837347, lr=0.4776588062815005
2023-12-05 19:38:11   INFO  epoch: 9/72, acc_iter=37658, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:50, time_cost(all): 8:42:56/2 days, 10:15:04, loss=0.55659881519714, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.15(1.03), norm=3.011324905568319, lr=0.4775448392061327
2023-12-05 19:38:53   INFO  epoch: 9/72, acc_iter=37708, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:13, time_cost(all): 8:43:38/2 days, 5:29:03, loss=0.556539617771199, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.02(1.03), norm=3.0106971936394626, lr=0.4774308721307649
2023-12-05 19:39:35   INFO  epoch: 9/72, acc_iter=37758, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:07, time_cost(all): 8:44:20/2 days, 6:07:57, loss=0.556480420345258, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.03(1.03), norm=1.5914916317492, lr=0.47731690505539714
2023-12-05 19:40:17   INFO  epoch: 9/72, acc_iter=37808, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:48, time_cost(all): 8:45:02/2 days, 7:50:25, loss=0.556421222919317, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.12(1.03), norm=3.193276929437544, lr=0.47720293798002933
2023-12-05 19:40:58   INFO  epoch: 9/72, acc_iter=37858, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:38, time_cost(all): 8:45:43/2 days, 7:10:39, loss=0.556362025493376, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.12(1.03), norm=4.4023314002085705, lr=0.4770889709046615
2023-12-05 19:41:40   INFO  epoch: 9/72, acc_iter=37908, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:40, time_cost(all): 8:46:25/2 days, 6:31:47, loss=0.556302828067435, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.88(1.03), norm=2.124106985501083, lr=0.47697500382929375
2023-12-05 19:42:22   INFO  epoch: 9/72, acc_iter=37958, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:08:59, time_cost(all): 8:47:07/2 days, 8:34:16, loss=0.556243630641494, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.01(1.03), norm=2.430366631326963, lr=0.47686103675392594
2023-12-05 19:43:04   INFO  epoch: 9/72, acc_iter=38008, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:20, time_cost(all): 8:47:49/2 days, 7:21:52, loss=0.556184433215553, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.08(1.03), norm=4.5914395180183565, lr=0.4767470696785582
2023-12-05 19:43:45   INFO  epoch: 9/72, acc_iter=38058, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:08:00, time_cost(all): 8:48:30/2 days, 8:03:39, loss=0.556125235789612, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.22(1.03), norm=3.465890380468799, lr=0.47663310260319036
2023-12-05 19:44:27   INFO  epoch: 9/72, acc_iter=38108, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:06:47, time_cost(all): 8:49:12/2 days, 7:28:02, loss=0.556066038363671, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.01(1.03), norm=2.5277715056974444, lr=0.47651913552782255
2023-12-05 19:45:09   INFO  epoch: 9/72, acc_iter=38158, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:40, time_cost(all): 8:49:54/2 days, 5:58:24, loss=0.55600684093773, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.01(1.03), norm=3.467045077097497, lr=0.4764051684524548
2023-12-05 19:45:51   INFO  epoch: 9/72, acc_iter=38208, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:27, time_cost(all): 8:50:36/2 days, 7:44:50, loss=0.555947643511789, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.92(1.03), norm=0.9571477598816449, lr=0.47629120137708697
2023-12-05 19:46:33   INFO  epoch: 9/72, acc_iter=38258, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:11, time_cost(all): 8:51:18/2 days, 8:41:23, loss=0.555888446085848, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.84(1.03), norm=2.9744880352847463, lr=0.47617723430171915
2023-12-05 19:47:14   INFO  epoch: 9/72, acc_iter=38308, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:27, time_cost(all): 8:51:59/2 days, 6:28:35, loss=0.555829248659907, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.97(1.03), norm=2.62465978439892, lr=0.4760632672263514
2023-12-05 19:47:56   INFO  epoch: 9/72, acc_iter=38358, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:37, time_cost(all): 8:52:41/2 days, 7:55:27, loss=0.555770051233966, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.2(1.03), norm=3.2053638927221613, lr=0.4759493001509836
2023-12-05 19:48:38   INFO  epoch: 9/72, acc_iter=38408, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:03:03, time_cost(all): 8:53:23/2 days, 5:42:47, loss=0.555710853808025, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.18(1.03), norm=1.7391457549112435, lr=0.4758353330756158
2023-12-05 19:49:20   INFO  epoch: 9/72, acc_iter=38458, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:11, time_cost(all): 8:54:05/2 days, 6:55:38, loss=0.555651656382084, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.99(1.03), norm=3.6128635562538416, lr=0.475721366000248
2023-12-05 19:50:02   INFO  epoch: 9/72, acc_iter=38508, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:33, time_cost(all): 8:54:47/2 days, 7:35:54, loss=0.555592458956144, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.09(1.03), norm=1.3855260364280937, lr=0.4756073989248802
2023-12-05 19:50:43   INFO  epoch: 9/72, acc_iter=38558, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:49, time_cost(all): 8:55:28/2 days, 5:17:32, loss=0.555533261530203, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.08(1.03), norm=1.7880749329553318, lr=0.4754934318495124
2023-12-05 19:51:25   INFO  epoch: 9/72, acc_iter=38608, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 8:56:10/2 days, 9:06:03, loss=0.555474064104262, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.06(1.03), norm=1.6910513053164136, lr=0.4753794647741446
2023-12-05 19:52:07   INFO  epoch: 10/72, acc_iter=38670, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:54:06, time_cost(all): 8:56:52/2 days, 8:31:22, loss=0.555400659296095, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.03(1.03), norm=3.223707675620825, lr=0.47523814560068856
2023-12-05 19:52:49   INFO  epoch: 10/72, acc_iter=38720, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:52:44, time_cost(all): 8:57:34/2 days, 5:04:34, loss=0.555341461870154, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.99(1.03), norm=4.1938041559529715, lr=0.47512417852532074
2023-12-05 19:53:30   INFO  epoch: 10/72, acc_iter=38770, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:54:08, time_cost(all): 8:58:15/2 days, 9:55:35, loss=0.555282264444213, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.97(1.03), norm=2.719481039278281, lr=0.475010211449953
2023-12-05 19:54:12   INFO  epoch: 10/72, acc_iter=38820, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:49:47, time_cost(all): 8:58:57/2 days, 9:22:43, loss=0.555223067018272, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.0(1.03), norm=3.583822865714115, lr=0.47489624437458516
2023-12-05 19:54:54   INFO  epoch: 10/72, acc_iter=38870, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:51:54, time_cost(all): 8:59:39/2 days, 5:03:59, loss=0.555163869592331, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.22(1.03), norm=1.4320354568700036, lr=0.47478227729921735
2023-12-05 19:55:36   INFO  epoch: 10/72, acc_iter=38920, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:50:48, time_cost(all): 9:00:21/2 days, 9:01:48, loss=0.55510467216639, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.08(1.03), norm=1.0875585457443893, lr=0.4746683102238496
2023-12-05 19:56:18   INFO  epoch: 10/72, acc_iter=38970, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:48:08, time_cost(all): 9:01:03/2 days, 9:45:50, loss=0.555045474740449, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.0(1.03), norm=3.3517132935407963, lr=0.47455434314848177
2023-12-05 19:56:59   INFO  epoch: 10/72, acc_iter=39020, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:49:41, time_cost(all): 9:01:44/2 days, 8:22:56, loss=0.554986277314508, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.83(1.03), norm=1.5033635345310767, lr=0.474440376073114
2023-12-05 19:57:41   INFO  epoch: 10/72, acc_iter=39070, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:47:25, time_cost(all): 9:02:26/2 days, 5:19:44, loss=0.554927079888567, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.09(1.03), norm=3.723841109015929, lr=0.4743264089977462
2023-12-05 19:58:23   INFO  epoch: 10/72, acc_iter=39120, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:49:00, time_cost(all): 9:03:08/2 days, 7:40:17, loss=0.554867882462626, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.06(1.03), norm=2.8427176344730016, lr=0.4742124419223784
2023-12-05 19:59:05   INFO  epoch: 10/72, acc_iter=39170, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:45:29, time_cost(all): 9:03:50/2 days, 10:00:35, loss=0.554808685036685, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.12(1.03), norm=2.3175993432630597, lr=0.4740984748470106
2023-12-05 19:59:46   INFO  epoch: 10/72, acc_iter=39220, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:45:39, time_cost(all): 9:04:31/2 days, 5:53:02, loss=0.554749487610745, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.0(1.03), norm=4.793045743857231, lr=0.4739845077716428
2023-12-05 20:00:28   INFO  epoch: 10/72, acc_iter=39270, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:46:33, time_cost(all): 9:05:13/2 days, 10:11:42, loss=0.554690290184804, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.9(1.03), norm=4.943640445732723, lr=0.473870540696275
2023-12-05 20:01:10   INFO  epoch: 10/72, acc_iter=39320, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:41:51, time_cost(all): 9:05:55/2 days, 5:23:45, loss=0.554631092758863, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.19(1.03), norm=3.9529394193958263, lr=0.47375657362090723
2023-12-05 20:01:52   INFO  epoch: 10/72, acc_iter=39370, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:45:17, time_cost(all): 9:06:37/2 days, 7:21:35, loss=0.554571895332922, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.15(1.03), norm=2.5375233095389733, lr=0.4736426065455394
2023-12-05 20:02:34   INFO  epoch: 10/72, acc_iter=39420, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:42:59, time_cost(all): 9:07:19/2 days, 6:35:08, loss=0.554512697906981, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.96(1.03), norm=4.935758532810315, lr=0.47352863947017165
2023-12-05 20:03:15   INFO  epoch: 10/72, acc_iter=39470, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:43:43, time_cost(all): 9:08:00/2 days, 9:30:32, loss=0.55445350048104, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.21(1.03), norm=4.243969893308286, lr=0.47341467239480384
2023-12-05 20:03:57   INFO  epoch: 10/72, acc_iter=39520, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:42:40, time_cost(all): 9:08:42/2 days, 5:04:04, loss=0.554394303055099, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.07(1.03), norm=0.6006279152552789, lr=0.473300705319436
2023-12-05 20:04:39   INFO  epoch: 10/72, acc_iter=39570, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:41:29, time_cost(all): 9:09:24/2 days, 7:22:54, loss=0.554335105629158, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.06(1.03), norm=1.2069922685803989, lr=0.47318673824406826
2023-12-05 20:05:21   INFO  epoch: 10/72, acc_iter=39620, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:39:31, time_cost(all): 9:10:06/2 days, 9:37:38, loss=0.554275908203217, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.01(1.03), norm=4.87665543905823, lr=0.47307277116870045
2023-12-05 20:06:02   INFO  epoch: 10/72, acc_iter=39670, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:38:24, time_cost(all): 9:10:47/2 days, 7:40:22, loss=0.554216710777276, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.2(1.03), norm=3.4372376671268423, lr=0.4729588040933327
2023-12-05 20:06:44   INFO  epoch: 10/72, acc_iter=39720, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:37:36, time_cost(all): 9:11:29/2 days, 9:04:09, loss=0.554157513351335, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.16(1.03), norm=4.7747640808779455, lr=0.47284483701796487
2023-12-05 20:07:26   INFO  epoch: 10/72, acc_iter=39770, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:38:59, time_cost(all): 9:12:11/2 days, 8:23:10, loss=0.554098315925394, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.22(1.03), norm=1.366646352478317, lr=0.47273086994259705
2023-12-05 20:08:08   INFO  epoch: 10/72, acc_iter=39820, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:38:35, time_cost(all): 9:12:53/2 days, 5:10:10, loss=0.554039118499453, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.02(1.03), norm=4.8751278475079465, lr=0.4726169028672293
2023-12-05 20:08:50   INFO  epoch: 10/72, acc_iter=39870, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:35:53, time_cost(all): 9:13:35/2 days, 7:21:36, loss=0.553979921073512, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.22(1.03), norm=4.368632050854018, lr=0.4725029357918615
2023-12-05 20:09:31   INFO  epoch: 10/72, acc_iter=39920, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:35:15, time_cost(all): 9:14:16/2 days, 6:29:43, loss=0.553920723647571, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.21(1.03), norm=4.276249997543863, lr=0.47238896871649366
2023-12-05 20:10:13   INFO  epoch: 10/72, acc_iter=39970, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:34:36, time_cost(all): 9:14:58/2 days, 8:15:38, loss=0.55386152622163, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.07(1.03), norm=4.962617111046651, lr=0.4722750016411259
2023-12-05 20:10:55   INFO  epoch: 10/72, acc_iter=40020, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:33:19, time_cost(all): 9:15:40/2 days, 4:45:09, loss=0.553802328795689, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.94(1.03), norm=2.4364866189118572, lr=0.4721610345657581
2023-12-05 20:11:37   INFO  epoch: 10/72, acc_iter=40070, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:33:40, time_cost(all): 9:16:22/2 days, 8:52:35, loss=0.553743131369749, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.15(1.03), norm=1.040598853506093, lr=0.4720470674903903
2023-12-05 20:12:18   INFO  epoch: 10/72, acc_iter=40120, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:33:13, time_cost(all): 9:17:03/2 days, 4:55:57, loss=0.553683933943808, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.02(1.03), norm=3.7321138047308335, lr=0.4719331004150225
2023-12-05 20:13:00   INFO  epoch: 10/72, acc_iter=40170, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:32:36, time_cost(all): 9:17:45/2 days, 8:13:46, loss=0.553624736517867, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.97(1.03), norm=4.762713635945311, lr=0.4718191333396547
2023-12-05 20:13:42   INFO  epoch: 10/72, acc_iter=40220, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:30:30, time_cost(all): 9:18:27/2 days, 9:45:04, loss=0.553565539091926, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.0(1.03), norm=2.97309985335787, lr=0.47170516626428693
2023-12-05 20:14:24   INFO  epoch: 10/72, acc_iter=40270, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:31:10, time_cost(all): 9:19:09/2 days, 7:14:36, loss=0.553506341665985, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.22(1.03), norm=0.7783957487091444, lr=0.4715911991889191
2023-12-05 20:15:06   INFO  epoch: 10/72, acc_iter=40320, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:30:25, time_cost(all): 9:19:51/2 days, 6:37:10, loss=0.553447144240044, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.95(1.03), norm=0.7332773988233134, lr=0.4714772321135513
2023-12-05 20:15:47   INFO  epoch: 10/72, acc_iter=40370, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:30:15, time_cost(all): 9:20:32/2 days, 4:56:31, loss=0.553387946814103, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.03(1.03), norm=2.3204263650020884, lr=0.47136326503818354
2023-12-05 20:16:29   INFO  epoch: 10/72, acc_iter=40420, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:28:31, time_cost(all): 9:21:14/2 days, 7:43:39, loss=0.553328749388162, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.15(1.03), norm=0.7631586126021178, lr=0.4712492979628157
2023-12-05 20:17:11   INFO  epoch: 10/72, acc_iter=40470, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:28:11, time_cost(all): 9:21:56/2 days, 8:49:07, loss=0.553269551962221, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.98(1.03), norm=3.162704903634009, lr=0.47113533088744797
2023-12-05 20:17:53   INFO  epoch: 10/72, acc_iter=40520, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:26:05, time_cost(all): 9:22:38/2 days, 8:32:52, loss=0.55321035453628, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.84(1.03), norm=4.271022741352806, lr=0.47102136381208015
2023-12-05 20:18:34   INFO  epoch: 10/72, acc_iter=40570, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:26:37, time_cost(all): 9:23:19/2 days, 7:48:55, loss=0.553151157110339, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.12(1.03), norm=4.255945517497455, lr=0.47090739673671234
2023-12-05 20:19:16   INFO  epoch: 10/72, acc_iter=40620, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:25:53, time_cost(all): 9:24:01/2 days, 7:26:35, loss=0.553091959684398, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.92(1.03), norm=2.0167332352865843, lr=0.4707934296613446
2023-12-05 20:19:58   INFO  epoch: 10/72, acc_iter=40670, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:12, time_cost(all): 9:24:43/2 days, 8:12:57, loss=0.553032762258457, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.88(1.03), norm=1.7138503706079249, lr=0.47067946258597676
2023-12-05 20:20:40   INFO  epoch: 10/72, acc_iter=40720, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:25:36, time_cost(all): 9:25:25/2 days, 8:07:34, loss=0.552973564832516, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.15(1.03), norm=4.641824380007367, lr=0.470565495510609
2023-12-05 20:21:22   INFO  epoch: 10/72, acc_iter=40770, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:22:49, time_cost(all): 9:26:07/2 days, 9:12:30, loss=0.552914367406575, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.16(1.03), norm=2.5866868777492225, lr=0.4704515284352412
2023-12-05 20:22:03   INFO  epoch: 10/72, acc_iter=40820, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:46, time_cost(all): 9:26:48/2 days, 9:03:08, loss=0.552855169980634, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.2(1.03), norm=2.65839186010205, lr=0.47033756135987337
2023-12-05 20:22:45   INFO  epoch: 10/72, acc_iter=40870, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:22:48, time_cost(all): 9:27:30/2 days, 5:33:27, loss=0.552795972554693, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.88(1.03), norm=3.7049371913149347, lr=0.4702235942845056
2023-12-05 20:23:27   INFO  epoch: 10/72, acc_iter=40920, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:22:06, time_cost(all): 9:28:12/2 days, 4:49:47, loss=0.552736775128753, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.97(1.03), norm=3.497861217565703, lr=0.4701096272091378
2023-12-05 20:24:09   INFO  epoch: 10/72, acc_iter=40970, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:03, time_cost(all): 9:28:54/2 days, 9:23:29, loss=0.552677577702812, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.11(1.03), norm=1.098988870057287, lr=0.46999566013377
2023-12-05 20:24:50   INFO  epoch: 10/72, acc_iter=41020, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:20:16, time_cost(all): 9:29:35/2 days, 8:58:36, loss=0.552618380276871, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.86(1.03), norm=4.213257848750781, lr=0.4698816930584022
2023-12-05 20:25:32   INFO  epoch: 10/72, acc_iter=41070, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:20:31, time_cost(all): 9:30:17/2 days, 5:33:38, loss=0.55255918285093, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.05(1.03), norm=3.4043787691433227, lr=0.4697677259830344
2023-12-05 20:26:14   INFO  epoch: 10/72, acc_iter=41120, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:18:50, time_cost(all): 9:30:59/2 days, 4:54:35, loss=0.552499985424989, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.19(1.03), norm=3.967287724183098, lr=0.46965375890766664
2023-12-05 20:26:56   INFO  epoch: 10/72, acc_iter=41170, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:19:08, time_cost(all): 9:31:41/2 days, 9:23:58, loss=0.552440787999048, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.02(1.03), norm=0.5839567463217397, lr=0.4695397918322988
2023-12-05 20:27:38   INFO  epoch: 10/72, acc_iter=41220, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:45, time_cost(all): 9:32:23/2 days, 5:44:51, loss=0.552381590573107, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.08(1.03), norm=4.043840938364931, lr=0.469425824756931
2023-12-05 20:28:19   INFO  epoch: 10/72, acc_iter=41270, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:17:14, time_cost(all): 9:33:04/2 days, 9:19:20, loss=0.552322393147166, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.14(1.03), norm=1.972142039551638, lr=0.46931185768156325
2023-12-05 20:29:01   INFO  epoch: 10/72, acc_iter=41320, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:15:26, time_cost(all): 9:33:46/2 days, 4:18:54, loss=0.552263195721225, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.86(1.03), norm=2.6361150820255097, lr=0.46919789060619543
2023-12-05 20:29:43   INFO  epoch: 10/72, acc_iter=41370, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:47, time_cost(all): 9:34:28/2 days, 9:34:16, loss=0.552203998295284, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.83(1.03), norm=1.199778063095265, lr=0.4690839235308276
2023-12-05 20:30:25   INFO  epoch: 10/72, acc_iter=41420, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:44, time_cost(all): 9:35:10/2 days, 6:54:40, loss=0.552144800869343, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.96(1.03), norm=3.2113555370886684, lr=0.46896995645545986
2023-12-05 20:31:07   INFO  epoch: 10/72, acc_iter=41470, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:43, time_cost(all): 9:35:52/2 days, 7:19:48, loss=0.552085603443402, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.89(1.03), norm=3.32360243695551, lr=0.46885598938009204
2023-12-05 20:31:48   INFO  epoch: 10/72, acc_iter=41520, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:42, time_cost(all): 9:36:33/2 days, 6:49:21, loss=0.552026406017461, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.0(1.03), norm=2.212064622145082, lr=0.4687420223047243
2023-12-05 20:32:30   INFO  epoch: 10/72, acc_iter=41570, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:41, time_cost(all): 9:37:15/2 days, 5:54:54, loss=0.55196720859152, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.95(1.03), norm=4.391415208871155, lr=0.46862805522935647
2023-12-05 20:33:12   INFO  epoch: 10/72, acc_iter=41620, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:01, time_cost(all): 9:37:57/2 days, 7:47:02, loss=0.551908011165579, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.11(1.03), norm=3.3150777167131165, lr=0.46851408815398865
2023-12-05 20:33:54   INFO  epoch: 10/72, acc_iter=41670, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:09, time_cost(all): 9:38:39/2 days, 8:14:01, loss=0.551848813739638, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.21(1.03), norm=2.238539826440848, lr=0.4684001210786209
2023-12-05 20:34:35   INFO  epoch: 10/72, acc_iter=41720, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:11:03, time_cost(all): 9:39:20/2 days, 7:52:50, loss=0.551789616313697, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.84(1.03), norm=3.763019117044097, lr=0.4682861540032531
2023-12-05 20:35:17   INFO  epoch: 10/72, acc_iter=41770, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:10:06, time_cost(all): 9:40:02/2 days, 5:13:32, loss=0.551730418887757, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.88(1.03), norm=4.1041461668181105, lr=0.4681721869278853
2023-12-05 20:35:59   INFO  epoch: 10/72, acc_iter=41820, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:11, time_cost(all): 9:40:44/2 days, 9:24:24, loss=0.551671221461816, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.99(1.03), norm=1.0294525348621484, lr=0.4680582198525175
2023-12-05 20:36:41   INFO  epoch: 10/72, acc_iter=41870, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:10, time_cost(all): 9:41:26/2 days, 7:37:50, loss=0.551612024035875, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.16(1.03), norm=4.645149488382109, lr=0.4679442527771497
2023-12-05 20:37:23   INFO  epoch: 10/72, acc_iter=41920, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:08:08, time_cost(all): 9:42:08/2 days, 9:34:06, loss=0.551552826609934, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.86(1.03), norm=2.5455059532372997, lr=0.4678302857017819
2023-12-05 20:38:04   INFO  epoch: 10/72, acc_iter=41970, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:06:58, time_cost(all): 9:42:49/2 days, 4:47:40, loss=0.551493629183993, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.9(1.03), norm=1.2251114184970309, lr=0.4677163186264141
2023-12-05 20:38:46   INFO  epoch: 10/72, acc_iter=42020, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:08, time_cost(all): 9:43:31/2 days, 4:59:53, loss=0.551434431758052, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.9(1.03), norm=3.2330769701251754, lr=0.46760235155104635
2023-12-05 20:39:28   INFO  epoch: 10/72, acc_iter=42070, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:33, time_cost(all): 9:44:13/2 days, 6:33:19, loss=0.551375234332111, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.19(1.03), norm=4.275110207108957, lr=0.46748838447567853
2023-12-05 20:40:10   INFO  epoch: 10/72, acc_iter=42120, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:04:53, time_cost(all): 9:44:55/2 days, 5:00:42, loss=0.55131603690617, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.96(1.03), norm=0.5357686160895203, lr=0.4673744174003107
2023-12-05 20:40:51   INFO  epoch: 10/72, acc_iter=42170, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:09, time_cost(all): 9:45:36/2 days, 8:39:16, loss=0.551256839480229, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.91(1.03), norm=2.055948795871809, lr=0.46726045032494296
2023-12-05 20:41:33   INFO  epoch: 10/72, acc_iter=42220, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:41, time_cost(all): 9:46:18/2 days, 8:03:47, loss=0.551197642054288, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.2(1.03), norm=4.322392875841894, lr=0.46714648324957514
2023-12-05 20:42:15   INFO  epoch: 10/72, acc_iter=42270, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:48, time_cost(all): 9:47:00/2 days, 8:34:18, loss=0.551138444628347, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.14(1.03), norm=2.2255459894506253, lr=0.4670325161742073
2023-12-05 20:42:57   INFO  epoch: 10/72, acc_iter=42320, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:14, time_cost(all): 9:47:42/2 days, 7:36:04, loss=0.551079247202406, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.08(1.03), norm=2.0547372128803314, lr=0.46691854909883956
2023-12-05 20:43:39   INFO  epoch: 10/72, acc_iter=42370, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:29, time_cost(all): 9:48:24/2 days, 5:57:07, loss=0.551020049776465, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.97(1.03), norm=4.560759418384359, lr=0.46680458202347175
2023-12-05 20:44:20   INFO  epoch: 10/72, acc_iter=42420, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:54, time_cost(all): 9:49:05/2 days, 4:47:29, loss=0.550960852350524, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.17(1.03), norm=0.666679852388836, lr=0.46669061494810393
2023-12-05 20:45:02   INFO  epoch: 10/72, acc_iter=42470, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 9:49:47/2 days, 9:11:07, loss=0.550901654924583, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.94(1.03), norm=3.834920077281769, lr=0.46657664787273617
2023-12-05 20:45:44   INFO  epoch: 11/72, acc_iter=42532, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:53:57, time_cost(all): 9:50:29/2 days, 6:24:50, loss=0.550828250116417, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.23(1.03), norm=2.855775346730586, lr=0.4664353286992801
2023-12-05 20:46:26   INFO  epoch: 11/72, acc_iter=42582, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:54:51, time_cost(all): 9:51:11/2 days, 4:16:45, loss=0.550769052690476, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.01(1.03), norm=0.6110242163237065, lr=0.4663213616239123
2023-12-05 20:47:07   INFO  epoch: 11/72, acc_iter=42632, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:52:17, time_cost(all): 9:51:52/2 days, 4:09:17, loss=0.550709855264535, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.17(1.03), norm=3.7967196319806416, lr=0.46620739454854454
2023-12-05 20:47:49   INFO  epoch: 11/72, acc_iter=42682, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:49:21, time_cost(all): 9:52:34/2 days, 7:47:34, loss=0.550650657838594, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.19(1.03), norm=4.873803725051002, lr=0.4660934274731767
2023-12-05 20:48:31   INFO  epoch: 11/72, acc_iter=42732, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:48:36, time_cost(all): 9:53:16/2 days, 4:33:45, loss=0.550591460412653, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.0(1.03), norm=2.5291587418074006, lr=0.4659794603978089
2023-12-05 20:49:13   INFO  epoch: 11/72, acc_iter=42782, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:48:18, time_cost(all): 9:53:58/2 days, 8:19:57, loss=0.550532262986712, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.09(1.03), norm=3.607244487460488, lr=0.46586549332244115
2023-12-05 20:49:55   INFO  epoch: 11/72, acc_iter=42832, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:47:31, time_cost(all): 9:54:40/2 days, 5:03:47, loss=0.550473065560771, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.85(1.03), norm=0.5854707448651115, lr=0.46575152624707333
2023-12-05 20:50:36   INFO  epoch: 11/72, acc_iter=42882, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:45:58, time_cost(all): 9:55:21/2 days, 4:49:17, loss=0.55041386813483, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.84(1.03), norm=3.068181762635853, lr=0.4656375591717055
2023-12-05 20:51:18   INFO  epoch: 11/72, acc_iter=42932, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:48:09, time_cost(all): 9:56:03/2 days, 8:53:04, loss=0.550354670708889, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.11(1.03), norm=0.5401474830712134, lr=0.46552359209633776
2023-12-05 20:52:00   INFO  epoch: 11/72, acc_iter=42982, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:46:19, time_cost(all): 9:56:45/2 days, 5:59:01, loss=0.550295473282948, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.97(1.03), norm=3.223281007031736, lr=0.46540962502096994
2023-12-05 20:52:42   INFO  epoch: 11/72, acc_iter=43032, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:47:15, time_cost(all): 9:57:27/2 days, 5:30:20, loss=0.550236275857007, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.93(1.03), norm=3.129744308428655, lr=0.4652956579456022
2023-12-05 20:53:23   INFO  epoch: 11/72, acc_iter=43082, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:43:21, time_cost(all): 9:58:08/2 days, 5:23:57, loss=0.550177078431066, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.9(1.03), norm=1.945146914413976, lr=0.46518169087023437
2023-12-05 20:54:05   INFO  epoch: 11/72, acc_iter=43132, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:42:56, time_cost(all): 9:58:50/2 days, 6:22:29, loss=0.550117881005125, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.16(1.03), norm=2.403324083079643, lr=0.46506772379486655
2023-12-05 20:54:47   INFO  epoch: 11/72, acc_iter=43182, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:46:11, time_cost(all): 9:59:32/2 days, 7:52:20, loss=0.550058683579184, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.04(1.03), norm=0.5474421658296451, lr=0.4649537567194988
2023-12-05 20:55:29   INFO  epoch: 11/72, acc_iter=43232, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:45:24, time_cost(all): 10:00:14/2 days, 8:51:44, loss=0.549999486153243, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.85(1.03), norm=0.8596884220316413, lr=0.464839789644131
2023-12-05 20:56:11   INFO  epoch: 11/72, acc_iter=43282, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:42:15, time_cost(all): 10:00:56/2 days, 6:30:19, loss=0.549940288727302, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.98(1.03), norm=3.014615466422346, lr=0.46472582256876316
2023-12-05 20:56:52   INFO  epoch: 11/72, acc_iter=43332, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:43:23, time_cost(all): 10:01:37/2 days, 4:54:51, loss=0.549881091301362, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.98(1.03), norm=4.1921791166767015, lr=0.4646118554933954
2023-12-05 20:57:34   INFO  epoch: 11/72, acc_iter=43382, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:39:22, time_cost(all): 10:02:19/2 days, 3:53:40, loss=0.549821893875421, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.16(1.03), norm=2.1638806267501876, lr=0.4644978884180276
2023-12-05 20:58:16   INFO  epoch: 11/72, acc_iter=43432, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:41:11, time_cost(all): 10:03:01/2 days, 5:24:07, loss=0.54976269644948, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.0(1.03), norm=4.594512467998686, lr=0.4643839213426598
2023-12-05 20:58:58   INFO  epoch: 11/72, acc_iter=43482, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:38:09, time_cost(all): 10:03:43/2 days, 6:44:11, loss=0.549703499023539, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.02(1.03), norm=4.865903292505997, lr=0.464269954267292
2023-12-05 20:59:39   INFO  epoch: 11/72, acc_iter=43532, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:40:20, time_cost(all): 10:04:24/2 days, 7:00:42, loss=0.549644301597598, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.83(1.03), norm=2.8767729427223245, lr=0.4641559871919242
2023-12-05 21:00:21   INFO  epoch: 11/72, acc_iter=43582, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:39:18, time_cost(all): 10:05:06/2 days, 7:57:43, loss=0.549585104171657, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.2(1.03), norm=2.9038227456261083, lr=0.46404202011655643
2023-12-05 21:01:03   INFO  epoch: 11/72, acc_iter=43632, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:36:24, time_cost(all): 10:05:48/2 days, 8:19:41, loss=0.549525906745716, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.91(1.03), norm=1.2211412865262465, lr=0.4639280530411886
2023-12-05 21:01:45   INFO  epoch: 11/72, acc_iter=43682, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:36:08, time_cost(all): 10:06:30/2 days, 6:19:57, loss=0.549466709319775, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.05(1.03), norm=0.9455755580374087, lr=0.46381408596582085
2023-12-05 21:02:27   INFO  epoch: 11/72, acc_iter=43732, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:35:35, time_cost(all): 10:07:12/2 days, 5:09:49, loss=0.549407511893834, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.16(1.03), norm=2.468476978461418, lr=0.46370011889045304
2023-12-05 21:03:08   INFO  epoch: 11/72, acc_iter=43782, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:35:28, time_cost(all): 10:07:53/2 days, 7:42:24, loss=0.549348314467893, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.01(1.03), norm=2.226436659855496, lr=0.4635861518150852
2023-12-05 21:03:50   INFO  epoch: 11/72, acc_iter=43832, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:34:27, time_cost(all): 10:08:35/2 days, 6:08:12, loss=0.549289117041952, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.08(1.03), norm=4.9429995375572515, lr=0.46347218473971746
2023-12-05 21:04:32   INFO  epoch: 11/72, acc_iter=43882, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:34:29, time_cost(all): 10:09:17/2 days, 8:28:18, loss=0.549229919616011, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.87(1.03), norm=2.0917111408678792, lr=0.46335821766434965
2023-12-05 21:05:14   INFO  epoch: 11/72, acc_iter=43932, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:32:33, time_cost(all): 10:09:59/2 days, 8:44:53, loss=0.54917072219007, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.21(1.03), norm=2.3900338253489073, lr=0.4632442505889819
2023-12-05 21:05:56   INFO  epoch: 11/72, acc_iter=43982, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:33:51, time_cost(all): 10:10:41/2 days, 4:01:50, loss=0.549111524764129, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.89(1.03), norm=4.844397773801898, lr=0.46313028351361407
2023-12-05 21:06:37   INFO  epoch: 11/72, acc_iter=44032, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:31:28, time_cost(all): 10:11:22/2 days, 6:04:28, loss=0.549052327338188, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.85(1.03), norm=1.3246900803689914, lr=0.46301631643824626
2023-12-05 21:07:19   INFO  epoch: 11/72, acc_iter=44082, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:30:10, time_cost(all): 10:12:04/2 days, 5:37:08, loss=0.548993129912247, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.18(1.03), norm=4.691299027597861, lr=0.46290234936287844
2023-12-05 21:08:01   INFO  epoch: 11/72, acc_iter=44132, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:30:24, time_cost(all): 10:12:46/2 days, 3:46:14, loss=0.548933932486307, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.07(1.03), norm=4.237548738862937, lr=0.4627883822875107
2023-12-05 21:08:43   INFO  epoch: 11/72, acc_iter=44182, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:29:43, time_cost(all): 10:13:28/2 days, 8:00:06, loss=0.548874735060366, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.06(1.03), norm=0.7664184231703395, lr=0.46267441521214286
2023-12-05 21:09:24   INFO  epoch: 11/72, acc_iter=44232, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:29:30, time_cost(all): 10:14:09/2 days, 7:11:37, loss=0.548815537634425, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.05(1.03), norm=1.370213885662901, lr=0.4625604481367751
2023-12-05 21:10:06   INFO  epoch: 11/72, acc_iter=44282, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:28:23, time_cost(all): 10:14:51/2 days, 7:24:13, loss=0.548756340208484, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.88(1.03), norm=1.2056556430631744, lr=0.4624464810614073
2023-12-05 21:10:48   INFO  epoch: 11/72, acc_iter=44332, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:28:10, time_cost(all): 10:15:33/2 days, 5:19:37, loss=0.548697142782543, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.94(1.03), norm=3.8129793794312676, lr=0.4623325139860395
2023-12-05 21:11:30   INFO  epoch: 11/72, acc_iter=44382, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:28:34, time_cost(all): 10:16:15/2 days, 8:38:06, loss=0.548637945356602, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.96(1.03), norm=3.821525699172386, lr=0.4622185469106717
2023-12-05 21:12:12   INFO  epoch: 11/72, acc_iter=44432, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:27:25, time_cost(all): 10:16:57/2 days, 4:02:12, loss=0.548578747930661, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.96(1.03), norm=2.5897077067526055, lr=0.4621045798353039
2023-12-05 21:12:53   INFO  epoch: 11/72, acc_iter=44482, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:26:42, time_cost(all): 10:17:38/2 days, 3:49:29, loss=0.54851955050472, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.97(1.03), norm=0.8603402687314852, lr=0.46199061275993614
2023-12-05 21:13:35   INFO  epoch: 11/72, acc_iter=44532, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:24:09, time_cost(all): 10:18:20/2 days, 7:09:22, loss=0.548460353078779, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.22(1.03), norm=4.241888375900803, lr=0.4618766456845683
2023-12-05 21:14:17   INFO  epoch: 11/72, acc_iter=44582, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:23:52, time_cost(all): 10:19:02/2 days, 4:45:05, loss=0.548401155652838, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.2(1.03), norm=2.1952547228174475, lr=0.4617626786092005
2023-12-05 21:14:59   INFO  epoch: 11/72, acc_iter=44632, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:49, time_cost(all): 10:19:44/2 days, 5:47:43, loss=0.548341958226897, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.02(1.03), norm=1.9615633512742272, lr=0.46164871153383275
2023-12-05 21:15:40   INFO  epoch: 11/72, acc_iter=44682, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:01, time_cost(all): 10:20:25/2 days, 3:38:23, loss=0.548282760800956, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.08(1.03), norm=3.701309339455069, lr=0.46153474445846493
2023-12-05 21:16:22   INFO  epoch: 11/72, acc_iter=44732, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:21:58, time_cost(all): 10:21:07/2 days, 5:57:19, loss=0.548223563375015, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.88(1.03), norm=1.169382991321835, lr=0.46142077738309717
2023-12-05 21:17:04   INFO  epoch: 11/72, acc_iter=44782, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:43, time_cost(all): 10:21:49/2 days, 8:30:12, loss=0.548164365949074, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.98(1.03), norm=2.612832219610528, lr=0.46130681030772935
2023-12-05 21:17:46   INFO  epoch: 11/72, acc_iter=44832, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:54, time_cost(all): 10:22:31/2 days, 6:16:28, loss=0.548105168523133, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.83(1.03), norm=0.6694696219334106, lr=0.46119284323236154
2023-12-05 21:18:28   INFO  epoch: 11/72, acc_iter=44882, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:19:50, time_cost(all): 10:23:13/2 days, 6:47:23, loss=0.548045971097192, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.87(1.03), norm=4.245146309335633, lr=0.4610788761569938
2023-12-05 21:19:09   INFO  epoch: 11/72, acc_iter=44932, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:19:45, time_cost(all): 10:23:54/2 days, 6:32:22, loss=0.547986773671251, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.01(1.03), norm=1.9361976131597525, lr=0.46096490908162596
2023-12-05 21:19:51   INFO  epoch: 11/72, acc_iter=44982, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:18:03, time_cost(all): 10:24:36/2 days, 7:23:26, loss=0.547927576245311, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.2(1.03), norm=1.6411978211186729, lr=0.46085094200625815
2023-12-05 21:20:33   INFO  epoch: 11/72, acc_iter=45032, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:19:08, time_cost(all): 10:25:18/2 days, 4:27:12, loss=0.54786837881937, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.14(1.03), norm=3.2357601861276586, lr=0.4607369749308904
2023-12-05 21:21:15   INFO  epoch: 11/72, acc_iter=45082, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:48, time_cost(all): 10:26:00/2 days, 8:15:03, loss=0.547809181393429, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.23(1.03), norm=3.2284206147541163, lr=0.46062300785552257
2023-12-05 21:21:56   INFO  epoch: 11/72, acc_iter=45132, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:17:18, time_cost(all): 10:26:41/2 days, 5:46:47, loss=0.547749983967488, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.86(1.03), norm=1.0524556494964532, lr=0.4605090407801548
2023-12-05 21:22:38   INFO  epoch: 11/72, acc_iter=45182, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:53, time_cost(all): 10:27:23/2 days, 4:33:06, loss=0.547690786541547, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.84(1.03), norm=0.8383131461044186, lr=0.460395073704787
2023-12-05 21:23:20   INFO  epoch: 11/72, acc_iter=45232, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:16:04, time_cost(all): 10:28:05/2 days, 5:11:42, loss=0.547631589115606, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.9(1.03), norm=4.346535586137433, lr=0.4602811066294192
2023-12-05 21:24:02   INFO  epoch: 11/72, acc_iter=45282, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:50, time_cost(all): 10:28:47/2 days, 7:02:09, loss=0.547572391689665, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.14(1.03), norm=2.0653787214352533, lr=0.4601671395540514
2023-12-05 21:24:44   INFO  epoch: 11/72, acc_iter=45332, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:19, time_cost(all): 10:29:29/2 days, 6:46:55, loss=0.547513194263724, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.89(1.03), norm=0.9924085285432684, lr=0.4600531724786836
2023-12-05 21:25:25   INFO  epoch: 11/72, acc_iter=45382, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:37, time_cost(all): 10:30:10/2 days, 5:13:37, loss=0.547453996837783, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.9(1.03), norm=4.632473370309881, lr=0.4599392054033158
2023-12-05 21:26:07   INFO  epoch: 11/72, acc_iter=45432, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:27, time_cost(all): 10:30:52/2 days, 6:59:59, loss=0.547394799411842, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.01(1.03), norm=0.5932041720790613, lr=0.459825238327948
2023-12-05 21:26:49   INFO  epoch: 11/72, acc_iter=45482, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:30, time_cost(all): 10:31:34/2 days, 6:48:46, loss=0.547335601985901, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.84(1.03), norm=4.220751496437012, lr=0.4597112712525802
2023-12-05 21:27:31   INFO  epoch: 11/72, acc_iter=45532, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:50, time_cost(all): 10:32:16/2 days, 7:50:24, loss=0.54727640455996, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.87(1.03), norm=2.5098326800574684, lr=0.45959730417721245
2023-12-05 21:28:12   INFO  epoch: 11/72, acc_iter=45582, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:25, time_cost(all): 10:32:57/2 days, 5:52:29, loss=0.547217207134019, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.19(1.03), norm=1.5507537528221982, lr=0.45948333710184464
2023-12-05 21:28:54   INFO  epoch: 11/72, acc_iter=45632, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:10:13, time_cost(all): 10:33:39/2 days, 7:27:57, loss=0.547158009708078, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.01(1.03), norm=3.9657009864979025, lr=0.4593693700264768
2023-12-05 21:29:36   INFO  epoch: 11/72, acc_iter=45682, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:11, time_cost(all): 10:34:21/2 days, 6:02:10, loss=0.547098812282137, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.16(1.03), norm=1.824806508794455, lr=0.45925540295110906
2023-12-05 21:30:18   INFO  epoch: 11/72, acc_iter=45732, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:51, time_cost(all): 10:35:03/2 days, 3:43:02, loss=0.547039614856196, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.99(1.03), norm=2.871471360072097, lr=0.45914143587574124
2023-12-05 21:31:00   INFO  epoch: 11/72, acc_iter=45782, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:27, time_cost(all): 10:35:45/2 days, 3:17:34, loss=0.546980417430255, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.04(1.03), norm=3.289264090737816, lr=0.4590274688003735
2023-12-05 21:31:41   INFO  epoch: 11/72, acc_iter=45832, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:13, time_cost(all): 10:36:26/2 days, 7:24:58, loss=0.546921220004315, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.97(1.03), norm=3.6896592964504933, lr=0.45891350172500567
2023-12-05 21:32:23   INFO  epoch: 11/72, acc_iter=45882, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:37, time_cost(all): 10:37:08/2 days, 4:01:57, loss=0.546862022578374, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.05(1.03), norm=3.778291458082791, lr=0.45879953464963785
2023-12-05 21:33:05   INFO  epoch: 11/72, acc_iter=45932, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:54, time_cost(all): 10:37:50/2 days, 7:36:02, loss=0.546802825152433, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.96(1.03), norm=4.996555468969296, lr=0.4586855675742701
2023-12-05 21:33:47   INFO  epoch: 11/72, acc_iter=45982, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:02, time_cost(all): 10:38:32/2 days, 7:17:11, loss=0.546743627726492, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.19(1.03), norm=2.8864859009467727, lr=0.4585716004989023
2023-12-05 21:34:28   INFO  epoch: 11/72, acc_iter=46032, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:15, time_cost(all): 10:39:13/2 days, 7:19:12, loss=0.546684430300551, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.17(1.03), norm=2.0034351476305914, lr=0.4584576334235345
2023-12-05 21:35:10   INFO  epoch: 11/72, acc_iter=46082, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:38, time_cost(all): 10:39:55/2 days, 7:51:40, loss=0.54662523287461, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.12(1.03), norm=2.328394411189837, lr=0.4583436663481667
2023-12-05 21:35:52   INFO  epoch: 11/72, acc_iter=46132, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:50, time_cost(all): 10:40:37/2 days, 3:54:58, loss=0.546566035448669, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.96(1.03), norm=1.698394552472088, lr=0.4582296992727989
2023-12-05 21:36:34   INFO  epoch: 11/72, acc_iter=46182, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:13, time_cost(all): 10:41:19/2 days, 5:32:16, loss=0.546506838022728, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.98(1.03), norm=1.2560888534010195, lr=0.45811573219743107
2023-12-05 21:37:16   INFO  epoch: 11/72, acc_iter=46232, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:37, time_cost(all): 10:42:01/2 days, 4:51:21, loss=0.546447640596787, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.85(1.03), norm=1.006593922527059, lr=0.4580017651220633
2023-12-05 21:37:57   INFO  epoch: 11/72, acc_iter=46282, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:53, time_cost(all): 10:42:42/2 days, 4:01:17, loss=0.546388443170846, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.86(1.03), norm=2.654259853069731, lr=0.4578877980466955
2023-12-05 21:38:39   INFO  epoch: 11/72, acc_iter=46332, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 10:43:24/2 days, 4:01:12, loss=0.546329245744905, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.12(1.03), norm=4.117958166878623, lr=0.45777383097132773
2023-12-05 21:39:21   INFO  epoch: 12/72, acc_iter=46394, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:53:45, time_cost(all): 10:44:06/2 days, 8:20:39, loss=0.546255840936738, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.14(1.03), norm=4.9212194508585165, lr=0.4576325117978717
2023-12-05 21:40:03   INFO  epoch: 12/72, acc_iter=46444, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:51:25, time_cost(all): 10:44:48/2 days, 7:20:06, loss=0.546196643510797, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.21(1.03), norm=1.0679064446398243, lr=0.45751854472250386
2023-12-05 21:40:45   INFO  epoch: 12/72, acc_iter=46494, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:51:30, time_cost(all): 10:45:30/2 days, 5:17:44, loss=0.546137446084856, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.94(1.03), norm=4.570735773366017, lr=0.45740457764713605
2023-12-05 21:41:26   INFO  epoch: 12/72, acc_iter=46544, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:48:48, time_cost(all): 10:46:11/2 days, 5:53:08, loss=0.546078248658916, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.92(1.03), norm=3.8137574751197136, lr=0.4572906105717683
2023-12-05 21:42:08   INFO  epoch: 12/72, acc_iter=46594, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:48:33, time_cost(all): 10:46:53/2 days, 4:40:57, loss=0.546019051232975, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.89(1.03), norm=2.6672766651303164, lr=0.45717664349640047
2023-12-05 21:42:50   INFO  epoch: 12/72, acc_iter=46644, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:50:32, time_cost(all): 10:47:35/2 days, 6:42:51, loss=0.545959853807034, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.12(1.03), norm=0.6515175130960822, lr=0.45706267642103265
2023-12-05 21:43:32   INFO  epoch: 12/72, acc_iter=46694, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:49:57, time_cost(all): 10:48:17/2 days, 7:42:24, loss=0.545900656381093, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.23(1.03), norm=0.6904947187303339, lr=0.4569487093456649
2023-12-05 21:44:13   INFO  epoch: 12/72, acc_iter=46744, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:46:52, time_cost(all): 10:48:58/2 days, 5:07:14, loss=0.545841458955152, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.14(1.03), norm=3.869503608777712, lr=0.4568347422702971
2023-12-05 21:44:55   INFO  epoch: 12/72, acc_iter=46794, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:49:28, time_cost(all): 10:49:40/2 days, 4:36:25, loss=0.545782261529211, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.12(1.03), norm=3.558908074967981, lr=0.4567207751949293
2023-12-05 21:45:37   INFO  epoch: 12/72, acc_iter=46844, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:45:43, time_cost(all): 10:50:22/2 days, 4:40:05, loss=0.54572306410327, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.98(1.03), norm=0.9067679620911703, lr=0.4566068081195615
2023-12-05 21:46:19   INFO  epoch: 12/72, acc_iter=46894, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:48:06, time_cost(all): 10:51:04/2 days, 4:53:21, loss=0.545663866677329, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=4.874108287465755, lr=0.4564928410441937
2023-12-05 21:47:01   INFO  epoch: 12/72, acc_iter=46944, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:46:49, time_cost(all): 10:51:46/2 days, 6:20:18, loss=0.545604669251388, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.94(1.03), norm=4.159395274389381, lr=0.4563788739688259
2023-12-05 21:47:42   INFO  epoch: 12/72, acc_iter=46994, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:42:35, time_cost(all): 10:52:27/2 days, 7:59:05, loss=0.545545471825447, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.09(1.03), norm=1.73922755068253, lr=0.4562649068934581
2023-12-05 21:48:24   INFO  epoch: 12/72, acc_iter=47044, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:43:26, time_cost(all): 10:53:09/2 days, 5:53:10, loss=0.545486274399506, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.95(1.03), norm=3.5316524278065438, lr=0.4561509398180903
2023-12-05 21:49:06   INFO  epoch: 12/72, acc_iter=47094, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:42:47, time_cost(all): 10:53:51/2 days, 4:22:26, loss=0.545427076973565, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.19(1.03), norm=0.5731565639479268, lr=0.45603697274272254
2023-12-05 21:49:48   INFO  epoch: 12/72, acc_iter=47144, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:44:21, time_cost(all): 10:54:33/2 days, 3:25:35, loss=0.545367879547624, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.08(1.03), norm=1.931927590564281, lr=0.4559230056673547
2023-12-05 21:50:29   INFO  epoch: 12/72, acc_iter=47194, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:42:25, time_cost(all): 10:55:14/2 days, 3:10:03, loss=0.545308682121683, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.96(1.03), norm=4.0101188415490014, lr=0.45580903859198696
2023-12-05 21:51:11   INFO  epoch: 12/72, acc_iter=47244, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:42:09, time_cost(all): 10:55:56/2 days, 7:00:50, loss=0.545249484695742, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.18(1.03), norm=3.9599541840092756, lr=0.45569507151661914
2023-12-05 21:51:53   INFO  epoch: 12/72, acc_iter=47294, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:39:36, time_cost(all): 10:56:38/2 days, 7:01:06, loss=0.545190287269801, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.06(1.03), norm=1.9991837244938666, lr=0.45558110444125133
2023-12-05 21:52:35   INFO  epoch: 12/72, acc_iter=47344, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:40:39, time_cost(all): 10:57:20/2 days, 6:03:07, loss=0.54513108984386, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.21(1.03), norm=4.5136860139273205, lr=0.45546713736588357
2023-12-05 21:53:17   INFO  epoch: 12/72, acc_iter=47394, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:38:39, time_cost(all): 10:58:02/2 days, 6:32:37, loss=0.54507189241792, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.96(1.03), norm=3.5043632352949077, lr=0.45535317029051575
2023-12-05 21:53:58   INFO  epoch: 12/72, acc_iter=47444, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:37:53, time_cost(all): 10:58:43/2 days, 5:37:16, loss=0.545012694991979, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.08(1.03), norm=1.5842427872811031, lr=0.455239203215148
2023-12-05 21:54:40   INFO  epoch: 12/72, acc_iter=47494, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:39:15, time_cost(all): 10:59:25/2 days, 4:18:05, loss=0.544953497566038, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.84(1.03), norm=3.1304903463515314, lr=0.4551252361397802
2023-12-05 21:55:22   INFO  epoch: 12/72, acc_iter=47544, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:35:50, time_cost(all): 11:00:07/2 days, 5:38:19, loss=0.544894300140097, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.92(1.03), norm=1.0675419633465084, lr=0.45501126906441236
2023-12-05 21:56:04   INFO  epoch: 12/72, acc_iter=47594, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:36:44, time_cost(all): 11:00:49/2 days, 7:45:40, loss=0.544835102714156, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.86(1.03), norm=1.65469951218229, lr=0.4548973019890446
2023-12-05 21:56:45   INFO  epoch: 12/72, acc_iter=47644, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:35:51, time_cost(all): 11:01:30/2 days, 5:36:19, loss=0.544775905288215, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.15(1.03), norm=1.5734897226938278, lr=0.4547833349136768
2023-12-05 21:57:27   INFO  epoch: 12/72, acc_iter=47694, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:36:25, time_cost(all): 11:02:12/2 days, 5:53:10, loss=0.544716707862274, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.13(1.03), norm=4.121616113819451, lr=0.454669367838309
2023-12-05 21:58:09   INFO  epoch: 12/72, acc_iter=47744, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:33:05, time_cost(all): 11:02:54/2 days, 6:19:13, loss=0.544657510436333, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.0(1.03), norm=0.5818386389780319, lr=0.4545554007629412
2023-12-05 21:58:51   INFO  epoch: 12/72, acc_iter=47794, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:32:12, time_cost(all): 11:03:36/2 days, 6:52:25, loss=0.544598313010392, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.9(1.03), norm=3.4744480403983253, lr=0.4544414336875734
2023-12-05 21:59:33   INFO  epoch: 12/72, acc_iter=47844, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:33:40, time_cost(all): 11:04:18/2 days, 6:04:36, loss=0.544539115584451, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.9(1.03), norm=1.1950784338308924, lr=0.4543274666122056
2023-12-05 22:00:14   INFO  epoch: 12/72, acc_iter=47894, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:32:02, time_cost(all): 11:04:59/2 days, 2:52:13, loss=0.54447991815851, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.84(1.03), norm=2.2540868025277288, lr=0.4542134995368378
2023-12-05 22:00:56   INFO  epoch: 12/72, acc_iter=47944, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:32:23, time_cost(all): 11:05:41/2 days, 4:41:54, loss=0.544420720732569, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.18(1.03), norm=4.217045864340332, lr=0.45409953246147
2023-12-05 22:01:38   INFO  epoch: 12/72, acc_iter=47994, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:30:30, time_cost(all): 11:06:23/2 days, 3:15:08, loss=0.544361523306628, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.06(1.03), norm=4.720064229863324, lr=0.45398556538610224
2023-12-05 22:02:20   INFO  epoch: 12/72, acc_iter=48044, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:30:11, time_cost(all): 11:07:05/2 days, 7:32:34, loss=0.544302325880687, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.91(1.03), norm=0.9962483532258526, lr=0.4538715983107344
2023-12-05 22:03:01   INFO  epoch: 12/72, acc_iter=48094, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:30:26, time_cost(all): 11:07:46/2 days, 7:21:08, loss=0.544243128454746, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.02(1.03), norm=3.258024859754841, lr=0.4537576312353666
2023-12-05 22:03:43   INFO  epoch: 12/72, acc_iter=48144, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:28:09, time_cost(all): 11:08:28/2 days, 4:56:07, loss=0.544183931028805, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.07(1.03), norm=2.4393449826995406, lr=0.45364366415999885
2023-12-05 22:04:25   INFO  epoch: 12/72, acc_iter=48194, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:27:38, time_cost(all): 11:09:10/2 days, 4:24:17, loss=0.544124733602864, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.2(1.03), norm=2.679292555225588, lr=0.45352969708463103
2023-12-05 22:05:07   INFO  epoch: 12/72, acc_iter=48244, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:26:05, time_cost(all): 11:09:52/2 days, 5:22:43, loss=0.544065536176923, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.2(1.03), norm=1.960288789983813, lr=0.4534157300092633
2023-12-05 22:05:49   INFO  epoch: 12/72, acc_iter=48294, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:26:52, time_cost(all): 11:10:34/2 days, 2:53:46, loss=0.544006338750983, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.09(1.03), norm=4.77561944039008, lr=0.45330176293389546
2023-12-05 22:06:30   INFO  epoch: 12/72, acc_iter=48344, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:26:30, time_cost(all): 11:11:15/2 days, 2:56:53, loss=0.543947141325042, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.0(1.03), norm=1.7597236903813114, lr=0.45318779585852764
2023-12-05 22:07:12   INFO  epoch: 12/72, acc_iter=48394, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:26:00, time_cost(all): 11:11:57/2 days, 2:54:50, loss=0.543887943899101, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.92(1.03), norm=3.000491431068322, lr=0.4530738287831599
2023-12-05 22:07:54   INFO  epoch: 12/72, acc_iter=48444, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:23:28, time_cost(all): 11:12:39/2 days, 3:07:50, loss=0.54382874647316, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.19(1.03), norm=3.8523648848951737, lr=0.45295986170779207
2023-12-05 22:08:36   INFO  epoch: 12/72, acc_iter=48494, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:22:44, time_cost(all): 11:13:21/2 days, 6:49:26, loss=0.543769549047219, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.98(1.03), norm=1.5330941992327054, lr=0.4528458946324243
2023-12-05 22:09:17   INFO  epoch: 12/72, acc_iter=48544, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:24:10, time_cost(all): 11:14:02/2 days, 7:48:03, loss=0.543710351621278, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.02(1.03), norm=4.788804733155084, lr=0.4527319275570565
2023-12-05 22:09:59   INFO  epoch: 12/72, acc_iter=48594, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:22:58, time_cost(all): 11:14:44/2 days, 6:08:47, loss=0.543651154195337, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.09(1.03), norm=1.2736097100906771, lr=0.4526179604816887
2023-12-05 22:10:41   INFO  epoch: 12/72, acc_iter=48644, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:22:04, time_cost(all): 11:15:26/2 days, 3:20:50, loss=0.543591956769396, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.87(1.03), norm=2.513486257808305, lr=0.4525039934063209
2023-12-05 22:11:23   INFO  epoch: 12/72, acc_iter=48694, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:03, time_cost(all): 11:16:08/2 days, 5:36:23, loss=0.543532759343455, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.12(1.03), norm=1.6863722054025854, lr=0.4523900263309531
2023-12-05 22:12:05   INFO  epoch: 12/72, acc_iter=48744, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:19:48, time_cost(all): 11:16:50/2 days, 3:06:11, loss=0.543473561917514, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.22(1.03), norm=3.139197030894575, lr=0.4522760592555853
2023-12-05 22:12:46   INFO  epoch: 12/72, acc_iter=48794, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:20:08, time_cost(all): 11:17:31/2 days, 7:28:54, loss=0.543414364491573, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.93(1.03), norm=0.7723898313515143, lr=0.4521620921802175
2023-12-05 22:13:28   INFO  epoch: 12/72, acc_iter=48844, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:18:04, time_cost(all): 11:18:13/2 days, 3:45:57, loss=0.543355167065632, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.2(1.03), norm=1.8143257805503876, lr=0.4520481251048497
2023-12-05 22:14:10   INFO  epoch: 12/72, acc_iter=48894, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:10, time_cost(all): 11:18:55/2 days, 6:22:34, loss=0.543295969639691, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.83(1.03), norm=1.0390707763060623, lr=0.45193415802948195
2023-12-05 22:14:52   INFO  epoch: 12/72, acc_iter=48944, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:55, time_cost(all): 11:19:37/2 days, 7:48:58, loss=0.54323677221375, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.93(1.03), norm=4.500604561810539, lr=0.45182019095411413
2023-12-05 22:15:34   INFO  epoch: 12/72, acc_iter=48994, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:17:25, time_cost(all): 11:20:19/2 days, 3:46:14, loss=0.543177574787809, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.85(1.03), norm=3.9131443190914776, lr=0.4517062238787463
2023-12-05 22:16:15   INFO  epoch: 12/72, acc_iter=49044, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:34, time_cost(all): 11:21:00/2 days, 7:48:13, loss=0.543118377361868, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.89(1.03), norm=1.0356632653397138, lr=0.45159225680337856
2023-12-05 22:16:57   INFO  epoch: 12/72, acc_iter=49094, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:16:05, time_cost(all): 11:21:42/2 days, 5:01:43, loss=0.543059179935927, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.12(1.03), norm=4.634708508273916, lr=0.45147828972801074
2023-12-05 22:17:39   INFO  epoch: 12/72, acc_iter=49144, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:07, time_cost(all): 11:22:24/2 days, 5:32:16, loss=0.542999982509986, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.19(1.03), norm=4.721500983273145, lr=0.4513643226526429
2023-12-05 22:18:21   INFO  epoch: 12/72, acc_iter=49194, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:13, time_cost(all): 11:23:06/2 days, 4:02:46, loss=0.542940785084046, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.07(1.03), norm=3.7555765747277468, lr=0.45125035557727516
2023-12-05 22:19:02   INFO  epoch: 12/72, acc_iter=49244, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:12:54, time_cost(all): 11:23:47/2 days, 5:17:02, loss=0.542881587658105, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.86(1.03), norm=0.6544446878312873, lr=0.45113638850190735
2023-12-05 22:19:44   INFO  epoch: 12/72, acc_iter=49294, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:13:04, time_cost(all): 11:24:29/2 days, 6:12:40, loss=0.542822390232164, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.98(1.03), norm=2.465157949295044, lr=0.4510224214265396
2023-12-05 22:20:26   INFO  epoch: 12/72, acc_iter=49344, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:35, time_cost(all): 11:25:11/2 days, 4:19:23, loss=0.542763192806223, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.85(1.03), norm=4.831774532576011, lr=0.4509084543511718
2023-12-05 22:21:08   INFO  epoch: 12/72, acc_iter=49394, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:30, time_cost(all): 11:25:53/2 days, 7:00:15, loss=0.542703995380282, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.08(1.03), norm=3.5228518991119238, lr=0.45079448727580396
2023-12-05 22:21:50   INFO  epoch: 12/72, acc_iter=49444, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:52, time_cost(all): 11:26:35/2 days, 5:45:46, loss=0.542644797954341, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.02(1.03), norm=4.3964750237960635, lr=0.4506805202004362
2023-12-05 22:22:31   INFO  epoch: 12/72, acc_iter=49494, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:10:24, time_cost(all): 11:27:16/2 days, 6:06:36, loss=0.5425856005284, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.12(1.03), norm=3.2761434460833723, lr=0.4505665531250684
2023-12-05 22:23:13   INFO  epoch: 12/72, acc_iter=49544, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:08:50, time_cost(all): 11:27:58/2 days, 4:52:17, loss=0.542526403102459, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.99(1.03), norm=3.7491304053569285, lr=0.4504525860497006
2023-12-05 22:23:55   INFO  epoch: 12/72, acc_iter=49594, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:44, time_cost(all): 11:28:40/2 days, 3:38:11, loss=0.542467205676518, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.14(1.03), norm=1.6397375692807517, lr=0.4503386189743328
2023-12-05 22:24:37   INFO  epoch: 12/72, acc_iter=49644, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:50, time_cost(all): 11:29:22/2 days, 5:51:25, loss=0.542408008250577, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.08(1.03), norm=2.619465909649837, lr=0.450224651898965
2023-12-05 22:25:18   INFO  epoch: 12/72, acc_iter=49694, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:27, time_cost(all): 11:30:03/2 days, 2:26:26, loss=0.542348810824636, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.14(1.03), norm=4.668199688543165, lr=0.45011068482359723
2023-12-05 22:26:00   INFO  epoch: 12/72, acc_iter=49744, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:33, time_cost(all): 11:30:45/2 days, 2:47:35, loss=0.542289613398695, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.99(1.03), norm=3.8300095757085293, lr=0.4499967177482294
2023-12-05 22:26:42   INFO  epoch: 12/72, acc_iter=49794, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:31, time_cost(all): 11:31:27/2 days, 6:17:55, loss=0.542230415972754, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.17(1.03), norm=2.005796214502565, lr=0.44988275067286165
2023-12-05 22:27:24   INFO  epoch: 12/72, acc_iter=49844, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:11, time_cost(all): 11:32:09/2 days, 5:57:44, loss=0.542171218546813, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.01(1.03), norm=2.4077714401548933, lr=0.44976878359749384
2023-12-05 22:28:06   INFO  epoch: 12/72, acc_iter=49894, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:08, time_cost(all): 11:32:51/2 days, 7:03:11, loss=0.542112021120872, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.9(1.03), norm=4.676216286076953, lr=0.449654816522126
2023-12-05 22:28:47   INFO  epoch: 12/72, acc_iter=49944, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:44, time_cost(all): 11:33:32/2 days, 2:51:41, loss=0.542052823694931, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.19(1.03), norm=3.9118626223204997, lr=0.44954084944675826
2023-12-05 22:29:29   INFO  epoch: 12/72, acc_iter=49994, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:03:01, time_cost(all): 11:34:14/2 days, 7:33:16, loss=0.541993626268991, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.97(1.03), norm=1.003073049824977, lr=0.44942688237139045
2023-12-05 22:30:11   INFO  epoch: 12/72, acc_iter=50044, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:13, time_cost(all): 11:34:56/2 days, 6:06:13, loss=0.54193442884305, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.02(1.03), norm=3.5010873686210213, lr=0.44931291529602263
2023-12-05 22:30:53   INFO  epoch: 12/72, acc_iter=50094, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:30, time_cost(all): 11:35:38/2 days, 7:26:33, loss=0.541875231417109, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.06(1.03), norm=3.4904844580432424, lr=0.44919894822065487
2023-12-05 22:31:34   INFO  epoch: 12/72, acc_iter=50144, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:52, time_cost(all): 11:36:19/2 days, 5:41:11, loss=0.541816033991168, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.16(1.03), norm=1.9052378049698093, lr=0.44908498114528705
2023-12-05 22:32:16   INFO  epoch: 12/72, acc_iter=50194, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 11:37:01/2 days, 2:44:30, loss=0.541756836565227, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.19(1.03), norm=0.5153693462473612, lr=0.44897101406991924
2023-12-05 22:32:58   INFO  epoch: 13/72, acc_iter=50256, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:52:13, time_cost(all): 11:37:43/2 days, 3:17:10, loss=0.54168343175706, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.87(1.03), norm=2.140842199793366, lr=0.4488296948964632
2023-12-05 22:33:40   INFO  epoch: 13/72, acc_iter=50306, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:50:08, time_cost(all): 11:38:25/2 days, 4:29:07, loss=0.541624234331119, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.86(1.03), norm=3.1442138150617462, lr=0.4487157278210954
2023-12-05 22:34:22   INFO  epoch: 13/72, acc_iter=50356, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:52:18, time_cost(all): 11:39:07/2 days, 2:20:46, loss=0.541565036905178, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.16(1.03), norm=2.6946182987028098, lr=0.4486017607457276
2023-12-05 22:35:03   INFO  epoch: 13/72, acc_iter=50406, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:51:06, time_cost(all): 11:39:48/2 days, 3:56:50, loss=0.541505839479237, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.11(1.03), norm=4.69175254119771, lr=0.4484877936703598
2023-12-05 22:35:45   INFO  epoch: 13/72, acc_iter=50456, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:48:14, time_cost(all): 11:40:30/2 days, 5:53:41, loss=0.541446642053296, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.01(1.03), norm=3.0073806692652925, lr=0.44837382659499203
2023-12-05 22:36:27   INFO  epoch: 13/72, acc_iter=50506, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:51:39, time_cost(all): 11:41:12/2 days, 2:59:55, loss=0.541387444627355, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.19(1.03), norm=1.478595182280807, lr=0.4482598595196242
2023-12-05 22:37:09   INFO  epoch: 13/72, acc_iter=50556, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:46:57, time_cost(all): 11:41:54/2 days, 4:31:48, loss=0.541328247201414, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.19(1.03), norm=1.4790806892976802, lr=0.44814589244425646
2023-12-05 22:37:50   INFO  epoch: 13/72, acc_iter=50606, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:47:44, time_cost(all): 11:42:35/2 days, 3:35:28, loss=0.541269049775473, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.07(1.03), norm=2.5256712933821523, lr=0.44803192536888864
2023-12-05 22:38:32   INFO  epoch: 13/72, acc_iter=50656, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:47:59, time_cost(all): 11:43:17/2 days, 2:12:35, loss=0.541209852349532, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.01(1.03), norm=1.8325379501585792, lr=0.4479179582935208
2023-12-05 22:39:14   INFO  epoch: 13/72, acc_iter=50706, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:47:01, time_cost(all): 11:43:59/2 days, 2:37:03, loss=0.541150654923592, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.88(1.03), norm=0.9633447419435093, lr=0.44780399121815306
2023-12-05 22:39:56   INFO  epoch: 13/72, acc_iter=50756, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:47:00, time_cost(all): 11:44:41/2 days, 2:52:48, loss=0.541091457497651, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.22(1.03), norm=4.415603994691906, lr=0.44769002414278525
2023-12-05 22:40:38   INFO  epoch: 13/72, acc_iter=50806, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:43:41, time_cost(all): 11:45:23/2 days, 6:35:44, loss=0.54103226007171, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.91(1.03), norm=0.5404999969727361, lr=0.44757605706741743
2023-12-05 22:41:19   INFO  epoch: 13/72, acc_iter=50856, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:42:36, time_cost(all): 11:46:04/2 days, 5:03:53, loss=0.540973062645769, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.23(1.03), norm=1.8451225725585143, lr=0.44746208999204967
2023-12-05 22:42:01   INFO  epoch: 13/72, acc_iter=50906, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:45:09, time_cost(all): 11:46:46/2 days, 3:03:46, loss=0.540913865219828, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.87(1.03), norm=2.1626398420381885, lr=0.44734812291668186
2023-12-05 22:42:43   INFO  epoch: 13/72, acc_iter=50956, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:43:31, time_cost(all): 11:47:28/2 days, 7:14:50, loss=0.540854667793887, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.08(1.03), norm=0.6192811666055202, lr=0.4472341558413141
2023-12-05 22:43:25   INFO  epoch: 13/72, acc_iter=51006, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:43:08, time_cost(all): 11:48:10/2 days, 4:25:07, loss=0.540795470367946, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.03(1.03), norm=3.3196574739895324, lr=0.4471201887659463
2023-12-05 22:44:06   INFO  epoch: 13/72, acc_iter=51056, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:41:01, time_cost(all): 11:48:51/2 days, 7:15:09, loss=0.540736272942005, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.06(1.03), norm=0.7111968777767478, lr=0.44700622169057846
2023-12-05 22:44:48   INFO  epoch: 13/72, acc_iter=51106, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:42:38, time_cost(all): 11:49:33/2 days, 3:29:50, loss=0.540677075516064, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.19(1.03), norm=3.2749987595662198, lr=0.4468922546152107
2023-12-05 22:45:30   INFO  epoch: 13/72, acc_iter=51156, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:40:26, time_cost(all): 11:50:15/2 days, 4:28:53, loss=0.540617878090123, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.21(1.03), norm=1.0895850342213553, lr=0.4467782875398429
2023-12-05 22:46:12   INFO  epoch: 13/72, acc_iter=51206, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:41:11, time_cost(all): 11:50:57/2 days, 3:30:44, loss=0.540558680664182, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.14(1.03), norm=2.0555141222831947, lr=0.44666432046447513
2023-12-05 22:46:54   INFO  epoch: 13/72, acc_iter=51256, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:40:22, time_cost(all): 11:51:39/2 days, 7:18:45, loss=0.540499483238241, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.04(1.03), norm=3.3081200287749692, lr=0.4465503533891073
2023-12-05 22:47:35   INFO  epoch: 13/72, acc_iter=51306, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:37:08, time_cost(all): 11:52:20/2 days, 2:43:51, loss=0.5404402858123, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.86(1.03), norm=3.943165884455834, lr=0.4464363863137395
2023-12-05 22:48:17   INFO  epoch: 13/72, acc_iter=51356, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:35:53, time_cost(all): 11:53:02/2 days, 2:38:22, loss=0.540381088386359, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.87(1.03), norm=4.337623104592202, lr=0.44632241923837174
2023-12-05 22:48:59   INFO  epoch: 13/72, acc_iter=51406, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:37:42, time_cost(all): 11:53:44/2 days, 2:15:52, loss=0.540321890960418, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.21(1.03), norm=1.6854427106118608, lr=0.4462084521630039
2023-12-05 22:49:41   INFO  epoch: 13/72, acc_iter=51456, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:37:16, time_cost(all): 11:54:26/2 days, 2:21:36, loss=0.540262693534477, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.85(1.03), norm=3.2943606784376915, lr=0.44609448508763616
2023-12-05 22:50:23   INFO  epoch: 13/72, acc_iter=51506, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:34:47, time_cost(all): 11:55:08/2 days, 2:43:11, loss=0.540203496108536, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.07(1.03), norm=3.16037459850784, lr=0.44598051801226835
2023-12-05 22:51:04   INFO  epoch: 13/72, acc_iter=51556, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:33:33, time_cost(all): 11:55:49/2 days, 2:33:05, loss=0.540144298682596, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.86(1.03), norm=2.857744377670685, lr=0.44586655093690053
2023-12-05 22:51:46   INFO  epoch: 13/72, acc_iter=51606, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:34:13, time_cost(all): 11:56:31/2 days, 6:16:44, loss=0.540085101256655, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.19(1.03), norm=2.524500803877708, lr=0.44575258386153277
2023-12-05 22:52:28   INFO  epoch: 13/72, acc_iter=51656, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:34:38, time_cost(all): 11:57:13/2 days, 3:35:04, loss=0.540025903830714, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.07(1.03), norm=1.3617884795716195, lr=0.44563861678616495
2023-12-05 22:53:10   INFO  epoch: 13/72, acc_iter=51706, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:34:05, time_cost(all): 11:57:55/2 days, 2:48:17, loss=0.539966706404773, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.2(1.03), norm=4.74460325105627, lr=0.44552464971079714
2023-12-05 22:53:51   INFO  epoch: 13/72, acc_iter=51756, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:31:32, time_cost(all): 11:58:36/2 days, 6:01:39, loss=0.539907508978832, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.01(1.03), norm=4.198647893165737, lr=0.4454106826354294
2023-12-05 22:54:33   INFO  epoch: 13/72, acc_iter=51806, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:30:14, time_cost(all): 11:59:18/2 days, 4:42:17, loss=0.539848311552891, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.2(1.03), norm=3.2233192846374, lr=0.44529671556006156
2023-12-05 22:55:15   INFO  epoch: 13/72, acc_iter=51856, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:31:40, time_cost(all): 12:00:00/2 days, 6:27:18, loss=0.53978911412695, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.97(1.03), norm=2.7266229126612087, lr=0.44518274848469375
2023-12-05 22:55:57   INFO  epoch: 13/72, acc_iter=51906, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:31:23, time_cost(all): 12:00:42/2 days, 5:17:48, loss=0.539729916701009, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.08(1.03), norm=3.5573738206877694, lr=0.445068781409326
2023-12-05 22:56:39   INFO  epoch: 13/72, acc_iter=51956, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:30:16, time_cost(all): 12:01:24/2 days, 2:35:25, loss=0.539670719275068, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.92(1.03), norm=1.5038428513080053, lr=0.44495481433395817
2023-12-05 22:57:20   INFO  epoch: 13/72, acc_iter=52006, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:29:23, time_cost(all): 12:02:05/2 days, 6:14:32, loss=0.539611521849127, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.9(1.03), norm=2.6148489444322602, lr=0.4448408472585904
2023-12-05 22:58:02   INFO  epoch: 13/72, acc_iter=52056, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:28:35, time_cost(all): 12:02:47/2 days, 5:57:52, loss=0.539552324423186, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.83(1.03), norm=3.9054637249499566, lr=0.4447268801832226
2023-12-05 22:58:44   INFO  epoch: 13/72, acc_iter=52106, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:26:18, time_cost(all): 12:03:29/2 days, 3:48:46, loss=0.539493126997245, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.98(1.03), norm=2.7359130451587834, lr=0.4446129131078548
2023-12-05 22:59:26   INFO  epoch: 13/72, acc_iter=52156, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:27:01, time_cost(all): 12:04:11/2 days, 2:37:24, loss=0.539433929571304, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.09(1.03), norm=2.2436002147916705, lr=0.444498946032487
2023-12-05 23:00:07   INFO  epoch: 13/72, acc_iter=52206, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:25:24, time_cost(all): 12:04:52/2 days, 6:39:50, loss=0.539374732145363, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.84(1.03), norm=4.019038859216038, lr=0.4443849789571192
2023-12-05 23:00:49   INFO  epoch: 13/72, acc_iter=52256, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:24:08, time_cost(all): 12:05:34/2 days, 4:13:01, loss=0.539315534719422, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.88(1.03), norm=2.8952200416417266, lr=0.44427101188175144
2023-12-05 23:01:31   INFO  epoch: 13/72, acc_iter=52306, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:23:28, time_cost(all): 12:06:16/2 days, 3:21:57, loss=0.539256337293481, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.1(1.03), norm=3.826632462811954, lr=0.4441570448063836
2023-12-05 23:02:13   INFO  epoch: 13/72, acc_iter=52356, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:14, time_cost(all): 12:06:58/2 days, 6:52:32, loss=0.53919713986754, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.96(1.03), norm=1.6956561490626239, lr=0.4440430777310158
2023-12-05 23:02:55   INFO  epoch: 13/72, acc_iter=52406, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:23:58, time_cost(all): 12:07:40/2 days, 6:32:52, loss=0.5391379424416, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.18(1.03), norm=4.364946345018874, lr=0.44392911065564805
2023-12-05 23:03:36   INFO  epoch: 13/72, acc_iter=52456, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:21:54, time_cost(all): 12:08:21/2 days, 6:23:49, loss=0.539078745015659, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.85(1.03), norm=1.2620978764682402, lr=0.44381514358028024
2023-12-05 23:04:18   INFO  epoch: 13/72, acc_iter=52506, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:38, time_cost(all): 12:09:03/2 days, 5:30:32, loss=0.539019547589718, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.21(1.03), norm=2.34698267012585, lr=0.4437011765049125
2023-12-05 23:05:00   INFO  epoch: 13/72, acc_iter=52556, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:14, time_cost(all): 12:09:45/2 days, 6:56:44, loss=0.538960350163777, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.95(1.03), norm=2.913859007925589, lr=0.44358720942954466
2023-12-05 23:05:42   INFO  epoch: 13/72, acc_iter=52606, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:20:45, time_cost(all): 12:10:27/2 days, 3:29:51, loss=0.538901152737836, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.11(1.03), norm=2.264016311435389, lr=0.44347324235417684
2023-12-05 23:06:23   INFO  epoch: 13/72, acc_iter=52656, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:18:51, time_cost(all): 12:11:08/2 days, 3:25:34, loss=0.538841955311895, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.97(1.03), norm=1.063101876477769, lr=0.4433592752788091
2023-12-05 23:07:05   INFO  epoch: 13/72, acc_iter=52706, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:18:33, time_cost(all): 12:11:50/2 days, 6:47:16, loss=0.538782757885954, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.04(1.03), norm=2.9749810638551515, lr=0.44324530820344127
2023-12-05 23:07:47   INFO  epoch: 13/72, acc_iter=52756, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:17:44, time_cost(all): 12:12:32/2 days, 5:55:16, loss=0.538723560460013, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.19(1.03), norm=0.9377459261957571, lr=0.4431313411280735
2023-12-05 23:08:29   INFO  epoch: 13/72, acc_iter=52806, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:06, time_cost(all): 12:13:14/2 days, 5:08:51, loss=0.538664363034072, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.03(1.03), norm=0.6414911114474333, lr=0.4430173740527057
2023-12-05 23:09:11   INFO  epoch: 13/72, acc_iter=52856, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:17:03, time_cost(all): 12:13:56/2 days, 5:38:56, loss=0.538605165608131, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.85(1.03), norm=4.964251370430336, lr=0.4429034069773379
2023-12-05 23:09:52   INFO  epoch: 13/72, acc_iter=52906, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:52, time_cost(all): 12:14:37/2 days, 2:20:43, loss=0.53854596818219, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.86(1.03), norm=1.216659151810051, lr=0.44278943990197006
2023-12-05 23:10:34   INFO  epoch: 13/72, acc_iter=52956, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:04, time_cost(all): 12:15:19/2 days, 4:13:13, loss=0.538486770756249, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.87(1.03), norm=3.027259797056477, lr=0.4426754728266023
2023-12-05 23:11:16   INFO  epoch: 13/72, acc_iter=53006, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:15:09, time_cost(all): 12:16:01/2 days, 2:06:33, loss=0.538427573330308, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.85(1.03), norm=2.37593173937387, lr=0.4425615057512345
2023-12-05 23:11:58   INFO  epoch: 13/72, acc_iter=53056, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:25, time_cost(all): 12:16:43/2 days, 4:25:34, loss=0.538368375904367, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.0(1.03), norm=1.0401571419047322, lr=0.4424475386758667
2023-12-05 23:12:39   INFO  epoch: 13/72, acc_iter=53106, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:12, time_cost(all): 12:17:24/2 days, 4:43:13, loss=0.538309178478426, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.17(1.03), norm=0.7914712669738333, lr=0.4423335716004989
2023-12-05 23:13:21   INFO  epoch: 13/72, acc_iter=53156, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:20, time_cost(all): 12:18:06/2 days, 5:41:25, loss=0.538249981052485, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.84(1.03), norm=2.9379521681098786, lr=0.4422196045251311
2023-12-05 23:14:03   INFO  epoch: 13/72, acc_iter=53206, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:25, time_cost(all): 12:18:48/2 days, 3:51:36, loss=0.538190783626544, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.96(1.03), norm=4.354225804282948, lr=0.44210563744976333
2023-12-05 23:14:45   INFO  epoch: 13/72, acc_iter=53256, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:08, time_cost(all): 12:19:30/2 days, 1:50:26, loss=0.538131586200604, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.87(1.03), norm=4.556395971114175, lr=0.4419916703743955
2023-12-05 23:15:27   INFO  epoch: 13/72, acc_iter=53306, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:11:02, time_cost(all): 12:20:12/2 days, 4:10:32, loss=0.538072388774663, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.84(1.03), norm=3.882718398050002, lr=0.44187770329902776
2023-12-05 23:16:08   INFO  epoch: 13/72, acc_iter=53356, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:10:09, time_cost(all): 12:20:53/2 days, 3:23:47, loss=0.538013191348722, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.0(1.03), norm=2.2124770484295198, lr=0.44176373622365994
2023-12-05 23:16:50   INFO  epoch: 13/72, acc_iter=53406, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:08:51, time_cost(all): 12:21:35/2 days, 4:44:41, loss=0.537953993922781, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.0(1.03), norm=4.9695692956948125, lr=0.4416497691482921
2023-12-05 23:17:32   INFO  epoch: 13/72, acc_iter=53456, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:30, time_cost(all): 12:22:17/2 days, 2:12:10, loss=0.53789479649684, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.92(1.03), norm=2.266428169342356, lr=0.44153580207292437
2023-12-05 23:18:14   INFO  epoch: 13/72, acc_iter=53506, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:08:11, time_cost(all): 12:22:59/2 days, 5:06:50, loss=0.537835599070899, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.01(1.03), norm=2.255653231566049, lr=0.44142183499755655
2023-12-05 23:18:55   INFO  epoch: 13/72, acc_iter=53556, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:06:53, time_cost(all): 12:23:40/2 days, 2:49:18, loss=0.537776401644958, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.0(1.03), norm=4.777835520519176, lr=0.4413078679221888
2023-12-05 23:19:37   INFO  epoch: 13/72, acc_iter=53606, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:44, time_cost(all): 12:24:22/2 days, 2:20:23, loss=0.537717204219017, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.01(1.03), norm=3.089412288980106, lr=0.441193900846821
2023-12-05 23:20:19   INFO  epoch: 13/72, acc_iter=53656, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:34, time_cost(all): 12:25:04/2 days, 6:11:32, loss=0.537658006793076, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.14(1.03), norm=3.9642341868978206, lr=0.44107993377145316
2023-12-05 23:21:01   INFO  epoch: 13/72, acc_iter=53706, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:04:48, time_cost(all): 12:25:46/2 days, 6:36:22, loss=0.537598809367135, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.84(1.03), norm=4.672889284132145, lr=0.4409659666960854
2023-12-05 23:21:43   INFO  epoch: 13/72, acc_iter=53756, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:24, time_cost(all): 12:26:28/2 days, 1:51:13, loss=0.537539611941194, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.89(1.03), norm=3.972213403068926, lr=0.4408519996207176
2023-12-05 23:22:24   INFO  epoch: 13/72, acc_iter=53806, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:34, time_cost(all): 12:27:09/2 days, 3:38:22, loss=0.537480414515253, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.13(1.03), norm=2.76434415158556, lr=0.44073803254534977
2023-12-05 23:23:06   INFO  epoch: 13/72, acc_iter=53856, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:59, time_cost(all): 12:27:51/2 days, 4:35:35, loss=0.537421217089312, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.0(1.03), norm=2.2290857942455737, lr=0.440624065469982
2023-12-05 23:23:48   INFO  epoch: 13/72, acc_iter=53906, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:16, time_cost(all): 12:28:33/2 days, 5:09:33, loss=0.537362019663371, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.95(1.03), norm=2.6100818995107686, lr=0.4405100983946142
2023-12-05 23:24:30   INFO  epoch: 13/72, acc_iter=53956, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:31, time_cost(all): 12:29:15/2 days, 3:38:04, loss=0.53730282223743, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.21(1.03), norm=2.304514043855895, lr=0.44039613131924643
2023-12-05 23:25:12   INFO  epoch: 13/72, acc_iter=54006, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:52, time_cost(all): 12:29:57/2 days, 2:39:50, loss=0.537243624811489, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.05(1.03), norm=1.100669615795291, lr=0.4402821642438786
2023-12-05 23:25:53   INFO  epoch: 13/72, acc_iter=54056, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 12:30:38/2 days, 2:58:50, loss=0.537184427385549, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.17(1.03), norm=2.667963835883337, lr=0.4401681971685108
2023-12-05 23:26:35   INFO  epoch: 14/72, acc_iter=54118, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:52:42, time_cost(all): 12:31:20/2 days, 2:15:03, loss=0.537111022577382, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.91(1.03), norm=4.12915166354256, lr=0.44002687799505474
2023-12-05 23:27:17   INFO  epoch: 14/72, acc_iter=54168, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:54:18, time_cost(all): 12:32:02/2 days, 2:15:22, loss=0.537051825151441, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.87(1.03), norm=4.912373663494575, lr=0.439912910919687
2023-12-05 23:27:59   INFO  epoch: 14/72, acc_iter=54218, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:53:48, time_cost(all): 12:32:44/2 days, 1:41:57, loss=0.5369926277255, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.2(1.03), norm=1.15396406567223, lr=0.43979894384431917
2023-12-05 23:28:40   INFO  epoch: 14/72, acc_iter=54268, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:49:33, time_cost(all): 12:33:25/2 days, 2:42:18, loss=0.536933430299559, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.18(1.03), norm=4.630708001129537, lr=0.43968497676895135
2023-12-05 23:29:22   INFO  epoch: 14/72, acc_iter=54318, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:52:21, time_cost(all): 12:34:07/2 days, 4:28:48, loss=0.536874232873618, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.93(1.03), norm=3.7281662926292602, lr=0.4395710096935836
2023-12-05 23:30:04   INFO  epoch: 14/72, acc_iter=54368, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:48:24, time_cost(all): 12:34:49/2 days, 4:19:14, loss=0.536815035447677, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.08(1.03), norm=4.2919919982586325, lr=0.4394570426182158
2023-12-05 23:30:46   INFO  epoch: 14/72, acc_iter=54418, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:47:36, time_cost(all): 12:35:31/2 days, 5:04:58, loss=0.536755838021736, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.16(1.03), norm=4.937984554289681, lr=0.43934307554284796
2023-12-05 23:31:28   INFO  epoch: 14/72, acc_iter=54468, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:49:23, time_cost(all): 12:36:13/2 days, 4:20:51, loss=0.536696640595795, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.95(1.03), norm=4.768558502192219, lr=0.4392291084674802
2023-12-05 23:32:09   INFO  epoch: 14/72, acc_iter=54518, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:48:56, time_cost(all): 12:36:54/2 days, 4:32:47, loss=0.536637443169854, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.18(1.03), norm=3.8745550332155654, lr=0.4391151413921124
2023-12-05 23:32:51   INFO  epoch: 14/72, acc_iter=54568, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:46:06, time_cost(all): 12:37:36/2 days, 4:33:13, loss=0.536578245743913, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.05(1.03), norm=3.388309195299692, lr=0.4390011743167446
2023-12-05 23:33:33   INFO  epoch: 14/72, acc_iter=54618, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:46:59, time_cost(all): 12:38:18/2 days, 4:30:35, loss=0.536519048317972, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.97(1.03), norm=2.234219960923293, lr=0.4388872072413768
2023-12-05 23:34:15   INFO  epoch: 14/72, acc_iter=54668, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:47:30, time_cost(all): 12:39:00/2 days, 1:31:43, loss=0.536459850892031, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.1(1.03), norm=1.0451647588614161, lr=0.438773240166009
2023-12-05 23:34:56   INFO  epoch: 14/72, acc_iter=54718, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:45:31, time_cost(all): 12:39:41/2 days, 2:54:08, loss=0.53640065346609, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.12(1.03), norm=0.6989982413578513, lr=0.43865927309064123
2023-12-05 23:35:38   INFO  epoch: 14/72, acc_iter=54768, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:46:03, time_cost(all): 12:40:23/2 days, 3:42:33, loss=0.536341456040149, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.18(1.03), norm=4.526882946069929, lr=0.4385453060152734
2023-12-05 23:36:20   INFO  epoch: 14/72, acc_iter=54818, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:43:26, time_cost(all): 12:41:05/2 days, 1:45:29, loss=0.536282258614209, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.98(1.03), norm=2.55847091048164, lr=0.4384313389399056
2023-12-05 23:37:02   INFO  epoch: 14/72, acc_iter=54868, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:42:46, time_cost(all): 12:41:47/2 days, 6:23:19, loss=0.536223061188268, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.03(1.03), norm=2.664924554594954, lr=0.43831737186453784
2023-12-05 23:37:44   INFO  epoch: 14/72, acc_iter=54918, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:40:22, time_cost(all): 12:42:29/2 days, 6:08:03, loss=0.536163863762327, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.22(1.03), norm=4.809431243579939, lr=0.43820340478917
2023-12-05 23:38:25   INFO  epoch: 14/72, acc_iter=54968, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:42:26, time_cost(all): 12:43:10/2 days, 4:20:42, loss=0.536104666336386, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.22(1.03), norm=4.41087000972173, lr=0.43808943771380227
2023-12-05 23:39:07   INFO  epoch: 14/72, acc_iter=55018, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:39:50, time_cost(all): 12:43:52/2 days, 1:28:28, loss=0.536045468910445, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.15(1.03), norm=0.5138253685163077, lr=0.43797547063843445
2023-12-05 23:39:49   INFO  epoch: 14/72, acc_iter=55068, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:41:09, time_cost(all): 12:44:34/2 days, 2:04:05, loss=0.535986271484504, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.2(1.03), norm=4.17246487946376, lr=0.43786150356306663
2023-12-05 23:40:31   INFO  epoch: 14/72, acc_iter=55118, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:38:56, time_cost(all): 12:45:16/2 days, 1:55:23, loss=0.535927074058563, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.23(1.03), norm=2.7809476512055267, lr=0.4377475364876989
2023-12-05 23:41:12   INFO  epoch: 14/72, acc_iter=55168, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:36:58, time_cost(all): 12:45:57/2 days, 4:20:16, loss=0.535867876632622, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.14(1.03), norm=1.0786620192573602, lr=0.43763356941233106
2023-12-05 23:41:54   INFO  epoch: 14/72, acc_iter=55218, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:36:07, time_cost(all): 12:46:39/2 days, 2:49:51, loss=0.535808679206681, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.15(1.03), norm=4.545565728137151, lr=0.4375196023369633
2023-12-05 23:42:36   INFO  epoch: 14/72, acc_iter=55268, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:36:33, time_cost(all): 12:47:21/2 days, 1:27:06, loss=0.53574948178074, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.16(1.03), norm=2.9655201182112974, lr=0.4374056352615955
2023-12-05 23:43:18   INFO  epoch: 14/72, acc_iter=55318, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:35:40, time_cost(all): 12:48:03/2 days, 4:59:47, loss=0.535690284354799, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.14(1.03), norm=1.775366281404968, lr=0.43729166818622767
2023-12-05 23:44:00   INFO  epoch: 14/72, acc_iter=55368, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:36:12, time_cost(all): 12:48:45/2 days, 2:53:46, loss=0.535631086928858, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.12(1.03), norm=0.8820078895142125, lr=0.43717770111085985
2023-12-05 23:44:41   INFO  epoch: 14/72, acc_iter=55418, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:33:31, time_cost(all): 12:49:26/2 days, 4:12:08, loss=0.535571889502917, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.15(1.03), norm=4.019645507775961, lr=0.4370637340354921
2023-12-05 23:45:23   INFO  epoch: 14/72, acc_iter=55468, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:35:27, time_cost(all): 12:50:08/2 days, 3:46:49, loss=0.535512692076976, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.97(1.03), norm=3.910663194466165, lr=0.43694976696012433
2023-12-05 23:46:05   INFO  epoch: 14/72, acc_iter=55518, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:33:03, time_cost(all): 12:50:50/2 days, 2:14:19, loss=0.535453494651035, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.11(1.03), norm=4.93909617688708, lr=0.4368357998847565
2023-12-05 23:46:47   INFO  epoch: 14/72, acc_iter=55568, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:31:55, time_cost(all): 12:51:32/2 days, 2:11:51, loss=0.535394297225094, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.17(1.03), norm=1.144212125678556, lr=0.4367218328093887
2023-12-05 23:47:28   INFO  epoch: 14/72, acc_iter=55618, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:32:09, time_cost(all): 12:52:13/2 days, 3:23:40, loss=0.535335099799154, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.98(1.03), norm=2.921631851091246, lr=0.4366078657340209
2023-12-05 23:48:10   INFO  epoch: 14/72, acc_iter=55668, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:32:34, time_cost(all): 12:52:55/2 days, 2:23:03, loss=0.535275902373213, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.19(1.03), norm=3.7260205479167747, lr=0.4364938986586531
2023-12-05 23:48:52   INFO  epoch: 14/72, acc_iter=55718, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:29:56, time_cost(all): 12:53:37/2 days, 1:51:55, loss=0.535216704947272, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.01(1.03), norm=1.7366103215390143, lr=0.4363799315832853
2023-12-05 23:49:34   INFO  epoch: 14/72, acc_iter=55768, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:30:23, time_cost(all): 12:54:19/2 days, 4:49:12, loss=0.535157507521331, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.89(1.03), norm=4.396927161973659, lr=0.43626596450791755
2023-12-05 23:50:16   INFO  epoch: 14/72, acc_iter=55818, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:28:58, time_cost(all): 12:55:01/2 days, 2:31:19, loss=0.53509831009539, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.19(1.03), norm=2.911533411566996, lr=0.43615199743254973
2023-12-05 23:50:57   INFO  epoch: 14/72, acc_iter=55868, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:28:19, time_cost(all): 12:55:42/2 days, 2:15:06, loss=0.535039112669449, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.23(1.03), norm=1.884539880496575, lr=0.4360380303571819
2023-12-05 23:51:39   INFO  epoch: 14/72, acc_iter=55918, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:26:49, time_cost(all): 12:56:24/2 days, 3:14:34, loss=0.534979915243508, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.01(1.03), norm=4.201111246011575, lr=0.43592406328181416
2023-12-05 23:52:21   INFO  epoch: 14/72, acc_iter=55968, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:27:18, time_cost(all): 12:57:06/2 days, 2:39:06, loss=0.534920717817567, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.87(1.03), norm=2.6963157111888787, lr=0.43581009620644634
2023-12-05 23:53:03   INFO  epoch: 14/72, acc_iter=56018, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:27:42, time_cost(all): 12:57:48/2 days, 5:20:35, loss=0.534861520391626, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.89(1.03), norm=0.6200627171068172, lr=0.4356961291310786
2023-12-05 23:53:44   INFO  epoch: 14/72, acc_iter=56068, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:25:12, time_cost(all): 12:58:29/2 days, 3:25:06, loss=0.534802322965685, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.88(1.03), norm=1.8201720505472376, lr=0.43558216205571076
2023-12-05 23:54:26   INFO  epoch: 14/72, acc_iter=56118, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:24:30, time_cost(all): 12:59:11/2 days, 5:44:14, loss=0.534743125539744, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.22(1.03), norm=2.2991337702809473, lr=0.43546819498034295
2023-12-05 23:55:08   INFO  epoch: 14/72, acc_iter=56168, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:23:37, time_cost(all): 12:59:53/2 days, 1:16:03, loss=0.534683928113803, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.15(1.03), norm=2.007650854238129, lr=0.4353542279049752
2023-12-05 23:55:50   INFO  epoch: 14/72, acc_iter=56218, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:34, time_cost(all): 13:00:35/2 days, 2:51:19, loss=0.534624730687862, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.85(1.03), norm=3.2722718594517826, lr=0.4352402608296074
2023-12-05 23:56:32   INFO  epoch: 14/72, acc_iter=56268, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:24, time_cost(all): 13:01:17/2 days, 5:24:42, loss=0.534565533261921, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.87(1.03), norm=3.0648969318029744, lr=0.4351262937542396
2023-12-05 23:57:13   INFO  epoch: 14/72, acc_iter=56318, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:22:28, time_cost(all): 13:01:58/2 days, 2:00:30, loss=0.53450633583598, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.07(1.03), norm=2.922342228149807, lr=0.4350123266788718
2023-12-05 23:57:55   INFO  epoch: 14/72, acc_iter=56368, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:30, time_cost(all): 13:02:40/2 days, 5:49:50, loss=0.534447138410039, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.98(1.03), norm=1.5276199537090154, lr=0.434898359603504
2023-12-05 23:58:37   INFO  epoch: 14/72, acc_iter=56418, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:59, time_cost(all): 13:03:22/2 days, 5:12:33, loss=0.534387940984098, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.11(1.03), norm=3.862209431423826, lr=0.4347843925281362
2023-12-05 23:59:19   INFO  epoch: 14/72, acc_iter=56468, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:20:03, time_cost(all): 13:04:04/2 days, 5:48:42, loss=0.534328743558158, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.86(1.03), norm=3.623580481262086, lr=0.4346704254527684
2023-12-06 00:00:01   INFO  epoch: 14/72, acc_iter=56518, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:20:10, time_cost(all): 13:04:46/2 days, 3:52:30, loss=0.534269546132217, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.16(1.03), norm=1.2804438765071193, lr=0.43455645837740065
2023-12-06 00:00:42   INFO  epoch: 14/72, acc_iter=56568, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:19, time_cost(all): 13:05:27/2 days, 3:13:10, loss=0.534210348706276, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.86(1.03), norm=4.696561227658723, lr=0.43444249130203283
2023-12-06 00:01:24   INFO  epoch: 14/72, acc_iter=56618, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:49, time_cost(all): 13:06:09/2 days, 3:31:08, loss=0.534151151280335, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.22(1.03), norm=1.2242142953696873, lr=0.434328524226665
2023-12-06 00:02:06   INFO  epoch: 14/72, acc_iter=56668, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:16, time_cost(all): 13:06:51/2 days, 3:37:16, loss=0.534091953854394, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.2(1.03), norm=3.829804454082014, lr=0.43421455715129725
2023-12-06 00:02:48   INFO  epoch: 14/72, acc_iter=56718, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:17:33, time_cost(all): 13:07:33/2 days, 5:04:51, loss=0.534032756428453, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.84(1.03), norm=1.233331146581736, lr=0.43410059007592944
2023-12-06 00:03:29   INFO  epoch: 14/72, acc_iter=56768, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:15:59, time_cost(all): 13:08:14/2 days, 1:07:15, loss=0.533973559002512, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.9(1.03), norm=1.113409477925804, lr=0.4339866230005617
2023-12-06 00:04:11   INFO  epoch: 14/72, acc_iter=56818, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:04, time_cost(all): 13:08:56/2 days, 5:54:24, loss=0.533914361576571, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.22(1.03), norm=3.1075001214523947, lr=0.43387265592519386
2023-12-06 00:04:53   INFO  epoch: 14/72, acc_iter=56868, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:17, time_cost(all): 13:09:38/2 days, 5:18:59, loss=0.53385516415063, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.86(1.03), norm=4.195510216464642, lr=0.43375868884982605
2023-12-06 00:05:35   INFO  epoch: 14/72, acc_iter=56918, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:23, time_cost(all): 13:10:20/2 days, 2:03:24, loss=0.533795966724689, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.01(1.03), norm=4.957872414144519, lr=0.43364472177445823
2023-12-06 00:06:17   INFO  epoch: 14/72, acc_iter=56968, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:35, time_cost(all): 13:11:02/2 days, 2:15:43, loss=0.533736769298748, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.88(1.03), norm=0.6613595641164951, lr=0.43353075469909047
2023-12-06 00:06:58   INFO  epoch: 14/72, acc_iter=57018, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:26, time_cost(all): 13:11:43/2 days, 5:26:28, loss=0.533677571872807, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.05(1.03), norm=1.5039029013903684, lr=0.43341678762372265
2023-12-06 00:07:40   INFO  epoch: 14/72, acc_iter=57068, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:11:56, time_cost(all): 13:12:25/2 days, 2:48:42, loss=0.533618374446866, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.02(1.03), norm=2.9407894470131315, lr=0.4333028205483549
2023-12-06 00:08:22   INFO  epoch: 14/72, acc_iter=57118, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:07, time_cost(all): 13:13:07/2 days, 0:55:16, loss=0.533559177020925, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.09(1.03), norm=2.767212668975703, lr=0.4331888534729871
2023-12-06 00:09:04   INFO  epoch: 14/72, acc_iter=57168, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:33, time_cost(all): 13:13:49/2 days, 5:11:04, loss=0.533499979594984, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.0(1.03), norm=2.7463484055787326, lr=0.43307488639761926
2023-12-06 00:09:45   INFO  epoch: 14/72, acc_iter=57218, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:58, time_cost(all): 13:14:30/2 days, 2:39:55, loss=0.533440782169043, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.93(1.03), norm=3.4414176179546363, lr=0.4329609193222515
2023-12-06 00:10:27   INFO  epoch: 14/72, acc_iter=57268, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:21, time_cost(all): 13:15:12/2 days, 4:08:56, loss=0.533381584743102, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.02(1.03), norm=4.726365596991683, lr=0.4328469522468837
2023-12-06 00:11:09   INFO  epoch: 14/72, acc_iter=57318, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:52, time_cost(all): 13:15:54/2 days, 3:01:08, loss=0.533322387317162, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.22(1.03), norm=2.5525956659234725, lr=0.4327329851715159
2023-12-06 00:11:51   INFO  epoch: 14/72, acc_iter=57368, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:08:00, time_cost(all): 13:16:36/2 days, 3:57:11, loss=0.533263189891221, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.93(1.03), norm=3.5588777744285895, lr=0.4326190180961481
2023-12-06 00:12:33   INFO  epoch: 14/72, acc_iter=57418, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:06:59, time_cost(all): 13:17:18/2 days, 1:16:54, loss=0.53320399246528, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.93(1.03), norm=3.173871411782981, lr=0.4325050510207803
2023-12-06 00:13:14   INFO  epoch: 14/72, acc_iter=57468, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:17, time_cost(all): 13:17:59/2 days, 3:14:19, loss=0.533144795039339, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.98(1.03), norm=4.458469332613143, lr=0.43239108394541254
2023-12-06 00:13:56   INFO  epoch: 14/72, acc_iter=57518, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:32, time_cost(all): 13:18:41/2 days, 4:29:00, loss=0.533085597613398, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.0(1.03), norm=1.403728353754883, lr=0.4322771168700447
2023-12-06 00:14:38   INFO  epoch: 14/72, acc_iter=57568, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:04, time_cost(all): 13:19:23/2 days, 1:46:44, loss=0.533026400187457, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.9(1.03), norm=1.7631633341892536, lr=0.43216314979467696
2023-12-06 00:15:20   INFO  epoch: 14/72, acc_iter=57618, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:21, time_cost(all): 13:20:05/2 days, 1:06:50, loss=0.532967202761516, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.12(1.03), norm=2.003571310487432, lr=0.43204918271930914
2023-12-06 00:16:01   INFO  epoch: 14/72, acc_iter=57668, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:46, time_cost(all): 13:20:46/2 days, 4:37:26, loss=0.532908005335575, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.92(1.03), norm=4.245889773198909, lr=0.43193521564394133
2023-12-06 00:16:43   INFO  epoch: 14/72, acc_iter=57718, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:54, time_cost(all): 13:21:28/2 days, 3:00:42, loss=0.532848807909634, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.1(1.03), norm=2.180023774175758, lr=0.43182124856857357
2023-12-06 00:17:25   INFO  epoch: 14/72, acc_iter=57768, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:18, time_cost(all): 13:22:10/2 days, 5:06:37, loss=0.532789610483693, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.88(1.03), norm=2.9386854147639827, lr=0.43170728149320575
2023-12-06 00:18:07   INFO  epoch: 14/72, acc_iter=57818, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:33, time_cost(all): 13:22:52/2 days, 5:32:58, loss=0.532730413057752, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.88(1.03), norm=1.8229183258345654, lr=0.43159331441783794
2023-12-06 00:18:49   INFO  epoch: 14/72, acc_iter=57868, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:50, time_cost(all): 13:23:34/2 days, 3:06:46, loss=0.532671215631811, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.94(1.03), norm=2.8764244566141373, lr=0.4314793473424702
2023-12-06 00:19:30   INFO  epoch: 14/72, acc_iter=57918, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 13:24:15/2 days, 3:13:29, loss=0.53261201820587, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.85(1.03), norm=3.127683366365077, lr=0.43136538026710236
2023-12-06 00:20:12   INFO  epoch: 15/72, acc_iter=57980, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:55:29, time_cost(all): 13:24:57/2 days, 1:05:25, loss=0.532538613397703, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.1(1.03), norm=3.3251079698650505, lr=0.4312240610936463
2023-12-06 00:20:54   INFO  epoch: 15/72, acc_iter=58030, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:50:04, time_cost(all): 13:25:39/2 days, 4:50:00, loss=0.532479415971763, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.18(1.03), norm=1.883476737612648, lr=0.4311100940182785
2023-12-06 00:21:36   INFO  epoch: 15/72, acc_iter=58080, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:49:18, time_cost(all): 13:26:21/2 days, 4:47:19, loss=0.532420218545822, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.84(1.03), norm=3.790723781515611, lr=0.43099612694291073
2023-12-06 00:22:17   INFO  epoch: 15/72, acc_iter=58130, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:50:56, time_cost(all): 13:27:02/2 days, 2:07:49, loss=0.532361021119881, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.08(1.03), norm=4.618603661986263, lr=0.4308821598675429
2023-12-06 00:22:59   INFO  epoch: 15/72, acc_iter=58180, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:50:19, time_cost(all): 13:27:44/2 days, 2:05:23, loss=0.53230182369394, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.98(1.03), norm=3.4872922776723194, lr=0.43076819279217515
2023-12-06 00:23:41   INFO  epoch: 15/72, acc_iter=58230, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:51:54, time_cost(all): 13:28:26/2 days, 3:06:07, loss=0.532242626267999, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.05(1.03), norm=3.6792392911048477, lr=0.43065422571680734
2023-12-06 00:24:23   INFO  epoch: 15/72, acc_iter=58280, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:49:36, time_cost(all): 13:29:08/2 days, 3:58:48, loss=0.532183428842058, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.9(1.03), norm=4.157889729963295, lr=0.4305402586414395
2023-12-06 00:25:05   INFO  epoch: 15/72, acc_iter=58330, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:49:18, time_cost(all): 13:29:50/2 days, 4:03:14, loss=0.532124231416117, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.91(1.03), norm=3.6680783000268176, lr=0.4304262915660717
2023-12-06 00:25:46   INFO  epoch: 15/72, acc_iter=58380, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:45:11, time_cost(all): 13:30:31/2 days, 2:51:22, loss=0.532065033990176, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.93(1.03), norm=2.1443268347568925, lr=0.43031232449070395
2023-12-06 00:26:28   INFO  epoch: 15/72, acc_iter=58430, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:46:51, time_cost(all): 13:31:13/2 days, 4:17:14, loss=0.532005836564235, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.93(1.03), norm=2.615864867001745, lr=0.43019835741533613
2023-12-06 00:27:10   INFO  epoch: 15/72, acc_iter=58480, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:45:24, time_cost(all): 13:31:55/2 days, 3:43:04, loss=0.531946639138294, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.16(1.03), norm=4.293592872356296, lr=0.43008439033996837
2023-12-06 00:27:52   INFO  epoch: 15/72, acc_iter=58530, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:47:10, time_cost(all): 13:32:37/2 days, 0:34:23, loss=0.531887441712353, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.85(1.03), norm=2.6257757766145837, lr=0.42997042326460055
2023-12-06 00:28:33   INFO  epoch: 15/72, acc_iter=58580, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:45:54, time_cost(all): 13:33:18/2 days, 1:01:37, loss=0.531828244286412, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.86(1.03), norm=3.4853309955737335, lr=0.42985645618923274
2023-12-06 00:29:15   INFO  epoch: 15/72, acc_iter=58630, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:45:10, time_cost(all): 13:34:00/2 days, 3:09:56, loss=0.531769046860471, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.09(1.03), norm=1.2070451725416507, lr=0.429742489113865
2023-12-06 00:29:57   INFO  epoch: 15/72, acc_iter=58680, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:44:47, time_cost(all): 13:34:42/2 days, 1:10:07, loss=0.53170984943453, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.18(1.03), norm=2.4942317838992416, lr=0.42962852203849716
2023-12-06 00:30:39   INFO  epoch: 15/72, acc_iter=58730, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:40:47, time_cost(all): 13:35:24/2 days, 1:37:26, loss=0.531650652008589, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.92(1.03), norm=4.6542908954773425, lr=0.4295145549631294
2023-12-06 00:31:21   INFO  epoch: 15/72, acc_iter=58780, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:40:14, time_cost(all): 13:36:06/2 days, 1:33:24, loss=0.531591454582648, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.05(1.03), norm=1.848457475373193, lr=0.4294005878877616
2023-12-06 00:32:02   INFO  epoch: 15/72, acc_iter=58830, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:39:12, time_cost(all): 13:36:47/2 days, 3:23:46, loss=0.531532257156707, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.85(1.03), norm=1.4575848875585953, lr=0.42928662081239377
2023-12-06 00:32:44   INFO  epoch: 15/72, acc_iter=58880, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:40:19, time_cost(all): 13:37:29/2 days, 5:26:43, loss=0.531473059730767, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.83(1.03), norm=4.760608339553202, lr=0.429172653737026
2023-12-06 00:33:26   INFO  epoch: 15/72, acc_iter=58930, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:41:09, time_cost(all): 13:38:11/2 days, 3:20:57, loss=0.531413862304826, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.1(1.03), norm=3.833342150975332, lr=0.4290586866616582
2023-12-06 00:34:08   INFO  epoch: 15/72, acc_iter=58980, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:38:13, time_cost(all): 13:38:53/2 days, 0:55:19, loss=0.531354664878885, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.02(1.03), norm=2.3669947315194673, lr=0.42894471958629043
2023-12-06 00:34:50   INFO  epoch: 15/72, acc_iter=59030, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:38:14, time_cost(all): 13:39:35/2 days, 4:14:00, loss=0.531295467452944, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.13(1.03), norm=4.652395189823508, lr=0.4288307525109226
2023-12-06 00:35:31   INFO  epoch: 15/72, acc_iter=59080, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:37:31, time_cost(all): 13:40:16/2 days, 2:41:58, loss=0.531236270027003, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.99(1.03), norm=2.8865623268832143, lr=0.4287167854355548
2023-12-06 00:36:13   INFO  epoch: 15/72, acc_iter=59130, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:36:19, time_cost(all): 13:40:58/2 days, 1:59:38, loss=0.531177072601062, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.04(1.03), norm=1.4505761890732227, lr=0.428602818360187
2023-12-06 00:36:55   INFO  epoch: 15/72, acc_iter=59180, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:38:06, time_cost(all): 13:41:40/2 days, 0:52:32, loss=0.531117875175121, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.85(1.03), norm=4.272549794890149, lr=0.4284888512848192
2023-12-06 00:37:37   INFO  epoch: 15/72, acc_iter=59230, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:36:10, time_cost(all): 13:42:22/2 days, 1:31:04, loss=0.53105867774918, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.88(1.03), norm=3.4267280759472194, lr=0.42837488420945147
2023-12-06 00:38:18   INFO  epoch: 15/72, acc_iter=59280, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:33:36, time_cost(all): 13:43:03/2 days, 3:21:40, loss=0.530999480323239, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.15(1.03), norm=3.9438149930154283, lr=0.42826091713408365
2023-12-06 00:39:00   INFO  epoch: 15/72, acc_iter=59330, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:35:46, time_cost(all): 13:43:45/2 days, 4:31:53, loss=0.530940282897298, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.03(1.03), norm=1.6337433895361566, lr=0.42814695005871584
2023-12-06 00:39:42   INFO  epoch: 15/72, acc_iter=59380, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:33:02, time_cost(all): 13:44:27/2 days, 4:12:49, loss=0.530881085471357, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.19(1.03), norm=1.0406370136259935, lr=0.428032982983348
2023-12-06 00:40:24   INFO  epoch: 15/72, acc_iter=59430, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:32:23, time_cost(all): 13:45:09/2 days, 1:56:50, loss=0.530821888045416, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.96(1.03), norm=1.9166960637413206, lr=0.42791901590798026
2023-12-06 00:41:06   INFO  epoch: 15/72, acc_iter=59480, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:30:45, time_cost(all): 13:45:51/2 days, 1:57:40, loss=0.530762690619475, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.16(1.03), norm=2.5937955994633404, lr=0.4278050488326125
2023-12-06 00:41:47   INFO  epoch: 15/72, acc_iter=59530, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:30:07, time_cost(all): 13:46:32/2 days, 3:23:33, loss=0.530703493193534, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.21(1.03), norm=1.6706372583414326, lr=0.4276910817572447
2023-12-06 00:42:29   INFO  epoch: 15/72, acc_iter=59580, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:31:01, time_cost(all): 13:47:14/2 days, 4:20:52, loss=0.530644295767593, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.17(1.03), norm=1.1558329435486838, lr=0.42757711468187687
2023-12-06 00:43:11   INFO  epoch: 15/72, acc_iter=59630, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:30:20, time_cost(all): 13:47:56/2 days, 0:40:27, loss=0.530585098341652, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.17(1.03), norm=1.1258527014774666, lr=0.42746314760650905
2023-12-06 00:43:53   INFO  epoch: 15/72, acc_iter=59680, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:28:51, time_cost(all): 13:48:38/2 days, 4:59:14, loss=0.530525900915711, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.98(1.03), norm=3.504092356110501, lr=0.4273491805311413
2023-12-06 00:44:34   INFO  epoch: 15/72, acc_iter=59730, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:28:12, time_cost(all): 13:49:19/2 days, 2:00:39, loss=0.530466703489771, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.07(1.03), norm=0.9905126786247356, lr=0.4272352134557735
2023-12-06 00:45:16   INFO  epoch: 15/72, acc_iter=59780, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:26:40, time_cost(all): 13:50:01/2 days, 0:29:27, loss=0.53040750606383, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.21(1.03), norm=4.617127162826667, lr=0.4271212463804057
2023-12-06 00:45:58   INFO  epoch: 15/72, acc_iter=59830, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:27:52, time_cost(all): 13:50:43/2 days, 0:56:38, loss=0.530348308637889, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.96(1.03), norm=1.870047590358796, lr=0.4270072793050379
2023-12-06 00:46:40   INFO  epoch: 15/72, acc_iter=59880, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:27:40, time_cost(all): 13:51:25/2 days, 4:22:46, loss=0.530289111211948, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.13(1.03), norm=0.8173120297333858, lr=0.4268933122296701
2023-12-06 00:47:22   INFO  epoch: 15/72, acc_iter=59930, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:25:17, time_cost(all): 13:52:07/2 days, 0:42:16, loss=0.530229913786007, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.93(1.03), norm=1.3804974207590817, lr=0.4267793451543023
2023-12-06 00:48:03   INFO  epoch: 15/72, acc_iter=59980, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:24:01, time_cost(all): 13:52:48/2 days, 0:42:51, loss=0.530170716360066, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.11(1.03), norm=0.5517777111586215, lr=0.4266653780789345
2023-12-06 00:48:45   INFO  epoch: 15/72, acc_iter=60030, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:24:43, time_cost(all): 13:53:30/2 days, 0:47:57, loss=0.530111518934125, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.01(1.03), norm=1.7323042714539956, lr=0.42655141100356675
2023-12-06 00:49:27   INFO  epoch: 15/72, acc_iter=60080, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:23:29, time_cost(all): 13:54:12/2 days, 4:30:23, loss=0.530052321508184, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.1(1.03), norm=0.9492918096472214, lr=0.42643744392819893
2023-12-06 00:50:09   INFO  epoch: 15/72, acc_iter=60130, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:11, time_cost(all): 13:54:54/2 days, 3:16:20, loss=0.529993124082243, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.03(1.03), norm=2.290018770578832, lr=0.4263234768528311
2023-12-06 00:50:50   INFO  epoch: 15/72, acc_iter=60180, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:22:52, time_cost(all): 13:55:35/2 days, 4:31:57, loss=0.529933926656302, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.86(1.03), norm=4.87299207231649, lr=0.42620950977746336
2023-12-06 00:51:32   INFO  epoch: 15/72, acc_iter=60230, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:20:53, time_cost(all): 13:56:17/2 days, 3:26:42, loss=0.529874729230361, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.87(1.03), norm=3.487832748957619, lr=0.42609554270209554
2023-12-06 00:52:14   INFO  epoch: 15/72, acc_iter=60280, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:40, time_cost(all): 13:56:59/2 days, 0:28:20, loss=0.52981553180442, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.85(1.03), norm=4.6063478050854645, lr=0.4259815756267278
2023-12-06 00:52:56   INFO  epoch: 15/72, acc_iter=60330, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:19:55, time_cost(all): 13:57:41/2 days, 1:58:43, loss=0.529756334378479, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.98(1.03), norm=4.899277379535177, lr=0.42586760855135997
2023-12-06 00:53:38   INFO  epoch: 15/72, acc_iter=60380, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:20:04, time_cost(all): 13:58:23/2 days, 3:41:18, loss=0.529697136952538, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.9(1.03), norm=3.128324191029425, lr=0.42575364147599215
2023-12-06 00:54:19   INFO  epoch: 15/72, acc_iter=60430, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:18:09, time_cost(all): 13:59:04/2 days, 2:08:55, loss=0.529637939526597, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.87(1.03), norm=0.5438537277120236, lr=0.4256396744006244
2023-12-06 00:55:01   INFO  epoch: 15/72, acc_iter=60480, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:17:47, time_cost(all): 13:59:46/2 days, 0:25:50, loss=0.529578742100656, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.16(1.03), norm=4.157105194740592, lr=0.4255257073252566
2023-12-06 00:55:43   INFO  epoch: 15/72, acc_iter=60530, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:18:17, time_cost(all): 14:00:28/2 days, 0:01:09, loss=0.529519544674715, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.17(1.03), norm=2.9321853775037985, lr=0.4254117402498888
2023-12-06 00:56:25   INFO  epoch: 15/72, acc_iter=60580, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:54, time_cost(all): 14:01:10/2 days, 4:30:19, loss=0.529460347248774, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.08(1.03), norm=2.7743835316753733, lr=0.425297773174521
2023-12-06 00:57:06   INFO  epoch: 15/72, acc_iter=60630, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:47, time_cost(all): 14:01:51/2 days, 1:45:46, loss=0.529401149822833, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.18(1.03), norm=2.611712421400358, lr=0.4251838060991532
2023-12-06 00:57:48   INFO  epoch: 15/72, acc_iter=60680, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:21, time_cost(all): 14:02:33/2 days, 2:01:46, loss=0.529341952396893, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.91(1.03), norm=0.9139473419433912, lr=0.4250698390237854
2023-12-06 00:58:30   INFO  epoch: 15/72, acc_iter=60730, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:11, time_cost(all): 14:03:15/2 days, 1:30:59, loss=0.529282754970952, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.91(1.03), norm=2.7535346182531417, lr=0.4249558719484176
2023-12-06 00:59:12   INFO  epoch: 15/72, acc_iter=60780, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:11, time_cost(all): 14:03:57/2 days, 1:18:11, loss=0.529223557545011, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.03(1.03), norm=1.461346844628686, lr=0.4248419048730498
2023-12-06 00:59:54   INFO  epoch: 15/72, acc_iter=60830, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:17, time_cost(all): 14:04:39/2 days, 1:11:58, loss=0.52916436011907, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.18(1.03), norm=1.7761724090340438, lr=0.42472793779768203
2023-12-06 01:00:35   INFO  epoch: 15/72, acc_iter=60880, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:42, time_cost(all): 14:05:20/2 days, 4:46:07, loss=0.529105162693129, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.08(1.03), norm=3.776272223855244, lr=0.4246139707223142
2023-12-06 01:01:17   INFO  epoch: 15/72, acc_iter=60930, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:18, time_cost(all): 14:06:02/2 days, 2:47:29, loss=0.529045965267188, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.88(1.03), norm=2.539313548400779, lr=0.4245000036469464
2023-12-06 01:01:59   INFO  epoch: 15/72, acc_iter=60980, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:44, time_cost(all): 14:06:44/2 days, 0:39:12, loss=0.528986767841247, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.9(1.03), norm=1.8104617363857927, lr=0.42438603657157864
2023-12-06 01:02:41   INFO  epoch: 15/72, acc_iter=61030, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:33, time_cost(all): 14:07:26/2 days, 1:37:53, loss=0.528927570415306, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.16(1.03), norm=2.0990889676702786, lr=0.4242720694962108
2023-12-06 01:03:22   INFO  epoch: 15/72, acc_iter=61080, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:54, time_cost(all): 14:08:07/2 days, 3:58:08, loss=0.528868372989365, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.89(1.03), norm=4.288998062122296, lr=0.42415810242084306
2023-12-06 01:04:04   INFO  epoch: 15/72, acc_iter=61130, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:33, time_cost(all): 14:08:49/2 days, 0:08:40, loss=0.528809175563424, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.88(1.03), norm=0.9967932605688101, lr=0.42404413534547525
2023-12-06 01:04:46   INFO  epoch: 15/72, acc_iter=61180, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:37, time_cost(all): 14:09:31/2 days, 1:46:51, loss=0.528749978137483, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.91(1.03), norm=0.600671715261949, lr=0.42393016827010743
2023-12-06 01:05:28   INFO  epoch: 15/72, acc_iter=61230, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:58, time_cost(all): 14:10:13/2 days, 3:04:10, loss=0.528690780711542, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.21(1.03), norm=1.7715322572745436, lr=0.4238162011947397
2023-12-06 01:06:10   INFO  epoch: 15/72, acc_iter=61280, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:09, time_cost(all): 14:10:55/2 days, 0:26:55, loss=0.528631583285601, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.08(1.03), norm=3.15022955125241, lr=0.42370223411937186
2023-12-06 01:06:51   INFO  epoch: 15/72, acc_iter=61330, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:07, time_cost(all): 14:11:36/2 days, 0:29:38, loss=0.52857238585966, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.92(1.03), norm=4.415789586131684, lr=0.4235882670440041
2023-12-06 01:07:33   INFO  epoch: 15/72, acc_iter=61380, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:30, time_cost(all): 14:12:18/2 days, 1:59:05, loss=0.528513188433719, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.22(1.03), norm=4.700046751584454, lr=0.4234742999686363
2023-12-06 01:08:15   INFO  epoch: 15/72, acc_iter=61430, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:11, time_cost(all): 14:13:00/2 days, 2:53:00, loss=0.528453991007778, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.86(1.03), norm=4.3489524301153075, lr=0.42336033289326847
2023-12-06 01:08:57   INFO  epoch: 15/72, acc_iter=61480, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:29, time_cost(all): 14:13:42/2 days, 1:00:43, loss=0.528394793581837, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.18(1.03), norm=3.947027342819672, lr=0.4232463658179007
2023-12-06 01:09:38   INFO  epoch: 15/72, acc_iter=61530, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:39, time_cost(all): 14:14:23/2 days, 1:28:55, loss=0.528335596155897, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.21(1.03), norm=3.2178869517622237, lr=0.4231323987425329
2023-12-06 01:10:20   INFO  epoch: 15/72, acc_iter=61580, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:03:04, time_cost(all): 14:15:05/2 days, 4:45:11, loss=0.528276398729956, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.05(1.03), norm=4.445820555677203, lr=0.42301843166716513
2023-12-06 01:11:02   INFO  epoch: 15/72, acc_iter=61630, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:20, time_cost(all): 14:15:47/2 days, 3:02:17, loss=0.528217201304015, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.15(1.03), norm=3.7881534489977144, lr=0.4229044645917973
2023-12-06 01:11:44   INFO  epoch: 15/72, acc_iter=61680, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:33, time_cost(all): 14:16:29/2 days, 0:59:00, loss=0.528158003878074, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.97(1.03), norm=4.39379655214714, lr=0.4227904975164295
2023-12-06 01:12:26   INFO  epoch: 15/72, acc_iter=61730, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:50, time_cost(all): 14:17:11/2 days, 3:25:33, loss=0.528098806452133, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.14(1.03), norm=4.973577497081481, lr=0.4226765304410617
2023-12-06 01:13:07   INFO  epoch: 15/72, acc_iter=61780, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 14:17:52/2 days, 1:29:58, loss=0.528039609026192, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.18(1.03), norm=3.2387420223935384, lr=0.4225625633656939
2023-12-06 01:13:49   INFO  epoch: 16/72, acc_iter=61842, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:54:02, time_cost(all): 14:18:34/2 days, 2:43:36, loss=0.527966204218025, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.01(1.03), norm=4.442231432976952, lr=0.42242124419223787
2023-12-06 01:14:31   INFO  epoch: 16/72, acc_iter=61892, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:54:32, time_cost(all): 14:19:16/2 days, 3:42:47, loss=0.527907006792084, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.89(1.03), norm=1.2585701443652078, lr=0.42230727711687005
2023-12-06 01:15:13   INFO  epoch: 16/72, acc_iter=61942, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:53:59, time_cost(all): 14:19:58/2 days, 4:41:40, loss=0.527847809366143, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.0(1.03), norm=2.5490276615653977, lr=0.4221933100415023
2023-12-06 01:15:55   INFO  epoch: 16/72, acc_iter=61992, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:49:23, time_cost(all): 14:20:40/2 days, 2:03:59, loss=0.527788611940202, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.99(1.03), norm=4.850710761163726, lr=0.4220793429661345
2023-12-06 01:16:36   INFO  epoch: 16/72, acc_iter=62042, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:50:41, time_cost(all): 14:21:21/2 days, 3:11:30, loss=0.527729414514261, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.2(1.03), norm=4.025858681512844, lr=0.42196537589076666
2023-12-06 01:17:18   INFO  epoch: 16/72, acc_iter=62092, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:51:06, time_cost(all): 14:22:03/2 days, 4:01:02, loss=0.52767021708832, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.85(1.03), norm=3.1308295639787618, lr=0.42185140881539884
2023-12-06 01:18:00   INFO  epoch: 16/72, acc_iter=62142, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:49:47, time_cost(all): 14:22:45/2 days, 0:40:12, loss=0.527611019662379, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.12(1.03), norm=4.6506402577593455, lr=0.4217374417400311
2023-12-06 01:18:42   INFO  epoch: 16/72, acc_iter=62192, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:46:15, time_cost(all): 14:23:27/2 days, 3:20:08, loss=0.527551822236439, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.01(1.03), norm=4.3518229504415595, lr=0.42162347466466327
2023-12-06 01:19:23   INFO  epoch: 16/72, acc_iter=62242, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:49:19, time_cost(all): 14:24:08/2 days, 1:53:55, loss=0.527492624810498, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.93(1.03), norm=1.0439654271652372, lr=0.4215095075892955
2023-12-06 01:20:05   INFO  epoch: 16/72, acc_iter=62292, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:44:38, time_cost(all): 14:24:50/2 days, 1:30:57, loss=0.527433427384557, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.04(1.03), norm=1.5434822899834204, lr=0.4213955405139277
2023-12-06 01:20:47   INFO  epoch: 16/72, acc_iter=62342, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:46:59, time_cost(all): 14:25:32/2 days, 4:11:33, loss=0.527374229958616, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.88(1.03), norm=2.463182786934778, lr=0.4212815734385599
2023-12-06 01:21:29   INFO  epoch: 16/72, acc_iter=62392, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:44:41, time_cost(all): 14:26:14/2 days, 4:25:54, loss=0.527315032532675, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.93(1.03), norm=0.6425282250511786, lr=0.4211676063631921
2023-12-06 01:22:11   INFO  epoch: 16/72, acc_iter=62442, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:46:28, time_cost(all): 14:26:56/2 days, 3:01:41, loss=0.527255835106734, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.22(1.03), norm=2.7388538797397834, lr=0.4210536392878243
2023-12-06 01:22:52   INFO  epoch: 16/72, acc_iter=62492, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:46:10, time_cost(all): 14:27:37/2 days, 2:43:56, loss=0.527196637680793, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.9(1.03), norm=3.145863820403858, lr=0.42093967221245654
2023-12-06 01:23:34   INFO  epoch: 16/72, acc_iter=62542, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:43:25, time_cost(all): 14:28:19/2 days, 4:13:23, loss=0.527137440254852, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.91(1.03), norm=4.832397591614348, lr=0.4208257051370887
2023-12-06 01:24:16   INFO  epoch: 16/72, acc_iter=62592, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:42:21, time_cost(all): 14:29:01/1 day, 23:48:54, loss=0.527078242828911, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.21(1.03), norm=3.861810875690558, lr=0.4207117380617209
2023-12-06 01:24:58   INFO  epoch: 16/72, acc_iter=62642, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:40:55, time_cost(all): 14:29:43/2 days, 1:47:39, loss=0.52701904540297, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.21(1.03), norm=4.266342737281516, lr=0.42059777098635315
2023-12-06 01:25:39   INFO  epoch: 16/72, acc_iter=62692, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:40:10, time_cost(all): 14:30:24/2 days, 1:30:53, loss=0.526959847977029, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.87(1.03), norm=3.175761244449951, lr=0.42048380391098533
2023-12-06 01:26:21   INFO  epoch: 16/72, acc_iter=62742, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:39:44, time_cost(all): 14:31:06/2 days, 0:38:41, loss=0.526900650551088, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.95(1.03), norm=3.562418043734347, lr=0.42036983683561757
2023-12-06 01:27:03   INFO  epoch: 16/72, acc_iter=62792, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:39:55, time_cost(all): 14:31:48/2 days, 4:10:10, loss=0.526841453125147, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.02(1.03), norm=3.329827865915157, lr=0.42025586976024976
2023-12-06 01:27:45   INFO  epoch: 16/72, acc_iter=62842, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:38:46, time_cost(all): 14:32:30/2 days, 0:48:24, loss=0.526782255699206, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.87(1.03), norm=1.1981625139258745, lr=0.42014190268488194
2023-12-06 01:28:27   INFO  epoch: 16/72, acc_iter=62892, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:38:01, time_cost(all): 14:33:12/2 days, 3:00:22, loss=0.526723058273265, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.17(1.03), norm=1.3940734914217487, lr=0.4200279356095142
2023-12-06 01:29:08   INFO  epoch: 16/72, acc_iter=62942, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:39:25, time_cost(all): 14:33:53/1 day, 23:44:06, loss=0.526663860847324, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.89(1.03), norm=4.337550120505426, lr=0.41991396853414636
2023-12-06 01:29:50   INFO  epoch: 16/72, acc_iter=62992, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:36:42, time_cost(all): 14:34:35/2 days, 3:21:25, loss=0.526604663421383, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.13(1.03), norm=4.307514199375559, lr=0.4198000014587786
2023-12-06 01:30:32   INFO  epoch: 16/72, acc_iter=63042, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:35:05, time_cost(all): 14:35:17/2 days, 3:07:44, loss=0.526545465995442, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.99(1.03), norm=2.109343755915319, lr=0.4196860343834108
2023-12-06 01:31:14   INFO  epoch: 16/72, acc_iter=63092, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:34:30, time_cost(all): 14:35:59/2 days, 3:50:27, loss=0.526486268569502, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.09(1.03), norm=3.584322516610129, lr=0.419572067308043
2023-12-06 01:31:55   INFO  epoch: 16/72, acc_iter=63142, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:35:20, time_cost(all): 14:36:40/2 days, 1:05:12, loss=0.526427071143561, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.92(1.03), norm=4.321337952075632, lr=0.4194581002326752
2023-12-06 01:32:37   INFO  epoch: 16/72, acc_iter=63192, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:33:09, time_cost(all): 14:37:22/2 days, 3:54:16, loss=0.52636787371762, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.92(1.03), norm=1.0103629166969925, lr=0.4193441331573074
2023-12-06 01:33:19   INFO  epoch: 16/72, acc_iter=63242, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:33:06, time_cost(all): 14:38:04/1 day, 23:31:09, loss=0.526308676291679, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.11(1.03), norm=2.258812013729413, lr=0.41923016608193964
2023-12-06 01:34:01   INFO  epoch: 16/72, acc_iter=63292, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:34:31, time_cost(all): 14:38:46/2 days, 3:06:18, loss=0.526249478865738, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.95(1.03), norm=1.2657782107540734, lr=0.4191161990065718
2023-12-06 01:34:43   INFO  epoch: 16/72, acc_iter=63342, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:30:43, time_cost(all): 14:39:28/2 days, 2:39:48, loss=0.526190281439797, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.05(1.03), norm=1.5589814855351611, lr=0.419002231931204
2023-12-06 01:35:24   INFO  epoch: 16/72, acc_iter=63392, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:32:30, time_cost(all): 14:40:09/2 days, 2:07:03, loss=0.526131084013856, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.17(1.03), norm=3.3919593716659264, lr=0.4188882648558362
2023-12-06 01:36:06   INFO  epoch: 16/72, acc_iter=63442, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:32:02, time_cost(all): 14:40:51/2 days, 3:40:09, loss=0.526071886587915, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.15(1.03), norm=3.6890334707698136, lr=0.41877429778046843
2023-12-06 01:36:48   INFO  epoch: 16/72, acc_iter=63492, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:29:36, time_cost(all): 14:41:33/2 days, 1:58:42, loss=0.526012689161974, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.02(1.03), norm=2.087089008511062, lr=0.41866033070510067
2023-12-06 01:37:30   INFO  epoch: 16/72, acc_iter=63542, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:28:54, time_cost(all): 14:42:15/1 day, 23:27:16, loss=0.525953491736033, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.06(1.03), norm=2.9089233534680377, lr=0.41854636362973285
2023-12-06 01:38:11   INFO  epoch: 16/72, acc_iter=63592, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:28:28, time_cost(all): 14:42:56/2 days, 3:16:30, loss=0.525894294310092, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.2(1.03), norm=1.451978805004007, lr=0.41843239655436504
2023-12-06 01:38:53   INFO  epoch: 16/72, acc_iter=63642, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:28:35, time_cost(all): 14:43:38/1 day, 23:28:36, loss=0.525835096884151, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.11(1.03), norm=3.2969485725637657, lr=0.4183184294789972
2023-12-06 01:39:35   INFO  epoch: 16/72, acc_iter=63692, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:28:36, time_cost(all): 14:44:20/2 days, 1:43:07, loss=0.52577589945821, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.1(1.03), norm=3.33737200071511, lr=0.41820446240362946
2023-12-06 01:40:17   INFO  epoch: 16/72, acc_iter=63742, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:25:40, time_cost(all): 14:45:02/1 day, 23:30:25, loss=0.525716702032269, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.1(1.03), norm=1.63033137400439, lr=0.41809049532826165
2023-12-06 01:40:59   INFO  epoch: 16/72, acc_iter=63792, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:25:36, time_cost(all): 14:45:44/1 day, 23:41:46, loss=0.525657504606328, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.96(1.03), norm=1.3567989032716794, lr=0.4179765282528939
2023-12-06 01:41:40   INFO  epoch: 16/72, acc_iter=63842, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:20, time_cost(all): 14:46:25/2 days, 2:27:56, loss=0.525598307180387, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.87(1.03), norm=2.062217426059676, lr=0.41786256117752607
2023-12-06 01:42:22   INFO  epoch: 16/72, acc_iter=63892, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:25:20, time_cost(all): 14:47:07/2 days, 2:56:27, loss=0.525539109754447, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.92(1.03), norm=3.047705202319506, lr=0.41774859410215825
2023-12-06 01:43:04   INFO  epoch: 16/72, acc_iter=63942, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:23:43, time_cost(all): 14:47:49/2 days, 2:02:17, loss=0.525479912328506, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.16(1.03), norm=4.111579980005321, lr=0.4176346270267905
2023-12-06 01:43:46   INFO  epoch: 16/72, acc_iter=63992, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:46, time_cost(all): 14:48:31/2 days, 2:25:44, loss=0.525420714902565, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.9(1.03), norm=2.6736976781979442, lr=0.4175206599514227
2023-12-06 01:44:27   INFO  epoch: 16/72, acc_iter=64042, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:22:40, time_cost(all): 14:49:12/2 days, 1:26:49, loss=0.525361517476624, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.98(1.03), norm=2.7188722851650424, lr=0.4174066928760549
2023-12-06 01:45:09   INFO  epoch: 16/72, acc_iter=64092, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:23, time_cost(all): 14:49:54/2 days, 3:25:52, loss=0.525302320050683, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.93(1.03), norm=1.517912955316485, lr=0.4172927258006871
2023-12-06 01:45:51   INFO  epoch: 16/72, acc_iter=64142, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:01, time_cost(all): 14:50:36/2 days, 0:37:09, loss=0.525243122624742, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.21(1.03), norm=0.5152174969941656, lr=0.4171787587253193
2023-12-06 01:46:33   INFO  epoch: 16/72, acc_iter=64192, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:20:20, time_cost(all): 14:51:18/2 days, 3:42:22, loss=0.525183925198801, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.19(1.03), norm=1.7006553426953062, lr=0.4170647916499515
2023-12-06 01:47:15   INFO  epoch: 16/72, acc_iter=64242, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:20:29, time_cost(all): 14:52:00/2 days, 1:14:54, loss=0.52512472777286, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.86(1.03), norm=3.6481654686091693, lr=0.4169508245745837
2023-12-06 01:47:56   INFO  epoch: 16/72, acc_iter=64292, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:18, time_cost(all): 14:52:41/1 day, 23:11:12, loss=0.525065530346919, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.91(1.03), norm=2.7504604343667203, lr=0.41683685749921595
2023-12-06 01:48:38   INFO  epoch: 16/72, acc_iter=64342, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:17:41, time_cost(all): 14:53:23/1 day, 23:29:24, loss=0.525006332920978, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.07(1.03), norm=4.862973934426482, lr=0.41672289042384814
2023-12-06 01:49:20   INFO  epoch: 16/72, acc_iter=64392, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:59, time_cost(all): 14:54:05/2 days, 0:46:40, loss=0.524947135495037, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.16(1.03), norm=1.2147256886901618, lr=0.4166089233484803
2023-12-06 01:50:02   INFO  epoch: 16/72, acc_iter=64442, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:17:18, time_cost(all): 14:54:47/2 days, 0:53:48, loss=0.524887938069096, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.87(1.03), norm=1.048406310476277, lr=0.41649495627311256
2023-12-06 01:50:44   INFO  epoch: 16/72, acc_iter=64492, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:09, time_cost(all): 14:55:29/2 days, 1:12:49, loss=0.524828740643155, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.11(1.03), norm=1.4440260149108304, lr=0.41638098919774474
2023-12-06 01:51:25   INFO  epoch: 16/72, acc_iter=64542, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:36, time_cost(all): 14:56:10/2 days, 3:56:44, loss=0.524769543217214, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.22(1.03), norm=1.9029307916801212, lr=0.416267022122377
2023-12-06 01:52:07   INFO  epoch: 16/72, acc_iter=64592, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:47, time_cost(all): 14:56:52/2 days, 2:23:32, loss=0.524710345791273, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.83(1.03), norm=3.3748075292165582, lr=0.41615305504700917
2023-12-06 01:52:49   INFO  epoch: 16/72, acc_iter=64642, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:34, time_cost(all): 14:57:34/2 days, 0:20:52, loss=0.524651148365332, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.84(1.03), norm=2.930768531113674, lr=0.41603908797164135
2023-12-06 01:53:31   INFO  epoch: 16/72, acc_iter=64692, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:14:01, time_cost(all): 14:58:16/1 day, 23:12:30, loss=0.524591950939391, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.93(1.03), norm=4.793493091826212, lr=0.41592512089627354
2023-12-06 01:54:12   INFO  epoch: 16/72, acc_iter=64742, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:54, time_cost(all): 14:58:57/1 day, 23:42:28, loss=0.524532753513451, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.22(1.03), norm=0.6617463578271388, lr=0.4158111538209058
2023-12-06 01:54:54   INFO  epoch: 16/72, acc_iter=64792, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:34, time_cost(all): 14:59:39/2 days, 0:57:31, loss=0.52447355608751, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.14(1.03), norm=1.850429871571114, lr=0.41569718674553796
2023-12-06 01:55:36   INFO  epoch: 16/72, acc_iter=64842, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:15, time_cost(all): 15:00:21/2 days, 0:32:20, loss=0.524414358661569, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.21(1.03), norm=2.0920642565571628, lr=0.4155832196701702
2023-12-06 01:56:18   INFO  epoch: 16/72, acc_iter=64892, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:11:06, time_cost(all): 15:01:03/2 days, 0:12:52, loss=0.524355161235628, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.12(1.03), norm=2.0307376509610267, lr=0.4154692525948024
2023-12-06 01:57:00   INFO  epoch: 16/72, acc_iter=64942, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:10:11, time_cost(all): 15:01:45/2 days, 1:29:36, loss=0.524295963809687, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.95(1.03), norm=4.104271764131591, lr=0.41535528551943457
2023-12-06 01:57:41   INFO  epoch: 16/72, acc_iter=64992, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:15, time_cost(all): 15:02:26/2 days, 1:11:03, loss=0.524236766383746, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.08(1.03), norm=3.9344318707646027, lr=0.4152413184440668
2023-12-06 01:58:23   INFO  epoch: 16/72, acc_iter=65042, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:36, time_cost(all): 15:03:08/2 days, 2:30:39, loss=0.524177568957805, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.89(1.03), norm=1.1785089701866156, lr=0.415127351368699
2023-12-06 01:59:05   INFO  epoch: 16/72, acc_iter=65092, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:08:03, time_cost(all): 15:03:50/1 day, 23:25:24, loss=0.524118371531864, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.21(1.03), norm=3.8517388957083507, lr=0.41501338429333123
2023-12-06 01:59:47   INFO  epoch: 16/72, acc_iter=65142, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:00, time_cost(all): 15:04:32/2 days, 2:35:07, loss=0.524059174105923, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.21(1.03), norm=1.103816062971957, lr=0.4148994172179634
2023-12-06 02:00:28   INFO  epoch: 16/72, acc_iter=65192, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:16, time_cost(all): 15:05:13/2 days, 1:36:52, loss=0.523999976679982, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.98(1.03), norm=4.205123651844632, lr=0.4147854501425956
2023-12-06 02:01:10   INFO  epoch: 16/72, acc_iter=65242, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:45, time_cost(all): 15:05:55/1 day, 23:06:17, loss=0.523940779254041, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.17(1.03), norm=4.952039264479809, lr=0.41467148306722784
2023-12-06 02:01:52   INFO  epoch: 16/72, acc_iter=65292, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:00, time_cost(all): 15:06:37/2 days, 3:45:29, loss=0.5238815818281, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.14(1.03), norm=1.3126099618309008, lr=0.41455751599186
2023-12-06 02:02:34   INFO  epoch: 16/72, acc_iter=65342, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:26, time_cost(all): 15:07:19/2 days, 3:27:06, loss=0.523822384402159, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.18(1.03), norm=3.183730021528094, lr=0.41444354891649227
2023-12-06 02:03:16   INFO  epoch: 16/72, acc_iter=65392, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:35, time_cost(all): 15:08:01/1 day, 23:16:01, loss=0.523763186976218, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.93(1.03), norm=3.1159223667085065, lr=0.41432958184112445
2023-12-06 02:03:57   INFO  epoch: 16/72, acc_iter=65442, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:03:00, time_cost(all): 15:08:42/2 days, 2:01:29, loss=0.523703989550277, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.21(1.03), norm=2.8096337418536783, lr=0.41421561476575663
2023-12-06 02:04:39   INFO  epoch: 16/72, acc_iter=65492, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:18, time_cost(all): 15:09:24/1 day, 23:28:19, loss=0.523644792124336, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.86(1.03), norm=2.0410845497719703, lr=0.4141016476903888
2023-12-06 02:05:21   INFO  epoch: 16/72, acc_iter=65542, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:36, time_cost(all): 15:10:06/2 days, 1:44:24, loss=0.523585594698395, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.05(1.03), norm=2.2222202262681323, lr=0.41398768061502106
2023-12-06 02:06:03   INFO  epoch: 16/72, acc_iter=65592, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:49, time_cost(all): 15:10:48/2 days, 1:43:08, loss=0.523526397272455, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.99(1.03), norm=2.866814981616187, lr=0.41387371353965324
2023-12-06 02:06:44   INFO  epoch: 16/72, acc_iter=65642, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 15:11:29/2 days, 0:24:37, loss=0.523467199846514, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.18(1.03), norm=3.542887353116349, lr=0.4137597464642855
2023-12-06 02:07:26   INFO  epoch: 17/72, acc_iter=65704, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:52:22, time_cost(all): 15:12:11/2 days, 0:14:26, loss=0.523393795038347, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.96(1.03), norm=2.918054073656887, lr=0.4136184272908294
2023-12-06 02:08:08   INFO  epoch: 17/72, acc_iter=65754, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:50:13, time_cost(all): 15:12:53/2 days, 0:19:47, loss=0.523334597612406, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.16(1.03), norm=2.7368071435598247, lr=0.4135044602154616
2023-12-06 02:08:50   INFO  epoch: 17/72, acc_iter=65804, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:53:11, time_cost(all): 15:13:35/2 days, 2:18:37, loss=0.523275400186465, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.02(1.03), norm=1.3469904159397774, lr=0.4133904931400938
2023-12-06 02:09:32   INFO  epoch: 17/72, acc_iter=65854, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:51:31, time_cost(all): 15:14:17/2 days, 3:14:12, loss=0.523216202760524, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.11(1.03), norm=2.684580821276526, lr=0.41327652606472604
2023-12-06 02:10:13   INFO  epoch: 17/72, acc_iter=65904, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:50:58, time_cost(all): 15:14:58/1 day, 23:27:38, loss=0.523157005334583, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.13(1.03), norm=4.181929581559325, lr=0.4131625589893582
2023-12-06 02:10:55   INFO  epoch: 17/72, acc_iter=65954, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:49:34, time_cost(all): 15:15:40/1 day, 23:12:19, loss=0.523097807908642, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.96(1.03), norm=2.557552063641414, lr=0.41304859191399046
2023-12-06 02:11:37   INFO  epoch: 17/72, acc_iter=66004, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:48:43, time_cost(all): 15:16:22/2 days, 1:18:02, loss=0.523038610482701, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.98(1.03), norm=0.7937160794484793, lr=0.41293462483862264
2023-12-06 02:12:19   INFO  epoch: 17/72, acc_iter=66054, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:50:20, time_cost(all): 15:17:04/2 days, 0:28:50, loss=0.52297941305676, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.84(1.03), norm=3.6584087406316073, lr=0.41282065776325483
2023-12-06 02:13:00   INFO  epoch: 17/72, acc_iter=66104, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:49:39, time_cost(all): 15:17:45/2 days, 0:26:42, loss=0.522920215630819, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.11(1.03), norm=1.4166456640201393, lr=0.412706690687887
2023-12-06 02:13:42   INFO  epoch: 17/72, acc_iter=66154, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:46:44, time_cost(all): 15:18:27/2 days, 1:16:49, loss=0.522861018204878, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.9(1.03), norm=3.3520332586337367, lr=0.41259272361251925
2023-12-06 02:14:24   INFO  epoch: 17/72, acc_iter=66204, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:44:05, time_cost(all): 15:19:09/2 days, 2:26:23, loss=0.522801820778937, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.99(1.03), norm=2.4279852460119775, lr=0.41247875653715144
2023-12-06 02:15:06   INFO  epoch: 17/72, acc_iter=66254, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:43:30, time_cost(all): 15:19:51/2 days, 2:57:58, loss=0.522742623352996, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.84(1.03), norm=1.7263375890906807, lr=0.4123647894617837
2023-12-06 02:15:48   INFO  epoch: 17/72, acc_iter=66304, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:43:21, time_cost(all): 15:20:33/2 days, 2:59:15, loss=0.522683425927056, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.06(1.03), norm=2.193374566182, lr=0.41225082238641586
2023-12-06 02:16:29   INFO  epoch: 17/72, acc_iter=66354, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:43:49, time_cost(all): 15:21:14/2 days, 2:28:49, loss=0.522624228501115, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.16(1.03), norm=4.12753522449416, lr=0.41213685531104804
2023-12-06 02:17:11   INFO  epoch: 17/72, acc_iter=66404, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:43:53, time_cost(all): 15:21:56/1 day, 23:20:37, loss=0.522565031075174, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.1(1.03), norm=0.7359580624660688, lr=0.4120228882356803
2023-12-06 02:17:53   INFO  epoch: 17/72, acc_iter=66454, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:44:19, time_cost(all): 15:22:38/2 days, 0:30:49, loss=0.522505833649233, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.05(1.03), norm=0.8365753361249666, lr=0.41190892116031247
2023-12-06 02:18:35   INFO  epoch: 17/72, acc_iter=66504, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:43:34, time_cost(all): 15:23:20/2 days, 0:11:51, loss=0.522446636223292, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.04(1.03), norm=2.4402545697836384, lr=0.4117949540849447
2023-12-06 02:19:16   INFO  epoch: 17/72, acc_iter=66554, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:40:07, time_cost(all): 15:24:01/1 day, 23:20:21, loss=0.522387438797351, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.97(1.03), norm=0.9303856704956683, lr=0.4116809870095769
2023-12-06 02:19:58   INFO  epoch: 17/72, acc_iter=66604, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:40:41, time_cost(all): 15:24:43/2 days, 3:20:08, loss=0.52232824137141, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.11(1.03), norm=3.5766873211843833, lr=0.4115670199342091
2023-12-06 02:20:40   INFO  epoch: 17/72, acc_iter=66654, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:41:22, time_cost(all): 15:25:25/2 days, 0:11:30, loss=0.522269043945469, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.98(1.03), norm=3.752583811899067, lr=0.4114530528588413
2023-12-06 02:21:22   INFO  epoch: 17/72, acc_iter=66704, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:39:43, time_cost(all): 15:26:07/1 day, 23:38:15, loss=0.522209846519528, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.13(1.03), norm=0.7810484772386592, lr=0.4113390857834735
2023-12-06 02:22:04   INFO  epoch: 17/72, acc_iter=66754, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:39:44, time_cost(all): 15:26:49/2 days, 2:16:12, loss=0.522150649093587, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.08(1.03), norm=4.062446939554184, lr=0.41122511870810574
2023-12-06 02:22:45   INFO  epoch: 17/72, acc_iter=66804, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:39:35, time_cost(all): 15:27:30/2 days, 3:00:15, loss=0.522091451667646, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.87(1.03), norm=1.646931595853904, lr=0.4111111516327379
2023-12-06 02:23:27   INFO  epoch: 17/72, acc_iter=66854, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:38:42, time_cost(all): 15:28:12/1 day, 23:40:47, loss=0.522032254241705, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.99(1.03), norm=2.366631969104785, lr=0.4109971845573701
2023-12-06 02:24:09   INFO  epoch: 17/72, acc_iter=66904, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:36:26, time_cost(all): 15:28:54/1 day, 23:13:19, loss=0.521973056815764, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.17(1.03), norm=2.1006150464825604, lr=0.41088321748200235
2023-12-06 02:24:51   INFO  epoch: 17/72, acc_iter=66954, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:35:04, time_cost(all): 15:29:36/2 days, 2:46:48, loss=0.521913859389823, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.13(1.03), norm=3.7761714017118537, lr=0.41076925040663453
2023-12-06 02:25:33   INFO  epoch: 17/72, acc_iter=67004, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:34:04, time_cost(all): 15:30:18/1 day, 23:20:53, loss=0.521854661963882, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.06(1.03), norm=1.9815471748611104, lr=0.4106552833312668
2023-12-06 02:26:14   INFO  epoch: 17/72, acc_iter=67054, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:35:59, time_cost(all): 15:30:59/2 days, 2:11:09, loss=0.521795464537941, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.01(1.03), norm=3.960292176951923, lr=0.41054131625589896
2023-12-06 02:26:56   INFO  epoch: 17/72, acc_iter=67104, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:34:23, time_cost(all): 15:31:41/1 day, 23:48:18, loss=0.521736267112, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.84(1.03), norm=3.15769461974135, lr=0.41042734918053114
2023-12-06 02:27:38   INFO  epoch: 17/72, acc_iter=67154, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:32:20, time_cost(all): 15:32:23/2 days, 2:34:03, loss=0.52167706968606, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.14(1.03), norm=2.166018164737798, lr=0.4103133821051634
2023-12-06 02:28:20   INFO  epoch: 17/72, acc_iter=67204, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:33:13, time_cost(all): 15:33:05/2 days, 1:52:08, loss=0.521617872260119, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.89(1.03), norm=4.81876917648224, lr=0.41019941502979557
2023-12-06 02:29:01   INFO  epoch: 17/72, acc_iter=67254, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:32:57, time_cost(all): 15:33:46/2 days, 2:29:42, loss=0.521558674834178, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.1(1.03), norm=2.0758953321416165, lr=0.4100854479544278
2023-12-06 02:29:43   INFO  epoch: 17/72, acc_iter=67304, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:32:18, time_cost(all): 15:34:28/2 days, 1:47:00, loss=0.521499477408237, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.9(1.03), norm=2.787739608524517, lr=0.40997148087906
2023-12-06 02:30:25   INFO  epoch: 17/72, acc_iter=67354, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:28:37, time_cost(all): 15:35:10/2 days, 2:28:47, loss=0.521440279982296, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.98(1.03), norm=4.490123409057567, lr=0.4098575138036922
2023-12-06 02:31:07   INFO  epoch: 17/72, acc_iter=67404, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:29:43, time_cost(all): 15:35:52/2 days, 0:29:49, loss=0.521381082556355, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.08(1.03), norm=4.603195008468919, lr=0.40974354672832436
2023-12-06 02:31:49   INFO  epoch: 17/72, acc_iter=67454, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:29:19, time_cost(all): 15:36:34/2 days, 3:07:47, loss=0.521321885130414, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=2.679926497844936, lr=0.4096295796529566
2023-12-06 02:32:30   INFO  epoch: 17/72, acc_iter=67504, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:28:19, time_cost(all): 15:37:15/2 days, 2:14:15, loss=0.521262687704473, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.96(1.03), norm=1.7270024011865193, lr=0.40951561257758884
2023-12-06 02:33:12   INFO  epoch: 17/72, acc_iter=67554, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:26:23, time_cost(all): 15:37:57/2 days, 2:02:56, loss=0.521203490278532, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.01(1.03), norm=3.9252814113124623, lr=0.409401645502221
2023-12-06 02:33:54   INFO  epoch: 17/72, acc_iter=67604, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:27:34, time_cost(all): 15:38:39/2 days, 1:30:12, loss=0.521144292852591, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.08(1.03), norm=3.1703187232800736, lr=0.4092876784268532
2023-12-06 02:34:36   INFO  epoch: 17/72, acc_iter=67654, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:25:10, time_cost(all): 15:39:21/2 days, 2:34:13, loss=0.52108509542665, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.1(1.03), norm=3.083286423727555, lr=0.4091737113514854
2023-12-06 02:35:17   INFO  epoch: 17/72, acc_iter=67704, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:16, time_cost(all): 15:40:02/2 days, 1:35:45, loss=0.521025898000709, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.16(1.03), norm=1.7818442910485244, lr=0.40905974427611763
2023-12-06 02:35:59   INFO  epoch: 17/72, acc_iter=67754, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:24:38, time_cost(all): 15:40:44/2 days, 1:28:47, loss=0.520966700574768, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.07(1.03), norm=2.0608299038391333, lr=0.4089457772007498
2023-12-06 02:36:41   INFO  epoch: 17/72, acc_iter=67804, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:05, time_cost(all): 15:41:26/2 days, 1:03:30, loss=0.520907503148827, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.91(1.03), norm=3.376620859817977, lr=0.40883181012538206
2023-12-06 02:37:23   INFO  epoch: 17/72, acc_iter=67854, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:23:49, time_cost(all): 15:42:08/2 days, 0:24:15, loss=0.520848305722886, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.19(1.03), norm=3.0253656695552986, lr=0.40871784305001424
2023-12-06 02:38:05   INFO  epoch: 17/72, acc_iter=67904, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:21:23, time_cost(all): 15:42:50/1 day, 23:59:48, loss=0.520789108296945, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.86(1.03), norm=0.527491239198532, lr=0.4086038759746464
2023-12-06 02:38:46   INFO  epoch: 17/72, acc_iter=67954, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:36, time_cost(all): 15:43:31/2 days, 0:25:00, loss=0.520729910871005, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.93(1.03), norm=4.656093535682431, lr=0.40848990889927866
2023-12-06 02:39:28   INFO  epoch: 17/72, acc_iter=68004, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:57, time_cost(all): 15:44:13/2 days, 0:31:50, loss=0.520670713445064, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.01(1.03), norm=2.3785964999881486, lr=0.40837594182391085
2023-12-06 02:40:10   INFO  epoch: 17/72, acc_iter=68054, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:19:38, time_cost(all): 15:44:55/2 days, 2:37:56, loss=0.520611516019123, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.87(1.03), norm=1.8828727740020692, lr=0.4082619747485431
2023-12-06 02:40:52   INFO  epoch: 17/72, acc_iter=68104, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:19:07, time_cost(all): 15:45:37/2 days, 3:02:40, loss=0.520552318593182, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.01(1.03), norm=4.540688060358231, lr=0.4081480076731753
2023-12-06 02:41:33   INFO  epoch: 17/72, acc_iter=68154, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:44, time_cost(all): 15:46:18/2 days, 0:06:50, loss=0.520493121167241, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.96(1.03), norm=1.1995391956633803, lr=0.40803404059780746
2023-12-06 02:42:15   INFO  epoch: 17/72, acc_iter=68204, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:58, time_cost(all): 15:47:00/2 days, 1:29:53, loss=0.5204339237413, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.91(1.03), norm=4.586149353209588, lr=0.4079200735224397
2023-12-06 02:42:57   INFO  epoch: 17/72, acc_iter=68254, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:16:42, time_cost(all): 15:47:42/1 day, 22:34:49, loss=0.520374726315359, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.15(1.03), norm=1.8497714017025713, lr=0.4078061064470719
2023-12-06 02:43:39   INFO  epoch: 17/72, acc_iter=68304, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:17:25, time_cost(all): 15:48:24/1 day, 22:45:18, loss=0.520315528889418, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.15(1.03), norm=1.023313679219562, lr=0.4076921393717041
2023-12-06 02:44:21   INFO  epoch: 17/72, acc_iter=68354, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:15:25, time_cost(all): 15:49:06/2 days, 0:16:49, loss=0.520256331463477, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.02(1.03), norm=4.2315796074660765, lr=0.4075781722963363
2023-12-06 02:45:02   INFO  epoch: 17/72, acc_iter=68404, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:14:57, time_cost(all): 15:49:47/1 day, 23:04:55, loss=0.520197134037536, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.93(1.03), norm=4.861620362807039, lr=0.4074642052209685
2023-12-06 02:45:44   INFO  epoch: 17/72, acc_iter=68454, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:15:10, time_cost(all): 15:50:29/2 days, 0:12:31, loss=0.520137936611595, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.06(1.03), norm=1.939676717999911, lr=0.4073502381456007
2023-12-06 02:46:26   INFO  epoch: 17/72, acc_iter=68504, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:14, time_cost(all): 15:51:11/2 days, 0:50:43, loss=0.520078739185654, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.14(1.03), norm=1.8176472915749164, lr=0.4072362710702329
2023-12-06 02:47:08   INFO  epoch: 17/72, acc_iter=68554, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:35, time_cost(all): 15:51:53/1 day, 23:47:13, loss=0.520019541759713, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.05(1.03), norm=3.4417890931008324, lr=0.4071223039948651
2023-12-06 02:47:49   INFO  epoch: 17/72, acc_iter=68604, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:20, time_cost(all): 15:52:34/2 days, 2:03:07, loss=0.519960344333772, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.91(1.03), norm=4.715461063226453, lr=0.40700833691949734
2023-12-06 02:48:31   INFO  epoch: 17/72, acc_iter=68654, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:07, time_cost(all): 15:53:16/1 day, 23:11:42, loss=0.519901146907831, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.07(1.03), norm=2.9604563497865426, lr=0.4068943698441295
2023-12-06 02:49:13   INFO  epoch: 17/72, acc_iter=68704, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:30, time_cost(all): 15:53:58/2 days, 1:42:11, loss=0.51984194948189, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.91(1.03), norm=0.7201309562988338, lr=0.4067804027687617
2023-12-06 02:49:55   INFO  epoch: 17/72, acc_iter=68754, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:56, time_cost(all): 15:54:40/1 day, 22:15:16, loss=0.519782752055949, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.95(1.03), norm=2.4323588696939855, lr=0.40666643569339395
2023-12-06 02:50:37   INFO  epoch: 17/72, acc_iter=68804, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:47, time_cost(all): 15:55:22/2 days, 1:09:55, loss=0.519723554630009, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.23(1.03), norm=2.5933305460631524, lr=0.40655246861802613
2023-12-06 02:51:18   INFO  epoch: 17/72, acc_iter=68854, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:03, time_cost(all): 15:56:03/1 day, 23:49:28, loss=0.519664357204068, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.98(1.03), norm=2.8797840916349973, lr=0.40643850154265837
2023-12-06 02:52:00   INFO  epoch: 17/72, acc_iter=68904, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:25, time_cost(all): 15:56:45/2 days, 2:07:36, loss=0.519605159778127, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.1(1.03), norm=2.0143566600054026, lr=0.40632453446729055
2023-12-06 02:52:42   INFO  epoch: 17/72, acc_iter=68954, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:44, time_cost(all): 15:57:27/1 day, 23:35:41, loss=0.519545962352186, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.03(1.03), norm=0.6852245109183008, lr=0.40621056739192274
2023-12-06 02:53:24   INFO  epoch: 17/72, acc_iter=69004, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:00, time_cost(all): 15:58:09/1 day, 23:14:50, loss=0.519486764926245, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.15(1.03), norm=2.341308769117722, lr=0.406096600316555
2023-12-06 02:54:05   INFO  epoch: 17/72, acc_iter=69054, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:40, time_cost(all): 15:58:50/2 days, 2:57:36, loss=0.519427567500304, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.17(1.03), norm=4.936616080003099, lr=0.40598263324118716
2023-12-06 02:54:47   INFO  epoch: 17/72, acc_iter=69104, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:40, time_cost(all): 15:59:32/1 day, 22:17:47, loss=0.519368370074363, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.14(1.03), norm=2.5366539900564566, lr=0.4058686661658194
2023-12-06 02:55:29   INFO  epoch: 17/72, acc_iter=69154, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:03, time_cost(all): 16:00:14/1 day, 22:30:55, loss=0.519309172648422, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.09(1.03), norm=3.9799184414994064, lr=0.4057546990904516
2023-12-06 02:56:11   INFO  epoch: 17/72, acc_iter=69204, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:19, time_cost(all): 16:00:56/1 day, 22:18:03, loss=0.519249975222481, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.13(1.03), norm=3.375300010341684, lr=0.40564073201508377
2023-12-06 02:56:53   INFO  epoch: 17/72, acc_iter=69254, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:43, time_cost(all): 16:01:38/2 days, 1:31:27, loss=0.51919077779654, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.94(1.03), norm=4.801225296915028, lr=0.405526764939716
2023-12-06 02:57:34   INFO  epoch: 17/72, acc_iter=69304, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:55, time_cost(all): 16:02:19/1 day, 23:25:50, loss=0.519131580370599, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.92(1.03), norm=1.729184517102707, lr=0.4054127978643482
2023-12-06 02:58:16   INFO  epoch: 17/72, acc_iter=69354, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:15, time_cost(all): 16:03:01/1 day, 23:35:58, loss=0.519072382944658, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.0(1.03), norm=1.180995305298499, lr=0.4052988307889804
2023-12-06 02:58:58   INFO  epoch: 17/72, acc_iter=69404, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:33, time_cost(all): 16:03:43/2 days, 0:15:10, loss=0.519013185518717, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.06(1.03), norm=2.9922389754914898, lr=0.4051848637136126
2023-12-06 02:59:40   INFO  epoch: 17/72, acc_iter=69454, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:51, time_cost(all): 16:04:25/1 day, 22:04:17, loss=0.518953988092776, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.85(1.03), norm=4.868061851891019, lr=0.4050708966382448
2023-12-06 03:00:22   INFO  epoch: 17/72, acc_iter=69504, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 16:05:07/2 days, 1:48:02, loss=0.518894790666835, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.06(1.03), norm=4.232405882927987, lr=0.404956929562877
2023-12-06 03:01:03   INFO  epoch: 18/72, acc_iter=69566, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:54:39, time_cost(all): 16:05:48/1 day, 23:31:18, loss=0.518821385858669, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.08(1.03), norm=1.6577750109337448, lr=0.40481561038942093
2023-12-06 03:01:45   INFO  epoch: 18/72, acc_iter=69616, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:53:39, time_cost(all): 16:06:30/1 day, 23:23:54, loss=0.518762188432728, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.03(1.03), norm=0.8540264203408277, lr=0.40470164331405317
2023-12-06 03:02:27   INFO  epoch: 18/72, acc_iter=69666, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:52:32, time_cost(all): 16:07:12/1 day, 23:03:52, loss=0.518702991006787, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.1(1.03), norm=4.233980497608203, lr=0.40458767623868536
2023-12-06 03:03:09   INFO  epoch: 18/72, acc_iter=69716, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:51:15, time_cost(all): 16:07:54/1 day, 23:40:55, loss=0.518643793580846, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.01(1.03), norm=2.58411726541517, lr=0.4044737091633176
2023-12-06 03:03:50   INFO  epoch: 18/72, acc_iter=69766, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:51:51, time_cost(all): 16:08:35/2 days, 1:24:23, loss=0.518584596154905, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.21(1.03), norm=1.7388527632343522, lr=0.4043597420879498
2023-12-06 03:04:32   INFO  epoch: 18/72, acc_iter=69816, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:48:58, time_cost(all): 16:09:17/1 day, 23:09:14, loss=0.518525398728964, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.04(1.03), norm=3.9007171790215, lr=0.40424577501258196
2023-12-06 03:05:14   INFO  epoch: 18/72, acc_iter=69866, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:46:32, time_cost(all): 16:09:59/2 days, 0:50:19, loss=0.518466201303023, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.91(1.03), norm=4.811151566598163, lr=0.40413180793721415
2023-12-06 03:05:56   INFO  epoch: 18/72, acc_iter=69916, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:49:46, time_cost(all): 16:10:41/2 days, 1:08:15, loss=0.518407003877082, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.85(1.03), norm=2.0795171877191256, lr=0.4040178408618464
2023-12-06 03:06:38   INFO  epoch: 18/72, acc_iter=69966, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:49:13, time_cost(all): 16:11:23/1 day, 23:48:35, loss=0.518347806451141, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.05(1.03), norm=3.5080643458041103, lr=0.40390387378647863
2023-12-06 03:07:19   INFO  epoch: 18/72, acc_iter=70016, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:45:12, time_cost(all): 16:12:04/2 days, 0:59:49, loss=0.5182886090252, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.13(1.03), norm=4.464895148129119, lr=0.4037899067111108
2023-12-06 03:08:01   INFO  epoch: 18/72, acc_iter=70066, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:47:05, time_cost(all): 16:12:46/2 days, 1:46:44, loss=0.518229411599259, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.06(1.03), norm=3.2928808079535585, lr=0.403675939635743
2023-12-06 03:08:43   INFO  epoch: 18/72, acc_iter=70116, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:47:11, time_cost(all): 16:13:28/2 days, 0:17:48, loss=0.518170214173318, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.21(1.03), norm=1.9027871016935056, lr=0.4035619725603752
2023-12-06 03:09:25   INFO  epoch: 18/72, acc_iter=70166, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:45:37, time_cost(all): 16:14:10/2 days, 1:55:27, loss=0.518111016747377, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.09(1.03), norm=4.017248001537805, lr=0.4034480054850074
2023-12-06 03:10:06   INFO  epoch: 18/72, acc_iter=70216, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:43:06, time_cost(all): 16:14:51/1 day, 23:40:20, loss=0.518051819321436, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.98(1.03), norm=2.427592841652098, lr=0.40333403840963966
2023-12-06 03:10:48   INFO  epoch: 18/72, acc_iter=70266, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:41:25, time_cost(all): 16:15:33/2 days, 2:34:52, loss=0.517992621895495, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.93(1.03), norm=1.6497460410219098, lr=0.40322007133427185
2023-12-06 03:11:30   INFO  epoch: 18/72, acc_iter=70316, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:41:29, time_cost(all): 16:16:15/2 days, 0:17:01, loss=0.517933424469554, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.86(1.03), norm=0.7009052013362633, lr=0.40310610425890403
2023-12-06 03:12:12   INFO  epoch: 18/72, acc_iter=70366, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:40:08, time_cost(all): 16:16:57/2 days, 0:26:16, loss=0.517874227043614, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.13(1.03), norm=1.2930734609011758, lr=0.4029921371835362
2023-12-06 03:12:54   INFO  epoch: 18/72, acc_iter=70416, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:42:17, time_cost(all): 16:17:39/1 day, 22:18:57, loss=0.517815029617673, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.19(1.03), norm=2.6207615748808237, lr=0.40287817010816845
2023-12-06 03:13:35   INFO  epoch: 18/72, acc_iter=70466, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:39:19, time_cost(all): 16:18:20/2 days, 0:35:00, loss=0.517755832191732, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.16(1.03), norm=4.9320082942066215, lr=0.40276420303280064
2023-12-06 03:14:17   INFO  epoch: 18/72, acc_iter=70516, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:40:35, time_cost(all): 16:19:02/1 day, 22:17:28, loss=0.517696634765791, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.98(1.03), norm=3.270334363853862, lr=0.4026502359574329
2023-12-06 03:14:59   INFO  epoch: 18/72, acc_iter=70566, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:39:21, time_cost(all): 16:19:44/2 days, 2:32:17, loss=0.51763743733985, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.87(1.03), norm=2.431170244336239, lr=0.40253626888206506
2023-12-06 03:15:41   INFO  epoch: 18/72, acc_iter=70616, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:39:53, time_cost(all): 16:20:26/2 days, 1:09:51, loss=0.517578239913909, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.06(1.03), norm=4.562621152688425, lr=0.40242230180669725
2023-12-06 03:16:22   INFO  epoch: 18/72, acc_iter=70666, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:39:19, time_cost(all): 16:21:07/1 day, 23:37:25, loss=0.517519042487968, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.11(1.03), norm=4.919564481869728, lr=0.4023083347313295
2023-12-06 03:17:04   INFO  epoch: 18/72, acc_iter=70716, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:36:25, time_cost(all): 16:21:49/1 day, 23:04:07, loss=0.517459845062027, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.23(1.03), norm=0.6035409162002805, lr=0.40219436765596167
2023-12-06 03:17:46   INFO  epoch: 18/72, acc_iter=70766, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:37:01, time_cost(all): 16:22:31/2 days, 0:40:48, loss=0.517400647636086, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.92(1.03), norm=2.946304468199808, lr=0.4020804005805939
2023-12-06 03:18:28   INFO  epoch: 18/72, acc_iter=70816, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:36:25, time_cost(all): 16:23:13/2 days, 1:28:26, loss=0.517341450210145, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.04(1.03), norm=4.891982245183771, lr=0.4019664335052261
2023-12-06 03:19:10   INFO  epoch: 18/72, acc_iter=70866, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:36:00, time_cost(all): 16:23:55/1 day, 23:11:42, loss=0.517282252784204, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.03(1.03), norm=4.1166597245975165, lr=0.4018524664298583
2023-12-06 03:19:51   INFO  epoch: 18/72, acc_iter=70916, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:35:08, time_cost(all): 16:24:36/1 day, 23:38:26, loss=0.517223055358263, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.99(1.03), norm=4.2904925073484055, lr=0.4017384993544905
2023-12-06 03:20:33   INFO  epoch: 18/72, acc_iter=70966, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:33:35, time_cost(all): 16:25:18/1 day, 22:54:51, loss=0.517163857932322, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.0(1.03), norm=3.270247180266052, lr=0.4016245322791227
2023-12-06 03:21:15   INFO  epoch: 18/72, acc_iter=71016, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:33:16, time_cost(all): 16:26:00/2 days, 0:05:04, loss=0.517104660506381, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.18(1.03), norm=1.1905590394167231, lr=0.40151056520375494
2023-12-06 03:21:57   INFO  epoch: 18/72, acc_iter=71066, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:32:33, time_cost(all): 16:26:42/1 day, 23:36:29, loss=0.51704546308044, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.07(1.03), norm=3.1416771277082356, lr=0.4013965981283871
2023-12-06 03:22:38   INFO  epoch: 18/72, acc_iter=71116, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:31:38, time_cost(all): 16:27:23/1 day, 23:13:21, loss=0.516986265654499, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.95(1.03), norm=2.4294748043647325, lr=0.4012826310530193
2023-12-06 03:23:20   INFO  epoch: 18/72, acc_iter=71166, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:29:37, time_cost(all): 16:28:05/1 day, 23:17:37, loss=0.516927068228558, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.05(1.03), norm=1.2382424106044847, lr=0.40116866397765155
2023-12-06 03:24:02   INFO  epoch: 18/72, acc_iter=71216, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:30:19, time_cost(all): 16:28:47/1 day, 22:48:00, loss=0.516867870802617, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.97(1.03), norm=4.568879549640619, lr=0.40105469690228374
2023-12-06 03:24:44   INFO  epoch: 18/72, acc_iter=71266, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:30:35, time_cost(all): 16:29:29/2 days, 0:04:46, loss=0.516808673376677, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.05(1.03), norm=4.892987761266626, lr=0.400940729826916
2023-12-06 03:25:26   INFO  epoch: 18/72, acc_iter=71316, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:28:58, time_cost(all): 16:30:11/2 days, 0:57:40, loss=0.516749475950736, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.11(1.03), norm=2.6583865312912542, lr=0.40082676275154816
2023-12-06 03:26:07   INFO  epoch: 18/72, acc_iter=71366, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:27:34, time_cost(all): 16:30:52/2 days, 1:19:12, loss=0.516690278524795, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.21(1.03), norm=4.184271290349464, lr=0.40071279567618034
2023-12-06 03:26:49   INFO  epoch: 18/72, acc_iter=71416, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:26:38, time_cost(all): 16:31:34/1 day, 23:10:51, loss=0.516631081098854, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.03(1.03), norm=2.762494700284059, lr=0.40059882860081253
2023-12-06 03:27:31   INFO  epoch: 18/72, acc_iter=71466, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:26:29, time_cost(all): 16:32:16/2 days, 0:38:54, loss=0.516571883672913, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.99(1.03), norm=2.4610110462886983, lr=0.40048486152544477
2023-12-06 03:28:13   INFO  epoch: 18/72, acc_iter=71516, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:26:56, time_cost(all): 16:32:58/1 day, 22:38:13, loss=0.516512686246972, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.95(1.03), norm=2.317440040632758, lr=0.40037089445007695
2023-12-06 03:28:54   INFO  epoch: 18/72, acc_iter=71566, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:22, time_cost(all): 16:33:39/1 day, 23:16:54, loss=0.516453488821031, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.93(1.03), norm=0.563473609700712, lr=0.4002569273747092
2023-12-06 03:29:36   INFO  epoch: 18/72, acc_iter=71616, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:23:25, time_cost(all): 16:34:21/1 day, 22:30:19, loss=0.51639429139509, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.14(1.03), norm=3.3463848127627074, lr=0.4001429602993414
2023-12-06 03:30:18   INFO  epoch: 18/72, acc_iter=71666, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:23:53, time_cost(all): 16:35:03/1 day, 23:00:22, loss=0.516335093969149, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.07(1.03), norm=1.0364424314507619, lr=0.40002899322397356
2023-12-06 03:31:00   INFO  epoch: 18/72, acc_iter=71716, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:12, time_cost(all): 16:35:45/1 day, 21:54:56, loss=0.516275896543208, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.1(1.03), norm=2.881431570705192, lr=0.3999150261486058
2023-12-06 03:31:42   INFO  epoch: 18/72, acc_iter=71766, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:22:10, time_cost(all): 16:36:27/1 day, 22:12:23, loss=0.516216699117267, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.21(1.03), norm=4.017809047877648, lr=0.399801059073238
2023-12-06 03:32:23   INFO  epoch: 18/72, acc_iter=71816, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:31, time_cost(all): 16:37:08/2 days, 1:31:38, loss=0.516157501691326, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.94(1.03), norm=1.4221789788620864, lr=0.3996870919978702
2023-12-06 03:33:05   INFO  epoch: 18/72, acc_iter=71866, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:09, time_cost(all): 16:37:50/2 days, 1:49:07, loss=0.516098304265385, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.2(1.03), norm=2.484905988732572, lr=0.3995731249225024
2023-12-06 03:33:47   INFO  epoch: 18/72, acc_iter=71916, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:20:59, time_cost(all): 16:38:32/1 day, 22:44:31, loss=0.516039106839444, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.83(1.03), norm=3.4611424142373384, lr=0.3994591578471346
2023-12-06 03:34:29   INFO  epoch: 18/72, acc_iter=71966, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:20:09, time_cost(all): 16:39:14/2 days, 0:20:22, loss=0.515979909413503, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.17(1.03), norm=4.102133644470857, lr=0.39934519077176683
2023-12-06 03:35:11   INFO  epoch: 18/72, acc_iter=72016, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:18:44, time_cost(all): 16:39:56/2 days, 0:17:38, loss=0.515920711987562, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.21(1.03), norm=1.9304412015893682, lr=0.399231223696399
2023-12-06 03:35:52   INFO  epoch: 18/72, acc_iter=72066, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:27, time_cost(all): 16:40:37/2 days, 1:30:02, loss=0.515861514561621, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.88(1.03), norm=0.8506637366215108, lr=0.39911725662103126
2023-12-06 03:36:34   INFO  epoch: 18/72, acc_iter=72116, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:18:21, time_cost(all): 16:41:19/1 day, 21:28:15, loss=0.515802317135681, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.01(1.03), norm=1.766525386973749, lr=0.39900328954566344
2023-12-06 03:37:16   INFO  epoch: 18/72, acc_iter=72166, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:12, time_cost(all): 16:42:01/1 day, 23:01:36, loss=0.51574311970974, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.01(1.03), norm=1.4968513617576553, lr=0.3988893224702956
2023-12-06 03:37:58   INFO  epoch: 18/72, acc_iter=72216, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:03, time_cost(all): 16:42:43/2 days, 0:14:23, loss=0.515683922283799, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.97(1.03), norm=3.295387228061277, lr=0.39877535539492787
2023-12-06 03:38:39   INFO  epoch: 18/72, acc_iter=72266, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:47, time_cost(all): 16:43:24/1 day, 23:23:32, loss=0.515624724857858, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.04(1.03), norm=1.7750546399159708, lr=0.39866138831956005
2023-12-06 03:39:21   INFO  epoch: 18/72, acc_iter=72316, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:15:13, time_cost(all): 16:44:06/2 days, 0:18:48, loss=0.515565527431917, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.03(1.03), norm=1.4000509087259592, lr=0.39854742124419223
2023-12-06 03:40:03   INFO  epoch: 18/72, acc_iter=72366, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:40, time_cost(all): 16:44:48/2 days, 0:44:08, loss=0.515506330005976, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.89(1.03), norm=3.3306789516564472, lr=0.3984334541688245
2023-12-06 03:40:45   INFO  epoch: 18/72, acc_iter=72416, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:15, time_cost(all): 16:45:30/1 day, 21:40:38, loss=0.515447132580035, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.03(1.03), norm=2.5449900056071364, lr=0.39831948709345666
2023-12-06 03:41:27   INFO  epoch: 18/72, acc_iter=72466, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:45, time_cost(all): 16:46:12/2 days, 0:54:17, loss=0.515387935154094, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.91(1.03), norm=1.7622857278620159, lr=0.39820552001808884
2023-12-06 03:42:08   INFO  epoch: 18/72, acc_iter=72516, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:01, time_cost(all): 16:46:53/1 day, 21:46:24, loss=0.515328737728153, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.2(1.03), norm=2.4458294141560253, lr=0.3980915529427211
2023-12-06 03:42:50   INFO  epoch: 18/72, acc_iter=72566, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:16, time_cost(all): 16:47:35/1 day, 22:41:30, loss=0.515269540302212, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.16(1.03), norm=2.601590090345005, lr=0.39797758586735327
2023-12-06 03:43:32   INFO  epoch: 18/72, acc_iter=72616, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:17, time_cost(all): 16:48:17/1 day, 21:38:55, loss=0.515210342876271, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.05(1.03), norm=1.7245104640112359, lr=0.3978636187919855
2023-12-06 03:44:14   INFO  epoch: 18/72, acc_iter=72666, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:10:08, time_cost(all): 16:48:59/2 days, 1:41:14, loss=0.51515114545033, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.18(1.03), norm=3.913799041421743, lr=0.3977496517166177
2023-12-06 03:44:55   INFO  epoch: 18/72, acc_iter=72716, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:08:47, time_cost(all): 16:49:40/2 days, 0:49:21, loss=0.515091948024389, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.16(1.03), norm=0.855880356788056, lr=0.3976356846412499
2023-12-06 03:45:37   INFO  epoch: 18/72, acc_iter=72766, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:37, time_cost(all): 16:50:22/2 days, 0:53:51, loss=0.515032750598448, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.87(1.03), norm=2.648434244306317, lr=0.3975217175658821
2023-12-06 03:46:19   INFO  epoch: 18/72, acc_iter=72816, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:08:08, time_cost(all): 16:51:04/1 day, 22:30:11, loss=0.514973553172507, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.12(1.03), norm=2.120221105771483, lr=0.3974077504905143
2023-12-06 03:47:01   INFO  epoch: 18/72, acc_iter=72866, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:06, time_cost(all): 16:51:46/1 day, 21:34:01, loss=0.514914355746566, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.1(1.03), norm=4.2045488857867, lr=0.39729378341514654
2023-12-06 03:47:43   INFO  epoch: 18/72, acc_iter=72916, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:34, time_cost(all): 16:52:28/2 days, 1:27:08, loss=0.514855158320625, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.95(1.03), norm=0.9037126541218332, lr=0.3971798163397787
2023-12-06 03:48:24   INFO  epoch: 18/72, acc_iter=72966, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:41, time_cost(all): 16:53:09/1 day, 21:23:19, loss=0.514795960894684, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.98(1.03), norm=4.979080164384795, lr=0.3970658492644109
2023-12-06 03:49:06   INFO  epoch: 18/72, acc_iter=73016, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:08, time_cost(all): 16:53:51/1 day, 23:20:15, loss=0.514736763468744, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.88(1.03), norm=1.9471522071519018, lr=0.39695188218904315
2023-12-06 03:49:48   INFO  epoch: 18/72, acc_iter=73066, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:27, time_cost(all): 16:54:33/2 days, 0:37:07, loss=0.514677566042803, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.91(1.03), norm=1.948655983507643, lr=0.39683791511367533
2023-12-06 03:50:30   INFO  epoch: 18/72, acc_iter=73116, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:49, time_cost(all): 16:55:15/1 day, 22:20:23, loss=0.514618368616862, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.92(1.03), norm=3.171763531640524, lr=0.3967239480383076
2023-12-06 03:51:11   INFO  epoch: 18/72, acc_iter=73166, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:49, time_cost(all): 16:55:56/1 day, 22:35:08, loss=0.514559171190921, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.9(1.03), norm=1.9364431968244786, lr=0.39660998096293976
2023-12-06 03:51:53   INFO  epoch: 18/72, acc_iter=73216, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:10, time_cost(all): 16:56:38/1 day, 22:06:50, loss=0.51449997376498, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.96(1.03), norm=1.5452251901076637, lr=0.39649601388757194
2023-12-06 03:52:35   INFO  epoch: 18/72, acc_iter=73266, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:29, time_cost(all): 16:57:20/1 day, 23:20:34, loss=0.514440776339039, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.0(1.03), norm=1.8060261322712432, lr=0.3963820468122041
2023-12-06 03:53:17   INFO  epoch: 18/72, acc_iter=73316, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:49, time_cost(all): 16:58:02/1 day, 23:52:12, loss=0.514381578913098, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.06(1.03), norm=3.9348865796794907, lr=0.39626807973683636
2023-12-06 03:53:59   INFO  epoch: 18/72, acc_iter=73366, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 16:58:44/1 day, 21:47:56, loss=0.514322381487157, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.11(1.03), norm=4.694201332158453, lr=0.39615411266146855
2023-12-06 03:54:40   INFO  epoch: 19/72, acc_iter=73428, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:51:45, time_cost(all): 16:59:25/1 day, 23:49:10, loss=0.51424897667899, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.12(1.03), norm=3.43315189567529, lr=0.3960127934880125
2023-12-06 03:55:22   INFO  epoch: 19/72, acc_iter=73478, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:54:51, time_cost(all): 17:00:07/2 days, 0:51:29, loss=0.514189779253049, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.06(1.03), norm=3.7579480486282053, lr=0.39589882641264473
2023-12-06 03:56:04   INFO  epoch: 19/72, acc_iter=73528, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:50:17, time_cost(all): 17:00:49/1 day, 23:59:12, loss=0.514130581827108, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.89(1.03), norm=4.24157139320447, lr=0.3957848593372769
2023-12-06 03:56:46   INFO  epoch: 19/72, acc_iter=73578, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:51:51, time_cost(all): 17:01:31/2 days, 0:16:17, loss=0.514071384401167, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.08(1.03), norm=4.199962936158113, lr=0.3956708922619091
2023-12-06 03:57:27   INFO  epoch: 19/72, acc_iter=73628, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:50:55, time_cost(all): 17:02:12/2 days, 0:28:54, loss=0.514012186975226, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.83(1.03), norm=2.201605057786318, lr=0.39555692518654134
2023-12-06 03:58:09   INFO  epoch: 19/72, acc_iter=73678, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:47:28, time_cost(all): 17:02:54/1 day, 21:26:36, loss=0.513952989549286, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.98(1.03), norm=4.660165018464989, lr=0.3954429581111735
2023-12-06 03:58:51   INFO  epoch: 19/72, acc_iter=73728, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:47:07, time_cost(all): 17:03:36/1 day, 23:36:49, loss=0.513893792123345, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.88(1.03), norm=3.916703740945982, lr=0.39532899103580577
2023-12-06 03:59:33   INFO  epoch: 19/72, acc_iter=73778, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:50:05, time_cost(all): 17:04:18/1 day, 23:14:10, loss=0.513834594697404, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.89(1.03), norm=2.511984399872533, lr=0.39521502396043795
2023-12-06 04:00:15   INFO  epoch: 19/72, acc_iter=73828, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:46:42, time_cost(all): 17:05:00/1 day, 23:18:43, loss=0.513775397271463, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.01(1.03), norm=4.685262705216025, lr=0.39510105688507013
2023-12-06 04:00:56   INFO  epoch: 19/72, acc_iter=73878, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:48:34, time_cost(all): 17:05:41/1 day, 22:41:14, loss=0.513716199845522, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.11(1.03), norm=3.6924259001878035, lr=0.3949870898097023
2023-12-06 04:01:38   INFO  epoch: 19/72, acc_iter=73928, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:46:51, time_cost(all): 17:06:23/2 days, 1:15:14, loss=0.513657002419581, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.88(1.03), norm=4.468927733649394, lr=0.39487312273433456
2023-12-06 04:02:20   INFO  epoch: 19/72, acc_iter=73978, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:44:45, time_cost(all): 17:07:05/1 day, 21:10:52, loss=0.51359780499364, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.9(1.03), norm=3.459194284534557, lr=0.3947591556589668
2023-12-06 04:03:02   INFO  epoch: 19/72, acc_iter=74028, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:45:12, time_cost(all): 17:07:47/1 day, 21:51:05, loss=0.513538607567699, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.05(1.03), norm=1.53530349841399, lr=0.394645188583599
2023-12-06 04:03:43   INFO  epoch: 19/72, acc_iter=74078, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:45:11, time_cost(all): 17:08:28/1 day, 22:51:34, loss=0.513479410141758, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.18(1.03), norm=1.094119794163928, lr=0.39453122150823117
2023-12-06 04:04:25   INFO  epoch: 19/72, acc_iter=74128, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:43:40, time_cost(all): 17:09:10/1 day, 22:15:38, loss=0.513420212715817, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.17(1.03), norm=2.193257156907093, lr=0.39441725443286335
2023-12-06 04:05:07   INFO  epoch: 19/72, acc_iter=74178, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:41:14, time_cost(all): 17:09:52/2 days, 0:38:50, loss=0.513361015289876, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.22(1.03), norm=4.434745627115399, lr=0.3943032873574956
2023-12-06 04:05:49   INFO  epoch: 19/72, acc_iter=74228, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:41:00, time_cost(all): 17:10:34/2 days, 0:07:19, loss=0.513301817863935, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.18(1.03), norm=2.3432057796023327, lr=0.39418932028212783
2023-12-06 04:06:31   INFO  epoch: 19/72, acc_iter=74278, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:40:44, time_cost(all): 17:11:16/1 day, 21:10:03, loss=0.513242620437994, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.98(1.03), norm=2.9839142888351593, lr=0.39407535320676
2023-12-06 04:07:12   INFO  epoch: 19/72, acc_iter=74328, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:39:21, time_cost(all): 17:11:57/1 day, 23:40:19, loss=0.513183423012053, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.14(1.03), norm=1.0531580710787256, lr=0.3939613861313922
2023-12-06 04:07:54   INFO  epoch: 19/72, acc_iter=74378, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:38:19, time_cost(all): 17:12:39/1 day, 21:34:22, loss=0.513124225586112, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.14(1.03), norm=1.4540668974957038, lr=0.3938474190560244
2023-12-06 04:08:36   INFO  epoch: 19/72, acc_iter=74428, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:39:02, time_cost(all): 17:13:21/1 day, 23:18:31, loss=0.513065028160171, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.06(1.03), norm=3.2359166062797606, lr=0.3937334519806566
2023-12-06 04:09:18   INFO  epoch: 19/72, acc_iter=74478, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:37:23, time_cost(all): 17:14:03/1 day, 22:42:37, loss=0.51300583073423, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.95(1.03), norm=3.527447085414114, lr=0.3936194849052888
2023-12-06 04:10:00   INFO  epoch: 19/72, acc_iter=74528, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:36:57, time_cost(all): 17:14:45/2 days, 1:02:25, loss=0.512946633308289, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.17(1.03), norm=0.9986783954657226, lr=0.39350551782992105
2023-12-06 04:10:41   INFO  epoch: 19/72, acc_iter=74578, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:35:18, time_cost(all): 17:15:26/1 day, 23:34:51, loss=0.512887435882349, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.88(1.03), norm=2.1284573449468316, lr=0.39339155075455323
2023-12-06 04:11:23   INFO  epoch: 19/72, acc_iter=74628, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:37:50, time_cost(all): 17:16:08/2 days, 0:31:23, loss=0.512828238456408, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.02(1.03), norm=4.985475421701038, lr=0.3932775836791854
2023-12-06 04:12:05   INFO  epoch: 19/72, acc_iter=74678, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:36:53, time_cost(all): 17:16:50/1 day, 21:22:34, loss=0.512769041030467, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.1(1.03), norm=1.4394639458418608, lr=0.39316361660381766
2023-12-06 04:12:47   INFO  epoch: 19/72, acc_iter=74728, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:33:18, time_cost(all): 17:17:32/1 day, 23:48:25, loss=0.512709843604526, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.21(1.03), norm=1.1081079672435639, lr=0.39304964952844984
2023-12-06 04:13:28   INFO  epoch: 19/72, acc_iter=74778, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:34:44, time_cost(all): 17:18:13/1 day, 21:03:53, loss=0.512650646178585, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.89(1.03), norm=3.446410861457049, lr=0.3929356824530821
2023-12-06 04:14:10   INFO  epoch: 19/72, acc_iter=74828, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:34:29, time_cost(all): 17:18:55/2 days, 0:56:46, loss=0.512591448752644, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.09(1.03), norm=3.075946325134422, lr=0.39282171537771426
2023-12-06 04:14:52   INFO  epoch: 19/72, acc_iter=74878, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:31:26, time_cost(all): 17:19:37/1 day, 23:43:54, loss=0.512532251326703, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.11(1.03), norm=2.3175023573953744, lr=0.39270774830234645
2023-12-06 04:15:34   INFO  epoch: 19/72, acc_iter=74928, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:30:47, time_cost(all): 17:20:19/2 days, 0:55:55, loss=0.512473053900762, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.97(1.03), norm=3.8344320338253812, lr=0.3925937812269787
2023-12-06 04:16:16   INFO  epoch: 19/72, acc_iter=74978, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:32:07, time_cost(all): 17:21:01/2 days, 0:27:43, loss=0.512413856474821, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.91(1.03), norm=3.4718395941442353, lr=0.3924798141516109
2023-12-06 04:16:57   INFO  epoch: 19/72, acc_iter=75028, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:31:51, time_cost(all): 17:21:42/2 days, 1:13:05, loss=0.51235465904888, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.09(1.03), norm=1.473129725573683, lr=0.3923658470762431
2023-12-06 04:17:39   INFO  epoch: 19/72, acc_iter=75078, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:30:37, time_cost(all): 17:22:24/1 day, 22:39:49, loss=0.512295461622939, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.21(1.03), norm=2.434800967205092, lr=0.3922518800008753
2023-12-06 04:18:21   INFO  epoch: 19/72, acc_iter=75128, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:28:02, time_cost(all): 17:23:06/1 day, 21:21:19, loss=0.512236264196998, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.07(1.03), norm=2.6781054695744966, lr=0.3921379129255075
2023-12-06 04:19:03   INFO  epoch: 19/72, acc_iter=75178, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:30:07, time_cost(all): 17:23:48/1 day, 21:46:15, loss=0.512177066771057, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.0(1.03), norm=4.793157470351172, lr=0.3920239458501397
2023-12-06 04:19:44   INFO  epoch: 19/72, acc_iter=75228, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:28:39, time_cost(all): 17:24:29/2 days, 0:08:34, loss=0.512117869345116, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.22(1.03), norm=3.6587043586399597, lr=0.3919099787747719
2023-12-06 04:20:26   INFO  epoch: 19/72, acc_iter=75278, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:27:51, time_cost(all): 17:25:11/1 day, 22:10:52, loss=0.512058671919175, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.15(1.03), norm=1.748995245703337, lr=0.3917960116994041
2023-12-06 04:21:08   INFO  epoch: 19/72, acc_iter=75328, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:27:18, time_cost(all): 17:25:53/1 day, 23:39:51, loss=0.511999474493234, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.07(1.03), norm=3.902837115278345, lr=0.39168204462403633
2023-12-06 04:21:50   INFO  epoch: 19/72, acc_iter=75378, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:25:02, time_cost(all): 17:26:35/1 day, 22:39:42, loss=0.511940277067294, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.92(1.03), norm=1.030255887876359, lr=0.3915680775486685
2023-12-06 04:22:32   INFO  epoch: 19/72, acc_iter=75428, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:24:38, time_cost(all): 17:27:17/2 days, 0:50:40, loss=0.511881079641353, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.93(1.03), norm=0.8068436324020529, lr=0.3914541104733007
2023-12-06 04:23:13   INFO  epoch: 19/72, acc_iter=75478, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:25:24, time_cost(all): 17:27:58/1 day, 20:57:39, loss=0.511821882215412, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.06(1.03), norm=4.639273128295375, lr=0.39134014339793294
2023-12-06 04:23:55   INFO  epoch: 19/72, acc_iter=75528, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:22:42, time_cost(all): 17:28:40/1 day, 22:39:36, loss=0.511762684789471, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.1(1.03), norm=1.0006361270548303, lr=0.3912261763225651
2023-12-06 04:24:37   INFO  epoch: 19/72, acc_iter=75578, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:46, time_cost(all): 17:29:22/2 days, 0:34:28, loss=0.51170348736353, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.1(1.03), norm=0.8709987631265618, lr=0.39111220924719736
2023-12-06 04:25:19   INFO  epoch: 19/72, acc_iter=75628, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:23:23, time_cost(all): 17:30:04/2 days, 0:45:26, loss=0.511644289937589, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.09(1.03), norm=2.14425747517887, lr=0.39099824217182955
2023-12-06 04:26:00   INFO  epoch: 19/72, acc_iter=75678, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:20:42, time_cost(all): 17:30:45/2 days, 0:30:43, loss=0.511585092511648, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.02(1.03), norm=2.5396397193635725, lr=0.39088427509646173
2023-12-06 04:26:42   INFO  epoch: 19/72, acc_iter=75728, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:59, time_cost(all): 17:31:27/1 day, 23:44:19, loss=0.511525895085707, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.92(1.03), norm=2.3517912859663954, lr=0.39077030802109397
2023-12-06 04:27:24   INFO  epoch: 19/72, acc_iter=75778, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:21:16, time_cost(all): 17:32:09/1 day, 21:47:13, loss=0.511466697659766, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.02(1.03), norm=1.5351583861535205, lr=0.39065634094572615
2023-12-06 04:28:06   INFO  epoch: 19/72, acc_iter=75828, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:19:35, time_cost(all): 17:32:51/1 day, 21:12:13, loss=0.511407500233825, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.97(1.03), norm=4.631014845587885, lr=0.3905423738703584
2023-12-06 04:28:48   INFO  epoch: 19/72, acc_iter=75878, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:18:38, time_cost(all): 17:33:33/1 day, 21:11:51, loss=0.511348302807884, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.07(1.03), norm=0.6005121769789006, lr=0.3904284067949906
2023-12-06 04:29:29   INFO  epoch: 19/72, acc_iter=75928, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:13, time_cost(all): 17:34:14/1 day, 21:05:05, loss=0.511289105381943, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.07(1.03), norm=1.853013513933073, lr=0.39031443971962276
2023-12-06 04:30:11   INFO  epoch: 19/72, acc_iter=75978, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:05, time_cost(all): 17:34:56/1 day, 22:56:08, loss=0.511229907956002, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.08(1.03), norm=2.1671255930123468, lr=0.390200472644255
2023-12-06 04:30:53   INFO  epoch: 19/72, acc_iter=76028, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:17:16, time_cost(all): 17:35:38/1 day, 22:20:55, loss=0.511170710530061, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.95(1.03), norm=0.7994631100384446, lr=0.3900865055688872
2023-12-06 04:31:35   INFO  epoch: 19/72, acc_iter=76078, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:38, time_cost(all): 17:36:20/2 days, 1:10:21, loss=0.51111151310412, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.89(1.03), norm=3.4603824188961347, lr=0.3899725384935194
2023-12-06 04:32:16   INFO  epoch: 19/72, acc_iter=76128, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:43, time_cost(all): 17:37:01/1 day, 22:26:39, loss=0.511052315678179, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.12(1.03), norm=2.7122382152114244, lr=0.3898585714181516
2023-12-06 04:32:58   INFO  epoch: 19/72, acc_iter=76178, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:27, time_cost(all): 17:37:43/1 day, 20:58:13, loss=0.510993118252238, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.92(1.03), norm=4.229741000120282, lr=0.3897446043427838
2023-12-06 04:33:40   INFO  epoch: 19/72, acc_iter=76228, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:31, time_cost(all): 17:38:25/1 day, 22:06:08, loss=0.510933920826298, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.86(1.03), norm=4.059462078274779, lr=0.389630637267416
2023-12-06 04:34:22   INFO  epoch: 19/72, acc_iter=76278, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:10, time_cost(all): 17:39:07/1 day, 22:22:59, loss=0.510874723400357, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.9(1.03), norm=4.75794089148228, lr=0.3895166701920482
2023-12-06 04:35:04   INFO  epoch: 19/72, acc_iter=76328, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:13:17, time_cost(all): 17:39:49/1 day, 23:43:20, loss=0.510815525974416, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.05(1.03), norm=2.2189829506270513, lr=0.3894027031166804
2023-12-06 04:35:45   INFO  epoch: 19/72, acc_iter=76378, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:28, time_cost(all): 17:40:30/1 day, 21:43:52, loss=0.510756328548475, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.01(1.03), norm=1.9326380762758215, lr=0.38928873604131264
2023-12-06 04:36:27   INFO  epoch: 19/72, acc_iter=76428, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:00, time_cost(all): 17:41:12/1 day, 21:31:44, loss=0.510697131122534, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.89(1.03), norm=2.492261784001545, lr=0.38917476896594483
2023-12-06 04:37:09   INFO  epoch: 19/72, acc_iter=76478, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:14, time_cost(all): 17:41:54/2 days, 1:03:08, loss=0.510637933696593, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.21(1.03), norm=3.333154760876085, lr=0.389060801890577
2023-12-06 04:37:51   INFO  epoch: 19/72, acc_iter=76528, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:29, time_cost(all): 17:42:36/1 day, 21:33:19, loss=0.510578736270652, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.05(1.03), norm=4.827209367327667, lr=0.38894683481520925
2023-12-06 04:38:32   INFO  epoch: 19/72, acc_iter=76578, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:03, time_cost(all): 17:43:17/1 day, 20:30:50, loss=0.510519538844711, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.14(1.03), norm=4.502495218855587, lr=0.38883286773984144
2023-12-06 04:39:14   INFO  epoch: 19/72, acc_iter=76628, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:39, time_cost(all): 17:43:59/1 day, 23:26:25, loss=0.51046034141877, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.97(1.03), norm=4.6938336205037325, lr=0.3887189006644737
2023-12-06 04:39:56   INFO  epoch: 19/72, acc_iter=76678, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:54, time_cost(all): 17:44:41/1 day, 21:16:53, loss=0.510401143992829, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.95(1.03), norm=0.8960080917836974, lr=0.38860493358910586
2023-12-06 04:40:38   INFO  epoch: 19/72, acc_iter=76728, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:21, time_cost(all): 17:45:23/1 day, 21:40:15, loss=0.510341946566888, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.92(1.03), norm=4.18629531902224, lr=0.38849096651373805
2023-12-06 04:41:20   INFO  epoch: 19/72, acc_iter=76778, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:21, time_cost(all): 17:46:05/1 day, 21:32:23, loss=0.510282749140947, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.9(1.03), norm=2.3013920590730144, lr=0.3883769994383703
2023-12-06 04:42:01   INFO  epoch: 19/72, acc_iter=76828, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:36, time_cost(all): 17:46:46/2 days, 0:22:59, loss=0.510223551715006, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.91(1.03), norm=3.339887918563148, lr=0.38826303236300247
2023-12-06 04:42:43   INFO  epoch: 19/72, acc_iter=76878, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:16, time_cost(all): 17:47:28/1 day, 23:31:46, loss=0.510164354289065, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.14(1.03), norm=4.107980857787546, lr=0.3881490652876347
2023-12-06 04:43:25   INFO  epoch: 19/72, acc_iter=76928, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:15, time_cost(all): 17:48:10/1 day, 23:40:36, loss=0.510105156863124, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.95(1.03), norm=2.323369401153819, lr=0.3880350982122669
2023-12-06 04:44:07   INFO  epoch: 19/72, acc_iter=76978, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:28, time_cost(all): 17:48:52/1 day, 23:48:14, loss=0.510045959437183, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.01(1.03), norm=0.6736609753064917, lr=0.3879211311368991
2023-12-06 04:44:49   INFO  epoch: 19/72, acc_iter=77028, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:49, time_cost(all): 17:49:34/1 day, 20:47:53, loss=0.509986762011242, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.9(1.03), norm=1.6939880165060996, lr=0.38780716406153126
2023-12-06 04:45:30   INFO  epoch: 19/72, acc_iter=77078, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:15, time_cost(all): 17:50:15/2 days, 0:21:23, loss=0.509927564585302, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.23(1.03), norm=1.966178272844191, lr=0.3876931969861635
2023-12-06 04:46:12   INFO  epoch: 19/72, acc_iter=77128, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:29, time_cost(all): 17:50:57/1 day, 22:21:27, loss=0.509868367159361, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.17(1.03), norm=4.913859564387945, lr=0.3875792299107957
2023-12-06 04:46:54   INFO  epoch: 19/72, acc_iter=77178, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:49, time_cost(all): 17:51:39/1 day, 23:51:22, loss=0.50980916973342, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.08(1.03), norm=2.791225505846966, lr=0.3874652628354279
2023-12-06 04:47:36   INFO  epoch: 19/72, acc_iter=77228, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 17:52:21/1 day, 20:23:14, loss=0.509749972307479, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.93(1.03), norm=2.5096651688142826, lr=0.3873512957600601
2023-12-06 04:48:17   INFO  epoch: 20/72, acc_iter=77290, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:54:52, time_cost(all): 17:53:02/1 day, 22:23:28, loss=0.509676567499312, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.22(1.03), norm=4.637302393850015, lr=0.38720997658660405
2023-12-06 04:48:59   INFO  epoch: 20/72, acc_iter=77340, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:54:54, time_cost(all): 17:53:44/1 day, 21:39:07, loss=0.509617370073371, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.08(1.03), norm=0.6808001513364532, lr=0.38709600951123624
2023-12-06 04:49:41   INFO  epoch: 20/72, acc_iter=77390, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:53:25, time_cost(all): 17:54:26/1 day, 22:15:42, loss=0.50955817264743, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.89(1.03), norm=2.004140575013305, lr=0.3869820424358685
2023-12-06 04:50:23   INFO  epoch: 20/72, acc_iter=77440, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:48:58, time_cost(all): 17:55:08/1 day, 22:11:08, loss=0.509498975221489, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.11(1.03), norm=3.884696230221641, lr=0.38686807536050066
2023-12-06 04:51:05   INFO  epoch: 20/72, acc_iter=77490, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:49:10, time_cost(all): 17:55:50/1 day, 22:04:31, loss=0.509439777795548, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.2(1.03), norm=1.833502324002512, lr=0.38675410828513285
2023-12-06 04:51:46   INFO  epoch: 20/72, acc_iter=77540, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:47:17, time_cost(all): 17:56:31/1 day, 23:42:26, loss=0.509380580369607, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.93(1.03), norm=3.0663582743554554, lr=0.3866401412097651
2023-12-06 04:52:28   INFO  epoch: 20/72, acc_iter=77590, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:50:29, time_cost(all): 17:57:13/1 day, 21:13:48, loss=0.509321382943666, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.85(1.03), norm=1.6899633698614678, lr=0.38652617413439727
2023-12-06 04:53:10   INFO  epoch: 20/72, acc_iter=77640, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:47:20, time_cost(all): 17:57:55/2 days, 0:01:51, loss=0.509262185517725, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.23(1.03), norm=4.91393858884064, lr=0.38641220705902946
2023-12-06 04:53:52   INFO  epoch: 20/72, acc_iter=77690, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:48:13, time_cost(all): 17:58:37/1 day, 22:14:20, loss=0.509202988091784, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.02(1.03), norm=4.569673790313012, lr=0.3862982399836617
2023-12-06 04:54:33   INFO  epoch: 20/72, acc_iter=77740, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:48:12, time_cost(all): 17:59:18/1 day, 21:00:36, loss=0.509143790665843, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.89(1.03), norm=3.361841108908834, lr=0.38618427290829394
2023-12-06 04:55:15   INFO  epoch: 20/72, acc_iter=77790, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:43:52, time_cost(all): 18:00:00/1 day, 23:34:38, loss=0.509084593239903, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.96(1.03), norm=2.656882374843884, lr=0.3860703058329261
2023-12-06 04:55:57   INFO  epoch: 20/72, acc_iter=77840, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:44:19, time_cost(all): 18:00:42/1 day, 21:36:57, loss=0.509025395813962, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.99(1.03), norm=3.2777843723920914, lr=0.3859563387575583
2023-12-06 04:56:39   INFO  epoch: 20/72, acc_iter=77890, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:44:54, time_cost(all): 18:01:24/1 day, 21:38:39, loss=0.508966198388021, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.99(1.03), norm=3.262021789780734, lr=0.38584237168219054
2023-12-06 04:57:21   INFO  epoch: 20/72, acc_iter=77940, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:44:37, time_cost(all): 18:02:06/1 day, 22:26:18, loss=0.50890700096208, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.97(1.03), norm=1.8187755320290717, lr=0.38572840460682273
2023-12-06 04:58:02   INFO  epoch: 20/72, acc_iter=77990, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:42:37, time_cost(all): 18:02:47/1 day, 23:28:46, loss=0.508847803536139, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.04(1.03), norm=3.3657893791602107, lr=0.3856144375314549
2023-12-06 04:58:44   INFO  epoch: 20/72, acc_iter=78040, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:44:12, time_cost(all): 18:03:29/1 day, 23:21:08, loss=0.508788606110198, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.07(1.03), norm=3.1980405745534877, lr=0.38550047045608715
2023-12-06 04:59:26   INFO  epoch: 20/72, acc_iter=78090, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:43:06, time_cost(all): 18:04:11/1 day, 22:39:40, loss=0.508729408684257, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.2(1.03), norm=3.9355023000016778, lr=0.38538650338071934
2023-12-06 05:00:08   INFO  epoch: 20/72, acc_iter=78140, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:39:12, time_cost(all): 18:04:53/1 day, 21:33:32, loss=0.508670211258316, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.22(1.03), norm=3.3507279909038648, lr=0.3852725363053515
2023-12-06 05:00:49   INFO  epoch: 20/72, acc_iter=78190, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:41:27, time_cost(all): 18:05:34/1 day, 20:08:25, loss=0.508611013832375, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.88(1.03), norm=4.131565619328358, lr=0.38515856922998376
2023-12-06 05:01:31   INFO  epoch: 20/72, acc_iter=78240, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:37:58, time_cost(all): 18:06:16/1 day, 23:44:21, loss=0.508551816406434, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.09(1.03), norm=0.5582543807776016, lr=0.38504460215461594
2023-12-06 05:02:13   INFO  epoch: 20/72, acc_iter=78290, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:38:21, time_cost(all): 18:06:58/1 day, 22:46:59, loss=0.508492618980493, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.92(1.03), norm=2.1293454027108947, lr=0.3849306350792482
2023-12-06 05:02:55   INFO  epoch: 20/72, acc_iter=78340, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:37:23, time_cost(all): 18:07:40/1 day, 23:04:23, loss=0.508433421554552, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.23(1.03), norm=2.407906474507722, lr=0.38481666800388037
2023-12-06 05:03:37   INFO  epoch: 20/72, acc_iter=78390, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:38:32, time_cost(all): 18:08:22/2 days, 0:34:11, loss=0.508374224128611, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.97(1.03), norm=2.5640286883908425, lr=0.38470270092851255
2023-12-06 05:04:18   INFO  epoch: 20/72, acc_iter=78440, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:36:19, time_cost(all): 18:09:03/1 day, 20:09:06, loss=0.50831502670267, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.01(1.03), norm=3.358420020695573, lr=0.3845887338531448
2023-12-06 05:05:00   INFO  epoch: 20/72, acc_iter=78490, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:35:12, time_cost(all): 18:09:45/1 day, 23:22:48, loss=0.508255829276729, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.94(1.03), norm=2.8175997559781645, lr=0.384474766777777
2023-12-06 05:05:42   INFO  epoch: 20/72, acc_iter=78540, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:37:19, time_cost(all): 18:10:27/1 day, 22:08:31, loss=0.508196631850788, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.16(1.03), norm=4.053711468891726, lr=0.3843607997024092
2023-12-06 05:06:24   INFO  epoch: 20/72, acc_iter=78590, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:34:41, time_cost(all): 18:11:09/1 day, 20:40:03, loss=0.508137434424847, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.13(1.03), norm=2.6707069437895523, lr=0.3842468326270414
2023-12-06 05:07:05   INFO  epoch: 20/72, acc_iter=78640, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:35:33, time_cost(all): 18:11:50/1 day, 20:09:34, loss=0.508078236998907, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.2(1.03), norm=1.5150660416515727, lr=0.3841328655516736
2023-12-06 05:07:47   INFO  epoch: 20/72, acc_iter=78690, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:34:14, time_cost(all): 18:12:32/1 day, 23:09:43, loss=0.508019039572966, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.93(1.03), norm=4.740804522945032, lr=0.38401889847630577
2023-12-06 05:08:29   INFO  epoch: 20/72, acc_iter=78740, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:31:50, time_cost(all): 18:13:14/1 day, 20:19:54, loss=0.507959842147025, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.98(1.03), norm=1.5476789666163775, lr=0.383904931400938
2023-12-06 05:09:11   INFO  epoch: 20/72, acc_iter=78790, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:33:35, time_cost(all): 18:13:56/1 day, 23:07:33, loss=0.507900644721084, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.16(1.03), norm=3.0782535896552283, lr=0.38379096432557025
2023-12-06 05:09:53   INFO  epoch: 20/72, acc_iter=78840, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:31:34, time_cost(all): 18:14:38/1 day, 22:17:47, loss=0.507841447295143, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.09(1.03), norm=3.3943459619437046, lr=0.38367699725020243
2023-12-06 05:10:34   INFO  epoch: 20/72, acc_iter=78890, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:31:29, time_cost(all): 18:15:19/1 day, 23:26:44, loss=0.507782249869202, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.1(1.03), norm=0.8272013114256749, lr=0.3835630301748346
2023-12-06 05:11:16   INFO  epoch: 20/72, acc_iter=78940, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:28:37, time_cost(all): 18:16:01/1 day, 21:02:54, loss=0.507723052443261, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.96(1.03), norm=4.005580076182659, lr=0.38344906309946686
2023-12-06 05:11:58   INFO  epoch: 20/72, acc_iter=78990, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:28:01, time_cost(all): 18:16:43/1 day, 23:18:40, loss=0.50766385501732, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.17(1.03), norm=3.5288303239510923, lr=0.38333509602409904
2023-12-06 05:12:40   INFO  epoch: 20/72, acc_iter=79040, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:28:43, time_cost(all): 18:17:25/1 day, 21:23:06, loss=0.507604657591379, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.09(1.03), norm=4.983002894787337, lr=0.3832211289487312
2023-12-06 05:13:21   INFO  epoch: 20/72, acc_iter=79090, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:27:50, time_cost(all): 18:18:06/1 day, 23:40:18, loss=0.507545460165438, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.0(1.03), norm=2.2834867051940364, lr=0.38310716187336347
2023-12-06 05:14:03   INFO  epoch: 20/72, acc_iter=79140, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:26:05, time_cost(all): 18:18:48/1 day, 20:56:04, loss=0.507486262739497, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.91(1.03), norm=4.731167201719214, lr=0.38299319479799565
2023-12-06 05:14:45   INFO  epoch: 20/72, acc_iter=79190, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:27:44, time_cost(all): 18:19:30/1 day, 23:53:31, loss=0.507427065313556, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.23(1.03), norm=1.0976724816742967, lr=0.38287922772262784
2023-12-06 05:15:27   INFO  epoch: 20/72, acc_iter=79240, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:26:41, time_cost(all): 18:20:12/1 day, 23:58:28, loss=0.507367867887615, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.03(1.03), norm=3.458614153757543, lr=0.3827652606472601
2023-12-06 05:16:09   INFO  epoch: 20/72, acc_iter=79290, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:13, time_cost(all): 18:20:54/2 days, 0:03:29, loss=0.507308670461674, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.17(1.03), norm=4.268631432660131, lr=0.38265129357189226
2023-12-06 05:16:50   INFO  epoch: 20/72, acc_iter=79340, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:24:10, time_cost(all): 18:21:35/1 day, 21:26:27, loss=0.507249473035733, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.9(1.03), norm=3.5669890396020194, lr=0.3825373264965245
2023-12-06 05:17:32   INFO  epoch: 20/72, acc_iter=79390, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:23:54, time_cost(all): 18:22:17/1 day, 21:32:39, loss=0.507190275609792, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.16(1.03), norm=2.544526281693387, lr=0.3824233594211567
2023-12-06 05:18:14   INFO  epoch: 20/72, acc_iter=79440, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:23:00, time_cost(all): 18:22:59/1 day, 20:11:36, loss=0.507131078183852, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.15(1.03), norm=0.7757265346015201, lr=0.38230939234578887
2023-12-06 05:18:56   INFO  epoch: 20/72, acc_iter=79490, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:23:16, time_cost(all): 18:23:41/1 day, 20:58:57, loss=0.507071880757911, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.11(1.03), norm=1.653088037419837, lr=0.3821954252704211
2023-12-06 05:19:38   INFO  epoch: 20/72, acc_iter=79540, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:22:21, time_cost(all): 18:24:23/1 day, 22:42:19, loss=0.50701268333197, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.17(1.03), norm=3.510578085768722, lr=0.3820814581950533
2023-12-06 05:20:19   INFO  epoch: 20/72, acc_iter=79590, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:01, time_cost(all): 18:25:04/1 day, 22:27:45, loss=0.506953485906029, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.14(1.03), norm=1.8565702533887194, lr=0.38196749111968553
2023-12-06 05:21:01   INFO  epoch: 20/72, acc_iter=79640, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:20:04, time_cost(all): 18:25:46/1 day, 21:50:39, loss=0.506894288480088, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.99(1.03), norm=1.3875958145104326, lr=0.3818535240443177
2023-12-06 05:21:43   INFO  epoch: 20/72, acc_iter=79690, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:20:19, time_cost(all): 18:26:28/1 day, 21:30:07, loss=0.506835091054147, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.15(1.03), norm=3.4653262528791235, lr=0.3817395569689499
2023-12-06 05:22:25   INFO  epoch: 20/72, acc_iter=79740, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:41, time_cost(all): 18:27:10/1 day, 23:43:27, loss=0.506775893628206, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.02(1.03), norm=2.702328796235391, lr=0.38162558989358214
2023-12-06 05:23:06   INFO  epoch: 20/72, acc_iter=79790, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:17:50, time_cost(all): 18:27:51/1 day, 23:17:24, loss=0.506716696202265, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.17(1.03), norm=2.014455585789532, lr=0.3815116228182143
2023-12-06 05:23:48   INFO  epoch: 20/72, acc_iter=79840, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:06, time_cost(all): 18:28:33/1 day, 20:29:41, loss=0.506657498776324, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.96(1.03), norm=4.785167298791175, lr=0.38139765574284656
2023-12-06 05:24:30   INFO  epoch: 20/72, acc_iter=79890, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:27, time_cost(all): 18:29:15/1 day, 22:17:47, loss=0.506598301350383, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.22(1.03), norm=4.431992724835341, lr=0.38128368866747875
2023-12-06 05:25:12   INFO  epoch: 20/72, acc_iter=79940, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:15:48, time_cost(all): 18:29:57/1 day, 22:10:44, loss=0.506539103924442, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.09(1.03), norm=0.7812304039936144, lr=0.38116972159211093
2023-12-06 05:25:54   INFO  epoch: 20/72, acc_iter=79990, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:07, time_cost(all): 18:30:39/1 day, 23:53:59, loss=0.506479906498501, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.91(1.03), norm=1.120780767001425, lr=0.3810557545167431
2023-12-06 05:26:35   INFO  epoch: 20/72, acc_iter=80040, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:27, time_cost(all): 18:31:20/2 days, 0:15:12, loss=0.50642070907256, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.17(1.03), norm=1.7096656673894128, lr=0.38094178744137536
2023-12-06 05:27:17   INFO  epoch: 20/72, acc_iter=80090, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:13:39, time_cost(all): 18:32:02/1 day, 20:51:26, loss=0.506361511646619, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.97(1.03), norm=1.431136722787964, lr=0.3808278203660076
2023-12-06 05:27:59   INFO  epoch: 20/72, acc_iter=80140, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:07, time_cost(all): 18:32:44/2 days, 0:00:42, loss=0.506302314220678, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.96(1.03), norm=4.2584237116196695, lr=0.3807138532906398
2023-12-06 05:28:41   INFO  epoch: 20/72, acc_iter=80190, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:13:17, time_cost(all): 18:33:26/1 day, 21:52:02, loss=0.506243116794737, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.99(1.03), norm=0.9781980036112485, lr=0.38059988621527197
2023-12-06 05:29:22   INFO  epoch: 20/72, acc_iter=80240, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:11:51, time_cost(all): 18:34:07/1 day, 23:59:47, loss=0.506183919368796, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.95(1.03), norm=3.0266315889044284, lr=0.38048591913990415
2023-12-06 05:30:04   INFO  epoch: 20/72, acc_iter=80290, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:10:56, time_cost(all): 18:34:49/1 day, 21:19:39, loss=0.506124721942856, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.22(1.03), norm=0.8397273344794212, lr=0.3803719520645364
2023-12-06 05:30:46   INFO  epoch: 20/72, acc_iter=80340, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:16, time_cost(all): 18:35:31/1 day, 22:54:10, loss=0.506065524516915, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.97(1.03), norm=3.185327802374146, lr=0.3802579849891686
2023-12-06 05:31:28   INFO  epoch: 20/72, acc_iter=80390, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:59, time_cost(all): 18:36:13/1 day, 23:48:01, loss=0.506006327090974, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.85(1.03), norm=2.3568554908906445, lr=0.3801440179138008
2023-12-06 05:32:10   INFO  epoch: 20/72, acc_iter=80440, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:08, time_cost(all): 18:36:55/1 day, 22:41:45, loss=0.505947129665033, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.95(1.03), norm=4.098831114947746, lr=0.380030050838433
2023-12-06 05:32:51   INFO  epoch: 20/72, acc_iter=80490, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:42, time_cost(all): 18:37:36/1 day, 22:08:09, loss=0.505887932239092, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.96(1.03), norm=3.1841079441509645, lr=0.3799160837630652
2023-12-06 05:33:33   INFO  epoch: 20/72, acc_iter=80540, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:27, time_cost(all): 18:38:18/1 day, 22:52:13, loss=0.505828734813151, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.01(1.03), norm=2.7764907915089316, lr=0.3798021166876974
2023-12-06 05:34:15   INFO  epoch: 20/72, acc_iter=80590, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:06, time_cost(all): 18:39:00/1 day, 22:49:16, loss=0.50576953738721, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.06(1.03), norm=2.3679249817422217, lr=0.37968814961232966
2023-12-06 05:34:57   INFO  epoch: 20/72, acc_iter=80640, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:11, time_cost(all): 18:39:42/1 day, 23:55:24, loss=0.505710339961269, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.1(1.03), norm=3.6927598075645283, lr=0.37957418253696185
2023-12-06 05:35:38   INFO  epoch: 20/72, acc_iter=80690, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:50, time_cost(all): 18:40:23/1 day, 19:53:55, loss=0.505651142535328, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.02(1.03), norm=3.3260408546381726, lr=0.37946021546159403
2023-12-06 05:36:20   INFO  epoch: 20/72, acc_iter=80740, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:04:57, time_cost(all): 18:41:05/1 day, 22:56:58, loss=0.505591945109387, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.01(1.03), norm=3.3800188100426536, lr=0.3793462483862262
2023-12-06 05:37:02   INFO  epoch: 20/72, acc_iter=80790, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:15, time_cost(all): 18:41:47/1 day, 23:03:14, loss=0.505532747683446, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.06(1.03), norm=3.5659304999441663, lr=0.37923228131085845
2023-12-06 05:37:44   INFO  epoch: 20/72, acc_iter=80840, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:42, time_cost(all): 18:42:29/1 day, 23:06:01, loss=0.505473550257505, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.99(1.03), norm=0.6112990163117813, lr=0.37911831423549064
2023-12-06 05:38:26   INFO  epoch: 20/72, acc_iter=80890, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:03:05, time_cost(all): 18:43:11/1 day, 21:36:14, loss=0.505414352831564, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.1(1.03), norm=3.937561010379806, lr=0.3790043471601228
2023-12-06 05:39:07   INFO  epoch: 20/72, acc_iter=80940, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:13, time_cost(all): 18:43:52/1 day, 21:51:47, loss=0.505355155405623, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.99(1.03), norm=4.2423477199061015, lr=0.37889038008475506
2023-12-06 05:39:49   INFO  epoch: 20/72, acc_iter=80990, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:36, time_cost(all): 18:44:34/1 day, 23:25:46, loss=0.505295957979682, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.08(1.03), norm=2.872439002659667, lr=0.37877641300938725
2023-12-06 05:40:31   INFO  epoch: 20/72, acc_iter=81040, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:53, time_cost(all): 18:45:16/1 day, 19:30:22, loss=0.505236760553741, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.9(1.03), norm=0.5028568059341365, lr=0.37866244593401943
2023-12-06 05:41:13   INFO  epoch: 20/72, acc_iter=81090, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 18:45:58/1 day, 19:31:33, loss=0.5051775631278, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.17(1.03), norm=3.542802750611398, lr=0.37854847885865167
2023-12-06 05:41:54   INFO  epoch: 21/72, acc_iter=81152, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:54:31, time_cost(all): 18:46:39/1 day, 19:46:54, loss=0.505104158319634, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.92(1.03), norm=1.799135102027252, lr=0.3784071596851956
2023-12-06 05:42:36   INFO  epoch: 21/72, acc_iter=81202, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:50:03, time_cost(all): 18:47:21/1 day, 22:41:45, loss=0.505044960893693, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.2(1.03), norm=3.9906930754162655, lr=0.3782931926098278
2023-12-06 05:43:18   INFO  epoch: 21/72, acc_iter=81252, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:49:13, time_cost(all): 18:48:03/1 day, 23:52:41, loss=0.504985763467752, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.15(1.03), norm=4.449178635342987, lr=0.37817922553446004
2023-12-06 05:44:00   INFO  epoch: 21/72, acc_iter=81302, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:52:06, time_cost(all): 18:48:45/1 day, 22:11:34, loss=0.504926566041811, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.85(1.03), norm=2.4607420296545244, lr=0.3780652584590922
2023-12-06 05:44:42   INFO  epoch: 21/72, acc_iter=81352, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:50:29, time_cost(all): 18:49:27/1 day, 21:35:45, loss=0.50486736861587, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.0(1.03), norm=2.759364886820901, lr=0.3779512913837244
2023-12-06 05:45:23   INFO  epoch: 21/72, acc_iter=81402, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:51:09, time_cost(all): 18:50:08/1 day, 19:48:12, loss=0.504808171189929, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.03(1.03), norm=4.603134632652968, lr=0.37783732430835665
2023-12-06 05:46:05   INFO  epoch: 21/72, acc_iter=81452, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:49:26, time_cost(all): 18:50:50/1 day, 22:52:27, loss=0.504748973763988, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.08(1.03), norm=1.41458468021425, lr=0.37772335723298883
2023-12-06 05:46:47   INFO  epoch: 21/72, acc_iter=81502, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:46:14, time_cost(all): 18:51:32/1 day, 21:49:36, loss=0.504689776338047, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.96(1.03), norm=1.8527079245285558, lr=0.377609390157621
2023-12-06 05:47:29   INFO  epoch: 21/72, acc_iter=81552, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:47:31, time_cost(all): 18:52:14/1 day, 20:12:44, loss=0.504630578912106, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.98(1.03), norm=3.5158341532834188, lr=0.37749542308225326
2023-12-06 05:48:10   INFO  epoch: 21/72, acc_iter=81602, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:47:06, time_cost(all): 18:52:55/1 day, 20:09:22, loss=0.504571381486165, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.19(1.03), norm=4.875195120782388, lr=0.37738145600688544
2023-12-06 05:48:52   INFO  epoch: 21/72, acc_iter=81652, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:45:58, time_cost(all): 18:53:37/1 day, 22:47:51, loss=0.504512184060224, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.01(1.03), norm=3.296049673540128, lr=0.3772674889315176
2023-12-06 05:49:34   INFO  epoch: 21/72, acc_iter=81702, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:46:25, time_cost(all): 18:54:19/1 day, 22:51:11, loss=0.504452986634283, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.87(1.03), norm=4.070347703207004, lr=0.37715352185614986
2023-12-06 05:50:16   INFO  epoch: 21/72, acc_iter=81752, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:44:48, time_cost(all): 18:55:01/1 day, 21:50:51, loss=0.504393789208342, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.87(1.03), norm=0.9569264201097326, lr=0.3770395547807821
2023-12-06 05:50:58   INFO  epoch: 21/72, acc_iter=81802, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:45:02, time_cost(all): 18:55:43/1 day, 22:28:16, loss=0.504334591782401, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.12(1.03), norm=2.456225526418871, lr=0.3769255877054143
2023-12-06 05:51:39   INFO  epoch: 21/72, acc_iter=81852, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:43:36, time_cost(all): 18:56:24/1 day, 22:03:08, loss=0.504275394356461, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.03(1.03), norm=1.913050826533648, lr=0.3768116206300465
2023-12-06 05:52:21   INFO  epoch: 21/72, acc_iter=81902, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:44:38, time_cost(all): 18:57:06/1 day, 19:30:50, loss=0.50421619693052, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.94(1.03), norm=4.388801914838, lr=0.3766976535546787
2023-12-06 05:53:03   INFO  epoch: 21/72, acc_iter=81952, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:40:36, time_cost(all): 18:57:48/1 day, 20:17:03, loss=0.504156999504579, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.06(1.03), norm=1.1460774489294132, lr=0.3765836864793109
2023-12-06 05:53:45   INFO  epoch: 21/72, acc_iter=82002, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:41:22, time_cost(all): 18:58:30/1 day, 20:03:51, loss=0.504097802078638, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.92(1.03), norm=1.1313750833490797, lr=0.3764697194039431
2023-12-06 05:54:26   INFO  epoch: 21/72, acc_iter=82052, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:41:34, time_cost(all): 18:59:11/1 day, 21:35:49, loss=0.504038604652697, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.99(1.03), norm=3.6602877986437243, lr=0.3763557523285753
2023-12-06 05:55:08   INFO  epoch: 21/72, acc_iter=82102, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:39:22, time_cost(all): 18:59:53/1 day, 21:05:42, loss=0.503979407226756, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.89(1.03), norm=0.5520503414428217, lr=0.3762417852532075
2023-12-06 05:55:50   INFO  epoch: 21/72, acc_iter=82152, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:38:45, time_cost(all): 19:00:35/1 day, 20:05:30, loss=0.503920209800815, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.17(1.03), norm=1.2996854540356282, lr=0.3761278181778397
2023-12-06 05:56:32   INFO  epoch: 21/72, acc_iter=82202, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:40:06, time_cost(all): 19:01:17/1 day, 21:06:23, loss=0.503861012374874, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.13(1.03), norm=2.1288101670641053, lr=0.37601385110247193
2023-12-06 05:57:14   INFO  epoch: 21/72, acc_iter=82252, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:39:26, time_cost(all): 19:01:59/1 day, 19:21:46, loss=0.503801814948933, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.03(1.03), norm=3.799117801587305, lr=0.3758998840271041
2023-12-06 05:57:55   INFO  epoch: 21/72, acc_iter=82302, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:38:52, time_cost(all): 19:02:40/1 day, 21:45:01, loss=0.503742617522992, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.87(1.03), norm=2.5545187818411064, lr=0.37578591695173635
2023-12-06 05:58:37   INFO  epoch: 21/72, acc_iter=82352, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:36:19, time_cost(all): 19:03:22/1 day, 19:33:53, loss=0.503683420097051, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.04(1.03), norm=2.842601055801342, lr=0.37567194987636854
2023-12-06 05:59:19   INFO  epoch: 21/72, acc_iter=82402, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:35:54, time_cost(all): 19:04:04/1 day, 22:17:42, loss=0.50362422267111, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.98(1.03), norm=3.5573274678133977, lr=0.3755579828010007
2023-12-06 06:00:01   INFO  epoch: 21/72, acc_iter=82452, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:34:16, time_cost(all): 19:04:46/1 day, 23:12:40, loss=0.503565025245169, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.85(1.03), norm=1.041557487616356, lr=0.37544401572563296
2023-12-06 06:00:43   INFO  epoch: 21/72, acc_iter=82502, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:35:35, time_cost(all): 19:05:28/1 day, 21:13:52, loss=0.503505827819228, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.99(1.03), norm=4.79235729100141, lr=0.37533004865026515
2023-12-06 06:01:24   INFO  epoch: 21/72, acc_iter=82552, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:32:55, time_cost(all): 19:06:09/1 day, 23:25:45, loss=0.503446630393287, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.11(1.03), norm=3.1210230905422716, lr=0.3752160815748974
2023-12-06 06:02:06   INFO  epoch: 21/72, acc_iter=82602, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:32:01, time_cost(all): 19:06:51/1 day, 20:12:53, loss=0.503387432967346, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.2(1.03), norm=1.74966974770066, lr=0.37510211449952957
2023-12-06 06:02:48   INFO  epoch: 21/72, acc_iter=82652, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:32:15, time_cost(all): 19:07:33/1 day, 22:28:47, loss=0.503328235541405, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.85(1.03), norm=3.1163693088834794, lr=0.37498814742416176
2023-12-06 06:03:30   INFO  epoch: 21/72, acc_iter=82702, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:33:01, time_cost(all): 19:08:15/1 day, 21:44:40, loss=0.503269038115464, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.84(1.03), norm=4.189736891538738, lr=0.37487418034879394
2023-12-06 06:04:11   INFO  epoch: 21/72, acc_iter=82752, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:29:16, time_cost(all): 19:08:56/1 day, 20:48:36, loss=0.503209840689524, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.84(1.03), norm=2.1652197751941484, lr=0.3747602132734262
2023-12-06 06:04:53   INFO  epoch: 21/72, acc_iter=82802, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:31:00, time_cost(all): 19:09:38/1 day, 20:01:30, loss=0.503150643263583, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.9(1.03), norm=3.0357573141734644, lr=0.3746462461980584
2023-12-06 06:05:35   INFO  epoch: 21/72, acc_iter=82852, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:30:00, time_cost(all): 19:10:20/1 day, 23:25:28, loss=0.503091445837642, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.12(1.03), norm=4.07689578719947, lr=0.3745322791226906
2023-12-06 06:06:17   INFO  epoch: 21/72, acc_iter=82902, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:29:23, time_cost(all): 19:11:02/1 day, 22:19:58, loss=0.503032248411701, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.15(1.03), norm=3.1259160901677387, lr=0.3744183120473228
2023-12-06 06:06:59   INFO  epoch: 21/72, acc_iter=82952, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:28:39, time_cost(all): 19:11:44/1 day, 23:17:32, loss=0.50297305098576, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.88(1.03), norm=1.1701069992755535, lr=0.37430434497195497
2023-12-06 06:07:40   INFO  epoch: 21/72, acc_iter=83002, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:27:42, time_cost(all): 19:12:25/1 day, 20:35:37, loss=0.502913853559819, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.92(1.03), norm=1.015171619704521, lr=0.3741903778965872
2023-12-06 06:08:22   INFO  epoch: 21/72, acc_iter=83052, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:25:25, time_cost(all): 19:13:07/1 day, 21:11:55, loss=0.502854656133878, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.99(1.03), norm=4.161370523098872, lr=0.3740764108212194
2023-12-06 06:09:04   INFO  epoch: 21/72, acc_iter=83102, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:24:51, time_cost(all): 19:13:49/1 day, 21:58:30, loss=0.502795458707937, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.2(1.03), norm=0.9609931775620995, lr=0.37396244374585164
2023-12-06 06:09:46   INFO  epoch: 21/72, acc_iter=83152, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:26:06, time_cost(all): 19:14:31/1 day, 21:44:36, loss=0.502736261281996, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.16(1.03), norm=3.6408282092290865, lr=0.3738484766704838
2023-12-06 06:10:27   INFO  epoch: 21/72, acc_iter=83202, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:25:37, time_cost(all): 19:15:12/1 day, 22:15:01, loss=0.502677063856055, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.15(1.03), norm=3.524969474437739, lr=0.373734509595116
2023-12-06 06:11:09   INFO  epoch: 21/72, acc_iter=83252, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:22:51, time_cost(all): 19:15:54/1 day, 22:36:47, loss=0.502617866430114, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.17(1.03), norm=3.0739185731246694, lr=0.37362054251974824
2023-12-06 06:11:51   INFO  epoch: 21/72, acc_iter=83302, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:24:11, time_cost(all): 19:16:36/1 day, 21:24:57, loss=0.502558669004173, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.87(1.03), norm=1.4656634461113096, lr=0.37350657544438043
2023-12-06 06:12:33   INFO  epoch: 21/72, acc_iter=83352, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:22:24, time_cost(all): 19:17:18/1 day, 23:23:01, loss=0.502499471578232, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.83(1.03), norm=3.214022588716472, lr=0.37339260836901267
2023-12-06 06:13:15   INFO  epoch: 21/72, acc_iter=83402, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:22:04, time_cost(all): 19:18:00/1 day, 20:10:40, loss=0.502440274152291, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.87(1.03), norm=3.2215373378503065, lr=0.37327864129364485
2023-12-06 06:13:56   INFO  epoch: 21/72, acc_iter=83452, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:23, time_cost(all): 19:18:41/1 day, 19:30:42, loss=0.50238107672635, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.91(1.03), norm=4.375464968396571, lr=0.37316467421827704
2023-12-06 06:14:38   INFO  epoch: 21/72, acc_iter=83502, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:21:10, time_cost(all): 19:19:23/1 day, 19:57:31, loss=0.502321879300409, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.07(1.03), norm=1.0256830464885125, lr=0.3730507071429092
2023-12-06 06:15:20   INFO  epoch: 21/72, acc_iter=83552, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:20:13, time_cost(all): 19:20:05/1 day, 19:02:08, loss=0.502262681874468, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.14(1.03), norm=4.103537307616125, lr=0.37293674006754146
2023-12-06 06:16:02   INFO  epoch: 21/72, acc_iter=83602, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:07, time_cost(all): 19:20:47/1 day, 23:09:11, loss=0.502203484448527, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.95(1.03), norm=1.0151141476114431, lr=0.3728227729921737
2023-12-06 06:16:43   INFO  epoch: 21/72, acc_iter=83652, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:14, time_cost(all): 19:21:28/1 day, 19:07:59, loss=0.502144287022587, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.09(1.03), norm=4.398950666428393, lr=0.3727088059168059
2023-12-06 06:17:25   INFO  epoch: 21/72, acc_iter=83702, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:18:26, time_cost(all): 19:22:10/1 day, 21:51:25, loss=0.502085089596646, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.88(1.03), norm=4.692807240552273, lr=0.37259483884143807
2023-12-06 06:18:07   INFO  epoch: 21/72, acc_iter=83752, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:58, time_cost(all): 19:22:52/1 day, 20:03:35, loss=0.502025892170705, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.0(1.03), norm=3.4624743516641856, lr=0.3724808717660703
2023-12-06 06:18:49   INFO  epoch: 21/72, acc_iter=83802, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:26, time_cost(all): 19:23:34/1 day, 19:29:16, loss=0.501966694744764, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.12(1.03), norm=2.696855264589127, lr=0.3723669046907025
2023-12-06 06:19:31   INFO  epoch: 21/72, acc_iter=83852, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:16:02, time_cost(all): 19:24:16/1 day, 22:39:19, loss=0.501907497318823, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.98(1.03), norm=2.6494544872506247, lr=0.37225293761533473
2023-12-06 06:20:12   INFO  epoch: 21/72, acc_iter=83902, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:20, time_cost(all): 19:24:57/1 day, 20:29:43, loss=0.501848299892882, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.94(1.03), norm=4.934777176544439, lr=0.3721389705399669
2023-12-06 06:20:54   INFO  epoch: 21/72, acc_iter=83952, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:13:23, time_cost(all): 19:25:39/1 day, 19:02:29, loss=0.501789102466941, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.02(1.03), norm=1.1672564205107014, lr=0.3720250034645991
2023-12-06 06:21:36   INFO  epoch: 21/72, acc_iter=84002, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:49, time_cost(all): 19:26:21/1 day, 20:25:51, loss=0.501729905041, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.08(1.03), norm=0.5664120083616854, lr=0.3719110363892313
2023-12-06 06:22:18   INFO  epoch: 21/72, acc_iter=84052, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:17, time_cost(all): 19:27:03/1 day, 19:17:33, loss=0.501670707615059, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.96(1.03), norm=0.9833583611391785, lr=0.3717970693138635
2023-12-06 06:22:59   INFO  epoch: 21/72, acc_iter=84102, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:00, time_cost(all): 19:27:44/1 day, 20:24:52, loss=0.501611510189118, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.1(1.03), norm=0.5393073834846124, lr=0.37168310223849577
2023-12-06 06:23:41   INFO  epoch: 21/72, acc_iter=84152, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:26, time_cost(all): 19:28:26/1 day, 22:58:15, loss=0.501552312763177, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.84(1.03), norm=4.823970388490129, lr=0.37156913516312795
2023-12-06 06:24:23   INFO  epoch: 21/72, acc_iter=84202, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:58, time_cost(all): 19:29:08/1 day, 21:18:44, loss=0.501493115337236, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.85(1.03), norm=1.9645527056447176, lr=0.37145516808776013
2023-12-06 06:25:05   INFO  epoch: 21/72, acc_iter=84252, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:10:10, time_cost(all): 19:29:50/1 day, 22:41:36, loss=0.501433917911295, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.93(1.03), norm=3.3351382271424637, lr=0.3713412010123923
2023-12-06 06:25:47   INFO  epoch: 21/72, acc_iter=84302, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:25, time_cost(all): 19:30:32/1 day, 21:11:02, loss=0.501374720485354, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.17(1.03), norm=2.99532838618726, lr=0.37122723393702456
2023-12-06 06:26:28   INFO  epoch: 21/72, acc_iter=84352, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:19, time_cost(all): 19:31:13/1 day, 18:49:55, loss=0.501315523059413, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.09(1.03), norm=1.4055315374355035, lr=0.37111326686165674
2023-12-06 06:27:10   INFO  epoch: 21/72, acc_iter=84402, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:32, time_cost(all): 19:31:55/1 day, 22:53:11, loss=0.501256325633472, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.93(1.03), norm=3.6491379585981556, lr=0.370999299786289
2023-12-06 06:27:52   INFO  epoch: 21/72, acc_iter=84452, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:01, time_cost(all): 19:32:37/1 day, 19:12:30, loss=0.501197128207532, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.08(1.03), norm=4.0172931942989765, lr=0.37088533271092117
2023-12-06 06:28:34   INFO  epoch: 21/72, acc_iter=84502, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:38, time_cost(all): 19:33:19/1 day, 19:31:54, loss=0.501137930781591, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.12(1.03), norm=1.2151013552515775, lr=0.37077136563555335
2023-12-06 06:29:15   INFO  epoch: 21/72, acc_iter=84552, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:43, time_cost(all): 19:34:00/1 day, 22:12:36, loss=0.50107873335565, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.85(1.03), norm=0.8588711336938267, lr=0.37065739856018554
2023-12-06 06:29:57   INFO  epoch: 21/72, acc_iter=84602, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:01, time_cost(all): 19:34:42/1 day, 22:01:51, loss=0.501019535929709, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.99(1.03), norm=2.541563605783704, lr=0.37054343148481783
2023-12-06 06:30:39   INFO  epoch: 21/72, acc_iter=84652, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:26, time_cost(all): 19:35:24/1 day, 19:15:05, loss=0.500960338503768, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.85(1.03), norm=4.459980149137557, lr=0.37042946440945
2023-12-06 06:31:21   INFO  epoch: 21/72, acc_iter=84702, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:45, time_cost(all): 19:36:06/1 day, 21:40:27, loss=0.500901141077827, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.93(1.03), norm=0.594300488390175, lr=0.3703154973340822
2023-12-06 06:32:03   INFO  epoch: 21/72, acc_iter=84752, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:48, time_cost(all): 19:36:48/1 day, 21:11:53, loss=0.500841943651886, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.03(1.03), norm=1.311635948127699, lr=0.3702015302587144
2023-12-06 06:32:44   INFO  epoch: 21/72, acc_iter=84802, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:18, time_cost(all): 19:37:29/1 day, 21:27:40, loss=0.500782746225945, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.95(1.03), norm=2.016921291000884, lr=0.3700875631833466
2023-12-06 06:33:26   INFO  epoch: 21/72, acc_iter=84852, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:31, time_cost(all): 19:38:11/1 day, 23:05:28, loss=0.500723548800004, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.83(1.03), norm=4.266408776235171, lr=0.3699735961079788
2023-12-06 06:34:08   INFO  epoch: 21/72, acc_iter=84902, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:50, time_cost(all): 19:38:53/1 day, 22:50:52, loss=0.500664351374063, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.19(1.03), norm=2.903178940734686, lr=0.36985962903261105
2023-12-06 06:34:50   INFO  epoch: 21/72, acc_iter=84952, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 19:39:35/1 day, 18:39:06, loss=0.500605153948122, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.88(1.03), norm=3.5809743741541267, lr=0.36974566195724323
2023-12-06 06:35:32   INFO  epoch: 22/72, acc_iter=85014, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:50:48, time_cost(all): 19:40:17/1 day, 22:08:16, loss=0.500531749139955, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.19(1.03), norm=1.433595091933269, lr=0.3696043427837872
2023-12-06 06:36:13   INFO  epoch: 22/72, acc_iter=85064, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:51:31, time_cost(all): 19:40:58/1 day, 22:01:10, loss=0.500472551714014, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.21(1.03), norm=1.255290109718866, lr=0.36949037570841936
2023-12-06 06:36:55   INFO  epoch: 22/72, acc_iter=85114, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:50:41, time_cost(all): 19:41:40/1 day, 18:52:37, loss=0.500413354288073, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.92(1.03), norm=2.2784300481687563, lr=0.36937640863305155
2023-12-06 06:37:37   INFO  epoch: 22/72, acc_iter=85164, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:52:25, time_cost(all): 19:42:22/1 day, 21:04:29, loss=0.500354156862133, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.06(1.03), norm=4.411111364700098, lr=0.36926244155768373
2023-12-06 06:38:19   INFO  epoch: 22/72, acc_iter=85214, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:49:03, time_cost(all): 19:43:04/1 day, 22:11:46, loss=0.500294959436192, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.13(1.03), norm=4.443463870923986, lr=0.369148474482316
2023-12-06 06:39:00   INFO  epoch: 22/72, acc_iter=85264, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:52:03, time_cost(all): 19:43:45/1 day, 23:02:07, loss=0.500235762010251, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.04(1.03), norm=2.2325906395868005, lr=0.3690345074069482
2023-12-06 06:39:42   INFO  epoch: 22/72, acc_iter=85314, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:50:44, time_cost(all): 19:44:27/1 day, 19:35:59, loss=0.50017656458431, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.99(1.03), norm=3.566269698000715, lr=0.3689205403315804
2023-12-06 06:40:24   INFO  epoch: 22/72, acc_iter=85364, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:49:57, time_cost(all): 19:45:09/1 day, 18:43:32, loss=0.500117367158369, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.85(1.03), norm=4.542200186294612, lr=0.3688065732562126
2023-12-06 06:41:06   INFO  epoch: 22/72, acc_iter=85414, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:46:45, time_cost(all): 19:45:51/1 day, 21:19:03, loss=0.500058169732428, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.99(1.03), norm=4.608247601269684, lr=0.36869260618084476
2023-12-06 06:41:48   INFO  epoch: 22/72, acc_iter=85464, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:46:07, time_cost(all): 19:46:33/1 day, 20:14:00, loss=0.499998972306487, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.05(1.03), norm=4.696649506728063, lr=0.368578639105477
2023-12-06 06:42:29   INFO  epoch: 22/72, acc_iter=85514, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:47:30, time_cost(all): 19:47:14/1 day, 22:59:27, loss=0.499939774880546, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.97(1.03), norm=2.944534268888484, lr=0.3684646720301092
2023-12-06 06:43:11   INFO  epoch: 22/72, acc_iter=85564, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:46:42, time_cost(all): 19:47:56/1 day, 21:29:37, loss=0.499880577454605, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.22(1.03), norm=1.4041595024520581, lr=0.3683507049547414
2023-12-06 06:43:53   INFO  epoch: 22/72, acc_iter=85614, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:43:12, time_cost(all): 19:48:38/1 day, 22:01:33, loss=0.499821380028664, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.97(1.03), norm=2.9171052618445, lr=0.3682367378793736
2023-12-06 06:44:35   INFO  epoch: 22/72, acc_iter=85664, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:42:29, time_cost(all): 19:49:20/1 day, 22:49:27, loss=0.499762182602723, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.19(1.03), norm=1.0601059447197527, lr=0.3681227708040058
2023-12-06 06:45:16   INFO  epoch: 22/72, acc_iter=85714, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:41:39, time_cost(all): 19:50:01/1 day, 22:04:00, loss=0.499702985176782, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.06(1.03), norm=3.8074620090265276, lr=0.36800880372863803
2023-12-06 06:45:58   INFO  epoch: 22/72, acc_iter=85764, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:44:33, time_cost(all): 19:50:43/1 day, 20:16:25, loss=0.499643787750841, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.22(1.03), norm=0.8302897952482515, lr=0.3678948366532703
2023-12-06 06:46:40   INFO  epoch: 22/72, acc_iter=85814, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:43:18, time_cost(all): 19:51:25/1 day, 21:55:48, loss=0.4995845903249, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.18(1.03), norm=1.5458795063572874, lr=0.36778086957790246
2023-12-06 06:47:22   INFO  epoch: 22/72, acc_iter=85864, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:42:01, time_cost(all): 19:52:07/1 day, 20:41:25, loss=0.499525392898959, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.22(1.03), norm=0.8173283414969632, lr=0.36766690250253464
2023-12-06 06:48:04   INFO  epoch: 22/72, acc_iter=85914, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:42:01, time_cost(all): 19:52:49/1 day, 22:26:06, loss=0.499466195473018, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.84(1.03), norm=3.469914521856593, lr=0.3675529354271668
2023-12-06 06:48:45   INFO  epoch: 22/72, acc_iter=85964, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:37:54, time_cost(all): 19:53:30/1 day, 22:12:10, loss=0.499406998047077, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=0.7396366215389147, lr=0.36743896835179907
2023-12-06 06:49:27   INFO  epoch: 22/72, acc_iter=86014, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:39:41, time_cost(all): 19:54:12/1 day, 18:36:08, loss=0.499347800621137, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.9(1.03), norm=4.095576152062763, lr=0.36732500127643125
2023-12-06 06:50:09   INFO  epoch: 22/72, acc_iter=86064, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:37:57, time_cost(all): 19:54:54/1 day, 20:32:04, loss=0.499288603195196, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.01(1.03), norm=2.4186020634785708, lr=0.3672110342010635
2023-12-06 06:50:51   INFO  epoch: 22/72, acc_iter=86114, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:39:38, time_cost(all): 19:55:36/1 day, 22:16:32, loss=0.499229405769255, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.17(1.03), norm=2.6091713534801233, lr=0.3670970671256957
2023-12-06 06:51:32   INFO  epoch: 22/72, acc_iter=86164, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:37:56, time_cost(all): 19:56:17/1 day, 21:51:41, loss=0.499170208343314, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.23(1.03), norm=1.8181684569649164, lr=0.36698310005032786
2023-12-06 06:52:14   INFO  epoch: 22/72, acc_iter=86214, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:38:10, time_cost(all): 19:56:59/1 day, 21:38:01, loss=0.499111010917373, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.99(1.03), norm=2.3842827374098414, lr=0.3668691329749601
2023-12-06 06:52:56   INFO  epoch: 22/72, acc_iter=86264, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:37:03, time_cost(all): 19:57:41/1 day, 20:49:26, loss=0.499051813491432, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.89(1.03), norm=1.6606011355562456, lr=0.3667551658995923
2023-12-06 06:53:38   INFO  epoch: 22/72, acc_iter=86314, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:35:20, time_cost(all): 19:58:23/1 day, 19:24:28, loss=0.498992616065491, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.04(1.03), norm=4.191575879628974, lr=0.3666411988242245
2023-12-06 06:54:20   INFO  epoch: 22/72, acc_iter=86364, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:32:49, time_cost(all): 19:59:05/1 day, 19:18:55, loss=0.49893341863955, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.98(1.03), norm=1.701501994758398, lr=0.3665272317488567
2023-12-06 06:55:01   INFO  epoch: 22/72, acc_iter=86414, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:32:57, time_cost(all): 19:59:46/1 day, 19:06:27, loss=0.498874221213609, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.05(1.03), norm=4.332110258726571, lr=0.3664132646734889
2023-12-06 06:55:43   INFO  epoch: 22/72, acc_iter=86464, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:34:23, time_cost(all): 20:00:28/1 day, 18:48:54, loss=0.498815023787668, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.11(1.03), norm=3.233367628995784, lr=0.3662992975981211
2023-12-06 06:56:25   INFO  epoch: 22/72, acc_iter=86514, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:32:53, time_cost(all): 20:01:10/1 day, 21:32:53, loss=0.498755826361727, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.94(1.03), norm=3.643589164673508, lr=0.3661853305227533
2023-12-06 06:57:07   INFO  epoch: 22/72, acc_iter=86564, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:30:26, time_cost(all): 20:01:52/1 day, 19:22:02, loss=0.498696628935786, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.07(1.03), norm=4.301437703354719, lr=0.36607136344738556
2023-12-06 06:57:48   INFO  epoch: 22/72, acc_iter=86614, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:29:45, time_cost(all): 20:02:33/1 day, 19:07:49, loss=0.498637431509845, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.19(1.03), norm=0.5250512032317485, lr=0.36595739637201774
2023-12-06 06:58:30   INFO  epoch: 22/72, acc_iter=86664, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:28:57, time_cost(all): 20:03:15/1 day, 19:22:56, loss=0.498578234083904, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.99(1.03), norm=1.162049555420362, lr=0.3658434292966499
2023-12-06 06:59:12   INFO  epoch: 22/72, acc_iter=86714, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:28:42, time_cost(all): 20:03:57/1 day, 21:44:29, loss=0.498519036657963, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.94(1.03), norm=4.18255189653322, lr=0.3657294622212821
2023-12-06 06:59:54   INFO  epoch: 22/72, acc_iter=86764, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:28:50, time_cost(all): 20:04:39/1 day, 21:50:58, loss=0.498459839232022, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.21(1.03), norm=3.8961231008009287, lr=0.36561549514591435
2023-12-06 07:00:36   INFO  epoch: 22/72, acc_iter=86814, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:27:41, time_cost(all): 20:05:21/1 day, 22:24:05, loss=0.498400641806081, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.16(1.03), norm=4.812668870896414, lr=0.3655015280705466
2023-12-06 07:01:17   INFO  epoch: 22/72, acc_iter=86864, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:27:54, time_cost(all): 20:06:02/1 day, 21:33:15, loss=0.49834144438014, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.15(1.03), norm=3.393424269374057, lr=0.3653875609951788
2023-12-06 07:01:59   INFO  epoch: 22/72, acc_iter=86914, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:26:03, time_cost(all): 20:06:44/1 day, 21:37:43, loss=0.4982822469542, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.92(1.03), norm=1.9764992764705578, lr=0.36527359391981096
2023-12-06 07:02:41   INFO  epoch: 22/72, acc_iter=86964, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:27:01, time_cost(all): 20:07:26/1 day, 20:19:56, loss=0.498223049528259, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.06(1.03), norm=3.318388257503752, lr=0.36515962684444314
2023-12-06 07:03:23   INFO  epoch: 22/72, acc_iter=87014, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:26:04, time_cost(all): 20:08:08/1 day, 19:14:10, loss=0.498163852102318, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.05(1.03), norm=3.2780359038140485, lr=0.3650456597690754
2023-12-06 07:04:04   INFO  epoch: 22/72, acc_iter=87064, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:25:18, time_cost(all): 20:08:49/1 day, 20:08:05, loss=0.498104654676377, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.11(1.03), norm=0.9956300414960602, lr=0.36493169269370757
2023-12-06 07:04:46   INFO  epoch: 22/72, acc_iter=87114, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:04, time_cost(all): 20:09:31/1 day, 19:36:06, loss=0.498045457250436, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.84(1.03), norm=1.1087521768702988, lr=0.3648177256183398
2023-12-06 07:05:28   INFO  epoch: 22/72, acc_iter=87164, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:23:12, time_cost(all): 20:10:13/1 day, 21:43:57, loss=0.497986259824495, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.98(1.03), norm=3.9529573959503357, lr=0.364703758542972
2023-12-06 07:06:10   INFO  epoch: 22/72, acc_iter=87214, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:21:35, time_cost(all): 20:10:55/1 day, 19:50:31, loss=0.497927062398554, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.0(1.03), norm=3.1965715242279047, lr=0.3645897914676042
2023-12-06 07:06:52   INFO  epoch: 22/72, acc_iter=87264, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:41, time_cost(all): 20:11:37/1 day, 22:11:45, loss=0.497867864972613, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.14(1.03), norm=2.843169318339935, lr=0.3644758243922364
2023-12-06 07:07:33   INFO  epoch: 22/72, acc_iter=87314, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:02, time_cost(all): 20:12:18/1 day, 20:34:21, loss=0.497808667546672, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.16(1.03), norm=0.6412271685452653, lr=0.3643618573168686
2023-12-06 07:08:15   INFO  epoch: 22/72, acc_iter=87364, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:21:02, time_cost(all): 20:13:00/1 day, 20:33:14, loss=0.497749470120731, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.21(1.03), norm=1.880043051150522, lr=0.36424789024150084
2023-12-06 07:08:57   INFO  epoch: 22/72, acc_iter=87414, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:19:34, time_cost(all): 20:13:42/1 day, 18:38:59, loss=0.49769027269479, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.93(1.03), norm=3.4632106814133325, lr=0.364133923166133
2023-12-06 07:09:39   INFO  epoch: 22/72, acc_iter=87464, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:25, time_cost(all): 20:14:24/1 day, 21:29:01, loss=0.497631075268849, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.16(1.03), norm=4.64779721267819, lr=0.3640199560907652
2023-12-06 07:10:21   INFO  epoch: 22/72, acc_iter=87514, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:17:25, time_cost(all): 20:15:06/1 day, 20:25:32, loss=0.497571877842908, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.08(1.03), norm=2.490593527200458, lr=0.3639059890153974
2023-12-06 07:11:02   INFO  epoch: 22/72, acc_iter=87564, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:16:50, time_cost(all): 20:15:47/1 day, 20:37:49, loss=0.497512680416967, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.96(1.03), norm=3.279407137944606, lr=0.36379202194002963
2023-12-06 07:11:44   INFO  epoch: 22/72, acc_iter=87614, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:14, time_cost(all): 20:16:29/1 day, 19:44:48, loss=0.497453482991026, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.16(1.03), norm=4.047720308932109, lr=0.36367805486466187
2023-12-06 07:12:26   INFO  epoch: 22/72, acc_iter=87664, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:16, time_cost(all): 20:17:11/1 day, 18:55:09, loss=0.497394285565085, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.13(1.03), norm=4.985206539906817, lr=0.36356408778929405
2023-12-06 07:13:08   INFO  epoch: 22/72, acc_iter=87714, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:59, time_cost(all): 20:17:53/1 day, 20:09:54, loss=0.497335088139145, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.1(1.03), norm=3.1219629914786844, lr=0.36345012071392624
2023-12-06 07:13:49   INFO  epoch: 22/72, acc_iter=87764, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:29, time_cost(all): 20:18:34/1 day, 21:22:29, loss=0.497275890713204, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.1(1.03), norm=2.36283485189451, lr=0.3633361536385585
2023-12-06 07:14:31   INFO  epoch: 22/72, acc_iter=87814, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:22, time_cost(all): 20:19:16/1 day, 22:03:55, loss=0.497216693287263, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.94(1.03), norm=1.50198538037369, lr=0.36322218656319066
2023-12-06 07:15:13   INFO  epoch: 22/72, acc_iter=87864, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:26, time_cost(all): 20:19:58/1 day, 18:25:14, loss=0.497157495861322, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.05(1.03), norm=3.23089542581446, lr=0.3631082194878229
2023-12-06 07:15:55   INFO  epoch: 22/72, acc_iter=87914, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:30, time_cost(all): 20:20:40/1 day, 18:27:07, loss=0.497098298435381, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.92(1.03), norm=3.3290847479575185, lr=0.3629942524124551
2023-12-06 07:16:37   INFO  epoch: 22/72, acc_iter=87964, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:10, time_cost(all): 20:21:22/1 day, 21:31:00, loss=0.49703910100944, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.02(1.03), norm=3.6540619085821433, lr=0.36288028533708727
2023-12-06 07:17:18   INFO  epoch: 22/72, acc_iter=88014, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:40, time_cost(all): 20:22:03/1 day, 19:01:17, loss=0.496979903583499, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.86(1.03), norm=0.5035120725648767, lr=0.36276631826171946
2023-12-06 07:18:00   INFO  epoch: 22/72, acc_iter=88064, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:53, time_cost(all): 20:22:45/1 day, 22:16:40, loss=0.496920706157558, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.98(1.03), norm=1.3009377070558563, lr=0.36265235118635164
2023-12-06 07:18:42   INFO  epoch: 22/72, acc_iter=88114, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:34, time_cost(all): 20:23:27/1 day, 22:03:08, loss=0.496861508731617, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.12(1.03), norm=2.5281756760979897, lr=0.36253838411098394
2023-12-06 07:19:24   INFO  epoch: 22/72, acc_iter=88164, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:05, time_cost(all): 20:24:09/1 day, 20:23:28, loss=0.496802311305676, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.09(1.03), norm=2.2695418106380547, lr=0.3624244170356161
2023-12-06 07:20:05   INFO  epoch: 22/72, acc_iter=88214, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:21, time_cost(all): 20:24:50/1 day, 22:12:19, loss=0.496743113879735, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.91(1.03), norm=1.7337759465873748, lr=0.3623104499602483
2023-12-06 07:20:47   INFO  epoch: 22/72, acc_iter=88264, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:32, time_cost(all): 20:25:32/1 day, 21:49:06, loss=0.496683916453794, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.16(1.03), norm=3.607955728737693, lr=0.3621964828848805
2023-12-06 07:21:29   INFO  epoch: 22/72, acc_iter=88314, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:09, time_cost(all): 20:26:14/1 day, 19:37:37, loss=0.496624719027853, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.12(1.03), norm=1.7213946675331395, lr=0.36208251580951273
2023-12-06 07:22:11   INFO  epoch: 22/72, acc_iter=88364, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:41, time_cost(all): 20:26:56/1 day, 20:34:12, loss=0.496565521601912, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.07(1.03), norm=3.2394515555900507, lr=0.3619685487341449
2023-12-06 07:22:53   INFO  epoch: 22/72, acc_iter=88414, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:51, time_cost(all): 20:27:38/1 day, 22:15:30, loss=0.496506324175971, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.87(1.03), norm=3.4868946015451945, lr=0.36185458165877715
2023-12-06 07:23:34   INFO  epoch: 22/72, acc_iter=88464, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:00, time_cost(all): 20:28:19/1 day, 20:24:48, loss=0.49644712675003, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.93(1.03), norm=3.840868464608713, lr=0.36174061458340934
2023-12-06 07:24:16   INFO  epoch: 22/72, acc_iter=88514, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:13, time_cost(all): 20:29:01/1 day, 21:51:34, loss=0.496387929324089, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.91(1.03), norm=1.2201458916548105, lr=0.3616266475080415
2023-12-06 07:24:58   INFO  epoch: 22/72, acc_iter=88564, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:29, time_cost(all): 20:29:43/1 day, 21:34:38, loss=0.496328731898149, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.15(1.03), norm=0.9920328951292593, lr=0.3615126804326737
2023-12-06 07:25:40   INFO  epoch: 22/72, acc_iter=88614, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:03:04, time_cost(all): 20:30:25/1 day, 20:07:37, loss=0.496269534472208, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.02(1.03), norm=2.613390466570031, lr=0.361398713357306
2023-12-06 07:26:21   INFO  epoch: 22/72, acc_iter=88664, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:19, time_cost(all): 20:31:06/1 day, 18:23:34, loss=0.496210337046267, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.84(1.03), norm=2.339816555492005, lr=0.3612847462819382
2023-12-06 07:27:03   INFO  epoch: 22/72, acc_iter=88714, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:29, time_cost(all): 20:31:48/1 day, 19:49:47, loss=0.496151139620326, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.15(1.03), norm=4.873559121850865, lr=0.36117077920657037
2023-12-06 07:27:45   INFO  epoch: 22/72, acc_iter=88764, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:53, time_cost(all): 20:32:30/1 day, 21:25:04, loss=0.496091942194385, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.23(1.03), norm=1.3457193785213288, lr=0.36105681213120255
2023-12-06 07:28:27   INFO  epoch: 22/72, acc_iter=88814, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 20:33:12/1 day, 18:50:34, loss=0.496032744768444, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.19(1.03), norm=4.148296851160273, lr=0.36094284505583474
2023-12-06 07:29:09   INFO  epoch: 23/72, acc_iter=88876, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:54:17, time_cost(all): 20:33:54/1 day, 18:36:22, loss=0.495959339960277, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.87(1.03), norm=3.227559604791556, lr=0.3608015258823787
2023-12-06 07:29:50   INFO  epoch: 23/72, acc_iter=88926, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:52:56, time_cost(all): 20:34:35/1 day, 17:48:38, loss=0.495900142534336, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.06(1.03), norm=1.387270291296071, lr=0.3606875588070109
2023-12-06 07:30:32   INFO  epoch: 23/72, acc_iter=88976, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:53:58, time_cost(all): 20:35:17/1 day, 19:52:57, loss=0.495840945108395, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.97(1.03), norm=4.7244259015916095, lr=0.3605735917316431
2023-12-06 07:31:14   INFO  epoch: 23/72, acc_iter=89026, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:51:54, time_cost(all): 20:35:59/1 day, 19:31:36, loss=0.495781747682454, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.88(1.03), norm=4.9225691709102195, lr=0.36045962465627535
2023-12-06 07:31:56   INFO  epoch: 23/72, acc_iter=89076, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:49:07, time_cost(all): 20:36:41/1 day, 18:34:16, loss=0.495722550256513, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.97(1.03), norm=0.9365291024655189, lr=0.36034565758090753
2023-12-06 07:32:37   INFO  epoch: 23/72, acc_iter=89126, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:49:41, time_cost(all): 20:37:22/1 day, 19:21:05, loss=0.495663352830572, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.1(1.03), norm=4.639698346995734, lr=0.3602316905055397
2023-12-06 07:33:19   INFO  epoch: 23/72, acc_iter=89176, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:50:02, time_cost(all): 20:38:04/1 day, 21:51:05, loss=0.495604155404631, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.02(1.03), norm=1.0904922553605014, lr=0.3601177234301719
2023-12-06 07:34:01   INFO  epoch: 23/72, acc_iter=89226, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:46:59, time_cost(all): 20:38:46/1 day, 17:59:35, loss=0.49554495797869, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.19(1.03), norm=1.7692018735391337, lr=0.36000375635480414
2023-12-06 07:34:43   INFO  epoch: 23/72, acc_iter=89276, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:49:50, time_cost(all): 20:39:28/1 day, 22:00:41, loss=0.49548576055275, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.99(1.03), norm=3.9321723492864993, lr=0.3598897892794364
2023-12-06 07:35:25   INFO  epoch: 23/72, acc_iter=89326, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:48:02, time_cost(all): 20:40:10/1 day, 20:32:41, loss=0.495426563126809, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.87(1.03), norm=0.7398289002017404, lr=0.35977582220406856
2023-12-06 07:36:06   INFO  epoch: 23/72, acc_iter=89376, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:44:33, time_cost(all): 20:40:51/1 day, 21:48:15, loss=0.495367365700868, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.01(1.03), norm=2.29956387893853, lr=0.35966185512870075
2023-12-06 07:36:48   INFO  epoch: 23/72, acc_iter=89426, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:43:14, time_cost(all): 20:41:33/1 day, 19:43:48, loss=0.495308168274927, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.21(1.03), norm=3.950023706267749, lr=0.35954788805333293
2023-12-06 07:37:30   INFO  epoch: 23/72, acc_iter=89476, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:45:50, time_cost(all): 20:42:15/1 day, 18:41:43, loss=0.495248970848986, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.99(1.03), norm=2.1887773810448388, lr=0.35943392097796517
2023-12-06 07:38:12   INFO  epoch: 23/72, acc_iter=89526, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:44:04, time_cost(all): 20:42:57/1 day, 19:45:24, loss=0.495189773423045, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.9(1.03), norm=3.3092836865303075, lr=0.35931995390259736
2023-12-06 07:38:53   INFO  epoch: 23/72, acc_iter=89576, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:43:29, time_cost(all): 20:43:38/1 day, 21:51:30, loss=0.495130575997104, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.04(1.03), norm=4.426411971831538, lr=0.3592059868272296
2023-12-06 07:39:35   INFO  epoch: 23/72, acc_iter=89626, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:44:31, time_cost(all): 20:44:20/1 day, 20:57:01, loss=0.495071378571163, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.94(1.03), norm=4.360051290386181, lr=0.3590920197518618
2023-12-06 07:40:17   INFO  epoch: 23/72, acc_iter=89676, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:43:54, time_cost(all): 20:45:02/1 day, 19:46:47, loss=0.495012181145222, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.18(1.03), norm=2.101749850754186, lr=0.35897805267649396
2023-12-06 07:40:59   INFO  epoch: 23/72, acc_iter=89726, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:42:47, time_cost(all): 20:45:44/1 day, 19:33:13, loss=0.494952983719281, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.0(1.03), norm=3.0111631163271326, lr=0.3588640856011262
2023-12-06 07:41:41   INFO  epoch: 23/72, acc_iter=89776, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:41:34, time_cost(all): 20:46:26/1 day, 20:45:57, loss=0.49489378629334, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.17(1.03), norm=4.711254608461127, lr=0.35875011852575844
2023-12-06 07:42:22   INFO  epoch: 23/72, acc_iter=89826, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:41:32, time_cost(all): 20:47:07/1 day, 17:36:16, loss=0.494834588867399, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.86(1.03), norm=4.617785875297806, lr=0.35863615145039063
2023-12-06 07:43:04   INFO  epoch: 23/72, acc_iter=89876, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:39:04, time_cost(all): 20:47:49/1 day, 20:50:48, loss=0.494775391441458, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.06(1.03), norm=2.5054840042145634, lr=0.3585221843750228
2023-12-06 07:43:46   INFO  epoch: 23/72, acc_iter=89926, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:38:14, time_cost(all): 20:48:31/1 day, 19:40:12, loss=0.494716194015517, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.86(1.03), norm=1.0975422001928896, lr=0.358408217299655
2023-12-06 07:44:28   INFO  epoch: 23/72, acc_iter=89976, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:38:45, time_cost(all): 20:49:13/1 day, 17:44:51, loss=0.494656996589576, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.98(1.03), norm=4.822328067392005, lr=0.35829425022428724
2023-12-06 07:45:10   INFO  epoch: 23/72, acc_iter=90026, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:38:49, time_cost(all): 20:49:55/1 day, 21:50:18, loss=0.494597799163635, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.1(1.03), norm=3.266160057076109, lr=0.3581802831489194
2023-12-06 07:45:51   INFO  epoch: 23/72, acc_iter=90076, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:35:31, time_cost(all): 20:50:36/1 day, 19:07:25, loss=0.494538601737694, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.16(1.03), norm=3.6665580498675823, lr=0.35806631607355166
2023-12-06 07:46:33   INFO  epoch: 23/72, acc_iter=90126, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:34:31, time_cost(all): 20:51:18/1 day, 20:58:55, loss=0.494479404311754, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.15(1.03), norm=0.8511549982516684, lr=0.35795234899818384
2023-12-06 07:47:15   INFO  epoch: 23/72, acc_iter=90176, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:34:50, time_cost(all): 20:52:00/1 day, 20:55:15, loss=0.494420206885813, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.21(1.03), norm=1.7768973830570072, lr=0.35783838192281603
2023-12-06 07:47:57   INFO  epoch: 23/72, acc_iter=90226, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:33:32, time_cost(all): 20:52:42/1 day, 21:34:18, loss=0.494361009459872, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.88(1.03), norm=2.9601815168273524, lr=0.35772441484744827
2023-12-06 07:48:38   INFO  epoch: 23/72, acc_iter=90276, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:34:40, time_cost(all): 20:53:23/1 day, 21:14:38, loss=0.494301812033931, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.86(1.03), norm=3.15721510250516, lr=0.35761044777208045
2023-12-06 07:49:20   INFO  epoch: 23/72, acc_iter=90326, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:31:50, time_cost(all): 20:54:05/1 day, 20:48:13, loss=0.49424261460799, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.87(1.03), norm=0.6671372991475006, lr=0.3574964806967127
2023-12-06 07:50:02   INFO  epoch: 23/72, acc_iter=90376, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:33:03, time_cost(all): 20:54:47/1 day, 17:46:57, loss=0.494183417182049, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.16(1.03), norm=2.969676875724658, lr=0.3573825136213449
2023-12-06 07:50:44   INFO  epoch: 23/72, acc_iter=90426, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:30:00, time_cost(all): 20:55:29/1 day, 21:21:26, loss=0.494124219756108, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.97(1.03), norm=2.6486156177030082, lr=0.35726854654597706
2023-12-06 07:51:26   INFO  epoch: 23/72, acc_iter=90476, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:31:13, time_cost(all): 20:56:11/1 day, 17:51:29, loss=0.494065022330167, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.14(1.03), norm=2.511596028384623, lr=0.35715457947060925
2023-12-06 07:52:07   INFO  epoch: 23/72, acc_iter=90526, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:31:33, time_cost(all): 20:56:52/1 day, 21:38:36, loss=0.494005824904226, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.95(1.03), norm=0.7456449552490312, lr=0.3570406123952415
2023-12-06 07:52:49   INFO  epoch: 23/72, acc_iter=90576, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:28:14, time_cost(all): 20:57:34/1 day, 18:53:45, loss=0.493946627478285, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.18(1.03), norm=1.87159860827698, lr=0.3569266453198737
2023-12-06 07:53:31   INFO  epoch: 23/72, acc_iter=90626, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:28:02, time_cost(all): 20:58:16/1 day, 19:03:20, loss=0.493887430052344, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.99(1.03), norm=1.1495377310409225, lr=0.3568126782445059
2023-12-06 07:54:13   INFO  epoch: 23/72, acc_iter=90676, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:27:47, time_cost(all): 20:58:58/1 day, 18:52:01, loss=0.493828232626403, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.86(1.03), norm=3.6870247567186305, lr=0.3566987111691381
2023-12-06 07:54:54   INFO  epoch: 23/72, acc_iter=90726, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:26:05, time_cost(all): 20:59:39/1 day, 20:58:41, loss=0.493769035200462, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.93(1.03), norm=3.6986672984116638, lr=0.3565847440937703
2023-12-06 07:55:36   INFO  epoch: 23/72, acc_iter=90776, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:27:15, time_cost(all): 21:00:21/1 day, 20:54:05, loss=0.493709837774521, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.22(1.03), norm=4.413531249752145, lr=0.3564707770184025
2023-12-06 07:56:18   INFO  epoch: 23/72, acc_iter=90826, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:25:57, time_cost(all): 21:01:03/1 day, 18:13:48, loss=0.49365064034858, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.96(1.03), norm=4.058193161096458, lr=0.35635680994303476
2023-12-06 07:57:00   INFO  epoch: 23/72, acc_iter=90876, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:37, time_cost(all): 21:01:45/1 day, 18:13:44, loss=0.493591442922639, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.96(1.03), norm=4.49491576318508, lr=0.35624284286766694
2023-12-06 07:57:42   INFO  epoch: 23/72, acc_iter=90926, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:24:18, time_cost(all): 21:02:27/1 day, 19:54:49, loss=0.493532245496698, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.05(1.03), norm=2.5555266986840115, lr=0.3561288757922991
2023-12-06 07:58:23   INFO  epoch: 23/72, acc_iter=90976, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:23:01, time_cost(all): 21:03:08/1 day, 18:54:45, loss=0.493473048070758, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.04(1.03), norm=2.630419042913074, lr=0.3560149087169313
2023-12-06 07:59:05   INFO  epoch: 23/72, acc_iter=91026, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:30, time_cost(all): 21:03:50/1 day, 17:49:43, loss=0.493413850644817, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.04(1.03), norm=1.7279285886082163, lr=0.3559009416415635
2023-12-06 07:59:47   INFO  epoch: 23/72, acc_iter=91076, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:22:43, time_cost(all): 21:04:32/1 day, 17:34:21, loss=0.493354653218876, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.98(1.03), norm=4.196225527805749, lr=0.3557869745661958
2023-12-06 08:00:29   INFO  epoch: 23/72, acc_iter=91126, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:38, time_cost(all): 21:05:14/1 day, 19:26:17, loss=0.493295455792935, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.12(1.03), norm=0.5253296714561764, lr=0.355673007490828
2023-12-06 08:01:10   INFO  epoch: 23/72, acc_iter=91176, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:16, time_cost(all): 21:05:55/1 day, 20:56:53, loss=0.493236258366994, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.86(1.03), norm=0.6306643391134488, lr=0.35555904041546016
2023-12-06 08:01:52   INFO  epoch: 23/72, acc_iter=91226, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:20:30, time_cost(all): 21:06:37/1 day, 20:06:44, loss=0.493177060941053, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.12(1.03), norm=3.2154011448159654, lr=0.35544507334009234
2023-12-06 08:02:34   INFO  epoch: 23/72, acc_iter=91276, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:20:19, time_cost(all): 21:07:19/1 day, 17:22:38, loss=0.493117863515112, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.03(1.03), norm=1.4773436766111883, lr=0.3553311062647246
2023-12-06 08:03:16   INFO  epoch: 23/72, acc_iter=91326, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:16, time_cost(all): 21:08:01/1 day, 20:53:51, loss=0.493058666089171, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.0(1.03), norm=3.36822582246049, lr=0.35521713918935677
2023-12-06 08:03:58   INFO  epoch: 23/72, acc_iter=91376, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:19:02, time_cost(all): 21:08:43/1 day, 19:32:15, loss=0.49299946866323, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.12(1.03), norm=0.55138830101162, lr=0.355103172113989
2023-12-06 08:04:39   INFO  epoch: 23/72, acc_iter=91426, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:42, time_cost(all): 21:09:24/1 day, 18:30:00, loss=0.492940271237289, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.94(1.03), norm=1.9021611372921692, lr=0.3549892050386212
2023-12-06 08:05:21   INFO  epoch: 23/72, acc_iter=91476, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:07, time_cost(all): 21:10:06/1 day, 19:00:26, loss=0.492881073811348, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.87(1.03), norm=3.2382248684779227, lr=0.3548752379632534
2023-12-06 08:06:03   INFO  epoch: 23/72, acc_iter=91526, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:26, time_cost(all): 21:10:48/1 day, 20:55:15, loss=0.492821876385407, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.05(1.03), norm=2.124659942617972, lr=0.35476127088788556
2023-12-06 08:06:45   INFO  epoch: 23/72, acc_iter=91576, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:09, time_cost(all): 21:11:30/1 day, 19:14:30, loss=0.492762678959466, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.0(1.03), norm=2.3600992529552145, lr=0.3546473038125178
2023-12-06 08:07:26   INFO  epoch: 23/72, acc_iter=91626, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:15:30, time_cost(all): 21:12:11/1 day, 20:20:56, loss=0.492703481533525, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.04(1.03), norm=0.9720904348180532, lr=0.35453333673715004
2023-12-06 08:08:08   INFO  epoch: 23/72, acc_iter=91676, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:13:52, time_cost(all): 21:12:53/1 day, 17:48:17, loss=0.492644284107584, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.05(1.03), norm=3.849639214990072, lr=0.3544193696617822
2023-12-06 08:08:50   INFO  epoch: 23/72, acc_iter=91726, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:50, time_cost(all): 21:13:35/1 day, 19:41:41, loss=0.492585086681643, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.16(1.03), norm=2.539769303656516, lr=0.3543054025864144
2023-12-06 08:09:32   INFO  epoch: 23/72, acc_iter=91776, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:26, time_cost(all): 21:14:17/1 day, 21:18:14, loss=0.492525889255702, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.19(1.03), norm=3.6539595419293436, lr=0.3541914355110466
2023-12-06 08:10:14   INFO  epoch: 23/72, acc_iter=91826, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:06, time_cost(all): 21:14:59/1 day, 18:07:30, loss=0.492466691829762, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.84(1.03), norm=3.16083436192071, lr=0.35407746843567883
2023-12-06 08:10:55   INFO  epoch: 23/72, acc_iter=91876, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:47, time_cost(all): 21:15:40/1 day, 19:43:10, loss=0.492407494403821, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.21(1.03), norm=2.101485761073899, lr=0.353963501360311
2023-12-06 08:11:37   INFO  epoch: 23/72, acc_iter=91926, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:50, time_cost(all): 21:16:22/1 day, 20:30:02, loss=0.49234829697788, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.19(1.03), norm=4.885331122451954, lr=0.35384953428494326
2023-12-06 08:12:19   INFO  epoch: 23/72, acc_iter=91976, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:46, time_cost(all): 21:17:04/1 day, 18:56:52, loss=0.492289099551939, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.89(1.03), norm=4.155561034523208, lr=0.35373556720957544
2023-12-06 08:13:01   INFO  epoch: 23/72, acc_iter=92026, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:07, time_cost(all): 21:17:46/1 day, 17:30:45, loss=0.492229902125998, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.12(1.03), norm=3.3486637541688498, lr=0.3536216001342076
2023-12-06 08:13:42   INFO  epoch: 23/72, acc_iter=92076, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:53, time_cost(all): 21:18:27/1 day, 20:20:05, loss=0.492170704700057, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.99(1.03), norm=3.9113832526581547, lr=0.3535076330588398
2023-12-06 08:14:24   INFO  epoch: 23/72, acc_iter=92126, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:43, time_cost(all): 21:19:09/1 day, 17:26:51, loss=0.492111507274116, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.07(1.03), norm=3.8117331726957246, lr=0.3533936659834721
2023-12-06 08:15:06   INFO  epoch: 23/72, acc_iter=92176, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:29, time_cost(all): 21:19:51/1 day, 17:58:27, loss=0.492052309848175, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.22(1.03), norm=1.5931270671160138, lr=0.3532796989081043
2023-12-06 08:15:48   INFO  epoch: 23/72, acc_iter=92226, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:41, time_cost(all): 21:20:33/1 day, 17:59:13, loss=0.491993112422234, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.09(1.03), norm=0.7246869685606137, lr=0.3531657318327365
2023-12-06 08:16:30   INFO  epoch: 23/72, acc_iter=92276, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:31, time_cost(all): 21:21:15/1 day, 19:50:10, loss=0.491933914996293, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.83(1.03), norm=1.1943556571717027, lr=0.35305176475736866
2023-12-06 08:17:11   INFO  epoch: 23/72, acc_iter=92326, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:09, time_cost(all): 21:21:56/1 day, 18:32:37, loss=0.491874717570352, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.85(1.03), norm=1.6656918429630458, lr=0.3529377976820009
2023-12-06 08:17:53   INFO  epoch: 23/72, acc_iter=92376, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:11, time_cost(all): 21:22:38/1 day, 19:14:59, loss=0.491815520144411, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.87(1.03), norm=0.8174294897890387, lr=0.3528238306066331
2023-12-06 08:18:35   INFO  epoch: 23/72, acc_iter=92426, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:40, time_cost(all): 21:23:20/1 day, 18:23:54, loss=0.49175632271847, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.22(1.03), norm=2.3622595180919106, lr=0.3527098635312653
2023-12-06 08:19:17   INFO  epoch: 23/72, acc_iter=92476, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:03:02, time_cost(all): 21:24:02/1 day, 18:51:50, loss=0.491697125292529, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.05(1.03), norm=1.9742399282844267, lr=0.3525958964558975
2023-12-06 08:19:59   INFO  epoch: 23/72, acc_iter=92526, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:19, time_cost(all): 21:24:44/1 day, 20:15:44, loss=0.491637927866588, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.06(1.03), norm=3.6504343617968833, lr=0.3524819293805297
2023-12-06 08:20:40   INFO  epoch: 23/72, acc_iter=92576, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:37, time_cost(all): 21:25:25/1 day, 19:33:21, loss=0.491578730440647, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.15(1.03), norm=0.7024410846446474, lr=0.3523679623051619
2023-12-06 08:21:22   INFO  epoch: 23/72, acc_iter=92626, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:51, time_cost(all): 21:26:07/1 day, 17:11:55, loss=0.491519533014706, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.09(1.03), norm=3.7993674640373434, lr=0.3522539952297941
2023-12-06 08:22:04   INFO  epoch: 23/72, acc_iter=92676, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 21:26:49/1 day, 17:44:19, loss=0.491460335588766, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.15(1.03), norm=2.44070027520101, lr=0.35214002815442635
2023-12-06 08:22:46   INFO  epoch: 24/72, acc_iter=92738, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:55:25, time_cost(all): 21:27:31/1 day, 17:55:12, loss=0.491386930780599, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.85(1.03), norm=3.2738611304800735, lr=0.3519987089809703
2023-12-06 08:23:27   INFO  epoch: 24/72, acc_iter=92788, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:51:39, time_cost(all): 21:28:12/1 day, 19:17:51, loss=0.491327733354658, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.84(1.03), norm=4.209229963083917, lr=0.3518847419056025
2023-12-06 08:24:09   INFO  epoch: 24/72, acc_iter=92838, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:51:03, time_cost(all): 21:28:54/1 day, 18:42:04, loss=0.491268535928717, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.92(1.03), norm=2.8112185522951205, lr=0.35177077483023467
2023-12-06 08:24:51   INFO  epoch: 24/72, acc_iter=92888, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:51:43, time_cost(all): 21:29:36/1 day, 17:53:30, loss=0.491209338502776, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.01(1.03), norm=1.2086100976095673, lr=0.35165680775486685
2023-12-06 08:25:33   INFO  epoch: 24/72, acc_iter=92938, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:52:47, time_cost(all): 21:30:18/1 day, 17:46:09, loss=0.491150141076835, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.09(1.03), norm=2.658585009529168, lr=0.3515428406794991
2023-12-06 08:26:15   INFO  epoch: 24/72, acc_iter=92988, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:48:51, time_cost(all): 21:31:00/1 day, 18:56:30, loss=0.491090943650894, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.86(1.03), norm=3.2279484211881755, lr=0.3514288736041313
2023-12-06 08:26:56   INFO  epoch: 24/72, acc_iter=93038, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:46:47, time_cost(all): 21:31:41/1 day, 19:22:52, loss=0.491031746224953, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.89(1.03), norm=3.852904132538615, lr=0.3513149065287635
2023-12-06 08:27:38   INFO  epoch: 24/72, acc_iter=93088, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:49:50, time_cost(all): 21:32:23/1 day, 19:13:21, loss=0.490972548799012, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.12(1.03), norm=0.608702012198584, lr=0.3512009394533957
2023-12-06 08:28:20   INFO  epoch: 24/72, acc_iter=93138, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:45:24, time_cost(all): 21:33:05/1 day, 19:18:44, loss=0.490913351373071, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.94(1.03), norm=2.7960666548691644, lr=0.3510869723780279
2023-12-06 08:29:02   INFO  epoch: 24/72, acc_iter=93188, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:46:57, time_cost(all): 21:33:47/1 day, 20:11:16, loss=0.49085415394713, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.05(1.03), norm=1.8102454360948679, lr=0.35097300530266007
2023-12-06 08:29:43   INFO  epoch: 24/72, acc_iter=93238, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:45:44, time_cost(all): 21:34:28/1 day, 20:38:53, loss=0.490794956521189, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.92(1.03), norm=4.884918575056154, lr=0.3508590382272923
2023-12-06 08:30:25   INFO  epoch: 24/72, acc_iter=93288, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:45:47, time_cost(all): 21:35:10/1 day, 16:52:18, loss=0.490735759095248, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.89(1.03), norm=1.8961882238647583, lr=0.35074507115192455
2023-12-06 08:31:07   INFO  epoch: 24/72, acc_iter=93338, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:45:14, time_cost(all): 21:35:52/1 day, 19:43:33, loss=0.490676561669307, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.21(1.03), norm=1.5009151745368199, lr=0.35063110407655673
2023-12-06 08:31:49   INFO  epoch: 24/72, acc_iter=93388, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:42:46, time_cost(all): 21:36:34/1 day, 18:57:47, loss=0.490617364243367, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.86(1.03), norm=3.3567873762249665, lr=0.3505171370011889
2023-12-06 08:32:31   INFO  epoch: 24/72, acc_iter=93438, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:41:41, time_cost(all): 21:37:16/1 day, 18:45:34, loss=0.490558166817426, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.11(1.03), norm=3.5317652069052516, lr=0.3504031699258211
2023-12-06 08:33:12   INFO  epoch: 24/72, acc_iter=93488, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:40:31, time_cost(all): 21:37:57/1 day, 19:29:10, loss=0.490498969391485, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.84(1.03), norm=3.438122654047812, lr=0.35028920285045334
2023-12-06 08:33:54   INFO  epoch: 24/72, acc_iter=93538, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:42:48, time_cost(all): 21:38:39/1 day, 20:08:31, loss=0.490439771965544, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.2(1.03), norm=1.517303605348658, lr=0.3501752357750855
2023-12-06 08:34:36   INFO  epoch: 24/72, acc_iter=93588, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:40:13, time_cost(all): 21:39:21/1 day, 16:49:31, loss=0.490380574539603, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.12(1.03), norm=2.6837274888427283, lr=0.35006126869971776
2023-12-06 08:35:18   INFO  epoch: 24/72, acc_iter=93638, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:41:55, time_cost(all): 21:40:03/1 day, 20:47:37, loss=0.490321377113662, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.0(1.03), norm=1.694044856284, lr=0.34994730162434995
2023-12-06 08:35:59   INFO  epoch: 24/72, acc_iter=93688, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:37:57, time_cost(all): 21:40:44/1 day, 17:47:14, loss=0.490262179687721, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.19(1.03), norm=1.2451080332339983, lr=0.34983333454898213
2023-12-06 08:36:41   INFO  epoch: 24/72, acc_iter=93738, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:38:06, time_cost(all): 21:41:26/1 day, 17:27:30, loss=0.49020298226178, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.07(1.03), norm=2.420135088977769, lr=0.3497193674736143
2023-12-06 08:37:23   INFO  epoch: 24/72, acc_iter=93788, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:40:14, time_cost(all): 21:42:08/1 day, 19:22:31, loss=0.490143784835839, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.98(1.03), norm=3.8451662586033692, lr=0.34960540039824656
2023-12-06 08:38:05   INFO  epoch: 24/72, acc_iter=93838, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:36:30, time_cost(all): 21:42:50/1 day, 20:08:25, loss=0.490084587409898, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.9(1.03), norm=4.898386805729463, lr=0.3494914333228788
2023-12-06 08:38:47   INFO  epoch: 24/72, acc_iter=93888, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:36:14, time_cost(all): 21:43:32/1 day, 17:13:47, loss=0.490025389983957, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.87(1.03), norm=2.4595454646510087, lr=0.349377466247511
2023-12-06 08:39:28   INFO  epoch: 24/72, acc_iter=93938, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:34:48, time_cost(all): 21:44:13/1 day, 17:59:56, loss=0.489966192558016, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.86(1.03), norm=3.850383499531252, lr=0.34926349917214317
2023-12-06 08:40:10   INFO  epoch: 24/72, acc_iter=93988, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:37:03, time_cost(all): 21:44:55/1 day, 19:25:15, loss=0.489906995132075, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.05(1.03), norm=2.531320511972974, lr=0.34914953209677535
2023-12-06 08:40:52   INFO  epoch: 24/72, acc_iter=94038, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:36:01, time_cost(all): 21:45:37/1 day, 20:09:10, loss=0.489847797706134, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.84(1.03), norm=3.371596557427471, lr=0.3490355650214076
2023-12-06 08:41:34   INFO  epoch: 24/72, acc_iter=94088, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:34:48, time_cost(all): 21:46:19/1 day, 19:05:10, loss=0.489788600280193, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.04(1.03), norm=1.9227359853087878, lr=0.3489215979460398
2023-12-06 08:42:15   INFO  epoch: 24/72, acc_iter=94138, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:34:35, time_cost(all): 21:47:00/1 day, 19:56:23, loss=0.489729402854252, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.12(1.03), norm=1.384348261426033, lr=0.348807630870672
2023-12-06 08:42:57   INFO  epoch: 24/72, acc_iter=94188, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:32:47, time_cost(all): 21:47:42/1 day, 17:45:32, loss=0.489670205428311, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.94(1.03), norm=4.450056875812706, lr=0.3486936637953042
2023-12-06 08:43:39   INFO  epoch: 24/72, acc_iter=94238, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:32:53, time_cost(all): 21:48:24/1 day, 18:06:18, loss=0.489611008002371, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.96(1.03), norm=3.202255805311081, lr=0.3485796967199364
2023-12-06 08:44:21   INFO  epoch: 24/72, acc_iter=94288, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:30:40, time_cost(all): 21:49:06/1 day, 19:33:31, loss=0.48955181057643, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.96(1.03), norm=4.3784938912454265, lr=0.34846572964456857
2023-12-06 08:45:03   INFO  epoch: 24/72, acc_iter=94338, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:31:38, time_cost(all): 21:49:48/1 day, 17:03:31, loss=0.489492613150489, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.07(1.03), norm=2.1017948665015393, lr=0.3483517625692008
2023-12-06 08:45:44   INFO  epoch: 24/72, acc_iter=94388, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:28:44, time_cost(all): 21:50:29/1 day, 19:07:58, loss=0.489433415724548, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.9(1.03), norm=2.30461259378531, lr=0.34823779549383305
2023-12-06 08:46:26   INFO  epoch: 24/72, acc_iter=94438, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:28:58, time_cost(all): 21:51:11/1 day, 18:21:24, loss=0.489374218298607, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.13(1.03), norm=4.208092919608601, lr=0.34812382841846523
2023-12-06 08:47:08   INFO  epoch: 24/72, acc_iter=94488, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:30:06, time_cost(all): 21:51:53/1 day, 18:33:42, loss=0.489315020872666, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.9(1.03), norm=1.0587425461554294, lr=0.3480098613430974
2023-12-06 08:47:50   INFO  epoch: 24/72, acc_iter=94538, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:28:29, time_cost(all): 21:52:35/1 day, 20:14:06, loss=0.489255823446725, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.22(1.03), norm=2.812593685163661, lr=0.34789589426772966
2023-12-06 08:48:31   INFO  epoch: 24/72, acc_iter=94588, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:26:19, time_cost(all): 21:53:16/1 day, 17:50:25, loss=0.489196626020784, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.11(1.03), norm=1.6780079234589325, lr=0.34778192719236184
2023-12-06 08:49:13   INFO  epoch: 24/72, acc_iter=94638, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:26:21, time_cost(all): 21:53:58/1 day, 20:05:36, loss=0.489137428594843, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.04(1.03), norm=4.380742061032429, lr=0.3476679601169941
2023-12-06 08:49:55   INFO  epoch: 24/72, acc_iter=94688, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:25:51, time_cost(all): 21:54:40/1 day, 19:34:31, loss=0.489078231168902, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.21(1.03), norm=4.023526917284215, lr=0.34755399304162626
2023-12-06 08:50:37   INFO  epoch: 24/72, acc_iter=94738, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:26:28, time_cost(all): 21:55:22/1 day, 19:54:26, loss=0.489019033742961, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.02(1.03), norm=1.788226369077996, lr=0.34744002596625845
2023-12-06 08:51:19   INFO  epoch: 24/72, acc_iter=94788, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:24:03, time_cost(all): 21:56:04/1 day, 19:55:03, loss=0.48895983631702, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.88(1.03), norm=4.814903823754916, lr=0.34732605889089063
2023-12-06 08:52:00   INFO  epoch: 24/72, acc_iter=94838, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:09, time_cost(all): 21:56:45/1 day, 17:27:28, loss=0.488900638891079, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.01(1.03), norm=1.2782798471395025, lr=0.34721209181552287
2023-12-06 08:52:42   INFO  epoch: 24/72, acc_iter=94888, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:12, time_cost(all): 21:57:27/1 day, 17:32:13, loss=0.488841441465138, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.06(1.03), norm=1.5300764332825318, lr=0.3470981247401551
2023-12-06 08:53:24   INFO  epoch: 24/72, acc_iter=94938, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:21:49, time_cost(all): 21:58:09/1 day, 18:15:19, loss=0.488782244039197, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.21(1.03), norm=3.595308357485306, lr=0.3469841576647873
2023-12-06 08:54:06   INFO  epoch: 24/72, acc_iter=94988, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:03, time_cost(all): 21:58:51/1 day, 18:04:25, loss=0.488723046613256, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.9(1.03), norm=0.9218746446131676, lr=0.3468701905894195
2023-12-06 08:54:48   INFO  epoch: 24/72, acc_iter=95038, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:35, time_cost(all): 21:59:33/1 day, 20:22:46, loss=0.488663849187315, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.2(1.03), norm=1.6715252240629048, lr=0.34675622351405166
2023-12-06 08:55:29   INFO  epoch: 24/72, acc_iter=95088, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:21:15, time_cost(all): 22:00:14/1 day, 17:55:43, loss=0.488604651761375, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.1(1.03), norm=2.81648332402287, lr=0.3466422564386839
2023-12-06 08:56:11   INFO  epoch: 24/72, acc_iter=95138, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:18:46, time_cost(all): 22:00:56/1 day, 20:33:17, loss=0.488545454335434, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.17(1.03), norm=1.1538008794606844, lr=0.3465282893633161
2023-12-06 08:56:53   INFO  epoch: 24/72, acc_iter=95188, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:18:55, time_cost(all): 22:01:38/1 day, 19:14:30, loss=0.488486256909493, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.93(1.03), norm=2.727970367184946, lr=0.34641432228794833
2023-12-06 08:57:35   INFO  epoch: 24/72, acc_iter=95238, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:50, time_cost(all): 22:02:20/1 day, 17:43:07, loss=0.488427059483552, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.91(1.03), norm=1.2189088251831088, lr=0.3463003552125805
2023-12-06 08:58:16   INFO  epoch: 24/72, acc_iter=95288, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:16:59, time_cost(all): 22:03:01/1 day, 20:00:01, loss=0.488367862057611, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.03(1.03), norm=0.7402436829213146, lr=0.3461863881372127
2023-12-06 08:58:58   INFO  epoch: 24/72, acc_iter=95338, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:17:22, time_cost(all): 22:03:43/1 day, 19:40:33, loss=0.48830866463167, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.07(1.03), norm=1.491452400315783, lr=0.3460724210618449
2023-12-06 08:59:40   INFO  epoch: 24/72, acc_iter=95388, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:27, time_cost(all): 22:04:25/1 day, 18:41:49, loss=0.488249467205729, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.05(1.03), norm=3.5409536571176092, lr=0.3459584539864772
2023-12-06 09:00:22   INFO  epoch: 24/72, acc_iter=95438, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:14:47, time_cost(all): 22:05:07/1 day, 19:11:59, loss=0.488190269779788, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.12(1.03), norm=2.0785014631822882, lr=0.34584448691110936
2023-12-06 09:01:04   INFO  epoch: 24/72, acc_iter=95488, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:16, time_cost(all): 22:05:49/1 day, 18:27:24, loss=0.488131072353847, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.11(1.03), norm=2.2456252282330555, lr=0.34573051983574155
2023-12-06 09:01:45   INFO  epoch: 24/72, acc_iter=95538, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:03, time_cost(all): 22:06:30/1 day, 19:16:57, loss=0.488071874927906, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.19(1.03), norm=3.980689372077488, lr=0.34561655276037373
2023-12-06 09:02:27   INFO  epoch: 24/72, acc_iter=95588, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:47, time_cost(all): 22:07:12/1 day, 17:48:13, loss=0.488012677501965, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.91(1.03), norm=0.6137663799136672, lr=0.34550258568500597
2023-12-06 09:03:09   INFO  epoch: 24/72, acc_iter=95638, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:11, time_cost(all): 22:07:54/1 day, 18:53:42, loss=0.487953480076024, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.02(1.03), norm=2.51618270275823, lr=0.34538861860963815
2023-12-06 09:03:51   INFO  epoch: 24/72, acc_iter=95688, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:11:58, time_cost(all): 22:08:36/1 day, 16:40:30, loss=0.487894282650083, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.88(1.03), norm=4.376214202575734, lr=0.3452746515342704
2023-12-06 09:04:32   INFO  epoch: 24/72, acc_iter=95738, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:52, time_cost(all): 22:09:17/1 day, 16:21:20, loss=0.487835085224142, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.01(1.03), norm=4.325390936186085, lr=0.3451606844589026
2023-12-06 09:05:14   INFO  epoch: 24/72, acc_iter=95788, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:11:02, time_cost(all): 22:09:59/1 day, 17:07:07, loss=0.487775887798201, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.13(1.03), norm=2.644520952621991, lr=0.34504671738353476
2023-12-06 09:05:56   INFO  epoch: 24/72, acc_iter=95838, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:31, time_cost(all): 22:10:41/1 day, 18:31:20, loss=0.48771669037226, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.89(1.03), norm=2.648746042920361, lr=0.34493275030816695
2023-12-06 09:06:38   INFO  epoch: 24/72, acc_iter=95888, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:08:54, time_cost(all): 22:11:23/1 day, 20:07:04, loss=0.487657492946319, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.11(1.03), norm=2.7203123743790303, lr=0.3448187832327992
2023-12-06 09:07:20   INFO  epoch: 24/72, acc_iter=95938, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:55, time_cost(all): 22:12:05/1 day, 16:21:27, loss=0.487598295520379, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.14(1.03), norm=4.335703403891276, lr=0.3447048161574314
2023-12-06 09:08:01   INFO  epoch: 24/72, acc_iter=95988, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:38, time_cost(all): 22:12:46/1 day, 19:51:32, loss=0.487539098094438, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.23(1.03), norm=2.7349877326456284, lr=0.3445908490820636
2023-12-06 09:08:43   INFO  epoch: 24/72, acc_iter=96038, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:03, time_cost(all): 22:13:28/1 day, 20:25:22, loss=0.487479900668497, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.03(1.03), norm=2.846434348081628, lr=0.3444768820066958
2023-12-06 09:09:25   INFO  epoch: 24/72, acc_iter=96088, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:11, time_cost(all): 22:14:10/1 day, 16:21:16, loss=0.487420703242556, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.98(1.03), norm=2.18399684009158, lr=0.344362914931328
2023-12-06 09:10:07   INFO  epoch: 24/72, acc_iter=96138, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:35, time_cost(all): 22:14:52/1 day, 18:18:52, loss=0.487361505816615, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.11(1.03), norm=1.4335188287609752, lr=0.3442489478559602
2023-12-06 09:10:48   INFO  epoch: 24/72, acc_iter=96188, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:14, time_cost(all): 22:15:33/1 day, 17:55:12, loss=0.487302308390674, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.18(1.03), norm=2.6558537947943015, lr=0.3441349807805924
2023-12-06 09:11:30   INFO  epoch: 24/72, acc_iter=96238, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:13, time_cost(all): 22:16:15/1 day, 16:18:54, loss=0.487243110964733, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.1(1.03), norm=2.5220374191211916, lr=0.34402101370522464
2023-12-06 09:12:12   INFO  epoch: 24/72, acc_iter=96288, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:43, time_cost(all): 22:16:57/1 day, 18:57:59, loss=0.487183913538792, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.86(1.03), norm=1.4989054389104561, lr=0.3439070466298568
2023-12-06 09:12:54   INFO  epoch: 24/72, acc_iter=96338, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:58, time_cost(all): 22:17:39/1 day, 18:48:00, loss=0.487124716112851, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.13(1.03), norm=3.9762547077564925, lr=0.343793079554489
2023-12-06 09:13:36   INFO  epoch: 24/72, acc_iter=96388, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:11, time_cost(all): 22:18:21/1 day, 17:37:51, loss=0.48706551868691, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.93(1.03), norm=1.2031013530351813, lr=0.3436791124791212
2023-12-06 09:14:17   INFO  epoch: 24/72, acc_iter=96438, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:29, time_cost(all): 22:19:02/1 day, 18:43:34, loss=0.487006321260969, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.95(1.03), norm=4.7095134889527115, lr=0.3435651454037535
2023-12-06 09:14:59   INFO  epoch: 24/72, acc_iter=96488, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:50, time_cost(all): 22:19:44/1 day, 18:32:57, loss=0.486947123835028, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.2(1.03), norm=2.112207383445183, lr=0.3434511783283857
2023-12-06 09:15:41   INFO  epoch: 24/72, acc_iter=96538, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 22:20:26/1 day, 17:05:36, loss=0.486887926409087, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.11(1.03), norm=1.4578420234114593, lr=0.34333721125301786
2023-12-06 09:16:23   INFO  epoch: 25/72, acc_iter=96600, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:52:41, time_cost(all): 22:21:08/1 day, 18:38:00, loss=0.48681452160092, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.16(1.03), norm=2.866495714255304, lr=0.3431958920795618
2023-12-06 09:17:04   INFO  epoch: 25/72, acc_iter=96650, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:53:31, time_cost(all): 22:21:49/1 day, 19:12:46, loss=0.48675532417498, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.91(1.03), norm=1.161908545106694, lr=0.343081925004194
2023-12-06 09:17:46   INFO  epoch: 25/72, acc_iter=96700, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:50:41, time_cost(all): 22:22:31/1 day, 19:10:16, loss=0.486696126749039, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.21(1.03), norm=0.5775105255071316, lr=0.3429679579288262
2023-12-06 09:18:28   INFO  epoch: 25/72, acc_iter=96750, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:52:26, time_cost(all): 22:23:13/1 day, 18:43:04, loss=0.486636929323098, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.07(1.03), norm=1.2793132075373528, lr=0.3428539908534584
2023-12-06 09:19:10   INFO  epoch: 25/72, acc_iter=96800, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:49:36, time_cost(all): 22:23:55/1 day, 18:56:45, loss=0.486577731897157, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.08(1.03), norm=3.6976883197012453, lr=0.3427400237780906
2023-12-06 09:19:52   INFO  epoch: 25/72, acc_iter=96850, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:52:03, time_cost(all): 22:24:37/1 day, 16:52:08, loss=0.486518534471216, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.87(1.03), norm=2.1703576207668966, lr=0.34262605670272284
2023-12-06 09:20:33   INFO  epoch: 25/72, acc_iter=96900, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:46:56, time_cost(all): 22:25:18/1 day, 17:13:58, loss=0.486459337045275, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.89(1.03), norm=3.2791148498097042, lr=0.342512089627355
2023-12-06 09:21:15   INFO  epoch: 25/72, acc_iter=96950, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:47:19, time_cost(all): 22:26:00/1 day, 16:39:18, loss=0.486400139619334, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.06(1.03), norm=3.533029162276384, lr=0.3423981225519872
2023-12-06 09:21:57   INFO  epoch: 25/72, acc_iter=97000, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:47:02, time_cost(all): 22:26:42/1 day, 16:46:14, loss=0.486340942193393, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.9(1.03), norm=0.5681537853911663, lr=0.34228415547661944
2023-12-06 09:22:39   INFO  epoch: 25/72, acc_iter=97050, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:48:06, time_cost(all): 22:27:24/1 day, 16:53:52, loss=0.486281744767452, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.08(1.03), norm=3.9797019145208488, lr=0.34217018840125163
2023-12-06 09:23:20   INFO  epoch: 25/72, acc_iter=97100, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:46:52, time_cost(all): 22:28:05/1 day, 19:13:09, loss=0.486222547341511, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.12(1.03), norm=4.9895423932675085, lr=0.34205622132588387
2023-12-06 09:24:02   INFO  epoch: 25/72, acc_iter=97150, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:47:32, time_cost(all): 22:28:47/1 day, 16:19:58, loss=0.48616334991557, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.01(1.03), norm=0.8343273869279553, lr=0.34194225425051605
2023-12-06 09:24:44   INFO  epoch: 25/72, acc_iter=97200, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:43:04, time_cost(all): 22:29:29/1 day, 16:35:44, loss=0.486104152489629, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.08(1.03), norm=1.5552666670509785, lr=0.34182828717514824
2023-12-06 09:25:26   INFO  epoch: 25/72, acc_iter=97250, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:43:11, time_cost(all): 22:30:11/1 day, 16:56:40, loss=0.486044955063688, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.15(1.03), norm=2.3452648408740213, lr=0.3417143200997804
2023-12-06 09:26:08   INFO  epoch: 25/72, acc_iter=97300, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:42:02, time_cost(all): 22:30:53/1 day, 19:01:10, loss=0.485985757637747, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.18(1.03), norm=3.91409866146195, lr=0.34160035302441266
2023-12-06 09:26:49   INFO  epoch: 25/72, acc_iter=97350, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:42:53, time_cost(all): 22:31:34/1 day, 17:19:29, loss=0.485926560211806, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.03(1.03), norm=3.951831888706064, lr=0.3414863859490449
2023-12-06 09:27:31   INFO  epoch: 25/72, acc_iter=97400, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:40:36, time_cost(all): 22:32:16/1 day, 16:05:16, loss=0.485867362785865, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.94(1.03), norm=2.426718115471695, lr=0.3413724188736771
2023-12-06 09:28:13   INFO  epoch: 25/72, acc_iter=97450, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:41:46, time_cost(all): 22:32:58/1 day, 17:01:57, loss=0.485808165359924, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.83(1.03), norm=2.8968324627224993, lr=0.34125845179830927
2023-12-06 09:28:55   INFO  epoch: 25/72, acc_iter=97500, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:42:17, time_cost(all): 22:33:40/1 day, 16:38:37, loss=0.485748967933984, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.92(1.03), norm=4.182277510335676, lr=0.3411444847229415
2023-12-06 09:29:37   INFO  epoch: 25/72, acc_iter=97550, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:39:55, time_cost(all): 22:34:22/1 day, 19:34:46, loss=0.485689770508043, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.93(1.03), norm=3.0656994752197, lr=0.3410305176475737
2023-12-06 09:30:18   INFO  epoch: 25/72, acc_iter=97600, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:39:09, time_cost(all): 22:35:03/1 day, 17:14:10, loss=0.485630573082102, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.97(1.03), norm=1.7987646651778424, lr=0.34091655057220593
2023-12-06 09:31:00   INFO  epoch: 25/72, acc_iter=97650, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:38:54, time_cost(all): 22:35:45/1 day, 16:17:59, loss=0.485571375656161, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.84(1.03), norm=3.0050975601146774, lr=0.3408025834968381
2023-12-06 09:31:42   INFO  epoch: 25/72, acc_iter=97700, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:37:19, time_cost(all): 22:36:27/1 day, 19:22:49, loss=0.48551217823022, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.89(1.03), norm=3.7503411173515273, lr=0.3406886164214703
2023-12-06 09:32:24   INFO  epoch: 25/72, acc_iter=97750, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:38:06, time_cost(all): 22:37:09/1 day, 17:51:24, loss=0.485452980804279, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.0(1.03), norm=3.7826786283027523, lr=0.3405746493461025
2023-12-06 09:33:05   INFO  epoch: 25/72, acc_iter=97800, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:36:42, time_cost(all): 22:37:50/1 day, 19:30:55, loss=0.485393783378338, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.04(1.03), norm=1.1895433554079644, lr=0.3404606822707347
2023-12-06 09:33:47   INFO  epoch: 25/72, acc_iter=97850, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:36:19, time_cost(all): 22:38:32/1 day, 16:54:27, loss=0.485334585952397, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.88(1.03), norm=4.004094527049524, lr=0.34034671519536697
2023-12-06 09:34:29   INFO  epoch: 25/72, acc_iter=97900, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:35:46, time_cost(all): 22:39:14/1 day, 17:33:53, loss=0.485275388526456, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.99(1.03), norm=1.8882440883002247, lr=0.34023274811999915
2023-12-06 09:35:11   INFO  epoch: 25/72, acc_iter=97950, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:34:51, time_cost(all): 22:39:56/1 day, 17:08:06, loss=0.485216191100515, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.95(1.03), norm=3.666080428633209, lr=0.34011878104463134
2023-12-06 09:35:53   INFO  epoch: 25/72, acc_iter=98000, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:34:54, time_cost(all): 22:40:38/1 day, 16:24:47, loss=0.485156993674574, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.17(1.03), norm=1.9318697685632877, lr=0.3400048139692635
2023-12-06 09:36:34   INFO  epoch: 25/72, acc_iter=98050, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:31:27, time_cost(all): 22:41:19/1 day, 18:45:26, loss=0.485097796248633, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.19(1.03), norm=2.6303208714832236, lr=0.33989084689389576
2023-12-06 09:37:16   INFO  epoch: 25/72, acc_iter=98100, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:32:23, time_cost(all): 22:42:01/1 day, 18:04:32, loss=0.485038598822692, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.9(1.03), norm=2.930796944049382, lr=0.33977687981852794
2023-12-06 09:37:58   INFO  epoch: 25/72, acc_iter=98150, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:32:48, time_cost(all): 22:42:43/1 day, 17:40:23, loss=0.484979401396751, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.07(1.03), norm=4.28963174786904, lr=0.3396629127431602
2023-12-06 09:38:40   INFO  epoch: 25/72, acc_iter=98200, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:32:03, time_cost(all): 22:43:25/1 day, 16:03:10, loss=0.48492020397081, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.12(1.03), norm=2.508782525384958, lr=0.33954894566779237
2023-12-06 09:39:21   INFO  epoch: 25/72, acc_iter=98250, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:31:06, time_cost(all): 22:44:06/1 day, 17:43:18, loss=0.484861006544869, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.22(1.03), norm=2.3068165592760472, lr=0.33943497859242455
2023-12-06 09:40:03   INFO  epoch: 25/72, acc_iter=98300, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:29:32, time_cost(all): 22:44:48/1 day, 17:22:52, loss=0.484801809118928, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.06(1.03), norm=2.991759795064875, lr=0.33932101151705674
2023-12-06 09:40:45   INFO  epoch: 25/72, acc_iter=98350, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:27:36, time_cost(all): 22:45:30/1 day, 17:31:19, loss=0.484742611692987, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.08(1.03), norm=4.390336853364676, lr=0.33920704444168903
2023-12-06 09:41:27   INFO  epoch: 25/72, acc_iter=98400, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:28:21, time_cost(all): 22:46:12/1 day, 16:15:06, loss=0.484683414267047, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.92(1.03), norm=2.3604826307364517, lr=0.3390930773663212
2023-12-06 09:42:09   INFO  epoch: 25/72, acc_iter=98450, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:27:49, time_cost(all): 22:46:54/1 day, 17:41:23, loss=0.484624216841106, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.18(1.03), norm=4.889063947921889, lr=0.3389791102909534
2023-12-06 09:42:50   INFO  epoch: 25/72, acc_iter=98500, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:25:36, time_cost(all): 22:47:35/1 day, 18:45:05, loss=0.484565019415165, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.12(1.03), norm=0.9292536259366158, lr=0.3388651432155856
2023-12-06 09:43:32   INFO  epoch: 25/72, acc_iter=98550, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:26:59, time_cost(all): 22:48:17/1 day, 19:24:30, loss=0.484505821989224, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.86(1.03), norm=2.4772591383349782, lr=0.3387511761402178
2023-12-06 09:44:14   INFO  epoch: 25/72, acc_iter=98600, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:24:02, time_cost(all): 22:48:59/1 day, 15:47:06, loss=0.484446624563283, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.84(1.03), norm=4.884886260053712, lr=0.33863720906485
2023-12-06 09:44:56   INFO  epoch: 25/72, acc_iter=98650, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:23:46, time_cost(all): 22:49:41/1 day, 16:41:47, loss=0.484387427137342, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.98(1.03), norm=3.925299609888805, lr=0.33852324198948225
2023-12-06 09:45:37   INFO  epoch: 25/72, acc_iter=98700, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:04, time_cost(all): 22:50:22/1 day, 16:36:26, loss=0.484328229711401, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.22(1.03), norm=3.6274904389940055, lr=0.33840927491411443
2023-12-06 09:46:19   INFO  epoch: 25/72, acc_iter=98750, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:23:24, time_cost(all): 22:51:04/1 day, 18:47:10, loss=0.48426903228546, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.18(1.03), norm=0.6165607784443057, lr=0.3382953078387466
2023-12-06 09:47:01   INFO  epoch: 25/72, acc_iter=98800, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:21:37, time_cost(all): 22:51:46/1 day, 18:48:33, loss=0.484209834859519, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.07(1.03), norm=4.328141958740182, lr=0.3381813407633788
2023-12-06 09:47:43   INFO  epoch: 25/72, acc_iter=98850, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:20:47, time_cost(all): 22:52:28/1 day, 16:33:33, loss=0.484150637433578, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.07(1.03), norm=0.9265398544499596, lr=0.338067373688011
2023-12-06 09:48:25   INFO  epoch: 25/72, acc_iter=98900, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:34, time_cost(all): 22:53:10/1 day, 17:40:14, loss=0.484091440007637, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.0(1.03), norm=1.145150594790643, lr=0.3379534066126433
2023-12-06 09:49:06   INFO  epoch: 25/72, acc_iter=98950, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:20:23, time_cost(all): 22:53:51/1 day, 18:38:17, loss=0.484032242581696, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.11(1.03), norm=3.67553144763042, lr=0.33783943953727547
2023-12-06 09:49:48   INFO  epoch: 25/72, acc_iter=99000, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:19:14, time_cost(all): 22:54:33/1 day, 17:53:01, loss=0.483973045155755, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.84(1.03), norm=3.3107843484940895, lr=0.33772547246190765
2023-12-06 09:50:30   INFO  epoch: 25/72, acc_iter=99050, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:42, time_cost(all): 22:55:15/1 day, 18:59:33, loss=0.483913847729814, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.12(1.03), norm=1.9293586697314666, lr=0.33761150538653983
2023-12-06 09:51:12   INFO  epoch: 25/72, acc_iter=99100, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:09, time_cost(all): 22:55:57/1 day, 17:14:20, loss=0.483854650303873, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.04(1.03), norm=4.85171873769876, lr=0.3374975383111721
2023-12-06 09:51:53   INFO  epoch: 25/72, acc_iter=99150, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:42, time_cost(all): 22:56:38/1 day, 15:53:12, loss=0.483795452877932, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.12(1.03), norm=2.3118692670730656, lr=0.33738357123580426
2023-12-06 09:52:35   INFO  epoch: 25/72, acc_iter=99200, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:17:10, time_cost(all): 22:57:20/1 day, 17:01:52, loss=0.483736255451992, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.12(1.03), norm=3.5149967455225375, lr=0.3372696041604365
2023-12-06 09:53:17   INFO  epoch: 25/72, acc_iter=99250, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:15:47, time_cost(all): 22:58:02/1 day, 15:43:33, loss=0.483677058026051, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.84(1.03), norm=4.7821507981173825, lr=0.3371556370850687
2023-12-06 09:53:59   INFO  epoch: 25/72, acc_iter=99300, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:52, time_cost(all): 22:58:44/1 day, 15:42:47, loss=0.48361786060011, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.99(1.03), norm=2.8321276657413805, lr=0.33704167000970087
2023-12-06 09:54:41   INFO  epoch: 25/72, acc_iter=99350, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:25, time_cost(all): 22:59:26/1 day, 19:31:58, loss=0.483558663174169, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.19(1.03), norm=3.162011355769943, lr=0.33692770293433305
2023-12-06 09:55:22   INFO  epoch: 25/72, acc_iter=99400, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:07, time_cost(all): 23:00:07/1 day, 18:50:31, loss=0.483499465748228, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.1(1.03), norm=2.6075431776798847, lr=0.33681373585896535
2023-12-06 09:56:04   INFO  epoch: 25/72, acc_iter=99450, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:05, time_cost(all): 23:00:49/1 day, 17:14:47, loss=0.483440268322287, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.98(1.03), norm=4.154604321901401, lr=0.33669976878359753
2023-12-06 09:56:46   INFO  epoch: 25/72, acc_iter=99500, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:49, time_cost(all): 23:01:31/1 day, 18:19:06, loss=0.483381070896346, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.94(1.03), norm=1.3678123325637639, lr=0.3365858017082297
2023-12-06 09:57:28   INFO  epoch: 25/72, acc_iter=99550, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:17, time_cost(all): 23:02:13/1 day, 15:59:43, loss=0.483321873470405, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.18(1.03), norm=1.9282585026906454, lr=0.3364718346328619
2023-12-06 09:58:09   INFO  epoch: 25/72, acc_iter=99600, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:22, time_cost(all): 23:02:54/1 day, 19:12:10, loss=0.483262676044464, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.17(1.03), norm=2.8132258913769297, lr=0.3363578675574941
2023-12-06 09:58:51   INFO  epoch: 25/72, acc_iter=99650, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:58, time_cost(all): 23:03:36/1 day, 15:35:51, loss=0.483203478618523, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.97(1.03), norm=3.443113026534703, lr=0.3362439004821263
2023-12-06 09:59:33   INFO  epoch: 25/72, acc_iter=99700, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:34, time_cost(all): 23:04:18/1 day, 15:57:03, loss=0.483144281192582, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.14(1.03), norm=0.9554032196430655, lr=0.3361299334067585
2023-12-06 10:00:15   INFO  epoch: 25/72, acc_iter=99750, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:08:50, time_cost(all): 23:05:00/1 day, 15:56:58, loss=0.483085083766641, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.83(1.03), norm=3.261272478377947, lr=0.33601596633139075
2023-12-06 10:00:57   INFO  epoch: 25/72, acc_iter=99800, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:48, time_cost(all): 23:05:42/1 day, 17:40:52, loss=0.4830258863407, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.22(1.03), norm=0.5846069823115321, lr=0.33590199925602293
2023-12-06 10:01:38   INFO  epoch: 25/72, acc_iter=99850, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:59, time_cost(all): 23:06:23/1 day, 19:08:59, loss=0.482966688914759, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.18(1.03), norm=2.3951309116846686, lr=0.3357880321806551
2023-12-06 10:02:20   INFO  epoch: 25/72, acc_iter=99900, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:20, time_cost(all): 23:07:05/1 day, 15:25:59, loss=0.482907491488818, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.03(1.03), norm=3.3300090560613063, lr=0.33567406510528736
2023-12-06 10:03:02   INFO  epoch: 25/72, acc_iter=99950, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:30, time_cost(all): 23:07:47/1 day, 19:15:45, loss=0.482848294062877, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.03(1.03), norm=1.7155433909928226, lr=0.3355600980299196
2023-12-06 10:03:44   INFO  epoch: 25/72, acc_iter=100000, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:43, time_cost(all): 23:08:29/1 day, 17:04:49, loss=0.482789096636936, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.23(1.03), norm=4.933744158598791, lr=0.3354461309545518
2023-12-06 10:04:26   INFO  epoch: 25/72, acc_iter=100050, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:04:53, time_cost(all): 23:09:11/1 day, 16:40:16, loss=0.482729899210996, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.18(1.03), norm=3.952268732030182, lr=0.33533216387918396
2023-12-06 10:05:07   INFO  epoch: 25/72, acc_iter=100100, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:31, time_cost(all): 23:09:52/1 day, 15:43:05, loss=0.482670701785055, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.23(1.03), norm=4.105809758469309, lr=0.33521819680381615
2023-12-06 10:05:49   INFO  epoch: 25/72, acc_iter=100150, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:38, time_cost(all): 23:10:34/1 day, 17:47:06, loss=0.482611504359114, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.84(1.03), norm=1.2346067292452463, lr=0.3351042297284484
2023-12-06 10:06:31   INFO  epoch: 25/72, acc_iter=100200, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:58, time_cost(all): 23:11:16/1 day, 15:32:27, loss=0.482552306933173, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.84(1.03), norm=2.4545088941985407, lr=0.3349902626530806
2023-12-06 10:07:13   INFO  epoch: 25/72, acc_iter=100250, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:10, time_cost(all): 23:11:58/1 day, 16:02:48, loss=0.482493109507232, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.91(1.03), norm=3.467449484603686, lr=0.3348762955777128
2023-12-06 10:07:54   INFO  epoch: 25/72, acc_iter=100300, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:35, time_cost(all): 23:12:39/1 day, 16:32:11, loss=0.482433912081291, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.22(1.03), norm=3.845953997164321, lr=0.334762328502345
2023-12-06 10:08:36   INFO  epoch: 25/72, acc_iter=100350, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:51, time_cost(all): 23:13:21/1 day, 16:22:33, loss=0.48237471465535, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.14(1.03), norm=0.6476290907494141, lr=0.3346483614269772
2023-12-06 10:09:18   INFO  epoch: 25/72, acc_iter=100400, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 23:14:03/1 day, 15:16:22, loss=0.482315517229409, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.12(1.03), norm=4.100409668758113, lr=0.33453439435160937
2023-12-06 10:10:00   INFO  epoch: 26/72, acc_iter=100462, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:54:20, time_cost(all): 23:14:45/1 day, 16:47:10, loss=0.482242112421242, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.02(1.03), norm=2.340003592777685, lr=0.3343930751781533
2023-12-06 10:10:42   INFO  epoch: 26/72, acc_iter=100512, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:52:52, time_cost(all): 23:15:27/1 day, 18:30:47, loss=0.482182914995301, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.95(1.03), norm=1.2455913987764216, lr=0.33427910810278555
2023-12-06 10:11:23   INFO  epoch: 26/72, acc_iter=100562, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:49:26, time_cost(all): 23:16:08/1 day, 16:35:34, loss=0.48212371756936, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.0(1.03), norm=1.192368170590692, lr=0.3341651410274178
2023-12-06 10:12:05   INFO  epoch: 26/72, acc_iter=100612, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:50:56, time_cost(all): 23:16:50/1 day, 18:48:00, loss=0.482064520143419, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.12(1.03), norm=3.352327188567793, lr=0.33405117395205
2023-12-06 10:12:47   INFO  epoch: 26/72, acc_iter=100662, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:48:07, time_cost(all): 23:17:32/1 day, 17:14:17, loss=0.482005322717478, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.17(1.03), norm=2.9209310256551024, lr=0.33393720687668216
2023-12-06 10:13:29   INFO  epoch: 26/72, acc_iter=100712, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:50:29, time_cost(all): 23:18:14/1 day, 19:05:46, loss=0.481946125291537, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.94(1.03), norm=4.541171546525919, lr=0.33382323980131434
2023-12-06 10:14:10   INFO  epoch: 26/72, acc_iter=100762, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:49:00, time_cost(all): 23:18:55/1 day, 18:06:06, loss=0.481886927865597, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.06(1.03), norm=3.7191333235603485, lr=0.3337092727259466
2023-12-06 10:14:52   INFO  epoch: 26/72, acc_iter=100812, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:46:39, time_cost(all): 23:19:37/1 day, 18:33:02, loss=0.481827730439656, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.05(1.03), norm=4.729294518287233, lr=0.33359530565057877
2023-12-06 10:15:34   INFO  epoch: 26/72, acc_iter=100862, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:48:43, time_cost(all): 23:20:19/1 day, 15:18:07, loss=0.481768533013715, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.13(1.03), norm=2.9367222975798715, lr=0.333481338575211
2023-12-06 10:16:16   INFO  epoch: 26/72, acc_iter=100912, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:48:54, time_cost(all): 23:21:01/1 day, 15:24:03, loss=0.481709335587774, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.95(1.03), norm=2.7208265661862656, lr=0.3333673714998432
2023-12-06 10:16:58   INFO  epoch: 26/72, acc_iter=100962, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:46:28, time_cost(all): 23:21:43/1 day, 16:03:57, loss=0.481650138161833, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.91(1.03), norm=4.267245044738449, lr=0.3332534044244754
2023-12-06 10:17:39   INFO  epoch: 26/72, acc_iter=101012, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:46:35, time_cost(all): 23:22:24/1 day, 15:42:22, loss=0.481590940735892, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.09(1.03), norm=2.9385257576478945, lr=0.3331394373491076
2023-12-06 10:18:21   INFO  epoch: 26/72, acc_iter=101062, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:46:15, time_cost(all): 23:23:06/1 day, 15:47:26, loss=0.481531743309951, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.19(1.03), norm=1.7661927750848592, lr=0.3330254702737398
2023-12-06 10:19:03   INFO  epoch: 26/72, acc_iter=101112, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:44:55, time_cost(all): 23:23:48/1 day, 17:53:58, loss=0.48147254588401, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.88(1.03), norm=1.169058485104795, lr=0.33291150319837204
2023-12-06 10:19:45   INFO  epoch: 26/72, acc_iter=101162, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:44:21, time_cost(all): 23:24:30/1 day, 17:38:24, loss=0.481413348458069, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.13(1.03), norm=4.475691417142189, lr=0.3327975361230042
2023-12-06 10:20:26   INFO  epoch: 26/72, acc_iter=101212, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:44:25, time_cost(all): 23:25:11/1 day, 18:30:51, loss=0.481354151032128, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.16(1.03), norm=3.3862271826358668, lr=0.3326835690476364
2023-12-06 10:21:08   INFO  epoch: 26/72, acc_iter=101262, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:40:09, time_cost(all): 23:25:53/1 day, 17:25:47, loss=0.481294953606187, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.85(1.03), norm=2.46729118910411, lr=0.3325696019722686
2023-12-06 10:21:50   INFO  epoch: 26/72, acc_iter=101312, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:42:38, time_cost(all): 23:26:35/1 day, 15:11:29, loss=0.481235756180246, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.9(1.03), norm=3.7760577516990157, lr=0.33245563489690083
2023-12-06 10:22:32   INFO  epoch: 26/72, acc_iter=101362, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:42:00, time_cost(all): 23:27:17/1 day, 15:27:35, loss=0.481176558754305, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.86(1.03), norm=3.9244058941295124, lr=0.33234166782153307
2023-12-06 10:23:14   INFO  epoch: 26/72, acc_iter=101412, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:38:58, time_cost(all): 23:27:59/1 day, 17:03:12, loss=0.481117361328364, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.96(1.03), norm=3.8848871124266604, lr=0.33222770074616526
2023-12-06 10:23:55   INFO  epoch: 26/72, acc_iter=101462, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:40:48, time_cost(all): 23:28:40/1 day, 17:24:29, loss=0.481058163902423, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.03(1.03), norm=0.9822587370669857, lr=0.33211373367079744
2023-12-06 10:24:37   INFO  epoch: 26/72, acc_iter=101512, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:37:26, time_cost(all): 23:29:22/1 day, 17:03:52, loss=0.480998966476482, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.16(1.03), norm=3.6834295314371577, lr=0.3319997665954297
2023-12-06 10:25:19   INFO  epoch: 26/72, acc_iter=101562, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:36:22, time_cost(all): 23:30:04/1 day, 18:39:11, loss=0.480939769050541, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.86(1.03), norm=2.1859306857610274, lr=0.33188579952006186
2023-12-06 10:26:01   INFO  epoch: 26/72, acc_iter=101612, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:38:12, time_cost(all): 23:30:46/1 day, 16:37:04, loss=0.480880571624601, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.88(1.03), norm=1.0643967772592722, lr=0.3317718324446941
2023-12-06 10:26:42   INFO  epoch: 26/72, acc_iter=101662, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:36:55, time_cost(all): 23:31:27/1 day, 16:23:08, loss=0.48082137419866, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.22(1.03), norm=1.0318349200492718, lr=0.3316578653693263
2023-12-06 10:27:24   INFO  epoch: 26/72, acc_iter=101712, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:34:37, time_cost(all): 23:32:09/1 day, 17:18:17, loss=0.480762176772719, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.11(1.03), norm=0.6352535182822513, lr=0.33154389829395847
2023-12-06 10:28:06   INFO  epoch: 26/72, acc_iter=101762, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:33:38, time_cost(all): 23:32:51/1 day, 17:28:03, loss=0.480702979346778, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.95(1.03), norm=3.3102496948341162, lr=0.33142993121859066
2023-12-06 10:28:48   INFO  epoch: 26/72, acc_iter=101812, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:34:11, time_cost(all): 23:33:33/1 day, 18:37:21, loss=0.480643781920837, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.96(1.03), norm=1.1263207722377682, lr=0.33131596414322284
2023-12-06 10:29:30   INFO  epoch: 26/72, acc_iter=101862, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:34:38, time_cost(all): 23:34:15/1 day, 18:37:10, loss=0.480584584494896, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.07(1.03), norm=3.164878354120263, lr=0.33120199706785514
2023-12-06 10:30:11   INFO  epoch: 26/72, acc_iter=101912, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:34:01, time_cost(all): 23:34:56/1 day, 18:44:09, loss=0.480525387068955, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.86(1.03), norm=3.7019737789618867, lr=0.3310880299924873
2023-12-06 10:30:53   INFO  epoch: 26/72, acc_iter=101962, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:33:30, time_cost(all): 23:35:38/1 day, 16:33:34, loss=0.480466189643014, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.02(1.03), norm=3.4488269940220118, lr=0.3309740629171195
2023-12-06 10:31:35   INFO  epoch: 26/72, acc_iter=102012, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:33:01, time_cost(all): 23:36:20/1 day, 17:05:45, loss=0.480406992217073, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.2(1.03), norm=3.7659693587845817, lr=0.3308600958417517
2023-12-06 10:32:17   INFO  epoch: 26/72, acc_iter=102062, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:31:51, time_cost(all): 23:37:02/1 day, 16:55:30, loss=0.480347794791132, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.92(1.03), norm=1.1278531535410392, lr=0.33074612876638393
2023-12-06 10:32:58   INFO  epoch: 26/72, acc_iter=102112, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:29:40, time_cost(all): 23:37:43/1 day, 18:13:02, loss=0.480288597365191, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.86(1.03), norm=1.2650519452215607, lr=0.3306321616910161
2023-12-06 10:33:40   INFO  epoch: 26/72, acc_iter=102162, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:30:51, time_cost(all): 23:38:25/1 day, 16:59:00, loss=0.48022939993925, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.06(1.03), norm=3.163715062439895, lr=0.33051819461564835
2023-12-06 10:34:22   INFO  epoch: 26/72, acc_iter=102212, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:27:26, time_cost(all): 23:39:07/1 day, 15:56:16, loss=0.480170202513309, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.06(1.03), norm=1.3388448405049176, lr=0.33040422754028054
2023-12-06 10:35:04   INFO  epoch: 26/72, acc_iter=102262, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:28:31, time_cost(all): 23:39:49/1 day, 15:55:17, loss=0.480111005087368, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.14(1.03), norm=4.700506666677228, lr=0.3302902604649127
2023-12-06 10:35:46   INFO  epoch: 26/72, acc_iter=102312, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:26:23, time_cost(all): 23:40:31/1 day, 15:55:13, loss=0.480051807661427, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.18(1.03), norm=2.677354240967544, lr=0.3301762933895449
2023-12-06 10:36:27   INFO  epoch: 26/72, acc_iter=102362, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:27:20, time_cost(all): 23:41:12/1 day, 15:46:40, loss=0.479992610235486, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.19(1.03), norm=1.124909140468887, lr=0.3300623263141772
2023-12-06 10:37:09   INFO  epoch: 26/72, acc_iter=102412, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:26:38, time_cost(all): 23:41:54/1 day, 18:18:24, loss=0.479933412809545, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.99(1.03), norm=3.3394512632661675, lr=0.3299483592388094
2023-12-06 10:37:51   INFO  epoch: 26/72, acc_iter=102462, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:44, time_cost(all): 23:42:36/1 day, 18:01:14, loss=0.479874215383605, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.83(1.03), norm=4.760261186697887, lr=0.32983439216344157
2023-12-06 10:38:33   INFO  epoch: 26/72, acc_iter=102512, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:23:36, time_cost(all): 23:43:18/1 day, 16:52:45, loss=0.479815017957664, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.86(1.03), norm=2.9690038525429285, lr=0.32972042508807375
2023-12-06 10:39:14   INFO  epoch: 26/72, acc_iter=102562, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:04, time_cost(all): 23:43:59/1 day, 17:50:21, loss=0.479755820531723, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.13(1.03), norm=2.5270836698512142, lr=0.32960645801270594
2023-12-06 10:39:56   INFO  epoch: 26/72, acc_iter=102612, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:24:02, time_cost(all): 23:44:41/1 day, 16:40:43, loss=0.479696623105782, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.22(1.03), norm=3.229685687096655, lr=0.3294924909373382
2023-12-06 10:40:38   INFO  epoch: 26/72, acc_iter=102662, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:21:22, time_cost(all): 23:45:23/1 day, 18:26:13, loss=0.479637425679841, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.95(1.03), norm=2.8943399234389404, lr=0.32937852386197036
2023-12-06 10:41:20   INFO  epoch: 26/72, acc_iter=102712, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:43, time_cost(all): 23:46:05/1 day, 15:22:33, loss=0.4795782282539, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.05(1.03), norm=4.111853481064398, lr=0.3292645567866026
2023-12-06 10:42:02   INFO  epoch: 26/72, acc_iter=102762, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:12, time_cost(all): 23:46:47/1 day, 15:15:29, loss=0.479519030827959, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.88(1.03), norm=3.2986861637353124, lr=0.3291505897112348
2023-12-06 10:42:43   INFO  epoch: 26/72, acc_iter=102812, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:20:49, time_cost(all): 23:47:28/1 day, 17:55:15, loss=0.479459833402018, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.0(1.03), norm=2.4841083130898296, lr=0.32903662263586697
2023-12-06 10:43:25   INFO  epoch: 26/72, acc_iter=102862, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:18:47, time_cost(all): 23:48:10/1 day, 17:36:44, loss=0.479400635976077, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.18(1.03), norm=0.5661862917164082, lr=0.32892265556049916
2023-12-06 10:44:07   INFO  epoch: 26/72, acc_iter=102912, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:28, time_cost(all): 23:48:52/1 day, 15:57:29, loss=0.479341438550136, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.84(1.03), norm=0.6451260428982657, lr=0.32880868848513145
2023-12-06 10:44:49   INFO  epoch: 26/72, acc_iter=102962, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:17:30, time_cost(all): 23:49:34/1 day, 18:37:49, loss=0.479282241124195, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.11(1.03), norm=2.5775100612418256, lr=0.32869472140976363
2023-12-06 10:45:31   INFO  epoch: 26/72, acc_iter=103012, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:03, time_cost(all): 23:50:16/1 day, 17:50:35, loss=0.479223043698254, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.06(1.03), norm=1.901578640284872, lr=0.3285807543343958
2023-12-06 10:46:12   INFO  epoch: 26/72, acc_iter=103062, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:34, time_cost(all): 23:50:57/1 day, 15:00:11, loss=0.479163846272313, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.92(1.03), norm=2.483771335498294, lr=0.328466787259028
2023-12-06 10:46:54   INFO  epoch: 26/72, acc_iter=103112, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:27, time_cost(all): 23:51:39/1 day, 14:54:04, loss=0.479104648846372, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.13(1.03), norm=2.3619247814381765, lr=0.32835282018366024
2023-12-06 10:47:36   INFO  epoch: 26/72, acc_iter=103162, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:08, time_cost(all): 23:52:21/1 day, 18:27:21, loss=0.479045451420431, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.87(1.03), norm=2.0090322223395174, lr=0.3282388531082924
2023-12-06 10:48:18   INFO  epoch: 26/72, acc_iter=103212, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:12, time_cost(all): 23:53:03/1 day, 17:42:52, loss=0.47898625399449, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.9(1.03), norm=0.829158302671394, lr=0.32812488603292467
2023-12-06 10:48:59   INFO  epoch: 26/72, acc_iter=103262, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:07, time_cost(all): 23:53:44/1 day, 16:13:55, loss=0.478927056568549, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.94(1.03), norm=3.501161583387942, lr=0.32801091895755685
2023-12-06 10:49:41   INFO  epoch: 26/72, acc_iter=103312, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:11, time_cost(all): 23:54:26/1 day, 17:27:37, loss=0.478867859142609, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.87(1.03), norm=3.3810197899910355, lr=0.32789695188218904
2023-12-06 10:50:23   INFO  epoch: 26/72, acc_iter=103362, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:32, time_cost(all): 23:55:08/1 day, 16:55:05, loss=0.478808661716668, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.03(1.03), norm=2.8543489024974797, lr=0.3277829848068212
2023-12-06 10:51:05   INFO  epoch: 26/72, acc_iter=103412, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:02, time_cost(all): 23:55:50/1 day, 16:10:35, loss=0.478749464290727, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.85(1.03), norm=4.648776144332155, lr=0.32766901773145346
2023-12-06 10:51:47   INFO  epoch: 26/72, acc_iter=103462, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:46, time_cost(all): 23:56:32/1 day, 17:55:08, loss=0.478690266864786, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.85(1.03), norm=4.557125088588911, lr=0.3275550506560857
2023-12-06 10:52:28   INFO  epoch: 26/72, acc_iter=103512, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:59, time_cost(all): 23:57:13/1 day, 15:21:09, loss=0.478631069438845, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.83(1.03), norm=1.5687171719858966, lr=0.3274410835807179
2023-12-06 10:53:10   INFO  epoch: 26/72, acc_iter=103562, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:40, time_cost(all): 23:57:55/1 day, 17:26:44, loss=0.478571872012904, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.83(1.03), norm=1.5886887544418233, lr=0.32732711650535007
2023-12-06 10:53:52   INFO  epoch: 26/72, acc_iter=103612, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:08:56, time_cost(all): 23:58:37/1 day, 14:57:44, loss=0.478512674586963, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.91(1.03), norm=4.82006188701797, lr=0.32721314942998225
2023-12-06 10:54:34   INFO  epoch: 26/72, acc_iter=103662, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:28, time_cost(all): 23:59:19/1 day, 15:30:52, loss=0.478453477161022, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.15(1.03), norm=3.8154182087387736, lr=0.3270991823546145
2023-12-06 10:55:15   INFO  epoch: 26/72, acc_iter=103712, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:44, time_cost(all): 1 day, 0:00:00/1 day, 14:43:44, loss=0.478394279735081, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.02(1.03), norm=2.8115362016251573, lr=0.3269852152792467
2023-12-06 10:55:57   INFO  epoch: 26/72, acc_iter=103762, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:01, time_cost(all): 1 day, 0:00:42/1 day, 15:56:28, loss=0.47833508230914, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.92(1.03), norm=0.5602363211802377, lr=0.3268712482038789
2023-12-06 10:56:39   INFO  epoch: 26/72, acc_iter=103812, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:10, time_cost(all): 1 day, 0:01:24/1 day, 18:23:07, loss=0.478275884883199, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.14(1.03), norm=3.604099876163391, lr=0.3267572811285111
2023-12-06 10:57:21   INFO  epoch: 26/72, acc_iter=103862, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:27, time_cost(all): 1 day, 0:02:06/1 day, 18:12:51, loss=0.478216687457258, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.21(1.03), norm=4.598418429586601, lr=0.3266433140531433
2023-12-06 10:58:03   INFO  epoch: 26/72, acc_iter=103912, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:06, time_cost(all): 1 day, 0:02:48/1 day, 15:50:17, loss=0.478157490031317, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.85(1.03), norm=1.2685798470107135, lr=0.3265293469777755
2023-12-06 10:58:44   INFO  epoch: 26/72, acc_iter=103962, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:25, time_cost(all): 1 day, 0:03:29/1 day, 15:37:45, loss=0.478098292605376, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.91(1.03), norm=3.2508827619323197, lr=0.32641537990240777
2023-12-06 10:59:26   INFO  epoch: 26/72, acc_iter=104012, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:44, time_cost(all): 1 day, 0:04:11/1 day, 15:25:31, loss=0.478039095179435, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.05(1.03), norm=3.58885735143805, lr=0.32630141282703995
2023-12-06 11:00:08   INFO  epoch: 26/72, acc_iter=104062, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:03:03, time_cost(all): 1 day, 0:04:53/1 day, 18:26:36, loss=0.477979897753494, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.13(1.03), norm=1.3945285218535182, lr=0.32618744575167213
2023-12-06 11:00:50   INFO  epoch: 26/72, acc_iter=104112, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:18, time_cost(all): 1 day, 0:05:35/1 day, 14:44:47, loss=0.477920700327553, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.21(1.03), norm=4.076106812860713, lr=0.3260734786763043
2023-12-06 11:01:31   INFO  epoch: 26/72, acc_iter=104162, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:30, time_cost(all): 1 day, 0:06:16/1 day, 17:47:41, loss=0.477861502901613, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.08(1.03), norm=4.669469524098506, lr=0.32595951160093656
2023-12-06 11:02:13   INFO  epoch: 26/72, acc_iter=104212, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:53, time_cost(all): 1 day, 0:06:58/1 day, 16:48:58, loss=0.477802305475672, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.88(1.03), norm=3.7350185337463997, lr=0.32584554452556874
2023-12-06 11:02:55   INFO  epoch: 26/72, acc_iter=104262, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 1 day, 0:07:40/1 day, 14:40:58, loss=0.477743108049731, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.89(1.03), norm=3.008174844064953, lr=0.325731577450201
2023-12-06 11:03:37   INFO  epoch: 27/72, acc_iter=104324, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:52:40, time_cost(all): 1 day, 0:08:22/1 day, 14:33:45, loss=0.477669703241564, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.02(1.03), norm=2.8694238477641028, lr=0.3255902582767449
2023-12-06 11:04:19   INFO  epoch: 27/72, acc_iter=104374, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:54:28, time_cost(all): 1 day, 0:09:04/1 day, 15:34:16, loss=0.477610505815623, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.08(1.03), norm=2.6343239987469236, lr=0.3254762912013771
2023-12-06 11:05:00   INFO  epoch: 27/72, acc_iter=104424, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:52:00, time_cost(all): 1 day, 0:09:45/1 day, 14:53:25, loss=0.477551308389682, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.97(1.03), norm=1.7797041382577587, lr=0.3253623241260093
2023-12-06 11:05:42   INFO  epoch: 27/72, acc_iter=104474, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:50:57, time_cost(all): 1 day, 0:10:27/1 day, 17:51:22, loss=0.477492110963741, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.04(1.03), norm=1.7077934514804942, lr=0.3252483570506415
2023-12-06 11:06:24   INFO  epoch: 27/72, acc_iter=104524, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:51:14, time_cost(all): 1 day, 0:11:09/1 day, 16:48:46, loss=0.4774329135378, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.23(1.03), norm=3.1686261377202545, lr=0.3251343899752737
2023-12-06 11:07:06   INFO  epoch: 27/72, acc_iter=104574, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:47:34, time_cost(all): 1 day, 0:11:51/1 day, 18:17:22, loss=0.477373716111859, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.06(1.03), norm=1.589722294659113, lr=0.32502042289990596
2023-12-06 11:07:47   INFO  epoch: 27/72, acc_iter=104624, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:50:51, time_cost(all): 1 day, 0:12:32/1 day, 14:23:05, loss=0.477314518685918, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.19(1.03), norm=1.193391914173625, lr=0.32490645582453814
2023-12-06 11:08:29   INFO  epoch: 27/72, acc_iter=104674, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:49:22, time_cost(all): 1 day, 0:13:14/1 day, 14:41:15, loss=0.477255321259977, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.92(1.03), norm=4.192041696998672, lr=0.3247924887491703
2023-12-06 11:09:11   INFO  epoch: 27/72, acc_iter=104724, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:45:36, time_cost(all): 1 day, 0:13:56/1 day, 15:53:31, loss=0.477196123834036, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.1(1.03), norm=4.773276825178042, lr=0.3246785216738025
2023-12-06 11:09:53   INFO  epoch: 27/72, acc_iter=104774, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:47:41, time_cost(all): 1 day, 0:14:38/1 day, 14:54:22, loss=0.477136926408095, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.22(1.03), norm=3.972680450058235, lr=0.3245645545984347
2023-12-06 11:10:35   INFO  epoch: 27/72, acc_iter=104824, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:44:14, time_cost(all): 1 day, 0:15:20/1 day, 17:06:43, loss=0.477077728982154, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.83(1.03), norm=2.6392794051238933, lr=0.32445058752306694
2023-12-06 11:11:16   INFO  epoch: 27/72, acc_iter=104874, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:46:42, time_cost(all): 1 day, 0:16:01/1 day, 14:40:06, loss=0.477018531556214, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.12(1.03), norm=4.17845971909736, lr=0.3243366204476992
2023-12-06 11:11:58   INFO  epoch: 27/72, acc_iter=104924, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:43:58, time_cost(all): 1 day, 0:16:43/1 day, 15:11:25, loss=0.476959334130273, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.18(1.03), norm=2.9526035067081895, lr=0.32422265337233136
2023-12-06 11:12:40   INFO  epoch: 27/72, acc_iter=104974, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:42:11, time_cost(all): 1 day, 0:17:25/1 day, 16:21:02, loss=0.476900136704332, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.1(1.03), norm=0.7540515363694154, lr=0.32410868629696354
2023-12-06 11:13:22   INFO  epoch: 27/72, acc_iter=105024, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:44:12, time_cost(all): 1 day, 0:18:07/1 day, 14:45:48, loss=0.476840939278391, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.09(1.03), norm=2.5187320895798773, lr=0.3239947192215958
2023-12-06 11:14:03   INFO  epoch: 27/72, acc_iter=105074, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:43:29, time_cost(all): 1 day, 0:18:48/1 day, 17:34:21, loss=0.47678174185245, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.03(1.03), norm=1.7794200190343001, lr=0.32388075214622797
2023-12-06 11:14:45   INFO  epoch: 27/72, acc_iter=105124, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:43:56, time_cost(all): 1 day, 0:19:30/1 day, 15:37:45, loss=0.476722544426509, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.07(1.03), norm=1.935928196315313, lr=0.3237667850708602
2023-12-06 11:15:27   INFO  epoch: 27/72, acc_iter=105174, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:41:16, time_cost(all): 1 day, 0:20:12/1 day, 17:51:26, loss=0.476663347000568, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.11(1.03), norm=4.024122454038345, lr=0.3236528179954924
2023-12-06 11:16:09   INFO  epoch: 27/72, acc_iter=105224, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:39:59, time_cost(all): 1 day, 0:20:54/1 day, 14:54:15, loss=0.476604149574627, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.18(1.03), norm=1.8928734202046646, lr=0.3235388509201246
2023-12-06 11:16:51   INFO  epoch: 27/72, acc_iter=105274, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:40:48, time_cost(all): 1 day, 0:21:36/1 day, 14:34:18, loss=0.476544952148686, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.05(1.03), norm=3.532173367812999, lr=0.32342488384475676
2023-12-06 11:17:32   INFO  epoch: 27/72, acc_iter=105324, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:39:05, time_cost(all): 1 day, 0:22:17/1 day, 14:15:00, loss=0.476485754722745, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.16(1.03), norm=3.6162208912398466, lr=0.323310916769389
2023-12-06 11:18:14   INFO  epoch: 27/72, acc_iter=105374, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:39:51, time_cost(all): 1 day, 0:22:59/1 day, 14:11:00, loss=0.476426557296804, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.89(1.03), norm=1.9173250790311074, lr=0.32319694969402124
2023-12-06 11:18:56   INFO  epoch: 27/72, acc_iter=105424, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:36:59, time_cost(all): 1 day, 0:23:41/1 day, 14:19:09, loss=0.476367359870863, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.2(1.03), norm=1.7680256051876126, lr=0.3230829826186534
2023-12-06 11:19:38   INFO  epoch: 27/72, acc_iter=105474, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:37:51, time_cost(all): 1 day, 0:24:23/1 day, 16:13:48, loss=0.476308162444922, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.08(1.03), norm=3.3038267477129697, lr=0.3229690155432856
2023-12-06 11:20:20   INFO  epoch: 27/72, acc_iter=105524, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:36:51, time_cost(all): 1 day, 0:25:05/1 day, 15:22:24, loss=0.476248965018981, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.93(1.03), norm=1.113714311177856, lr=0.3228550484679178
2023-12-06 11:21:01   INFO  epoch: 27/72, acc_iter=105574, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:34:20, time_cost(all): 1 day, 0:25:46/1 day, 16:23:43, loss=0.47618976759304, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.1(1.03), norm=1.0876062018860353, lr=0.32274108139255003
2023-12-06 11:21:43   INFO  epoch: 27/72, acc_iter=105624, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:34:01, time_cost(all): 1 day, 0:26:28/1 day, 16:05:35, loss=0.476130570167099, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.02(1.03), norm=3.8274342051231023, lr=0.3226271143171822
2023-12-06 11:22:25   INFO  epoch: 27/72, acc_iter=105674, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:34:27, time_cost(all): 1 day, 0:27:10/1 day, 17:35:22, loss=0.476071372741158, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.86(1.03), norm=1.2417359147289317, lr=0.32251314724181446
2023-12-06 11:23:07   INFO  epoch: 27/72, acc_iter=105724, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:34:49, time_cost(all): 1 day, 0:27:52/1 day, 17:33:53, loss=0.476012175315218, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.99(1.03), norm=3.238959346963762, lr=0.32239918016644664
2023-12-06 11:23:48   INFO  epoch: 27/72, acc_iter=105774, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:32:14, time_cost(all): 1 day, 0:28:33/1 day, 17:04:36, loss=0.475952977889277, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.97(1.03), norm=1.5738182369251639, lr=0.3222852130910788
2023-12-06 11:24:30   INFO  epoch: 27/72, acc_iter=105824, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:33:06, time_cost(all): 1 day, 0:29:15/1 day, 16:15:46, loss=0.475893780463336, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.17(1.03), norm=2.607152127744964, lr=0.322171246015711
2023-12-06 11:25:12   INFO  epoch: 27/72, acc_iter=105874, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:31:14, time_cost(all): 1 day, 0:29:57/1 day, 17:31:18, loss=0.475834583037395, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.91(1.03), norm=3.2517079453541564, lr=0.3220572789403433
2023-12-06 11:25:54   INFO  epoch: 27/72, acc_iter=105924, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:29:46, time_cost(all): 1 day, 0:30:39/1 day, 14:16:54, loss=0.475775385611454, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.07(1.03), norm=1.091736110224378, lr=0.3219433118649755
2023-12-06 11:26:36   INFO  epoch: 27/72, acc_iter=105974, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:31:01, time_cost(all): 1 day, 0:31:21/1 day, 17:42:54, loss=0.475716188185513, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.18(1.03), norm=0.9848314452389197, lr=0.3218293447896077
2023-12-06 11:27:17   INFO  epoch: 27/72, acc_iter=106024, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:30:37, time_cost(all): 1 day, 0:32:02/1 day, 15:38:51, loss=0.475656990759572, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.0(1.03), norm=2.511656298667597, lr=0.32171537771423986
2023-12-06 11:27:59   INFO  epoch: 27/72, acc_iter=106074, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:29:35, time_cost(all): 1 day, 0:32:44/1 day, 16:58:03, loss=0.475597793333631, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.98(1.03), norm=3.4706587176541994, lr=0.3216014106388721
2023-12-06 11:28:41   INFO  epoch: 27/72, acc_iter=106124, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:28:12, time_cost(all): 1 day, 0:33:26/1 day, 14:40:30, loss=0.47553859590769, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.15(1.03), norm=0.8118241768733416, lr=0.3214874435635043
2023-12-06 11:29:23   INFO  epoch: 27/72, acc_iter=106174, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:27:03, time_cost(all): 1 day, 0:34:08/1 day, 14:32:09, loss=0.475479398481749, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.12(1.03), norm=0.9076747530032109, lr=0.3213734764881365
2023-12-06 11:30:04   INFO  epoch: 27/72, acc_iter=106224, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:27:45, time_cost(all): 1 day, 0:34:49/1 day, 15:08:16, loss=0.475420201055808, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.86(1.03), norm=4.098572838717687, lr=0.3212595094127687
2023-12-06 11:30:46   INFO  epoch: 27/72, acc_iter=106274, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:26:16, time_cost(all): 1 day, 0:35:31/1 day, 15:57:37, loss=0.475361003629867, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.1(1.03), norm=2.2795651637576952, lr=0.3211455423374009
2023-12-06 11:31:28   INFO  epoch: 27/72, acc_iter=106324, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:26, time_cost(all): 1 day, 0:36:13/1 day, 14:38:48, loss=0.475301806203926, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.03(1.03), norm=3.3428287232164138, lr=0.3210315752620331
2023-12-06 11:32:10   INFO  epoch: 27/72, acc_iter=106374, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:24:32, time_cost(all): 1 day, 0:36:55/1 day, 15:36:32, loss=0.475242608777985, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.91(1.03), norm=3.1248817451473183, lr=0.3209176081866653
2023-12-06 11:32:52   INFO  epoch: 27/72, acc_iter=106424, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:23:56, time_cost(all): 1 day, 0:37:37/1 day, 15:27:27, loss=0.475183411352044, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.18(1.03), norm=2.2094588873186867, lr=0.32080364111129755
2023-12-06 11:33:33   INFO  epoch: 27/72, acc_iter=106474, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:48, time_cost(all): 1 day, 0:38:18/1 day, 14:46:48, loss=0.475124213926103, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.0(1.03), norm=2.6787218335651173, lr=0.32068967403592974
2023-12-06 11:34:15   INFO  epoch: 27/72, acc_iter=106524, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:22:58, time_cost(all): 1 day, 0:39:00/1 day, 14:19:07, loss=0.475065016500162, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.15(1.03), norm=3.05851997977571, lr=0.3205757069605619
2023-12-06 11:34:57   INFO  epoch: 27/72, acc_iter=106574, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:22:11, time_cost(all): 1 day, 0:39:42/1 day, 14:04:09, loss=0.475005819074222, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.08(1.03), norm=1.856488710905258, lr=0.3204617398851941
2023-12-06 11:35:39   INFO  epoch: 27/72, acc_iter=106624, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:07, time_cost(all): 1 day, 0:40:24/1 day, 16:07:42, loss=0.474946621648281, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.99(1.03), norm=2.9038856489472487, lr=0.32034777280982635
2023-12-06 11:36:20   INFO  epoch: 27/72, acc_iter=106674, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:21:18, time_cost(all): 1 day, 0:41:05/1 day, 16:40:43, loss=0.47488742422234, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.07(1.03), norm=4.836950026603045, lr=0.32023380573445853
2023-12-06 11:37:02   INFO  epoch: 27/72, acc_iter=106724, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:19:00, time_cost(all): 1 day, 0:41:47/1 day, 17:17:59, loss=0.474828226796399, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.07(1.03), norm=0.6958660594598185, lr=0.32011983865909077
2023-12-06 11:37:44   INFO  epoch: 27/72, acc_iter=106774, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:18:01, time_cost(all): 1 day, 0:42:29/1 day, 15:05:29, loss=0.474769029370458, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.06(1.03), norm=1.717975580630147, lr=0.32000587158372296
2023-12-06 11:38:26   INFO  epoch: 27/72, acc_iter=106824, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:52, time_cost(all): 1 day, 0:43:11/1 day, 14:28:08, loss=0.474709831944517, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.03(1.03), norm=2.4272417488211078, lr=0.31989190450835514
2023-12-06 11:39:08   INFO  epoch: 27/72, acc_iter=106874, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:38, time_cost(all): 1 day, 0:43:53/1 day, 14:29:30, loss=0.474650634518576, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.99(1.03), norm=2.5053542117711896, lr=0.3197779374329873
2023-12-06 11:39:49   INFO  epoch: 27/72, acc_iter=106924, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:17:07, time_cost(all): 1 day, 0:44:34/1 day, 17:36:09, loss=0.474591437092635, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.1(1.03), norm=1.360987428757604, lr=0.3196639703576196
2023-12-06 11:40:31   INFO  epoch: 27/72, acc_iter=106974, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:28, time_cost(all): 1 day, 0:45:16/1 day, 14:31:29, loss=0.474532239666694, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.07(1.03), norm=1.3238317474532963, lr=0.3195500032822518
2023-12-06 11:41:13   INFO  epoch: 27/72, acc_iter=107024, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:14:47, time_cost(all): 1 day, 0:45:58/1 day, 15:05:26, loss=0.474473042240753, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.94(1.03), norm=1.3006259915070708, lr=0.319436036206884
2023-12-06 11:41:55   INFO  epoch: 27/72, acc_iter=107074, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:15, time_cost(all): 1 day, 0:46:40/1 day, 15:47:44, loss=0.474413844814812, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.2(1.03), norm=2.7840979440103633, lr=0.3193220691315162
2023-12-06 11:42:36   INFO  epoch: 27/72, acc_iter=107124, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:13:35, time_cost(all): 1 day, 0:47:21/1 day, 15:09:11, loss=0.474354647388871, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.04(1.03), norm=2.728952378127722, lr=0.3192081020561484
2023-12-06 11:43:18   INFO  epoch: 27/72, acc_iter=107174, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:21, time_cost(all): 1 day, 0:48:03/1 day, 13:51:41, loss=0.47429544996293, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.89(1.03), norm=2.667387814711368, lr=0.3190941349807806
2023-12-06 11:44:00   INFO  epoch: 27/72, acc_iter=107224, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:13:09, time_cost(all): 1 day, 0:48:45/1 day, 16:58:02, loss=0.474236252536989, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.2(1.03), norm=3.333475223596274, lr=0.31898016790541284
2023-12-06 11:44:42   INFO  epoch: 27/72, acc_iter=107274, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:07, time_cost(all): 1 day, 0:49:27/1 day, 17:26:19, loss=0.474177055111048, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.23(1.03), norm=2.9046030021241234, lr=0.318866200830045
2023-12-06 11:45:24   INFO  epoch: 27/72, acc_iter=107324, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:00, time_cost(all): 1 day, 0:50:09/1 day, 17:40:28, loss=0.474117857685107, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.97(1.03), norm=4.048803032425926, lr=0.3187522337546772
2023-12-06 11:46:05   INFO  epoch: 27/72, acc_iter=107374, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:54, time_cost(all): 1 day, 0:50:50/1 day, 14:13:57, loss=0.474058660259166, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.07(1.03), norm=4.7938270881680465, lr=0.3186382666793094
2023-12-06 11:46:47   INFO  epoch: 27/72, acc_iter=107424, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:27, time_cost(all): 1 day, 0:51:32/1 day, 17:12:33, loss=0.473999462833226, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.91(1.03), norm=1.7119230390090339, lr=0.31852429960394163
2023-12-06 11:47:29   INFO  epoch: 27/72, acc_iter=107474, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:02, time_cost(all): 1 day, 0:52:14/1 day, 15:41:46, loss=0.473940265407285, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.92(1.03), norm=3.6813193855227064, lr=0.31841033252857387
2023-12-06 11:48:11   INFO  epoch: 27/72, acc_iter=107524, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:17, time_cost(all): 1 day, 0:52:56/1 day, 16:13:29, loss=0.473881067981344, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.91(1.03), norm=3.008302590856184, lr=0.31829636545320605
2023-12-06 11:48:52   INFO  epoch: 27/72, acc_iter=107574, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:08:04, time_cost(all): 1 day, 0:53:37/1 day, 13:53:51, loss=0.473821870555403, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.05(1.03), norm=0.9202325903405693, lr=0.31818239837783824
2023-12-06 11:49:34   INFO  epoch: 27/72, acc_iter=107624, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:22, time_cost(all): 1 day, 0:54:19/1 day, 14:10:45, loss=0.473762673129462, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.1(1.03), norm=0.5954385965936451, lr=0.3180684313024704
2023-12-06 11:50:16   INFO  epoch: 27/72, acc_iter=107674, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:23, time_cost(all): 1 day, 0:55:01/1 day, 15:59:01, loss=0.473703475703521, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.91(1.03), norm=2.4601587245771217, lr=0.31795446422710266
2023-12-06 11:50:58   INFO  epoch: 27/72, acc_iter=107724, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:46, time_cost(all): 1 day, 0:55:43/1 day, 15:52:24, loss=0.47364427827758, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.04(1.03), norm=4.760766172457179, lr=0.31784049715173485
2023-12-06 11:51:40   INFO  epoch: 27/72, acc_iter=107774, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:04:48, time_cost(all): 1 day, 0:56:25/1 day, 13:38:06, loss=0.473585080851639, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.17(1.03), norm=0.8741440471696894, lr=0.3177265300763671
2023-12-06 11:52:21   INFO  epoch: 27/72, acc_iter=107824, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:27, time_cost(all): 1 day, 0:57:06/1 day, 14:32:24, loss=0.473525883425698, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.83(1.03), norm=2.558930443809641, lr=0.31761256300099927
2023-12-06 11:53:03   INFO  epoch: 27/72, acc_iter=107874, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:34, time_cost(all): 1 day, 0:57:48/1 day, 15:34:05, loss=0.473466685999757, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.11(1.03), norm=4.718668414144348, lr=0.31749859592563145
2023-12-06 11:53:45   INFO  epoch: 27/72, acc_iter=107924, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:03:00, time_cost(all): 1 day, 0:58:30/1 day, 13:40:36, loss=0.473407488573816, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.96(1.03), norm=3.5125961476949574, lr=0.3173846288502637
2023-12-06 11:54:27   INFO  epoch: 27/72, acc_iter=107974, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:14, time_cost(all): 1 day, 0:59:12/1 day, 14:40:32, loss=0.473348291147875, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.09(1.03), norm=2.5798228583985385, lr=0.31727066177489593
2023-12-06 11:55:09   INFO  epoch: 27/72, acc_iter=108024, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:32, time_cost(all): 1 day, 0:59:54/1 day, 16:24:03, loss=0.473289093721934, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.1(1.03), norm=1.2624760570095026, lr=0.3171566946995281
2023-12-06 11:55:50   INFO  epoch: 27/72, acc_iter=108074, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:50, time_cost(all): 1 day, 1:00:35/1 day, 15:14:18, loss=0.473229896295993, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.03(1.03), norm=1.9084818923089062, lr=0.3170427276241603
2023-12-06 11:56:32   INFO  epoch: 27/72, acc_iter=108124, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 1 day, 1:01:17/1 day, 17:20:07, loss=0.473170698870052, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.02(1.03), norm=2.61739578869089, lr=0.3169287605487925
2023-12-06 11:57:14   INFO  epoch: 28/72, acc_iter=108186, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:55:20, time_cost(all): 1 day, 1:01:59/1 day, 15:32:45, loss=0.473097294061886, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.08(1.03), norm=4.884089495865331, lr=0.31678744137533643
2023-12-06 11:57:56   INFO  epoch: 28/72, acc_iter=108236, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:54:38, time_cost(all): 1 day, 1:02:41/1 day, 15:02:13, loss=0.473038096635945, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.91(1.03), norm=4.3401656069450585, lr=0.3166734742999686
2023-12-06 11:58:37   INFO  epoch: 28/72, acc_iter=108286, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:54:08, time_cost(all): 1 day, 1:03:22/1 day, 17:11:14, loss=0.472978899210004, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.03(1.03), norm=4.556474392248694, lr=0.31655950722460086
2023-12-06 11:59:19   INFO  epoch: 28/72, acc_iter=108336, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:50:13, time_cost(all): 1 day, 1:04:04/1 day, 15:13:16, loss=0.472919701784063, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.89(1.03), norm=1.039958277095872, lr=0.3164455401492331
2023-12-06 12:00:01   INFO  epoch: 28/72, acc_iter=108386, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:48:04, time_cost(all): 1 day, 1:04:46/1 day, 16:09:49, loss=0.472860504358122, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.94(1.03), norm=4.89837004896539, lr=0.3163315730738653
2023-12-06 12:00:43   INFO  epoch: 28/72, acc_iter=108436, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:47:32, time_cost(all): 1 day, 1:05:28/1 day, 13:51:56, loss=0.472801306932181, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.92(1.03), norm=3.422829502379139, lr=0.31621760599849746
2023-12-06 12:01:25   INFO  epoch: 28/72, acc_iter=108486, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:49:59, time_cost(all): 1 day, 1:06:10/1 day, 14:24:22, loss=0.47274210950624, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.04(1.03), norm=2.2079715422634045, lr=0.31610363892312965
2023-12-06 12:02:06   INFO  epoch: 28/72, acc_iter=108536, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:46:25, time_cost(all): 1 day, 1:06:51/1 day, 14:12:50, loss=0.472682912080299, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.97(1.03), norm=2.6993006681383407, lr=0.3159896718477619
2023-12-06 12:02:48   INFO  epoch: 28/72, acc_iter=108586, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:48:52, time_cost(all): 1 day, 1:07:33/1 day, 17:07:05, loss=0.472623714654358, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.09(1.03), norm=0.7508819174531302, lr=0.3158757047723941
2023-12-06 12:03:30   INFO  epoch: 28/72, acc_iter=108636, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:48:23, time_cost(all): 1 day, 1:08:15/1 day, 15:14:56, loss=0.472564517228417, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.17(1.03), norm=1.2368207789219734, lr=0.3157617376970263
2023-12-06 12:04:12   INFO  epoch: 28/72, acc_iter=108686, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:47:50, time_cost(all): 1 day, 1:08:57/1 day, 13:49:06, loss=0.472505319802476, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.84(1.03), norm=1.4501268494161417, lr=0.3156477706216585
2023-12-06 12:04:53   INFO  epoch: 28/72, acc_iter=108736, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:43:13, time_cost(all): 1 day, 1:09:38/1 day, 16:02:45, loss=0.472446122376535, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.85(1.03), norm=3.458375865826384, lr=0.3155338035462907
2023-12-06 12:05:35   INFO  epoch: 28/72, acc_iter=108786, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:46:47, time_cost(all): 1 day, 1:10:20/1 day, 16:15:52, loss=0.472386924950594, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.98(1.03), norm=4.732159947621276, lr=0.31541983647092287
2023-12-06 12:06:17   INFO  epoch: 28/72, acc_iter=108836, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:44:13, time_cost(all): 1 day, 1:11:02/1 day, 14:53:09, loss=0.472327727524653, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.1(1.03), norm=2.268584395644584, lr=0.31530586939555516
2023-12-06 12:06:59   INFO  epoch: 28/72, acc_iter=108886, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:43:03, time_cost(all): 1 day, 1:11:44/1 day, 14:36:37, loss=0.472268530098712, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.1(1.03), norm=4.534696772238422, lr=0.31519190232018734
2023-12-06 12:07:41   INFO  epoch: 28/72, acc_iter=108936, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:41:16, time_cost(all): 1 day, 1:12:26/1 day, 16:16:00, loss=0.472209332672771, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.01(1.03), norm=3.059601332262692, lr=0.31507793524481953
2023-12-06 12:08:22   INFO  epoch: 28/72, acc_iter=108986, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:40:53, time_cost(all): 1 day, 1:13:07/1 day, 17:12:09, loss=0.472150135246831, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.96(1.03), norm=0.948634274944923, lr=0.3149639681694517
2023-12-06 12:09:04   INFO  epoch: 28/72, acc_iter=109036, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:42:55, time_cost(all): 1 day, 1:13:49/1 day, 16:01:58, loss=0.47209093782089, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.17(1.03), norm=0.6868259029644491, lr=0.31485000109408395
2023-12-06 12:09:46   INFO  epoch: 28/72, acc_iter=109086, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:41:26, time_cost(all): 1 day, 1:14:31/1 day, 16:58:33, loss=0.472031740394949, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.15(1.03), norm=2.980882481154488, lr=0.31473603401871614
2023-12-06 12:10:28   INFO  epoch: 28/72, acc_iter=109136, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:40:44, time_cost(all): 1 day, 1:15:13/1 day, 14:21:08, loss=0.471972542969008, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.19(1.03), norm=1.5908441109365052, lr=0.3146220669433484
2023-12-06 12:11:09   INFO  epoch: 28/72, acc_iter=109186, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:39:51, time_cost(all): 1 day, 1:15:54/1 day, 14:11:21, loss=0.471913345543067, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.04(1.03), norm=1.9833082034849365, lr=0.31450809986798056
2023-12-06 12:11:51   INFO  epoch: 28/72, acc_iter=109236, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:37:27, time_cost(all): 1 day, 1:16:36/1 day, 16:04:31, loss=0.471854148117126, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.01(1.03), norm=1.8303418108748624, lr=0.31439413279261275
2023-12-06 12:12:33   INFO  epoch: 28/72, acc_iter=109286, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:37:41, time_cost(all): 1 day, 1:17:18/1 day, 15:39:51, loss=0.471794950691185, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.9(1.03), norm=1.0805404623215225, lr=0.31428016571724493
2023-12-06 12:13:15   INFO  epoch: 28/72, acc_iter=109336, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:36:26, time_cost(all): 1 day, 1:18:00/1 day, 16:40:07, loss=0.471735753265244, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.91(1.03), norm=2.0289174796869602, lr=0.31416619864187717
2023-12-06 12:13:57   INFO  epoch: 28/72, acc_iter=109386, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:35:04, time_cost(all): 1 day, 1:18:42/1 day, 13:44:10, loss=0.471676555839303, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.98(1.03), norm=2.239771811325276, lr=0.3140522315665094
2023-12-06 12:14:38   INFO  epoch: 28/72, acc_iter=109436, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:35:17, time_cost(all): 1 day, 1:19:23/1 day, 17:05:28, loss=0.471617358413362, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.09(1.03), norm=1.9991609549686913, lr=0.3139382644911416
2023-12-06 12:15:20   INFO  epoch: 28/72, acc_iter=109486, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:33:31, time_cost(all): 1 day, 1:20:05/1 day, 15:06:51, loss=0.471558160987421, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.03(1.03), norm=2.0930195696008678, lr=0.3138242974157738
2023-12-06 12:16:02   INFO  epoch: 28/72, acc_iter=109536, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:34:35, time_cost(all): 1 day, 1:20:47/1 day, 16:53:13, loss=0.47149896356148, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.87(1.03), norm=1.9839849673746832, lr=0.31371033034040596
2023-12-06 12:16:44   INFO  epoch: 28/72, acc_iter=109586, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:33:05, time_cost(all): 1 day, 1:21:29/1 day, 13:47:04, loss=0.471439766135539, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.08(1.03), norm=1.5403149987236198, lr=0.3135963632650382
2023-12-06 12:17:25   INFO  epoch: 28/72, acc_iter=109636, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:34:30, time_cost(all): 1 day, 1:22:10/1 day, 16:40:33, loss=0.471380568709598, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.23(1.03), norm=0.8069425860237436, lr=0.3134823961896704
2023-12-06 12:18:07   INFO  epoch: 28/72, acc_iter=109686, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:33:32, time_cost(all): 1 day, 1:22:52/1 day, 15:09:40, loss=0.471321371283657, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.08(1.03), norm=1.9628072562269896, lr=0.3133684291143026
2023-12-06 12:18:49   INFO  epoch: 28/72, acc_iter=109736, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:32:31, time_cost(all): 1 day, 1:23:34/1 day, 13:29:56, loss=0.471262173857716, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.99(1.03), norm=2.7264473833827036, lr=0.3132544620389348
2023-12-06 12:19:31   INFO  epoch: 28/72, acc_iter=109786, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:30:48, time_cost(all): 1 day, 1:24:16/1 day, 14:59:52, loss=0.471202976431775, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.99(1.03), norm=2.7937219328240657, lr=0.313140494963567
2023-12-06 12:20:13   INFO  epoch: 28/72, acc_iter=109836, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:30:21, time_cost(all): 1 day, 1:24:58/1 day, 15:09:29, loss=0.471143779005834, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.93(1.03), norm=3.112158385559789, lr=0.3130265278881992
2023-12-06 12:20:54   INFO  epoch: 28/72, acc_iter=109886, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:30:07, time_cost(all): 1 day, 1:25:39/1 day, 15:09:26, loss=0.471084581579894, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.96(1.03), norm=1.9745789443886002, lr=0.3129125608128315
2023-12-06 12:21:36   INFO  epoch: 28/72, acc_iter=109936, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:27:20, time_cost(all): 1 day, 1:26:21/1 day, 16:16:08, loss=0.471025384153953, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.19(1.03), norm=3.8499221471890377, lr=0.31279859373746366
2023-12-06 12:22:18   INFO  epoch: 28/72, acc_iter=109986, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:29:18, time_cost(all): 1 day, 1:27:03/1 day, 16:27:21, loss=0.470966186728012, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.0(1.03), norm=3.58301431684899, lr=0.31268462666209584
2023-12-06 12:23:00   INFO  epoch: 28/72, acc_iter=110036, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:26:22, time_cost(all): 1 day, 1:27:45/1 day, 16:49:29, loss=0.470906989302071, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.18(1.03), norm=4.862448472608967, lr=0.31257065958672803
2023-12-06 12:23:41   INFO  epoch: 28/72, acc_iter=110086, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:26:50, time_cost(all): 1 day, 1:28:26/1 day, 16:27:06, loss=0.47084779187613, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.1(1.03), norm=0.5218013139130337, lr=0.31245669251136027
2023-12-06 12:24:23   INFO  epoch: 28/72, acc_iter=110136, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:26:08, time_cost(all): 1 day, 1:29:08/1 day, 14:45:23, loss=0.470788594450189, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.89(1.03), norm=2.8950266270283347, lr=0.31234272543599245
2023-12-06 12:25:05   INFO  epoch: 28/72, acc_iter=110186, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:24:58, time_cost(all): 1 day, 1:29:50/1 day, 14:40:49, loss=0.470729397024248, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.92(1.03), norm=1.428848226217688, lr=0.3122287583606247
2023-12-06 12:25:47   INFO  epoch: 28/72, acc_iter=110236, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:23:27, time_cost(all): 1 day, 1:30:32/1 day, 15:41:35, loss=0.470670199598307, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.85(1.03), norm=3.658649074787388, lr=0.3121147912852569
2023-12-06 12:26:29   INFO  epoch: 28/72, acc_iter=110286, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:23:49, time_cost(all): 1 day, 1:31:14/1 day, 14:22:56, loss=0.470611002172366, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.86(1.03), norm=3.8935411577352514, lr=0.31200082420988906
2023-12-06 12:27:10   INFO  epoch: 28/72, acc_iter=110336, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:00, time_cost(all): 1 day, 1:31:55/1 day, 15:57:19, loss=0.470551804746425, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.9(1.03), norm=1.6533201325510194, lr=0.31188685713452124
2023-12-06 12:27:52   INFO  epoch: 28/72, acc_iter=110386, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:21:29, time_cost(all): 1 day, 1:32:37/1 day, 16:38:31, loss=0.470492607320484, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.22(1.03), norm=1.993821443921622, lr=0.3117728900591535
2023-12-06 12:28:34   INFO  epoch: 28/72, acc_iter=110436, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:43, time_cost(all): 1 day, 1:33:19/1 day, 15:58:28, loss=0.470433409894543, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.15(1.03), norm=2.6074565992192626, lr=0.3116589229837857
2023-12-06 12:29:16   INFO  epoch: 28/72, acc_iter=110486, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:30, time_cost(all): 1 day, 1:34:01/1 day, 13:16:54, loss=0.470374212468602, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.08(1.03), norm=1.4485509217386494, lr=0.3115449559084179
2023-12-06 12:29:58   INFO  epoch: 28/72, acc_iter=110536, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:21:07, time_cost(all): 1 day, 1:34:43/1 day, 15:39:41, loss=0.470315015042661, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.91(1.03), norm=2.439603427649574, lr=0.3114309888330501
2023-12-06 12:30:39   INFO  epoch: 28/72, acc_iter=110586, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:19:30, time_cost(all): 1 day, 1:35:24/1 day, 13:04:34, loss=0.47025581761672, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.2(1.03), norm=2.677421600777898, lr=0.3113170217576823
2023-12-06 12:31:21   INFO  epoch: 28/72, acc_iter=110636, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:08, time_cost(all): 1 day, 1:36:06/1 day, 16:11:59, loss=0.470196620190779, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.22(1.03), norm=3.200838201857628, lr=0.3112030546823145
2023-12-06 12:32:03   INFO  epoch: 28/72, acc_iter=110686, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:45, time_cost(all): 1 day, 1:36:48/1 day, 13:01:20, loss=0.470137422764838, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.0(1.03), norm=2.359585663685908, lr=0.3110890876069467
2023-12-06 12:32:45   INFO  epoch: 28/72, acc_iter=110736, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:18:01, time_cost(all): 1 day, 1:37:30/1 day, 14:13:43, loss=0.470078225338898, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.1(1.03), norm=3.943196568273285, lr=0.31097512053157894
2023-12-06 12:33:26   INFO  epoch: 28/72, acc_iter=110786, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:17:29, time_cost(all): 1 day, 1:38:11/1 day, 16:49:18, loss=0.470019027912957, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.06(1.03), norm=0.5738445010466331, lr=0.3108611534562111
2023-12-06 12:34:08   INFO  epoch: 28/72, acc_iter=110836, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:55, time_cost(all): 1 day, 1:38:53/1 day, 15:19:52, loss=0.469959830487016, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.9(1.03), norm=1.569356548522407, lr=0.3107471863808433
2023-12-06 12:34:50   INFO  epoch: 28/72, acc_iter=110886, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:25, time_cost(all): 1 day, 1:39:35/1 day, 16:18:46, loss=0.469900633061075, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.99(1.03), norm=0.7420015985261471, lr=0.3106332193054755
2023-12-06 12:35:32   INFO  epoch: 28/72, acc_iter=110936, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:15:04, time_cost(all): 1 day, 1:40:17/1 day, 13:50:05, loss=0.469841435635134, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.03(1.03), norm=1.9974002908394421, lr=0.3105192522301078
2023-12-06 12:36:14   INFO  epoch: 28/72, acc_iter=110986, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:13:25, time_cost(all): 1 day, 1:40:59/1 day, 14:57:20, loss=0.469782238209193, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.2(1.03), norm=2.12942669628346, lr=0.31040528515474
2023-12-06 12:36:55   INFO  epoch: 28/72, acc_iter=111036, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:35, time_cost(all): 1 day, 1:41:40/1 day, 14:31:50, loss=0.469723040783252, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.87(1.03), norm=2.5230081353566014, lr=0.31029131807937216
2023-12-06 12:37:37   INFO  epoch: 28/72, acc_iter=111086, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:13:17, time_cost(all): 1 day, 1:42:22/1 day, 14:30:34, loss=0.469663843357311, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.94(1.03), norm=1.0134359312340586, lr=0.31017735100400434
2023-12-06 12:38:19   INFO  epoch: 28/72, acc_iter=111136, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:26, time_cost(all): 1 day, 1:43:04/1 day, 16:23:41, loss=0.46960464593137, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.02(1.03), norm=2.4502566411669067, lr=0.3100633839286365
2023-12-06 12:39:01   INFO  epoch: 28/72, acc_iter=111186, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:37, time_cost(all): 1 day, 1:43:46/1 day, 13:03:52, loss=0.469545448505429, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.99(1.03), norm=2.114556872606644, lr=0.30994941685326877
2023-12-06 12:39:42   INFO  epoch: 28/72, acc_iter=111236, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:29, time_cost(all): 1 day, 1:44:27/1 day, 13:07:27, loss=0.469486251079488, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.19(1.03), norm=2.5915177805959635, lr=0.309835449777901
2023-12-06 12:40:24   INFO  epoch: 28/72, acc_iter=111286, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:28, time_cost(all): 1 day, 1:45:09/1 day, 13:26:39, loss=0.469427053653547, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.21(1.03), norm=1.3077350638191714, lr=0.3097214827025332
2023-12-06 12:41:06   INFO  epoch: 28/72, acc_iter=111336, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:15, time_cost(all): 1 day, 1:45:51/1 day, 15:56:34, loss=0.469367856227606, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.01(1.03), norm=4.924597781297216, lr=0.3096075156271654
2023-12-06 12:41:48   INFO  epoch: 28/72, acc_iter=111386, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:30, time_cost(all): 1 day, 1:46:33/1 day, 13:52:46, loss=0.469308658801665, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.01(1.03), norm=4.534624175607393, lr=0.30949354855179756
2023-12-06 12:42:30   INFO  epoch: 28/72, acc_iter=111436, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:08:07, time_cost(all): 1 day, 1:47:15/1 day, 14:59:02, loss=0.469249461375724, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.21(1.03), norm=3.5480733360289847, lr=0.3093795814764298
2023-12-06 12:43:11   INFO  epoch: 28/72, acc_iter=111486, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:06, time_cost(all): 1 day, 1:47:56/1 day, 14:34:32, loss=0.469190263949783, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.05(1.03), norm=4.871870019738387, lr=0.30926561440106204
2023-12-06 12:43:53   INFO  epoch: 28/72, acc_iter=111536, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:40, time_cost(all): 1 day, 1:48:38/1 day, 14:06:04, loss=0.469131066523843, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.93(1.03), norm=0.8074153004093689, lr=0.3091516473256942
2023-12-06 12:44:35   INFO  epoch: 28/72, acc_iter=111586, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:31, time_cost(all): 1 day, 1:49:20/1 day, 13:27:51, loss=0.469071869097902, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.23(1.03), norm=1.9866468303535723, lr=0.3090376802503264
2023-12-06 12:45:17   INFO  epoch: 28/72, acc_iter=111636, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:17, time_cost(all): 1 day, 1:50:02/1 day, 13:07:01, loss=0.469012671671961, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.03(1.03), norm=1.1051790542387605, lr=0.3089237131749586
2023-12-06 12:45:58   INFO  epoch: 28/72, acc_iter=111686, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:28, time_cost(all): 1 day, 1:50:43/1 day, 13:50:56, loss=0.46895347424602, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.88(1.03), norm=3.3341502101090428, lr=0.30880974609959083
2023-12-06 12:46:40   INFO  epoch: 28/72, acc_iter=111736, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:46, time_cost(all): 1 day, 1:51:25/1 day, 15:12:51, loss=0.468894276820079, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.86(1.03), norm=2.3387459427198514, lr=0.308695779024223
2023-12-06 12:47:22   INFO  epoch: 28/72, acc_iter=111786, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:03:04, time_cost(all): 1 day, 1:52:07/1 day, 14:19:24, loss=0.468835079394138, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.99(1.03), norm=2.6490337820092202, lr=0.30858181194885526
2023-12-06 12:48:04   INFO  epoch: 28/72, acc_iter=111836, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:19, time_cost(all): 1 day, 1:52:49/1 day, 13:30:57, loss=0.468775881968197, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.95(1.03), norm=3.4740307112859887, lr=0.30846784487348744
2023-12-06 12:48:46   INFO  epoch: 28/72, acc_iter=111886, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:30, time_cost(all): 1 day, 1:53:31/1 day, 14:27:29, loss=0.468716684542256, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.94(1.03), norm=1.0614806666796577, lr=0.3083538777981196
2023-12-06 12:49:27   INFO  epoch: 28/72, acc_iter=111936, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:50, time_cost(all): 1 day, 1:54:12/1 day, 13:22:34, loss=0.468657487116315, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.12(1.03), norm=1.3351034594694966, lr=0.30823991072275186
2023-12-06 12:50:09   INFO  epoch: 28/72, acc_iter=111986, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 1 day, 1:54:54/1 day, 13:45:12, loss=0.468598289690374, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.2(1.03), norm=4.6303883194601925, lr=0.30812594364738405
2023-12-06 12:50:51   INFO  epoch: 29/72, acc_iter=112048, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:55:36, time_cost(all): 1 day, 1:55:36/1 day, 15:47:09, loss=0.468524884882207, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.9(1.03), norm=2.5622655379691097, lr=0.307984624473928
2023-12-06 12:51:33   INFO  epoch: 29/72, acc_iter=112098, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:52:54, time_cost(all): 1 day, 1:56:18/1 day, 15:58:55, loss=0.468465687456266, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.2(1.03), norm=4.865000814979596, lr=0.30787065739856023
2023-12-06 12:52:14   INFO  epoch: 29/72, acc_iter=112148, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:49:31, time_cost(all): 1 day, 1:56:59/1 day, 13:38:51, loss=0.468406490030325, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.2(1.03), norm=0.5467122426997072, lr=0.3077566903231924
2023-12-06 12:52:56   INFO  epoch: 29/72, acc_iter=112198, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:49:48, time_cost(all): 1 day, 1:57:41/1 day, 15:50:19, loss=0.468347292604384, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.99(1.03), norm=2.8491617301058363, lr=0.3076427232478246
2023-12-06 12:53:38   INFO  epoch: 29/72, acc_iter=112248, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:52:27, time_cost(all): 1 day, 1:58:23/1 day, 13:45:09, loss=0.468288095178443, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.12(1.03), norm=1.0008744901277815, lr=0.3075287561724568
2023-12-06 12:54:20   INFO  epoch: 29/72, acc_iter=112298, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:51:57, time_cost(all): 1 day, 1:59:05/1 day, 14:39:39, loss=0.468228897752503, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.02(1.03), norm=2.6298497254810154, lr=0.307414789097089
2023-12-06 12:55:02   INFO  epoch: 29/72, acc_iter=112348, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:49:52, time_cost(all): 1 day, 1:59:47/1 day, 15:28:00, loss=0.468169700326562, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.15(1.03), norm=2.2818579489064548, lr=0.30730082202172126
2023-12-06 12:55:43   INFO  epoch: 29/72, acc_iter=112398, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:46:22, time_cost(all): 1 day, 2:00:28/1 day, 13:06:51, loss=0.468110502900621, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.07(1.03), norm=0.7263601825442054, lr=0.30718685494635345
2023-12-06 12:56:25   INFO  epoch: 29/72, acc_iter=112448, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:47:46, time_cost(all): 1 day, 2:01:10/1 day, 16:03:19, loss=0.46805130547468, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.19(1.03), norm=0.7394514866634153, lr=0.30707288787098563
2023-12-06 12:57:07   INFO  epoch: 29/72, acc_iter=112498, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:48:27, time_cost(all): 1 day, 2:01:52/1 day, 16:15:29, loss=0.467992108048739, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.04(1.03), norm=2.77656020949334, lr=0.3069589207956178
2023-12-06 12:57:49   INFO  epoch: 29/72, acc_iter=112548, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:46:23, time_cost(all): 1 day, 2:02:34/1 day, 14:49:52, loss=0.467932910622798, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.12(1.03), norm=0.5470861271266163, lr=0.30684495372025006
2023-12-06 12:58:30   INFO  epoch: 29/72, acc_iter=112598, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:44:25, time_cost(all): 1 day, 2:03:15/1 day, 15:08:53, loss=0.467873713196857, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.0(1.03), norm=4.410989308281505, lr=0.30673098664488224
2023-12-06 12:59:12   INFO  epoch: 29/72, acc_iter=112648, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:46:22, time_cost(all): 1 day, 2:03:57/1 day, 13:39:43, loss=0.467814515770916, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.1(1.03), norm=2.7888525470168286, lr=0.3066170195695145
2023-12-06 12:59:54   INFO  epoch: 29/72, acc_iter=112698, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:44:53, time_cost(all): 1 day, 2:04:39/1 day, 12:54:13, loss=0.467755318344975, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.09(1.03), norm=2.2230150500207024, lr=0.30650305249414667
2023-12-06 13:00:36   INFO  epoch: 29/72, acc_iter=112748, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:43:03, time_cost(all): 1 day, 2:05:21/1 day, 12:51:45, loss=0.467696120919034, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.22(1.03), norm=4.5877851776631635, lr=0.30638908541877885
2023-12-06 13:01:18   INFO  epoch: 29/72, acc_iter=112798, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:40:36, time_cost(all): 1 day, 2:06:03/1 day, 13:52:03, loss=0.467636923493093, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.17(1.03), norm=3.943529474912814, lr=0.30627511834341103
2023-12-06 13:01:59   INFO  epoch: 29/72, acc_iter=112848, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:42:21, time_cost(all): 1 day, 2:06:44/1 day, 14:26:16, loss=0.467577726067152, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.86(1.03), norm=4.611507236055149, lr=0.30616115126804333
2023-12-06 13:02:41   INFO  epoch: 29/72, acc_iter=112898, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:41:45, time_cost(all): 1 day, 2:07:26/1 day, 15:23:42, loss=0.467518528641211, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.19(1.03), norm=2.6421168169228437, lr=0.3060471841926755
2023-12-06 13:03:23   INFO  epoch: 29/72, acc_iter=112948, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:40:46, time_cost(all): 1 day, 2:08:08/1 day, 13:24:30, loss=0.46745933121527, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.1(1.03), norm=4.209549864243332, lr=0.3059332171173077
2023-12-06 13:04:05   INFO  epoch: 29/72, acc_iter=112998, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:39:16, time_cost(all): 1 day, 2:08:50/1 day, 13:24:00, loss=0.467400133789329, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.21(1.03), norm=1.566335682516292, lr=0.3058192500419399
2023-12-06 13:04:47   INFO  epoch: 29/72, acc_iter=113048, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:37:16, time_cost(all): 1 day, 2:09:32/1 day, 12:46:05, loss=0.467340936363388, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.13(1.03), norm=1.2607918140733216, lr=0.3057052829665721
2023-12-06 13:05:28   INFO  epoch: 29/72, acc_iter=113098, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:37:04, time_cost(all): 1 day, 2:10:13/1 day, 12:54:37, loss=0.467281738937448, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.21(1.03), norm=0.8933518194296679, lr=0.3055913158912043
2023-12-06 13:06:10   INFO  epoch: 29/72, acc_iter=113148, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:36:17, time_cost(all): 1 day, 2:10:55/1 day, 14:26:27, loss=0.467222541511507, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.92(1.03), norm=1.9082469494619159, lr=0.30547734881583655
2023-12-06 13:06:52   INFO  epoch: 29/72, acc_iter=113198, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:38:43, time_cost(all): 1 day, 2:11:37/1 day, 15:41:35, loss=0.467163344085566, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.97(1.03), norm=4.805671990775125, lr=0.30536338174046873
2023-12-06 13:07:34   INFO  epoch: 29/72, acc_iter=113248, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:36:56, time_cost(all): 1 day, 2:12:19/1 day, 15:52:30, loss=0.467104146659625, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.13(1.03), norm=1.47262355598063, lr=0.3052494146651009
2023-12-06 13:08:15   INFO  epoch: 29/72, acc_iter=113298, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:33:54, time_cost(all): 1 day, 2:13:00/1 day, 15:20:39, loss=0.467044949233684, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.94(1.03), norm=1.9843752183698848, lr=0.3051354475897331
2023-12-06 13:08:57   INFO  epoch: 29/72, acc_iter=113348, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:35:51, time_cost(all): 1 day, 2:13:42/1 day, 14:45:26, loss=0.466985751807743, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.06(1.03), norm=1.9389430371406848, lr=0.3050214805143653
2023-12-06 13:09:39   INFO  epoch: 29/72, acc_iter=113398, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:35:50, time_cost(all): 1 day, 2:14:24/1 day, 15:18:22, loss=0.466926554381802, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.06(1.03), norm=2.4466854373090663, lr=0.3049075134389976
2023-12-06 13:10:21   INFO  epoch: 29/72, acc_iter=113448, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:34:19, time_cost(all): 1 day, 2:15:06/1 day, 15:50:07, loss=0.466867356955861, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.99(1.03), norm=0.9127112774166379, lr=0.30479354636362976
2023-12-06 13:11:03   INFO  epoch: 29/72, acc_iter=113498, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:31:44, time_cost(all): 1 day, 2:15:48/1 day, 12:38:29, loss=0.46680815952992, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.99(1.03), norm=1.0140622047029377, lr=0.30467957928826195
2023-12-06 13:11:44   INFO  epoch: 29/72, acc_iter=113548, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:31:52, time_cost(all): 1 day, 2:16:29/1 day, 13:23:09, loss=0.466748962103979, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.06(1.03), norm=3.319988125153161, lr=0.30456561221289413
2023-12-06 13:12:26   INFO  epoch: 29/72, acc_iter=113598, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:31:27, time_cost(all): 1 day, 2:17:11/1 day, 14:34:17, loss=0.466689764678038, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.21(1.03), norm=4.805775064712658, lr=0.30445164513752637
2023-12-06 13:13:08   INFO  epoch: 29/72, acc_iter=113648, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:31:15, time_cost(all): 1 day, 2:17:53/1 day, 13:22:24, loss=0.466630567252097, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.91(1.03), norm=2.481915813687201, lr=0.30433767806215856
2023-12-06 13:13:50   INFO  epoch: 29/72, acc_iter=113698, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:29:34, time_cost(all): 1 day, 2:18:35/1 day, 13:14:24, loss=0.466571369826156, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.85(1.03), norm=4.958779874255747, lr=0.3042237109867908
2023-12-06 13:14:31   INFO  epoch: 29/72, acc_iter=113748, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:29:30, time_cost(all): 1 day, 2:19:16/1 day, 14:48:28, loss=0.466512172400215, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.12(1.03), norm=2.9403188872540937, lr=0.304109743911423
2023-12-06 13:15:13   INFO  epoch: 29/72, acc_iter=113798, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:29:38, time_cost(all): 1 day, 2:19:58/1 day, 14:09:46, loss=0.466452974974274, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.19(1.03), norm=1.3350074397478964, lr=0.30399577683605516
2023-12-06 13:15:55   INFO  epoch: 29/72, acc_iter=113848, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:26:58, time_cost(all): 1 day, 2:20:40/1 day, 14:58:47, loss=0.466393777548333, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.08(1.03), norm=3.3591683024999024, lr=0.30388180976068735
2023-12-06 13:16:37   INFO  epoch: 29/72, acc_iter=113898, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:27:43, time_cost(all): 1 day, 2:21:22/1 day, 14:08:55, loss=0.466334580122392, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.97(1.03), norm=1.551186611178752, lr=0.30376784268531964
2023-12-06 13:17:19   INFO  epoch: 29/72, acc_iter=113948, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:26:16, time_cost(all): 1 day, 2:22:04/1 day, 12:50:34, loss=0.466275382696452, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.97(1.03), norm=1.4769045312297429, lr=0.30365387560995183
2023-12-06 13:18:00   INFO  epoch: 29/72, acc_iter=113998, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:25:20, time_cost(all): 1 day, 2:22:45/1 day, 12:20:25, loss=0.466216185270511, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.17(1.03), norm=3.4966011850527154, lr=0.303539908534584
2023-12-06 13:18:42   INFO  epoch: 29/72, acc_iter=114048, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:24:21, time_cost(all): 1 day, 2:23:27/1 day, 13:03:25, loss=0.46615698784457, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.84(1.03), norm=1.916642046241388, lr=0.3034259414592162
2023-12-06 13:19:24   INFO  epoch: 29/72, acc_iter=114098, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:23:38, time_cost(all): 1 day, 2:24:09/1 day, 15:19:42, loss=0.466097790418629, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.22(1.03), norm=1.1587004401104908, lr=0.3033119743838484
2023-12-06 13:20:06   INFO  epoch: 29/72, acc_iter=114148, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:23:35, time_cost(all): 1 day, 2:24:51/1 day, 16:01:17, loss=0.466038592992688, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.97(1.03), norm=2.2471880663882473, lr=0.3031980073084806
2023-12-06 13:20:47   INFO  epoch: 29/72, acc_iter=114198, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:39, time_cost(all): 1 day, 2:25:32/1 day, 14:27:48, loss=0.465979395566747, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.17(1.03), norm=0.8381172314023307, lr=0.3030840402331128
2023-12-06 13:21:29   INFO  epoch: 29/72, acc_iter=114248, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:23:31, time_cost(all): 1 day, 2:26:14/1 day, 12:19:14, loss=0.465920198140806, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.11(1.03), norm=1.5746061561427316, lr=0.30297007315774505
2023-12-06 13:22:11   INFO  epoch: 29/72, acc_iter=114298, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:22:07, time_cost(all): 1 day, 2:26:56/1 day, 12:56:24, loss=0.465861000714865, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.84(1.03), norm=1.888287703370762, lr=0.30285610608237723
2023-12-06 13:22:53   INFO  epoch: 29/72, acc_iter=114348, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:41, time_cost(all): 1 day, 2:27:38/1 day, 13:40:37, loss=0.465801803288924, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.86(1.03), norm=2.7942493716457815, lr=0.3027421390070094
2023-12-06 13:23:35   INFO  epoch: 29/72, acc_iter=114398, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:19:24, time_cost(all): 1 day, 2:28:20/1 day, 13:37:55, loss=0.465742605862983, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.92(1.03), norm=3.1094901045740255, lr=0.30262817193164165
2023-12-06 13:24:16   INFO  epoch: 29/72, acc_iter=114448, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:20:19, time_cost(all): 1 day, 2:29:01/1 day, 14:09:57, loss=0.465683408437042, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.08(1.03), norm=4.883476380556318, lr=0.3025142048562739
2023-12-06 13:24:58   INFO  epoch: 29/72, acc_iter=114498, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:23, time_cost(all): 1 day, 2:29:43/1 day, 14:28:46, loss=0.465624211011101, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.2(1.03), norm=0.7229959668629508, lr=0.3024002377809061
2023-12-06 13:25:40   INFO  epoch: 29/72, acc_iter=114548, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:45, time_cost(all): 1 day, 2:30:25/1 day, 12:29:41, loss=0.46556501358516, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.96(1.03), norm=3.915859568374292, lr=0.30228627070553826
2023-12-06 13:26:22   INFO  epoch: 29/72, acc_iter=114598, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:18:06, time_cost(all): 1 day, 2:31:07/1 day, 14:31:14, loss=0.465505816159219, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.89(1.03), norm=1.8884404194593345, lr=0.30217230363017045
2023-12-06 13:27:03   INFO  epoch: 29/72, acc_iter=114648, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:16, time_cost(all): 1 day, 2:31:48/1 day, 15:18:49, loss=0.465446618733278, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.19(1.03), norm=3.0560977701336722, lr=0.3020583365548027
2023-12-06 13:27:45   INFO  epoch: 29/72, acc_iter=114698, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:15:28, time_cost(all): 1 day, 2:32:30/1 day, 12:09:17, loss=0.465387421307337, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=3.613701517657833, lr=0.30194436947943487
2023-12-06 13:28:27   INFO  epoch: 29/72, acc_iter=114748, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:16:14, time_cost(all): 1 day, 2:33:12/1 day, 14:09:18, loss=0.465328223881396, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.83(1.03), norm=1.5518967416230414, lr=0.3018304024040671
2023-12-06 13:29:09   INFO  epoch: 29/72, acc_iter=114798, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:15:01, time_cost(all): 1 day, 2:33:54/1 day, 14:35:10, loss=0.465269026455456, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.02(1.03), norm=0.8429664496356826, lr=0.3017164353286993
2023-12-06 13:29:51   INFO  epoch: 29/72, acc_iter=114848, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:22, time_cost(all): 1 day, 2:34:36/1 day, 14:07:03, loss=0.465209829029515, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.08(1.03), norm=1.8765377461780186, lr=0.3016024682533315
2023-12-06 13:30:32   INFO  epoch: 29/72, acc_iter=114898, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:12:50, time_cost(all): 1 day, 2:35:17/1 day, 13:13:10, loss=0.465150631603574, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.16(1.03), norm=2.395603899364195, lr=0.30148850117796366
2023-12-06 13:31:14   INFO  epoch: 29/72, acc_iter=114948, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:21, time_cost(all): 1 day, 2:35:59/1 day, 14:27:32, loss=0.465091434177633, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.99(1.03), norm=3.2183645579881848, lr=0.3013745341025959
2023-12-06 13:31:56   INFO  epoch: 29/72, acc_iter=114998, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:11:43, time_cost(all): 1 day, 2:36:41/1 day, 13:13:47, loss=0.465032236751692, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.1(1.03), norm=1.9768466310266726, lr=0.30126056702722814
2023-12-06 13:32:38   INFO  epoch: 29/72, acc_iter=115048, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:22, time_cost(all): 1 day, 2:37:23/1 day, 15:43:13, loss=0.464973039325751, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.86(1.03), norm=3.9660381852835833, lr=0.3011465999518603
2023-12-06 13:33:19   INFO  epoch: 29/72, acc_iter=115098, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:25, time_cost(all): 1 day, 2:38:04/1 day, 12:28:51, loss=0.46491384189981, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.11(1.03), norm=3.7916473611755417, lr=0.3010326328764925
2023-12-06 13:34:01   INFO  epoch: 29/72, acc_iter=115148, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:54, time_cost(all): 1 day, 2:38:46/1 day, 13:00:10, loss=0.464854644473869, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.01(1.03), norm=1.00605350460022, lr=0.3009186658011247
2023-12-06 13:34:43   INFO  epoch: 29/72, acc_iter=115198, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:26, time_cost(all): 1 day, 2:39:28/1 day, 15:34:19, loss=0.464795447047928, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.96(1.03), norm=2.287891668409453, lr=0.30080469872575694
2023-12-06 13:35:25   INFO  epoch: 29/72, acc_iter=115248, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:43, time_cost(all): 1 day, 2:40:10/1 day, 14:56:28, loss=0.464736249621987, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.84(1.03), norm=4.518056570463556, lr=0.3006907316503892
2023-12-06 13:36:07   INFO  epoch: 29/72, acc_iter=115298, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:08:06, time_cost(all): 1 day, 2:40:52/1 day, 12:24:48, loss=0.464677052196046, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.03(1.03), norm=1.5393968966171145, lr=0.30057676457502136
2023-12-06 13:36:48   INFO  epoch: 29/72, acc_iter=115348, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:06:59, time_cost(all): 1 day, 2:41:33/1 day, 14:16:17, loss=0.464617854770105, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.15(1.03), norm=2.6118320533737727, lr=0.30046279749965354
2023-12-06 13:37:30   INFO  epoch: 29/72, acc_iter=115398, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:16, time_cost(all): 1 day, 2:42:15/1 day, 13:55:36, loss=0.464558657344164, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.96(1.03), norm=0.9788735460866675, lr=0.30034883042428573
2023-12-06 13:38:12   INFO  epoch: 29/72, acc_iter=115448, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:39, time_cost(all): 1 day, 2:42:57/1 day, 13:24:01, loss=0.464499459918223, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.95(1.03), norm=2.9482067400253644, lr=0.30023486334891797
2023-12-06 13:38:54   INFO  epoch: 29/72, acc_iter=115498, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:04:54, time_cost(all): 1 day, 2:43:39/1 day, 12:54:55, loss=0.464440262492282, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.23(1.03), norm=4.049569059925066, lr=0.3001208962735502
2023-12-06 13:39:36   INFO  epoch: 29/72, acc_iter=115548, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:14, time_cost(all): 1 day, 2:44:21/1 day, 12:38:24, loss=0.464381065066341, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.9(1.03), norm=1.4775889132329836, lr=0.3000069291981824
2023-12-06 13:40:17   INFO  epoch: 29/72, acc_iter=115598, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:43, time_cost(all): 1 day, 2:45:02/1 day, 12:28:52, loss=0.4643218676404, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.97(1.03), norm=4.417021992430524, lr=0.2998929621228146
2023-12-06 13:40:59   INFO  epoch: 29/72, acc_iter=115648, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:03:05, time_cost(all): 1 day, 2:45:44/1 day, 14:03:14, loss=0.46426267021446, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.0(1.03), norm=3.499233347454357, lr=0.29977899504744676
2023-12-06 13:41:41   INFO  epoch: 29/72, acc_iter=115698, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:08, time_cost(all): 1 day, 2:46:26/1 day, 13:45:17, loss=0.464203472788519, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.04(1.03), norm=3.098948295868548, lr=0.299665027972079
2023-12-06 13:42:23   INFO  epoch: 29/72, acc_iter=115748, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:36, time_cost(all): 1 day, 2:47:08/1 day, 14:55:40, loss=0.464144275362578, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.85(1.03), norm=4.756297999420839, lr=0.2995510608967112
2023-12-06 13:43:04   INFO  epoch: 29/72, acc_iter=115798, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:52, time_cost(all): 1 day, 2:47:49/1 day, 15:09:21, loss=0.464085077936637, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.94(1.03), norm=2.856568831878911, lr=0.2994370938213434
2023-12-06 13:43:46   INFO  epoch: 29/72, acc_iter=115848, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 1 day, 2:48:31/1 day, 12:06:11, loss=0.464025880510696, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.91(1.03), norm=3.135214319628933, lr=0.2993231267459756
2023-12-06 13:44:28   INFO  epoch: 30/72, acc_iter=115910, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:51:10, time_cost(all): 1 day, 2:49:13/1 day, 14:43:04, loss=0.463952475702529, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.16(1.03), norm=2.08647759621159, lr=0.29918180757251955
2023-12-06 13:45:10   INFO  epoch: 30/72, acc_iter=115960, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:52:40, time_cost(all): 1 day, 2:49:55/1 day, 12:08:46, loss=0.463893278276588, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=4.705047595949422, lr=0.29906784049715174
2023-12-06 13:45:52   INFO  epoch: 30/72, acc_iter=116010, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:51:30, time_cost(all): 1 day, 2:50:37/1 day, 15:18:44, loss=0.463834080850647, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.18(1.03), norm=4.098442761800895, lr=0.298953873421784
2023-12-06 13:46:33   INFO  epoch: 30/72, acc_iter=116060, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:50:09, time_cost(all): 1 day, 2:51:18/1 day, 13:34:52, loss=0.463774883424706, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.9(1.03), norm=4.530718049601667, lr=0.29883990634641616
2023-12-06 13:47:15   INFO  epoch: 30/72, acc_iter=116110, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:50:54, time_cost(all): 1 day, 2:52:00/1 day, 13:18:35, loss=0.463715685998765, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.1(1.03), norm=3.7206328359536167, lr=0.2987259392710484
2023-12-06 13:47:57   INFO  epoch: 30/72, acc_iter=116160, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:47:12, time_cost(all): 1 day, 2:52:42/1 day, 14:45:29, loss=0.463656488572824, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.18(1.03), norm=3.0391210462326717, lr=0.2986119721956806
2023-12-06 13:48:39   INFO  epoch: 30/72, acc_iter=116210, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:50:51, time_cost(all): 1 day, 2:53:24/1 day, 15:05:44, loss=0.463597291146883, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.15(1.03), norm=1.3839973454434733, lr=0.29849800512031277
2023-12-06 13:49:20   INFO  epoch: 30/72, acc_iter=116260, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:49:54, time_cost(all): 1 day, 2:54:05/1 day, 12:11:52, loss=0.463538093720942, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.13(1.03), norm=4.843438518305559, lr=0.29838403804494495
2023-12-06 13:50:02   INFO  epoch: 30/72, acc_iter=116310, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:47:43, time_cost(all): 1 day, 2:54:47/1 day, 14:36:22, loss=0.463478896295001, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.91(1.03), norm=2.9601207788560138, lr=0.29827007096957714
2023-12-06 13:50:44   INFO  epoch: 30/72, acc_iter=116360, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:45:38, time_cost(all): 1 day, 2:55:29/1 day, 11:51:06, loss=0.463419698869061, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.88(1.03), norm=3.6392477644569565, lr=0.29815610389420943
2023-12-06 13:51:26   INFO  epoch: 30/72, acc_iter=116410, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:46:23, time_cost(all): 1 day, 2:56:11/1 day, 12:45:10, loss=0.46336050144312, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.88(1.03), norm=0.8728381192607817, lr=0.2980421368188416
2023-12-06 13:52:08   INFO  epoch: 30/72, acc_iter=116460, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:43:49, time_cost(all): 1 day, 2:56:53/1 day, 12:36:25, loss=0.463301304017179, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.84(1.03), norm=3.813340733415864, lr=0.2979281697434738
2023-12-06 13:52:49   INFO  epoch: 30/72, acc_iter=116510, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:44:44, time_cost(all): 1 day, 2:57:34/1 day, 15:08:20, loss=0.463242106591238, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.04(1.03), norm=3.4319327330703238, lr=0.297814202668106
2023-12-06 13:53:31   INFO  epoch: 30/72, acc_iter=116560, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:43:50, time_cost(all): 1 day, 2:58:16/1 day, 15:25:16, loss=0.463182909165297, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.13(1.03), norm=3.615336608399155, lr=0.2977002355927382
2023-12-06 13:54:13   INFO  epoch: 30/72, acc_iter=116610, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:43:06, time_cost(all): 1 day, 2:58:58/1 day, 15:14:22, loss=0.463123711739356, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.17(1.03), norm=1.8074785070494324, lr=0.2975862685173704
2023-12-06 13:54:55   INFO  epoch: 30/72, acc_iter=116660, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:40:47, time_cost(all): 1 day, 2:59:40/1 day, 14:54:43, loss=0.463064514313415, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.15(1.03), norm=2.7722242234521683, lr=0.29747230144200265
2023-12-06 13:55:36   INFO  epoch: 30/72, acc_iter=116710, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:42:40, time_cost(all): 1 day, 3:00:21/1 day, 12:54:44, loss=0.463005316887474, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.97(1.03), norm=3.8041501641352697, lr=0.29735833436663484
2023-12-06 13:56:18   INFO  epoch: 30/72, acc_iter=116760, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:40:11, time_cost(all): 1 day, 3:01:03/1 day, 14:16:01, loss=0.462946119461533, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.21(1.03), norm=1.8868719321078258, lr=0.297244367291267
2023-12-06 13:57:00   INFO  epoch: 30/72, acc_iter=116810, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:38:34, time_cost(all): 1 day, 3:01:45/1 day, 11:50:01, loss=0.462886922035592, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.0(1.03), norm=0.5765460564821426, lr=0.2971304002158992
2023-12-06 13:57:42   INFO  epoch: 30/72, acc_iter=116860, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:39:52, time_cost(all): 1 day, 3:02:27/1 day, 14:32:26, loss=0.462827724609651, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.05(1.03), norm=3.0122113267691626, lr=0.2970164331405315
2023-12-06 13:58:24   INFO  epoch: 30/72, acc_iter=116910, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:39:03, time_cost(all): 1 day, 3:03:09/1 day, 14:52:44, loss=0.46276852718371, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.86(1.03), norm=4.7554444246194185, lr=0.2969024660651637
2023-12-06 13:59:05   INFO  epoch: 30/72, acc_iter=116960, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:40:06, time_cost(all): 1 day, 3:03:50/1 day, 12:59:55, loss=0.462709329757769, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.98(1.03), norm=2.0224247798066424, lr=0.29678849898979587
2023-12-06 13:59:47   INFO  epoch: 30/72, acc_iter=117010, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:39:15, time_cost(all): 1 day, 3:04:32/1 day, 12:27:05, loss=0.462650132331828, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.19(1.03), norm=2.738764290630357, lr=0.29667453191442805
2023-12-06 14:00:29   INFO  epoch: 30/72, acc_iter=117060, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:38:12, time_cost(all): 1 day, 3:05:14/1 day, 14:06:37, loss=0.462590934905887, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.89(1.03), norm=4.654119643223879, lr=0.29656056483906024
2023-12-06 14:01:11   INFO  epoch: 30/72, acc_iter=117110, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:35:28, time_cost(all): 1 day, 3:05:56/1 day, 14:30:10, loss=0.462531737479946, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.9(1.03), norm=1.7744484090243238, lr=0.2964465977636925
2023-12-06 14:01:52   INFO  epoch: 30/72, acc_iter=117160, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:35:26, time_cost(all): 1 day, 3:06:37/1 day, 14:15:46, loss=0.462472540054005, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.08(1.03), norm=1.5444604099736714, lr=0.29633263068832466
2023-12-06 14:02:34   INFO  epoch: 30/72, acc_iter=117210, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:33:42, time_cost(all): 1 day, 3:07:19/1 day, 12:40:15, loss=0.462413342628065, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.94(1.03), norm=4.085148603482457, lr=0.2962186636129569
2023-12-06 14:03:16   INFO  epoch: 30/72, acc_iter=117260, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:33:53, time_cost(all): 1 day, 3:08:01/1 day, 12:17:10, loss=0.462354145202124, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.1(1.03), norm=1.5877786063730606, lr=0.2961046965375891
2023-12-06 14:03:58   INFO  epoch: 30/72, acc_iter=117310, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:32:03, time_cost(all): 1 day, 3:08:43/1 day, 14:00:40, loss=0.462294947776183, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.05(1.03), norm=3.986197740693062, lr=0.29599072946222127
2023-12-06 14:04:40   INFO  epoch: 30/72, acc_iter=117360, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:34:03, time_cost(all): 1 day, 3:09:25/1 day, 13:18:45, loss=0.462235750350242, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.14(1.03), norm=1.1666892740376589, lr=0.29587676238685345
2023-12-06 14:05:21   INFO  epoch: 30/72, acc_iter=117410, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:32:30, time_cost(all): 1 day, 3:10:06/1 day, 11:45:09, loss=0.462176552924301, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.04(1.03), norm=1.3664706191307312, lr=0.29576279531148575
2023-12-06 14:06:03   INFO  epoch: 30/72, acc_iter=117460, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:31:35, time_cost(all): 1 day, 3:10:48/1 day, 14:49:28, loss=0.46211735549836, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.97(1.03), norm=3.746899028134816, lr=0.29564882823611793
2023-12-06 14:06:45   INFO  epoch: 30/72, acc_iter=117510, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:30:37, time_cost(all): 1 day, 3:11:30/1 day, 13:05:44, loss=0.462058158072419, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.05(1.03), norm=1.1031818743728876, lr=0.2955348611607501
2023-12-06 14:07:27   INFO  epoch: 30/72, acc_iter=117560, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:30:07, time_cost(all): 1 day, 3:12:12/1 day, 14:35:07, loss=0.461998960646478, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.23(1.03), norm=4.8088181582648435, lr=0.2954208940853823
2023-12-06 14:08:08   INFO  epoch: 30/72, acc_iter=117610, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:29:50, time_cost(all): 1 day, 3:12:53/1 day, 12:14:47, loss=0.461939763220537, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.83(1.03), norm=1.3234697857400193, lr=0.29530692701001454
2023-12-06 14:08:50   INFO  epoch: 30/72, acc_iter=117660, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:28:54, time_cost(all): 1 day, 3:13:35/1 day, 13:38:58, loss=0.461880565794596, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.15(1.03), norm=2.645312157214134, lr=0.2951929599346467
2023-12-06 14:09:32   INFO  epoch: 30/72, acc_iter=117710, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:27:06, time_cost(all): 1 day, 3:14:17/1 day, 14:12:50, loss=0.461821368368655, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.17(1.03), norm=0.8478345888511831, lr=0.29507899285927897
2023-12-06 14:10:14   INFO  epoch: 30/72, acc_iter=117760, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:26:42, time_cost(all): 1 day, 3:14:59/1 day, 12:13:59, loss=0.461762170942714, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.9(1.03), norm=3.759728181571685, lr=0.29496502578391115
2023-12-06 14:10:56   INFO  epoch: 30/72, acc_iter=117810, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:26:01, time_cost(all): 1 day, 3:15:41/1 day, 14:49:27, loss=0.461702973516773, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.16(1.03), norm=2.1212051561397676, lr=0.29485105870854333
2023-12-06 14:11:37   INFO  epoch: 30/72, acc_iter=117860, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:24:42, time_cost(all): 1 day, 3:16:22/1 day, 11:27:20, loss=0.461643776090832, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.01(1.03), norm=4.199543402182147, lr=0.2947370916331755
2023-12-06 14:12:19   INFO  epoch: 30/72, acc_iter=117910, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:24:09, time_cost(all): 1 day, 3:17:04/1 day, 11:56:37, loss=0.461584578664891, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.05(1.03), norm=2.8808338727068525, lr=0.29462312455780776
2023-12-06 14:13:01   INFO  epoch: 30/72, acc_iter=117960, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:25:20, time_cost(all): 1 day, 3:17:46/1 day, 11:24:51, loss=0.46152538123895, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.94(1.03), norm=2.6893113327526135, lr=0.29450915748244
2023-12-06 14:13:43   INFO  epoch: 30/72, acc_iter=118010, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:23:54, time_cost(all): 1 day, 3:18:28/1 day, 13:52:31, loss=0.461466183813009, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.9(1.03), norm=1.3366048918337436, lr=0.2943951904070722
2023-12-06 14:14:25   INFO  epoch: 30/72, acc_iter=118060, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:56, time_cost(all): 1 day, 3:19:10/1 day, 12:31:29, loss=0.461406986387069, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.99(1.03), norm=4.2968435918730155, lr=0.29428122333170437
2023-12-06 14:15:06   INFO  epoch: 30/72, acc_iter=118110, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:23:28, time_cost(all): 1 day, 3:19:51/1 day, 13:41:39, loss=0.461347788961128, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.16(1.03), norm=3.4833072413692934, lr=0.29416725625633655
2023-12-06 14:15:48   INFO  epoch: 30/72, acc_iter=118160, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:22:04, time_cost(all): 1 day, 3:20:33/1 day, 14:36:48, loss=0.461288591535187, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.84(1.03), norm=1.6536356504847582, lr=0.2940532891809688
2023-12-06 14:16:30   INFO  epoch: 30/72, acc_iter=118210, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:23, time_cost(all): 1 day, 3:21:15/1 day, 14:12:52, loss=0.461229394109246, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.92(1.03), norm=2.1456629745690305, lr=0.293939322105601
2023-12-06 14:17:12   INFO  epoch: 30/72, acc_iter=118260, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:19:37, time_cost(all): 1 day, 3:21:57/1 day, 13:15:44, loss=0.461170196683305, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.06(1.03), norm=1.5234763550459498, lr=0.2938253550302332
2023-12-06 14:17:53   INFO  epoch: 30/72, acc_iter=118310, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:19:29, time_cost(all): 1 day, 3:22:38/1 day, 13:48:19, loss=0.461110999257364, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.15(1.03), norm=4.745721431101915, lr=0.2937113879548654
2023-12-06 14:18:35   INFO  epoch: 30/72, acc_iter=118360, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:48, time_cost(all): 1 day, 3:23:20/1 day, 12:35:56, loss=0.461051801831423, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.2(1.03), norm=3.9292284377237365, lr=0.2935974208794976
2023-12-06 14:19:17   INFO  epoch: 30/72, acc_iter=118410, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:41, time_cost(all): 1 day, 3:24:02/1 day, 11:49:56, loss=0.460992604405482, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.09(1.03), norm=3.2784106330243747, lr=0.2934834538041298
2023-12-06 14:19:59   INFO  epoch: 30/72, acc_iter=118460, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:12, time_cost(all): 1 day, 3:24:44/1 day, 11:16:28, loss=0.460933406979541, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.07(1.03), norm=4.153143225394538, lr=0.29336948672876206
2023-12-06 14:20:41   INFO  epoch: 30/72, acc_iter=118510, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:17:04, time_cost(all): 1 day, 3:25:26/1 day, 12:04:02, loss=0.4608742095536, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.17(1.03), norm=2.5510194795495487, lr=0.29325551965339425
2023-12-06 14:21:22   INFO  epoch: 30/72, acc_iter=118560, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:27, time_cost(all): 1 day, 3:26:07/1 day, 13:47:23, loss=0.460815012127659, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.92(1.03), norm=3.663536760403885, lr=0.29314155257802643
2023-12-06 14:22:04   INFO  epoch: 30/72, acc_iter=118610, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:10, time_cost(all): 1 day, 3:26:49/1 day, 14:39:30, loss=0.460755814701718, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.02(1.03), norm=2.053895829020743, lr=0.2930275855026586
2023-12-06 14:22:46   INFO  epoch: 30/72, acc_iter=118660, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:15:30, time_cost(all): 1 day, 3:27:31/1 day, 14:22:02, loss=0.460696617275777, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.08(1.03), norm=2.0644541423842324, lr=0.29291361842729086
2023-12-06 14:23:28   INFO  epoch: 30/72, acc_iter=118710, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:29, time_cost(all): 1 day, 3:28:13/1 day, 12:20:01, loss=0.460637419849836, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.19(1.03), norm=1.6506583997518427, lr=0.29279965135192304
2023-12-06 14:24:09   INFO  epoch: 30/72, acc_iter=118760, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:59, time_cost(all): 1 day, 3:28:54/1 day, 11:18:42, loss=0.460578222423895, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.85(1.03), norm=3.374993805169158, lr=0.2926856842765553
2023-12-06 14:24:51   INFO  epoch: 30/72, acc_iter=118810, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:55, time_cost(all): 1 day, 3:29:36/1 day, 12:39:42, loss=0.460519024997954, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.94(1.03), norm=0.954949415257373, lr=0.29257171720118746
2023-12-06 14:25:33   INFO  epoch: 30/72, acc_iter=118860, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:22, time_cost(all): 1 day, 3:30:18/1 day, 14:52:58, loss=0.460459827572013, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.87(1.03), norm=3.2690456819232123, lr=0.29245775012581965
2023-12-06 14:26:15   INFO  epoch: 30/72, acc_iter=118910, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:35, time_cost(all): 1 day, 3:31:00/1 day, 13:52:51, loss=0.460400630146073, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.22(1.03), norm=1.989892042371053, lr=0.2923437830504519
2023-12-06 14:26:57   INFO  epoch: 30/72, acc_iter=118960, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:28, time_cost(all): 1 day, 3:31:42/1 day, 11:32:20, loss=0.460341432720132, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.08(1.03), norm=3.980023527094225, lr=0.2922298159750841
2023-12-06 14:27:38   INFO  epoch: 30/72, acc_iter=119010, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:10:04, time_cost(all): 1 day, 3:32:23/1 day, 13:23:09, loss=0.460282235294191, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.9(1.03), norm=2.8461535720844715, lr=0.2921158488997163
2023-12-06 14:28:20   INFO  epoch: 30/72, acc_iter=119060, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:08:46, time_cost(all): 1 day, 3:33:05/1 day, 14:05:26, loss=0.46022303786825, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.15(1.03), norm=3.3285067769363894, lr=0.2920018818243485
2023-12-06 14:29:02   INFO  epoch: 30/72, acc_iter=119110, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:26, time_cost(all): 1 day, 3:33:47/1 day, 11:52:11, loss=0.460163840442309, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.94(1.03), norm=4.646227148351398, lr=0.2918879147489807
2023-12-06 14:29:44   INFO  epoch: 30/72, acc_iter=119160, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:50, time_cost(all): 1 day, 3:34:29/1 day, 13:06:59, loss=0.460104643016368, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.16(1.03), norm=2.2005442977004828, lr=0.29177394767361287
2023-12-06 14:30:25   INFO  epoch: 30/72, acc_iter=119210, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:20, time_cost(all): 1 day, 3:35:10/1 day, 14:15:27, loss=0.460045445590427, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.84(1.03), norm=2.872798273402037, lr=0.2916599805982451
2023-12-06 14:31:07   INFO  epoch: 30/72, acc_iter=119260, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:31, time_cost(all): 1 day, 3:35:52/1 day, 12:57:33, loss=0.459986248164486, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.92(1.03), norm=2.7803657338844987, lr=0.29154601352287735
2023-12-06 14:31:49   INFO  epoch: 30/72, acc_iter=119310, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:39, time_cost(all): 1 day, 3:36:34/1 day, 11:10:57, loss=0.459927050738545, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.09(1.03), norm=0.7986860679882202, lr=0.29143204644750953
2023-12-06 14:32:31   INFO  epoch: 30/72, acc_iter=119360, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:01, time_cost(all): 1 day, 3:37:16/1 day, 13:08:44, loss=0.459867853312604, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.2(1.03), norm=1.134165354855447, lr=0.2913180793721417
2023-12-06 14:33:13   INFO  epoch: 30/72, acc_iter=119410, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:15, time_cost(all): 1 day, 3:37:58/1 day, 13:05:01, loss=0.459808655886663, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.11(1.03), norm=3.28052014457883, lr=0.2912041122967739
2023-12-06 14:33:54   INFO  epoch: 30/72, acc_iter=119460, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:38, time_cost(all): 1 day, 3:38:39/1 day, 12:04:40, loss=0.459749458460722, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.14(1.03), norm=2.1146349749160507, lr=0.29109014522140614
2023-12-06 14:34:36   INFO  epoch: 30/72, acc_iter=119510, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:03:05, time_cost(all): 1 day, 3:39:21/1 day, 12:38:06, loss=0.459690261034781, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.06(1.03), norm=4.021382172591071, lr=0.2909761781460384
2023-12-06 14:35:18   INFO  epoch: 30/72, acc_iter=119560, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:15, time_cost(all): 1 day, 3:40:03/1 day, 12:53:43, loss=0.45963106360884, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.97(1.03), norm=3.215464656998451, lr=0.29086221107067056
2023-12-06 14:36:00   INFO  epoch: 30/72, acc_iter=119610, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:34, time_cost(all): 1 day, 3:40:45/1 day, 11:39:53, loss=0.459571866182899, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.22(1.03), norm=0.908220203349153, lr=0.29074824399530275
2023-12-06 14:36:41   INFO  epoch: 30/72, acc_iter=119660, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:49, time_cost(all): 1 day, 3:41:26/1 day, 12:53:34, loss=0.459512668756958, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.21(1.03), norm=1.70325215951366, lr=0.29063427691993493
2023-12-06 14:37:23   INFO  epoch: 30/72, acc_iter=119710, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 1 day, 3:42:08/1 day, 12:48:58, loss=0.459453471331017, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.0(1.03), norm=0.5025500602631642, lr=0.2905203098445671
2023-12-06 14:38:05   INFO  epoch: 31/72, acc_iter=119772, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:53:07, time_cost(all): 1 day, 3:42:50/1 day, 13:28:21, loss=0.459380066522851, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.01(1.03), norm=2.2816691398507536, lr=0.29037899067111106
2023-12-06 14:38:47   INFO  epoch: 31/72, acc_iter=119822, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:51:23, time_cost(all): 1 day, 3:43:32/1 day, 12:22:42, loss=0.45932086909691, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.01(1.03), norm=2.253753621163037, lr=0.2902650235957433
2023-12-06 14:39:29   INFO  epoch: 31/72, acc_iter=119872, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:52:07, time_cost(all): 1 day, 3:44:14/1 day, 12:49:45, loss=0.459261671670969, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.84(1.03), norm=2.4801820492852746, lr=0.29015105652037554
2023-12-06 14:40:10   INFO  epoch: 31/72, acc_iter=119922, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:51:29, time_cost(all): 1 day, 3:44:55/1 day, 12:14:06, loss=0.459202474245028, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.15(1.03), norm=1.8507519541333557, lr=0.2900370894450077
2023-12-06 14:40:52   INFO  epoch: 31/72, acc_iter=119972, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:49:26, time_cost(all): 1 day, 3:45:37/1 day, 13:13:00, loss=0.459143276819087, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.12(1.03), norm=1.2587857543820782, lr=0.2899231223696399
2023-12-06 14:41:34   INFO  epoch: 31/72, acc_iter=120022, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:48:26, time_cost(all): 1 day, 3:46:19/1 day, 12:24:09, loss=0.459084079393146, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.99(1.03), norm=3.7000848629204075, lr=0.2898091552942721
2023-12-06 14:42:16   INFO  epoch: 31/72, acc_iter=120072, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:48:02, time_cost(all): 1 day, 3:47:01/1 day, 13:19:48, loss=0.459024881967205, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.95(1.03), norm=3.547731146967213, lr=0.28969518821890433
2023-12-06 14:42:57   INFO  epoch: 31/72, acc_iter=120122, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:46:32, time_cost(all): 1 day, 3:47:42/1 day, 10:59:58, loss=0.458965684541264, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.08(1.03), norm=2.981098424097948, lr=0.2895812211435365
2023-12-06 14:43:39   INFO  epoch: 31/72, acc_iter=120172, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:45:38, time_cost(all): 1 day, 3:48:24/1 day, 11:40:14, loss=0.458906487115323, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.19(1.03), norm=2.0309516410516233, lr=0.28946725406816876
2023-12-06 14:44:21   INFO  epoch: 31/72, acc_iter=120222, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:47:19, time_cost(all): 1 day, 3:49:06/1 day, 13:00:52, loss=0.458847289689382, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.92(1.03), norm=3.014836181629042, lr=0.28935328699280094
2023-12-06 14:45:03   INFO  epoch: 31/72, acc_iter=120272, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:48:01, time_cost(all): 1 day, 3:49:48/1 day, 14:25:51, loss=0.458788092263441, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.22(1.03), norm=4.9119233279316825, lr=0.2892393199174331
2023-12-06 14:45:45   INFO  epoch: 31/72, acc_iter=120322, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:46:40, time_cost(all): 1 day, 3:50:30/1 day, 13:39:47, loss=0.4587288948375, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.15(1.03), norm=1.5267558112441562, lr=0.2891253528420653
2023-12-06 14:46:26   INFO  epoch: 31/72, acc_iter=120372, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:46:17, time_cost(all): 1 day, 3:51:11/1 day, 10:54:46, loss=0.458669697411559, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.17(1.03), norm=1.4007237274168012, lr=0.2890113857666976
2023-12-06 14:47:08   INFO  epoch: 31/72, acc_iter=120422, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:44:31, time_cost(all): 1 day, 3:51:53/1 day, 13:30:51, loss=0.458610499985618, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.09(1.03), norm=2.49086347363126, lr=0.2888974186913298
2023-12-06 14:47:50   INFO  epoch: 31/72, acc_iter=120472, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:41:38, time_cost(all): 1 day, 3:52:35/1 day, 12:27:21, loss=0.458551302559678, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.01(1.03), norm=3.607340315963275, lr=0.288783451615962
2023-12-06 14:48:32   INFO  epoch: 31/72, acc_iter=120522, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:40:55, time_cost(all): 1 day, 3:53:17/1 day, 12:35:16, loss=0.458492105133737, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.07(1.03), norm=4.673251175879705, lr=0.28866948454059416
2023-12-06 14:49:13   INFO  epoch: 31/72, acc_iter=120572, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:41:03, time_cost(all): 1 day, 3:53:58/1 day, 12:06:03, loss=0.458432907707796, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.21(1.03), norm=4.777867800131089, lr=0.2885555174652264
2023-12-06 14:49:55   INFO  epoch: 31/72, acc_iter=120622, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:39:38, time_cost(all): 1 day, 3:54:40/1 day, 12:28:58, loss=0.458373710281855, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.97(1.03), norm=4.837059113423661, lr=0.2884415503898586
2023-12-06 14:50:37   INFO  epoch: 31/72, acc_iter=120672, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:41:38, time_cost(all): 1 day, 3:55:22/1 day, 13:46:21, loss=0.458314512855914, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.98(1.03), norm=0.9483018057326836, lr=0.2883275833144908
2023-12-06 14:51:19   INFO  epoch: 31/72, acc_iter=120722, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:38:32, time_cost(all): 1 day, 3:56:04/1 day, 13:49:51, loss=0.458255315429973, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.06(1.03), norm=2.040659611339943, lr=0.288213616239123
2023-12-06 14:52:01   INFO  epoch: 31/72, acc_iter=120772, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:41:00, time_cost(all): 1 day, 3:56:46/1 day, 11:27:17, loss=0.458196118004032, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.2(1.03), norm=3.947493614777631, lr=0.2880996491637552
2023-12-06 14:52:42   INFO  epoch: 31/72, acc_iter=120822, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:39:22, time_cost(all): 1 day, 3:57:27/1 day, 13:35:19, loss=0.458136920578091, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.05(1.03), norm=1.0947070378151866, lr=0.2879856820883874
2023-12-06 14:53:24   INFO  epoch: 31/72, acc_iter=120872, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:36:01, time_cost(all): 1 day, 3:58:09/1 day, 12:46:07, loss=0.45807772315215, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.87(1.03), norm=3.484831342026045, lr=0.2878717150130196
2023-12-06 14:54:06   INFO  epoch: 31/72, acc_iter=120922, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:38:31, time_cost(all): 1 day, 3:58:51/1 day, 13:34:43, loss=0.458018525726209, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.06(1.03), norm=0.9037203807551619, lr=0.28775774793765185
2023-12-06 14:54:48   INFO  epoch: 31/72, acc_iter=120972, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:34:38, time_cost(all): 1 day, 3:59:33/1 day, 13:32:16, loss=0.457959328300268, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.85(1.03), norm=1.9331296719396829, lr=0.28764378086228404
2023-12-06 14:55:30   INFO  epoch: 31/72, acc_iter=121022, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:35:55, time_cost(all): 1 day, 4:00:15/1 day, 11:48:45, loss=0.457900130874327, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.92(1.03), norm=4.0257519152924495, lr=0.2875298137869162
2023-12-06 14:56:11   INFO  epoch: 31/72, acc_iter=121072, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:36:25, time_cost(all): 1 day, 4:00:56/1 day, 11:56:16, loss=0.457840933448386, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.14(1.03), norm=3.7855651527851526, lr=0.2874158467115484
2023-12-06 14:56:53   INFO  epoch: 31/72, acc_iter=121122, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:34:45, time_cost(all): 1 day, 4:01:38/1 day, 13:51:07, loss=0.457781736022445, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.93(1.03), norm=1.5274234492181737, lr=0.28730187963618065
2023-12-06 14:57:35   INFO  epoch: 31/72, acc_iter=121172, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:33:44, time_cost(all): 1 day, 4:02:20/1 day, 12:34:40, loss=0.457722538596504, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.11(1.03), norm=1.825994826826907, lr=0.28718791256081283
2023-12-06 14:58:17   INFO  epoch: 31/72, acc_iter=121222, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:34:02, time_cost(all): 1 day, 4:03:02/1 day, 13:12:08, loss=0.457663341170563, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.86(1.03), norm=2.275881560939151, lr=0.28707394548544507
2023-12-06 14:58:58   INFO  epoch: 31/72, acc_iter=121272, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:32:38, time_cost(all): 1 day, 4:03:43/1 day, 14:13:09, loss=0.457604143744622, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.95(1.03), norm=2.5500693475042793, lr=0.28695997841007725
2023-12-06 14:59:40   INFO  epoch: 31/72, acc_iter=121322, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:30:30, time_cost(all): 1 day, 4:04:25/1 day, 11:34:27, loss=0.457544946318681, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.13(1.03), norm=1.9942233820326827, lr=0.28684601133470944
2023-12-06 15:00:22   INFO  epoch: 31/72, acc_iter=121372, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:30:21, time_cost(all): 1 day, 4:05:07/1 day, 10:48:25, loss=0.457485748892741, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.17(1.03), norm=3.766167074657914, lr=0.2867320442593416
2023-12-06 15:01:04   INFO  epoch: 31/72, acc_iter=121422, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:31:26, time_cost(all): 1 day, 4:05:49/1 day, 12:16:07, loss=0.4574265514668, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.04(1.03), norm=0.8519664780695728, lr=0.2866180771839739
2023-12-06 15:01:46   INFO  epoch: 31/72, acc_iter=121472, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:29:39, time_cost(all): 1 day, 4:06:31/1 day, 13:52:26, loss=0.457367354040859, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.95(1.03), norm=1.5954553351659568, lr=0.2865041101086061
2023-12-06 15:02:27   INFO  epoch: 31/72, acc_iter=121522, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:28:16, time_cost(all): 1 day, 4:07:12/1 day, 11:41:41, loss=0.457308156614918, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.86(1.03), norm=3.9676876478739582, lr=0.2863901430332383
2023-12-06 15:03:09   INFO  epoch: 31/72, acc_iter=121572, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:26:38, time_cost(all): 1 day, 4:07:54/1 day, 11:18:06, loss=0.457248959188977, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.22(1.03), norm=0.7352980477852419, lr=0.28627617595787047
2023-12-06 15:03:51   INFO  epoch: 31/72, acc_iter=121622, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:27:52, time_cost(all): 1 day, 4:08:36/1 day, 13:58:17, loss=0.457189761763036, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.17(1.03), norm=1.3965767454506623, lr=0.2861622088825027
2023-12-06 15:04:33   INFO  epoch: 31/72, acc_iter=121672, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:27:23, time_cost(all): 1 day, 4:09:18/1 day, 10:36:34, loss=0.457130564337095, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.88(1.03), norm=1.5635610170174545, lr=0.2860482418071349
2023-12-06 15:05:14   INFO  epoch: 31/72, acc_iter=121722, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:24:58, time_cost(all): 1 day, 4:09:59/1 day, 12:24:53, loss=0.457071366911154, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.03(1.03), norm=2.43664828501987, lr=0.28593427473176714
2023-12-06 15:05:56   INFO  epoch: 31/72, acc_iter=121772, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:26:00, time_cost(all): 1 day, 4:10:41/1 day, 10:39:45, loss=0.457012169485213, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.08(1.03), norm=0.5073632626069033, lr=0.2858203076563993
2023-12-06 15:06:38   INFO  epoch: 31/72, acc_iter=121822, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:23:45, time_cost(all): 1 day, 4:11:23/1 day, 12:05:21, loss=0.456952972059272, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.0(1.03), norm=0.837189797636203, lr=0.2857063405810315
2023-12-06 15:07:20   INFO  epoch: 31/72, acc_iter=121872, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:59, time_cost(all): 1 day, 4:12:05/1 day, 13:12:16, loss=0.456893774633331, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.02(1.03), norm=2.461759630179487, lr=0.2855923735056637
2023-12-06 15:08:02   INFO  epoch: 31/72, acc_iter=121922, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:23:59, time_cost(all): 1 day, 4:12:47/1 day, 11:23:09, loss=0.45683457720739, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.18(1.03), norm=4.963944611476713, lr=0.28547840643029593
2023-12-06 15:08:43   INFO  epoch: 31/72, acc_iter=121972, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:23:00, time_cost(all): 1 day, 4:13:28/1 day, 13:15:34, loss=0.456775379781449, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.88(1.03), norm=2.8921513918714554, lr=0.28536443935492817
2023-12-06 15:09:25   INFO  epoch: 31/72, acc_iter=122022, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:30, time_cost(all): 1 day, 4:14:10/1 day, 13:07:04, loss=0.456716182355508, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.06(1.03), norm=4.646720159300244, lr=0.28525047227956035
2023-12-06 15:10:07   INFO  epoch: 31/72, acc_iter=122072, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:58, time_cost(all): 1 day, 4:14:52/1 day, 11:33:54, loss=0.456656984929567, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.99(1.03), norm=3.362159751042042, lr=0.28513650520419254
2023-12-06 15:10:49   INFO  epoch: 31/72, acc_iter=122122, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:21:17, time_cost(all): 1 day, 4:15:34/1 day, 13:47:55, loss=0.456597787503626, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.96(1.03), norm=1.3515192963697313, lr=0.2850225381288247
2023-12-06 15:11:30   INFO  epoch: 31/72, acc_iter=122172, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:18:57, time_cost(all): 1 day, 4:16:15/1 day, 13:42:26, loss=0.456538590077685, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=3.3485702113757667, lr=0.28490857105345696
2023-12-06 15:12:12   INFO  epoch: 31/72, acc_iter=122222, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:51, time_cost(all): 1 day, 4:16:57/1 day, 13:29:16, loss=0.456479392651745, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.9(1.03), norm=0.5519357997665705, lr=0.28479460397808914
2023-12-06 15:12:54   INFO  epoch: 31/72, acc_iter=122272, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:34, time_cost(all): 1 day, 4:17:39/1 day, 13:01:42, loss=0.456420195225804, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.9(1.03), norm=3.9797487366815285, lr=0.2846806369027214
2023-12-06 15:13:36   INFO  epoch: 31/72, acc_iter=122322, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:08, time_cost(all): 1 day, 4:18:21/1 day, 10:36:26, loss=0.456360997799863, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.08(1.03), norm=3.2286198718828722, lr=0.28456666982735357
2023-12-06 15:14:18   INFO  epoch: 31/72, acc_iter=122372, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:04, time_cost(all): 1 day, 4:19:03/1 day, 11:14:11, loss=0.456301800373922, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.94(1.03), norm=3.6757075942437742, lr=0.28445270275198575
2023-12-06 15:14:59   INFO  epoch: 31/72, acc_iter=122422, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:27, time_cost(all): 1 day, 4:19:44/1 day, 13:01:46, loss=0.456242602947981, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.01(1.03), norm=2.564943330021132, lr=0.284338735676618
2023-12-06 15:15:41   INFO  epoch: 31/72, acc_iter=122472, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:29, time_cost(all): 1 day, 4:20:26/1 day, 12:29:35, loss=0.45618340552204, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.97(1.03), norm=0.8673785080585483, lr=0.28422476860125023
2023-12-06 15:16:23   INFO  epoch: 31/72, acc_iter=122522, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:15:05, time_cost(all): 1 day, 4:21:08/1 day, 12:23:00, loss=0.456124208096099, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.95(1.03), norm=3.428576130652065, lr=0.2841108015258824
2023-12-06 15:17:05   INFO  epoch: 31/72, acc_iter=122572, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:15, time_cost(all): 1 day, 4:21:50/1 day, 12:28:16, loss=0.456065010670158, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.89(1.03), norm=4.105769338893783, lr=0.2839968344505146
2023-12-06 15:17:46   INFO  epoch: 31/72, acc_iter=122622, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:12:47, time_cost(all): 1 day, 4:22:31/1 day, 13:40:07, loss=0.456005813244217, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.91(1.03), norm=3.8288324128828912, lr=0.2838828673751468
2023-12-06 15:18:28   INFO  epoch: 31/72, acc_iter=122672, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:50, time_cost(all): 1 day, 4:23:13/1 day, 12:53:11, loss=0.455946615818276, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.2(1.03), norm=3.873389425672425, lr=0.28376890029977897
2023-12-06 15:19:10   INFO  epoch: 31/72, acc_iter=122722, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:01, time_cost(all): 1 day, 4:23:55/1 day, 12:15:57, loss=0.455887418392335, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.15(1.03), norm=2.5688726806863507, lr=0.2836549332244112
2023-12-06 15:19:52   INFO  epoch: 31/72, acc_iter=122772, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:22, time_cost(all): 1 day, 4:24:37/1 day, 10:34:47, loss=0.455828220966394, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.04(1.03), norm=0.5534712681546139, lr=0.28354096614904345
2023-12-06 15:20:34   INFO  epoch: 31/72, acc_iter=122822, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:29, time_cost(all): 1 day, 4:25:19/1 day, 11:49:11, loss=0.455769023540453, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.17(1.03), norm=1.082604526881443, lr=0.28342699907367563
2023-12-06 15:21:15   INFO  epoch: 31/72, acc_iter=122872, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:40, time_cost(all): 1 day, 4:26:00/1 day, 13:09:02, loss=0.455709826114512, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.01(1.03), norm=1.1246418165814982, lr=0.2833130319983078
2023-12-06 15:21:57   INFO  epoch: 31/72, acc_iter=122922, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:30, time_cost(all): 1 day, 4:26:42/1 day, 13:32:36, loss=0.455650628688571, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.91(1.03), norm=2.7577151956004253, lr=0.28319906492294006
2023-12-06 15:22:39   INFO  epoch: 31/72, acc_iter=122972, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:24, time_cost(all): 1 day, 4:27:24/1 day, 13:34:44, loss=0.45559143126263, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.16(1.03), norm=2.807956048092727, lr=0.28308509784757224
2023-12-06 15:23:21   INFO  epoch: 31/72, acc_iter=123022, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:35, time_cost(all): 1 day, 4:28:06/1 day, 12:25:56, loss=0.45553223383669, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.09(1.03), norm=1.4702874964940411, lr=0.2829711307722045
2023-12-06 15:24:02   INFO  epoch: 31/72, acc_iter=123072, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:04, time_cost(all): 1 day, 4:28:47/1 day, 12:23:10, loss=0.455473036410749, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.18(1.03), norm=4.127231405866359, lr=0.28285716369683667
2023-12-06 15:24:44   INFO  epoch: 31/72, acc_iter=123122, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:23, time_cost(all): 1 day, 4:29:29/1 day, 11:13:39, loss=0.455413838984808, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.85(1.03), norm=1.8644529050399046, lr=0.28274319662146885
2023-12-06 15:25:26   INFO  epoch: 31/72, acc_iter=123172, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:42, time_cost(all): 1 day, 4:30:11/1 day, 13:13:26, loss=0.455354641558867, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.15(1.03), norm=2.8319381619742696, lr=0.28262922954610104
2023-12-06 15:26:08   INFO  epoch: 31/72, acc_iter=123222, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:04:52, time_cost(all): 1 day, 4:30:53/1 day, 10:49:44, loss=0.455295444132926, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.83(1.03), norm=3.897923562045281, lr=0.2825152624707333
2023-12-06 15:26:50   INFO  epoch: 31/72, acc_iter=123272, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:10, time_cost(all): 1 day, 4:31:35/1 day, 11:34:24, loss=0.455236246706985, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.07(1.03), norm=2.738396145490117, lr=0.2824012953953655
2023-12-06 15:27:31   INFO  epoch: 31/72, acc_iter=123322, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:36, time_cost(all): 1 day, 4:32:16/1 day, 10:42:44, loss=0.455177049281044, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.88(1.03), norm=2.5573074922887287, lr=0.2822873283199977
2023-12-06 15:28:13   INFO  epoch: 31/72, acc_iter=123372, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:51, time_cost(all): 1 day, 4:32:58/1 day, 12:24:39, loss=0.455117851855103, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.04(1.03), norm=1.1565360669693, lr=0.2821733612446299
2023-12-06 15:28:55   INFO  epoch: 31/72, acc_iter=123422, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:09, time_cost(all): 1 day, 4:33:40/1 day, 11:25:09, loss=0.455058654429162, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.21(1.03), norm=0.6277609067229754, lr=0.28205939416926207
2023-12-06 15:29:37   INFO  epoch: 31/72, acc_iter=123472, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:31, time_cost(all): 1 day, 4:34:22/1 day, 12:47:24, loss=0.454999457003221, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.22(1.03), norm=0.6964769982370436, lr=0.2819454270938943
2023-12-06 15:30:19   INFO  epoch: 31/72, acc_iter=123522, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:49, time_cost(all): 1 day, 4:35:04/1 day, 13:23:15, loss=0.45494025957728, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.88(1.03), norm=2.7762800021240963, lr=0.2818314600185265
2023-12-06 15:31:00   INFO  epoch: 31/72, acc_iter=123572, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 1 day, 4:35:45/1 day, 11:12:23, loss=0.454881062151339, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.15(1.03), norm=1.3266947931947357, lr=0.28171749294315873
2023-12-06 15:31:42   INFO  epoch: 32/72, acc_iter=123634, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:54:03, time_cost(all): 1 day, 4:36:27/1 day, 12:17:50, loss=0.454807657343172, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.17(1.03), norm=2.9609189429186973, lr=0.2815761737697027
2023-12-06 15:32:24   INFO  epoch: 32/72, acc_iter=123684, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:50:18, time_cost(all): 1 day, 4:37:09/1 day, 11:02:58, loss=0.454748459917231, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.99(1.03), norm=4.0341303452204045, lr=0.28146220669433486
2023-12-06 15:33:06   INFO  epoch: 32/72, acc_iter=123734, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:51:10, time_cost(all): 1 day, 4:37:51/1 day, 12:12:15, loss=0.45468926249129, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.19(1.03), norm=2.7133422049495897, lr=0.28134823961896704
2023-12-06 15:33:47   INFO  epoch: 32/72, acc_iter=123784, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:50:41, time_cost(all): 1 day, 4:38:32/1 day, 13:37:24, loss=0.45463006506535, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.02(1.03), norm=3.7385732636476523, lr=0.28123427254359923
2023-12-06 15:34:29   INFO  epoch: 32/72, acc_iter=123834, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:49:23, time_cost(all): 1 day, 4:39:14/1 day, 10:27:59, loss=0.454570867639409, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.17(1.03), norm=1.2729298922621535, lr=0.28112030546823147
2023-12-06 15:35:11   INFO  epoch: 32/72, acc_iter=123884, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:48:53, time_cost(all): 1 day, 4:39:56/1 day, 13:36:47, loss=0.454511670213468, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.18(1.03), norm=3.7818225373208705, lr=0.2810063383928637
2023-12-06 15:35:53   INFO  epoch: 32/72, acc_iter=123934, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:49:02, time_cost(all): 1 day, 4:40:38/1 day, 10:37:31, loss=0.454452472787527, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.07(1.03), norm=0.6795849983006118, lr=0.2808923713174959
2023-12-06 15:36:35   INFO  epoch: 32/72, acc_iter=123984, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:47:38, time_cost(all): 1 day, 4:41:20/1 day, 12:01:53, loss=0.454393275361586, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.86(1.03), norm=3.3509932978574044, lr=0.2807784042421281
2023-12-06 15:37:16   INFO  epoch: 32/72, acc_iter=124034, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:48:08, time_cost(all): 1 day, 4:42:01/1 day, 11:25:38, loss=0.454334077935645, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.97(1.03), norm=2.9111668558859707, lr=0.28066443716676026
2023-12-06 15:37:58   INFO  epoch: 32/72, acc_iter=124084, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:45:25, time_cost(all): 1 day, 4:42:43/1 day, 12:50:58, loss=0.454274880509704, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.85(1.03), norm=4.677559559097165, lr=0.2805504700913925
2023-12-06 15:38:40   INFO  epoch: 32/72, acc_iter=124134, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:46:21, time_cost(all): 1 day, 4:43:25/1 day, 11:32:25, loss=0.454215683083763, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.86(1.03), norm=4.341858939293839, lr=0.2804365030160247
2023-12-06 15:39:22   INFO  epoch: 32/72, acc_iter=124184, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:46:42, time_cost(all): 1 day, 4:44:07/1 day, 10:28:36, loss=0.454156485657822, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.14(1.03), norm=1.2223912018052827, lr=0.2803225359406569
2023-12-06 15:40:03   INFO  epoch: 32/72, acc_iter=124234, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:44:12, time_cost(all): 1 day, 4:44:48/1 day, 10:52:03, loss=0.454097288231881, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.02(1.03), norm=1.5143369600687373, lr=0.2802085688652891
2023-12-06 15:40:45   INFO  epoch: 32/72, acc_iter=124284, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:42:36, time_cost(all): 1 day, 4:45:30/1 day, 12:35:49, loss=0.45403809080594, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.85(1.03), norm=1.5934549714347739, lr=0.2800946017899213
2023-12-06 15:41:27   INFO  epoch: 32/72, acc_iter=124334, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:43:33, time_cost(all): 1 day, 4:46:12/1 day, 11:40:55, loss=0.453978893379999, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.11(1.03), norm=4.1938161200448025, lr=0.2799806347145535
2023-12-06 15:42:09   INFO  epoch: 32/72, acc_iter=124384, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:43:03, time_cost(all): 1 day, 4:46:54/1 day, 13:09:47, loss=0.453919695954058, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.01(1.03), norm=3.426823987015927, lr=0.2798666676391858
2023-12-06 15:42:51   INFO  epoch: 32/72, acc_iter=124434, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:42:11, time_cost(all): 1 day, 4:47:36/1 day, 11:10:47, loss=0.453860498528117, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.21(1.03), norm=4.374656399124185, lr=0.27975270056381796
2023-12-06 15:43:32   INFO  epoch: 32/72, acc_iter=124484, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:42:30, time_cost(all): 1 day, 4:48:17/1 day, 12:06:33, loss=0.453801301102176, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.22(1.03), norm=3.539121137079571, lr=0.27963873348845014
2023-12-06 15:44:14   INFO  epoch: 32/72, acc_iter=124534, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:40:01, time_cost(all): 1 day, 4:48:59/1 day, 13:23:13, loss=0.453742103676235, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.02(1.03), norm=1.237506242022222, lr=0.2795247664130823
2023-12-06 15:44:56   INFO  epoch: 32/72, acc_iter=124584, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:38:17, time_cost(all): 1 day, 4:49:41/1 day, 13:22:13, loss=0.453682906250295, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.94(1.03), norm=2.6458670501728543, lr=0.27941079933771457
2023-12-06 15:45:38   INFO  epoch: 32/72, acc_iter=124634, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:39:40, time_cost(all): 1 day, 4:50:23/1 day, 13:05:44, loss=0.453623708824354, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.02(1.03), norm=4.070333659649453, lr=0.27929683226234675
2023-12-06 15:46:19   INFO  epoch: 32/72, acc_iter=124684, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:38:59, time_cost(all): 1 day, 4:51:04/1 day, 13:02:11, loss=0.453564511398413, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.85(1.03), norm=2.565206750636795, lr=0.279182865186979
2023-12-06 15:47:01   INFO  epoch: 32/72, acc_iter=124734, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:38:41, time_cost(all): 1 day, 4:51:46/1 day, 11:31:09, loss=0.453505313972472, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.89(1.03), norm=1.8613108282426478, lr=0.2790688981116112
2023-12-06 15:47:43   INFO  epoch: 32/72, acc_iter=124784, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:36:48, time_cost(all): 1 day, 4:52:28/1 day, 11:14:16, loss=0.453446116546531, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.13(1.03), norm=1.269777063748627, lr=0.27895493103624336
2023-12-06 15:48:25   INFO  epoch: 32/72, acc_iter=124834, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:36:01, time_cost(all): 1 day, 4:53:10/1 day, 10:13:00, loss=0.45338691912059, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.94(1.03), norm=3.8331502475665413, lr=0.27884096396087554
2023-12-06 15:49:07   INFO  epoch: 32/72, acc_iter=124884, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:34:43, time_cost(all): 1 day, 4:53:52/1 day, 13:16:09, loss=0.453327721694649, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.85(1.03), norm=2.201197702791644, lr=0.2787269968855078
2023-12-06 15:49:48   INFO  epoch: 32/72, acc_iter=124934, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:35:31, time_cost(all): 1 day, 4:54:33/1 day, 12:43:06, loss=0.453268524268708, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.85(1.03), norm=4.377570044683299, lr=0.27861302981014
2023-12-06 15:50:30   INFO  epoch: 32/72, acc_iter=124984, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:33:42, time_cost(all): 1 day, 4:55:15/1 day, 11:15:40, loss=0.453209326842767, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.03(1.03), norm=1.6998594397734625, lr=0.2784990627347722
2023-12-06 15:51:12   INFO  epoch: 32/72, acc_iter=125034, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:32:07, time_cost(all): 1 day, 4:55:57/1 day, 11:10:23, loss=0.453150129416826, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.01(1.03), norm=1.7216949201901524, lr=0.2783850956594044
2023-12-06 15:51:54   INFO  epoch: 32/72, acc_iter=125084, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:33:52, time_cost(all): 1 day, 4:56:39/1 day, 12:51:31, loss=0.453090931990885, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.84(1.03), norm=2.5617866284599837, lr=0.2782711285840366
2023-12-06 15:52:35   INFO  epoch: 32/72, acc_iter=125134, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:33:25, time_cost(all): 1 day, 4:57:20/1 day, 11:44:29, loss=0.453031734564944, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.05(1.03), norm=4.991010716392897, lr=0.2781571615086688
2023-12-06 15:53:17   INFO  epoch: 32/72, acc_iter=125184, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:31:59, time_cost(all): 1 day, 4:58:02/1 day, 13:18:50, loss=0.452972537139003, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.85(1.03), norm=1.0690859022834087, lr=0.278043194433301
2023-12-06 15:53:59   INFO  epoch: 32/72, acc_iter=125234, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:29:27, time_cost(all): 1 day, 4:58:44/1 day, 10:22:51, loss=0.452913339713062, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.21(1.03), norm=2.3518434486513096, lr=0.27792922735793324
2023-12-06 15:54:41   INFO  epoch: 32/72, acc_iter=125284, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:31:28, time_cost(all): 1 day, 4:59:26/1 day, 9:51:00, loss=0.452854142287121, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.02(1.03), norm=4.017510741798972, lr=0.2778152602825654
2023-12-06 15:55:23   INFO  epoch: 32/72, acc_iter=125334, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:29:39, time_cost(all): 1 day, 5:00:08/1 day, 10:17:30, loss=0.45279494486118, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.23(1.03), norm=3.3297755800417064, lr=0.2777012932071976
2023-12-06 15:56:04   INFO  epoch: 32/72, acc_iter=125384, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:29:12, time_cost(all): 1 day, 5:00:49/1 day, 11:36:34, loss=0.452735747435239, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.19(1.03), norm=3.36646975215944, lr=0.27758732613182985
2023-12-06 15:56:46   INFO  epoch: 32/72, acc_iter=125434, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:26:44, time_cost(all): 1 day, 5:01:31/1 day, 10:16:41, loss=0.452676550009299, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.14(1.03), norm=4.778589491969222, lr=0.2774733590564621
2023-12-06 15:57:28   INFO  epoch: 32/72, acc_iter=125484, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:26:45, time_cost(all): 1 day, 5:02:13/1 day, 9:58:09, loss=0.452617352583358, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.9(1.03), norm=2.4178190339362526, lr=0.27735939198109427
2023-12-06 15:58:10   INFO  epoch: 32/72, acc_iter=125534, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:25:27, time_cost(all): 1 day, 5:02:55/1 day, 10:07:33, loss=0.452558155157417, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.99(1.03), norm=0.6086026477589785, lr=0.27724542490572646
2023-12-06 15:58:51   INFO  epoch: 32/72, acc_iter=125584, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:26:13, time_cost(all): 1 day, 5:03:36/1 day, 10:20:04, loss=0.452498957731476, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.2(1.03), norm=4.535932875076869, lr=0.27713145783035864
2023-12-06 15:59:33   INFO  epoch: 32/72, acc_iter=125634, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:53, time_cost(all): 1 day, 5:04:18/1 day, 13:09:08, loss=0.452439760305535, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.11(1.03), norm=3.5925509752606617, lr=0.2770174907549908
2023-12-06 16:00:15   INFO  epoch: 32/72, acc_iter=125684, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:25:05, time_cost(all): 1 day, 5:05:00/1 day, 12:55:11, loss=0.452380562879594, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.99(1.03), norm=3.572752065084857, lr=0.27690352367962306
2023-12-06 16:00:57   INFO  epoch: 32/72, acc_iter=125734, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:09, time_cost(all): 1 day, 5:05:42/1 day, 12:21:55, loss=0.452321365453653, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.91(1.03), norm=4.723658431554392, lr=0.2767895566042553
2023-12-06 16:01:39   INFO  epoch: 32/72, acc_iter=125784, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:56, time_cost(all): 1 day, 5:06:24/1 day, 13:07:57, loss=0.452262168027712, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.17(1.03), norm=4.649327275167338, lr=0.2766755895288875
2023-12-06 16:02:20   INFO  epoch: 32/72, acc_iter=125834, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:22:32, time_cost(all): 1 day, 5:07:05/1 day, 10:56:04, loss=0.452202970601771, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.13(1.03), norm=2.760686535613273, lr=0.2765616224535197
2023-12-06 16:03:02   INFO  epoch: 32/72, acc_iter=125884, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:56, time_cost(all): 1 day, 5:07:47/1 day, 11:39:37, loss=0.45214377317583, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.03(1.03), norm=3.105657142265106, lr=0.27644765537815186
2023-12-06 16:03:44   INFO  epoch: 32/72, acc_iter=125934, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:50, time_cost(all): 1 day, 5:08:29/1 day, 10:18:28, loss=0.452084575749889, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.12(1.03), norm=4.923900362130242, lr=0.2763336883027841
2023-12-06 16:04:26   INFO  epoch: 32/72, acc_iter=125984, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:20:38, time_cost(all): 1 day, 5:09:11/1 day, 12:25:22, loss=0.452025378323948, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.98(1.03), norm=4.282075450288575, lr=0.27621972122741634
2023-12-06 16:05:08   INFO  epoch: 32/72, acc_iter=126034, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:19:09, time_cost(all): 1 day, 5:09:53/1 day, 10:55:16, loss=0.451966180898007, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.04(1.03), norm=2.131348119196237, lr=0.2761057541520485
2023-12-06 16:05:49   INFO  epoch: 32/72, acc_iter=126084, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:18:59, time_cost(all): 1 day, 5:10:34/1 day, 11:48:49, loss=0.451906983472066, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.05(1.03), norm=2.9011646690190855, lr=0.2759917870766807
2023-12-06 16:06:31   INFO  epoch: 32/72, acc_iter=126134, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:28, time_cost(all): 1 day, 5:11:16/1 day, 10:07:11, loss=0.451847786046125, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.93(1.03), norm=1.5376545235621593, lr=0.2758778200013129
2023-12-06 16:07:13   INFO  epoch: 32/72, acc_iter=126184, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:30, time_cost(all): 1 day, 5:11:58/1 day, 11:14:59, loss=0.451788588620184, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.11(1.03), norm=1.4877798137099774, lr=0.27576385292594513
2023-12-06 16:07:55   INFO  epoch: 32/72, acc_iter=126234, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:08, time_cost(all): 1 day, 5:12:40/1 day, 9:54:26, loss=0.451729391194243, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.9(1.03), norm=4.11896814059916, lr=0.2756498858505773
2023-12-06 16:08:36   INFO  epoch: 32/72, acc_iter=126284, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:01, time_cost(all): 1 day, 5:13:21/1 day, 11:48:14, loss=0.451670193768303, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.03(1.03), norm=3.258604509628172, lr=0.27553591877520955
2023-12-06 16:09:18   INFO  epoch: 32/72, acc_iter=126334, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:41, time_cost(all): 1 day, 5:14:03/1 day, 12:55:39, loss=0.451610996342362, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.16(1.03), norm=1.1631749944155243, lr=0.27542195169984174
2023-12-06 16:10:00   INFO  epoch: 32/72, acc_iter=126384, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:13, time_cost(all): 1 day, 5:14:45/1 day, 10:03:38, loss=0.451551798916421, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.04(1.03), norm=3.4430540427204304, lr=0.2753079846244739
2023-12-06 16:10:42   INFO  epoch: 32/72, acc_iter=126434, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:13, time_cost(all): 1 day, 5:15:27/1 day, 12:32:06, loss=0.45149260149048, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.92(1.03), norm=2.025625090847779, lr=0.27519401754910616
2023-12-06 16:11:24   INFO  epoch: 32/72, acc_iter=126484, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:12:48, time_cost(all): 1 day, 5:16:09/1 day, 10:01:16, loss=0.451433404064539, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.94(1.03), norm=3.7078348857082606, lr=0.27508005047373835
2023-12-06 16:12:05   INFO  epoch: 32/72, acc_iter=126534, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:42, time_cost(all): 1 day, 5:16:50/1 day, 12:54:46, loss=0.451374206638598, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.89(1.03), norm=0.7153188561619779, lr=0.2749660833983706
2023-12-06 16:12:47   INFO  epoch: 32/72, acc_iter=126584, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:18, time_cost(all): 1 day, 5:17:32/1 day, 12:30:19, loss=0.451315009212657, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.0(1.03), norm=3.7949520455092323, lr=0.27485211632300277
2023-12-06 16:13:29   INFO  epoch: 32/72, acc_iter=126634, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:12, time_cost(all): 1 day, 5:18:14/1 day, 9:51:56, loss=0.451255811786716, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.99(1.03), norm=2.2148068155973073, lr=0.27473814924763496
2023-12-06 16:14:11   INFO  epoch: 32/72, acc_iter=126684, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:13, time_cost(all): 1 day, 5:18:56/1 day, 12:46:51, loss=0.451196614360775, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.01(1.03), norm=0.9217320127072683, lr=0.2746241821722672
2023-12-06 16:14:52   INFO  epoch: 32/72, acc_iter=126734, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:10:20, time_cost(all): 1 day, 5:19:37/1 day, 9:33:34, loss=0.451137416934834, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.0(1.03), norm=3.877956285386126, lr=0.2745102150968994
2023-12-06 16:15:34   INFO  epoch: 32/72, acc_iter=126784, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:31, time_cost(all): 1 day, 5:20:19/1 day, 11:47:49, loss=0.451078219508893, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.91(1.03), norm=1.5847429980012842, lr=0.2743962480215316
2023-12-06 16:16:16   INFO  epoch: 32/72, acc_iter=126834, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:09, time_cost(all): 1 day, 5:21:01/1 day, 9:57:26, loss=0.451019022082952, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.86(1.03), norm=2.520456635300184, lr=0.2742822809461638
2023-12-06 16:16:58   INFO  epoch: 32/72, acc_iter=126884, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:08:11, time_cost(all): 1 day, 5:21:43/1 day, 9:26:55, loss=0.450959824657011, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.08(1.03), norm=4.639523290231183, lr=0.274168313870796
2023-12-06 16:17:40   INFO  epoch: 32/72, acc_iter=126934, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:12, time_cost(all): 1 day, 5:22:25/1 day, 9:49:57, loss=0.45090062723107, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.05(1.03), norm=3.1732666975534563, lr=0.2740543467954282
2023-12-06 16:18:21   INFO  epoch: 32/72, acc_iter=126984, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:39, time_cost(all): 1 day, 5:23:06/1 day, 10:43:46, loss=0.450841429805129, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.0(1.03), norm=3.522291118465693, lr=0.27394037972006036
2023-12-06 16:19:03   INFO  epoch: 32/72, acc_iter=127034, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:41, time_cost(all): 1 day, 5:23:48/1 day, 12:11:22, loss=0.450782232379188, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.99(1.03), norm=1.2551183321986554, lr=0.27382641264469265
2023-12-06 16:19:45   INFO  epoch: 32/72, acc_iter=127084, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:10, time_cost(all): 1 day, 5:24:30/1 day, 11:16:30, loss=0.450723034953247, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.85(1.03), norm=0.7978510928159306, lr=0.27371244556932484
2023-12-06 16:20:27   INFO  epoch: 32/72, acc_iter=127134, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:19, time_cost(all): 1 day, 5:25:12/1 day, 12:37:06, loss=0.450663837527307, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.06(1.03), norm=1.6130261462276008, lr=0.273598478493957
2023-12-06 16:21:08   INFO  epoch: 32/72, acc_iter=127184, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:48, time_cost(all): 1 day, 5:25:53/1 day, 12:41:20, loss=0.450604640101366, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.04(1.03), norm=2.8956655309335333, lr=0.2734845114185892
2023-12-06 16:21:50   INFO  epoch: 32/72, acc_iter=127234, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:56, time_cost(all): 1 day, 5:26:35/1 day, 9:38:51, loss=0.450545442675425, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.0(1.03), norm=4.214331340139466, lr=0.27337054434322144
2023-12-06 16:22:32   INFO  epoch: 32/72, acc_iter=127284, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:19, time_cost(all): 1 day, 5:27:17/1 day, 12:48:50, loss=0.450486245249484, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.88(1.03), norm=0.8861187894964606, lr=0.27325657726785363
2023-12-06 16:23:14   INFO  epoch: 32/72, acc_iter=127334, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:30, time_cost(all): 1 day, 5:27:59/1 day, 11:02:21, loss=0.450427047823543, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.06(1.03), norm=2.9067872206553647, lr=0.27314261019248587
2023-12-06 16:23:56   INFO  epoch: 32/72, acc_iter=127384, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:51, time_cost(all): 1 day, 5:28:41/1 day, 11:52:10, loss=0.450367850397602, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.89(1.03), norm=0.8122629391684884, lr=0.27302864311711805
2023-12-06 16:24:37   INFO  epoch: 32/72, acc_iter=127434, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 1 day, 5:29:22/1 day, 11:19:52, loss=0.450308652971661, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.99(1.03), norm=1.3722777621447428, lr=0.27291467604175024
2023-12-06 16:25:19   INFO  epoch: 33/72, acc_iter=127496, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:54:41, time_cost(all): 1 day, 5:30:04/1 day, 10:13:41, loss=0.450235248163494, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.11(1.03), norm=4.294759699979305, lr=0.2727733568682942
2023-12-06 16:26:01   INFO  epoch: 33/72, acc_iter=127546, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:53:24, time_cost(all): 1 day, 5:30:46/1 day, 10:13:37, loss=0.450176050737553, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.06(1.03), norm=3.959579716570146, lr=0.27265938979292637
2023-12-06 16:26:43   INFO  epoch: 33/72, acc_iter=127596, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:49:52, time_cost(all): 1 day, 5:31:28/1 day, 10:11:01, loss=0.450116853311612, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.17(1.03), norm=3.7885662540771294, lr=0.2725454227175586
2023-12-06 16:27:24   INFO  epoch: 33/72, acc_iter=127646, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:53:04, time_cost(all): 1 day, 5:32:09/1 day, 9:32:06, loss=0.450057655885671, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.88(1.03), norm=1.272138285490568, lr=0.27243145564219085
2023-12-06 16:28:06   INFO  epoch: 33/72, acc_iter=127696, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:49:33, time_cost(all): 1 day, 5:32:51/1 day, 11:09:19, loss=0.44999845845973, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.23(1.03), norm=0.7919575616344932, lr=0.27231748856682303
2023-12-06 16:28:48   INFO  epoch: 33/72, acc_iter=127746, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:52:02, time_cost(all): 1 day, 5:33:33/1 day, 10:21:34, loss=0.449939261033789, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.83(1.03), norm=1.2199140593817748, lr=0.2722035214914552
2023-12-06 16:29:30   INFO  epoch: 33/72, acc_iter=127796, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:47:41, time_cost(all): 1 day, 5:34:15/1 day, 9:25:35, loss=0.449880063607848, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.96(1.03), norm=3.070018435195563, lr=0.27208955441608745
2023-12-06 16:30:12   INFO  epoch: 33/72, acc_iter=127846, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:47:19, time_cost(all): 1 day, 5:34:57/1 day, 10:04:40, loss=0.449820866181908, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.17(1.03), norm=1.5535508345371039, lr=0.2719755873407196
2023-12-06 16:30:53   INFO  epoch: 33/72, acc_iter=127896, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:49:41, time_cost(all): 1 day, 5:35:38/1 day, 10:57:06, loss=0.449761668755967, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.12(1.03), norm=4.225593554421575, lr=0.2718616202653519
2023-12-06 16:31:35   INFO  epoch: 33/72, acc_iter=127946, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:45:56, time_cost(all): 1 day, 5:36:20/1 day, 9:47:43, loss=0.449702471330026, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.01(1.03), norm=1.104217279404714, lr=0.27174765318998406
2023-12-06 16:32:17   INFO  epoch: 33/72, acc_iter=127996, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:44:31, time_cost(all): 1 day, 5:37:02/1 day, 12:37:50, loss=0.449643273904085, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.07(1.03), norm=4.38260675726925, lr=0.27163368611461625
2023-12-06 16:32:59   INFO  epoch: 33/72, acc_iter=128046, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:45:59, time_cost(all): 1 day, 5:37:44/1 day, 11:44:08, loss=0.449584076478144, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.12(1.03), norm=2.0646467642760413, lr=0.27151971903924843
2023-12-06 16:33:40   INFO  epoch: 33/72, acc_iter=128096, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:45:34, time_cost(all): 1 day, 5:38:25/1 day, 12:07:26, loss=0.449524879052203, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.04(1.03), norm=2.000910875925764, lr=0.2714057519638806
2023-12-06 16:34:22   INFO  epoch: 33/72, acc_iter=128146, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:42:38, time_cost(all): 1 day, 5:39:07/1 day, 9:55:16, loss=0.449465681626262, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.08(1.03), norm=3.056663342381922, lr=0.27129178488851285
2023-12-06 16:35:04   INFO  epoch: 33/72, acc_iter=128196, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:44:19, time_cost(all): 1 day, 5:39:49/1 day, 11:56:42, loss=0.449406484200321, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.88(1.03), norm=4.986445426107122, lr=0.2711778178131451
2023-12-06 16:35:46   INFO  epoch: 33/72, acc_iter=128246, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:41:22, time_cost(all): 1 day, 5:40:31/1 day, 11:08:50, loss=0.44934728677438, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.0(1.03), norm=2.444364177183536, lr=0.2710638507377773
2023-12-06 16:36:28   INFO  epoch: 33/72, acc_iter=128296, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:40:16, time_cost(all): 1 day, 5:41:13/1 day, 9:40:20, loss=0.449288089348439, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.2(1.03), norm=4.330125887949974, lr=0.27094988366240946
2023-12-06 16:37:09   INFO  epoch: 33/72, acc_iter=128346, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:40:56, time_cost(all): 1 day, 5:41:54/1 day, 11:59:03, loss=0.449228891922498, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.93(1.03), norm=3.69913379605777, lr=0.2708359165870417
2023-12-06 16:37:51   INFO  epoch: 33/72, acc_iter=128396, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:40:53, time_cost(all): 1 day, 5:42:36/1 day, 11:44:51, loss=0.449169694496557, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.06(1.03), norm=3.952398455627294, lr=0.2707219495116739
2023-12-06 16:38:33   INFO  epoch: 33/72, acc_iter=128446, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:41:42, time_cost(all): 1 day, 5:43:18/1 day, 10:01:55, loss=0.449110497070616, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.18(1.03), norm=1.3022935601197216, lr=0.2706079824363061
2023-12-06 16:39:15   INFO  epoch: 33/72, acc_iter=128496, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:41:07, time_cost(all): 1 day, 5:44:00/1 day, 11:53:54, loss=0.449051299644675, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.0(1.03), norm=4.280066282946086, lr=0.2704940153609383
2023-12-06 16:39:57   INFO  epoch: 33/72, acc_iter=128546, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:38:16, time_cost(all): 1 day, 5:44:42/1 day, 9:08:18, loss=0.448992102218734, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.87(1.03), norm=3.006787089565286, lr=0.2703800482855705
2023-12-06 16:40:38   INFO  epoch: 33/72, acc_iter=128596, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:37:22, time_cost(all): 1 day, 5:45:23/1 day, 9:47:28, loss=0.448932904792793, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.1(1.03), norm=1.6052787191947289, lr=0.2702660812102027
2023-12-06 16:41:20   INFO  epoch: 33/72, acc_iter=128646, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:37:08, time_cost(all): 1 day, 5:46:05/1 day, 10:42:49, loss=0.448873707366852, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.89(1.03), norm=1.0402805112126776, lr=0.270152114134835
2023-12-06 16:42:02   INFO  epoch: 33/72, acc_iter=128696, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:37:02, time_cost(all): 1 day, 5:46:47/1 day, 12:07:28, loss=0.448814509940912, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.86(1.03), norm=1.100763221980164, lr=0.2700381470594671
2023-12-06 16:42:44   INFO  epoch: 33/72, acc_iter=128746, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:37:08, time_cost(all): 1 day, 5:47:29/1 day, 10:32:39, loss=0.448755312514971, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.86(1.03), norm=4.267655322401917, lr=0.26992417998409934
2023-12-06 16:43:25   INFO  epoch: 33/72, acc_iter=128796, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:35:33, time_cost(all): 1 day, 5:48:10/1 day, 10:39:42, loss=0.44869611508903, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.96(1.03), norm=1.8936258910459023, lr=0.26981021290873153
2023-12-06 16:44:07   INFO  epoch: 33/72, acc_iter=128846, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:34:56, time_cost(all): 1 day, 5:48:52/1 day, 9:33:20, loss=0.448636917663089, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.86(1.03), norm=0.716369247671625, lr=0.2696962458333637
2023-12-06 16:44:49   INFO  epoch: 33/72, acc_iter=128896, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:32:41, time_cost(all): 1 day, 5:49:34/1 day, 9:30:52, loss=0.448577720237148, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.0(1.03), norm=2.1362141824274703, lr=0.26958227875799595
2023-12-06 16:45:31   INFO  epoch: 33/72, acc_iter=128946, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:34:05, time_cost(all): 1 day, 5:50:16/1 day, 12:24:27, loss=0.448518522811207, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.05(1.03), norm=4.714491276363864, lr=0.26946831168262814
2023-12-06 16:46:13   INFO  epoch: 33/72, acc_iter=128996, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:30:49, time_cost(all): 1 day, 5:50:58/1 day, 9:01:03, loss=0.448459325385266, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.22(1.03), norm=2.952337887250932, lr=0.2693543446072604
2023-12-06 16:46:54   INFO  epoch: 33/72, acc_iter=129046, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:32:06, time_cost(all): 1 day, 5:51:39/1 day, 11:53:44, loss=0.448400127959325, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.96(1.03), norm=3.647842839949089, lr=0.26924037753189256
2023-12-06 16:47:36   INFO  epoch: 33/72, acc_iter=129096, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:31:42, time_cost(all): 1 day, 5:52:21/1 day, 10:54:32, loss=0.448340930533384, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.22(1.03), norm=4.6829658756128, lr=0.26912641045652475
2023-12-06 16:48:18   INFO  epoch: 33/72, acc_iter=129146, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:30:35, time_cost(all): 1 day, 5:53:03/1 day, 11:01:48, loss=0.448281733107443, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.2(1.03), norm=1.9400972489799286, lr=0.269012443381157
2023-12-06 16:49:00   INFO  epoch: 33/72, acc_iter=129196, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:29:59, time_cost(all): 1 day, 5:53:45/1 day, 10:07:05, loss=0.448222535681502, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.15(1.03), norm=3.9166155722999156, lr=0.2688984763057892
2023-12-06 16:49:41   INFO  epoch: 33/72, acc_iter=129246, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:29:30, time_cost(all): 1 day, 5:54:26/1 day, 11:13:17, loss=0.448163338255561, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.2(1.03), norm=4.612311470226711, lr=0.2687845092304214
2023-12-06 16:50:23   INFO  epoch: 33/72, acc_iter=129296, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:27:34, time_cost(all): 1 day, 5:55:08/1 day, 9:04:59, loss=0.44810414082962, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.0(1.03), norm=4.995959521016314, lr=0.2686705421550536
2023-12-06 16:51:05   INFO  epoch: 33/72, acc_iter=129346, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:27:17, time_cost(all): 1 day, 5:55:50/1 day, 9:43:49, loss=0.448044943403679, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.83(1.03), norm=1.7817301916041777, lr=0.2685565750796858
2023-12-06 16:51:47   INFO  epoch: 33/72, acc_iter=129396, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:26:58, time_cost(all): 1 day, 5:56:32/1 day, 9:23:00, loss=0.447985745977738, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.17(1.03), norm=2.4279847070050486, lr=0.26844260800431796
2023-12-06 16:52:29   INFO  epoch: 33/72, acc_iter=129446, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:25:06, time_cost(all): 1 day, 5:57:14/1 day, 10:55:14, loss=0.447926548551797, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.07(1.03), norm=4.281081945687038, lr=0.2683286409289502
2023-12-06 16:53:10   INFO  epoch: 33/72, acc_iter=129496, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:24:30, time_cost(all): 1 day, 5:57:55/1 day, 8:58:55, loss=0.447867351125856, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.9(1.03), norm=4.966767918210332, lr=0.26821467385358244
2023-12-06 16:53:52   INFO  epoch: 33/72, acc_iter=129546, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:23:31, time_cost(all): 1 day, 5:58:37/1 day, 11:50:33, loss=0.447808153699916, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.15(1.03), norm=3.697095653313215, lr=0.2681007067782146
2023-12-06 16:54:34   INFO  epoch: 33/72, acc_iter=129596, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:23:56, time_cost(all): 1 day, 5:59:19/1 day, 10:01:04, loss=0.447748956273975, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.18(1.03), norm=0.8975204355807148, lr=0.2679867397028468
2023-12-06 16:55:16   INFO  epoch: 33/72, acc_iter=129646, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:53, time_cost(all): 1 day, 6:00:01/1 day, 9:48:55, loss=0.447689758848034, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.84(1.03), norm=4.450814074458693, lr=0.267872772627479
2023-12-06 16:55:57   INFO  epoch: 33/72, acc_iter=129696, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:23:23, time_cost(all): 1 day, 6:00:42/1 day, 8:53:35, loss=0.447630561422093, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.86(1.03), norm=2.615305407242171, lr=0.26775880555211123
2023-12-06 16:56:39   INFO  epoch: 33/72, acc_iter=129746, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:22:22, time_cost(all): 1 day, 6:01:24/1 day, 10:22:29, loss=0.447571363996152, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.08(1.03), norm=2.6054120547962225, lr=0.2676448384767435
2023-12-06 16:57:21   INFO  epoch: 33/72, acc_iter=129796, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:07, time_cost(all): 1 day, 6:02:06/1 day, 11:40:11, loss=0.447512166570211, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.07(1.03), norm=2.2080884606275863, lr=0.26753087140137566
2023-12-06 16:58:03   INFO  epoch: 33/72, acc_iter=129846, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:19:35, time_cost(all): 1 day, 6:02:48/1 day, 10:58:54, loss=0.44745296914427, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.97(1.03), norm=2.3638662262626093, lr=0.26741690432600784
2023-12-06 16:58:45   INFO  epoch: 33/72, acc_iter=129896, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:19:50, time_cost(all): 1 day, 6:03:30/1 day, 11:07:01, loss=0.447393771718329, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.23(1.03), norm=4.022142318384601, lr=0.2673029372506401
2023-12-06 16:59:26   INFO  epoch: 33/72, acc_iter=129946, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:18:17, time_cost(all): 1 day, 6:04:11/1 day, 11:19:46, loss=0.447334574292388, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.0(1.03), norm=1.890000218325619, lr=0.2671889701752722
2023-12-06 17:00:08   INFO  epoch: 33/72, acc_iter=129996, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:17:28, time_cost(all): 1 day, 6:04:53/1 day, 10:31:36, loss=0.447275376866447, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.15(1.03), norm=2.4549869696847972, lr=0.2670750030999045
2023-12-06 17:00:50   INFO  epoch: 33/72, acc_iter=130046, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:18:19, time_cost(all): 1 day, 6:05:35/1 day, 10:52:13, loss=0.447216179440506, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.89(1.03), norm=3.4753777391685894, lr=0.2669610360245367
2023-12-06 17:01:32   INFO  epoch: 33/72, acc_iter=130096, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:46, time_cost(all): 1 day, 6:06:17/1 day, 11:48:49, loss=0.447156982014565, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.93(1.03), norm=1.4739789571817241, lr=0.2668470689491689
2023-12-06 17:02:13   INFO  epoch: 33/72, acc_iter=130146, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:10, time_cost(all): 1 day, 6:06:58/1 day, 10:16:13, loss=0.447097784588624, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.03(1.03), norm=2.4725409865527532, lr=0.26673310187380106
2023-12-06 17:02:55   INFO  epoch: 33/72, acc_iter=130196, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:14:54, time_cost(all): 1 day, 6:07:40/1 day, 11:41:40, loss=0.447038587162683, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.07(1.03), norm=3.8708308502785127, lr=0.2666191347984333
2023-12-06 17:03:37   INFO  epoch: 33/72, acc_iter=130246, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:15:23, time_cost(all): 1 day, 6:08:22/1 day, 10:14:19, loss=0.446979389736742, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.19(1.03), norm=1.8445934646634279, lr=0.2665051677230655
2023-12-06 17:04:19   INFO  epoch: 33/72, acc_iter=130296, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:20, time_cost(all): 1 day, 6:09:04/1 day, 9:40:21, loss=0.446920192310801, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.88(1.03), norm=1.4246631565180254, lr=0.2663912006476977
2023-12-06 17:05:01   INFO  epoch: 33/72, acc_iter=130346, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:27, time_cost(all): 1 day, 6:09:46/1 day, 10:14:53, loss=0.44686099488486, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.08(1.03), norm=2.318315419304427, lr=0.2662772335723299
2023-12-06 17:05:42   INFO  epoch: 33/72, acc_iter=130396, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:52, time_cost(all): 1 day, 6:10:27/1 day, 9:02:25, loss=0.44680179745892, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.08(1.03), norm=3.2657156238171265, lr=0.2661632664969621
2023-12-06 17:06:24   INFO  epoch: 33/72, acc_iter=130446, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:11:31, time_cost(all): 1 day, 6:11:09/1 day, 11:01:18, loss=0.446742600032979, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.92(1.03), norm=3.320379010369227, lr=0.26604929942159433
2023-12-06 17:07:06   INFO  epoch: 33/72, acc_iter=130496, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:39, time_cost(all): 1 day, 6:11:51/1 day, 9:57:17, loss=0.446683402607038, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.89(1.03), norm=4.530890325896513, lr=0.26593533234622646
2023-12-06 17:07:48   INFO  epoch: 33/72, acc_iter=130546, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:38, time_cost(all): 1 day, 6:12:33/1 day, 11:19:20, loss=0.446624205181097, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.17(1.03), norm=3.1914942575918057, lr=0.26582136527085876
2023-12-06 17:08:29   INFO  epoch: 33/72, acc_iter=130596, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:47, time_cost(all): 1 day, 6:13:14/1 day, 9:33:32, loss=0.446565007755156, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.91(1.03), norm=4.654265309817157, lr=0.26570739819549094
2023-12-06 17:09:11   INFO  epoch: 33/72, acc_iter=130646, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:08:54, time_cost(all): 1 day, 6:13:56/1 day, 10:23:56, loss=0.446505810329215, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.14(1.03), norm=4.6198122986348285, lr=0.2655934311201231
2023-12-06 17:09:53   INFO  epoch: 33/72, acc_iter=130696, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:10, time_cost(all): 1 day, 6:14:38/1 day, 11:02:17, loss=0.446446612903274, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.16(1.03), norm=4.028862164168691, lr=0.2654794640447553
2023-12-06 17:10:35   INFO  epoch: 33/72, acc_iter=130746, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:29, time_cost(all): 1 day, 6:15:20/1 day, 8:39:00, loss=0.446387415477333, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.91(1.03), norm=2.603281681387718, lr=0.2653654969693876
2023-12-06 17:11:17   INFO  epoch: 33/72, acc_iter=130796, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:21, time_cost(all): 1 day, 6:16:02/1 day, 9:50:28, loss=0.446328218051392, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.13(1.03), norm=1.6347025942539657, lr=0.26525152989401973
2023-12-06 17:11:58   INFO  epoch: 33/72, acc_iter=130846, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:17, time_cost(all): 1 day, 6:16:43/1 day, 11:22:29, loss=0.446269020625451, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.97(1.03), norm=2.4365296949249773, lr=0.265137562818652
2023-12-06 17:12:40   INFO  epoch: 33/72, acc_iter=130896, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:59, time_cost(all): 1 day, 6:17:25/1 day, 11:19:11, loss=0.44620982319951, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.91(1.03), norm=0.7762304154446862, lr=0.26502359574328416
2023-12-06 17:13:22   INFO  epoch: 33/72, acc_iter=130946, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:04:51, time_cost(all): 1 day, 6:18:07/1 day, 8:46:02, loss=0.446150625773569, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.98(1.03), norm=1.4973909982159732, lr=0.26490962866791634
2023-12-06 17:14:04   INFO  epoch: 33/72, acc_iter=130996, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:12, time_cost(all): 1 day, 6:18:49/1 day, 8:59:35, loss=0.446091428347628, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.92(1.03), norm=1.4629986796764163, lr=0.2647956615925486
2023-12-06 17:14:46   INFO  epoch: 33/72, acc_iter=131046, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:41, time_cost(all): 1 day, 6:19:31/1 day, 8:44:47, loss=0.446032230921687, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.01(1.03), norm=3.7181258988581147, lr=0.2646816945171808
2023-12-06 17:15:27   INFO  epoch: 33/72, acc_iter=131096, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:57, time_cost(all): 1 day, 6:20:12/1 day, 11:24:03, loss=0.445973033495746, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.92(1.03), norm=3.2327924254182516, lr=0.264567727441813
2023-12-06 17:16:09   INFO  epoch: 33/72, acc_iter=131146, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:20, time_cost(all): 1 day, 6:20:54/1 day, 10:54:30, loss=0.445913836069805, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.16(1.03), norm=2.9182110391986114, lr=0.2644537603664452
2023-12-06 17:16:51   INFO  epoch: 33/72, acc_iter=131196, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:32, time_cost(all): 1 day, 6:21:36/1 day, 9:09:11, loss=0.445854638643864, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.91(1.03), norm=1.0002401329435824, lr=0.26433979329107743
2023-12-06 17:17:33   INFO  epoch: 33/72, acc_iter=131246, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:50, time_cost(all): 1 day, 6:22:18/1 day, 11:10:24, loss=0.445795441217924, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.98(1.03), norm=1.9849109876328288, lr=0.26422582621570956
2023-12-06 17:18:14   INFO  epoch: 33/72, acc_iter=131296, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 1 day, 6:22:59/1 day, 11:29:04, loss=0.445736243791983, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.1(1.03), norm=2.5414558810153784, lr=0.26411185914034185
2023-12-06 17:18:56   INFO  epoch: 34/72, acc_iter=131358, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:53:54, time_cost(all): 1 day, 6:23:41/1 day, 10:50:57, loss=0.445662838983816, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.2(1.03), norm=3.416652803486368, lr=0.26397053996688574
2023-12-06 17:19:38   INFO  epoch: 34/72, acc_iter=131408, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:53:25, time_cost(all): 1 day, 6:24:23/1 day, 11:05:32, loss=0.445603641557875, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.83(1.03), norm=3.169150289530169, lr=0.263856572891518
2023-12-06 17:20:20   INFO  epoch: 34/72, acc_iter=131458, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:50:34, time_cost(all): 1 day, 6:25:05/1 day, 9:15:45, loss=0.445544444131934, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.09(1.03), norm=2.4785405932546674, lr=0.26374260581615017
2023-12-06 17:21:02   INFO  epoch: 34/72, acc_iter=131508, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:50:28, time_cost(all): 1 day, 6:25:47/1 day, 8:29:04, loss=0.445485246705993, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.22(1.03), norm=4.398358466376189, lr=0.26362863874078235
2023-12-06 17:21:43   INFO  epoch: 34/72, acc_iter=131558, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:51:39, time_cost(all): 1 day, 6:26:28/1 day, 9:55:04, loss=0.445426049280052, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.92(1.03), norm=1.412644365350776, lr=0.26351467166541453
2023-12-06 17:22:25   INFO  epoch: 34/72, acc_iter=131608, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:50:02, time_cost(all): 1 day, 6:27:10/1 day, 10:44:54, loss=0.445366851854111, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.91(1.03), norm=1.186744996658583, lr=0.26340070459004683
2023-12-06 17:23:07   INFO  epoch: 34/72, acc_iter=131658, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:48:10, time_cost(all): 1 day, 6:27:52/1 day, 8:43:13, loss=0.44530765442817, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.12(1.03), norm=1.7230413547774235, lr=0.26328673751467896
2023-12-06 17:23:49   INFO  epoch: 34/72, acc_iter=131708, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:47:21, time_cost(all): 1 day, 6:28:34/1 day, 11:33:55, loss=0.445248457002229, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.05(1.03), norm=1.33806663087426, lr=0.2631727704393112
2023-12-06 17:24:30   INFO  epoch: 34/72, acc_iter=131758, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:49:19, time_cost(all): 1 day, 6:29:15/1 day, 11:24:25, loss=0.445189259576288, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.12(1.03), norm=4.545019676072873, lr=0.2630588033639434
2023-12-06 17:25:12   INFO  epoch: 34/72, acc_iter=131808, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:47:00, time_cost(all): 1 day, 6:29:57/1 day, 8:50:47, loss=0.445130062150347, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.18(1.03), norm=3.2438410978074423, lr=0.26294483628857557
2023-12-06 17:25:54   INFO  epoch: 34/72, acc_iter=131858, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:48:00, time_cost(all): 1 day, 6:30:39/1 day, 9:12:26, loss=0.445070864724406, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.22(1.03), norm=2.9740902173340644, lr=0.2628308692132078
2023-12-06 17:26:36   INFO  epoch: 34/72, acc_iter=131908, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:47:23, time_cost(all): 1 day, 6:31:21/1 day, 9:14:05, loss=0.445011667298465, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.02(1.03), norm=0.5674994468431078, lr=0.26271690213784
2023-12-06 17:27:18   INFO  epoch: 34/72, acc_iter=131958, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:44:27, time_cost(all): 1 day, 6:32:03/1 day, 8:24:47, loss=0.444952469872525, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.99(1.03), norm=1.5848872020564637, lr=0.26260293506247223
2023-12-06 17:27:59   INFO  epoch: 34/72, acc_iter=132008, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:45:30, time_cost(all): 1 day, 6:32:44/1 day, 9:29:30, loss=0.444893272446584, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.14(1.03), norm=1.7583181761259283, lr=0.2624889679871044
2023-12-06 17:28:41   INFO  epoch: 34/72, acc_iter=132058, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:42:42, time_cost(all): 1 day, 6:33:26/1 day, 9:46:12, loss=0.444834075020643, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.11(1.03), norm=4.362645318088992, lr=0.2623750009117366
2023-12-06 17:29:23   INFO  epoch: 34/72, acc_iter=132108, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:40:36, time_cost(all): 1 day, 6:34:08/1 day, 8:58:04, loss=0.444774877594702, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.15(1.03), norm=3.2882884868067155, lr=0.26226103383636884
2023-12-06 17:30:05   INFO  epoch: 34/72, acc_iter=132158, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:41:00, time_cost(all): 1 day, 6:34:50/1 day, 8:46:20, loss=0.444715680168761, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.17(1.03), norm=2.6102347836913866, lr=0.2621470667610011
2023-12-06 17:30:46   INFO  epoch: 34/72, acc_iter=132208, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:42:26, time_cost(all): 1 day, 6:35:31/1 day, 9:35:50, loss=0.44465648274282, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.84(1.03), norm=3.076951850844793, lr=0.26203309968563326
2023-12-06 17:31:28   INFO  epoch: 34/72, acc_iter=132258, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:41:52, time_cost(all): 1 day, 6:36:13/1 day, 8:42:25, loss=0.444597285316879, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.14(1.03), norm=2.0707446793237088, lr=0.26191913261026545
2023-12-06 17:32:10   INFO  epoch: 34/72, acc_iter=132308, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:38:51, time_cost(all): 1 day, 6:36:55/1 day, 9:52:00, loss=0.444538087890938, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.0(1.03), norm=4.718813914523596, lr=0.26180516553489763
2023-12-06 17:32:52   INFO  epoch: 34/72, acc_iter=132358, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:40:18, time_cost(all): 1 day, 6:37:37/1 day, 8:52:00, loss=0.444478890464997, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.08(1.03), norm=4.817450163386333, lr=0.2616911984595298
2023-12-06 17:33:34   INFO  epoch: 34/72, acc_iter=132408, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:40:21, time_cost(all): 1 day, 6:38:19/1 day, 8:13:45, loss=0.444419693039056, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.89(1.03), norm=1.406352313660669, lr=0.26157723138416206
2023-12-06 17:34:15   INFO  epoch: 34/72, acc_iter=132458, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:38:11, time_cost(all): 1 day, 6:39:00/1 day, 8:26:54, loss=0.444360495613115, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.03(1.03), norm=1.6329016693076743, lr=0.26146326430879424
2023-12-06 17:34:57   INFO  epoch: 34/72, acc_iter=132508, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:38:16, time_cost(all): 1 day, 6:39:42/1 day, 8:46:19, loss=0.444301298187174, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.05(1.03), norm=4.682032062512494, lr=0.2613492972334265
2023-12-06 17:35:39   INFO  epoch: 34/72, acc_iter=132558, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:37:50, time_cost(all): 1 day, 6:40:24/1 day, 9:49:32, loss=0.444242100761233, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.93(1.03), norm=0.8644085429431565, lr=0.26123533015805867
2023-12-06 17:36:21   INFO  epoch: 34/72, acc_iter=132608, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:34:00, time_cost(all): 1 day, 6:41:06/1 day, 10:18:19, loss=0.444182903335292, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.09(1.03), norm=4.170665480770101, lr=0.26112136308269085
2023-12-06 17:37:02   INFO  epoch: 34/72, acc_iter=132658, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:34:57, time_cost(all): 1 day, 6:41:47/1 day, 8:56:32, loss=0.444123705909351, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.21(1.03), norm=0.5477762876111005, lr=0.2610073960073231
2023-12-06 17:37:44   INFO  epoch: 34/72, acc_iter=132708, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:34:07, time_cost(all): 1 day, 6:42:29/1 day, 9:45:55, loss=0.44406450848341, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.14(1.03), norm=4.677541634883089, lr=0.26089342893195533
2023-12-06 17:38:26   INFO  epoch: 34/72, acc_iter=132758, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:32:01, time_cost(all): 1 day, 6:43:11/1 day, 10:10:08, loss=0.444005311057469, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.0(1.03), norm=1.4724920416603064, lr=0.2607794618565875
2023-12-06 17:39:08   INFO  epoch: 34/72, acc_iter=132808, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:31:34, time_cost(all): 1 day, 6:43:53/1 day, 10:11:32, loss=0.443946113631528, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.87(1.03), norm=3.1349713901797163, lr=0.2606654947812197
2023-12-06 17:39:50   INFO  epoch: 34/72, acc_iter=132858, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:31:59, time_cost(all): 1 day, 6:44:35/1 day, 9:17:51, loss=0.443886916205588, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.99(1.03), norm=2.701047784939589, lr=0.26055152770585194
2023-12-06 17:40:31   INFO  epoch: 34/72, acc_iter=132908, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:30:08, time_cost(all): 1 day, 6:45:16/1 day, 11:22:38, loss=0.443827718779647, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.89(1.03), norm=1.4509619658679358, lr=0.26043756063048407
2023-12-06 17:41:13   INFO  epoch: 34/72, acc_iter=132958, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:30:16, time_cost(all): 1 day, 6:45:58/1 day, 9:21:48, loss=0.443768521353706, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.89(1.03), norm=0.5345182945735718, lr=0.26032359355511636
2023-12-06 17:41:55   INFO  epoch: 34/72, acc_iter=133008, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:28:58, time_cost(all): 1 day, 6:46:40/1 day, 11:06:41, loss=0.443709323927765, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.09(1.03), norm=1.0920997287483505, lr=0.26020962647974855
2023-12-06 17:42:37   INFO  epoch: 34/72, acc_iter=133058, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:30:37, time_cost(all): 1 day, 6:47:22/1 day, 9:20:10, loss=0.443650126501824, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.97(1.03), norm=1.5929547754225148, lr=0.26009565940438073
2023-12-06 17:43:18   INFO  epoch: 34/72, acc_iter=133108, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:27:56, time_cost(all): 1 day, 6:48:03/1 day, 9:42:47, loss=0.443590929075883, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.91(1.03), norm=2.028943411514027, lr=0.2599816923290129
2023-12-06 17:44:00   INFO  epoch: 34/72, acc_iter=133158, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:28:45, time_cost(all): 1 day, 6:48:45/1 day, 10:20:13, loss=0.443531731649942, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.09(1.03), norm=1.7973124490247243, lr=0.25986772525364515
2023-12-06 17:44:42   INFO  epoch: 34/72, acc_iter=133208, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:27:18, time_cost(all): 1 day, 6:49:27/1 day, 8:55:03, loss=0.443472534224001, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.2(1.03), norm=2.160516516181854, lr=0.25975375817827734
2023-12-06 17:45:24   INFO  epoch: 34/72, acc_iter=133258, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:27:21, time_cost(all): 1 day, 6:50:09/1 day, 8:59:24, loss=0.44341333679806, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.98(1.03), norm=3.4599834798143654, lr=0.2596397911029096
2023-12-06 17:46:06   INFO  epoch: 34/72, acc_iter=133308, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:24:44, time_cost(all): 1 day, 6:50:51/1 day, 10:07:43, loss=0.443354139372119, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.09(1.03), norm=4.7775800795061, lr=0.25952582402754176
2023-12-06 17:46:47   INFO  epoch: 34/72, acc_iter=133358, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:24:41, time_cost(all): 1 day, 6:51:32/1 day, 10:44:55, loss=0.443294941946178, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.9(1.03), norm=2.970152377058914, lr=0.25941185695217395
2023-12-06 17:47:29   INFO  epoch: 34/72, acc_iter=133408, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:25:01, time_cost(all): 1 day, 6:52:14/1 day, 10:32:57, loss=0.443235744520237, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.13(1.03), norm=3.752729510050694, lr=0.2592978898768062
2023-12-06 17:48:11   INFO  epoch: 34/72, acc_iter=133458, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:22:42, time_cost(all): 1 day, 6:52:56/1 day, 9:19:32, loss=0.443176547094296, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.01(1.03), norm=1.1696019161727622, lr=0.2591839228014383
2023-12-06 17:48:53   INFO  epoch: 34/72, acc_iter=133508, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:23:53, time_cost(all): 1 day, 6:53:38/1 day, 10:38:23, loss=0.443117349668355, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.2(1.03), norm=4.6716884983619815, lr=0.2590699557260706
2023-12-06 17:49:35   INFO  epoch: 34/72, acc_iter=133558, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:22:03, time_cost(all): 1 day, 6:54:20/1 day, 9:16:23, loss=0.443058152242414, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.11(1.03), norm=4.826914138686093, lr=0.2589559886507028
2023-12-06 17:50:16   INFO  epoch: 34/72, acc_iter=133608, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:22:28, time_cost(all): 1 day, 6:55:01/1 day, 11:11:45, loss=0.442998954816473, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.17(1.03), norm=4.202708797596703, lr=0.258842021575335
2023-12-06 17:50:58   INFO  epoch: 34/72, acc_iter=133658, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:11, time_cost(all): 1 day, 6:55:43/1 day, 11:01:36, loss=0.442939757390532, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.19(1.03), norm=1.2501243008622485, lr=0.25872805449996716
2023-12-06 17:51:40   INFO  epoch: 34/72, acc_iter=133708, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:21:22, time_cost(all): 1 day, 6:56:25/1 day, 11:15:37, loss=0.442880559964592, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.14(1.03), norm=2.643304481375445, lr=0.25861408742459946
2023-12-06 17:52:22   INFO  epoch: 34/72, acc_iter=133758, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:19:18, time_cost(all): 1 day, 6:57:07/1 day, 10:06:42, loss=0.442821362538651, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.22(1.03), norm=0.8198667815454608, lr=0.2585001203492316
2023-12-06 17:53:03   INFO  epoch: 34/72, acc_iter=133808, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:49, time_cost(all): 1 day, 6:57:48/1 day, 9:09:04, loss=0.44276216511271, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.1(1.03), norm=3.8609660022242616, lr=0.25838615327386383
2023-12-06 17:53:45   INFO  epoch: 34/72, acc_iter=133858, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:17:33, time_cost(all): 1 day, 6:58:30/1 day, 10:43:05, loss=0.442702967686769, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.19(1.03), norm=1.7086881796896072, lr=0.258272186198496
2023-12-06 17:54:27   INFO  epoch: 34/72, acc_iter=133908, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:40, time_cost(all): 1 day, 6:59:12/1 day, 8:01:59, loss=0.442643770260828, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.95(1.03), norm=1.297024492375202, lr=0.2581582191231282
2023-12-06 17:55:09   INFO  epoch: 34/72, acc_iter=133958, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:17:12, time_cost(all): 1 day, 6:59:54/1 day, 8:49:25, loss=0.442584572834887, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.1(1.03), norm=4.744938090805503, lr=0.25804425204776044
2023-12-06 17:55:51   INFO  epoch: 34/72, acc_iter=134008, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:20, time_cost(all): 1 day, 7:00:36/1 day, 8:56:07, loss=0.442525375408946, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.91(1.03), norm=3.0932428193263797, lr=0.2579302849723927
2023-12-06 17:56:32   INFO  epoch: 34/72, acc_iter=134058, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:55, time_cost(all): 1 day, 7:01:17/1 day, 9:41:47, loss=0.442466177983005, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.93(1.03), norm=1.375632591720841, lr=0.25781631789702486
2023-12-06 17:57:14   INFO  epoch: 34/72, acc_iter=134108, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:12, time_cost(all): 1 day, 7:01:59/1 day, 11:01:20, loss=0.442406980557064, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.86(1.03), norm=1.8159701690197374, lr=0.25770235082165704
2023-12-06 17:57:56   INFO  epoch: 34/72, acc_iter=134158, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:13:26, time_cost(all): 1 day, 7:02:41/1 day, 8:36:46, loss=0.442347783131123, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.88(1.03), norm=4.530195073208149, lr=0.25758838374628923
2023-12-06 17:58:38   INFO  epoch: 34/72, acc_iter=134208, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:41, time_cost(all): 1 day, 7:03:23/1 day, 10:14:44, loss=0.442288585705182, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.85(1.03), norm=0.8754598233316138, lr=0.2574744166709214
2023-12-06 17:59:19   INFO  epoch: 34/72, acc_iter=134258, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:37, time_cost(all): 1 day, 7:04:04/1 day, 10:08:40, loss=0.442229388279241, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.11(1.03), norm=3.734665830458064, lr=0.2573604495955537
2023-12-06 18:00:01   INFO  epoch: 34/72, acc_iter=134308, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:31, time_cost(all): 1 day, 7:04:46/1 day, 9:10:10, loss=0.4421701908533, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.07(1.03), norm=4.245553923504508, lr=0.25724648252018584
2023-12-06 18:00:43   INFO  epoch: 34/72, acc_iter=134358, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:10:58, time_cost(all): 1 day, 7:05:28/1 day, 9:13:52, loss=0.442110993427359, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.22(1.03), norm=4.730979452254713, lr=0.2571325154448181
2023-12-06 18:01:25   INFO  epoch: 34/72, acc_iter=134408, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:11:03, time_cost(all): 1 day, 7:06:10/1 day, 7:50:20, loss=0.442051796001418, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.86(1.03), norm=0.737763963798137, lr=0.25701854836945026
2023-12-06 18:02:07   INFO  epoch: 34/72, acc_iter=134458, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:55, time_cost(all): 1 day, 7:06:52/1 day, 10:03:40, loss=0.441992598575477, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.14(1.03), norm=3.710912629688059, lr=0.25690458129408245
2023-12-06 18:02:48   INFO  epoch: 34/72, acc_iter=134508, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:08:47, time_cost(all): 1 day, 7:07:33/1 day, 9:30:35, loss=0.441933401149536, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.0(1.03), norm=3.5012995877778503, lr=0.2567906142187147
2023-12-06 18:03:30   INFO  epoch: 34/72, acc_iter=134558, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:13, time_cost(all): 1 day, 7:08:15/1 day, 9:49:27, loss=0.441874203723596, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.04(1.03), norm=2.7303481944053463, lr=0.2566766471433469
2023-12-06 18:04:12   INFO  epoch: 34/72, acc_iter=134608, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:27, time_cost(all): 1 day, 7:08:57/1 day, 8:10:01, loss=0.441815006297655, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.12(1.03), norm=0.6765253409055741, lr=0.2565626800679791
2023-12-06 18:04:54   INFO  epoch: 34/72, acc_iter=134658, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:28, time_cost(all): 1 day, 7:09:39/1 day, 8:36:07, loss=0.441755808871714, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.88(1.03), norm=3.0672796376294174, lr=0.2564487129926113
2023-12-06 18:05:35   INFO  epoch: 34/72, acc_iter=134708, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:16, time_cost(all): 1 day, 7:10:20/1 day, 7:48:55, loss=0.441696611445773, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.23(1.03), norm=2.4894847089480647, lr=0.25633474591724353
2023-12-06 18:06:17   INFO  epoch: 34/72, acc_iter=134758, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:34, time_cost(all): 1 day, 7:11:02/1 day, 9:56:42, loss=0.441637414019832, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.88(1.03), norm=1.302609191563481, lr=0.2562207788418757
2023-12-06 18:06:59   INFO  epoch: 34/72, acc_iter=134808, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:04:52, time_cost(all): 1 day, 7:11:44/1 day, 10:12:11, loss=0.441578216593891, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.0(1.03), norm=4.318837505906384, lr=0.25610681176650796
2023-12-06 18:07:41   INFO  epoch: 34/72, acc_iter=134858, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:25, time_cost(all): 1 day, 7:12:26/1 day, 9:45:39, loss=0.44151901916795, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.98(1.03), norm=3.8789189662659975, lr=0.25599284469114014
2023-12-06 18:08:23   INFO  epoch: 34/72, acc_iter=134908, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:38, time_cost(all): 1 day, 7:13:08/1 day, 7:57:05, loss=0.441459821742009, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.92(1.03), norm=1.9558235014939722, lr=0.2558788776157723
2023-12-06 18:09:04   INFO  epoch: 34/72, acc_iter=134958, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:03:00, time_cost(all): 1 day, 7:13:49/1 day, 9:54:12, loss=0.441400624316068, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.08(1.03), norm=1.1706238763275394, lr=0.2557649105404045
2023-12-06 18:09:46   INFO  epoch: 34/72, acc_iter=135008, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:18, time_cost(all): 1 day, 7:14:31/1 day, 8:18:42, loss=0.441341426890127, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.94(1.03), norm=4.937700891225428, lr=0.2556509434650367
2023-12-06 18:10:28   INFO  epoch: 34/72, acc_iter=135058, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:29, time_cost(all): 1 day, 7:15:13/1 day, 9:53:07, loss=0.441282229464186, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.87(1.03), norm=3.3067453288853996, lr=0.25553697638966894
2023-12-06 18:11:10   INFO  epoch: 34/72, acc_iter=135108, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:51, time_cost(all): 1 day, 7:15:55/1 day, 9:43:18, loss=0.441223032038245, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.87(1.03), norm=2.0891091573675133, lr=0.2554230093143012
2023-12-06 18:11:51   INFO  epoch: 34/72, acc_iter=135158, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 1 day, 7:16:36/1 day, 9:30:19, loss=0.441163834612304, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.21(1.03), norm=2.1928000041160343, lr=0.25530904223893336
2023-12-06 18:12:33   INFO  epoch: 35/72, acc_iter=135220, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:50:44, time_cost(all): 1 day, 7:17:18/1 day, 8:13:07, loss=0.441090429804137, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.22(1.03), norm=1.7626019001515254, lr=0.2551677230654773
2023-12-06 18:13:15   INFO  epoch: 35/72, acc_iter=135270, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:52:57, time_cost(all): 1 day, 7:18:00/1 day, 10:46:48, loss=0.441031232378197, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.91(1.03), norm=1.6585109227179387, lr=0.2550537559901095
2023-12-06 18:13:57   INFO  epoch: 35/72, acc_iter=135320, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:52:01, time_cost(all): 1 day, 7:18:42/1 day, 7:41:38, loss=0.440972034952256, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.14(1.03), norm=3.428178049042042, lr=0.25493978891474167
2023-12-06 18:14:39   INFO  epoch: 35/72, acc_iter=135370, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:49:35, time_cost(all): 1 day, 7:19:24/1 day, 9:48:11, loss=0.440912837526315, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.11(1.03), norm=1.3885955413384905, lr=0.2548258218393739
2023-12-06 18:15:20   INFO  epoch: 35/72, acc_iter=135420, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:50:50, time_cost(all): 1 day, 7:20:05/1 day, 8:48:57, loss=0.440853640100374, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.98(1.03), norm=3.440593005705613, lr=0.2547118547640061
2023-12-06 18:16:02   INFO  epoch: 35/72, acc_iter=135470, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:47:41, time_cost(all): 1 day, 7:20:47/1 day, 9:59:56, loss=0.440794442674433, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.03(1.03), norm=2.992980873426771, lr=0.25459788768863834
2023-12-06 18:16:44   INFO  epoch: 35/72, acc_iter=135520, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:48:29, time_cost(all): 1 day, 7:21:29/1 day, 10:14:53, loss=0.440735245248492, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.85(1.03), norm=4.369313565014538, lr=0.2544839206132705
2023-12-06 18:17:26   INFO  epoch: 35/72, acc_iter=135570, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:48:47, time_cost(all): 1 day, 7:22:11/1 day, 7:56:22, loss=0.440676047822551, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.15(1.03), norm=3.401705756866439, lr=0.2543699535379027
2023-12-06 18:18:07   INFO  epoch: 35/72, acc_iter=135620, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:47:43, time_cost(all): 1 day, 7:22:52/1 day, 7:46:25, loss=0.44061685039661, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.16(1.03), norm=3.682103091294791, lr=0.25425598646253494
2023-12-06 18:18:49   INFO  epoch: 35/72, acc_iter=135670, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:44:41, time_cost(all): 1 day, 7:23:34/1 day, 10:12:17, loss=0.440557652970669, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.87(1.03), norm=0.8201195560266688, lr=0.2541420193871672
2023-12-06 18:19:31   INFO  epoch: 35/72, acc_iter=135720, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:44:42, time_cost(all): 1 day, 7:24:16/1 day, 10:31:49, loss=0.440498455544728, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.97(1.03), norm=2.4047144715736017, lr=0.25402805231179937
2023-12-06 18:20:13   INFO  epoch: 35/72, acc_iter=135770, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:47:11, time_cost(all): 1 day, 7:24:58/1 day, 10:23:29, loss=0.440439258118787, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.12(1.03), norm=4.579647684023257, lr=0.25391408523643155
2023-12-06 18:20:55   INFO  epoch: 35/72, acc_iter=135820, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:45:10, time_cost(all): 1 day, 7:25:40/1 day, 10:34:57, loss=0.440380060692846, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.89(1.03), norm=3.6275805055847874, lr=0.2538001181610638
2023-12-06 18:21:36   INFO  epoch: 35/72, acc_iter=135870, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:45:26, time_cost(all): 1 day, 7:26:21/1 day, 7:53:33, loss=0.440320863266905, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.98(1.03), norm=4.409688466472163, lr=0.2536861510856959
2023-12-06 18:22:18   INFO  epoch: 35/72, acc_iter=135920, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:41:43, time_cost(all): 1 day, 7:27:03/1 day, 7:56:34, loss=0.440261665840964, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.99(1.03), norm=1.3344475949740642, lr=0.2535721840103282
2023-12-06 18:23:00   INFO  epoch: 35/72, acc_iter=135970, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:40:43, time_cost(all): 1 day, 7:27:45/1 day, 9:43:39, loss=0.440202468415023, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.89(1.03), norm=4.98186496211083, lr=0.2534582169349604
2023-12-06 18:23:42   INFO  epoch: 35/72, acc_iter=136020, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:41:54, time_cost(all): 1 day, 7:28:27/1 day, 7:48:48, loss=0.440143270989082, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.06(1.03), norm=1.910379634108474, lr=0.2533442498595926
2023-12-06 18:24:24   INFO  epoch: 35/72, acc_iter=136070, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:41:02, time_cost(all): 1 day, 7:29:09/1 day, 8:05:10, loss=0.440084073563141, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.93(1.03), norm=1.7632380389795532, lr=0.25323028278422477
2023-12-06 18:25:05   INFO  epoch: 35/72, acc_iter=136120, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:39:59, time_cost(all): 1 day, 7:29:50/1 day, 9:44:24, loss=0.440024876137201, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.89(1.03), norm=4.84038596144236, lr=0.25311631570885695
2023-12-06 18:25:47   INFO  epoch: 35/72, acc_iter=136170, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:38:24, time_cost(all): 1 day, 7:30:32/1 day, 10:04:21, loss=0.43996567871126, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.0(1.03), norm=2.989972050579971, lr=0.2530023486334892
2023-12-06 18:26:29   INFO  epoch: 35/72, acc_iter=136220, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:38:31, time_cost(all): 1 day, 7:31:14/1 day, 7:39:15, loss=0.439906481285319, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.12(1.03), norm=0.599342392788784, lr=0.25288838155812143
2023-12-06 18:27:11   INFO  epoch: 35/72, acc_iter=136270, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:36:41, time_cost(all): 1 day, 7:31:56/1 day, 7:25:48, loss=0.439847283859378, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.18(1.03), norm=2.765857605403214, lr=0.2527744144827536
2023-12-06 18:27:52   INFO  epoch: 35/72, acc_iter=136320, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:39:00, time_cost(all): 1 day, 7:32:37/1 day, 7:22:14, loss=0.439788086433437, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.03(1.03), norm=0.5379612942277614, lr=0.2526604474073858
2023-12-06 18:28:34   INFO  epoch: 35/72, acc_iter=136370, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:36:26, time_cost(all): 1 day, 7:33:19/1 day, 7:38:34, loss=0.439728889007496, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.18(1.03), norm=4.714635845959047, lr=0.25254648033201804
2023-12-06 18:29:16   INFO  epoch: 35/72, acc_iter=136420, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:37:28, time_cost(all): 1 day, 7:34:01/1 day, 9:49:39, loss=0.439669691581555, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.17(1.03), norm=3.177437467702733, lr=0.25243251325665017
2023-12-06 18:29:58   INFO  epoch: 35/72, acc_iter=136470, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:34:10, time_cost(all): 1 day, 7:34:43/1 day, 10:29:01, loss=0.439610494155614, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.23(1.03), norm=1.5629185646682615, lr=0.25231854618128247
2023-12-06 18:30:40   INFO  epoch: 35/72, acc_iter=136520, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:33:58, time_cost(all): 1 day, 7:35:25/1 day, 9:24:26, loss=0.439551296729673, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.93(1.03), norm=3.102827453832093, lr=0.25220457910591465
2023-12-06 18:31:21   INFO  epoch: 35/72, acc_iter=136570, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:34:07, time_cost(all): 1 day, 7:36:06/1 day, 10:08:36, loss=0.439492099303732, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.85(1.03), norm=0.5144348489507731, lr=0.25209061203054683
2023-12-06 18:32:03   INFO  epoch: 35/72, acc_iter=136620, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:34:03, time_cost(all): 1 day, 7:36:48/1 day, 9:02:55, loss=0.439432901877791, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.19(1.03), norm=4.137488315009032, lr=0.251976644955179
2023-12-06 18:32:45   INFO  epoch: 35/72, acc_iter=136670, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:32:37, time_cost(all): 1 day, 7:37:30/1 day, 9:38:26, loss=0.43937370445185, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.2(1.03), norm=4.361700920542272, lr=0.2518626778798113
2023-12-06 18:33:27   INFO  epoch: 35/72, acc_iter=136720, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:33:00, time_cost(all): 1 day, 7:38:12/1 day, 9:43:06, loss=0.439314507025909, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.22(1.03), norm=4.004876329924243, lr=0.25174871080444344
2023-12-06 18:34:08   INFO  epoch: 35/72, acc_iter=136770, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:31:46, time_cost(all): 1 day, 7:38:53/1 day, 10:26:54, loss=0.439255309599968, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.16(1.03), norm=1.4643082051443264, lr=0.2516347437290757
2023-12-06 18:34:50   INFO  epoch: 35/72, acc_iter=136820, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:32:18, time_cost(all): 1 day, 7:39:35/1 day, 8:06:45, loss=0.439196112174027, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.2(1.03), norm=2.031842066329156, lr=0.25152077665370787
2023-12-06 18:35:32   INFO  epoch: 35/72, acc_iter=136870, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:31:13, time_cost(all): 1 day, 7:40:17/1 day, 9:04:06, loss=0.439136914748086, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.09(1.03), norm=1.16022573050516, lr=0.25140680957834005
2023-12-06 18:36:14   INFO  epoch: 35/72, acc_iter=136920, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:29:24, time_cost(all): 1 day, 7:40:59/1 day, 9:58:02, loss=0.439077717322146, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=1.454656960581222, lr=0.2512928425029723
2023-12-06 18:36:56   INFO  epoch: 35/72, acc_iter=136970, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:29:46, time_cost(all): 1 day, 7:41:41/1 day, 7:43:30, loss=0.439018519896205, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.09(1.03), norm=4.609373550724848, lr=0.2511788754276045
2023-12-06 18:37:37   INFO  epoch: 35/72, acc_iter=137020, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:29:15, time_cost(all): 1 day, 7:42:22/1 day, 10:13:40, loss=0.438959322470264, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.09(1.03), norm=3.2159681057807594, lr=0.2510649083522367
2023-12-06 18:38:19   INFO  epoch: 35/72, acc_iter=137070, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:28:13, time_cost(all): 1 day, 7:43:04/1 day, 7:26:33, loss=0.438900125044323, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.13(1.03), norm=4.334705603011783, lr=0.2509509412768689
2023-12-06 18:39:01   INFO  epoch: 35/72, acc_iter=137120, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:26:41, time_cost(all): 1 day, 7:43:46/1 day, 10:09:25, loss=0.438840927618382, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.9(1.03), norm=4.164059322542286, lr=0.2508369742015011
2023-12-06 18:39:43   INFO  epoch: 35/72, acc_iter=137170, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:26:36, time_cost(all): 1 day, 7:44:28/1 day, 9:26:11, loss=0.438781730192441, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.93(1.03), norm=4.538342076734171, lr=0.25072300712613327
2023-12-06 18:40:24   INFO  epoch: 35/72, acc_iter=137220, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:24:00, time_cost(all): 1 day, 7:45:09/1 day, 7:50:22, loss=0.4387225327665, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.15(1.03), norm=3.0649515545297783, lr=0.25060904005076556
2023-12-06 18:41:06   INFO  epoch: 35/72, acc_iter=137270, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:24:19, time_cost(all): 1 day, 7:45:51/1 day, 8:08:48, loss=0.438663335340559, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.03(1.03), norm=1.0692137831237973, lr=0.2504950729753977
2023-12-06 18:41:48   INFO  epoch: 35/72, acc_iter=137320, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:27, time_cost(all): 1 day, 7:46:33/1 day, 9:03:39, loss=0.438604137914618, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.88(1.03), norm=3.411400819169796, lr=0.25038110590002993
2023-12-06 18:42:30   INFO  epoch: 35/72, acc_iter=137370, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:24:17, time_cost(all): 1 day, 7:47:15/1 day, 8:12:05, loss=0.438544940488677, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.11(1.03), norm=3.489289818107209, lr=0.2502671388246621
2023-12-06 18:43:12   INFO  epoch: 35/72, acc_iter=137420, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:21:48, time_cost(all): 1 day, 7:47:57/1 day, 10:01:48, loss=0.438485743062736, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.88(1.03), norm=3.482374279376949, lr=0.2501531717492943
2023-12-06 18:43:53   INFO  epoch: 35/72, acc_iter=137470, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:58, time_cost(all): 1 day, 7:48:38/1 day, 8:10:06, loss=0.438426545636795, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.14(1.03), norm=4.960988926157802, lr=0.25003920467392654
2023-12-06 18:44:35   INFO  epoch: 35/72, acc_iter=137520, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:05, time_cost(all): 1 day, 7:49:20/1 day, 7:11:04, loss=0.438367348210854, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.21(1.03), norm=3.103596053172101, lr=0.24992523759855878
2023-12-06 18:45:17   INFO  epoch: 35/72, acc_iter=137570, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:19:40, time_cost(all): 1 day, 7:50:02/1 day, 9:39:55, loss=0.438308150784913, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.02(1.03), norm=1.434401516411239, lr=0.24981127052319096
2023-12-06 18:45:59   INFO  epoch: 35/72, acc_iter=137620, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:18:48, time_cost(all): 1 day, 7:50:44/1 day, 9:09:03, loss=0.438248953358972, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.18(1.03), norm=2.6440165654754635, lr=0.24969730344782315
2023-12-06 18:46:40   INFO  epoch: 35/72, acc_iter=137670, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:43, time_cost(all): 1 day, 7:51:25/1 day, 7:35:54, loss=0.438189755933031, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.87(1.03), norm=0.6787787452705198, lr=0.2495833363724554
2023-12-06 18:47:22   INFO  epoch: 35/72, acc_iter=137720, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:17:41, time_cost(all): 1 day, 7:52:07/1 day, 7:55:56, loss=0.43813055850709, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.18(1.03), norm=2.685232496899298, lr=0.24946936929708757
2023-12-06 18:48:04   INFO  epoch: 35/72, acc_iter=137770, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:16, time_cost(all): 1 day, 7:52:49/1 day, 10:13:50, loss=0.43807136108115, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.12(1.03), norm=0.5471723863355814, lr=0.24935540222171976
2023-12-06 18:48:46   INFO  epoch: 35/72, acc_iter=137820, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:17:22, time_cost(all): 1 day, 7:53:31/1 day, 7:26:53, loss=0.438012163655209, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.88(1.03), norm=3.147097338620584, lr=0.24924143514635194
2023-12-06 18:49:28   INFO  epoch: 35/72, acc_iter=137870, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:16, time_cost(all): 1 day, 7:54:13/1 day, 7:08:08, loss=0.437952966229268, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.83(1.03), norm=4.185802819271459, lr=0.24912746807098418
2023-12-06 18:50:09   INFO  epoch: 35/72, acc_iter=137920, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:16:09, time_cost(all): 1 day, 7:54:54/1 day, 8:48:50, loss=0.437893768803327, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.15(1.03), norm=2.809226639025591, lr=0.24901350099561642
2023-12-06 18:50:51   INFO  epoch: 35/72, acc_iter=137970, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:04, time_cost(all): 1 day, 7:55:36/1 day, 8:04:19, loss=0.437834571377386, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.95(1.03), norm=1.647565241994352, lr=0.24889953392024855
2023-12-06 18:51:33   INFO  epoch: 35/72, acc_iter=138020, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:13:31, time_cost(all): 1 day, 7:56:18/1 day, 7:37:21, loss=0.437775373951445, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.98(1.03), norm=2.346263851518149, lr=0.2487855668448808
2023-12-06 18:52:15   INFO  epoch: 35/72, acc_iter=138070, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:00, time_cost(all): 1 day, 7:57:00/1 day, 10:10:25, loss=0.437716176525504, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.06(1.03), norm=1.3813623375275756, lr=0.24867159976951303
2023-12-06 18:52:56   INFO  epoch: 35/72, acc_iter=138120, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:13:11, time_cost(all): 1 day, 7:57:41/1 day, 7:02:26, loss=0.437656979099563, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.03(1.03), norm=0.5955070759067751, lr=0.24855763269414521
2023-12-06 18:53:38   INFO  epoch: 35/72, acc_iter=138170, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:11:30, time_cost(all): 1 day, 7:58:23/1 day, 7:59:03, loss=0.437597781673622, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.11(1.03), norm=1.8985862970214356, lr=0.2484436656187774
2023-12-06 18:54:20   INFO  epoch: 35/72, acc_iter=138220, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:10:45, time_cost(all): 1 day, 7:59:05/1 day, 9:51:35, loss=0.437538584247681, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.92(1.03), norm=3.5234057326798514, lr=0.24832969854340964
2023-12-06 18:55:02   INFO  epoch: 35/72, acc_iter=138270, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:39, time_cost(all): 1 day, 7:59:47/1 day, 8:10:12, loss=0.43747938682174, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.13(1.03), norm=1.6113051748724263, lr=0.24821573146804182
2023-12-06 18:55:44   INFO  epoch: 35/72, acc_iter=138320, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:31, time_cost(all): 1 day, 8:00:29/1 day, 7:51:46, loss=0.437420189395799, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.09(1.03), norm=3.5557307381584784, lr=0.24810176439267406
2023-12-06 18:56:25   INFO  epoch: 35/72, acc_iter=138370, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:10, time_cost(all): 1 day, 8:01:10/1 day, 8:25:56, loss=0.437360991969858, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.85(1.03), norm=2.0077950121931503, lr=0.24798779731730625
2023-12-06 18:57:07   INFO  epoch: 35/72, acc_iter=138420, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:10, time_cost(all): 1 day, 8:01:52/1 day, 9:28:09, loss=0.437301794543917, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.89(1.03), norm=1.9281797615334957, lr=0.24787383024193843
2023-12-06 18:57:49   INFO  epoch: 35/72, acc_iter=138470, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:39, time_cost(all): 1 day, 8:02:34/1 day, 8:24:16, loss=0.437242597117976, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.16(1.03), norm=2.460146460351812, lr=0.24775986316657067
2023-12-06 18:58:31   INFO  epoch: 35/72, acc_iter=138520, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:09, time_cost(all): 1 day, 8:03:16/1 day, 7:56:59, loss=0.437183399692035, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.91(1.03), norm=4.720078021713039, lr=0.24764589609120286
2023-12-06 18:59:13   INFO  epoch: 35/72, acc_iter=138570, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:42, time_cost(all): 1 day, 8:03:58/1 day, 9:08:40, loss=0.437124202266094, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.1(1.03), norm=2.844922922335141, lr=0.24753192901583504
2023-12-06 18:59:54   INFO  epoch: 35/72, acc_iter=138620, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:52, time_cost(all): 1 day, 8:04:39/1 day, 9:06:10, loss=0.437065004840154, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.02(1.03), norm=0.5735869699592697, lr=0.24741796194046728
2023-12-06 19:00:36   INFO  epoch: 35/72, acc_iter=138670, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:00, time_cost(all): 1 day, 8:05:21/1 day, 9:25:38, loss=0.437005807414213, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.13(1.03), norm=2.1471249512186303, lr=0.24730399486509946
2023-12-06 19:01:18   INFO  epoch: 35/72, acc_iter=138720, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:09, time_cost(all): 1 day, 8:06:03/1 day, 7:18:47, loss=0.436946609988272, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.06(1.03), norm=2.4154083548057326, lr=0.24719002778973165
2023-12-06 19:02:00   INFO  epoch: 35/72, acc_iter=138770, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:30, time_cost(all): 1 day, 8:06:45/1 day, 7:37:28, loss=0.436887412562331, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.98(1.03), norm=3.2999408919097855, lr=0.2470760607143639
2023-12-06 19:02:41   INFO  epoch: 35/72, acc_iter=138820, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:03:00, time_cost(all): 1 day, 8:07:26/1 day, 9:59:42, loss=0.43682821513639, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.9(1.03), norm=4.943712488372442, lr=0.24696209363899607
2023-12-06 19:03:23   INFO  epoch: 35/72, acc_iter=138870, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:13, time_cost(all): 1 day, 8:08:08/1 day, 7:20:31, loss=0.436769017710449, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.07(1.03), norm=3.8724936966701526, lr=0.2468481265636283
2023-12-06 19:04:05   INFO  epoch: 35/72, acc_iter=138920, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:32, time_cost(all): 1 day, 8:08:50/1 day, 9:44:27, loss=0.436709820284508, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.01(1.03), norm=1.0993263878806838, lr=0.2467341594882605
2023-12-06 19:04:47   INFO  epoch: 35/72, acc_iter=138970, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:52, time_cost(all): 1 day, 8:09:32/1 day, 8:11:30, loss=0.436650622858567, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.21(1.03), norm=3.270551336642203, lr=0.24662019241289268
2023-12-06 19:05:29   INFO  epoch: 35/72, acc_iter=139020, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 1 day, 8:10:14/1 day, 8:50:11, loss=0.436591425432626, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.16(1.03), norm=4.24538574908866, lr=0.24650622533752492
2023-12-06 19:06:10   INFO  epoch: 36/72, acc_iter=139082, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:52:12, time_cost(all): 1 day, 8:10:55/1 day, 7:56:41, loss=0.436518020624459, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.88(1.03), norm=1.3088023030435132, lr=0.2463649061640688
2023-12-06 19:06:52   INFO  epoch: 36/72, acc_iter=139132, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:52:59, time_cost(all): 1 day, 8:11:37/1 day, 9:53:02, loss=0.436458823198518, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.97(1.03), norm=3.8135083650442105, lr=0.24625093908870105
2023-12-06 19:07:34   INFO  epoch: 36/72, acc_iter=139182, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:50:07, time_cost(all): 1 day, 8:12:19/1 day, 8:27:18, loss=0.436399625772577, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.06(1.03), norm=3.8457160157234545, lr=0.2461369720133333
2023-12-06 19:08:16   INFO  epoch: 36/72, acc_iter=139232, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:52:45, time_cost(all): 1 day, 8:13:01/1 day, 9:10:32, loss=0.436340428346636, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.02(1.03), norm=2.688910095532109, lr=0.24602300493796542
2023-12-06 19:08:57   INFO  epoch: 36/72, acc_iter=139282, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:52:09, time_cost(all): 1 day, 8:13:42/1 day, 9:50:01, loss=0.436281230920695, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.86(1.03), norm=4.021445215595554, lr=0.24590903786259766
2023-12-06 19:09:39   INFO  epoch: 36/72, acc_iter=139332, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:47:21, time_cost(all): 1 day, 8:14:24/1 day, 7:47:47, loss=0.436222033494755, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.06(1.03), norm=2.916517185749373, lr=0.2457950707872299
2023-12-06 19:10:21   INFO  epoch: 36/72, acc_iter=139382, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:49:13, time_cost(all): 1 day, 8:15:06/1 day, 8:22:19, loss=0.436162836068814, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.18(1.03), norm=0.8861925529952468, lr=0.24568110371186208
2023-12-06 19:11:03   INFO  epoch: 36/72, acc_iter=139432, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:46:53, time_cost(all): 1 day, 8:15:48/1 day, 6:58:50, loss=0.436103638642873, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.95(1.03), norm=1.7756334502247955, lr=0.24556713663649427
2023-12-06 19:11:45   INFO  epoch: 36/72, acc_iter=139482, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:48:25, time_cost(all): 1 day, 8:16:30/1 day, 9:13:18, loss=0.436044441216932, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.2(1.03), norm=2.1341500916673217, lr=0.2454531695611265
2023-12-06 19:12:26   INFO  epoch: 36/72, acc_iter=139532, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:46:44, time_cost(all): 1 day, 8:17:11/1 day, 7:25:37, loss=0.435985243790991, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.84(1.03), norm=3.7464919557420426, lr=0.2453392024857587
2023-12-06 19:13:08   INFO  epoch: 36/72, acc_iter=139582, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:45:40, time_cost(all): 1 day, 8:17:53/1 day, 8:23:46, loss=0.43592604636505, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.99(1.03), norm=4.803001254064775, lr=0.24522523541039093
2023-12-06 19:13:50   INFO  epoch: 36/72, acc_iter=139632, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:46:18, time_cost(all): 1 day, 8:18:35/1 day, 9:16:32, loss=0.435866848939109, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.16(1.03), norm=3.78959626763762, lr=0.2451112683350231
2023-12-06 19:14:32   INFO  epoch: 36/72, acc_iter=139682, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:45:12, time_cost(all): 1 day, 8:19:17/1 day, 9:04:18, loss=0.435807651513168, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.18(1.03), norm=2.007934943207764, lr=0.2449973012596553
2023-12-06 19:15:13   INFO  epoch: 36/72, acc_iter=139732, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:44:42, time_cost(all): 1 day, 8:19:58/1 day, 6:57:24, loss=0.435748454087227, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.22(1.03), norm=4.100946486539473, lr=0.24488333418428754
2023-12-06 19:15:55   INFO  epoch: 36/72, acc_iter=139782, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:43:16, time_cost(all): 1 day, 8:20:40/1 day, 7:48:43, loss=0.435689256661286, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.92(1.03), norm=2.57725618214975, lr=0.24476936710891972
2023-12-06 19:16:37   INFO  epoch: 36/72, acc_iter=139832, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:42:33, time_cost(all): 1 day, 8:21:22/1 day, 7:50:20, loss=0.435630059235345, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.87(1.03), norm=3.069979976620407, lr=0.2446554000335519
2023-12-06 19:17:19   INFO  epoch: 36/72, acc_iter=139882, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:42:50, time_cost(all): 1 day, 8:22:04/1 day, 8:53:35, loss=0.435570861809404, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.16(1.03), norm=4.554957494608196, lr=0.24454143295818415
2023-12-06 19:18:01   INFO  epoch: 36/72, acc_iter=139932, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:42:43, time_cost(all): 1 day, 8:22:46/1 day, 7:27:57, loss=0.435511664383463, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.92(1.03), norm=0.645360106487977, lr=0.24442746588281633
2023-12-06 19:18:42   INFO  epoch: 36/72, acc_iter=139982, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:41:54, time_cost(all): 1 day, 8:23:27/1 day, 9:36:26, loss=0.435452466957522, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.11(1.03), norm=2.0905949567236, lr=0.24431349880744851
2023-12-06 19:19:24   INFO  epoch: 36/72, acc_iter=140032, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:40:17, time_cost(all): 1 day, 8:24:09/1 day, 7:26:49, loss=0.435393269531581, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.2(1.03), norm=3.4558822693624194, lr=0.24419953173208075
2023-12-06 19:20:06   INFO  epoch: 36/72, acc_iter=140082, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:39:32, time_cost(all): 1 day, 8:24:51/1 day, 7:14:09, loss=0.43533407210564, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.05(1.03), norm=4.000378335138214, lr=0.24408556465671294
2023-12-06 19:20:48   INFO  epoch: 36/72, acc_iter=140132, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:37:12, time_cost(all): 1 day, 8:25:33/1 day, 9:21:53, loss=0.435274874679699, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.05(1.03), norm=3.1697459817507556, lr=0.24397159758134518
2023-12-06 19:21:29   INFO  epoch: 36/72, acc_iter=140182, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:38:21, time_cost(all): 1 day, 8:26:14/1 day, 7:42:32, loss=0.435215677253759, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.92(1.03), norm=3.3493394639097955, lr=0.24385763050597736
2023-12-06 19:22:11   INFO  epoch: 36/72, acc_iter=140232, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:36:10, time_cost(all): 1 day, 8:26:56/1 day, 9:08:25, loss=0.435156479827818, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.1(1.03), norm=1.8550073626480532, lr=0.24374366343060955
2023-12-06 19:22:53   INFO  epoch: 36/72, acc_iter=140282, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:35:50, time_cost(all): 1 day, 8:27:38/1 day, 7:17:37, loss=0.435097282401877, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.87(1.03), norm=1.3933330588019932, lr=0.2436296963552418
2023-12-06 19:23:35   INFO  epoch: 36/72, acc_iter=140332, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:35:12, time_cost(all): 1 day, 8:28:20/1 day, 7:33:09, loss=0.435038084975936, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.99(1.03), norm=2.3218331158820833, lr=0.24351572927987403
2023-12-06 19:24:17   INFO  epoch: 36/72, acc_iter=140382, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:35:46, time_cost(all): 1 day, 8:29:02/1 day, 6:29:27, loss=0.434978887549995, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.08(1.03), norm=4.370027497011207, lr=0.24340176220450616
2023-12-06 19:24:58   INFO  epoch: 36/72, acc_iter=140432, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:34:15, time_cost(all): 1 day, 8:29:43/1 day, 7:49:21, loss=0.434919690124054, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.06(1.03), norm=3.9292766829729033, lr=0.2432877951291384
2023-12-06 19:25:40   INFO  epoch: 36/72, acc_iter=140482, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:33:37, time_cost(all): 1 day, 8:30:25/1 day, 8:49:59, loss=0.434860492698113, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.1(1.03), norm=3.6820931310066474, lr=0.24317382805377064
2023-12-06 19:26:22   INFO  epoch: 36/72, acc_iter=140532, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:32:22, time_cost(all): 1 day, 8:31:07/1 day, 7:01:52, loss=0.434801295272172, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.21(1.03), norm=0.8516129245429881, lr=0.24305986097840282
2023-12-06 19:27:04   INFO  epoch: 36/72, acc_iter=140582, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:33:17, time_cost(all): 1 day, 8:31:49/1 day, 9:14:53, loss=0.434742097846231, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.16(1.03), norm=4.146639840870415, lr=0.242945893903035
2023-12-06 19:27:45   INFO  epoch: 36/72, acc_iter=140632, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:31:32, time_cost(all): 1 day, 8:32:30/1 day, 9:16:09, loss=0.43468290042029, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.03(1.03), norm=4.987570842641063, lr=0.2428319268276672
2023-12-06 19:28:27   INFO  epoch: 36/72, acc_iter=140682, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:29:23, time_cost(all): 1 day, 8:33:12/1 day, 7:38:34, loss=0.434623702994349, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.04(1.03), norm=4.817550677114555, lr=0.24271795975229943
2023-12-06 19:29:09   INFO  epoch: 36/72, acc_iter=140732, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:30:23, time_cost(all): 1 day, 8:33:54/1 day, 6:44:59, loss=0.434564505568408, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.09(1.03), norm=2.048434844754567, lr=0.2426039926769316
2023-12-06 19:29:51   INFO  epoch: 36/72, acc_iter=140782, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:30:24, time_cost(all): 1 day, 8:34:36/1 day, 9:17:22, loss=0.434505308142467, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.07(1.03), norm=3.545880259419296, lr=0.2424900256015638
2023-12-06 19:30:33   INFO  epoch: 36/72, acc_iter=140832, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:29:56, time_cost(all): 1 day, 8:35:18/1 day, 7:06:53, loss=0.434446110716526, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.93(1.03), norm=1.7813827978021577, lr=0.24237605852619604
2023-12-06 19:31:14   INFO  epoch: 36/72, acc_iter=140882, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:27:14, time_cost(all): 1 day, 8:35:59/1 day, 7:52:25, loss=0.434386913290585, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.92(1.03), norm=2.208901699969065, lr=0.24226209145082828
2023-12-06 19:31:56   INFO  epoch: 36/72, acc_iter=140932, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:26:12, time_cost(all): 1 day, 8:36:41/1 day, 6:54:50, loss=0.434327715864644, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.08(1.03), norm=2.9710792539725293, lr=0.2421481243754604
2023-12-06 19:32:38   INFO  epoch: 36/72, acc_iter=140982, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:25:39, time_cost(all): 1 day, 8:37:23/1 day, 7:26:52, loss=0.434268518438703, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.92(1.03), norm=2.706143997190544, lr=0.24203415730009264
2023-12-06 19:33:20   INFO  epoch: 36/72, acc_iter=141032, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:26:22, time_cost(all): 1 day, 8:38:05/1 day, 8:08:34, loss=0.434209321012763, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.9(1.03), norm=1.9882477869212143, lr=0.24192019022472488
2023-12-06 19:34:01   INFO  epoch: 36/72, acc_iter=141082, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:24:44, time_cost(all): 1 day, 8:38:46/1 day, 8:17:03, loss=0.434150123586822, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.93(1.03), norm=1.5773361921744327, lr=0.24180622314935707
2023-12-06 19:34:43   INFO  epoch: 36/72, acc_iter=141132, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:23:47, time_cost(all): 1 day, 8:39:28/1 day, 9:17:04, loss=0.434090926160881, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.12(1.03), norm=3.4826339301485927, lr=0.24169225607398925
2023-12-06 19:35:25   INFO  epoch: 36/72, acc_iter=141182, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:17, time_cost(all): 1 day, 8:40:10/1 day, 7:05:16, loss=0.43403172873494, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.15(1.03), norm=3.4941439125498013, lr=0.2415782889986215
2023-12-06 19:36:07   INFO  epoch: 36/72, acc_iter=141232, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:23:02, time_cost(all): 1 day, 8:40:52/1 day, 8:55:23, loss=0.433972531308999, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.07(1.03), norm=3.4695814254695327, lr=0.24146432192325368
2023-12-06 19:36:49   INFO  epoch: 36/72, acc_iter=141282, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:23:20, time_cost(all): 1 day, 8:41:34/1 day, 7:42:58, loss=0.433913333883058, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.85(1.03), norm=3.9700185858310237, lr=0.24135035484788592
2023-12-06 19:37:30   INFO  epoch: 36/72, acc_iter=141332, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:20:48, time_cost(all): 1 day, 8:42:15/1 day, 7:21:37, loss=0.433854136457117, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.2(1.03), norm=4.0251851246005295, lr=0.24123638777251805
2023-12-06 19:38:12   INFO  epoch: 36/72, acc_iter=141382, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:18, time_cost(all): 1 day, 8:42:57/1 day, 7:37:25, loss=0.433794939031176, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.94(1.03), norm=4.181856914575151, lr=0.24112242069715029
2023-12-06 19:38:54   INFO  epoch: 36/72, acc_iter=141432, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:21:16, time_cost(all): 1 day, 8:43:39/1 day, 7:34:14, loss=0.433735741605235, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.91(1.03), norm=1.0614331495949418, lr=0.24100845362178253
2023-12-06 19:39:36   INFO  epoch: 36/72, acc_iter=141482, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:20:14, time_cost(all): 1 day, 8:44:21/1 day, 7:00:25, loss=0.433676544179294, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.08(1.03), norm=0.668528558955134, lr=0.2408944865464147
2023-12-06 19:40:18   INFO  epoch: 36/72, acc_iter=141532, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:18:13, time_cost(all): 1 day, 8:45:03/1 day, 7:30:58, loss=0.433617346753353, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.89(1.03), norm=2.2334648206314944, lr=0.2407805194710469
2023-12-06 19:40:59   INFO  epoch: 36/72, acc_iter=141582, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:17:27, time_cost(all): 1 day, 8:45:44/1 day, 9:01:27, loss=0.433558149327412, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.91(1.03), norm=4.465064830972568, lr=0.24066655239567913
2023-12-06 19:41:41   INFO  epoch: 36/72, acc_iter=141632, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:15, time_cost(all): 1 day, 8:46:26/1 day, 6:24:56, loss=0.433498951901471, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.2(1.03), norm=1.0752209157250248, lr=0.24055258532031132
2023-12-06 19:42:23   INFO  epoch: 36/72, acc_iter=141682, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:25, time_cost(all): 1 day, 8:47:08/1 day, 6:41:23, loss=0.43343975447553, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.93(1.03), norm=1.3053144292767478, lr=0.2404386182449435
2023-12-06 19:43:05   INFO  epoch: 36/72, acc_iter=141732, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:15:57, time_cost(all): 1 day, 8:47:50/1 day, 8:02:42, loss=0.433380557049589, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.04(1.03), norm=1.711585301473205, lr=0.24032465116957574
2023-12-06 19:43:46   INFO  epoch: 36/72, acc_iter=141782, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:52, time_cost(all): 1 day, 8:48:31/1 day, 7:37:33, loss=0.433321359623648, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.86(1.03), norm=2.329487955927708, lr=0.24021068409420793
2023-12-06 19:44:28   INFO  epoch: 36/72, acc_iter=141832, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:15:11, time_cost(all): 1 day, 8:49:13/1 day, 7:03:18, loss=0.433262162197707, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.95(1.03), norm=1.5171165374299564, lr=0.24009671701884017
2023-12-06 19:45:10   INFO  epoch: 36/72, acc_iter=141882, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:18, time_cost(all): 1 day, 8:49:55/1 day, 7:04:19, loss=0.433202964771767, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.91(1.03), norm=3.4100088508031496, lr=0.23998274994347235
2023-12-06 19:45:52   INFO  epoch: 36/72, acc_iter=141932, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:40, time_cost(all): 1 day, 8:50:37/1 day, 6:29:41, loss=0.433143767345826, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.02(1.03), norm=3.3502775525608444, lr=0.23986878286810454
2023-12-06 19:46:34   INFO  epoch: 36/72, acc_iter=141982, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:46, time_cost(all): 1 day, 8:51:19/1 day, 6:08:39, loss=0.433084569919885, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.95(1.03), norm=0.5436258887737236, lr=0.23975481579273678
2023-12-06 19:47:15   INFO  epoch: 36/72, acc_iter=142032, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:14, time_cost(all): 1 day, 8:52:00/1 day, 6:52:49, loss=0.433025372493944, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.84(1.03), norm=2.1277719529404395, lr=0.23964084871736901
2023-12-06 19:47:57   INFO  epoch: 36/72, acc_iter=142082, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:10:57, time_cost(all): 1 day, 8:52:42/1 day, 8:51:00, loss=0.432966175068003, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.12(1.03), norm=1.7688583351178115, lr=0.23952688164200114
2023-12-06 19:48:39   INFO  epoch: 36/72, acc_iter=142132, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:13, time_cost(all): 1 day, 8:53:24/1 day, 6:26:48, loss=0.432906977642062, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.96(1.03), norm=4.552164469378889, lr=0.23941291456663338
2023-12-06 19:49:21   INFO  epoch: 36/72, acc_iter=142182, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:30, time_cost(all): 1 day, 8:54:06/1 day, 7:57:45, loss=0.432847780216121, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.86(1.03), norm=0.7946366367734635, lr=0.23929894749126557
2023-12-06 19:50:02   INFO  epoch: 36/72, acc_iter=142232, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:23, time_cost(all): 1 day, 8:54:47/1 day, 8:02:37, loss=0.43278858279018, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.13(1.03), norm=2.847276239342615, lr=0.2391849804158978
2023-12-06 19:50:44   INFO  epoch: 36/72, acc_iter=142282, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:24, time_cost(all): 1 day, 8:55:29/1 day, 7:43:53, loss=0.432729385364239, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.86(1.03), norm=1.405684202673214, lr=0.23907101334053
2023-12-06 19:51:26   INFO  epoch: 36/72, acc_iter=142332, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:52, time_cost(all): 1 day, 8:56:11/1 day, 8:48:45, loss=0.432670187938298, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.97(1.03), norm=2.5901481175464527, lr=0.23895704626516218
2023-12-06 19:52:08   INFO  epoch: 36/72, acc_iter=142382, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:02, time_cost(all): 1 day, 8:56:53/1 day, 7:53:52, loss=0.432610990512357, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.23(1.03), norm=3.760547247387167, lr=0.23884307918979442
2023-12-06 19:52:50   INFO  epoch: 36/72, acc_iter=142432, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:41, time_cost(all): 1 day, 8:57:35/1 day, 7:21:33, loss=0.432551793086416, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.16(1.03), norm=3.1688237631907117, lr=0.2387291121144266
2023-12-06 19:53:31   INFO  epoch: 36/72, acc_iter=142482, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:56, time_cost(all): 1 day, 8:58:16/1 day, 8:29:32, loss=0.432492595660475, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.09(1.03), norm=3.0173296716581626, lr=0.23861514503905878
2023-12-06 19:54:13   INFO  epoch: 36/72, acc_iter=142532, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:04:51, time_cost(all): 1 day, 8:58:58/1 day, 6:47:11, loss=0.432433398234534, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.14(1.03), norm=1.3194653460061834, lr=0.23850117796369102
2023-12-06 19:54:55   INFO  epoch: 36/72, acc_iter=142582, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:13, time_cost(all): 1 day, 8:59:40/1 day, 6:15:43, loss=0.432374200808593, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.94(1.03), norm=3.2130142198438754, lr=0.23838721088832326
2023-12-06 19:55:37   INFO  epoch: 36/72, acc_iter=142632, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:30, time_cost(all): 1 day, 9:00:22/1 day, 6:11:20, loss=0.432315003382652, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.87(1.03), norm=1.06565514815915, lr=0.2382732438129554
2023-12-06 19:56:18   INFO  epoch: 36/72, acc_iter=142682, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:53, time_cost(all): 1 day, 9:01:03/1 day, 5:57:13, loss=0.432255805956711, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.09(1.03), norm=4.844914327649548, lr=0.23815927673758763
2023-12-06 19:57:00   INFO  epoch: 36/72, acc_iter=142732, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:17, time_cost(all): 1 day, 9:01:45/1 day, 8:12:50, loss=0.43219660853077, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.14(1.03), norm=3.3361760243654555, lr=0.23804530966221987
2023-12-06 19:57:42   INFO  epoch: 36/72, acc_iter=142782, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:30, time_cost(all): 1 day, 9:02:27/1 day, 8:00:08, loss=0.43213741110483, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.12(1.03), norm=1.0663265695965742, lr=0.23793134258685206
2023-12-06 19:58:24   INFO  epoch: 36/72, acc_iter=142832, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:50, time_cost(all): 1 day, 9:03:09/1 day, 7:35:09, loss=0.432078213678889, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.0(1.03), norm=2.5111525040974993, lr=0.23781737551148424
2023-12-06 19:59:06   INFO  epoch: 36/72, acc_iter=142882, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 1 day, 9:03:51/1 day, 8:41:43, loss=0.432019016252948, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.03(1.03), norm=3.2425820193053205, lr=0.23770340843611648
2023-12-06 19:59:47   INFO  epoch: 37/72, acc_iter=142944, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:51:22, time_cost(all): 1 day, 9:04:32/1 day, 6:46:23, loss=0.431945611444781, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.03(1.03), norm=2.450955551778873, lr=0.23756208926266037
2023-12-06 20:00:29   INFO  epoch: 37/72, acc_iter=142994, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:52:07, time_cost(all): 1 day, 9:05:14/1 day, 6:00:27, loss=0.43188641401884, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.04(1.03), norm=1.3580182653055264, lr=0.2374481221872926
2023-12-06 20:01:11   INFO  epoch: 37/72, acc_iter=143044, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:50:27, time_cost(all): 1 day, 9:05:56/1 day, 7:00:24, loss=0.431827216592899, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.99(1.03), norm=0.7451761054231085, lr=0.2373341551119248
2023-12-06 20:01:53   INFO  epoch: 37/72, acc_iter=143094, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:50:13, time_cost(all): 1 day, 9:06:38/1 day, 8:18:49, loss=0.431768019166958, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.96(1.03), norm=2.711547976010851, lr=0.23722018803655703
2023-12-06 20:02:34   INFO  epoch: 37/72, acc_iter=143144, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:50:56, time_cost(all): 1 day, 9:07:19/1 day, 8:19:44, loss=0.431708821741017, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.14(1.03), norm=2.0809953048893197, lr=0.23710622096118922
2023-12-06 20:03:16   INFO  epoch: 37/72, acc_iter=143194, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:48:07, time_cost(all): 1 day, 9:08:01/1 day, 8:11:07, loss=0.431649624315076, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.07(1.03), norm=0.9879119369805709, lr=0.2369922538858214
2023-12-06 20:03:58   INFO  epoch: 37/72, acc_iter=143244, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:48:36, time_cost(all): 1 day, 9:08:43/1 day, 6:28:24, loss=0.431590426889135, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.1(1.03), norm=3.8245716055492105, lr=0.23687828681045364
2023-12-06 20:04:40   INFO  epoch: 37/72, acc_iter=143294, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:46:29, time_cost(all): 1 day, 9:09:25/1 day, 7:06:33, loss=0.431531229463194, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.23(1.03), norm=0.9901285995351479, lr=0.23676431973508588
2023-12-06 20:05:22   INFO  epoch: 37/72, acc_iter=143344, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:45:40, time_cost(all): 1 day, 9:10:07/1 day, 8:28:56, loss=0.431472032037253, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.91(1.03), norm=3.116039970644258, lr=0.236650352659718
2023-12-06 20:06:03   INFO  epoch: 37/72, acc_iter=143394, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:46:16, time_cost(all): 1 day, 9:10:48/1 day, 7:14:26, loss=0.431412834611312, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.18(1.03), norm=2.650251591900548, lr=0.23653638558435025
2023-12-06 20:06:45   INFO  epoch: 37/72, acc_iter=143444, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:45:21, time_cost(all): 1 day, 9:11:30/1 day, 8:46:43, loss=0.431353637185372, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.84(1.03), norm=1.9157229516185605, lr=0.23642241850898243
2023-12-06 20:07:27   INFO  epoch: 37/72, acc_iter=143494, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:43:55, time_cost(all): 1 day, 9:12:12/1 day, 7:05:05, loss=0.431294439759431, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.05(1.03), norm=3.1543094947553145, lr=0.23630845143361467
2023-12-06 20:08:09   INFO  epoch: 37/72, acc_iter=143544, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:45:45, time_cost(all): 1 day, 9:12:54/1 day, 7:15:03, loss=0.43123524233349, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.18(1.03), norm=4.765646199987442, lr=0.23619448435824686
2023-12-06 20:08:50   INFO  epoch: 37/72, acc_iter=143594, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:44:02, time_cost(all): 1 day, 9:13:35/1 day, 5:57:13, loss=0.431176044907549, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.04(1.03), norm=3.953287075154883, lr=0.23608051728287904
2023-12-06 20:09:32   INFO  epoch: 37/72, acc_iter=143644, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:42:13, time_cost(all): 1 day, 9:14:17/1 day, 6:14:42, loss=0.431116847481608, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.21(1.03), norm=0.7341333622037924, lr=0.23596655020751128
2023-12-06 20:10:14   INFO  epoch: 37/72, acc_iter=143694, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:44:12, time_cost(all): 1 day, 9:14:59/1 day, 6:06:21, loss=0.431057650055667, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.03(1.03), norm=1.543507526812463, lr=0.23585258313214347
2023-12-06 20:10:56   INFO  epoch: 37/72, acc_iter=143744, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:40:07, time_cost(all): 1 day, 9:15:41/1 day, 6:43:18, loss=0.430998452629726, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.0(1.03), norm=3.7297094345827064, lr=0.23573861605677565
2023-12-06 20:11:38   INFO  epoch: 37/72, acc_iter=143794, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:40:56, time_cost(all): 1 day, 9:16:23/1 day, 7:23:24, loss=0.430939255203785, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.23(1.03), norm=3.5894578768988326, lr=0.2356246489814079
2023-12-06 20:12:19   INFO  epoch: 37/72, acc_iter=143844, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:39:36, time_cost(all): 1 day, 9:17:04/1 day, 7:22:00, loss=0.430880057777844, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.87(1.03), norm=4.93533828907725, lr=0.23551068190604013
2023-12-06 20:13:01   INFO  epoch: 37/72, acc_iter=143894, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:40:19, time_cost(all): 1 day, 9:17:46/1 day, 8:33:42, loss=0.430820860351903, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.04(1.03), norm=3.331487718461815, lr=0.23539671483067226
2023-12-06 20:13:43   INFO  epoch: 37/72, acc_iter=143944, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:40:39, time_cost(all): 1 day, 9:18:28/1 day, 8:02:31, loss=0.430761662925962, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.16(1.03), norm=0.7464883458501155, lr=0.2352827477553045
2023-12-06 20:14:25   INFO  epoch: 37/72, acc_iter=143994, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:39:55, time_cost(all): 1 day, 9:19:10/1 day, 7:18:25, loss=0.430702465500021, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.05(1.03), norm=4.148445807600208, lr=0.23516878067993674
2023-12-06 20:15:07   INFO  epoch: 37/72, acc_iter=144044, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:36:03, time_cost(all): 1 day, 9:19:52/1 day, 8:01:54, loss=0.43064326807408, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.99(1.03), norm=2.062089730679251, lr=0.23505481360456892
2023-12-06 20:15:48   INFO  epoch: 37/72, acc_iter=144094, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:37:43, time_cost(all): 1 day, 9:20:33/1 day, 8:05:16, loss=0.430584070648139, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.1(1.03), norm=3.2616001365491663, lr=0.2349408465292011
2023-12-06 20:16:30   INFO  epoch: 37/72, acc_iter=144144, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:37:04, time_cost(all): 1 day, 9:21:15/1 day, 7:12:54, loss=0.430524873222198, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.17(1.03), norm=2.506911731111425, lr=0.23482687945383335
2023-12-06 20:17:12   INFO  epoch: 37/72, acc_iter=144194, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:35:13, time_cost(all): 1 day, 9:21:57/1 day, 8:12:24, loss=0.430465675796257, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.22(1.03), norm=2.162879552188417, lr=0.23471291237846553
2023-12-06 20:17:54   INFO  epoch: 37/72, acc_iter=144244, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:34:03, time_cost(all): 1 day, 9:22:39/1 day, 7:43:12, loss=0.430406478370316, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.92(1.03), norm=3.732214116780314, lr=0.23459894530309777
2023-12-06 20:18:35   INFO  epoch: 37/72, acc_iter=144294, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:32:38, time_cost(all): 1 day, 9:23:20/1 day, 6:34:57, loss=0.430347280944376, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.12(1.03), norm=2.8307072884057187, lr=0.2344849782277299
2023-12-06 20:19:17   INFO  epoch: 37/72, acc_iter=144344, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:33:07, time_cost(all): 1 day, 9:24:02/1 day, 6:57:57, loss=0.430288083518435, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.21(1.03), norm=4.332222684913017, lr=0.23437101115236214
2023-12-06 20:19:59   INFO  epoch: 37/72, acc_iter=144394, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:31:35, time_cost(all): 1 day, 9:24:44/1 day, 5:51:34, loss=0.430228886092494, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.09(1.03), norm=1.6550190330991594, lr=0.23425704407699438
2023-12-06 20:20:41   INFO  epoch: 37/72, acc_iter=144444, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:31:34, time_cost(all): 1 day, 9:25:26/1 day, 6:27:14, loss=0.430169688666553, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.19(1.03), norm=2.9036961121114695, lr=0.23414307700162657
2023-12-06 20:21:23   INFO  epoch: 37/72, acc_iter=144494, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:30:10, time_cost(all): 1 day, 9:26:08/1 day, 7:03:00, loss=0.430110491240612, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.13(1.03), norm=1.8197207953765289, lr=0.23402910992625875
2023-12-06 20:22:04   INFO  epoch: 37/72, acc_iter=144544, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:31:54, time_cost(all): 1 day, 9:26:49/1 day, 6:26:04, loss=0.430051293814671, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.15(1.03), norm=4.189943514336577, lr=0.233915142850891
2023-12-06 20:22:46   INFO  epoch: 37/72, acc_iter=144594, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:31:13, time_cost(all): 1 day, 9:27:31/1 day, 8:36:59, loss=0.42999209638873, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.04(1.03), norm=3.344982095868037, lr=0.23380117577552317
2023-12-06 20:23:28   INFO  epoch: 37/72, acc_iter=144644, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:30:21, time_cost(all): 1 day, 9:28:13/1 day, 8:35:00, loss=0.429932898962789, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.22(1.03), norm=0.649182892449452, lr=0.23368720870015536
2023-12-06 20:24:10   INFO  epoch: 37/72, acc_iter=144694, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:27:51, time_cost(all): 1 day, 9:28:55/1 day, 8:25:05, loss=0.429873701536848, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.06(1.03), norm=1.5627560158056495, lr=0.2335732416247876
2023-12-06 20:24:51   INFO  epoch: 37/72, acc_iter=144744, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:26:51, time_cost(all): 1 day, 9:29:36/1 day, 7:31:27, loss=0.429814504110907, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.99(1.03), norm=0.7030664817260582, lr=0.23345927454941978
2023-12-06 20:25:33   INFO  epoch: 37/72, acc_iter=144794, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:27:52, time_cost(all): 1 day, 9:30:18/1 day, 5:39:34, loss=0.429755306684966, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.12(1.03), norm=0.5112831330075434, lr=0.23334530747405202
2023-12-06 20:26:15   INFO  epoch: 37/72, acc_iter=144844, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:27:24, time_cost(all): 1 day, 9:31:00/1 day, 8:28:57, loss=0.429696109259025, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.9(1.03), norm=2.0558142468197236, lr=0.2332313403986842
2023-12-06 20:26:57   INFO  epoch: 37/72, acc_iter=144894, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:25:21, time_cost(all): 1 day, 9:31:42/1 day, 6:34:02, loss=0.429636911833084, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.12(1.03), norm=2.0363436051258903, lr=0.2331173733233164
2023-12-06 20:27:39   INFO  epoch: 37/72, acc_iter=144944, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:55, time_cost(all): 1 day, 9:32:24/1 day, 6:57:14, loss=0.429577714407143, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.01(1.03), norm=2.606525698291121, lr=0.23300340624794863
2023-12-06 20:28:20   INFO  epoch: 37/72, acc_iter=144994, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:25:10, time_cost(all): 1 day, 9:33:05/1 day, 7:09:24, loss=0.429518516981202, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.22(1.03), norm=4.475464037992161, lr=0.23288943917258081
2023-12-06 20:29:02   INFO  epoch: 37/72, acc_iter=145044, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:23:48, time_cost(all): 1 day, 9:33:47/1 day, 8:26:59, loss=0.429459319555261, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.02(1.03), norm=2.7751515099264563, lr=0.232775472097213
2023-12-06 20:29:44   INFO  epoch: 37/72, acc_iter=145094, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:43, time_cost(all): 1 day, 9:34:29/1 day, 7:46:14, loss=0.42940012212932, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.01(1.03), norm=0.6232566110414288, lr=0.23266150502184524
2023-12-06 20:30:26   INFO  epoch: 37/72, acc_iter=145144, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:23:03, time_cost(all): 1 day, 9:35:11/1 day, 5:57:18, loss=0.429340924703379, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.95(1.03), norm=1.6103255497172635, lr=0.23254753794647742
2023-12-06 20:31:07   INFO  epoch: 37/72, acc_iter=145194, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:20:42, time_cost(all): 1 day, 9:35:52/1 day, 6:13:20, loss=0.429281727277439, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.2(1.03), norm=3.2736034453165144, lr=0.23243357087110966
2023-12-06 20:31:49   INFO  epoch: 37/72, acc_iter=145244, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:49, time_cost(all): 1 day, 9:36:34/1 day, 8:15:25, loss=0.429222529851498, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.97(1.03), norm=2.911530911331097, lr=0.23231960379574185
2023-12-06 20:32:31   INFO  epoch: 37/72, acc_iter=145294, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:20:24, time_cost(all): 1 day, 9:37:16/1 day, 6:49:05, loss=0.429163332425557, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.07(1.03), norm=0.8999902850112806, lr=0.23220563672037403
2023-12-06 20:33:13   INFO  epoch: 37/72, acc_iter=145344, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:18:41, time_cost(all): 1 day, 9:37:58/1 day, 5:35:38, loss=0.429104134999616, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.88(1.03), norm=3.1862147603178457, lr=0.23209166964500627
2023-12-06 20:33:55   INFO  epoch: 37/72, acc_iter=145394, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:49, time_cost(all): 1 day, 9:38:40/1 day, 5:53:14, loss=0.429044937573675, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.2(1.03), norm=2.306071930925996, lr=0.23197770256963846
2023-12-06 20:34:36   INFO  epoch: 37/72, acc_iter=145444, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:13, time_cost(all): 1 day, 9:39:21/1 day, 5:42:35, loss=0.428985740147734, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.08(1.03), norm=2.3765668871576464, lr=0.23186373549427064
2023-12-06 20:35:18   INFO  epoch: 37/72, acc_iter=145494, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:16:54, time_cost(all): 1 day, 9:40:03/1 day, 6:19:25, loss=0.428926542721793, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.17(1.03), norm=1.621323648335053, lr=0.23174976841890288
2023-12-06 20:36:00   INFO  epoch: 37/72, acc_iter=145544, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:17:16, time_cost(all): 1 day, 9:40:45/1 day, 8:12:13, loss=0.428867345295852, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.01(1.03), norm=4.60735940853039, lr=0.23163580134353512
2023-12-06 20:36:42   INFO  epoch: 37/72, acc_iter=145594, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:15:35, time_cost(all): 1 day, 9:41:27/1 day, 7:16:46, loss=0.428808147869911, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.19(1.03), norm=0.7256849427139933, lr=0.23152183426816725
2023-12-06 20:37:23   INFO  epoch: 37/72, acc_iter=145644, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:07, time_cost(all): 1 day, 9:42:08/1 day, 6:47:06, loss=0.42874895044397, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.98(1.03), norm=3.152294297098982, lr=0.2314078671927995
2023-12-06 20:38:05   INFO  epoch: 37/72, acc_iter=145694, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:27, time_cost(all): 1 day, 9:42:50/1 day, 7:54:02, loss=0.428689753018029, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.13(1.03), norm=0.5220172520959836, lr=0.23129390011743173
2023-12-06 20:38:47   INFO  epoch: 37/72, acc_iter=145744, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:36, time_cost(all): 1 day, 9:43:32/1 day, 7:00:04, loss=0.428630555592088, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.1(1.03), norm=1.816151706375145, lr=0.2311799330420639
2023-12-06 20:39:29   INFO  epoch: 37/72, acc_iter=145794, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:35, time_cost(all): 1 day, 9:44:14/1 day, 7:40:18, loss=0.428571358166147, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.96(1.03), norm=2.786006810247057, lr=0.2310659659666961
2023-12-06 20:40:11   INFO  epoch: 37/72, acc_iter=145844, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:13:17, time_cost(all): 1 day, 9:44:56/1 day, 6:52:43, loss=0.428512160740206, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.06(1.03), norm=3.1003072099963087, lr=0.23095199889132828
2023-12-06 20:40:52   INFO  epoch: 37/72, acc_iter=145894, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:11:40, time_cost(all): 1 day, 9:45:37/1 day, 7:41:35, loss=0.428452963314265, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.15(1.03), norm=2.780565019453516, lr=0.23083803181596052
2023-12-06 20:41:34   INFO  epoch: 37/72, acc_iter=145944, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:02, time_cost(all): 1 day, 9:46:19/1 day, 7:56:29, loss=0.428393765888324, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.89(1.03), norm=2.269159044106631, lr=0.23072406474059276
2023-12-06 20:42:16   INFO  epoch: 37/72, acc_iter=145994, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:07, time_cost(all): 1 day, 9:47:01/1 day, 6:33:33, loss=0.428334568462383, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.03(1.03), norm=0.7310136340921343, lr=0.2306100976652249
2023-12-06 20:42:58   INFO  epoch: 37/72, acc_iter=146044, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:25, time_cost(all): 1 day, 9:47:43/1 day, 5:34:01, loss=0.428275371036443, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.83(1.03), norm=1.8965471773874063, lr=0.23049613058985713
2023-12-06 20:43:39   INFO  epoch: 37/72, acc_iter=146094, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:20, time_cost(all): 1 day, 9:48:24/1 day, 5:50:10, loss=0.428216173610502, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.05(1.03), norm=3.254424459684777, lr=0.23038216351448937
2023-12-06 20:44:21   INFO  epoch: 37/72, acc_iter=146144, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:33, time_cost(all): 1 day, 9:49:06/1 day, 7:35:13, loss=0.428156976184561, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.9(1.03), norm=2.5719004355735215, lr=0.23026819643912155
2023-12-06 20:45:03   INFO  epoch: 37/72, acc_iter=146194, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:08:03, time_cost(all): 1 day, 9:49:48/1 day, 6:15:56, loss=0.42809777875862, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.04(1.03), norm=4.60259263057676, lr=0.23015422936375374
2023-12-06 20:45:45   INFO  epoch: 37/72, acc_iter=146244, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:17, time_cost(all): 1 day, 9:50:30/1 day, 6:11:10, loss=0.428038581332679, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.19(1.03), norm=4.681315400378408, lr=0.23004026228838598
2023-12-06 20:46:27   INFO  epoch: 37/72, acc_iter=146294, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:11, time_cost(all): 1 day, 9:51:12/1 day, 7:56:33, loss=0.427979383906738, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.96(1.03), norm=3.646323334677632, lr=0.22992629521301816
2023-12-06 20:47:08   INFO  epoch: 37/72, acc_iter=146344, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:06:01, time_cost(all): 1 day, 9:51:53/1 day, 7:23:38, loss=0.427920186480797, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.87(1.03), norm=0.6787383485480611, lr=0.22981232813765035
2023-12-06 20:47:50   INFO  epoch: 37/72, acc_iter=146394, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:01, time_cost(all): 1 day, 9:52:35/1 day, 5:36:36, loss=0.427860989054856, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.01(1.03), norm=2.9782352928829927, lr=0.22969836106228259
2023-12-06 20:48:32   INFO  epoch: 37/72, acc_iter=146444, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:21, time_cost(all): 1 day, 9:53:17/1 day, 6:12:59, loss=0.427801791628915, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.18(1.03), norm=1.2112227751669333, lr=0.22958439398691477
2023-12-06 20:49:14   INFO  epoch: 37/72, acc_iter=146494, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:36, time_cost(all): 1 day, 9:53:59/1 day, 6:02:04, loss=0.427742594202974, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.88(1.03), norm=4.275525568548455, lr=0.229470426911547
2023-12-06 20:49:56   INFO  epoch: 37/72, acc_iter=146544, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:50, time_cost(all): 1 day, 9:54:41/1 day, 6:21:41, loss=0.427683396777033, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.87(1.03), norm=4.058034938123155, lr=0.2293564598361792
2023-12-06 20:50:37   INFO  epoch: 37/72, acc_iter=146594, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:13, time_cost(all): 1 day, 9:55:22/1 day, 6:16:53, loss=0.427624199351092, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.22(1.03), norm=3.175862688897081, lr=0.22924249276081138
2023-12-06 20:51:19   INFO  epoch: 37/72, acc_iter=146644, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:32, time_cost(all): 1 day, 9:56:04/1 day, 6:52:25, loss=0.427565001925151, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.02(1.03), norm=1.7916585773030949, lr=0.22912852568544362
2023-12-06 20:52:01   INFO  epoch: 37/72, acc_iter=146694, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:53, time_cost(all): 1 day, 9:56:46/1 day, 7:27:16, loss=0.42750580449921, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.06(1.03), norm=3.7121288121746963, lr=0.2290145586100758
2023-12-06 20:52:43   INFO  epoch: 37/72, acc_iter=146744, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 1 day, 9:57:28/1 day, 7:04:58, loss=0.427446607073269, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.95(1.03), norm=0.5408358289837458, lr=0.228900591534708
2023-12-06 20:53:24   INFO  epoch: 38/72, acc_iter=146806, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:53:37, time_cost(all): 1 day, 9:58:09/1 day, 6:00:06, loss=0.427373202265103, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.05(1.03), norm=2.097721669116698, lr=0.228759272361252
2023-12-06 20:54:06   INFO  epoch: 38/72, acc_iter=146856, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:53:11, time_cost(all): 1 day, 9:58:51/1 day, 5:40:46, loss=0.427314004839162, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.89(1.03), norm=3.0003709837049346, lr=0.22864530528588412
2023-12-06 20:54:48   INFO  epoch: 38/72, acc_iter=146906, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:53:25, time_cost(all): 1 day, 9:59:33/1 day, 5:45:33, loss=0.427254807413221, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.97(1.03), norm=1.2239074506991523, lr=0.22853133821051635
2023-12-06 20:55:30   INFO  epoch: 38/72, acc_iter=146956, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:53:29, time_cost(all): 1 day, 10:00:15/1 day, 7:27:04, loss=0.42719560998728, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.1(1.03), norm=3.9218395250968223, lr=0.2284173711351486
2023-12-06 20:56:12   INFO  epoch: 38/72, acc_iter=147006, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:52:23, time_cost(all): 1 day, 10:00:57/1 day, 6:48:59, loss=0.427136412561339, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.83(1.03), norm=4.508092707201066, lr=0.22830340405978078
2023-12-06 20:56:53   INFO  epoch: 38/72, acc_iter=147056, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:51:04, time_cost(all): 1 day, 10:01:38/1 day, 5:28:32, loss=0.427077215135398, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.92(1.03), norm=4.4439204845984275, lr=0.22818943698441296
2023-12-06 20:57:35   INFO  epoch: 38/72, acc_iter=147106, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:49:56, time_cost(all): 1 day, 10:02:20/1 day, 6:00:59, loss=0.427018017709457, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.08(1.03), norm=3.3462133027667487, lr=0.22807546990904515
2023-12-06 20:58:17   INFO  epoch: 38/72, acc_iter=147156, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:48:10, time_cost(all): 1 day, 10:03:02/1 day, 7:48:48, loss=0.426958820283516, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.06(1.03), norm=2.263438049041048, lr=0.2279615028336774
2023-12-06 20:58:59   INFO  epoch: 38/72, acc_iter=147206, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:48:28, time_cost(all): 1 day, 10:03:44/1 day, 6:46:28, loss=0.426899622857575, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.15(1.03), norm=4.1177937156569, lr=0.22784753575830963
2023-12-06 20:59:40   INFO  epoch: 38/72, acc_iter=147256, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:49:06, time_cost(all): 1 day, 10:04:25/1 day, 7:07:25, loss=0.426840425431634, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.92(1.03), norm=3.1263842783519715, lr=0.22773356868294176
2023-12-06 21:00:22   INFO  epoch: 38/72, acc_iter=147306, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:44:26, time_cost(all): 1 day, 10:05:07/1 day, 7:01:33, loss=0.426781228005693, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.06(1.03), norm=4.299405674987154, lr=0.227619601607574
2023-12-06 21:01:04   INFO  epoch: 38/72, acc_iter=147356, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:47:06, time_cost(all): 1 day, 10:05:49/1 day, 4:56:31, loss=0.426722030579752, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.99(1.03), norm=1.736346690000035, lr=0.22750563453220624
2023-12-06 21:01:46   INFO  epoch: 38/72, acc_iter=147406, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:45:36, time_cost(all): 1 day, 10:06:31/1 day, 7:32:22, loss=0.426662833153811, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.04(1.03), norm=2.7440978230153816, lr=0.22739166745683842
2023-12-06 21:02:28   INFO  epoch: 38/72, acc_iter=147456, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:43:03, time_cost(all): 1 day, 10:07:13/1 day, 7:16:36, loss=0.42660363572787, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.04(1.03), norm=4.2597364799154835, lr=0.2272777003814706
2023-12-06 21:03:09   INFO  epoch: 38/72, acc_iter=147506, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:45:25, time_cost(all): 1 day, 10:07:54/1 day, 6:53:24, loss=0.426544438301929, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.1(1.03), norm=4.161415422736048, lr=0.22716373330610284
2023-12-06 21:03:51   INFO  epoch: 38/72, acc_iter=147556, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:42:17, time_cost(all): 1 day, 10:08:36/1 day, 7:30:03, loss=0.426485240875988, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.07(1.03), norm=2.198877094762045, lr=0.22704976623073503
2023-12-06 21:04:33   INFO  epoch: 38/72, acc_iter=147606, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:41:46, time_cost(all): 1 day, 10:09:18/1 day, 6:12:57, loss=0.426426043450048, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.23(1.03), norm=0.9130410296900713, lr=0.2269357991553672
2023-12-06 21:05:15   INFO  epoch: 38/72, acc_iter=147656, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:39:32, time_cost(all): 1 day, 10:10:00/1 day, 7:36:33, loss=0.426366846024107, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.21(1.03), norm=2.2430535903063804, lr=0.22682183207999945
2023-12-06 21:05:56   INFO  epoch: 38/72, acc_iter=147706, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:41:36, time_cost(all): 1 day, 10:10:41/1 day, 7:13:44, loss=0.426307648598166, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.16(1.03), norm=2.0801600271087093, lr=0.22670786500463164
2023-12-06 21:06:38   INFO  epoch: 38/72, acc_iter=147756, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:40:29, time_cost(all): 1 day, 10:11:23/1 day, 5:19:57, loss=0.426248451172225, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.85(1.03), norm=4.306566196695189, lr=0.22659389792926388
2023-12-06 21:07:20   INFO  epoch: 38/72, acc_iter=147806, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:37:30, time_cost(all): 1 day, 10:12:05/1 day, 7:00:05, loss=0.426189253746284, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.9(1.03), norm=3.304092795992899, lr=0.226479930853896
2023-12-06 21:08:02   INFO  epoch: 38/72, acc_iter=147856, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:39:47, time_cost(all): 1 day, 10:12:47/1 day, 5:34:02, loss=0.426130056320343, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.13(1.03), norm=2.761188608553994, lr=0.22636596377852825
2023-12-06 21:08:44   INFO  epoch: 38/72, acc_iter=147906, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:39:34, time_cost(all): 1 day, 10:13:29/1 day, 6:34:10, loss=0.426070858894402, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.87(1.03), norm=1.4210541258176608, lr=0.22625199670316049
2023-12-06 21:09:25   INFO  epoch: 38/72, acc_iter=147956, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:35:54, time_cost(all): 1 day, 10:14:10/1 day, 5:10:30, loss=0.426011661468461, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.96(1.03), norm=4.7543762766127475, lr=0.22613802962779267
2023-12-06 21:10:07   INFO  epoch: 38/72, acc_iter=148006, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:38:07, time_cost(all): 1 day, 10:14:52/1 day, 4:51:50, loss=0.42595246404252, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.91(1.03), norm=3.953566877543892, lr=0.22602406255242485
2023-12-06 21:10:49   INFO  epoch: 38/72, acc_iter=148056, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:36:51, time_cost(all): 1 day, 10:15:34/1 day, 7:28:06, loss=0.425893266616579, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.0(1.03), norm=1.7060494435177316, lr=0.2259100954770571
2023-12-06 21:11:31   INFO  epoch: 38/72, acc_iter=148106, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:36:11, time_cost(all): 1 day, 10:16:16/1 day, 5:55:14, loss=0.425834069190638, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.83(1.03), norm=1.3790421740810566, lr=0.22579612840168928
2023-12-06 21:12:12   INFO  epoch: 38/72, acc_iter=148156, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:34:20, time_cost(all): 1 day, 10:16:57/1 day, 5:49:05, loss=0.425774871764697, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.04(1.03), norm=3.6158117794444817, lr=0.22568216132632152
2023-12-06 21:12:54   INFO  epoch: 38/72, acc_iter=148206, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:34:16, time_cost(all): 1 day, 10:17:39/1 day, 6:02:06, loss=0.425715674338756, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.03(1.03), norm=1.851791543965955, lr=0.2255681942509537
2023-12-06 21:13:36   INFO  epoch: 38/72, acc_iter=148256, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:32:39, time_cost(all): 1 day, 10:18:21/1 day, 6:59:04, loss=0.425656476912815, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.01(1.03), norm=2.014661509058511, lr=0.2254542271755859
2023-12-06 21:14:18   INFO  epoch: 38/72, acc_iter=148306, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:33:33, time_cost(all): 1 day, 10:19:03/1 day, 5:50:51, loss=0.425597279486874, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.01(1.03), norm=3.2547060393353684, lr=0.22534026010021813
2023-12-06 21:15:00   INFO  epoch: 38/72, acc_iter=148356, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:30:43, time_cost(all): 1 day, 10:19:45/1 day, 4:47:53, loss=0.425538082060933, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.85(1.03), norm=0.7917588382835479, lr=0.2252262930248503
2023-12-06 21:15:41   INFO  epoch: 38/72, acc_iter=148406, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:32:14, time_cost(all): 1 day, 10:20:26/1 day, 6:02:49, loss=0.425478884634993, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.08(1.03), norm=4.026299256190811, lr=0.2251123259494825
2023-12-06 21:16:23   INFO  epoch: 38/72, acc_iter=148456, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:30:24, time_cost(all): 1 day, 10:21:08/1 day, 6:56:34, loss=0.425419687209052, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.99(1.03), norm=4.31663124036581, lr=0.22499835887411473
2023-12-06 21:17:05   INFO  epoch: 38/72, acc_iter=148506, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:29:22, time_cost(all): 1 day, 10:21:50/1 day, 5:26:37, loss=0.425360489783111, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.84(1.03), norm=4.765463871380782, lr=0.22488439179874697
2023-12-06 21:17:47   INFO  epoch: 38/72, acc_iter=148556, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:27:41, time_cost(all): 1 day, 10:22:32/1 day, 5:24:33, loss=0.42530129235717, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.87(1.03), norm=2.1913475597251013, lr=0.2247704247233791
2023-12-06 21:18:28   INFO  epoch: 38/72, acc_iter=148606, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:26:47, time_cost(all): 1 day, 10:23:13/1 day, 6:47:07, loss=0.425242094931229, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.18(1.03), norm=1.0493644313442578, lr=0.22465645764801134
2023-12-06 21:19:10   INFO  epoch: 38/72, acc_iter=148656, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:27:23, time_cost(all): 1 day, 10:23:55/1 day, 4:48:56, loss=0.425182897505288, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.85(1.03), norm=1.428874203669491, lr=0.22454249057264353
2023-12-06 21:19:52   INFO  epoch: 38/72, acc_iter=148706, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:27:49, time_cost(all): 1 day, 10:24:37/1 day, 6:10:33, loss=0.425123700079347, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.93(1.03), norm=4.864117841617294, lr=0.22442852349727577
2023-12-06 21:20:34   INFO  epoch: 38/72, acc_iter=148756, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:25:03, time_cost(all): 1 day, 10:25:19/1 day, 5:02:02, loss=0.425064502653406, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.15(1.03), norm=2.430492664394758, lr=0.22431455642190795
2023-12-06 21:21:16   INFO  epoch: 38/72, acc_iter=148806, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:03, time_cost(all): 1 day, 10:26:01/1 day, 4:50:24, loss=0.425005305227465, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.23(1.03), norm=0.5341787500942302, lr=0.22420058934654014
2023-12-06 21:21:57   INFO  epoch: 38/72, acc_iter=148856, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:23:22, time_cost(all): 1 day, 10:26:42/1 day, 5:57:14, loss=0.424946107801524, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.13(1.03), norm=4.741403596359501, lr=0.22408662227117238
2023-12-06 21:22:39   INFO  epoch: 38/72, acc_iter=148906, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:16, time_cost(all): 1 day, 10:27:24/1 day, 6:04:41, loss=0.424886910375583, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.1(1.03), norm=1.5011716588813404, lr=0.22397265519580462
2023-12-06 21:23:21   INFO  epoch: 38/72, acc_iter=148956, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:55, time_cost(all): 1 day, 10:28:06/1 day, 5:50:25, loss=0.424827712949642, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.11(1.03), norm=2.0024154286968114, lr=0.22385868812043674
2023-12-06 21:24:03   INFO  epoch: 38/72, acc_iter=149006, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:21:39, time_cost(all): 1 day, 10:28:48/1 day, 6:34:41, loss=0.424768515523701, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.21(1.03), norm=4.74660900719322, lr=0.22374472104506898
2023-12-06 21:24:45   INFO  epoch: 38/72, acc_iter=149056, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:22:03, time_cost(all): 1 day, 10:29:30/1 day, 7:21:46, loss=0.42470931809776, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.91(1.03), norm=2.150738606060495, lr=0.22363075396970122
2023-12-06 21:25:26   INFO  epoch: 38/72, acc_iter=149106, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:25, time_cost(all): 1 day, 10:30:11/1 day, 5:36:31, loss=0.424650120671819, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.09(1.03), norm=0.7868039581990199, lr=0.2235167868943334
2023-12-06 21:26:08   INFO  epoch: 38/72, acc_iter=149156, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:21:12, time_cost(all): 1 day, 10:30:53/1 day, 6:02:05, loss=0.424590923245878, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.0(1.03), norm=4.554834537471137, lr=0.2234028198189656
2023-12-06 21:26:50   INFO  epoch: 38/72, acc_iter=149206, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:19:02, time_cost(all): 1 day, 10:31:35/1 day, 6:16:32, loss=0.424531725819937, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.02(1.03), norm=1.2891977761458886, lr=0.22328885274359783
2023-12-06 21:27:32   INFO  epoch: 38/72, acc_iter=149256, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:43, time_cost(all): 1 day, 10:32:17/1 day, 6:32:28, loss=0.424472528393997, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.84(1.03), norm=1.721274845118638, lr=0.22317488566823002
2023-12-06 21:28:13   INFO  epoch: 38/72, acc_iter=149306, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:11, time_cost(all): 1 day, 10:32:58/1 day, 4:56:22, loss=0.424413330968056, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.96(1.03), norm=3.2721126121251025, lr=0.2230609185928622
2023-12-06 21:28:55   INFO  epoch: 38/72, acc_iter=149356, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:16:51, time_cost(all): 1 day, 10:33:40/1 day, 6:50:10, loss=0.424354133542115, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.98(1.03), norm=1.4901499231573136, lr=0.22294695151749444
2023-12-06 21:29:37   INFO  epoch: 38/72, acc_iter=149406, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:13, time_cost(all): 1 day, 10:34:22/1 day, 6:57:12, loss=0.424294936116174, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.06(1.03), norm=3.8101976842180267, lr=0.22283298444212662
2023-12-06 21:30:19   INFO  epoch: 38/72, acc_iter=149456, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:15:49, time_cost(all): 1 day, 10:35:04/1 day, 5:43:12, loss=0.424235738690233, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.22(1.03), norm=4.220925753504748, lr=0.22271901736675886
2023-12-06 21:31:01   INFO  epoch: 38/72, acc_iter=149506, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:29, time_cost(all): 1 day, 10:35:46/1 day, 7:20:29, loss=0.424176541264292, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.88(1.03), norm=1.8507472834663556, lr=0.222605050291391
2023-12-06 21:31:42   INFO  epoch: 38/72, acc_iter=149556, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:10, time_cost(all): 1 day, 10:36:27/1 day, 5:07:50, loss=0.424117343838351, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.17(1.03), norm=2.7555663892317535, lr=0.22249108321602323
2023-12-06 21:32:24   INFO  epoch: 38/72, acc_iter=149606, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:21, time_cost(all): 1 day, 10:37:09/1 day, 4:34:41, loss=0.42405814641241, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.18(1.03), norm=2.5023004841148966, lr=0.22237711614065547
2023-12-06 21:33:06   INFO  epoch: 38/72, acc_iter=149656, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:10, time_cost(all): 1 day, 10:37:51/1 day, 6:44:14, loss=0.423998948986469, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.22(1.03), norm=3.716396485672873, lr=0.22226314906528766
2023-12-06 21:33:48   INFO  epoch: 38/72, acc_iter=149706, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:53, time_cost(all): 1 day, 10:38:33/1 day, 4:45:45, loss=0.423939751560528, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.84(1.03), norm=0.9080453174433374, lr=0.22214918198991984
2023-12-06 21:34:29   INFO  epoch: 38/72, acc_iter=149756, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:11:54, time_cost(all): 1 day, 10:39:14/1 day, 6:18:49, loss=0.423880554134587, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.2(1.03), norm=0.7284550678619774, lr=0.22203521491455208
2023-12-06 21:35:11   INFO  epoch: 38/72, acc_iter=149806, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:38, time_cost(all): 1 day, 10:39:56/1 day, 5:59:46, loss=0.423821356708646, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.11(1.03), norm=1.7070298314951686, lr=0.22192124783918427
2023-12-06 21:35:53   INFO  epoch: 38/72, acc_iter=149856, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:27, time_cost(all): 1 day, 10:40:38/1 day, 6:20:51, loss=0.423762159282705, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.19(1.03), norm=3.5230258730800825, lr=0.2218072807638165
2023-12-06 21:36:35   INFO  epoch: 38/72, acc_iter=149906, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:10:02, time_cost(all): 1 day, 10:41:20/1 day, 6:00:01, loss=0.423702961856764, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.21(1.03), norm=4.520225491902031, lr=0.2216933136884487
2023-12-06 21:37:17   INFO  epoch: 38/72, acc_iter=149956, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:25, time_cost(all): 1 day, 10:42:02/1 day, 5:05:47, loss=0.423643764430823, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.95(1.03), norm=2.32757689380177, lr=0.22157934661308087
2023-12-06 21:37:58   INFO  epoch: 38/72, acc_iter=150006, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:08, time_cost(all): 1 day, 10:42:43/1 day, 6:14:03, loss=0.423584567004882, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.85(1.03), norm=2.3356489764044532, lr=0.22146537953771311
2023-12-06 21:38:40   INFO  epoch: 38/72, acc_iter=150056, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:49, time_cost(all): 1 day, 10:43:25/1 day, 5:33:30, loss=0.423525369578941, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.15(1.03), norm=2.6012483716961903, lr=0.2213514124623453
2023-12-06 21:39:22   INFO  epoch: 38/72, acc_iter=150106, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:28, time_cost(all): 1 day, 10:44:07/1 day, 5:55:00, loss=0.423466172153001, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.14(1.03), norm=2.3540951562285923, lr=0.22123744538697748
2023-12-06 21:40:04   INFO  epoch: 38/72, acc_iter=150156, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:25, time_cost(all): 1 day, 10:44:49/1 day, 4:47:14, loss=0.42340697472706, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.95(1.03), norm=4.787409965438121, lr=0.22112347831160972
2023-12-06 21:40:45   INFO  epoch: 38/72, acc_iter=150206, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:48, time_cost(all): 1 day, 10:45:30/1 day, 4:53:35, loss=0.423347777301119, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.96(1.03), norm=1.8226790659210668, lr=0.2210095112362419
2023-12-06 21:41:27   INFO  epoch: 38/72, acc_iter=150256, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:03, time_cost(all): 1 day, 10:46:12/1 day, 6:17:00, loss=0.423288579875178, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.17(1.03), norm=2.978699278376734, lr=0.2208955441608741
2023-12-06 21:42:09   INFO  epoch: 38/72, acc_iter=150306, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:24, time_cost(all): 1 day, 10:46:54/1 day, 6:04:03, loss=0.423229382449237, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.07(1.03), norm=1.9818616425747049, lr=0.22078157708550633
2023-12-06 21:42:51   INFO  epoch: 38/72, acc_iter=150356, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:39, time_cost(all): 1 day, 10:47:36/1 day, 6:03:53, loss=0.423170185023296, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.23(1.03), norm=0.7590039706651033, lr=0.22066761001013852
2023-12-06 21:43:33   INFO  epoch: 38/72, acc_iter=150406, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:03:05, time_cost(all): 1 day, 10:48:18/1 day, 6:34:52, loss=0.423110987597355, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.03(1.03), norm=1.143598172963496, lr=0.22055364293477075
2023-12-06 21:44:14   INFO  epoch: 38/72, acc_iter=150456, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:10, time_cost(all): 1 day, 10:48:59/1 day, 5:13:50, loss=0.423051790171414, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.96(1.03), norm=4.312220813259036, lr=0.22043967585940294
2023-12-06 21:44:56   INFO  epoch: 38/72, acc_iter=150506, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:34, time_cost(all): 1 day, 10:49:41/1 day, 4:13:54, loss=0.422992592745473, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.07(1.03), norm=3.834406248015056, lr=0.22032570878403512
2023-12-06 21:45:38   INFO  epoch: 38/72, acc_iter=150556, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:53, time_cost(all): 1 day, 10:50:23/1 day, 4:39:37, loss=0.422933395319532, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.16(1.03), norm=0.7640774826022887, lr=0.22021174170866736
2023-12-06 21:46:20   INFO  epoch: 38/72, acc_iter=150606, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 1 day, 10:51:05/1 day, 6:06:30, loss=0.422874197893591, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.84(1.03), norm=2.206458313455204, lr=0.2200977746332996
2023-12-06 21:47:01   INFO  epoch: 39/72, acc_iter=150668, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:51:17, time_cost(all): 1 day, 10:51:46/1 day, 6:38:56, loss=0.422800793085424, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.87(1.03), norm=0.5890591205148402, lr=0.2199564554598435
2023-12-06 21:47:43   INFO  epoch: 39/72, acc_iter=150718, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:52:03, time_cost(all): 1 day, 10:52:28/1 day, 5:04:20, loss=0.422741595659483, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.88(1.03), norm=4.725085297507917, lr=0.21984248838447573
2023-12-06 21:48:25   INFO  epoch: 39/72, acc_iter=150768, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:51:54, time_cost(all): 1 day, 10:53:10/1 day, 6:27:45, loss=0.422682398233542, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.84(1.03), norm=0.5066214554564388, lr=0.21972852130910786
2023-12-06 21:49:07   INFO  epoch: 39/72, acc_iter=150818, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:50:24, time_cost(all): 1 day, 10:53:52/1 day, 6:47:43, loss=0.422623200807602, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.02(1.03), norm=1.5915754013766403, lr=0.2196145542337401
2023-12-06 21:49:49   INFO  epoch: 39/72, acc_iter=150868, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:51:48, time_cost(all): 1 day, 10:54:34/1 day, 6:56:39, loss=0.422564003381661, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.03(1.03), norm=3.416648020910419, lr=0.21950058715837234
2023-12-06 21:50:30   INFO  epoch: 39/72, acc_iter=150918, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:49:17, time_cost(all): 1 day, 10:55:15/1 day, 4:49:24, loss=0.42250480595572, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.06(1.03), norm=3.8065049230459063, lr=0.21938662008300452
2023-12-06 21:51:12   INFO  epoch: 39/72, acc_iter=150968, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:50:15, time_cost(all): 1 day, 10:55:57/1 day, 4:54:54, loss=0.422445608529779, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.0(1.03), norm=0.5404243937577002, lr=0.2192726530076367
2023-12-06 21:51:54   INFO  epoch: 39/72, acc_iter=151018, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:50:33, time_cost(all): 1 day, 10:56:39/1 day, 5:24:25, loss=0.422386411103838, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.88(1.03), norm=2.9246362347096726, lr=0.21915868593226895
2023-12-06 21:52:36   INFO  epoch: 39/72, acc_iter=151068, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:49:21, time_cost(all): 1 day, 10:57:21/1 day, 4:32:08, loss=0.422327213677897, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.09(1.03), norm=1.4868261423200206, lr=0.21904471885690113
2023-12-06 21:53:17   INFO  epoch: 39/72, acc_iter=151118, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:47:28, time_cost(all): 1 day, 10:58:02/1 day, 5:30:45, loss=0.422268016251956, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.1(1.03), norm=3.3768580484138266, lr=0.21893075178153337
2023-12-06 21:53:59   INFO  epoch: 39/72, acc_iter=151168, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:47:11, time_cost(all): 1 day, 10:58:44/1 day, 4:44:46, loss=0.422208818826015, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.94(1.03), norm=3.5816635329866044, lr=0.21881678470616556
2023-12-06 21:54:41   INFO  epoch: 39/72, acc_iter=151218, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:46:46, time_cost(all): 1 day, 10:59:26/1 day, 5:40:37, loss=0.422149621400074, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.08(1.03), norm=2.54461136629344, lr=0.21870281763079774
2023-12-06 21:55:23   INFO  epoch: 39/72, acc_iter=151268, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:44:45, time_cost(all): 1 day, 11:00:08/1 day, 5:25:32, loss=0.422090423974133, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.03(1.03), norm=3.650447738646145, lr=0.21858885055542998
2023-12-06 21:56:05   INFO  epoch: 39/72, acc_iter=151318, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:44:47, time_cost(all): 1 day, 11:00:50/1 day, 6:26:55, loss=0.422031226548192, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.04(1.03), norm=0.6370093582794216, lr=0.21847488348006217
2023-12-06 21:56:46   INFO  epoch: 39/72, acc_iter=151368, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:43:43, time_cost(all): 1 day, 11:01:31/1 day, 6:40:46, loss=0.421972029122251, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.12(1.03), norm=2.4849986853773154, lr=0.21836091640469435
2023-12-06 21:57:28   INFO  epoch: 39/72, acc_iter=151418, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:42:16, time_cost(all): 1 day, 11:02:13/1 day, 4:45:11, loss=0.42191283169631, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.85(1.03), norm=1.1594842532123533, lr=0.2182469493293266
2023-12-06 21:58:10   INFO  epoch: 39/72, acc_iter=151468, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:40:09, time_cost(all): 1 day, 11:02:55/1 day, 5:31:19, loss=0.421853634270369, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.93(1.03), norm=3.5639510897833078, lr=0.21813298225395877
2023-12-06 21:58:52   INFO  epoch: 39/72, acc_iter=151518, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:40:25, time_cost(all): 1 day, 11:03:37/1 day, 6:40:11, loss=0.421794436844428, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.1(1.03), norm=3.434736504660247, lr=0.21801901517859096
2023-12-06 21:59:34   INFO  epoch: 39/72, acc_iter=151568, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:39:55, time_cost(all): 1 day, 11:04:19/1 day, 4:55:19, loss=0.421735239418487, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.02(1.03), norm=2.0545983418846765, lr=0.2179050481032232
2023-12-06 22:00:15   INFO  epoch: 39/72, acc_iter=151618, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:40:07, time_cost(all): 1 day, 11:05:00/1 day, 6:51:33, loss=0.421676041992546, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.05(1.03), norm=4.048274716688088, lr=0.21779108102785538
2023-12-06 22:00:57   INFO  epoch: 39/72, acc_iter=151668, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:38:37, time_cost(all): 1 day, 11:05:42/1 day, 4:51:00, loss=0.421616844566606, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.01(1.03), norm=1.4352408756375163, lr=0.21767711395248762
2023-12-06 22:01:39   INFO  epoch: 39/72, acc_iter=151718, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:36:35, time_cost(all): 1 day, 11:06:24/1 day, 4:29:10, loss=0.421557647140665, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.91(1.03), norm=2.5468549576940602, lr=0.2175631468771198
2023-12-06 22:02:21   INFO  epoch: 39/72, acc_iter=151768, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:39:14, time_cost(all): 1 day, 11:07:06/1 day, 6:35:25, loss=0.421498449714724, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.85(1.03), norm=3.4162240302080074, lr=0.217449179801752
2023-12-06 22:03:02   INFO  epoch: 39/72, acc_iter=151818, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:36:44, time_cost(all): 1 day, 11:07:47/1 day, 6:30:52, loss=0.421439252288783, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.84(1.03), norm=0.8708749084373126, lr=0.21733521272638423
2023-12-06 22:03:44   INFO  epoch: 39/72, acc_iter=151868, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:34:39, time_cost(all): 1 day, 11:08:29/1 day, 5:35:11, loss=0.421380054862842, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.85(1.03), norm=0.9770794338911262, lr=0.21722124565101647
2023-12-06 22:04:26   INFO  epoch: 39/72, acc_iter=151918, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:36:42, time_cost(all): 1 day, 11:09:11/1 day, 5:35:49, loss=0.421320857436901, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.03(1.03), norm=1.402370751103777, lr=0.2171072785756486
2023-12-06 22:05:08   INFO  epoch: 39/72, acc_iter=151968, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:34:39, time_cost(all): 1 day, 11:09:53/1 day, 4:49:19, loss=0.42126166001096, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.09(1.03), norm=3.484932532484649, lr=0.21699331150028084
2023-12-06 22:05:50   INFO  epoch: 39/72, acc_iter=152018, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:35:19, time_cost(all): 1 day, 11:10:35/1 day, 4:08:15, loss=0.421202462585019, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.84(1.03), norm=3.0117003758625533, lr=0.21687934442491308
2023-12-06 22:06:31   INFO  epoch: 39/72, acc_iter=152068, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:34:34, time_cost(all): 1 day, 11:11:16/1 day, 6:10:04, loss=0.421143265159078, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.14(1.03), norm=3.3542603741895594, lr=0.21676537734954526
2023-12-06 22:07:13   INFO  epoch: 39/72, acc_iter=152118, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:31:40, time_cost(all): 1 day, 11:11:58/1 day, 4:25:13, loss=0.421084067733137, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.94(1.03), norm=4.382660874750589, lr=0.21665141027417745
2023-12-06 22:07:55   INFO  epoch: 39/72, acc_iter=152168, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:31:40, time_cost(all): 1 day, 11:12:40/1 day, 5:06:36, loss=0.421024870307196, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.12(1.03), norm=1.3876883915563458, lr=0.2165374431988097
2023-12-06 22:08:37   INFO  epoch: 39/72, acc_iter=152218, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:32:51, time_cost(all): 1 day, 11:13:22/1 day, 4:47:57, loss=0.420965672881255, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.95(1.03), norm=1.0199291601236768, lr=0.21642347612344187
2023-12-06 22:09:18   INFO  epoch: 39/72, acc_iter=152268, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:30:23, time_cost(all): 1 day, 11:14:03/1 day, 6:44:10, loss=0.420906475455314, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.06(1.03), norm=2.813144893010474, lr=0.21630950904807406
2023-12-06 22:10:00   INFO  epoch: 39/72, acc_iter=152318, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:30:01, time_cost(all): 1 day, 11:14:45/1 day, 4:53:10, loss=0.420847278029373, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.87(1.03), norm=1.4481027157988335, lr=0.21619554197270624
2023-12-06 22:10:42   INFO  epoch: 39/72, acc_iter=152368, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:28:08, time_cost(all): 1 day, 11:15:27/1 day, 4:37:35, loss=0.420788080603432, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.11(1.03), norm=1.526101977022929, lr=0.21608157489733848
2023-12-06 22:11:24   INFO  epoch: 39/72, acc_iter=152418, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:29:26, time_cost(all): 1 day, 11:16:09/1 day, 5:27:01, loss=0.420728883177491, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.14(1.03), norm=1.8399476531185672, lr=0.21596760782197072
2023-12-06 22:12:06   INFO  epoch: 39/72, acc_iter=152468, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:27:07, time_cost(all): 1 day, 11:16:51/1 day, 4:39:57, loss=0.42066968575155, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.0(1.03), norm=1.945145603989284, lr=0.21585364074660285
2023-12-06 22:12:47   INFO  epoch: 39/72, acc_iter=152518, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:27:11, time_cost(all): 1 day, 11:17:32/1 day, 6:26:48, loss=0.420610488325609, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.89(1.03), norm=2.20527709091071, lr=0.2157396736712351
2023-12-06 22:13:29   INFO  epoch: 39/72, acc_iter=152568, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:26:30, time_cost(all): 1 day, 11:18:14/1 day, 4:50:46, loss=0.420551290899669, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.16(1.03), norm=0.6801860819399765, lr=0.21562570659586733
2023-12-06 22:14:11   INFO  epoch: 39/72, acc_iter=152618, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:25:13, time_cost(all): 1 day, 11:18:56/1 day, 5:02:34, loss=0.420492093473728, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.04(1.03), norm=2.9522347754357527, lr=0.2155117395204995
2023-12-06 22:14:53   INFO  epoch: 39/72, acc_iter=152668, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:26:12, time_cost(all): 1 day, 11:19:38/1 day, 5:57:40, loss=0.420432896047787, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.07(1.03), norm=2.5766734158031017, lr=0.2153977724451317
2023-12-06 22:15:34   INFO  epoch: 39/72, acc_iter=152718, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:24:48, time_cost(all): 1 day, 11:20:19/1 day, 5:55:40, loss=0.420373698621846, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.18(1.03), norm=2.4492512358072345, lr=0.21528380536976394
2023-12-06 22:16:16   INFO  epoch: 39/72, acc_iter=152768, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:13, time_cost(all): 1 day, 11:21:01/1 day, 4:52:43, loss=0.420314501195905, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.17(1.03), norm=2.514188779650761, lr=0.21516983829439612
2023-12-06 22:16:58   INFO  epoch: 39/72, acc_iter=152818, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:49, time_cost(all): 1 day, 11:21:43/1 day, 4:51:20, loss=0.420255303769964, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.86(1.03), norm=0.8758167969231798, lr=0.21505587121902836
2023-12-06 22:17:40   INFO  epoch: 39/72, acc_iter=152868, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:23:27, time_cost(all): 1 day, 11:22:25/1 day, 4:03:14, loss=0.420196106344023, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.88(1.03), norm=2.4198760229781513, lr=0.21494190414366054
2023-12-06 22:18:22   INFO  epoch: 39/72, acc_iter=152918, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:02, time_cost(all): 1 day, 11:23:07/1 day, 3:44:00, loss=0.420136908918082, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.19(1.03), norm=0.6515951503494084, lr=0.21482793706829273
2023-12-06 22:19:03   INFO  epoch: 39/72, acc_iter=152968, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:54, time_cost(all): 1 day, 11:23:48/1 day, 6:19:30, loss=0.420077711492141, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.15(1.03), norm=0.7789312897771389, lr=0.21471396999292497
2023-12-06 22:19:45   INFO  epoch: 39/72, acc_iter=153018, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:21:00, time_cost(all): 1 day, 11:24:30/1 day, 5:34:38, loss=0.4200185140662, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.03(1.03), norm=1.541224382342404, lr=0.21460000291755715
2023-12-06 22:20:27   INFO  epoch: 39/72, acc_iter=153068, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:19:47, time_cost(all): 1 day, 11:25:12/1 day, 6:07:01, loss=0.419959316640259, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.21(1.03), norm=4.9507489693116025, lr=0.21448603584218934
2023-12-06 22:21:09   INFO  epoch: 39/72, acc_iter=153118, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:18:36, time_cost(all): 1 day, 11:25:54/1 day, 5:35:58, loss=0.419900119214318, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.13(1.03), norm=0.6656331128896535, lr=0.21437206876682158
2023-12-06 22:21:50   INFO  epoch: 39/72, acc_iter=153168, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:17:44, time_cost(all): 1 day, 11:26:35/1 day, 3:53:13, loss=0.419840921788377, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.1(1.03), norm=4.757723270379829, lr=0.21425810169145376
2023-12-06 22:22:32   INFO  epoch: 39/72, acc_iter=153218, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:18:11, time_cost(all): 1 day, 11:27:17/1 day, 4:39:14, loss=0.419781724362436, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.93(1.03), norm=4.679239221102508, lr=0.21414413461608595
2023-12-06 22:23:14   INFO  epoch: 39/72, acc_iter=153268, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:56, time_cost(all): 1 day, 11:27:59/1 day, 5:06:21, loss=0.419722526936495, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.01(1.03), norm=1.8014053079698218, lr=0.21403016754071819
2023-12-06 22:23:56   INFO  epoch: 39/72, acc_iter=153318, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:01, time_cost(all): 1 day, 11:28:41/1 day, 3:50:46, loss=0.419663329510554, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.18(1.03), norm=1.1069449068798174, lr=0.21391620046535037
2023-12-06 22:24:38   INFO  epoch: 39/72, acc_iter=153368, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:20, time_cost(all): 1 day, 11:29:23/1 day, 4:07:50, loss=0.419604132084614, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.0(1.03), norm=0.790499346235048, lr=0.2138022333899826
2023-12-06 22:25:19   INFO  epoch: 39/72, acc_iter=153418, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:12, time_cost(all): 1 day, 11:30:04/1 day, 3:39:10, loss=0.419544934658673, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.02(1.03), norm=2.892473277784705, lr=0.2136882663146148
2023-12-06 22:26:01   INFO  epoch: 39/72, acc_iter=153468, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:13:53, time_cost(all): 1 day, 11:30:46/1 day, 5:34:27, loss=0.419485737232732, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.0(1.03), norm=1.9920039737319677, lr=0.21357429923924698
2023-12-06 22:26:43   INFO  epoch: 39/72, acc_iter=153518, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:23, time_cost(all): 1 day, 11:31:28/1 day, 4:34:34, loss=0.419426539806791, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.2(1.03), norm=2.492524691879119, lr=0.21346033216387922
2023-12-06 22:27:25   INFO  epoch: 39/72, acc_iter=153568, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:35, time_cost(all): 1 day, 11:32:10/1 day, 4:27:41, loss=0.41936734238085, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.97(1.03), norm=3.670483755310019, lr=0.21334636508851146
2023-12-06 22:28:06   INFO  epoch: 39/72, acc_iter=153618, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:11:57, time_cost(all): 1 day, 11:32:51/1 day, 5:52:35, loss=0.419308144954909, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.01(1.03), norm=1.5433921170982152, lr=0.2132323980131436
2023-12-06 22:28:48   INFO  epoch: 39/72, acc_iter=153668, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:15, time_cost(all): 1 day, 11:33:33/1 day, 4:03:07, loss=0.419248947528968, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.08(1.03), norm=2.5786422950568144, lr=0.21311843093777583
2023-12-06 22:29:30   INFO  epoch: 39/72, acc_iter=153718, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:53, time_cost(all): 1 day, 11:34:15/1 day, 5:27:44, loss=0.419189750103027, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.85(1.03), norm=3.163378311923037, lr=0.21300446386240807
2023-12-06 22:30:12   INFO  epoch: 39/72, acc_iter=153768, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:10:06, time_cost(all): 1 day, 11:34:57/1 day, 4:25:09, loss=0.419130552677086, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.88(1.03), norm=4.2670596487862875, lr=0.21289049678704025
2023-12-06 22:30:54   INFO  epoch: 39/72, acc_iter=153818, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:29, time_cost(all): 1 day, 11:35:39/1 day, 5:58:14, loss=0.419071355251145, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.14(1.03), norm=1.863761249677907, lr=0.21277652971167244
2023-12-06 22:31:35   INFO  epoch: 39/72, acc_iter=153868, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:21, time_cost(all): 1 day, 11:36:20/1 day, 4:14:15, loss=0.419012157825204, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.99(1.03), norm=2.1113519184243312, lr=0.21266256263630462
2023-12-06 22:32:17   INFO  epoch: 39/72, acc_iter=153918, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:58, time_cost(all): 1 day, 11:37:02/1 day, 5:19:13, loss=0.418952960399263, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.96(1.03), norm=2.895560800409538, lr=0.21254859556093686
2023-12-06 22:32:59   INFO  epoch: 39/72, acc_iter=153968, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:06:58, time_cost(all): 1 day, 11:37:44/1 day, 4:12:23, loss=0.418893762973322, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.01(1.03), norm=2.5539340821423693, lr=0.21243462848556904
2023-12-06 22:33:41   INFO  epoch: 39/72, acc_iter=154018, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:26, time_cost(all): 1 day, 11:38:26/1 day, 3:28:50, loss=0.418834565547381, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.94(1.03), norm=3.923785322766, lr=0.21232066141020123
2023-12-06 22:34:23   INFO  epoch: 39/72, acc_iter=154068, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:40, time_cost(all): 1 day, 11:39:08/1 day, 3:56:42, loss=0.41877536812144, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.21(1.03), norm=2.647105096228459, lr=0.21220669433483347
2023-12-06 22:35:04   INFO  epoch: 39/72, acc_iter=154118, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:00, time_cost(all): 1 day, 11:39:49/1 day, 3:45:14, loss=0.418716170695499, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.09(1.03), norm=2.9815551817726904, lr=0.2120927272594657
2023-12-06 22:35:46   INFO  epoch: 39/72, acc_iter=154168, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:11, time_cost(all): 1 day, 11:40:31/1 day, 6:01:12, loss=0.418656973269558, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.03(1.03), norm=4.6766196076578765, lr=0.21197876018409784
2023-12-06 22:36:28   INFO  epoch: 39/72, acc_iter=154218, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:35, time_cost(all): 1 day, 11:41:13/1 day, 5:31:48, loss=0.418597775843617, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.02(1.03), norm=2.2061890392308285, lr=0.21186479310873008
2023-12-06 22:37:10   INFO  epoch: 39/72, acc_iter=154268, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:58, time_cost(all): 1 day, 11:41:55/1 day, 5:36:34, loss=0.418538578417677, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.02(1.03), norm=4.235044662261112, lr=0.21175082603336232
2023-12-06 22:37:51   INFO  epoch: 39/72, acc_iter=154318, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:11, time_cost(all): 1 day, 11:42:36/1 day, 6:12:46, loss=0.418479380991736, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.16(1.03), norm=1.7690448564578027, lr=0.2116368589579945
2023-12-06 22:38:33   INFO  epoch: 39/72, acc_iter=154368, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:35, time_cost(all): 1 day, 11:43:18/1 day, 5:52:42, loss=0.418420183565795, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.85(1.03), norm=0.7298768402417062, lr=0.21152289188262668
2023-12-06 22:39:15   INFO  epoch: 39/72, acc_iter=154418, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:50, time_cost(all): 1 day, 11:44:00/1 day, 4:34:10, loss=0.418360986139854, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.08(1.03), norm=2.5252088391892995, lr=0.21140892480725892
2023-12-06 22:39:57   INFO  epoch: 39/72, acc_iter=154468, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 1 day, 11:44:42/1 day, 4:19:44, loss=0.418301788713913, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.97(1.03), norm=2.7221170086643736, lr=0.2112949577318911
2023-12-06 22:40:39   INFO  epoch: 40/72, acc_iter=154530, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:54:38, time_cost(all): 1 day, 11:45:24/1 day, 3:23:47, loss=0.418228383905746, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.92(1.03), norm=1.8323910240761125, lr=0.21115363855843505
2023-12-06 22:41:20   INFO  epoch: 40/72, acc_iter=154580, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:52:02, time_cost(all): 1 day, 11:46:05/1 day, 4:48:35, loss=0.418169186479805, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.22(1.03), norm=3.5728596593473014, lr=0.21103967148306724
2023-12-06 22:42:02   INFO  epoch: 40/72, acc_iter=154630, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:50:04, time_cost(all): 1 day, 11:46:47/1 day, 4:31:32, loss=0.418109989053864, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.1(1.03), norm=4.859356025975731, lr=0.21092570440769948
2023-12-06 22:42:44   INFO  epoch: 40/72, acc_iter=154680, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:52:59, time_cost(all): 1 day, 11:47:29/1 day, 4:08:07, loss=0.418050791627923, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.87(1.03), norm=2.805482095623802, lr=0.21081173733233166
2023-12-06 22:43:26   INFO  epoch: 40/72, acc_iter=154730, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:51:45, time_cost(all): 1 day, 11:48:11/1 day, 3:28:50, loss=0.417991594201982, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.88(1.03), norm=1.5701370447095555, lr=0.21069777025696385
2023-12-06 22:44:07   INFO  epoch: 40/72, acc_iter=154780, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:50:41, time_cost(all): 1 day, 11:48:52/1 day, 3:37:15, loss=0.417932396776041, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.1(1.03), norm=4.9685467521272955, lr=0.21058380318159609
2023-12-06 22:44:49   INFO  epoch: 40/72, acc_iter=154830, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:48:53, time_cost(all): 1 day, 11:49:34/1 day, 3:58:32, loss=0.4178731993501, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.91(1.03), norm=0.5267153789173876, lr=0.21046983610622833
2023-12-06 22:45:31   INFO  epoch: 40/72, acc_iter=154880, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:49:37, time_cost(all): 1 day, 11:50:16/1 day, 4:16:08, loss=0.417814001924159, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.06(1.03), norm=0.6612979500009495, lr=0.21035586903086045
2023-12-06 22:46:13   INFO  epoch: 40/72, acc_iter=154930, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:49:43, time_cost(all): 1 day, 11:50:58/1 day, 5:21:19, loss=0.417754804498218, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.04(1.03), norm=2.100495934096765, lr=0.2102419019554927
2023-12-06 22:46:55   INFO  epoch: 40/72, acc_iter=154980, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:47:48, time_cost(all): 1 day, 11:51:40/1 day, 3:35:25, loss=0.417695607072278, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.17(1.03), norm=0.9562041839890518, lr=0.21012793488012493
2023-12-06 22:47:36   INFO  epoch: 40/72, acc_iter=155030, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:46:03, time_cost(all): 1 day, 11:52:21/1 day, 3:26:46, loss=0.417636409646337, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.01(1.03), norm=4.207299529158546, lr=0.21001396780475712
2023-12-06 22:48:18   INFO  epoch: 40/72, acc_iter=155080, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:47:30, time_cost(all): 1 day, 11:53:03/1 day, 5:54:20, loss=0.417577212220396, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.06(1.03), norm=4.537508088752846, lr=0.2099000007293893
2023-12-06 22:49:00   INFO  epoch: 40/72, acc_iter=155130, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:44:24, time_cost(all): 1 day, 11:53:45/1 day, 6:01:57, loss=0.417518014794455, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.97(1.03), norm=3.719084396916287, lr=0.2097860336540215
2023-12-06 22:49:42   INFO  epoch: 40/72, acc_iter=155180, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:42:50, time_cost(all): 1 day, 11:54:27/1 day, 4:56:20, loss=0.417458817368514, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.01(1.03), norm=2.8669182594014835, lr=0.20967206657865373
2023-12-06 22:50:23   INFO  epoch: 40/72, acc_iter=155230, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:41:22, time_cost(all): 1 day, 11:55:08/1 day, 4:57:23, loss=0.417399619942573, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.2(1.03), norm=0.8864306188490132, lr=0.2095580995032859
2023-12-06 22:51:05   INFO  epoch: 40/72, acc_iter=155280, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:42:29, time_cost(all): 1 day, 11:55:50/1 day, 5:41:10, loss=0.417340422516632, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.13(1.03), norm=1.2488529233021042, lr=0.2094441324279181
2023-12-06 22:51:47   INFO  epoch: 40/72, acc_iter=155330, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:40:48, time_cost(all): 1 day, 11:56:32/1 day, 3:16:28, loss=0.417281225090691, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.16(1.03), norm=2.998962861525605, lr=0.20933016535255033
2023-12-06 22:52:29   INFO  epoch: 40/72, acc_iter=155380, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:39:33, time_cost(all): 1 day, 11:57:14/1 day, 3:57:06, loss=0.41722202766475, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.13(1.03), norm=3.35745382089161, lr=0.20921619827718257
2023-12-06 22:53:11   INFO  epoch: 40/72, acc_iter=155430, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:39:52, time_cost(all): 1 day, 11:57:56/1 day, 3:44:52, loss=0.417162830238809, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.86(1.03), norm=3.769573835333164, lr=0.2091022312018147
2023-12-06 22:53:52   INFO  epoch: 40/72, acc_iter=155480, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:40:03, time_cost(all): 1 day, 11:58:37/1 day, 5:14:56, loss=0.417103632812868, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.0(1.03), norm=1.9787859127907677, lr=0.20898826412644694
2023-12-06 22:54:34   INFO  epoch: 40/72, acc_iter=155530, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:39:00, time_cost(all): 1 day, 11:59:19/1 day, 4:54:00, loss=0.417044435386927, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.09(1.03), norm=2.4071663776449626, lr=0.20887429705107918
2023-12-06 22:55:16   INFO  epoch: 40/72, acc_iter=155580, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:36:50, time_cost(all): 1 day, 12:00:01/1 day, 4:23:27, loss=0.416985237960986, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.2(1.03), norm=4.68834472601451, lr=0.20876032997571137
2023-12-06 22:55:58   INFO  epoch: 40/72, acc_iter=155630, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:37:49, time_cost(all): 1 day, 12:00:43/1 day, 5:01:16, loss=0.416926040535045, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.91(1.03), norm=2.74759614160768, lr=0.20864636290034355
2023-12-06 22:56:39   INFO  epoch: 40/72, acc_iter=155680, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:35:42, time_cost(all): 1 day, 12:01:24/1 day, 5:15:48, loss=0.416866843109104, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.91(1.03), norm=0.9605021358112567, lr=0.2085323958249758
2023-12-06 22:57:21   INFO  epoch: 40/72, acc_iter=155730, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:37:17, time_cost(all): 1 day, 12:02:06/1 day, 5:46:31, loss=0.416807645683163, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.85(1.03), norm=1.987569955044969, lr=0.20841842874960798
2023-12-06 22:58:03   INFO  epoch: 40/72, acc_iter=155780, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:36:37, time_cost(all): 1 day, 12:02:48/1 day, 5:14:39, loss=0.416748448257222, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.89(1.03), norm=0.7119723037953767, lr=0.20830446167424022
2023-12-06 22:58:45   INFO  epoch: 40/72, acc_iter=155830, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:34:35, time_cost(all): 1 day, 12:03:30/1 day, 3:46:12, loss=0.416689250831282, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.9(1.03), norm=3.6362281933949667, lr=0.2081904945988724
2023-12-06 22:59:27   INFO  epoch: 40/72, acc_iter=155880, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:35:21, time_cost(all): 1 day, 12:04:12/1 day, 5:46:54, loss=0.416630053405341, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.88(1.03), norm=4.225054684683624, lr=0.20807652752350458
2023-12-06 23:00:08   INFO  epoch: 40/72, acc_iter=155930, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:34:02, time_cost(all): 1 day, 12:04:53/1 day, 5:48:36, loss=0.4165708559794, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.09(1.03), norm=4.205136900274531, lr=0.20796256044813682
2023-12-06 23:00:50   INFO  epoch: 40/72, acc_iter=155980, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:32:51, time_cost(all): 1 day, 12:05:35/1 day, 5:37:14, loss=0.416511658553459, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.12(1.03), norm=2.3765684470485313, lr=0.207848593372769
2023-12-06 23:01:32   INFO  epoch: 40/72, acc_iter=156030, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:30:41, time_cost(all): 1 day, 12:06:17/1 day, 4:23:57, loss=0.416452461127518, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.88(1.03), norm=2.5785517233738044, lr=0.2077346262974012
2023-12-06 23:02:14   INFO  epoch: 40/72, acc_iter=156080, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:31:04, time_cost(all): 1 day, 12:06:59/1 day, 5:32:31, loss=0.416393263701577, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.98(1.03), norm=3.9329264525273673, lr=0.20762065922203343
2023-12-06 23:02:55   INFO  epoch: 40/72, acc_iter=156130, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:30:39, time_cost(all): 1 day, 12:07:40/1 day, 3:18:52, loss=0.416334066275636, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.02(1.03), norm=4.73144211752409, lr=0.20750669214666562
2023-12-06 23:03:37   INFO  epoch: 40/72, acc_iter=156180, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:29:45, time_cost(all): 1 day, 12:08:22/1 day, 4:25:18, loss=0.416274868849695, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.16(1.03), norm=2.9286073631858986, lr=0.2073927250712978
2023-12-06 23:04:19   INFO  epoch: 40/72, acc_iter=156230, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:29:19, time_cost(all): 1 day, 12:09:04/1 day, 4:09:33, loss=0.416215671423754, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.15(1.03), norm=3.837103402542057, lr=0.20727875799593004
2023-12-06 23:05:01   INFO  epoch: 40/72, acc_iter=156280, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:28:55, time_cost(all): 1 day, 12:09:46/1 day, 5:21:49, loss=0.416156473997813, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.12(1.03), norm=2.103039023150733, lr=0.20716479092056223
2023-12-06 23:05:43   INFO  epoch: 40/72, acc_iter=156330, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:29:16, time_cost(all): 1 day, 12:10:28/1 day, 4:44:51, loss=0.416097276571872, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.93(1.03), norm=4.2493865288472445, lr=0.20705082384519446
2023-12-06 23:06:24   INFO  epoch: 40/72, acc_iter=156380, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:27:57, time_cost(all): 1 day, 12:11:09/1 day, 3:27:20, loss=0.416038079145931, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.05(1.03), norm=4.00186689063066, lr=0.20693685676982665
2023-12-06 23:07:06   INFO  epoch: 40/72, acc_iter=156430, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:27:18, time_cost(all): 1 day, 12:11:51/1 day, 3:00:41, loss=0.41597888171999, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.0(1.03), norm=3.7752515214861484, lr=0.20682288969445883
2023-12-06 23:07:48   INFO  epoch: 40/72, acc_iter=156480, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:25:54, time_cost(all): 1 day, 12:12:33/1 day, 3:13:13, loss=0.415919684294049, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.2(1.03), norm=3.467452994811547, lr=0.20670892261909107
2023-12-06 23:08:30   INFO  epoch: 40/72, acc_iter=156530, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:24:46, time_cost(all): 1 day, 12:13:15/1 day, 5:02:32, loss=0.415860486868108, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.07(1.03), norm=0.7350341056797276, lr=0.2065949555437233
2023-12-06 23:09:12   INFO  epoch: 40/72, acc_iter=156580, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:25:13, time_cost(all): 1 day, 12:13:57/1 day, 4:15:46, loss=0.415801289442167, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.13(1.03), norm=3.0478701933498926, lr=0.20648098846835544
2023-12-06 23:09:53   INFO  epoch: 40/72, acc_iter=156630, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:23:20, time_cost(all): 1 day, 12:14:38/1 day, 4:00:49, loss=0.415742092016226, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.1(1.03), norm=1.2726553597858778, lr=0.20636702139298768
2023-12-06 23:10:35   INFO  epoch: 40/72, acc_iter=156680, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:23:15, time_cost(all): 1 day, 12:15:20/1 day, 3:30:21, loss=0.415682894590286, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.14(1.03), norm=3.916411467643623, lr=0.20625305431761987
2023-12-06 23:11:17   INFO  epoch: 40/72, acc_iter=156730, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:22:21, time_cost(all): 1 day, 12:16:02/1 day, 3:30:35, loss=0.415623697164345, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.97(1.03), norm=3.1151908840507057, lr=0.2061390872422521
2023-12-06 23:11:59   INFO  epoch: 40/72, acc_iter=156780, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:42, time_cost(all): 1 day, 12:16:44/1 day, 4:03:37, loss=0.415564499738404, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.14(1.03), norm=4.724835995275029, lr=0.2060251201668843
2023-12-06 23:12:40   INFO  epoch: 40/72, acc_iter=156830, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:22, time_cost(all): 1 day, 12:17:25/1 day, 4:49:29, loss=0.415505302312463, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.1(1.03), norm=1.1223023464270843, lr=0.20591115309151647
2023-12-06 23:13:22   INFO  epoch: 40/72, acc_iter=156880, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:19:52, time_cost(all): 1 day, 12:18:07/1 day, 3:33:34, loss=0.415446104886522, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.98(1.03), norm=0.6579956222496541, lr=0.20579718601614871
2023-12-06 23:14:04   INFO  epoch: 40/72, acc_iter=156930, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:20:23, time_cost(all): 1 day, 12:18:49/1 day, 3:47:45, loss=0.415386907460581, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.09(1.03), norm=2.79662894771495, lr=0.2056832189407809
2023-12-06 23:14:46   INFO  epoch: 40/72, acc_iter=156980, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:37, time_cost(all): 1 day, 12:19:31/1 day, 4:03:16, loss=0.41532771003464, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.91(1.03), norm=0.8710694933298235, lr=0.20556925186541308
2023-12-06 23:15:28   INFO  epoch: 40/72, acc_iter=157030, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:10, time_cost(all): 1 day, 12:20:13/1 day, 4:14:01, loss=0.415268512608699, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.21(1.03), norm=3.329027061402221, lr=0.20545528479004532
2023-12-06 23:16:09   INFO  epoch: 40/72, acc_iter=157080, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:24, time_cost(all): 1 day, 12:20:54/1 day, 3:41:01, loss=0.415209315182758, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.97(1.03), norm=4.607001301439109, lr=0.20534131771467756
2023-12-06 23:16:51   INFO  epoch: 40/72, acc_iter=157130, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:17:04, time_cost(all): 1 day, 12:21:36/1 day, 5:08:58, loss=0.415150117756817, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.85(1.03), norm=3.2296764626863315, lr=0.2052273506393097
2023-12-06 23:17:33   INFO  epoch: 40/72, acc_iter=157180, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:15:46, time_cost(all): 1 day, 12:22:18/1 day, 4:47:41, loss=0.415090920330876, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.13(1.03), norm=0.8424453149745647, lr=0.20511338356394193
2023-12-06 23:18:15   INFO  epoch: 40/72, acc_iter=157230, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:14:52, time_cost(all): 1 day, 12:23:00/1 day, 3:55:15, loss=0.415031722904935, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.04(1.03), norm=2.6696660624018786, lr=0.20499941648857417
2023-12-06 23:18:56   INFO  epoch: 40/72, acc_iter=157280, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:24, time_cost(all): 1 day, 12:23:41/1 day, 5:19:45, loss=0.414972525478994, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.22(1.03), norm=1.48858358105753, lr=0.20488544941320636
2023-12-06 23:19:38   INFO  epoch: 40/72, acc_iter=157330, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:13:51, time_cost(all): 1 day, 12:24:23/1 day, 2:50:00, loss=0.414913328053053, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.14(1.03), norm=1.6416000155979673, lr=0.20477148233783854
2023-12-06 23:20:20   INFO  epoch: 40/72, acc_iter=157380, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:09, time_cost(all): 1 day, 12:25:05/1 day, 5:00:32, loss=0.414854130627112, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.02(1.03), norm=2.2020948872895496, lr=0.20465751526247078
2023-12-06 23:21:02   INFO  epoch: 40/72, acc_iter=157430, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:43, time_cost(all): 1 day, 12:25:47/1 day, 3:44:28, loss=0.414794933201171, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.08(1.03), norm=1.6969187009464268, lr=0.20454354818710296
2023-12-06 23:21:44   INFO  epoch: 40/72, acc_iter=157480, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:34, time_cost(all): 1 day, 12:26:29/1 day, 4:12:19, loss=0.41473573577523, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.19(1.03), norm=4.0246076095689425, lr=0.2044295811117352
2023-12-06 23:22:25   INFO  epoch: 40/72, acc_iter=157530, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:10:44, time_cost(all): 1 day, 12:27:10/1 day, 4:10:40, loss=0.41467653834929, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.1(1.03), norm=3.72556939673974, lr=0.20431561403636733
2023-12-06 23:23:07   INFO  epoch: 40/72, acc_iter=157580, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:18, time_cost(all): 1 day, 12:27:52/1 day, 3:47:10, loss=0.414617340923349, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.1(1.03), norm=0.947243878309381, lr=0.20420164696099957
2023-12-06 23:23:49   INFO  epoch: 40/72, acc_iter=157630, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:44, time_cost(all): 1 day, 12:28:34/1 day, 5:11:04, loss=0.414558143497408, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.88(1.03), norm=1.087206480067391, lr=0.2040876798856318
2023-12-06 23:24:31   INFO  epoch: 40/72, acc_iter=157680, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:20, time_cost(all): 1 day, 12:29:16/1 day, 4:58:11, loss=0.414498946071467, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.02(1.03), norm=0.545924770766467, lr=0.203973712810264
2023-12-06 23:25:12   INFO  epoch: 40/72, acc_iter=157730, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:22, time_cost(all): 1 day, 12:29:57/1 day, 4:55:26, loss=0.414439748645526, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.04(1.03), norm=3.2905159025835027, lr=0.20385974573489618
2023-12-06 23:25:54   INFO  epoch: 40/72, acc_iter=157780, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:47, time_cost(all): 1 day, 12:30:39/1 day, 3:36:19, loss=0.414380551219585, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.09(1.03), norm=2.4976573736704086, lr=0.20374577865952842
2023-12-06 23:26:36   INFO  epoch: 40/72, acc_iter=157830, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:06:50, time_cost(all): 1 day, 12:31:21/1 day, 4:24:58, loss=0.414321353793644, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.17(1.03), norm=3.8509085943342214, lr=0.2036318115841606
2023-12-06 23:27:18   INFO  epoch: 40/72, acc_iter=157880, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:15, time_cost(all): 1 day, 12:32:03/1 day, 2:44:40, loss=0.414262156367703, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.2(1.03), norm=1.520413514136493, lr=0.2035178445087928
2023-12-06 23:28:00   INFO  epoch: 40/72, acc_iter=157930, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:35, time_cost(all): 1 day, 12:32:45/1 day, 3:42:50, loss=0.414202958941762, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.03(1.03), norm=2.05524088133784, lr=0.20340387743342503
2023-12-06 23:28:41   INFO  epoch: 40/72, acc_iter=157980, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:00, time_cost(all): 1 day, 12:33:26/1 day, 3:28:55, loss=0.414143761515821, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.06(1.03), norm=1.2829786772710112, lr=0.2032899103580572
2023-12-06 23:29:23   INFO  epoch: 40/72, acc_iter=158030, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:16, time_cost(all): 1 day, 12:34:08/1 day, 4:42:55, loss=0.41408456408988, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.89(1.03), norm=1.3918258206480543, lr=0.20317594328268945
2023-12-06 23:30:05   INFO  epoch: 40/72, acc_iter=158080, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:41, time_cost(all): 1 day, 12:34:50/1 day, 4:39:37, loss=0.414025366663939, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.98(1.03), norm=1.5555134125125385, lr=0.20306197620732164
2023-12-06 23:30:47   INFO  epoch: 40/72, acc_iter=158130, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:51, time_cost(all): 1 day, 12:35:32/1 day, 3:29:22, loss=0.413966169237998, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.22(1.03), norm=4.393982151351882, lr=0.20294800913195382
2023-12-06 23:31:28   INFO  epoch: 40/72, acc_iter=158180, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:17, time_cost(all): 1 day, 12:36:13/1 day, 3:52:56, loss=0.413906971812057, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.12(1.03), norm=2.719415670054294, lr=0.20283404205658606
2023-12-06 23:32:10   INFO  epoch: 40/72, acc_iter=158230, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:32, time_cost(all): 1 day, 12:36:55/1 day, 4:07:22, loss=0.413847774386116, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.12(1.03), norm=0.903245614397689, lr=0.2027200749812183
2023-12-06 23:32:52   INFO  epoch: 40/72, acc_iter=158280, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:53, time_cost(all): 1 day, 12:37:37/1 day, 5:06:07, loss=0.413788576960175, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.92(1.03), norm=4.991855012215593, lr=0.20260610790585043
2023-12-06 23:33:34   INFO  epoch: 40/72, acc_iter=158330, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 1 day, 12:38:19/1 day, 4:01:30, loss=0.413729379534234, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.95(1.03), norm=1.775920592263989, lr=0.20249214083048267
2023-12-06 23:34:16   INFO  epoch: 41/72, acc_iter=158392, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:53:30, time_cost(all): 1 day, 12:39:01/1 day, 3:48:20, loss=0.413655974726068, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.11(1.03), norm=2.982086030254536, lr=0.20235082165702656
2023-12-06 23:34:57   INFO  epoch: 41/72, acc_iter=158442, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:54:05, time_cost(all): 1 day, 12:39:42/1 day, 3:01:39, loss=0.413596777300127, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.11(1.03), norm=4.317069690087291, lr=0.2022368545816588
2023-12-06 23:35:39   INFO  epoch: 41/72, acc_iter=158492, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:53:39, time_cost(all): 1 day, 12:40:24/1 day, 3:16:27, loss=0.413537579874186, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.85(1.03), norm=4.925318688339636, lr=0.20212288750629104
2023-12-06 23:36:21   INFO  epoch: 41/72, acc_iter=158542, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:51:34, time_cost(all): 1 day, 12:41:06/1 day, 3:31:29, loss=0.413478382448245, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.86(1.03), norm=0.6862206872211805, lr=0.20200892043092322
2023-12-06 23:37:03   INFO  epoch: 41/72, acc_iter=158592, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:52:22, time_cost(all): 1 day, 12:41:48/1 day, 3:08:20, loss=0.413419185022304, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.89(1.03), norm=4.319334692611856, lr=0.2018949533555554
2023-12-06 23:37:44   INFO  epoch: 41/72, acc_iter=158642, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:51:31, time_cost(all): 1 day, 12:42:29/1 day, 2:33:55, loss=0.413359987596363, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.07(1.03), norm=2.1902880263214697, lr=0.20178098628018765
2023-12-06 23:38:26   INFO  epoch: 41/72, acc_iter=158692, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:46:42, time_cost(all): 1 day, 12:43:11/1 day, 4:43:46, loss=0.413300790170422, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.15(1.03), norm=4.649902942124298, lr=0.20166701920481983
2023-12-06 23:39:08   INFO  epoch: 41/72, acc_iter=158742, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:46:13, time_cost(all): 1 day, 12:43:53/1 day, 3:13:21, loss=0.413241592744481, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.92(1.03), norm=2.8441612030667005, lr=0.20155305212945207
2023-12-06 23:39:50   INFO  epoch: 41/72, acc_iter=158792, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:47:32, time_cost(all): 1 day, 12:44:35/1 day, 3:38:21, loss=0.41318239531854, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.95(1.03), norm=1.8243028317063872, lr=0.2014390850540842
2023-12-06 23:40:32   INFO  epoch: 41/72, acc_iter=158842, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:44:32, time_cost(all): 1 day, 12:45:17/1 day, 4:51:25, loss=0.413123197892599, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.91(1.03), norm=3.6449845208305485, lr=0.20132511797871644
2023-12-06 23:41:13   INFO  epoch: 41/72, acc_iter=158892, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:45:46, time_cost(all): 1 day, 12:45:58/1 day, 2:51:23, loss=0.413064000466658, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.97(1.03), norm=1.1069007343228647, lr=0.20121115090334868
2023-12-06 23:41:55   INFO  epoch: 41/72, acc_iter=158942, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:47:19, time_cost(all): 1 day, 12:46:40/1 day, 2:53:07, loss=0.413004803040717, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.17(1.03), norm=4.113102038758083, lr=0.20109718382798086
2023-12-06 23:42:37   INFO  epoch: 41/72, acc_iter=158992, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:43:52, time_cost(all): 1 day, 12:47:22/1 day, 3:57:10, loss=0.412945605614776, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.07(1.03), norm=4.00437511503324, lr=0.20098321675261305
2023-12-06 23:43:19   INFO  epoch: 41/72, acc_iter=159042, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:42:37, time_cost(all): 1 day, 12:48:04/1 day, 2:58:22, loss=0.412886408188835, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.16(1.03), norm=3.038253513890831, lr=0.2008692496772453
2023-12-06 23:44:01   INFO  epoch: 41/72, acc_iter=159092, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:41:52, time_cost(all): 1 day, 12:48:46/1 day, 4:42:11, loss=0.412827210762895, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.12(1.03), norm=1.032872577978877, lr=0.20075528260187747
2023-12-06 23:44:42   INFO  epoch: 41/72, acc_iter=159142, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:41:36, time_cost(all): 1 day, 12:49:27/1 day, 3:01:08, loss=0.412768013336954, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.84(1.03), norm=3.8421113282578645, lr=0.20064131552650966
2023-12-06 23:45:24   INFO  epoch: 41/72, acc_iter=159192, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:43:30, time_cost(all): 1 day, 12:50:09/1 day, 3:50:42, loss=0.412708815911013, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.15(1.03), norm=2.9338938726264963, lr=0.2005273484511419
2023-12-06 23:46:06   INFO  epoch: 41/72, acc_iter=159242, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:40:29, time_cost(all): 1 day, 12:50:51/1 day, 4:05:17, loss=0.412649618485072, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.14(1.03), norm=3.6760296327193602, lr=0.20041338137577408
2023-12-06 23:46:48   INFO  epoch: 41/72, acc_iter=159292, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:39:16, time_cost(all): 1 day, 12:51:33/1 day, 4:45:20, loss=0.412590421059131, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.16(1.03), norm=2.687434824648488, lr=0.20029941430040632
2023-12-06 23:47:29   INFO  epoch: 41/72, acc_iter=159342, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:39:20, time_cost(all): 1 day, 12:52:14/1 day, 3:34:27, loss=0.41253122363319, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.21(1.03), norm=1.0075677492538164, lr=0.2001854472250385
2023-12-06 23:48:11   INFO  epoch: 41/72, acc_iter=159392, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:39:16, time_cost(all): 1 day, 12:52:56/1 day, 4:11:48, loss=0.412472026207249, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.06(1.03), norm=3.8988426788204045, lr=0.2000714801496707
2023-12-06 23:48:53   INFO  epoch: 41/72, acc_iter=159442, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:37:10, time_cost(all): 1 day, 12:53:38/1 day, 3:06:06, loss=0.412412828781308, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.09(1.03), norm=4.588823062340677, lr=0.19995751307430293
2023-12-06 23:49:35   INFO  epoch: 41/72, acc_iter=159492, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:37:42, time_cost(all): 1 day, 12:54:20/1 day, 3:02:37, loss=0.412353631355367, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.0(1.03), norm=4.3973752957105585, lr=0.1998435459989351
2023-12-06 23:50:17   INFO  epoch: 41/72, acc_iter=159542, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:38:00, time_cost(all): 1 day, 12:55:02/1 day, 3:49:01, loss=0.412294433929426, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.03(1.03), norm=3.5629593360209872, lr=0.1997295789235673
2023-12-06 23:50:58   INFO  epoch: 41/72, acc_iter=159592, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:35:19, time_cost(all): 1 day, 12:55:43/1 day, 3:25:36, loss=0.412235236503485, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.03(1.03), norm=0.5054372213730801, lr=0.19961561184819954
2023-12-06 23:51:40   INFO  epoch: 41/72, acc_iter=159642, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:35:41, time_cost(all): 1 day, 12:56:25/1 day, 3:29:31, loss=0.412176039077544, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.1(1.03), norm=2.2908475986548438, lr=0.19950164477283172
2023-12-06 23:52:22   INFO  epoch: 41/72, acc_iter=159692, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:34:29, time_cost(all): 1 day, 12:57:07/1 day, 4:46:44, loss=0.412116841651603, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.9(1.03), norm=1.8803123004464009, lr=0.19938767769746396
2023-12-06 23:53:04   INFO  epoch: 41/72, acc_iter=159742, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:33:32, time_cost(all): 1 day, 12:57:49/1 day, 2:51:30, loss=0.412057644225662, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.9(1.03), norm=3.169449359641884, lr=0.19927371062209615
2023-12-06 23:53:45   INFO  epoch: 41/72, acc_iter=159792, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:34:07, time_cost(all): 1 day, 12:58:30/1 day, 3:31:16, loss=0.411998446799721, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.9(1.03), norm=2.029447871709242, lr=0.19915974354672833
2023-12-06 23:54:27   INFO  epoch: 41/72, acc_iter=159842, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:31:43, time_cost(all): 1 day, 12:59:12/1 day, 3:28:26, loss=0.41193924937378, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.18(1.03), norm=0.6381191694098025, lr=0.19904577647136057
2023-12-06 23:55:09   INFO  epoch: 41/72, acc_iter=159892, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:32:58, time_cost(all): 1 day, 12:59:54/1 day, 3:16:02, loss=0.411880051947839, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.06(1.03), norm=4.316739474914481, lr=0.19893180939599275
2023-12-06 23:55:51   INFO  epoch: 41/72, acc_iter=159942, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:31:14, time_cost(all): 1 day, 13:00:36/1 day, 4:48:37, loss=0.411820854521899, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.84(1.03), norm=3.7141535315848135, lr=0.19881784232062494
2023-12-06 23:56:33   INFO  epoch: 41/72, acc_iter=159992, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:32:06, time_cost(all): 1 day, 13:01:18/1 day, 3:50:36, loss=0.411761657095958, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.92(1.03), norm=1.2665103840643308, lr=0.19870387524525718
2023-12-06 23:57:14   INFO  epoch: 41/72, acc_iter=160042, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:30:52, time_cost(all): 1 day, 13:01:59/1 day, 2:21:29, loss=0.411702459670017, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.01(1.03), norm=1.554247591897862, lr=0.19858990816988942
2023-12-06 23:57:56   INFO  epoch: 41/72, acc_iter=160092, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:29:34, time_cost(all): 1 day, 13:02:41/1 day, 4:39:45, loss=0.411643262244076, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.06(1.03), norm=2.9136372450360666, lr=0.19847594109452155
2023-12-06 23:58:38   INFO  epoch: 41/72, acc_iter=160142, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:27:54, time_cost(all): 1 day, 13:03:23/1 day, 2:25:52, loss=0.411584064818135, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.21(1.03), norm=2.167598167442407, lr=0.1983619740191538
2023-12-06 23:59:20   INFO  epoch: 41/72, acc_iter=160192, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:27:02, time_cost(all): 1 day, 13:04:05/1 day, 3:23:33, loss=0.411524867392194, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.1(1.03), norm=4.075587002541575, lr=0.19824800694378603
2023-12-07 00:00:01   INFO  epoch: 41/72, acc_iter=160242, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:26:32, time_cost(all): 1 day, 13:04:46/1 day, 3:04:37, loss=0.411465669966253, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.2(1.03), norm=4.332853936915937, lr=0.1981340398684182
2023-12-07 00:00:43   INFO  epoch: 41/72, acc_iter=160292, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:26:34, time_cost(all): 1 day, 13:05:28/1 day, 4:26:27, loss=0.411406472540312, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.14(1.03), norm=4.4038483015026575, lr=0.1980200727930504
2023-12-07 00:01:25   INFO  epoch: 41/72, acc_iter=160342, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:26:45, time_cost(all): 1 day, 13:06:10/1 day, 2:43:14, loss=0.411347275114371, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.91(1.03), norm=3.821982293944707, lr=0.19790610571768258
2023-12-07 00:02:07   INFO  epoch: 41/72, acc_iter=160392, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:26:20, time_cost(all): 1 day, 13:06:52/1 day, 2:13:39, loss=0.41128807768843, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.91(1.03), norm=0.508209413076582, lr=0.19779213864231482
2023-12-07 00:02:49   INFO  epoch: 41/72, acc_iter=160442, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:25:37, time_cost(all): 1 day, 13:07:34/1 day, 3:01:08, loss=0.411228880262489, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.88(1.03), norm=3.6077767383056867, lr=0.19767817156694706
2023-12-07 00:03:30   INFO  epoch: 41/72, acc_iter=160492, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:15, time_cost(all): 1 day, 13:08:15/1 day, 3:21:28, loss=0.411169682836548, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.89(1.03), norm=2.5180422090996664, lr=0.1975642044915792
2023-12-07 00:04:12   INFO  epoch: 41/72, acc_iter=160542, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:24:01, time_cost(all): 1 day, 13:08:57/1 day, 2:30:04, loss=0.411110485410607, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.15(1.03), norm=2.8307504947245943, lr=0.19745023741621143
2023-12-07 00:04:54   INFO  epoch: 41/72, acc_iter=160592, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:21:24, time_cost(all): 1 day, 13:09:39/1 day, 4:41:26, loss=0.411051287984666, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.03(1.03), norm=2.33925436866998, lr=0.19733627034084367
2023-12-07 00:05:36   INFO  epoch: 41/72, acc_iter=160642, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:20:47, time_cost(all): 1 day, 13:10:21/1 day, 4:19:07, loss=0.410992090558725, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.98(1.03), norm=1.5760206632929163, lr=0.19722230326547585
2023-12-07 00:06:17   INFO  epoch: 41/72, acc_iter=160692, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:11, time_cost(all): 1 day, 13:11:02/1 day, 3:27:41, loss=0.410932893132784, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.16(1.03), norm=1.3245638284491394, lr=0.19710833619010804
2023-12-07 00:06:59   INFO  epoch: 41/72, acc_iter=160742, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:20:44, time_cost(all): 1 day, 13:11:44/1 day, 4:29:02, loss=0.410873695706844, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.98(1.03), norm=3.9330642151133066, lr=0.19699436911474028
2023-12-07 00:07:41   INFO  epoch: 41/72, acc_iter=160792, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:19:34, time_cost(all): 1 day, 13:12:26/1 day, 2:23:52, loss=0.410814498280903, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.1(1.03), norm=1.2295511156992252, lr=0.19688040203937246
2023-12-07 00:08:23   INFO  epoch: 41/72, acc_iter=160842, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:18:48, time_cost(all): 1 day, 13:13:08/1 day, 3:10:28, loss=0.410755300854962, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.17(1.03), norm=1.277795676281422, lr=0.19676643496400464
2023-12-07 00:09:05   INFO  epoch: 41/72, acc_iter=160892, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:27, time_cost(all): 1 day, 13:13:50/1 day, 3:45:49, loss=0.410696103429021, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.85(1.03), norm=1.963458111275824, lr=0.19665246788863688
2023-12-07 00:09:46   INFO  epoch: 41/72, acc_iter=160942, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:08, time_cost(all): 1 day, 13:14:31/1 day, 1:56:44, loss=0.41063690600308, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.11(1.03), norm=4.887204177113169, lr=0.19653850081326907
2023-12-07 00:10:28   INFO  epoch: 41/72, acc_iter=160992, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:38, time_cost(all): 1 day, 13:15:13/1 day, 3:12:43, loss=0.410577708577139, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.04(1.03), norm=3.826724635627794, lr=0.1964245337379013
2023-12-07 00:11:10   INFO  epoch: 41/72, acc_iter=161042, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:43, time_cost(all): 1 day, 13:15:55/1 day, 3:54:19, loss=0.410518511151198, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.9(1.03), norm=2.35143598680583, lr=0.1963105666625335
2023-12-07 00:11:52   INFO  epoch: 41/72, acc_iter=161092, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:26, time_cost(all): 1 day, 13:16:37/1 day, 4:31:11, loss=0.410459313725257, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.2(1.03), norm=3.009702371523324, lr=0.19619659958716568
2023-12-07 00:12:33   INFO  epoch: 41/72, acc_iter=161142, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:15:24, time_cost(all): 1 day, 13:17:18/1 day, 3:25:30, loss=0.410400116299316, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.99(1.03), norm=3.81713801796882, lr=0.19608263251179792
2023-12-07 00:13:15   INFO  epoch: 41/72, acc_iter=161192, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:13:57, time_cost(all): 1 day, 13:18:00/1 day, 2:09:08, loss=0.410340918873375, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.22(1.03), norm=4.704837289538587, lr=0.1959686654364301
2023-12-07 00:13:57   INFO  epoch: 41/72, acc_iter=161242, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:56, time_cost(all): 1 day, 13:18:42/1 day, 4:24:00, loss=0.410281721447434, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.95(1.03), norm=1.4862704272864231, lr=0.19585469836106228
2023-12-07 00:14:39   INFO  epoch: 41/72, acc_iter=161292, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:16, time_cost(all): 1 day, 13:19:24/1 day, 4:22:43, loss=0.410222524021493, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.01(1.03), norm=3.0550894097163175, lr=0.19574073128569452
2023-12-07 00:15:21   INFO  epoch: 41/72, acc_iter=161342, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:16, time_cost(all): 1 day, 13:20:06/1 day, 3:03:00, loss=0.410163326595552, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.11(1.03), norm=1.7836696895270374, lr=0.1956267642103267
2023-12-07 00:16:02   INFO  epoch: 41/72, acc_iter=161392, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:10:54, time_cost(all): 1 day, 13:20:47/1 day, 2:49:44, loss=0.410104129169611, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.95(1.03), norm=2.5935120203209716, lr=0.19551279713495895
2023-12-07 00:16:44   INFO  epoch: 41/72, acc_iter=161442, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:07, time_cost(all): 1 day, 13:21:29/1 day, 3:32:54, loss=0.41004493174367, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.18(1.03), norm=1.373070530460017, lr=0.19539883005959113
2023-12-07 00:17:26   INFO  epoch: 41/72, acc_iter=161492, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:52, time_cost(all): 1 day, 13:22:11/1 day, 3:00:54, loss=0.409985734317729, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.01(1.03), norm=0.8576721569753989, lr=0.19528486298422332
2023-12-07 00:18:08   INFO  epoch: 41/72, acc_iter=161542, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:40, time_cost(all): 1 day, 13:22:53/1 day, 3:58:44, loss=0.409926536891788, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.96(1.03), norm=4.8608676933017865, lr=0.19517089590885556
2023-12-07 00:18:49   INFO  epoch: 41/72, acc_iter=161592, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:15, time_cost(all): 1 day, 13:23:34/1 day, 2:11:45, loss=0.409867339465848, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.18(1.03), norm=3.4663217646708437, lr=0.19505692883348774
2023-12-07 00:19:31   INFO  epoch: 41/72, acc_iter=161642, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:52, time_cost(all): 1 day, 13:24:16/1 day, 2:13:43, loss=0.409808142039907, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.84(1.03), norm=1.7734365655906759, lr=0.19494296175811993
2023-12-07 00:20:13   INFO  epoch: 41/72, acc_iter=161692, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:13, time_cost(all): 1 day, 13:24:58/1 day, 2:28:40, loss=0.409748944613966, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.89(1.03), norm=0.9708843095610378, lr=0.19482899468275217
2023-12-07 00:20:55   INFO  epoch: 41/72, acc_iter=161742, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:42, time_cost(all): 1 day, 13:25:40/1 day, 3:53:34, loss=0.409689747188025, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.97(1.03), norm=0.9363085458483336, lr=0.1947150276073844
2023-12-07 00:21:37   INFO  epoch: 41/72, acc_iter=161792, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:30, time_cost(all): 1 day, 13:26:22/1 day, 4:08:40, loss=0.409630549762084, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.84(1.03), norm=4.599344885815538, lr=0.19460106053201653
2023-12-07 00:22:18   INFO  epoch: 41/72, acc_iter=161842, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:04:53, time_cost(all): 1 day, 13:27:03/1 day, 2:29:18, loss=0.409571352336143, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.18(1.03), norm=2.9373550284698253, lr=0.19448709345664877
2023-12-07 00:23:00   INFO  epoch: 41/72, acc_iter=161892, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:17, time_cost(all): 1 day, 13:27:45/1 day, 2:39:10, loss=0.409512154910202, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.96(1.03), norm=3.0184606595826375, lr=0.19437312638128096
2023-12-07 00:23:42   INFO  epoch: 41/72, acc_iter=161942, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:33, time_cost(all): 1 day, 13:28:27/1 day, 1:51:08, loss=0.409452957484261, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.04(1.03), norm=0.7959725619255292, lr=0.1942591593059132
2023-12-07 00:24:24   INFO  epoch: 41/72, acc_iter=161992, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:59, time_cost(all): 1 day, 13:29:09/1 day, 4:24:09, loss=0.40939376005832, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.86(1.03), norm=3.9480339078167166, lr=0.19414519223054538
2023-12-07 00:25:06   INFO  epoch: 41/72, acc_iter=162042, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:11, time_cost(all): 1 day, 13:29:51/1 day, 2:53:39, loss=0.409334562632379, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.0(1.03), norm=4.41510487263258, lr=0.19403122515517757
2023-12-07 00:25:47   INFO  epoch: 41/72, acc_iter=162092, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:37, time_cost(all): 1 day, 13:30:32/1 day, 2:52:14, loss=0.409275365206438, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.87(1.03), norm=1.4905316245142104, lr=0.1939172580798098
2023-12-07 00:26:29   INFO  epoch: 41/72, acc_iter=162142, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:52, time_cost(all): 1 day, 13:31:14/1 day, 2:19:27, loss=0.409216167780497, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.93(1.03), norm=2.249635989559674, lr=0.19380329100444205
2023-12-07 00:27:11   INFO  epoch: 41/72, acc_iter=162192, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 1 day, 13:31:56/1 day, 3:41:14, loss=0.409156970354556, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.05(1.03), norm=1.443269134563633, lr=0.19368932392907418
2023-12-07 00:27:53   INFO  epoch: 42/72, acc_iter=162254, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:51:01, time_cost(all): 1 day, 13:32:38/1 day, 1:46:00, loss=0.409083565546389, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.89(1.03), norm=2.25560191008969, lr=0.19354800475561817
2023-12-07 00:28:34   INFO  epoch: 42/72, acc_iter=162304, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:53:56, time_cost(all): 1 day, 13:33:19/1 day, 2:36:29, loss=0.409024368120449, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.1(1.03), norm=3.705506300368715, lr=0.19343403768025036
2023-12-07 00:29:16   INFO  epoch: 42/72, acc_iter=162354, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:52:54, time_cost(all): 1 day, 13:34:01/1 day, 2:03:48, loss=0.408965170694508, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.98(1.03), norm=0.8523149148790212, lr=0.19332007060488254
2023-12-07 00:29:58   INFO  epoch: 42/72, acc_iter=162404, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:50:57, time_cost(all): 1 day, 13:34:43/1 day, 2:19:07, loss=0.408905973268567, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.99(1.03), norm=4.033059407712487, lr=0.19320610352951478
2023-12-07 00:30:40   INFO  epoch: 42/72, acc_iter=162454, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:48:31, time_cost(all): 1 day, 13:35:25/1 day, 2:25:36, loss=0.408846775842626, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.09(1.03), norm=3.961746600372107, lr=0.19309213645414697
2023-12-07 00:31:22   INFO  epoch: 42/72, acc_iter=162504, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:47:23, time_cost(all): 1 day, 13:36:07/1 day, 3:07:44, loss=0.408787578416685, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.93(1.03), norm=0.7804887665272181, lr=0.19297816937877915
2023-12-07 00:32:03   INFO  epoch: 42/72, acc_iter=162554, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:50:55, time_cost(all): 1 day, 13:36:48/1 day, 3:17:54, loss=0.408728380990744, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.21(1.03), norm=4.426359858992885, lr=0.1928642023034114
2023-12-07 00:32:45   INFO  epoch: 42/72, acc_iter=162604, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:46:49, time_cost(all): 1 day, 13:37:30/1 day, 1:56:01, loss=0.408669183564803, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.95(1.03), norm=3.6301419040140526, lr=0.19275023522804358
2023-12-07 00:33:27   INFO  epoch: 42/72, acc_iter=162654, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:47:00, time_cost(all): 1 day, 13:38:12/1 day, 2:21:21, loss=0.408609986138862, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.1(1.03), norm=2.02495491468831, lr=0.19263626815267582
2023-12-07 00:34:09   INFO  epoch: 42/72, acc_iter=162704, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:45:36, time_cost(all): 1 day, 13:38:54/1 day, 3:35:43, loss=0.408550788712921, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.13(1.03), norm=3.2977116246297156, lr=0.192522301077308
2023-12-07 00:34:50   INFO  epoch: 42/72, acc_iter=162754, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:47:16, time_cost(all): 1 day, 13:39:35/1 day, 2:06:05, loss=0.40849159128698, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.96(1.03), norm=4.059126867483992, lr=0.19240833400194018
2023-12-07 00:35:32   INFO  epoch: 42/72, acc_iter=162804, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:45:47, time_cost(all): 1 day, 13:40:17/1 day, 3:44:17, loss=0.408432393861039, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.17(1.03), norm=1.911803105348986, lr=0.19229436692657242
2023-12-07 00:36:14   INFO  epoch: 42/72, acc_iter=162854, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:46:32, time_cost(all): 1 day, 13:40:59/1 day, 1:41:50, loss=0.408373196435098, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.17(1.03), norm=3.251162343323857, lr=0.1921803998512046
2023-12-07 00:36:56   INFO  epoch: 42/72, acc_iter=162904, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:43:13, time_cost(all): 1 day, 13:41:41/1 day, 3:32:04, loss=0.408313999009157, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.07(1.03), norm=4.48652081969956, lr=0.1920664327758368
2023-12-07 00:37:38   INFO  epoch: 42/72, acc_iter=162954, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:42:20, time_cost(all): 1 day, 13:42:23/1 day, 2:50:57, loss=0.408254801583216, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.18(1.03), norm=2.600267755528418, lr=0.19195246570046903
2023-12-07 00:38:19   INFO  epoch: 42/72, acc_iter=163004, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:41:42, time_cost(all): 1 day, 13:43:04/1 day, 2:32:28, loss=0.408195604157275, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.12(1.03), norm=3.3744277533046403, lr=0.19183849862510127
2023-12-07 00:39:01   INFO  epoch: 42/72, acc_iter=163054, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:43:59, time_cost(all): 1 day, 13:43:46/1 day, 2:33:00, loss=0.408136406731334, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.12(1.03), norm=2.65247280854286, lr=0.1917245315497334
2023-12-07 00:39:43   INFO  epoch: 42/72, acc_iter=163104, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:42:58, time_cost(all): 1 day, 13:44:28/1 day, 2:47:39, loss=0.408077209305393, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.04(1.03), norm=0.5878129824491524, lr=0.19161056447436564
2023-12-07 00:40:25   INFO  epoch: 42/72, acc_iter=163154, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:42:06, time_cost(all): 1 day, 13:45:10/1 day, 1:35:38, loss=0.408018011879453, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.06(1.03), norm=0.7334653500180777, lr=0.19149659739899783
2023-12-07 00:41:06   INFO  epoch: 42/72, acc_iter=163204, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:41:41, time_cost(all): 1 day, 13:45:51/1 day, 3:39:05, loss=0.407958814453512, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.11(1.03), norm=4.201513589502655, lr=0.19138263032363007
2023-12-07 00:41:48   INFO  epoch: 42/72, acc_iter=163254, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:40:23, time_cost(all): 1 day, 13:46:33/1 day, 1:34:33, loss=0.407899617027571, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.96(1.03), norm=1.5005001286432322, lr=0.19126866324826225
2023-12-07 00:42:30   INFO  epoch: 42/72, acc_iter=163304, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:38:22, time_cost(all): 1 day, 13:47:15/1 day, 3:33:05, loss=0.40784041960163, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.16(1.03), norm=1.1962692610084427, lr=0.19115469617289443
2023-12-07 00:43:12   INFO  epoch: 42/72, acc_iter=163354, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:37:23, time_cost(all): 1 day, 13:47:57/1 day, 1:55:51, loss=0.407781222175689, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.87(1.03), norm=0.9916600598326808, lr=0.19104072909752667
2023-12-07 00:43:54   INFO  epoch: 42/72, acc_iter=163404, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:35:55, time_cost(all): 1 day, 13:48:39/1 day, 3:22:01, loss=0.407722024749748, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.19(1.03), norm=3.532090064040637, lr=0.1909267620221589
2023-12-07 00:44:35   INFO  epoch: 42/72, acc_iter=163454, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:37:52, time_cost(all): 1 day, 13:49:20/1 day, 3:27:39, loss=0.407662827323807, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.2(1.03), norm=3.531586204539198, lr=0.19081279494679104
2023-12-07 00:45:17   INFO  epoch: 42/72, acc_iter=163504, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:34:22, time_cost(all): 1 day, 13:50:02/1 day, 3:04:27, loss=0.407603629897866, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.88(1.03), norm=2.4550007375427896, lr=0.19069882787142328
2023-12-07 00:45:59   INFO  epoch: 42/72, acc_iter=163554, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:35:35, time_cost(all): 1 day, 13:50:44/1 day, 3:21:32, loss=0.407544432471925, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.84(1.03), norm=0.6634599920249741, lr=0.19058486079605552
2023-12-07 00:46:41   INFO  epoch: 42/72, acc_iter=163604, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:33:21, time_cost(all): 1 day, 13:51:26/1 day, 2:15:19, loss=0.407485235045984, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.22(1.03), norm=4.462369549847818, lr=0.1904708937206877
2023-12-07 00:47:22   INFO  epoch: 42/72, acc_iter=163654, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:33:47, time_cost(all): 1 day, 13:52:07/1 day, 3:53:44, loss=0.407426037620043, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.94(1.03), norm=4.968345317641166, lr=0.1903569266453199
2023-12-07 00:48:04   INFO  epoch: 42/72, acc_iter=163704, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:31:45, time_cost(all): 1 day, 13:52:49/1 day, 3:57:43, loss=0.407366840194102, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.21(1.03), norm=4.2576421671536, lr=0.19024295956995213
2023-12-07 00:48:46   INFO  epoch: 42/72, acc_iter=163754, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:32:22, time_cost(all): 1 day, 13:53:31/1 day, 1:54:13, loss=0.407307642768161, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.91(1.03), norm=4.140756618955056, lr=0.19012899249458431
2023-12-07 00:49:28   INFO  epoch: 42/72, acc_iter=163804, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:30:29, time_cost(all): 1 day, 13:54:13/1 day, 3:44:06, loss=0.40724844534222, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.89(1.03), norm=1.4907143640444274, lr=0.1900150254192165
2023-12-07 00:50:10   INFO  epoch: 42/72, acc_iter=163854, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:31:58, time_cost(all): 1 day, 13:54:55/1 day, 2:34:07, loss=0.407189247916279, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.17(1.03), norm=1.7223334957000735, lr=0.18990105834384874
2023-12-07 00:50:51   INFO  epoch: 42/72, acc_iter=163904, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:31:10, time_cost(all): 1 day, 13:55:36/1 day, 3:51:18, loss=0.407130050490338, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.05(1.03), norm=1.765668356518513, lr=0.18978709126848092
2023-12-07 00:51:33   INFO  epoch: 42/72, acc_iter=163954, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:29:46, time_cost(all): 1 day, 13:56:18/1 day, 3:32:36, loss=0.407070853064397, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.93(1.03), norm=2.355848012703983, lr=0.18967312419311316
2023-12-07 00:52:15   INFO  epoch: 42/72, acc_iter=164004, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:29:18, time_cost(all): 1 day, 13:57:00/1 day, 3:15:07, loss=0.407011655638456, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.14(1.03), norm=2.0974546388035837, lr=0.1895591571177453
2023-12-07 00:52:57   INFO  epoch: 42/72, acc_iter=164054, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:28:11, time_cost(all): 1 day, 13:57:42/1 day, 1:57:52, loss=0.406952458212516, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.17(1.03), norm=4.934609607670422, lr=0.18944519004237753
2023-12-07 00:53:38   INFO  epoch: 42/72, acc_iter=164104, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:26:30, time_cost(all): 1 day, 13:58:23/1 day, 3:19:01, loss=0.406893260786575, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.2(1.03), norm=4.592109873656689, lr=0.18933122296700977
2023-12-07 00:54:20   INFO  epoch: 42/72, acc_iter=164154, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:25:24, time_cost(all): 1 day, 13:59:05/1 day, 3:31:30, loss=0.406834063360634, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.09(1.03), norm=1.0518503555661307, lr=0.18921725589164196
2023-12-07 00:55:02   INFO  epoch: 42/72, acc_iter=164204, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:24:44, time_cost(all): 1 day, 13:59:47/1 day, 1:24:55, loss=0.406774865934693, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.96(1.03), norm=2.13153565317361, lr=0.18910328881627414
2023-12-07 00:55:44   INFO  epoch: 42/72, acc_iter=164254, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:24:04, time_cost(all): 1 day, 14:00:29/1 day, 2:48:55, loss=0.406715668508752, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.88(1.03), norm=0.6190600157767375, lr=0.18898932174090638
2023-12-07 00:56:26   INFO  epoch: 42/72, acc_iter=164304, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:23:56, time_cost(all): 1 day, 14:01:11/1 day, 2:52:53, loss=0.406656471082811, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.93(1.03), norm=3.8428123850590126, lr=0.18887535466553856
2023-12-07 00:57:07   INFO  epoch: 42/72, acc_iter=164354, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:03, time_cost(all): 1 day, 14:01:52/1 day, 2:00:39, loss=0.40659727365687, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.83(1.03), norm=0.958645334642254, lr=0.1887613875901708
2023-12-07 00:57:49   INFO  epoch: 42/72, acc_iter=164404, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:23:01, time_cost(all): 1 day, 14:02:34/1 day, 3:35:56, loss=0.406538076230929, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.11(1.03), norm=2.9076835278126745, lr=0.188647420514803
2023-12-07 00:58:31   INFO  epoch: 42/72, acc_iter=164454, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:23:09, time_cost(all): 1 day, 14:03:16/1 day, 3:01:14, loss=0.406478878804988, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.14(1.03), norm=3.627286417019316, lr=0.18853345343943517
2023-12-07 00:59:13   INFO  epoch: 42/72, acc_iter=164504, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:26, time_cost(all): 1 day, 14:03:58/1 day, 3:16:44, loss=0.406419681379047, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.92(1.03), norm=3.9048953213001356, lr=0.1884194863640674
2023-12-07 00:59:55   INFO  epoch: 42/72, acc_iter=164554, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:42, time_cost(all): 1 day, 14:04:40/1 day, 3:45:30, loss=0.406360483953106, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.18(1.03), norm=0.9825664376131065, lr=0.1883055192886996
2023-12-07 01:00:36   INFO  epoch: 42/72, acc_iter=164604, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:19:49, time_cost(all): 1 day, 14:05:21/1 day, 3:44:08, loss=0.406301286527165, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.06(1.03), norm=1.6316555145354144, lr=0.18819155221333178
2023-12-07 01:01:18   INFO  epoch: 42/72, acc_iter=164654, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:18:41, time_cost(all): 1 day, 14:06:03/1 day, 3:13:46, loss=0.406242089101224, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.9(1.03), norm=3.054136469666197, lr=0.18807758513796402
2023-12-07 01:02:00   INFO  epoch: 42/72, acc_iter=164704, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:45, time_cost(all): 1 day, 14:06:45/1 day, 1:37:28, loss=0.406182891675283, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.92(1.03), norm=0.7744905637434086, lr=0.18796361806259626
2023-12-07 01:02:42   INFO  epoch: 42/72, acc_iter=164754, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:37, time_cost(all): 1 day, 14:07:27/1 day, 2:11:48, loss=0.406123694249342, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.1(1.03), norm=4.81168871858398, lr=0.1878496509872284
2023-12-07 01:03:23   INFO  epoch: 42/72, acc_iter=164804, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:18:07, time_cost(all): 1 day, 14:08:08/1 day, 2:11:39, loss=0.406064496823401, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.15(1.03), norm=2.2281104861408143, lr=0.18773568391186063
2023-12-07 01:04:05   INFO  epoch: 42/72, acc_iter=164854, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:29, time_cost(all): 1 day, 14:08:50/1 day, 1:27:55, loss=0.40600529939746, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.94(1.03), norm=4.809425423945466, lr=0.1876217168364928
2023-12-07 01:04:47   INFO  epoch: 42/72, acc_iter=164904, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:15:53, time_cost(all): 1 day, 14:09:32/1 day, 2:59:30, loss=0.40594610197152, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.2(1.03), norm=0.5087931449645194, lr=0.18750774976112505
2023-12-07 01:05:29   INFO  epoch: 42/72, acc_iter=164954, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:14:46, time_cost(all): 1 day, 14:10:14/1 day, 3:20:05, loss=0.405886904545579, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.0(1.03), norm=4.1089182944787, lr=0.18739378268575724
2023-12-07 01:06:11   INFO  epoch: 42/72, acc_iter=165004, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:31, time_cost(all): 1 day, 14:10:56/1 day, 2:10:46, loss=0.405827707119638, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.1(1.03), norm=1.0245381967453955, lr=0.18727981561038942
2023-12-07 01:06:52   INFO  epoch: 42/72, acc_iter=165054, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:13:43, time_cost(all): 1 day, 14:11:37/1 day, 2:49:11, loss=0.405768509693697, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.0(1.03), norm=1.2383240609383463, lr=0.18716584853502166
2023-12-07 01:07:34   INFO  epoch: 42/72, acc_iter=165104, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:48, time_cost(all): 1 day, 14:12:19/1 day, 2:57:26, loss=0.405709312267756, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.86(1.03), norm=0.8786147387440745, lr=0.1870518814596539
2023-12-07 01:08:16   INFO  epoch: 42/72, acc_iter=165154, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:13:05, time_cost(all): 1 day, 14:13:01/1 day, 2:56:47, loss=0.405650114841815, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.01(1.03), norm=3.2816743047774777, lr=0.18693791438428603
2023-12-07 01:08:58   INFO  epoch: 42/72, acc_iter=165204, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:00, time_cost(all): 1 day, 14:13:43/1 day, 1:16:20, loss=0.405590917415874, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.17(1.03), norm=3.646585515653992, lr=0.18682394730891827
2023-12-07 01:09:39   INFO  epoch: 42/72, acc_iter=165254, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:11, time_cost(all): 1 day, 14:14:24/1 day, 2:52:44, loss=0.405531719989933, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.83(1.03), norm=2.989344433228737, lr=0.1867099802335505
2023-12-07 01:10:21   INFO  epoch: 42/72, acc_iter=165304, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:39, time_cost(all): 1 day, 14:15:06/1 day, 1:50:37, loss=0.405472522563992, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.85(1.03), norm=1.5987477128995464, lr=0.1865960131581827
2023-12-07 01:11:03   INFO  epoch: 42/72, acc_iter=165354, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:34, time_cost(all): 1 day, 14:15:48/1 day, 1:42:14, loss=0.405413325138051, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.1(1.03), norm=1.1824627392204334, lr=0.18648204608281488
2023-12-07 01:11:45   INFO  epoch: 42/72, acc_iter=165404, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:13, time_cost(all): 1 day, 14:16:30/1 day, 2:09:38, loss=0.40535412771211, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.09(1.03), norm=1.0796044434954637, lr=0.18636807900744712
2023-12-07 01:12:27   INFO  epoch: 42/72, acc_iter=165454, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:44, time_cost(all): 1 day, 14:17:12/1 day, 2:49:40, loss=0.405294930286169, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.88(1.03), norm=1.068686776908309, lr=0.1862541119320793
2023-12-07 01:13:08   INFO  epoch: 42/72, acc_iter=165504, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:54, time_cost(all): 1 day, 14:17:53/1 day, 3:09:44, loss=0.405235732860228, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.1(1.03), norm=2.0488870196370477, lr=0.1861401448567115
2023-12-07 01:13:50   INFO  epoch: 42/72, acc_iter=165554, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:27, time_cost(all): 1 day, 14:18:35/1 day, 2:37:00, loss=0.405176535434287, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.15(1.03), norm=0.6462546016959781, lr=0.18602617778134367
2023-12-07 01:14:32   INFO  epoch: 42/72, acc_iter=165604, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:30, time_cost(all): 1 day, 14:19:17/1 day, 2:08:22, loss=0.405117338008346, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.84(1.03), norm=2.391869278675563, lr=0.1859122107059759
2023-12-07 01:15:14   INFO  epoch: 42/72, acc_iter=165654, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:32, time_cost(all): 1 day, 14:19:59/1 day, 2:18:24, loss=0.405058140582405, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.01(1.03), norm=4.483687570034829, lr=0.18579824363060815
2023-12-07 01:15:55   INFO  epoch: 42/72, acc_iter=165704, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:15, time_cost(all): 1 day, 14:20:40/1 day, 3:05:51, loss=0.404998943156465, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.02(1.03), norm=4.957903760541499, lr=0.18568427655524028
2023-12-07 01:16:37   INFO  epoch: 42/72, acc_iter=165754, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:31, time_cost(all): 1 day, 14:21:22/1 day, 2:05:11, loss=0.404939745730524, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.07(1.03), norm=3.041149744793352, lr=0.18557030947987252
2023-12-07 01:17:19   INFO  epoch: 42/72, acc_iter=165804, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:32, time_cost(all): 1 day, 14:22:04/1 day, 3:16:26, loss=0.404880548304583, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.87(1.03), norm=2.0939776715336214, lr=0.18545634240450476
2023-12-07 01:18:01   INFO  epoch: 42/72, acc_iter=165854, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:58, time_cost(all): 1 day, 14:22:46/1 day, 1:29:17, loss=0.404821350878642, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.18(1.03), norm=1.4448447940788467, lr=0.18534237532913694
2023-12-07 01:18:43   INFO  epoch: 42/72, acc_iter=165904, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:10, time_cost(all): 1 day, 14:23:28/1 day, 2:05:18, loss=0.404762153452701, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.87(1.03), norm=1.4866381392236327, lr=0.18522840825376913
2023-12-07 01:19:24   INFO  epoch: 42/72, acc_iter=165954, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:35, time_cost(all): 1 day, 14:24:09/1 day, 1:03:54, loss=0.40470295602676, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.9(1.03), norm=1.7579379682300518, lr=0.18511444117840137
2023-12-07 01:20:06   INFO  epoch: 42/72, acc_iter=166004, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:53, time_cost(all): 1 day, 14:24:51/1 day, 2:04:13, loss=0.404643758600819, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.95(1.03), norm=1.6834983244609376, lr=0.18500047410303355
2023-12-07 01:20:48   INFO  epoch: 42/72, acc_iter=166054, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 1 day, 14:25:33/1 day, 2:12:32, loss=0.404584561174878, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.94(1.03), norm=1.789903759175608, lr=0.1848865070276658
2023-12-07 01:21:30   INFO  epoch: 43/72, acc_iter=166116, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:54:06, time_cost(all): 1 day, 14:26:15/1 day, 2:55:57, loss=0.404511156366711, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.07(1.03), norm=0.8503173888630882, lr=0.18474518785420968
2023-12-07 01:22:11   INFO  epoch: 43/72, acc_iter=166166, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:51:36, time_cost(all): 1 day, 14:26:56/1 day, 2:17:40, loss=0.40445195894077, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.15(1.03), norm=4.9837669854646265, lr=0.18463122077884192
2023-12-07 01:22:53   INFO  epoch: 43/72, acc_iter=166216, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:50:25, time_cost(all): 1 day, 14:27:38/1 day, 1:53:39, loss=0.404392761514829, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.15(1.03), norm=0.6658155080058266, lr=0.1845172537034741
2023-12-07 01:23:35   INFO  epoch: 43/72, acc_iter=166266, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:51:24, time_cost(all): 1 day, 14:28:20/1 day, 0:56:43, loss=0.404333564088888, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.14(1.03), norm=4.613615333255621, lr=0.1844032866281063
2023-12-07 01:24:17   INFO  epoch: 43/72, acc_iter=166316, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:48:10, time_cost(all): 1 day, 14:29:02/1 day, 1:33:07, loss=0.404274366662947, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.84(1.03), norm=4.132510693429813, lr=0.18428931955273853
2023-12-07 01:24:59   INFO  epoch: 43/72, acc_iter=166366, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:50:11, time_cost(all): 1 day, 14:29:44/1 day, 1:33:54, loss=0.404215169237006, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.17(1.03), norm=3.150707703023969, lr=0.18417535247737077
2023-12-07 01:25:40   INFO  epoch: 43/72, acc_iter=166416, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:48:56, time_cost(all): 1 day, 14:30:25/1 day, 1:45:09, loss=0.404155971811065, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.21(1.03), norm=4.855172140350661, lr=0.1840613854020029
2023-12-07 01:26:22   INFO  epoch: 43/72, acc_iter=166466, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:48:30, time_cost(all): 1 day, 14:31:07/1 day, 1:26:03, loss=0.404096774385125, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.17(1.03), norm=4.009494626731086, lr=0.18394741832663514
2023-12-07 01:27:04   INFO  epoch: 43/72, acc_iter=166516, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:48:33, time_cost(all): 1 day, 14:31:49/1 day, 3:13:48, loss=0.404037576959184, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.01(1.03), norm=3.8830959561834884, lr=0.18383345125126738
2023-12-07 01:27:46   INFO  epoch: 43/72, acc_iter=166566, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:46:57, time_cost(all): 1 day, 14:32:31/1 day, 1:52:42, loss=0.403978379533243, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.95(1.03), norm=1.58619886178069, lr=0.18371948417589956
2023-12-07 01:28:27   INFO  epoch: 43/72, acc_iter=166616, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:44:34, time_cost(all): 1 day, 14:33:12/1 day, 1:58:22, loss=0.403919182107302, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.11(1.03), norm=4.071490027435328, lr=0.18360551710053175
2023-12-07 01:29:09   INFO  epoch: 43/72, acc_iter=166666, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:47:35, time_cost(all): 1 day, 14:33:54/1 day, 2:23:42, loss=0.403859984681361, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.03(1.03), norm=1.5859894827871428, lr=0.18349155002516399
2023-12-07 01:29:51   INFO  epoch: 43/72, acc_iter=166716, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:45:13, time_cost(all): 1 day, 14:34:36/1 day, 2:28:39, loss=0.40380078725542, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.97(1.03), norm=4.695022087363165, lr=0.18337758294979617
2023-12-07 01:30:33   INFO  epoch: 43/72, acc_iter=166766, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:45:41, time_cost(all): 1 day, 14:35:18/1 day, 2:55:53, loss=0.403741589829479, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.19(1.03), norm=3.812298235365816, lr=0.18326361587442835
2023-12-07 01:31:15   INFO  epoch: 43/72, acc_iter=166816, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:43:44, time_cost(all): 1 day, 14:36:00/1 day, 0:42:30, loss=0.403682392403538, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.19(1.03), norm=2.981587871050073, lr=0.18314964879906054
2023-12-07 01:31:56   INFO  epoch: 43/72, acc_iter=166866, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:44:46, time_cost(all): 1 day, 14:36:41/1 day, 1:42:57, loss=0.403623194977597, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.84(1.03), norm=1.5533336369799888, lr=0.18303568172369278
2023-12-07 01:32:38   INFO  epoch: 43/72, acc_iter=166916, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:39:59, time_cost(all): 1 day, 14:37:23/1 day, 1:51:35, loss=0.403563997551656, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.21(1.03), norm=2.4567572663255635, lr=0.18292171464832502
2023-12-07 01:33:20   INFO  epoch: 43/72, acc_iter=166966, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:42:40, time_cost(all): 1 day, 14:38:05/1 day, 1:39:42, loss=0.403504800125715, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.87(1.03), norm=4.0783663213483985, lr=0.18280774757295715
2023-12-07 01:34:02   INFO  epoch: 43/72, acc_iter=167016, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:40:19, time_cost(all): 1 day, 14:38:47/1 day, 1:09:14, loss=0.403445602699774, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.11(1.03), norm=1.4199376385751, lr=0.1826937804975894
2023-12-07 01:34:44   INFO  epoch: 43/72, acc_iter=167066, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:40:51, time_cost(all): 1 day, 14:39:29/1 day, 2:22:29, loss=0.403386405273833, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.01(1.03), norm=2.835954194777417, lr=0.18257981342222163
2023-12-07 01:35:25   INFO  epoch: 43/72, acc_iter=167116, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:38:22, time_cost(all): 1 day, 14:40:10/1 day, 2:25:42, loss=0.403327207847892, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.2(1.03), norm=2.216676819982836, lr=0.1824658463468538
2023-12-07 01:36:07   INFO  epoch: 43/72, acc_iter=167166, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:40:16, time_cost(all): 1 day, 14:40:52/1 day, 0:49:28, loss=0.403268010421951, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.18(1.03), norm=1.1404954055066665, lr=0.182351879271486
2023-12-07 01:36:49   INFO  epoch: 43/72, acc_iter=167216, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:36:41, time_cost(all): 1 day, 14:41:34/1 day, 0:39:30, loss=0.40320881299601, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.88(1.03), norm=3.133899791175036, lr=0.18223791219611823
2023-12-07 01:37:31   INFO  epoch: 43/72, acc_iter=167266, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:35:19, time_cost(all): 1 day, 14:42:16/1 day, 2:26:06, loss=0.40314961557007, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.09(1.03), norm=3.936547782260628, lr=0.18212394512075042
2023-12-07 01:38:12   INFO  epoch: 43/72, acc_iter=167316, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:36:42, time_cost(all): 1 day, 14:42:57/1 day, 0:47:18, loss=0.403090418144129, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.22(1.03), norm=4.9992683850897075, lr=0.18200997804538266
2023-12-07 01:38:54   INFO  epoch: 43/72, acc_iter=167366, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:35:29, time_cost(all): 1 day, 14:43:39/1 day, 1:38:29, loss=0.403031220718188, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.09(1.03), norm=1.112985598618348, lr=0.18189601097001484
2023-12-07 01:39:36   INFO  epoch: 43/72, acc_iter=167416, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:36:11, time_cost(all): 1 day, 14:44:21/1 day, 1:37:53, loss=0.402972023292247, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.01(1.03), norm=3.7038483669300986, lr=0.18178204389464703
2023-12-07 01:40:18   INFO  epoch: 43/72, acc_iter=167466, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:33:51, time_cost(all): 1 day, 14:45:03/1 day, 0:55:56, loss=0.402912825866306, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.17(1.03), norm=2.919172190895484, lr=0.18166807681927927
2023-12-07 01:41:00   INFO  epoch: 43/72, acc_iter=167516, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:34:14, time_cost(all): 1 day, 14:45:45/1 day, 3:01:17, loss=0.402853628440365, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.97(1.03), norm=1.8809075490730693, lr=0.18155410974391145
2023-12-07 01:41:41   INFO  epoch: 43/72, acc_iter=167566, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:31:31, time_cost(all): 1 day, 14:46:26/1 day, 1:00:47, loss=0.402794431014424, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.85(1.03), norm=1.4121747487747316, lr=0.18144014266854364
2023-12-07 01:42:23   INFO  epoch: 43/72, acc_iter=167616, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:30:55, time_cost(all): 1 day, 14:47:08/1 day, 1:25:56, loss=0.402735233588483, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.16(1.03), norm=2.099305542172602, lr=0.18132617559317588
2023-12-07 01:43:05   INFO  epoch: 43/72, acc_iter=167666, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:30:12, time_cost(all): 1 day, 14:47:50/1 day, 0:55:40, loss=0.402676036162542, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.0(1.03), norm=0.5187027956648532, lr=0.18121220851780806
2023-12-07 01:43:47   INFO  epoch: 43/72, acc_iter=167716, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:32:06, time_cost(all): 1 day, 14:48:32/1 day, 0:54:34, loss=0.402616838736601, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.21(1.03), norm=4.384287829021922, lr=0.18109824144244024
2023-12-07 01:44:28   INFO  epoch: 43/72, acc_iter=167766, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:31:07, time_cost(all): 1 day, 14:49:13/1 day, 2:10:52, loss=0.40255764131066, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.85(1.03), norm=3.1244356655999668, lr=0.18098427436707248
2023-12-07 01:45:10   INFO  epoch: 43/72, acc_iter=167816, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:30:21, time_cost(all): 1 day, 14:49:55/1 day, 1:36:52, loss=0.402498443884719, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.12(1.03), norm=0.9480414070723742, lr=0.18087030729170467
2023-12-07 01:45:52   INFO  epoch: 43/72, acc_iter=167866, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:29:31, time_cost(all): 1 day, 14:50:37/1 day, 1:01:17, loss=0.402439246458778, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.99(1.03), norm=3.4139183534867077, lr=0.1807563402163369
2023-12-07 01:46:34   INFO  epoch: 43/72, acc_iter=167916, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:27:17, time_cost(all): 1 day, 14:51:19/1 day, 2:00:26, loss=0.402380049032837, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.96(1.03), norm=1.3854903295783283, lr=0.1806423731409691
2023-12-07 01:47:16   INFO  epoch: 43/72, acc_iter=167966, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:28:37, time_cost(all): 1 day, 14:52:01/1 day, 1:08:10, loss=0.402320851606896, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.13(1.03), norm=3.199807558673272, lr=0.18052840606560128
2023-12-07 01:47:57   INFO  epoch: 43/72, acc_iter=168016, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:25:35, time_cost(all): 1 day, 14:52:42/1 day, 2:53:23, loss=0.402261654180955, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.92(1.03), norm=2.8354029164678334, lr=0.18041443899023352
2023-12-07 01:48:39   INFO  epoch: 43/72, acc_iter=168066, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:25:26, time_cost(all): 1 day, 14:53:24/1 day, 2:29:33, loss=0.402202456755014, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.1(1.03), norm=1.297157734339387, lr=0.18030047191486576
2023-12-07 01:49:21   INFO  epoch: 43/72, acc_iter=168116, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:33, time_cost(all): 1 day, 14:54:06/1 day, 0:42:55, loss=0.402143259329074, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.95(1.03), norm=2.4480019133432656, lr=0.18018650483949789
2023-12-07 01:50:03   INFO  epoch: 43/72, acc_iter=168166, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:25:27, time_cost(all): 1 day, 14:54:48/1 day, 1:42:23, loss=0.402084061903133, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.16(1.03), norm=4.558897398305878, lr=0.18007253776413013
2023-12-07 01:50:44   INFO  epoch: 43/72, acc_iter=168216, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:23, time_cost(all): 1 day, 14:55:29/1 day, 0:22:27, loss=0.402024864477192, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.03(1.03), norm=3.911886912057583, lr=0.17995857068876236
2023-12-07 01:51:26   INFO  epoch: 43/72, acc_iter=168266, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:23:13, time_cost(all): 1 day, 14:56:11/1 day, 2:16:21, loss=0.401965667051251, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.92(1.03), norm=2.2863810288103696, lr=0.17984460361339455
2023-12-07 01:52:08   INFO  epoch: 43/72, acc_iter=168316, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:21:39, time_cost(all): 1 day, 14:56:53/1 day, 0:50:48, loss=0.40190646962531, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.01(1.03), norm=0.5264148740510415, lr=0.17973063653802673
2023-12-07 01:52:50   INFO  epoch: 43/72, acc_iter=168366, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:22:02, time_cost(all): 1 day, 14:57:35/1 day, 1:07:07, loss=0.401847272199369, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.08(1.03), norm=4.560019026609498, lr=0.17961666946265892
2023-12-07 01:53:32   INFO  epoch: 43/72, acc_iter=168416, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:38, time_cost(all): 1 day, 14:58:17/1 day, 1:06:37, loss=0.401788074773428, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.87(1.03), norm=4.9109524446632635, lr=0.17950270238729116
2023-12-07 01:54:13   INFO  epoch: 43/72, acc_iter=168466, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:20:25, time_cost(all): 1 day, 14:58:58/1 day, 2:08:51, loss=0.401728877347487, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.95(1.03), norm=4.045883669948393, lr=0.17938873531192334
2023-12-07 01:54:55   INFO  epoch: 43/72, acc_iter=168516, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:19:26, time_cost(all): 1 day, 14:59:40/1 day, 1:10:03, loss=0.401669679921546, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.07(1.03), norm=2.526231758711419, lr=0.17927476823655553
2023-12-07 01:55:37   INFO  epoch: 43/72, acc_iter=168566, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:18:48, time_cost(all): 1 day, 15:00:22/1 day, 2:30:10, loss=0.401610482495605, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.15(1.03), norm=3.3580288303084833, lr=0.17916080116118777
2023-12-07 01:56:19   INFO  epoch: 43/72, acc_iter=168616, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:34, time_cost(all): 1 day, 15:01:04/1 day, 0:45:06, loss=0.401551285069664, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.04(1.03), norm=4.408288262136123, lr=0.17904683408582
2023-12-07 01:57:00   INFO  epoch: 43/72, acc_iter=168666, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:47, time_cost(all): 1 day, 15:01:45/1 day, 1:15:45, loss=0.401492087643723, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.97(1.03), norm=2.949691994162284, lr=0.17893286701045213
2023-12-07 01:57:42   INFO  epoch: 43/72, acc_iter=168716, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:17:40, time_cost(all): 1 day, 15:02:27/1 day, 0:51:40, loss=0.401432890217782, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.0(1.03), norm=2.085159340074089, lr=0.17881889993508437
2023-12-07 01:58:24   INFO  epoch: 43/72, acc_iter=168766, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:15:56, time_cost(all): 1 day, 15:03:09/1 day, 1:25:37, loss=0.401373692791841, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.11(1.03), norm=0.7073231237081539, lr=0.17870493285971661
2023-12-07 01:59:06   INFO  epoch: 43/72, acc_iter=168816, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:49, time_cost(all): 1 day, 15:03:51/1 day, 1:55:25, loss=0.4013144953659, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.99(1.03), norm=2.558028368049759, lr=0.1785909657843488
2023-12-07 01:59:48   INFO  epoch: 43/72, acc_iter=168866, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:47, time_cost(all): 1 day, 15:04:33/1 day, 1:04:59, loss=0.401255297939959, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.93(1.03), norm=2.932376573584146, lr=0.17847699870898098
2023-12-07 02:00:29   INFO  epoch: 43/72, acc_iter=168916, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:13:47, time_cost(all): 1 day, 15:05:14/1 day, 0:15:59, loss=0.401196100514018, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.01(1.03), norm=2.386114583007261, lr=0.17836303163361322
2023-12-07 02:01:11   INFO  epoch: 43/72, acc_iter=168966, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:07, time_cost(all): 1 day, 15:05:56/1 day, 2:37:58, loss=0.401136903088077, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.02(1.03), norm=2.030829935403464, lr=0.1782490645582454
2023-12-07 02:01:53   INFO  epoch: 43/72, acc_iter=169016, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:36, time_cost(all): 1 day, 15:06:38/1 day, 0:40:46, loss=0.401077705662137, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.16(1.03), norm=2.086102637139586, lr=0.17813509748287765
2023-12-07 02:02:35   INFO  epoch: 43/72, acc_iter=169066, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:11:58, time_cost(all): 1 day, 15:07:20/1 day, 1:20:48, loss=0.401018508236196, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.91(1.03), norm=2.6649531989359225, lr=0.17802113040750983
2023-12-07 02:03:16   INFO  epoch: 43/72, acc_iter=169116, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:38, time_cost(all): 1 day, 15:08:01/1 day, 1:44:36, loss=0.400959310810255, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.1(1.03), norm=4.710179368786742, lr=0.17790716333214202
2023-12-07 02:03:58   INFO  epoch: 43/72, acc_iter=169166, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:11:03, time_cost(all): 1 day, 15:08:43/1 day, 0:54:39, loss=0.400900113384314, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.92(1.03), norm=1.3093117888894508, lr=0.17779319625677426
2023-12-07 02:04:40   INFO  epoch: 43/72, acc_iter=169216, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:43, time_cost(all): 1 day, 15:09:25/1 day, 1:00:34, loss=0.400840915958373, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.22(1.03), norm=2.4353366987521, lr=0.17767922918140644
2023-12-07 02:05:22   INFO  epoch: 43/72, acc_iter=169266, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:08:53, time_cost(all): 1 day, 15:10:07/1 day, 2:25:01, loss=0.400781718532432, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.11(1.03), norm=3.6821698258007958, lr=0.17756526210603862
2023-12-07 02:06:04   INFO  epoch: 43/72, acc_iter=169316, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:33, time_cost(all): 1 day, 15:10:49/1 day, 0:14:00, loss=0.400722521106491, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.11(1.03), norm=1.0268146566400427, lr=0.17745129503067086
2023-12-07 02:06:45   INFO  epoch: 43/72, acc_iter=169366, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:43, time_cost(all): 1 day, 15:11:30/1 day, 2:14:51, loss=0.40066332368055, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.22(1.03), norm=4.496193440071296, lr=0.17733732795530305
2023-12-07 02:07:27   INFO  epoch: 43/72, acc_iter=169416, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:06:52, time_cost(all): 1 day, 15:12:12/1 day, 2:01:56, loss=0.400604126254609, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.1(1.03), norm=2.5188391613906957, lr=0.17722336087993523
2023-12-07 02:08:09   INFO  epoch: 43/72, acc_iter=169466, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:13, time_cost(all): 1 day, 15:12:54/1 day, 1:00:31, loss=0.400544928828668, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.01(1.03), norm=3.8094129042875124, lr=0.17710939380456747
2023-12-07 02:08:51   INFO  epoch: 43/72, acc_iter=169516, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:37, time_cost(all): 1 day, 15:13:36/1 day, 1:25:42, loss=0.400485731402727, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.05(1.03), norm=3.0892814057583107, lr=0.17699542672919966
2023-12-07 02:09:33   INFO  epoch: 43/72, acc_iter=169566, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:04:53, time_cost(all): 1 day, 15:14:18/1 day, 1:35:57, loss=0.400426533976786, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.88(1.03), norm=3.5312007819417284, lr=0.1768814596538319
2023-12-07 02:10:14   INFO  epoch: 43/72, acc_iter=169616, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:20, time_cost(all): 1 day, 15:14:59/1 day, 2:22:53, loss=0.400367336550845, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.86(1.03), norm=4.714993275906674, lr=0.17676749257846408
2023-12-07 02:10:56   INFO  epoch: 43/72, acc_iter=169666, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:47, time_cost(all): 1 day, 15:15:41/1 day, 1:38:42, loss=0.400308139124904, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.91(1.03), norm=4.765078995759699, lr=0.17665352550309626
2023-12-07 02:11:38   INFO  epoch: 43/72, acc_iter=169716, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:59, time_cost(all): 1 day, 15:16:23/1 day, 1:55:33, loss=0.400248941698963, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.21(1.03), norm=2.0606196432786166, lr=0.1765395584277285
2023-12-07 02:12:20   INFO  epoch: 43/72, acc_iter=169766, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:09, time_cost(all): 1 day, 15:17:05/1 day, 0:49:57, loss=0.400189744273022, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.11(1.03), norm=2.501737354389459, lr=0.17642559135236074
2023-12-07 02:13:01   INFO  epoch: 43/72, acc_iter=169816, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:34, time_cost(all): 1 day, 15:17:46/1 day, 1:35:12, loss=0.400130546847081, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.84(1.03), norm=2.181232043241448, lr=0.17631162427699287
2023-12-07 02:13:43   INFO  epoch: 43/72, acc_iter=169866, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:53, time_cost(all): 1 day, 15:18:28/1 day, 2:10:08, loss=0.400071349421141, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.88(1.03), norm=2.2741673323951193, lr=0.1761976572016251
2023-12-07 02:14:25   INFO  epoch: 43/72, acc_iter=169916, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 1 day, 15:19:10/1 day, 0:46:46, loss=0.4000121519952, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.96(1.03), norm=2.889112819426052, lr=0.17608369012625735
2023-12-07 02:15:07   INFO  epoch: 44/72, acc_iter=169978, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:51:28, time_cost(all): 1 day, 15:19:52/1 day, 2:18:45, loss=0.399938747187033, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.08(1.03), norm=2.9368133755637884, lr=0.17594237095280124
2023-12-07 02:15:49   INFO  epoch: 44/72, acc_iter=170028, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:54:58, time_cost(all): 1 day, 15:20:34/1 day, 1:41:39, loss=0.399879549761092, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.98(1.03), norm=2.9799737577158885, lr=0.17582840387743348
2023-12-07 02:16:30   INFO  epoch: 44/72, acc_iter=170078, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:53:15, time_cost(all): 1 day, 15:21:15/1 day, 1:10:48, loss=0.399820352335151, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.0(1.03), norm=2.2101719202598122, lr=0.17571443680206567
2023-12-07 02:17:12   INFO  epoch: 44/72, acc_iter=170128, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:48:59, time_cost(all): 1 day, 15:21:57/1 day, 0:27:24, loss=0.39976115490921, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.87(1.03), norm=1.8908205121359725, lr=0.17560046972669785
2023-12-07 02:17:54   INFO  epoch: 44/72, acc_iter=170178, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:52:00, time_cost(all): 1 day, 15:22:39/1 day, 2:23:52, loss=0.399701957483269, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.12(1.03), norm=3.00110992383819, lr=0.1754865026513301
2023-12-07 02:18:36   INFO  epoch: 44/72, acc_iter=170228, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:47:42, time_cost(all): 1 day, 15:23:21/1 day, 0:19:01, loss=0.399642760057328, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.04(1.03), norm=2.76553203894651, lr=0.17537253557596227
2023-12-07 02:19:17   INFO  epoch: 44/72, acc_iter=170278, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:46:36, time_cost(all): 1 day, 15:24:02/1 day, 2:05:52, loss=0.399583562631387, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.15(1.03), norm=3.7952092969427236, lr=0.1752585685005945
2023-12-07 02:19:59   INFO  epoch: 44/72, acc_iter=170328, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:49:58, time_cost(all): 1 day, 15:24:44/1 day, 0:33:20, loss=0.399524365205446, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.18(1.03), norm=3.233993694064992, lr=0.1751446014252267
2023-12-07 02:20:41   INFO  epoch: 44/72, acc_iter=170378, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:45:08, time_cost(all): 1 day, 15:25:26/1 day, 0:32:28, loss=0.399465167779505, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.93(1.03), norm=3.673133263236447, lr=0.17503063434985888
2023-12-07 02:21:23   INFO  epoch: 44/72, acc_iter=170428, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:46:33, time_cost(all): 1 day, 15:26:08/1 day, 0:46:57, loss=0.399405970353564, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.12(1.03), norm=1.6402715996919106, lr=0.17491666727449112
2023-12-07 02:22:05   INFO  epoch: 44/72, acc_iter=170478, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:45:56, time_cost(all): 1 day, 15:26:50/1 day, 1:41:50, loss=0.399346772927623, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.13(1.03), norm=4.944987676579866, lr=0.1748027001991233
2023-12-07 02:22:46   INFO  epoch: 44/72, acc_iter=170528, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:43:52, time_cost(all): 1 day, 15:27:31/1 day, 0:17:53, loss=0.399287575501682, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.04(1.03), norm=3.966541169291852, lr=0.1746887331237555
2023-12-07 02:23:28   INFO  epoch: 44/72, acc_iter=170578, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:43:55, time_cost(all): 1 day, 15:28:13/1 day, 2:07:27, loss=0.399228378075742, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.16(1.03), norm=0.5236620424803, lr=0.17457476604838773
2023-12-07 02:24:10   INFO  epoch: 44/72, acc_iter=170628, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:45:55, time_cost(all): 1 day, 15:28:55/1 day, 0:42:04, loss=0.399169180649801, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.18(1.03), norm=3.6282809100926414, lr=0.17446079897301991
2023-12-07 02:24:52   INFO  epoch: 44/72, acc_iter=170678, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:45:27, time_cost(all): 1 day, 15:29:37/1 day, 2:14:58, loss=0.39910998322386, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.23(1.03), norm=1.2473085227214946, lr=0.1743468318976521
2023-12-07 02:25:33   INFO  epoch: 44/72, acc_iter=170728, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:41:32, time_cost(all): 1 day, 15:30:18/1 day, 0:17:57, loss=0.399050785797919, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.83(1.03), norm=3.5295431473580035, lr=0.17423286482228434
2023-12-07 02:26:15   INFO  epoch: 44/72, acc_iter=170778, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:40:19, time_cost(all): 1 day, 15:31:00/1 day, 1:12:19, loss=0.398991588371978, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.21(1.03), norm=1.1608721684199215, lr=0.17411889774691652
2023-12-07 02:26:57   INFO  epoch: 44/72, acc_iter=170828, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:42:25, time_cost(all): 1 day, 15:31:42/1 day, 0:58:43, loss=0.398932390946037, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.22(1.03), norm=4.45795128934623, lr=0.17400493067154876
2023-12-07 02:27:39   INFO  epoch: 44/72, acc_iter=170878, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:39:50, time_cost(all): 1 day, 15:32:24/1 day, 0:16:49, loss=0.398873193520096, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.03(1.03), norm=2.821084121286992, lr=0.17389096359618095
2023-12-07 02:28:21   INFO  epoch: 44/72, acc_iter=170928, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:38:54, time_cost(all): 1 day, 15:33:06/1 day, 1:26:39, loss=0.398813996094155, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.98(1.03), norm=4.250825175404506, lr=0.17377699652081313
2023-12-07 02:29:02   INFO  epoch: 44/72, acc_iter=170978, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:38:18, time_cost(all): 1 day, 15:33:47/1 day, 2:11:34, loss=0.398754798668214, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.99(1.03), norm=1.2201643326951468, lr=0.17366302944544537
2023-12-07 02:29:44   INFO  epoch: 44/72, acc_iter=171028, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:39:46, time_cost(all): 1 day, 15:34:29/1 day, 2:11:30, loss=0.398695601242273, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.05(1.03), norm=3.500999468809695, lr=0.1735490623700776
2023-12-07 02:30:26   INFO  epoch: 44/72, acc_iter=171078, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:38:11, time_cost(all): 1 day, 15:35:11/1 day, 0:36:03, loss=0.398636403816332, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.15(1.03), norm=1.3823696895495778, lr=0.17343509529470974
2023-12-07 02:31:08   INFO  epoch: 44/72, acc_iter=171128, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:38:39, time_cost(all): 1 day, 15:35:53/1 day, 1:57:05, loss=0.398577206390391, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.1(1.03), norm=0.504418664629916, lr=0.17332112821934198
2023-12-07 02:31:49   INFO  epoch: 44/72, acc_iter=171178, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:38:06, time_cost(all): 1 day, 15:36:34/1 day, 1:02:41, loss=0.39851800896445, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.0(1.03), norm=1.9099403236512509, lr=0.17320716114397416
2023-12-07 02:32:31   INFO  epoch: 44/72, acc_iter=171228, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:36:58, time_cost(all): 1 day, 15:37:16/1 day, 0:27:55, loss=0.398458811538509, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.99(1.03), norm=0.9769516038121446, lr=0.1730931940686064
2023-12-07 02:33:13   INFO  epoch: 44/72, acc_iter=171278, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:33:38, time_cost(all): 1 day, 15:37:58/1 day, 1:57:25, loss=0.398399614112568, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.02(1.03), norm=2.1932084398313108, lr=0.1729792269932386
2023-12-07 02:33:55   INFO  epoch: 44/72, acc_iter=171328, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:35:02, time_cost(all): 1 day, 15:38:40/1 day, 0:36:32, loss=0.398340416686627, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.15(1.03), norm=1.1264183606454634, lr=0.17286525991787077
2023-12-07 02:34:37   INFO  epoch: 44/72, acc_iter=171378, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:31:58, time_cost(all): 1 day, 15:39:22/1 day, 0:40:04, loss=0.398281219260686, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.85(1.03), norm=1.5466395944308815, lr=0.172751292842503
2023-12-07 02:35:18   INFO  epoch: 44/72, acc_iter=171428, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:31:49, time_cost(all): 1 day, 15:40:03/1 day, 0:25:15, loss=0.398222021834746, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.02(1.03), norm=3.247413035957246, lr=0.1726373257671352
2023-12-07 02:36:00   INFO  epoch: 44/72, acc_iter=171478, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:32:31, time_cost(all): 1 day, 15:40:45/23:57:46, loss=0.398162824408805, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.95(1.03), norm=3.8943362088912297, lr=0.17252335869176738
2023-12-07 02:36:42   INFO  epoch: 44/72, acc_iter=171528, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:31:37, time_cost(all): 1 day, 15:41:27/1 day, 0:47:58, loss=0.398103626982864, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.0(1.03), norm=0.6647538892687368, lr=0.17240939161639962
2023-12-07 02:37:24   INFO  epoch: 44/72, acc_iter=171578, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:29:45, time_cost(all): 1 day, 15:42:09/23:37:40, loss=0.398044429556923, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.05(1.03), norm=4.347875683180564, lr=0.17229542454103186
2023-12-07 02:38:05   INFO  epoch: 44/72, acc_iter=171628, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:30:57, time_cost(all): 1 day, 15:42:50/1 day, 1:29:09, loss=0.397985232130982, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.1(1.03), norm=2.607898942154156, lr=0.172181457465664
2023-12-07 02:38:47   INFO  epoch: 44/72, acc_iter=171678, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:30:41, time_cost(all): 1 day, 15:43:32/1 day, 0:01:19, loss=0.397926034705041, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.87(1.03), norm=2.908130354542295, lr=0.17206749039029623
2023-12-07 02:39:29   INFO  epoch: 44/72, acc_iter=171728, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:29:21, time_cost(all): 1 day, 15:44:14/1 day, 2:00:56, loss=0.3978668372791, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.12(1.03), norm=0.7589539239994187, lr=0.17195352331492847
2023-12-07 02:40:11   INFO  epoch: 44/72, acc_iter=171778, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:28:20, time_cost(all): 1 day, 15:44:56/1 day, 1:29:29, loss=0.397807639853159, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.92(1.03), norm=1.4415239313776949, lr=0.17183955623956065
2023-12-07 02:40:53   INFO  epoch: 44/72, acc_iter=171828, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:28:30, time_cost(all): 1 day, 15:45:38/1 day, 0:49:26, loss=0.397748442427218, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.01(1.03), norm=0.9599617569833361, lr=0.17172558916419284
2023-12-07 02:41:34   INFO  epoch: 44/72, acc_iter=171878, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:26:59, time_cost(all): 1 day, 15:46:19/1 day, 1:48:31, loss=0.397689245001277, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.97(1.03), norm=3.253781181122906, lr=0.17161162208882508
2023-12-07 02:42:16   INFO  epoch: 44/72, acc_iter=171928, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:25:49, time_cost(all): 1 day, 15:47:01/1 day, 0:17:11, loss=0.397630047575336, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.93(1.03), norm=2.7837501736255637, lr=0.17149765501345726
2023-12-07 02:42:58   INFO  epoch: 44/72, acc_iter=171978, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:24:03, time_cost(all): 1 day, 15:47:43/1 day, 0:44:01, loss=0.397570850149395, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.09(1.03), norm=1.0352959405094349, lr=0.1713836879380895
2023-12-07 02:43:40   INFO  epoch: 44/72, acc_iter=172028, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:24:00, time_cost(all): 1 day, 15:48:25/1 day, 0:48:10, loss=0.397511652723454, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.92(1.03), norm=2.180636734622299, lr=0.17126972086272163
2023-12-07 02:44:22   INFO  epoch: 44/72, acc_iter=172078, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:23:12, time_cost(all): 1 day, 15:49:07/1 day, 1:32:02, loss=0.397452455297513, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.08(1.03), norm=1.9306805249679233, lr=0.17115575378735387
2023-12-07 02:45:03   INFO  epoch: 44/72, acc_iter=172128, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:25, time_cost(all): 1 day, 15:49:48/1 day, 1:41:25, loss=0.397393257871572, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.16(1.03), norm=4.420739705973016, lr=0.1710417867119861
2023-12-07 02:45:45   INFO  epoch: 44/72, acc_iter=172178, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:22:59, time_cost(all): 1 day, 15:50:30/1 day, 0:33:14, loss=0.397334060445631, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.21(1.03), norm=2.785727069273726, lr=0.1709278196366183
2023-12-07 02:46:27   INFO  epoch: 44/72, acc_iter=172228, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:22:32, time_cost(all): 1 day, 15:51:12/23:29:46, loss=0.397274863019691, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.94(1.03), norm=3.554639559006483, lr=0.17081385256125048
2023-12-07 02:47:09   INFO  epoch: 44/72, acc_iter=172278, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:38, time_cost(all): 1 day, 15:51:54/23:46:36, loss=0.39721566559375, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.21(1.03), norm=0.9417932230544215, lr=0.17069988548588272
2023-12-07 02:47:50   INFO  epoch: 44/72, acc_iter=172328, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:20:20, time_cost(all): 1 day, 15:52:35/23:47:05, loss=0.397156468167809, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.08(1.03), norm=2.233969354555626, lr=0.1705859184105149
2023-12-07 02:48:32   INFO  epoch: 44/72, acc_iter=172378, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:19:00, time_cost(all): 1 day, 15:53:17/1 day, 1:43:01, loss=0.397097270741868, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.99(1.03), norm=3.950891628340824, lr=0.1704719513351471
2023-12-07 02:49:14   INFO  epoch: 44/72, acc_iter=172428, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:08, time_cost(all): 1 day, 15:53:59/1 day, 0:20:35, loss=0.397038073315927, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.09(1.03), norm=1.5827618362672782, lr=0.17035798425977933
2023-12-07 02:49:56   INFO  epoch: 44/72, acc_iter=172478, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:17:57, time_cost(all): 1 day, 15:54:41/1 day, 1:49:57, loss=0.396978875889986, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.13(1.03), norm=4.650075182040708, lr=0.1702440171844115
2023-12-07 02:50:38   INFO  epoch: 44/72, acc_iter=172528, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:32, time_cost(all): 1 day, 15:55:23/1 day, 1:09:18, loss=0.396919678464045, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.18(1.03), norm=0.5883413095673162, lr=0.17013005010904375
2023-12-07 02:51:19   INFO  epoch: 44/72, acc_iter=172578, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:41, time_cost(all): 1 day, 15:56:04/23:48:11, loss=0.396860481038104, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.84(1.03), norm=4.956978070941938, lr=0.17001608303367594
2023-12-07 02:52:01   INFO  epoch: 44/72, acc_iter=172628, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:03, time_cost(all): 1 day, 15:56:46/1 day, 1:31:13, loss=0.396801283612163, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.0(1.03), norm=4.467749055919291, lr=0.16990211595830812
2023-12-07 02:52:43   INFO  epoch: 44/72, acc_iter=172678, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:29, time_cost(all): 1 day, 15:57:28/1 day, 1:16:19, loss=0.396742086186222, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.17(1.03), norm=4.690335532815118, lr=0.16978814888294036
2023-12-07 02:53:25   INFO  epoch: 44/72, acc_iter=172728, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:42, time_cost(all): 1 day, 15:58:10/1 day, 0:21:37, loss=0.396682888760281, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.07(1.03), norm=3.0530036718465423, lr=0.1696741818075726
2023-12-07 02:54:06   INFO  epoch: 44/72, acc_iter=172778, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:13:36, time_cost(all): 1 day, 15:58:51/1 day, 1:32:22, loss=0.39662369133434, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.96(1.03), norm=2.3987567666873884, lr=0.16956021473220473
2023-12-07 02:54:48   INFO  epoch: 44/72, acc_iter=172828, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:12:49, time_cost(all): 1 day, 15:59:33/1 day, 1:16:33, loss=0.396564493908399, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.85(1.03), norm=2.2376669238659925, lr=0.16944624765683697
2023-12-07 02:55:30   INFO  epoch: 44/72, acc_iter=172878, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:11, time_cost(all): 1 day, 16:00:15/23:29:26, loss=0.396505296482458, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.1(1.03), norm=2.1993484161398578, lr=0.16933228058146915
2023-12-07 02:56:12   INFO  epoch: 44/72, acc_iter=172928, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:27, time_cost(all): 1 day, 16:00:57/23:51:02, loss=0.396446099056517, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.03(1.03), norm=1.477480212172438, lr=0.1692183135061014
2023-12-07 02:56:54   INFO  epoch: 44/72, acc_iter=172978, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:18, time_cost(all): 1 day, 16:01:39/1 day, 0:15:47, loss=0.396386901630576, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.92(1.03), norm=3.9550519950805465, lr=0.16910434643073358
2023-12-07 02:57:35   INFO  epoch: 44/72, acc_iter=173028, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:38, time_cost(all): 1 day, 16:02:20/1 day, 0:57:34, loss=0.396327704204635, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.05(1.03), norm=3.2571280748993456, lr=0.16899037935536576
2023-12-07 02:58:17   INFO  epoch: 44/72, acc_iter=173078, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:41, time_cost(all): 1 day, 16:03:02/1 day, 0:42:07, loss=0.396268506778695, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.14(1.03), norm=3.1480589814472424, lr=0.168876412279998
2023-12-07 02:58:59   INFO  epoch: 44/72, acc_iter=173128, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:30, time_cost(all): 1 day, 16:03:44/23:46:27, loss=0.396209309352754, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.92(1.03), norm=4.557423666993846, lr=0.16876244520463018
2023-12-07 02:59:41   INFO  epoch: 44/72, acc_iter=173178, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:54, time_cost(all): 1 day, 16:04:26/1 day, 0:13:34, loss=0.396150111926813, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.13(1.03), norm=0.6896095864018799, lr=0.16864847812926237
2023-12-07 03:00:22   INFO  epoch: 44/72, acc_iter=173228, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:08:04, time_cost(all): 1 day, 16:05:07/1 day, 1:17:54, loss=0.396090914500872, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.17(1.03), norm=0.7160649488115541, lr=0.1685345110538946
2023-12-07 03:01:04   INFO  epoch: 44/72, acc_iter=173278, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:06:48, time_cost(all): 1 day, 16:05:49/1 day, 0:16:38, loss=0.396031717074931, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.93(1.03), norm=0.5632140255669738, lr=0.16842054397852685
2023-12-07 03:01:46   INFO  epoch: 44/72, acc_iter=173328, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:13, time_cost(all): 1 day, 16:06:31/1 day, 1:37:24, loss=0.39597251964899, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.95(1.03), norm=1.4481695200868798, lr=0.16830657690315898
2023-12-07 03:02:28   INFO  epoch: 44/72, acc_iter=173378, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:35, time_cost(all): 1 day, 16:07:13/1 day, 1:01:58, loss=0.395913322223049, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.98(1.03), norm=1.6290714993214488, lr=0.16819260982779122
2023-12-07 03:03:10   INFO  epoch: 44/72, acc_iter=173428, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:12, time_cost(all): 1 day, 16:07:55/1 day, 0:41:39, loss=0.395854124797108, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.13(1.03), norm=0.9320971584716614, lr=0.16807864275242346
2023-12-07 03:03:51   INFO  epoch: 44/72, acc_iter=173478, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:15, time_cost(all): 1 day, 16:08:36/1 day, 0:41:54, loss=0.395794927371167, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.86(1.03), norm=0.8529172775069653, lr=0.16796467567705564
2023-12-07 03:04:33   INFO  epoch: 44/72, acc_iter=173528, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:42, time_cost(all): 1 day, 16:09:18/23:34:53, loss=0.395735729945226, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.22(1.03), norm=2.9941779667089805, lr=0.16785070860168783
2023-12-07 03:05:15   INFO  epoch: 44/72, acc_iter=173578, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:48, time_cost(all): 1 day, 16:10:00/1 day, 0:22:00, loss=0.395676532519285, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.95(1.03), norm=4.493162778573751, lr=0.16773674152632007
2023-12-07 03:05:57   INFO  epoch: 44/72, acc_iter=173628, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:19, time_cost(all): 1 day, 16:10:42/1 day, 0:37:59, loss=0.395617335093344, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.86(1.03), norm=2.400845340523696, lr=0.16762277445095225
2023-12-07 03:06:38   INFO  epoch: 44/72, acc_iter=173678, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:29, time_cost(all): 1 day, 16:11:23/23:55:04, loss=0.395558137667403, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.1(1.03), norm=1.2528380164356696, lr=0.1675088073755845
2023-12-07 03:07:20   INFO  epoch: 44/72, acc_iter=173728, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:54, time_cost(all): 1 day, 16:12:05/1 day, 1:13:41, loss=0.395498940241462, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.16(1.03), norm=4.3285178730096465, lr=0.16739484030021662
2023-12-07 03:08:02   INFO  epoch: 44/72, acc_iter=173778, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 1 day, 16:12:47/23:43:53, loss=0.395439742815521, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.12(1.03), norm=4.52667125073659, lr=0.16728087322484886
2023-12-07 03:08:44   INFO  epoch: 45/72, acc_iter=173840, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:54:42, time_cost(all): 1 day, 16:13:29/1 day, 0:54:05, loss=0.395366338007355, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.03(1.03), norm=3.226446460671285, lr=0.1671395540513928
2023-12-07 03:09:26   INFO  epoch: 45/72, acc_iter=173890, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:51:37, time_cost(all): 1 day, 16:14:11/1 day, 0:24:16, loss=0.395307140581414, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.88(1.03), norm=1.81910843873428, lr=0.167025586976025
2023-12-07 03:10:07   INFO  epoch: 45/72, acc_iter=173940, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:53:35, time_cost(all): 1 day, 16:14:52/1 day, 0:26:07, loss=0.395247943155473, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.14(1.03), norm=1.7202930661374558, lr=0.16691161990065723
2023-12-07 03:10:49   INFO  epoch: 45/72, acc_iter=173990, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:48:33, time_cost(all): 1 day, 16:15:34/1 day, 1:27:17, loss=0.395188745729532, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.06(1.03), norm=3.1044758257720724, lr=0.16679765282528947
2023-12-07 03:11:31   INFO  epoch: 45/72, acc_iter=174040, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:49:08, time_cost(all): 1 day, 16:16:16/1 day, 0:28:41, loss=0.395129548303591, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.16(1.03), norm=1.757190428847668, lr=0.1666836857499216
2023-12-07 03:12:13   INFO  epoch: 45/72, acc_iter=174090, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:49:54, time_cost(all): 1 day, 16:16:58/1 day, 0:57:57, loss=0.39507035087765, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.12(1.03), norm=4.980543531207306, lr=0.16656971867455383
2023-12-07 03:12:54   INFO  epoch: 45/72, acc_iter=174140, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:51:09, time_cost(all): 1 day, 16:17:39/1 day, 1:25:30, loss=0.395011153451709, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.91(1.03), norm=4.158592663823744, lr=0.16645575159918602
2023-12-07 03:13:36   INFO  epoch: 45/72, acc_iter=174190, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:46:57, time_cost(all): 1 day, 16:18:21/1 day, 1:12:47, loss=0.394951956025768, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.98(1.03), norm=2.831015582436156, lr=0.16634178452381826
2023-12-07 03:14:18   INFO  epoch: 45/72, acc_iter=174240, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:47:38, time_cost(all): 1 day, 16:19:03/1 day, 0:43:41, loss=0.394892758599827, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.01(1.03), norm=2.237774814727571, lr=0.16622781744845044
2023-12-07 03:15:00   INFO  epoch: 45/72, acc_iter=174290, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:46:47, time_cost(all): 1 day, 16:19:45/23:04:57, loss=0.394833561173886, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.99(1.03), norm=4.139913811824443, lr=0.16611385037308263
2023-12-07 03:15:42   INFO  epoch: 45/72, acc_iter=174340, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:46:39, time_cost(all): 1 day, 16:20:27/23:15:25, loss=0.394774363747945, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.11(1.03), norm=0.7322658110090787, lr=0.16599988329771487
2023-12-07 03:16:23   INFO  epoch: 45/72, acc_iter=174390, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:47:13, time_cost(all): 1 day, 16:21:08/23:15:56, loss=0.394715166322004, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.0(1.03), norm=3.6354280669444408, lr=0.16588591622234705
2023-12-07 03:17:05   INFO  epoch: 45/72, acc_iter=174440, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:44:40, time_cost(all): 1 day, 16:21:50/23:03:16, loss=0.394655968896063, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.01(1.03), norm=2.829516380379408, lr=0.16577194914697924
2023-12-07 03:17:47   INFO  epoch: 45/72, acc_iter=174490, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:45:40, time_cost(all): 1 day, 16:22:32/23:32:59, loss=0.394596771470122, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.91(1.03), norm=3.1397050970043408, lr=0.16565798207161148
2023-12-07 03:18:29   INFO  epoch: 45/72, acc_iter=174540, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:43:29, time_cost(all): 1 day, 16:23:14/23:55:11, loss=0.394537574044181, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.12(1.03), norm=1.813690758220139, lr=0.16554401499624372
2023-12-07 03:19:11   INFO  epoch: 45/72, acc_iter=174590, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:43:32, time_cost(all): 1 day, 16:23:56/23:34:57, loss=0.39447837661824, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.01(1.03), norm=1.39421546633482, lr=0.16543004792087584
2023-12-07 03:19:52   INFO  epoch: 45/72, acc_iter=174640, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:41:44, time_cost(all): 1 day, 16:24:37/23:55:20, loss=0.3944191791923, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.87(1.03), norm=3.6924023028332984, lr=0.16531608084550808
2023-12-07 03:20:34   INFO  epoch: 45/72, acc_iter=174690, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:40:47, time_cost(all): 1 day, 16:25:19/23:39:22, loss=0.394359981766359, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.96(1.03), norm=2.6715797497016895, lr=0.16520211377014032
2023-12-07 03:21:16   INFO  epoch: 45/72, acc_iter=174740, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:39:16, time_cost(all): 1 day, 16:26:01/1 day, 0:23:50, loss=0.394300784340418, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.19(1.03), norm=0.6203347117739899, lr=0.1650881466947725
2023-12-07 03:21:58   INFO  epoch: 45/72, acc_iter=174790, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:41:32, time_cost(all): 1 day, 16:26:43/1 day, 1:13:13, loss=0.394241586914477, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.06(1.03), norm=1.851292500597703, lr=0.1649741796194047
2023-12-07 03:22:39   INFO  epoch: 45/72, acc_iter=174840, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:38:00, time_cost(all): 1 day, 16:27:24/23:54:01, loss=0.394182389488536, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.12(1.03), norm=4.9864772277906155, lr=0.16486021254403688
2023-12-07 03:23:21   INFO  epoch: 45/72, acc_iter=174890, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:39:49, time_cost(all): 1 day, 16:28:06/1 day, 0:47:33, loss=0.394123192062595, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.23(1.03), norm=2.5198012653405635, lr=0.16474624546866912
2023-12-07 03:24:03   INFO  epoch: 45/72, acc_iter=174940, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:36:23, time_cost(all): 1 day, 16:28:48/23:30:12, loss=0.394063994636654, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.87(1.03), norm=4.51590999063975, lr=0.16463227839330136
2023-12-07 03:24:45   INFO  epoch: 45/72, acc_iter=174990, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:37:19, time_cost(all): 1 day, 16:29:30/22:56:58, loss=0.394004797210713, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.08(1.03), norm=2.2901093128985384, lr=0.16451831131793349
2023-12-07 03:25:27   INFO  epoch: 45/72, acc_iter=175040, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:36:23, time_cost(all): 1 day, 16:30:12/1 day, 0:29:42, loss=0.393945599784772, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.98(1.03), norm=3.785364177816694, lr=0.16440434424256573
2023-12-07 03:26:08   INFO  epoch: 45/72, acc_iter=175090, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:37:03, time_cost(all): 1 day, 16:30:53/1 day, 0:22:01, loss=0.393886402358831, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.04(1.03), norm=2.163589788985634, lr=0.16429037716719797
2023-12-07 03:26:50   INFO  epoch: 45/72, acc_iter=175140, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:33:16, time_cost(all): 1 day, 16:31:35/22:59:54, loss=0.39382720493289, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.21(1.03), norm=4.873090445868585, lr=0.16417641009183015
2023-12-07 03:27:32   INFO  epoch: 45/72, acc_iter=175190, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:34:29, time_cost(all): 1 day, 16:32:17/1 day, 0:55:04, loss=0.393768007506949, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.19(1.03), norm=2.6377005233441193, lr=0.16406244301646233
2023-12-07 03:28:14   INFO  epoch: 45/72, acc_iter=175240, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:32:03, time_cost(all): 1 day, 16:32:59/23:43:57, loss=0.393708810081008, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.11(1.03), norm=1.8127894770784831, lr=0.16394847594109457
2023-12-07 03:28:55   INFO  epoch: 45/72, acc_iter=175290, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:33:07, time_cost(all): 1 day, 16:33:40/1 day, 0:49:44, loss=0.393649612655067, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.94(1.03), norm=1.7913949363630748, lr=0.16383450886572676
2023-12-07 03:29:37   INFO  epoch: 45/72, acc_iter=175340, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:30:56, time_cost(all): 1 day, 16:34:22/23:06:20, loss=0.393590415229126, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.86(1.03), norm=0.7010278359362089, lr=0.16372054179035894
2023-12-07 03:30:19   INFO  epoch: 45/72, acc_iter=175390, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:32:45, time_cost(all): 1 day, 16:35:04/22:57:38, loss=0.393531217803185, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.89(1.03), norm=2.8420321221508207, lr=0.16360657471499118
2023-12-07 03:31:01   INFO  epoch: 45/72, acc_iter=175440, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:29:55, time_cost(all): 1 day, 16:35:46/1 day, 0:36:40, loss=0.393472020377244, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.02(1.03), norm=2.1611018830241857, lr=0.16349260763962337
2023-12-07 03:31:43   INFO  epoch: 45/72, acc_iter=175490, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:29:30, time_cost(all): 1 day, 16:36:28/1 day, 0:11:18, loss=0.393412822951304, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.88(1.03), norm=4.905548305471535, lr=0.1633786405642556
2023-12-07 03:32:24   INFO  epoch: 45/72, acc_iter=175540, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:28:16, time_cost(all): 1 day, 16:37:09/23:04:04, loss=0.393353625525363, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.04(1.03), norm=3.9810728878001753, lr=0.1632646734888878
2023-12-07 03:33:06   INFO  epoch: 45/72, acc_iter=175590, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:28:28, time_cost(all): 1 day, 16:37:51/1 day, 1:05:22, loss=0.393294428099422, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.91(1.03), norm=1.400864875187025, lr=0.16315070641351997
2023-12-07 03:33:48   INFO  epoch: 45/72, acc_iter=175640, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:27:47, time_cost(all): 1 day, 16:38:33/23:07:30, loss=0.393235230673481, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.84(1.03), norm=3.025271738384367, lr=0.16303673933815221
2023-12-07 03:34:30   INFO  epoch: 45/72, acc_iter=175690, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:26:23, time_cost(all): 1 day, 16:39:15/23:29:30, loss=0.39317603324754, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.84(1.03), norm=2.1275772931759898, lr=0.1629227722627844
2023-12-07 03:35:11   INFO  epoch: 45/72, acc_iter=175740, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:26:36, time_cost(all): 1 day, 16:39:56/23:42:03, loss=0.393116835821599, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.87(1.03), norm=3.286947716798986, lr=0.16280880518741658
2023-12-07 03:35:53   INFO  epoch: 45/72, acc_iter=175790, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:26:49, time_cost(all): 1 day, 16:40:38/1 day, 0:55:06, loss=0.393057638395658, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.2(1.03), norm=4.359161300118085, lr=0.16269483811204882
2023-12-07 03:36:35   INFO  epoch: 45/72, acc_iter=175840, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:20, time_cost(all): 1 day, 16:41:20/23:56:58, loss=0.392998440969717, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.02(1.03), norm=4.166681636305079, lr=0.162580871036681
2023-12-07 03:37:17   INFO  epoch: 45/72, acc_iter=175890, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:24:28, time_cost(all): 1 day, 16:42:02/1 day, 0:58:34, loss=0.392939243543776, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.01(1.03), norm=1.0273177325739073, lr=0.16246690396131325
2023-12-07 03:37:59   INFO  epoch: 45/72, acc_iter=175940, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:23:33, time_cost(all): 1 day, 16:42:44/23:12:18, loss=0.392880046117835, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.02(1.03), norm=4.595353726146725, lr=0.16235293688594543
2023-12-07 03:38:40   INFO  epoch: 45/72, acc_iter=175990, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:27, time_cost(all): 1 day, 16:43:25/22:52:22, loss=0.392820848691894, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.1(1.03), norm=3.746295135670661, lr=0.16223896981057762
2023-12-07 03:39:22   INFO  epoch: 45/72, acc_iter=176040, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:21:39, time_cost(all): 1 day, 16:44:07/23:28:38, loss=0.392761651265953, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.94(1.03), norm=1.6561863158234271, lr=0.16212500273520986
2023-12-07 03:40:04   INFO  epoch: 45/72, acc_iter=176090, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:22:22, time_cost(all): 1 day, 16:44:49/23:34:06, loss=0.392702453840012, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.09(1.03), norm=1.3694913526812733, lr=0.16201103565984204
2023-12-07 03:40:46   INFO  epoch: 45/72, acc_iter=176140, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:25, time_cost(all): 1 day, 16:45:31/22:37:35, loss=0.392643256414071, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.92(1.03), norm=2.6241807687637064, lr=0.16189706858447422
2023-12-07 03:41:27   INFO  epoch: 45/72, acc_iter=176190, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:20:07, time_cost(all): 1 day, 16:46:12/1 day, 0:24:40, loss=0.39258405898813, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.23(1.03), norm=0.9654461665244292, lr=0.16178310150910646
2023-12-07 03:42:09   INFO  epoch: 45/72, acc_iter=176240, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:19:47, time_cost(all): 1 day, 16:46:54/22:48:10, loss=0.392524861562189, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.11(1.03), norm=3.7794181796203397, lr=0.1616691344337387
2023-12-07 03:42:51   INFO  epoch: 45/72, acc_iter=176290, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:18:01, time_cost(all): 1 day, 16:47:36/1 day, 0:49:01, loss=0.392465664136248, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.11(1.03), norm=3.586476472303879, lr=0.16155516735837083
2023-12-07 03:43:33   INFO  epoch: 45/72, acc_iter=176340, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:59, time_cost(all): 1 day, 16:48:18/1 day, 0:06:16, loss=0.392406466710307, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.22(1.03), norm=0.9100365166835147, lr=0.16144120028300307
2023-12-07 03:44:15   INFO  epoch: 45/72, acc_iter=176390, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:38, time_cost(all): 1 day, 16:49:00/23:50:13, loss=0.392347269284367, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.23(1.03), norm=2.6603427569119016, lr=0.1613272332076353
2023-12-07 03:44:56   INFO  epoch: 45/72, acc_iter=176440, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:17:24, time_cost(all): 1 day, 16:49:41/23:26:49, loss=0.392288071858426, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.92(1.03), norm=1.7904415230791935, lr=0.1612132661322675
2023-12-07 03:45:38   INFO  epoch: 45/72, acc_iter=176490, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:15:29, time_cost(all): 1 day, 16:50:23/1 day, 0:17:07, loss=0.392228874432485, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.2(1.03), norm=2.128666679287937, lr=0.16109929905689968
2023-12-07 03:46:20   INFO  epoch: 45/72, acc_iter=176540, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:29, time_cost(all): 1 day, 16:51:05/23:50:38, loss=0.392169677006544, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.1(1.03), norm=3.886475171403272, lr=0.16098533198153187
2023-12-07 03:47:02   INFO  epoch: 45/72, acc_iter=176590, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:12, time_cost(all): 1 day, 16:51:47/22:44:07, loss=0.392110479580603, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.12(1.03), norm=3.4790271638807253, lr=0.1608713649061641
2023-12-07 03:47:43   INFO  epoch: 45/72, acc_iter=176640, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:25, time_cost(all): 1 day, 16:52:28/1 day, 0:36:40, loss=0.392051282154662, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.91(1.03), norm=1.9724883634483716, lr=0.16075739783079634
2023-12-07 03:48:25   INFO  epoch: 45/72, acc_iter=176690, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:10, time_cost(all): 1 day, 16:53:10/23:06:11, loss=0.391992084728721, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.19(1.03), norm=1.0976435019229895, lr=0.16064343075542847
2023-12-07 03:49:07   INFO  epoch: 45/72, acc_iter=176740, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:29, time_cost(all): 1 day, 16:53:52/23:24:14, loss=0.39193288730278, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.16(1.03), norm=3.9393090036779275, lr=0.1605294636800607
2023-12-07 03:49:49   INFO  epoch: 45/72, acc_iter=176790, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:15, time_cost(all): 1 day, 16:54:34/1 day, 0:22:01, loss=0.391873689876839, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.09(1.03), norm=2.528461385396848, lr=0.16041549660469295
2023-12-07 03:50:31   INFO  epoch: 45/72, acc_iter=176840, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:25, time_cost(all): 1 day, 16:55:16/23:01:51, loss=0.391814492450898, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.11(1.03), norm=1.712400060153117, lr=0.16030152952932514
2023-12-07 03:51:12   INFO  epoch: 45/72, acc_iter=176890, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:57, time_cost(all): 1 day, 16:55:57/1 day, 0:10:07, loss=0.391755295024957, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.05(1.03), norm=3.1493234117546174, lr=0.16018756245395732
2023-12-07 03:51:54   INFO  epoch: 45/72, acc_iter=176940, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:58, time_cost(all): 1 day, 16:56:39/23:57:04, loss=0.391696097599016, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.95(1.03), norm=3.4126333822501915, lr=0.16007359537858956
2023-12-07 03:52:36   INFO  epoch: 45/72, acc_iter=176990, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:00, time_cost(all): 1 day, 16:57:21/1 day, 0:05:15, loss=0.391636900173075, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.84(1.03), norm=1.9133362630046884, lr=0.15995962830322175
2023-12-07 03:53:18   INFO  epoch: 45/72, acc_iter=177040, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:51, time_cost(all): 1 day, 16:58:03/1 day, 0:44:09, loss=0.391577702747134, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.14(1.03), norm=3.294693834507493, lr=0.15984566122785393
2023-12-07 03:54:00   INFO  epoch: 45/72, acc_iter=177090, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:08:00, time_cost(all): 1 day, 16:58:45/23:27:50, loss=0.391518505321193, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.84(1.03), norm=0.5209791574648615, lr=0.15973169415248617
2023-12-07 03:54:41   INFO  epoch: 45/72, acc_iter=177140, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:06:51, time_cost(all): 1 day, 16:59:26/22:52:36, loss=0.391459307895252, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.87(1.03), norm=3.3730078219035544, lr=0.15961772707711835
2023-12-07 03:55:23   INFO  epoch: 45/72, acc_iter=177190, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:17, time_cost(all): 1 day, 17:00:08/23:59:57, loss=0.391400110469311, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.06(1.03), norm=4.660976994337765, lr=0.1595037600017506
2023-12-07 03:56:05   INFO  epoch: 45/72, acc_iter=177240, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:34, time_cost(all): 1 day, 17:00:50/23:09:52, loss=0.391340913043371, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.02(1.03), norm=4.711674824895254, lr=0.15938979292638272
2023-12-07 03:56:47   INFO  epoch: 45/72, acc_iter=177290, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:07, time_cost(all): 1 day, 17:01:32/1 day, 0:11:21, loss=0.39128171561743, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.94(1.03), norm=3.024958800370612, lr=0.15927582585101496
2023-12-07 03:57:28   INFO  epoch: 45/72, acc_iter=177340, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:27, time_cost(all): 1 day, 17:02:13/23:25:33, loss=0.391222518191489, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.02(1.03), norm=1.4841187781311853, lr=0.1591618587756472
2023-12-07 03:58:10   INFO  epoch: 45/72, acc_iter=177390, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:28, time_cost(all): 1 day, 17:02:55/1 day, 0:11:52, loss=0.391163320765548, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.85(1.03), norm=4.82607703019422, lr=0.1590478917002794
2023-12-07 03:58:52   INFO  epoch: 45/72, acc_iter=177440, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:49, time_cost(all): 1 day, 17:03:37/1 day, 0:07:13, loss=0.391104123339607, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.85(1.03), norm=2.681343285154006, lr=0.15893392462491157
2023-12-07 03:59:34   INFO  epoch: 45/72, acc_iter=177490, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:18, time_cost(all): 1 day, 17:04:19/22:27:25, loss=0.391044925913666, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.03(1.03), norm=0.886495440025042, lr=0.1588199575495438
2023-12-07 04:00:16   INFO  epoch: 45/72, acc_iter=177540, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:35, time_cost(all): 1 day, 17:05:01/23:00:16, loss=0.390985728487725, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.98(1.03), norm=3.202386974993097, lr=0.158705990474176
2023-12-07 04:00:57   INFO  epoch: 45/72, acc_iter=177590, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:53, time_cost(all): 1 day, 17:05:42/23:55:21, loss=0.390926531061784, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.16(1.03), norm=4.1200727274646525, lr=0.15859202339880824
2023-12-07 04:01:39   INFO  epoch: 45/72, acc_iter=177640, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 1 day, 17:06:24/22:44:18, loss=0.390867333635843, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.94(1.03), norm=1.7585658764295262, lr=0.15847805632344042
2023-12-07 04:02:21   INFO  epoch: 46/72, acc_iter=177702, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:52:30, time_cost(all): 1 day, 17:07:06/23:38:38, loss=0.390793928827676, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.96(1.03), norm=1.6189312691564912, lr=0.15833673714998436
2023-12-07 04:03:03   INFO  epoch: 46/72, acc_iter=177752, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:53:54, time_cost(all): 1 day, 17:07:48/22:27:24, loss=0.390734731401735, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.17(1.03), norm=4.1764817597288335, lr=0.15822277007461655
2023-12-07 04:03:44   INFO  epoch: 46/72, acc_iter=177802, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:51:00, time_cost(all): 1 day, 17:08:29/22:59:26, loss=0.390675533975794, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.14(1.03), norm=4.352746568709458, lr=0.15810880299924873
2023-12-07 04:04:26   INFO  epoch: 46/72, acc_iter=177852, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:48:30, time_cost(all): 1 day, 17:09:11/23:26:14, loss=0.390616336549853, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.95(1.03), norm=3.372052905363855, lr=0.15799483592388097
2023-12-07 04:05:08   INFO  epoch: 46/72, acc_iter=177902, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:47:59, time_cost(all): 1 day, 17:09:53/22:41:31, loss=0.390557139123912, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.17(1.03), norm=1.8193860827490844, lr=0.1578808688485132
2023-12-07 04:05:50   INFO  epoch: 46/72, acc_iter=177952, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:47:44, time_cost(all): 1 day, 17:10:35/23:05:22, loss=0.390497941697972, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.99(1.03), norm=3.3840151448268085, lr=0.15776690177314534
2023-12-07 04:06:32   INFO  epoch: 46/72, acc_iter=178002, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:47:10, time_cost(all): 1 day, 17:11:17/22:45:25, loss=0.390438744272031, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.89(1.03), norm=1.0413058196982812, lr=0.15765293469777758
2023-12-07 04:07:13   INFO  epoch: 46/72, acc_iter=178052, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:48:16, time_cost(all): 1 day, 17:11:58/1 day, 0:13:28, loss=0.39037954684609, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.94(1.03), norm=1.4193164049736569, lr=0.15753896762240982
2023-12-07 04:07:55   INFO  epoch: 46/72, acc_iter=178102, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:46:17, time_cost(all): 1 day, 17:12:40/23:20:56, loss=0.390320349420149, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.15(1.03), norm=4.28039796805238, lr=0.157425000547042
2023-12-07 04:08:37   INFO  epoch: 46/72, acc_iter=178152, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:47:46, time_cost(all): 1 day, 17:13:22/23:26:13, loss=0.390261151994208, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.2(1.03), norm=4.116579561557163, lr=0.1573110334716742
2023-12-07 04:09:19   INFO  epoch: 46/72, acc_iter=178202, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:46:40, time_cost(all): 1 day, 17:14:04/23:28:29, loss=0.390201954568267, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.97(1.03), norm=3.4706953490531403, lr=0.15719706639630643
2023-12-07 04:10:00   INFO  epoch: 46/72, acc_iter=178252, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:46:34, time_cost(all): 1 day, 17:14:45/23:19:28, loss=0.390142757142326, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.16(1.03), norm=4.4715054137585, lr=0.1570830993209386
2023-12-07 04:10:42   INFO  epoch: 46/72, acc_iter=178302, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:45:44, time_cost(all): 1 day, 17:15:27/23:53:35, loss=0.390083559716385, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.89(1.03), norm=3.579379377740883, lr=0.1569691322455708
2023-12-07 04:11:24   INFO  epoch: 46/72, acc_iter=178352, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:45:21, time_cost(all): 1 day, 17:16:09/23:37:07, loss=0.390024362290444, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.22(1.03), norm=1.4279236026565099, lr=0.15685516517020304
2023-12-07 04:12:06   INFO  epoch: 46/72, acc_iter=178402, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:42:14, time_cost(all): 1 day, 17:16:51/23:00:56, loss=0.389965164864503, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.94(1.03), norm=3.840593301439102, lr=0.15674119809483522
2023-12-07 04:12:48   INFO  epoch: 46/72, acc_iter=178452, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:41:08, time_cost(all): 1 day, 17:17:33/22:37:22, loss=0.389905967438562, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.95(1.03), norm=2.0170708625568263, lr=0.15662723101946746
2023-12-07 04:13:29   INFO  epoch: 46/72, acc_iter=178502, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:39:53, time_cost(all): 1 day, 17:18:14/23:23:49, loss=0.389846770012621, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.87(1.03), norm=3.1483896897019585, lr=0.1565132639440996
2023-12-07 04:14:11   INFO  epoch: 46/72, acc_iter=178552, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:41:29, time_cost(all): 1 day, 17:18:56/1 day, 0:10:51, loss=0.38978757258668, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.96(1.03), norm=1.6515556138253762, lr=0.15639929686873183
2023-12-07 04:14:53   INFO  epoch: 46/72, acc_iter=178602, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:42:00, time_cost(all): 1 day, 17:19:38/22:23:21, loss=0.389728375160739, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.14(1.03), norm=1.6246921730485266, lr=0.15628532979336407
2023-12-07 04:15:35   INFO  epoch: 46/72, acc_iter=178652, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:38:54, time_cost(all): 1 day, 17:20:20/23:04:01, loss=0.389669177734798, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.16(1.03), norm=4.623774310455342, lr=0.15617136271799625
2023-12-07 04:16:16   INFO  epoch: 46/72, acc_iter=178702, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:37:49, time_cost(all): 1 day, 17:21:01/23:22:57, loss=0.389609980308857, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.0(1.03), norm=0.66224657262807, lr=0.15605739564262844
2023-12-07 04:16:58   INFO  epoch: 46/72, acc_iter=178752, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:36:48, time_cost(all): 1 day, 17:21:43/23:48:59, loss=0.389550782882916, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.91(1.03), norm=4.156466192587519, lr=0.15594342856726068
2023-12-07 04:17:40   INFO  epoch: 46/72, acc_iter=178802, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:36:34, time_cost(all): 1 day, 17:22:25/22:33:57, loss=0.389491585456976, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.21(1.03), norm=2.368344408553864, lr=0.15582946149189286
2023-12-07 04:18:22   INFO  epoch: 46/72, acc_iter=178852, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:35:22, time_cost(all): 1 day, 17:23:07/23:32:21, loss=0.389432388031035, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.08(1.03), norm=1.9895551016846034, lr=0.1557154944165251
2023-12-07 04:19:04   INFO  epoch: 46/72, acc_iter=178902, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:34:34, time_cost(all): 1 day, 17:23:49/23:23:52, loss=0.389373190605094, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.94(1.03), norm=3.8508753942351266, lr=0.1556015273411573
2023-12-07 04:19:45   INFO  epoch: 46/72, acc_iter=178952, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:36:47, time_cost(all): 1 day, 17:24:30/22:36:02, loss=0.389313993179153, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.84(1.03), norm=4.619975384153321, lr=0.15548756026578947
2023-12-07 04:20:27   INFO  epoch: 46/72, acc_iter=179002, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:35:34, time_cost(all): 1 day, 17:25:12/22:43:18, loss=0.389254795753212, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.03(1.03), norm=1.7126876557787039, lr=0.1553735931904217
2023-12-07 04:21:09   INFO  epoch: 46/72, acc_iter=179052, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:33:42, time_cost(all): 1 day, 17:25:54/22:02:34, loss=0.389195598327271, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.06(1.03), norm=4.480202947442478, lr=0.1552596261150539
2023-12-07 04:21:51   INFO  epoch: 46/72, acc_iter=179102, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:34:44, time_cost(all): 1 day, 17:26:36/22:43:12, loss=0.38913640090133, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.16(1.03), norm=4.604071958390599, lr=0.15514565903968608
2023-12-07 04:22:32   INFO  epoch: 46/72, acc_iter=179152, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:33:28, time_cost(all): 1 day, 17:27:17/1 day, 0:12:00, loss=0.389077203475389, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.12(1.03), norm=2.6634304219379183, lr=0.15503169196431832
2023-12-07 04:23:14   INFO  epoch: 46/72, acc_iter=179202, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:31:59, time_cost(all): 1 day, 17:27:59/22:37:23, loss=0.389018006049448, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.0(1.03), norm=4.20450747092472, lr=0.15491772488895056
2023-12-07 04:23:56   INFO  epoch: 46/72, acc_iter=179252, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:31:04, time_cost(all): 1 day, 17:28:41/22:12:01, loss=0.388958808623507, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.09(1.03), norm=3.7009713687292076, lr=0.1548037578135827
2023-12-07 04:24:38   INFO  epoch: 46/72, acc_iter=179302, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:30:55, time_cost(all): 1 day, 17:29:23/23:04:25, loss=0.388899611197566, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.96(1.03), norm=1.9116036326407948, lr=0.15468979073821493
2023-12-07 04:25:20   INFO  epoch: 46/72, acc_iter=179352, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:30:49, time_cost(all): 1 day, 17:30:05/22:31:17, loss=0.388840413771625, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.88(1.03), norm=0.6508663189063348, lr=0.1545758236628471
2023-12-07 04:26:01   INFO  epoch: 46/72, acc_iter=179402, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:28:43, time_cost(all): 1 day, 17:30:46/21:57:28, loss=0.388781216345684, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.97(1.03), norm=3.882186813493314, lr=0.15446185658747935
2023-12-07 04:26:43   INFO  epoch: 46/72, acc_iter=179452, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:27:51, time_cost(all): 1 day, 17:31:28/22:03:10, loss=0.388722018919743, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.16(1.03), norm=1.0444927043057524, lr=0.15434788951211154
2023-12-07 04:27:25   INFO  epoch: 46/72, acc_iter=179502, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:27:49, time_cost(all): 1 day, 17:32:10/23:36:12, loss=0.388662821493802, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.96(1.03), norm=1.5114131568062747, lr=0.15423392243674372
2023-12-07 04:28:07   INFO  epoch: 46/72, acc_iter=179552, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:26:13, time_cost(all): 1 day, 17:32:52/1 day, 0:02:09, loss=0.388603624067861, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.92(1.03), norm=2.6768605904511094, lr=0.15411995536137596
2023-12-07 04:28:49   INFO  epoch: 46/72, acc_iter=179602, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:27:15, time_cost(all): 1 day, 17:33:34/22:04:17, loss=0.38854442664192, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.15(1.03), norm=4.934207094057427, lr=0.1540059882860082
2023-12-07 04:29:30   INFO  epoch: 46/72, acc_iter=179652, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:25:02, time_cost(all): 1 day, 17:34:15/22:29:33, loss=0.38848522921598, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.98(1.03), norm=0.7410767452738525, lr=0.15389202121064033
2023-12-07 04:30:12   INFO  epoch: 46/72, acc_iter=179702, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:22, time_cost(all): 1 day, 17:34:57/22:07:06, loss=0.388426031790039, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.04(1.03), norm=1.3214436545814157, lr=0.15377805413527257
2023-12-07 04:30:54   INFO  epoch: 46/72, acc_iter=179752, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:25:44, time_cost(all): 1 day, 17:35:39/22:26:10, loss=0.388366834364098, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.9(1.03), norm=2.6532252278293837, lr=0.1536640870599048
2023-12-07 04:31:36   INFO  epoch: 46/72, acc_iter=179802, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:30, time_cost(all): 1 day, 17:36:21/23:06:57, loss=0.388307636938157, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.15(1.03), norm=1.6707497181485742, lr=0.153550119984537
2023-12-07 04:32:17   INFO  epoch: 46/72, acc_iter=179852, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:18, time_cost(all): 1 day, 17:37:02/23:25:54, loss=0.388248439512216, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.19(1.03), norm=2.3966559505621334, lr=0.15343615290916918
2023-12-07 04:32:59   INFO  epoch: 46/72, acc_iter=179902, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:22:10, time_cost(all): 1 day, 17:37:44/23:11:28, loss=0.388189242086275, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.12(1.03), norm=4.651671627326629, lr=0.15332218583380142
2023-12-07 04:33:41   INFO  epoch: 46/72, acc_iter=179952, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:37, time_cost(all): 1 day, 17:38:26/22:07:16, loss=0.388130044660334, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.86(1.03), norm=2.1316673495750216, lr=0.1532082187584336
2023-12-07 04:34:23   INFO  epoch: 46/72, acc_iter=180002, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:12, time_cost(all): 1 day, 17:39:08/23:16:00, loss=0.388070847234393, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.02(1.03), norm=2.284198956101137, lr=0.15309425168306579
2023-12-07 04:35:05   INFO  epoch: 46/72, acc_iter=180052, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:19:27, time_cost(all): 1 day, 17:39:50/23:58:51, loss=0.388011649808452, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.15(1.03), norm=3.9660632051955047, lr=0.15298028460769797
2023-12-07 04:35:46   INFO  epoch: 46/72, acc_iter=180102, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:20:29, time_cost(all): 1 day, 17:40:31/23:40:21, loss=0.387952452382511, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.22(1.03), norm=2.5956793580577115, lr=0.1528663175323302
2023-12-07 04:36:28   INFO  epoch: 46/72, acc_iter=180152, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:18:37, time_cost(all): 1 day, 17:41:13/23:14:50, loss=0.38789325495657, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.89(1.03), norm=0.9399225366502292, lr=0.15275235045696245
2023-12-07 04:37:10   INFO  epoch: 46/72, acc_iter=180202, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:02, time_cost(all): 1 day, 17:41:55/22:19:29, loss=0.387834057530629, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.11(1.03), norm=0.8706356740357233, lr=0.15263838338159458
2023-12-07 04:37:52   INFO  epoch: 46/72, acc_iter=180252, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:16:51, time_cost(all): 1 day, 17:42:37/21:44:39, loss=0.387774860104688, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.04(1.03), norm=0.973998770873141, lr=0.15252441630622682
2023-12-07 04:38:33   INFO  epoch: 46/72, acc_iter=180302, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:02, time_cost(all): 1 day, 17:43:18/21:56:45, loss=0.387715662678747, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.09(1.03), norm=1.956536147385832, lr=0.15241044923085906
2023-12-07 04:39:15   INFO  epoch: 46/72, acc_iter=180352, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:19, time_cost(all): 1 day, 17:44:00/22:11:22, loss=0.387656465252806, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.2(1.03), norm=4.207712891054236, lr=0.15229648215549124
2023-12-07 04:39:57   INFO  epoch: 46/72, acc_iter=180402, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:16:07, time_cost(all): 1 day, 17:44:42/22:16:11, loss=0.387597267826865, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.21(1.03), norm=3.3792859121943875, lr=0.15218251508012343
2023-12-07 04:40:39   INFO  epoch: 46/72, acc_iter=180452, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:15:27, time_cost(all): 1 day, 17:45:24/22:57:13, loss=0.387538070400924, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.1(1.03), norm=0.5211216649556802, lr=0.15206854800475567
2023-12-07 04:41:21   INFO  epoch: 46/72, acc_iter=180502, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:13:42, time_cost(all): 1 day, 17:46:06/23:06:35, loss=0.387478872974984, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.88(1.03), norm=3.5711263596459073, lr=0.15195458092938785
2023-12-07 04:42:02   INFO  epoch: 46/72, acc_iter=180552, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:16, time_cost(all): 1 day, 17:46:47/22:52:44, loss=0.387419675549043, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.99(1.03), norm=3.855852055012629, lr=0.1518406138540201
2023-12-07 04:42:44   INFO  epoch: 46/72, acc_iter=180602, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:13:08, time_cost(all): 1 day, 17:47:29/22:09:05, loss=0.387360478123102, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.0(1.03), norm=2.458140293085802, lr=0.15172664677865227
2023-12-07 04:43:26   INFO  epoch: 46/72, acc_iter=180652, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:11:39, time_cost(all): 1 day, 17:48:11/21:40:51, loss=0.387301280697161, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.96(1.03), norm=1.2436932811271404, lr=0.15161267970328446
2023-12-07 04:44:08   INFO  epoch: 46/72, acc_iter=180702, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:10:49, time_cost(all): 1 day, 17:48:53/22:09:35, loss=0.38724208327122, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.16(1.03), norm=2.44877229874824, lr=0.1514987126279167
2023-12-07 04:44:49   INFO  epoch: 46/72, acc_iter=180752, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:57, time_cost(all): 1 day, 17:49:34/23:26:21, loss=0.387182885845279, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.22(1.03), norm=2.0316821189723644, lr=0.15138474555254888
2023-12-07 04:45:31   INFO  epoch: 46/72, acc_iter=180802, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:59, time_cost(all): 1 day, 17:50:16/22:24:36, loss=0.387123688419338, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.18(1.03), norm=3.3549606690687033, lr=0.15127077847718107
2023-12-07 04:46:13   INFO  epoch: 46/72, acc_iter=180852, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:32, time_cost(all): 1 day, 17:50:58/22:26:44, loss=0.387064490993397, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.97(1.03), norm=4.014712466200473, lr=0.1511568114018133
2023-12-07 04:46:55   INFO  epoch: 46/72, acc_iter=180902, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:38, time_cost(all): 1 day, 17:51:40/22:59:03, loss=0.387005293567456, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.1(1.03), norm=2.6903298219091836, lr=0.1510428443264455
2023-12-07 04:47:37   INFO  epoch: 46/72, acc_iter=180952, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:46, time_cost(all): 1 day, 17:52:22/23:27:32, loss=0.386946096141515, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.98(1.03), norm=0.6185811247976915, lr=0.15092887725107768
2023-12-07 04:48:18   INFO  epoch: 46/72, acc_iter=181002, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:28, time_cost(all): 1 day, 17:53:03/22:46:46, loss=0.386886898715574, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.09(1.03), norm=0.6676417955624334, lr=0.15081491017570992
2023-12-07 04:49:00   INFO  epoch: 46/72, acc_iter=181052, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:41, time_cost(all): 1 day, 17:53:45/21:36:26, loss=0.386827701289633, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.86(1.03), norm=2.918353786180908, lr=0.1507009431003421
2023-12-07 04:49:42   INFO  epoch: 46/72, acc_iter=181102, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:47, time_cost(all): 1 day, 17:54:27/22:17:32, loss=0.386768503863692, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.14(1.03), norm=1.478566931601521, lr=0.15058697602497434
2023-12-07 04:50:24   INFO  epoch: 46/72, acc_iter=181152, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:09, time_cost(all): 1 day, 17:55:09/23:37:35, loss=0.386709306437751, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.89(1.03), norm=0.772118763502712, lr=0.15047300894960652
2023-12-07 04:51:05   INFO  epoch: 46/72, acc_iter=181202, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:24, time_cost(all): 1 day, 17:55:50/22:03:19, loss=0.38665010901181, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.19(1.03), norm=2.761300491349499, lr=0.1503590418742387
2023-12-07 04:51:47   INFO  epoch: 46/72, acc_iter=181252, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:33, time_cost(all): 1 day, 17:56:32/22:51:01, loss=0.386590911585869, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.87(1.03), norm=3.7581639672321523, lr=0.15024507479887095
2023-12-07 04:52:29   INFO  epoch: 46/72, acc_iter=181302, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:03:03, time_cost(all): 1 day, 17:57:14/22:38:57, loss=0.386531714159928, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.06(1.03), norm=4.026875008497129, lr=0.1501311077235032
2023-12-07 04:53:11   INFO  epoch: 46/72, acc_iter=181352, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:17, time_cost(all): 1 day, 17:57:56/22:58:33, loss=0.386472516733988, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.91(1.03), norm=4.545749291991023, lr=0.15001714064813532
2023-12-07 04:53:53   INFO  epoch: 46/72, acc_iter=181402, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:33, time_cost(all): 1 day, 17:58:38/22:32:34, loss=0.386413319308047, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.14(1.03), norm=2.403196014782355, lr=0.14990317357276756
2023-12-07 04:54:34   INFO  epoch: 46/72, acc_iter=181452, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:50, time_cost(all): 1 day, 17:59:19/23:02:11, loss=0.386354121882106, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.14(1.03), norm=1.245159808310783, lr=0.1497892064973998
2023-12-07 04:55:16   INFO  epoch: 46/72, acc_iter=181502, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 1 day, 18:00:01/23:39:20, loss=0.386294924456165, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.03(1.03), norm=4.032234475680429, lr=0.14967523942203198
2023-12-07 04:55:58   INFO  epoch: 47/72, acc_iter=181564, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:54:51, time_cost(all): 1 day, 18:00:43/21:40:01, loss=0.386221519647998, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.95(1.03), norm=0.6555418809666878, lr=0.14953392024857592
2023-12-07 04:56:40   INFO  epoch: 47/72, acc_iter=181614, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:53:43, time_cost(all): 1 day, 18:01:25/23:03:27, loss=0.386162322222057, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.86(1.03), norm=3.586480937344417, lr=0.1494199531732081
2023-12-07 04:57:21   INFO  epoch: 47/72, acc_iter=181664, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:52:21, time_cost(all): 1 day, 18:02:06/22:37:51, loss=0.386103124796116, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.99(1.03), norm=0.720574438117811, lr=0.1493059860978403
2023-12-07 04:58:03   INFO  epoch: 47/72, acc_iter=181714, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:50:08, time_cost(all): 1 day, 18:02:48/21:59:09, loss=0.386043927370175, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.08(1.03), norm=3.687905570716734, lr=0.14919201902247253
2023-12-07 04:58:45   INFO  epoch: 47/72, acc_iter=181764, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:52:18, time_cost(all): 1 day, 18:03:30/23:12:40, loss=0.385984729944234, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.92(1.03), norm=4.389498687047496, lr=0.14907805194710472
2023-12-07 04:59:27   INFO  epoch: 47/72, acc_iter=181814, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:47:37, time_cost(all): 1 day, 18:04:12/21:55:35, loss=0.385925532518293, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.88(1.03), norm=4.12431898680777, lr=0.14896408487173696
2023-12-07 05:00:09   INFO  epoch: 47/72, acc_iter=181864, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:47:33, time_cost(all): 1 day, 18:04:54/21:23:03, loss=0.385866335092352, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.9(1.03), norm=4.428239060538061, lr=0.14885011779636914
2023-12-07 05:00:50   INFO  epoch: 47/72, acc_iter=181914, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:47:41, time_cost(all): 1 day, 18:05:35/23:29:23, loss=0.385807137666411, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.06(1.03), norm=4.829993322896348, lr=0.14873615072100133
2023-12-07 05:01:32   INFO  epoch: 47/72, acc_iter=181964, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:48:59, time_cost(all): 1 day, 18:06:17/22:36:09, loss=0.38574794024047, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.14(1.03), norm=3.3096831023918307, lr=0.14862218364563357
2023-12-07 05:02:14   INFO  epoch: 47/72, acc_iter=182014, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:46:27, time_cost(all): 1 day, 18:06:59/22:06:11, loss=0.385688742814529, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.84(1.03), norm=2.465583865205745, lr=0.14850821657026575
2023-12-07 05:02:56   INFO  epoch: 47/72, acc_iter=182064, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:44:28, time_cost(all): 1 day, 18:07:41/23:21:27, loss=0.385629545388589, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.09(1.03), norm=0.8723300721010399, lr=0.14839424949489793
2023-12-07 05:03:37   INFO  epoch: 47/72, acc_iter=182114, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:45:32, time_cost(all): 1 day, 18:08:22/23:08:42, loss=0.385570347962648, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.84(1.03), norm=1.6474849204309931, lr=0.14828028241953017
2023-12-07 05:04:19   INFO  epoch: 47/72, acc_iter=182164, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:42:57, time_cost(all): 1 day, 18:09:04/21:46:32, loss=0.385511150536707, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.05(1.03), norm=4.901444318881292, lr=0.14816631534416236
2023-12-07 05:05:01   INFO  epoch: 47/72, acc_iter=182214, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:42:10, time_cost(all): 1 day, 18:09:46/21:15:50, loss=0.385451953110766, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.03(1.03), norm=1.2197933987211333, lr=0.14805234826879454
2023-12-07 05:05:43   INFO  epoch: 47/72, acc_iter=182264, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:44:54, time_cost(all): 1 day, 18:10:28/21:46:50, loss=0.385392755684825, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.83(1.03), norm=1.0851550357812845, lr=0.14793838119342678
2023-12-07 05:06:25   INFO  epoch: 47/72, acc_iter=182314, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:42:53, time_cost(all): 1 day, 18:11:10/21:36:19, loss=0.385333558258884, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.05(1.03), norm=2.6801395709588833, lr=0.14782441411805897
2023-12-07 05:07:06   INFO  epoch: 47/72, acc_iter=182364, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:42:15, time_cost(all): 1 day, 18:11:51/23:07:49, loss=0.385274360832943, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.17(1.03), norm=2.144277792751036, lr=0.1477104470426912
2023-12-07 05:07:48   INFO  epoch: 47/72, acc_iter=182414, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:41:04, time_cost(all): 1 day, 18:12:33/21:48:02, loss=0.385215163407002, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.03(1.03), norm=1.2141785961561238, lr=0.1475964799673234
2023-12-07 05:08:30   INFO  epoch: 47/72, acc_iter=182464, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:39:41, time_cost(all): 1 day, 18:13:15/21:58:48, loss=0.385155965981061, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.04(1.03), norm=2.741192616262912, lr=0.14748251289195558
2023-12-07 05:09:12   INFO  epoch: 47/72, acc_iter=182514, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:37:52, time_cost(all): 1 day, 18:13:57/23:16:29, loss=0.38509676855512, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.22(1.03), norm=4.771574086840989, lr=0.14736854581658781
2023-12-07 05:09:54   INFO  epoch: 47/72, acc_iter=182564, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:38:25, time_cost(all): 1 day, 18:14:39/21:18:51, loss=0.385037571129179, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.84(1.03), norm=3.968393622207184, lr=0.14725457874122005
2023-12-07 05:10:35   INFO  epoch: 47/72, acc_iter=182614, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:36:46, time_cost(all): 1 day, 18:15:20/23:12:52, loss=0.384978373703238, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.9(1.03), norm=4.8909446938735535, lr=0.14714061166585218
2023-12-07 05:11:17   INFO  epoch: 47/72, acc_iter=182664, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:38:02, time_cost(all): 1 day, 18:16:02/22:48:26, loss=0.384919176277297, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.09(1.03), norm=3.415193593834226, lr=0.14702664459048442
2023-12-07 05:11:59   INFO  epoch: 47/72, acc_iter=182714, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:35:20, time_cost(all): 1 day, 18:16:44/21:10:13, loss=0.384859978851356, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.92(1.03), norm=3.8110756798843153, lr=0.14691267751511666
2023-12-07 05:12:41   INFO  epoch: 47/72, acc_iter=182764, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:35:00, time_cost(all): 1 day, 18:17:26/21:53:16, loss=0.384800781425415, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.2(1.03), norm=2.2307676214012355, lr=0.14679871043974885
2023-12-07 05:13:22   INFO  epoch: 47/72, acc_iter=182814, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:35:06, time_cost(all): 1 day, 18:18:07/22:55:50, loss=0.384741583999474, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.0(1.03), norm=4.931851285059598, lr=0.14668474336438103
2023-12-07 05:14:04   INFO  epoch: 47/72, acc_iter=182864, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:35:55, time_cost(all): 1 day, 18:18:49/21:17:21, loss=0.384682386573533, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.17(1.03), norm=1.8716320208464057, lr=0.14657077628901327
2023-12-07 05:14:46   INFO  epoch: 47/72, acc_iter=182914, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:35:04, time_cost(all): 1 day, 18:19:31/21:34:04, loss=0.384623189147593, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.87(1.03), norm=3.310499260728055, lr=0.14645680921364546
2023-12-07 05:15:28   INFO  epoch: 47/72, acc_iter=182964, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:35:10, time_cost(all): 1 day, 18:20:13/22:51:23, loss=0.384563991721652, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.85(1.03), norm=3.2567318667822223, lr=0.14634284213827764
2023-12-07 05:16:10   INFO  epoch: 47/72, acc_iter=183014, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:33:31, time_cost(all): 1 day, 18:20:55/22:35:38, loss=0.384504794295711, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.14(1.03), norm=3.5279297389886044, lr=0.14622887506290982
2023-12-07 05:16:51   INFO  epoch: 47/72, acc_iter=183064, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:31:22, time_cost(all): 1 day, 18:21:36/21:48:54, loss=0.38444559686977, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.99(1.03), norm=3.055760518338067, lr=0.14611490798754206
2023-12-07 05:17:33   INFO  epoch: 47/72, acc_iter=183114, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:31:46, time_cost(all): 1 day, 18:22:18/22:46:26, loss=0.384386399443829, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.21(1.03), norm=4.800047694865896, lr=0.1460009409121743
2023-12-07 05:18:15   INFO  epoch: 47/72, acc_iter=183164, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:32:17, time_cost(all): 1 day, 18:23:00/22:26:22, loss=0.384327202017888, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.23(1.03), norm=2.0876292539347174, lr=0.14588697383680643
2023-12-07 05:18:57   INFO  epoch: 47/72, acc_iter=183214, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:29:36, time_cost(all): 1 day, 18:23:42/22:12:57, loss=0.384268004591947, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.92(1.03), norm=1.0929929680727724, lr=0.14577300676143867
2023-12-07 05:19:38   INFO  epoch: 47/72, acc_iter=183264, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:29:00, time_cost(all): 1 day, 18:24:23/21:42:42, loss=0.384208807166006, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.14(1.03), norm=3.2334221532633367, lr=0.1456590396860709
2023-12-07 05:20:20   INFO  epoch: 47/72, acc_iter=183314, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:27:57, time_cost(all): 1 day, 18:25:05/21:44:33, loss=0.384149609740065, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.09(1.03), norm=1.445473266542784, lr=0.1455450726107031
2023-12-07 05:21:02   INFO  epoch: 47/72, acc_iter=183364, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:27:20, time_cost(all): 1 day, 18:25:47/21:09:12, loss=0.384090412314124, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.09(1.03), norm=2.172795424108822, lr=0.14543110553533528
2023-12-07 05:21:44   INFO  epoch: 47/72, acc_iter=183414, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:28:05, time_cost(all): 1 day, 18:26:29/21:17:23, loss=0.384031214888183, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.99(1.03), norm=3.536816619270811, lr=0.14531713845996752
2023-12-07 05:22:26   INFO  epoch: 47/72, acc_iter=183464, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:27:32, time_cost(all): 1 day, 18:27:11/21:55:38, loss=0.383972017462242, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.07(1.03), norm=4.696564190614343, lr=0.1452031713845997
2023-12-07 05:23:07   INFO  epoch: 47/72, acc_iter=183514, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:26:12, time_cost(all): 1 day, 18:27:52/21:26:53, loss=0.383912820036301, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.17(1.03), norm=2.5518534885434674, lr=0.14508920430923195
2023-12-07 05:23:49   INFO  epoch: 47/72, acc_iter=183564, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:40, time_cost(all): 1 day, 18:28:34/21:28:00, loss=0.38385362261036, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.17(1.03), norm=0.6000556613468817, lr=0.14497523723386413
2023-12-07 05:24:31   INFO  epoch: 47/72, acc_iter=183614, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:23:27, time_cost(all): 1 day, 18:29:16/20:59:53, loss=0.383794425184419, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.96(1.03), norm=2.4874663210497947, lr=0.1448612701584963
2023-12-07 05:25:13   INFO  epoch: 47/72, acc_iter=183664, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:22:53, time_cost(all): 1 day, 18:29:58/21:18:38, loss=0.383735227758478, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.98(1.03), norm=3.541175126292436, lr=0.14474730308312855
2023-12-07 05:25:54   INFO  epoch: 47/72, acc_iter=183714, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:20, time_cost(all): 1 day, 18:30:39/21:40:27, loss=0.383676030332537, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.17(1.03), norm=0.8396303582939817, lr=0.14463333600776074
2023-12-07 05:26:36   INFO  epoch: 47/72, acc_iter=183764, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:21:53, time_cost(all): 1 day, 18:31:21/23:00:55, loss=0.383616832906597, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.94(1.03), norm=1.6993616401315925, lr=0.14451936893239292
2023-12-07 05:27:18   INFO  epoch: 47/72, acc_iter=183814, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:56, time_cost(all): 1 day, 18:32:03/22:12:51, loss=0.383557635480656, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.22(1.03), norm=3.5477959595524995, lr=0.14440540185702516
2023-12-07 05:28:00   INFO  epoch: 47/72, acc_iter=183864, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:29, time_cost(all): 1 day, 18:32:45/21:17:11, loss=0.383498438054715, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.98(1.03), norm=2.1773092118624486, lr=0.14429143478165735
2023-12-07 05:28:42   INFO  epoch: 47/72, acc_iter=183914, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:20:04, time_cost(all): 1 day, 18:33:27/21:59:05, loss=0.383439240628774, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.21(1.03), norm=1.008996592775547, lr=0.14417746770628953
2023-12-07 05:29:23   INFO  epoch: 47/72, acc_iter=183964, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:20:31, time_cost(all): 1 day, 18:34:08/22:36:28, loss=0.383380043202833, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.01(1.03), norm=3.8243989358638615, lr=0.14406350063092177
2023-12-07 05:30:05   INFO  epoch: 47/72, acc_iter=184014, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:18:25, time_cost(all): 1 day, 18:34:50/22:43:25, loss=0.383320845776892, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.1(1.03), norm=0.9567023006987903, lr=0.14394953355555395
2023-12-07 05:30:47   INFO  epoch: 47/72, acc_iter=184064, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:17:37, time_cost(all): 1 day, 18:35:32/22:22:12, loss=0.383261648350951, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.08(1.03), norm=2.774735584405047, lr=0.1438355664801862
2023-12-07 05:31:29   INFO  epoch: 47/72, acc_iter=184114, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:16:51, time_cost(all): 1 day, 18:36:14/22:46:10, loss=0.38320245092501, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.9(1.03), norm=1.3592734083554947, lr=0.14372159940481838
2023-12-07 05:32:10   INFO  epoch: 47/72, acc_iter=184164, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:29, time_cost(all): 1 day, 18:36:55/21:00:06, loss=0.383143253499069, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.98(1.03), norm=4.960398219020499, lr=0.14360763232945056
2023-12-07 05:32:52   INFO  epoch: 47/72, acc_iter=184214, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:46, time_cost(all): 1 day, 18:37:37/21:18:28, loss=0.383084056073128, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.94(1.03), norm=2.8001142626609057, lr=0.1434936652540828
2023-12-07 05:33:34   INFO  epoch: 47/72, acc_iter=184264, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:26, time_cost(all): 1 day, 18:38:19/22:37:39, loss=0.383024858647187, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.87(1.03), norm=2.0893985593799034, lr=0.14337969817871504
2023-12-07 05:34:16   INFO  epoch: 47/72, acc_iter=184314, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:15:20, time_cost(all): 1 day, 18:39:01/22:00:50, loss=0.382965661221246, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.03(1.03), norm=3.337580131674186, lr=0.14326573110334717
2023-12-07 05:34:58   INFO  epoch: 47/72, acc_iter=184364, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:13:51, time_cost(all): 1 day, 18:39:43/22:19:19, loss=0.382906463795305, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.99(1.03), norm=3.348077328792731, lr=0.1431517640279794
2023-12-07 05:35:39   INFO  epoch: 47/72, acc_iter=184414, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:06, time_cost(all): 1 day, 18:40:24/22:46:23, loss=0.382847266369364, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.08(1.03), norm=4.687863972639079, lr=0.14303779695261165
2023-12-07 05:36:21   INFO  epoch: 47/72, acc_iter=184464, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:13:13, time_cost(all): 1 day, 18:41:06/21:00:20, loss=0.382788068943423, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.11(1.03), norm=2.125401269032362, lr=0.14292382987724384
2023-12-07 05:37:03   INFO  epoch: 47/72, acc_iter=184514, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:11:41, time_cost(all): 1 day, 18:41:48/21:39:20, loss=0.382728871517482, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.2(1.03), norm=4.53703449272796, lr=0.14280986280187602
2023-12-07 05:37:45   INFO  epoch: 47/72, acc_iter=184564, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:42, time_cost(all): 1 day, 18:42:30/21:15:02, loss=0.382669674091542, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.23(1.03), norm=0.9065583412748739, lr=0.1426958957265082
2023-12-07 05:38:26   INFO  epoch: 47/72, acc_iter=184614, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:46, time_cost(all): 1 day, 18:43:11/21:16:14, loss=0.382610476665601, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.21(1.03), norm=3.6540751320379483, lr=0.14258192865114044
2023-12-07 05:39:08   INFO  epoch: 47/72, acc_iter=184664, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:10:02, time_cost(all): 1 day, 18:43:53/20:59:50, loss=0.38255127923966, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.96(1.03), norm=0.9149722509312471, lr=0.14246796157577263
2023-12-07 05:39:50   INFO  epoch: 47/72, acc_iter=184714, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:12, time_cost(all): 1 day, 18:44:35/22:20:16, loss=0.382492081813719, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.9(1.03), norm=0.8822664390664503, lr=0.1423539945004048
2023-12-07 05:40:32   INFO  epoch: 47/72, acc_iter=184764, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:43, time_cost(all): 1 day, 18:45:17/21:27:08, loss=0.382432884387778, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.98(1.03), norm=1.4438543955871166, lr=0.14224002742503705
2023-12-07 05:41:14   INFO  epoch: 47/72, acc_iter=184814, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:35, time_cost(all): 1 day, 18:45:59/22:38:26, loss=0.382373686961837, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.12(1.03), norm=3.1187216171739913, lr=0.1421260603496693
2023-12-07 05:41:55   INFO  epoch: 47/72, acc_iter=184864, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:18, time_cost(all): 1 day, 18:46:40/22:11:56, loss=0.382314489535896, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.22(1.03), norm=2.5709612629717453, lr=0.14201209327430142
2023-12-07 05:42:37   INFO  epoch: 47/72, acc_iter=184914, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:11, time_cost(all): 1 day, 18:47:22/21:10:09, loss=0.382255292109955, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.07(1.03), norm=0.8100723019826639, lr=0.14189812619893366
2023-12-07 05:43:19   INFO  epoch: 47/72, acc_iter=184964, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:59, time_cost(all): 1 day, 18:48:04/21:53:27, loss=0.382196094684014, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.02(1.03), norm=0.9700299808940399, lr=0.1417841591235659
2023-12-07 05:44:01   INFO  epoch: 47/72, acc_iter=185014, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:04, time_cost(all): 1 day, 18:48:46/21:56:33, loss=0.382136897258073, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.12(1.03), norm=2.695288771732527, lr=0.14167019204819808
2023-12-07 05:44:43   INFO  epoch: 47/72, acc_iter=185064, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:28, time_cost(all): 1 day, 18:49:28/22:07:02, loss=0.382077699832132, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.18(1.03), norm=1.843890083584452, lr=0.14155622497283027
2023-12-07 05:45:24   INFO  epoch: 47/72, acc_iter=185114, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:38, time_cost(all): 1 day, 18:50:09/22:31:08, loss=0.382018502406191, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.12(1.03), norm=4.1518130913463125, lr=0.1414422578974625
2023-12-07 05:46:06   INFO  epoch: 47/72, acc_iter=185164, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:51, time_cost(all): 1 day, 18:50:51/22:36:07, loss=0.38195930498025, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.97(1.03), norm=3.224782541030125, lr=0.1413282908220947
2023-12-07 05:46:48   INFO  epoch: 47/72, acc_iter=185214, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:11, time_cost(all): 1 day, 18:51:33/20:40:35, loss=0.381900107554309, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.94(1.03), norm=0.6732242166416536, lr=0.14121432374672693
2023-12-07 05:47:30   INFO  epoch: 47/72, acc_iter=185264, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:29, time_cost(all): 1 day, 18:52:15/21:22:51, loss=0.381840910128368, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.84(1.03), norm=1.2936371712924217, lr=0.14110035667135912
2023-12-07 05:48:11   INFO  epoch: 47/72, acc_iter=185314, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:53, time_cost(all): 1 day, 18:52:56/20:57:21, loss=0.381781712702427, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.2(1.03), norm=1.6465497538068374, lr=0.1409863895959913
2023-12-07 05:48:53   INFO  epoch: 47/72, acc_iter=185364, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 1 day, 18:53:38/22:33:28, loss=0.381722515276486, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.86(1.03), norm=3.160704101945788, lr=0.14087242252062354
2023-12-07 05:49:35   INFO  epoch: 48/72, acc_iter=185426, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:51:50, time_cost(all): 1 day, 18:54:20/22:14:36, loss=0.38164911046832, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.08(1.03), norm=3.020246121478788, lr=0.14073110334716743
2023-12-07 05:50:17   INFO  epoch: 48/72, acc_iter=185476, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:54:53, time_cost(all): 1 day, 18:55:02/22:31:21, loss=0.381589913042379, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.06(1.03), norm=2.906101573235948, lr=0.14061713627179967
2023-12-07 05:50:59   INFO  epoch: 48/72, acc_iter=185526, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:52:32, time_cost(all): 1 day, 18:55:44/22:09:50, loss=0.381530715616438, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.94(1.03), norm=2.2871627407366617, lr=0.1405031691964319
2023-12-07 05:51:40   INFO  epoch: 48/72, acc_iter=185576, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:51:02, time_cost(all): 1 day, 18:56:25/22:11:16, loss=0.381471518190497, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.86(1.03), norm=4.588685353071818, lr=0.14038920212106404
2023-12-07 05:52:22   INFO  epoch: 48/72, acc_iter=185626, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:51:26, time_cost(all): 1 day, 18:57:07/22:34:14, loss=0.381412320764556, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.17(1.03), norm=3.795299169292634, lr=0.14027523504569628
2023-12-07 05:53:04   INFO  epoch: 48/72, acc_iter=185676, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:49:08, time_cost(all): 1 day, 18:57:49/22:06:17, loss=0.381353123338615, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.98(1.03), norm=0.8274392701240527, lr=0.14016126797032852
2023-12-07 05:53:46   INFO  epoch: 48/72, acc_iter=185726, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:46:56, time_cost(all): 1 day, 18:58:31/21:00:01, loss=0.381293925912674, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.15(1.03), norm=2.899818495798358, lr=0.1400473008949607
2023-12-07 05:54:27   INFO  epoch: 48/72, acc_iter=185776, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:46:03, time_cost(all): 1 day, 18:59:12/21:26:54, loss=0.381234728486733, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.08(1.03), norm=3.0728230215641643, lr=0.1399333338195929
2023-12-07 05:55:09   INFO  epoch: 48/72, acc_iter=185826, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:46:52, time_cost(all): 1 day, 18:59:54/21:47:35, loss=0.381175531060792, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.07(1.03), norm=1.3250770769006222, lr=0.13981936674422507
2023-12-07 05:55:51   INFO  epoch: 48/72, acc_iter=185876, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:48:13, time_cost(all): 1 day, 19:00:36/21:04:35, loss=0.381116333634851, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.14(1.03), norm=4.448510942183924, lr=0.1397053996688573
2023-12-07 05:56:33   INFO  epoch: 48/72, acc_iter=185926, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:44:03, time_cost(all): 1 day, 19:01:18/21:37:22, loss=0.38105713620891, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.15(1.03), norm=1.5305222641989389, lr=0.1395914325934895
2023-12-07 05:57:15   INFO  epoch: 48/72, acc_iter=185976, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:47:36, time_cost(all): 1 day, 19:02:00/21:47:37, loss=0.380997938782969, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.14(1.03), norm=1.5745557404148032, lr=0.13947746551812168
2023-12-07 05:57:56   INFO  epoch: 48/72, acc_iter=186026, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:43:40, time_cost(all): 1 day, 19:02:41/20:33:31, loss=0.380938741357028, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.09(1.03), norm=2.3678319488731607, lr=0.13936349844275392
2023-12-07 05:58:38   INFO  epoch: 48/72, acc_iter=186076, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:42:33, time_cost(all): 1 day, 19:03:23/22:15:08, loss=0.380879543931087, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.91(1.03), norm=1.610067048189124, lr=0.13924953136738616
2023-12-07 05:59:20   INFO  epoch: 48/72, acc_iter=186126, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:42:57, time_cost(all): 1 day, 19:04:05/21:02:09, loss=0.380820346505146, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.89(1.03), norm=4.4326412970489, lr=0.1391355642920183
2023-12-07 06:00:02   INFO  epoch: 48/72, acc_iter=186176, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:43:28, time_cost(all): 1 day, 19:04:47/22:27:41, loss=0.380761149079206, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.19(1.03), norm=4.828905829625312, lr=0.13902159721665053
2023-12-07 06:00:43   INFO  epoch: 48/72, acc_iter=186226, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:42:42, time_cost(all): 1 day, 19:05:28/21:39:18, loss=0.380701951653265, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.21(1.03), norm=4.679519743651196, lr=0.13890763014128277
2023-12-07 06:01:25   INFO  epoch: 48/72, acc_iter=186276, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:39:17, time_cost(all): 1 day, 19:06:10/20:51:47, loss=0.380642754227324, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.85(1.03), norm=1.4631212691051931, lr=0.13879366306591495
2023-12-07 06:02:07   INFO  epoch: 48/72, acc_iter=186326, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:40:59, time_cost(all): 1 day, 19:06:52/22:12:48, loss=0.380583556801383, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.0(1.03), norm=3.7508662267086557, lr=0.13867969599054714
2023-12-07 06:02:49   INFO  epoch: 48/72, acc_iter=186376, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:38:28, time_cost(all): 1 day, 19:07:34/21:15:50, loss=0.380524359375442, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.94(1.03), norm=3.893202465598194, lr=0.13856572891517938
2023-12-07 06:03:31   INFO  epoch: 48/72, acc_iter=186426, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:38:24, time_cost(all): 1 day, 19:08:16/20:39:15, loss=0.380465161949501, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.0(1.03), norm=4.281418578470084, lr=0.13845176183981156
2023-12-07 06:04:12   INFO  epoch: 48/72, acc_iter=186476, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:39:25, time_cost(all): 1 day, 19:08:57/22:15:08, loss=0.38040596452356, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.17(1.03), norm=4.434257164729404, lr=0.1383377947644438
2023-12-07 06:04:54   INFO  epoch: 48/72, acc_iter=186526, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:36:13, time_cost(all): 1 day, 19:09:39/22:16:09, loss=0.380346767097619, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.1(1.03), norm=4.728963691469718, lr=0.13822382768907593
2023-12-07 06:05:36   INFO  epoch: 48/72, acc_iter=186576, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:37:59, time_cost(all): 1 day, 19:10:21/21:54:30, loss=0.380287569671678, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.88(1.03), norm=4.340082826818231, lr=0.13810986061370817
2023-12-07 06:06:18   INFO  epoch: 48/72, acc_iter=186626, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:36:23, time_cost(all): 1 day, 19:11:03/20:30:16, loss=0.380228372245737, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.96(1.03), norm=4.186767585799552, lr=0.1379958935383404
2023-12-07 06:06:59   INFO  epoch: 48/72, acc_iter=186676, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:36:58, time_cost(all): 1 day, 19:11:44/21:16:24, loss=0.380169174819796, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.89(1.03), norm=1.1943421620313983, lr=0.1378819264629726
2023-12-07 06:07:41   INFO  epoch: 48/72, acc_iter=186726, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:36:42, time_cost(all): 1 day, 19:12:26/21:46:54, loss=0.380109977393855, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.86(1.03), norm=2.3970769059708212, lr=0.13776795938760478
2023-12-07 06:08:23   INFO  epoch: 48/72, acc_iter=186776, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:33:09, time_cost(all): 1 day, 19:13:08/20:44:42, loss=0.380050779967914, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.97(1.03), norm=3.3648645064214766, lr=0.13765399231223702
2023-12-07 06:09:05   INFO  epoch: 48/72, acc_iter=186826, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:34:24, time_cost(all): 1 day, 19:13:50/20:49:07, loss=0.379991582541973, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.05(1.03), norm=1.911352570645135, lr=0.1375400252368692
2023-12-07 06:09:47   INFO  epoch: 48/72, acc_iter=186876, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:31:23, time_cost(all): 1 day, 19:14:32/20:17:33, loss=0.379932385116032, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.01(1.03), norm=2.4920971599525803, lr=0.13742605816150139
2023-12-07 06:10:28   INFO  epoch: 48/72, acc_iter=186926, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:30:38, time_cost(all): 1 day, 19:15:13/21:38:40, loss=0.379873187690091, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.1(1.03), norm=2.120757526427969, lr=0.13731209108613363
2023-12-07 06:11:10   INFO  epoch: 48/72, acc_iter=186976, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:30:52, time_cost(all): 1 day, 19:15:55/20:28:49, loss=0.37981399026415, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.9(1.03), norm=1.3033228005446087, lr=0.1371981240107658
2023-12-07 06:11:52   INFO  epoch: 48/72, acc_iter=187026, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:31:20, time_cost(all): 1 day, 19:16:37/21:02:18, loss=0.37975479283821, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.84(1.03), norm=1.1650845264796805, lr=0.13708415693539805
2023-12-07 06:12:34   INFO  epoch: 48/72, acc_iter=187076, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:29:30, time_cost(all): 1 day, 19:17:19/20:35:20, loss=0.379695595412269, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.2(1.03), norm=3.7776048693094846, lr=0.13697018986003023
2023-12-07 06:13:15   INFO  epoch: 48/72, acc_iter=187126, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:28:28, time_cost(all): 1 day, 19:18:00/20:25:09, loss=0.379636397986328, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.0(1.03), norm=4.00684454395674, lr=0.13685622278466242
2023-12-07 06:13:57   INFO  epoch: 48/72, acc_iter=187176, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:27:48, time_cost(all): 1 day, 19:18:42/20:47:17, loss=0.379577200560387, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.23(1.03), norm=2.570980752842547, lr=0.13674225570929466
2023-12-07 06:14:39   INFO  epoch: 48/72, acc_iter=187226, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:28:48, time_cost(all): 1 day, 19:19:24/21:35:10, loss=0.379518003134446, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.21(1.03), norm=1.595181491008051, lr=0.1366282886339269
2023-12-07 06:15:21   INFO  epoch: 48/72, acc_iter=187276, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:28:24, time_cost(all): 1 day, 19:20:06/21:45:45, loss=0.379458805708505, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.83(1.03), norm=0.5453590755233186, lr=0.13651432155855903
2023-12-07 06:16:03   INFO  epoch: 48/72, acc_iter=187326, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:25:41, time_cost(all): 1 day, 19:20:48/20:52:51, loss=0.379399608282564, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.04(1.03), norm=3.627425149805802, lr=0.13640035448319127
2023-12-07 06:16:44   INFO  epoch: 48/72, acc_iter=187376, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:26:16, time_cost(all): 1 day, 19:21:29/22:05:57, loss=0.379340410856623, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.85(1.03), norm=4.462764479482873, lr=0.13628638740782345
2023-12-07 06:17:26   INFO  epoch: 48/72, acc_iter=187426, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:26:03, time_cost(all): 1 day, 19:22:11/21:12:06, loss=0.379281213430682, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.83(1.03), norm=4.873297646299733, lr=0.1361724203324557
2023-12-07 06:18:08   INFO  epoch: 48/72, acc_iter=187476, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:25:32, time_cost(all): 1 day, 19:22:53/20:27:32, loss=0.379222016004741, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.86(1.03), norm=2.8642429403190928, lr=0.13605845325708787
2023-12-07 06:18:50   INFO  epoch: 48/72, acc_iter=187526, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:23:26, time_cost(all): 1 day, 19:23:35/20:28:05, loss=0.3791628185788, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.2(1.03), norm=3.535323628059071, lr=0.13594448618172006
2023-12-07 06:19:32   INFO  epoch: 48/72, acc_iter=187576, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:00, time_cost(all): 1 day, 19:24:17/22:01:58, loss=0.379103621152859, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.98(1.03), norm=4.50028380470145, lr=0.1358305191063523
2023-12-07 06:20:13   INFO  epoch: 48/72, acc_iter=187626, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:22:40, time_cost(all): 1 day, 19:24:58/21:28:32, loss=0.379044423726918, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.18(1.03), norm=4.19721649982975, lr=0.13571655203098448
2023-12-07 06:20:55   INFO  epoch: 48/72, acc_iter=187676, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:11, time_cost(all): 1 day, 19:25:40/20:44:18, loss=0.378985226300977, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.02(1.03), norm=1.8813820970676693, lr=0.13560258495561667
2023-12-07 06:21:37   INFO  epoch: 48/72, acc_iter=187726, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:58, time_cost(all): 1 day, 19:26:22/20:16:12, loss=0.378926028875036, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.2(1.03), norm=2.275138901306059, lr=0.1354886178802489
2023-12-07 06:22:19   INFO  epoch: 48/72, acc_iter=187776, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:20:29, time_cost(all): 1 day, 19:27:04/21:12:51, loss=0.378866831449095, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.08(1.03), norm=0.7224299405497506, lr=0.13537465080488115
2023-12-07 06:23:00   INFO  epoch: 48/72, acc_iter=187826, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:20:09, time_cost(all): 1 day, 19:27:45/20:54:19, loss=0.378807634023154, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.15(1.03), norm=1.233722237021146, lr=0.13526068372951328
2023-12-07 06:23:42   INFO  epoch: 48/72, acc_iter=187876, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:24, time_cost(all): 1 day, 19:28:27/21:59:14, loss=0.378748436597214, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.01(1.03), norm=3.8208902109728275, lr=0.13514671665414552
2023-12-07 06:24:24   INFO  epoch: 48/72, acc_iter=187926, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:11, time_cost(all): 1 day, 19:29:09/20:24:53, loss=0.378689239171273, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.1(1.03), norm=2.4694664106735407, lr=0.13503274957877776
2023-12-07 06:25:06   INFO  epoch: 48/72, acc_iter=187976, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:12, time_cost(all): 1 day, 19:29:51/20:36:52, loss=0.378630041745332, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.89(1.03), norm=2.519258131425846, lr=0.13491878250340994
2023-12-07 06:25:48   INFO  epoch: 48/72, acc_iter=188026, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:37, time_cost(all): 1 day, 19:30:33/20:04:41, loss=0.378570844319391, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.05(1.03), norm=2.1630155384847654, lr=0.13480481542804212
2023-12-07 06:26:29   INFO  epoch: 48/72, acc_iter=188076, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:34, time_cost(all): 1 day, 19:31:14/21:36:26, loss=0.37851164689345, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.98(1.03), norm=3.4650949548013887, lr=0.13469084835267436
2023-12-07 06:27:11   INFO  epoch: 48/72, acc_iter=188126, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:14:53, time_cost(all): 1 day, 19:31:56/20:13:10, loss=0.378452449467509, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.12(1.03), norm=1.2360944575707102, lr=0.13457688127730655
2023-12-07 06:27:53   INFO  epoch: 48/72, acc_iter=188176, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:34, time_cost(all): 1 day, 19:32:38/21:56:26, loss=0.378393252041568, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.9(1.03), norm=2.3962847226629287, lr=0.1344629142019388
2023-12-07 06:28:35   INFO  epoch: 48/72, acc_iter=188226, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:13:37, time_cost(all): 1 day, 19:33:20/20:21:56, loss=0.378334054615627, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.93(1.03), norm=0.5058923931087168, lr=0.13434894712657092
2023-12-07 06:29:16   INFO  epoch: 48/72, acc_iter=188276, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:45, time_cost(all): 1 day, 19:34:01/21:33:06, loss=0.378274857189686, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.21(1.03), norm=2.982679273297077, lr=0.13423498005120316
2023-12-07 06:29:58   INFO  epoch: 48/72, acc_iter=188326, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:22, time_cost(all): 1 day, 19:34:43/21:22:09, loss=0.378215659763745, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.0(1.03), norm=1.6327311800821163, lr=0.1341210129758354
2023-12-07 06:30:40   INFO  epoch: 48/72, acc_iter=188376, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:11:53, time_cost(all): 1 day, 19:35:25/20:27:29, loss=0.378156462337804, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.17(1.03), norm=3.087313598873428, lr=0.13400704590046758
2023-12-07 06:31:22   INFO  epoch: 48/72, acc_iter=188426, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:32, time_cost(all): 1 day, 19:36:07/21:49:08, loss=0.378097264911863, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.92(1.03), norm=1.2153814075289202, lr=0.13389307882509976
2023-12-07 06:32:04   INFO  epoch: 48/72, acc_iter=188476, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:43, time_cost(all): 1 day, 19:36:49/20:25:19, loss=0.378038067485922, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.84(1.03), norm=4.745978270446104, lr=0.133779111749732
2023-12-07 06:32:45   INFO  epoch: 48/72, acc_iter=188526, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:10:17, time_cost(all): 1 day, 19:37:30/21:10:25, loss=0.377978870059981, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.03(1.03), norm=3.015738671224096, lr=0.1336651446743642
2023-12-07 06:33:27   INFO  epoch: 48/72, acc_iter=188576, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:11, time_cost(all): 1 day, 19:38:12/21:47:22, loss=0.37791967263404, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.94(1.03), norm=2.3317740124760244, lr=0.13355117759899637
2023-12-07 06:34:09   INFO  epoch: 48/72, acc_iter=188626, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:35, time_cost(all): 1 day, 19:38:54/21:20:51, loss=0.377860475208099, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.93(1.03), norm=4.770037565102849, lr=0.1334372105236286
2023-12-07 06:34:51   INFO  epoch: 48/72, acc_iter=188676, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:34, time_cost(all): 1 day, 19:39:36/19:50:30, loss=0.377801277782158, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.96(1.03), norm=0.9478685098349531, lr=0.1333232434482608
2023-12-07 06:35:32   INFO  epoch: 48/72, acc_iter=188726, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:09, time_cost(all): 1 day, 19:40:17/21:21:42, loss=0.377742080356218, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.2(1.03), norm=4.9362037812762365, lr=0.13320927637289304
2023-12-07 06:36:14   INFO  epoch: 48/72, acc_iter=188776, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:23, time_cost(all): 1 day, 19:40:59/20:36:17, loss=0.377682882930277, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.09(1.03), norm=0.6116390001146624, lr=0.13309530929752522
2023-12-07 06:36:56   INFO  epoch: 48/72, acc_iter=188826, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:35, time_cost(all): 1 day, 19:41:41/20:38:43, loss=0.377623685504336, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.16(1.03), norm=3.257125276392388, lr=0.1329813422221574
2023-12-07 06:37:38   INFO  epoch: 48/72, acc_iter=188876, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:07, time_cost(all): 1 day, 19:42:23/21:50:24, loss=0.377564488078395, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.87(1.03), norm=0.699780122796305, lr=0.13286737514678965
2023-12-07 06:38:20   INFO  epoch: 48/72, acc_iter=188926, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:33, time_cost(all): 1 day, 19:43:05/20:37:22, loss=0.377505290652454, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.07(1.03), norm=4.212458409922855, lr=0.13275340807142183
2023-12-07 06:39:01   INFO  epoch: 48/72, acc_iter=188976, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:44, time_cost(all): 1 day, 19:43:46/21:43:58, loss=0.377446093226513, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.93(1.03), norm=1.240404664201991, lr=0.13263944099605401
2023-12-07 06:39:43   INFO  epoch: 48/72, acc_iter=189026, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:53, time_cost(all): 1 day, 19:44:28/19:54:42, loss=0.377386895800572, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.93(1.03), norm=3.345539712548134, lr=0.13252547392068625
2023-12-07 06:40:25   INFO  epoch: 48/72, acc_iter=189076, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:13, time_cost(all): 1 day, 19:45:10/20:40:19, loss=0.377327698374631, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.11(1.03), norm=0.9257381187130261, lr=0.13241150684531844
2023-12-07 06:41:07   INFO  epoch: 48/72, acc_iter=189126, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:32, time_cost(all): 1 day, 19:45:52/20:44:38, loss=0.37726850094869, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.12(1.03), norm=4.54890437314091, lr=0.13229753976995068
2023-12-07 06:41:48   INFO  epoch: 48/72, acc_iter=189176, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:51, time_cost(all): 1 day, 19:46:33/20:32:18, loss=0.377209303522749, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.86(1.03), norm=4.5283944070510085, lr=0.13218357269458286
2023-12-07 06:42:30   INFO  epoch: 48/72, acc_iter=189226, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 1 day, 19:47:15/20:05:09, loss=0.377150106096808, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.99(1.03), norm=0.6119864607845805, lr=0.13206960561921505
2023-12-07 06:43:12   INFO  epoch: 49/72, acc_iter=189288, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:53:50, time_cost(all): 1 day, 19:47:57/20:53:34, loss=0.377076701288641, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.1(1.03), norm=4.90325882350968, lr=0.131928286445759
2023-12-07 06:43:54   INFO  epoch: 49/72, acc_iter=189338, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:53:25, time_cost(all): 1 day, 19:48:39/20:03:57, loss=0.3770175038627, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.09(1.03), norm=1.2207401628894385, lr=0.13181431937039118
2023-12-07 06:44:36   INFO  epoch: 49/72, acc_iter=189388, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:50:23, time_cost(all): 1 day, 19:49:21/20:11:53, loss=0.376958306436759, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.05(1.03), norm=0.5834073811787686, lr=0.13170035229502342
2023-12-07 06:45:17   INFO  epoch: 49/72, acc_iter=189438, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:49:54, time_cost(all): 1 day, 19:50:02/21:34:21, loss=0.376899109010819, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.21(1.03), norm=0.7282291058011998, lr=0.13158638521965565
2023-12-07 06:45:59   INFO  epoch: 49/72, acc_iter=189488, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:50:55, time_cost(all): 1 day, 19:50:44/21:42:19, loss=0.376839911584878, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.23(1.03), norm=3.730040555835121, lr=0.13147241814428778
2023-12-07 06:46:41   INFO  epoch: 49/72, acc_iter=189538, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:49:03, time_cost(all): 1 day, 19:51:26/20:04:55, loss=0.376780714158937, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.12(1.03), norm=2.8787966319955256, lr=0.13135845106892002
2023-12-07 06:47:23   INFO  epoch: 49/72, acc_iter=189588, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:48:25, time_cost(all): 1 day, 19:52:08/20:08:59, loss=0.376721516732996, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.0(1.03), norm=2.5885951371420113, lr=0.13124448399355226
2023-12-07 06:48:04   INFO  epoch: 49/72, acc_iter=189638, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:46:30, time_cost(all): 1 day, 19:52:49/20:43:31, loss=0.376662319307055, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.83(1.03), norm=2.4801327594022995, lr=0.13113051691818445
2023-12-07 06:48:46   INFO  epoch: 49/72, acc_iter=189688, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:47:45, time_cost(all): 1 day, 19:53:31/19:57:09, loss=0.376603121881114, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.21(1.03), norm=4.589470266401219, lr=0.13101654984281663
2023-12-07 06:49:28   INFO  epoch: 49/72, acc_iter=189738, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:48:36, time_cost(all): 1 day, 19:54:13/20:05:41, loss=0.376543924455173, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.85(1.03), norm=4.452105140287022, lr=0.13090258276744887
2023-12-07 06:50:10   INFO  epoch: 49/72, acc_iter=189788, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:44:11, time_cost(all): 1 day, 19:54:55/20:21:34, loss=0.376484727029232, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.87(1.03), norm=3.6294576107229823, lr=0.13078861569208106
2023-12-07 06:50:52   INFO  epoch: 49/72, acc_iter=189838, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:47:09, time_cost(all): 1 day, 19:55:37/21:34:43, loss=0.376425529603291, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.89(1.03), norm=0.5657695310454016, lr=0.13067464861671324
2023-12-07 06:51:33   INFO  epoch: 49/72, acc_iter=189888, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:45:45, time_cost(all): 1 day, 19:56:18/20:59:57, loss=0.37636633217735, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.05(1.03), norm=2.9183424737939907, lr=0.13056068154134548
2023-12-07 06:52:15   INFO  epoch: 49/72, acc_iter=189938, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:42:32, time_cost(all): 1 day, 19:57:00/20:15:45, loss=0.376307134751409, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.98(1.03), norm=4.894760538997493, lr=0.13044671446597766
2023-12-07 06:52:57   INFO  epoch: 49/72, acc_iter=189988, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:41:11, time_cost(all): 1 day, 19:57:42/21:16:48, loss=0.376247937325468, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.07(1.03), norm=1.459203954396798, lr=0.1303327473906099
2023-12-07 06:53:39   INFO  epoch: 49/72, acc_iter=190038, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:40:46, time_cost(all): 1 day, 19:58:24/21:14:09, loss=0.376188739899527, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.17(1.03), norm=0.5142893667657217, lr=0.1302187803152421
2023-12-07 06:54:21   INFO  epoch: 49/72, acc_iter=190088, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:43:40, time_cost(all): 1 day, 19:59:06/19:32:15, loss=0.376129542473586, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.19(1.03), norm=3.9165512265734845, lr=0.13010481323987427
2023-12-07 06:55:02   INFO  epoch: 49/72, acc_iter=190138, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:39:57, time_cost(all): 1 day, 19:59:47/19:47:53, loss=0.376070345047645, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.93(1.03), norm=4.394549108686492, lr=0.1299908461645065
2023-12-07 06:55:44   INFO  epoch: 49/72, acc_iter=190188, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:40:32, time_cost(all): 1 day, 20:00:29/19:35:27, loss=0.376011147621704, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.14(1.03), norm=2.5530074653294386, lr=0.1298768790891387
2023-12-07 06:56:26   INFO  epoch: 49/72, acc_iter=190238, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:39:56, time_cost(all): 1 day, 20:01:11/21:02:38, loss=0.375951950195763, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.03(1.03), norm=4.324456470092095, lr=0.12976291201377088
2023-12-07 06:57:08   INFO  epoch: 49/72, acc_iter=190288, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:40:45, time_cost(all): 1 day, 20:01:53/21:18:58, loss=0.375892752769823, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.14(1.03), norm=1.6768996790799073, lr=0.12964894493840312
2023-12-07 06:57:49   INFO  epoch: 49/72, acc_iter=190338, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:38:47, time_cost(all): 1 day, 20:02:34/20:04:29, loss=0.375833555343882, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.84(1.03), norm=4.223455586596016, lr=0.1295349778630353
2023-12-07 06:58:31   INFO  epoch: 49/72, acc_iter=190388, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:37:34, time_cost(all): 1 day, 20:03:16/19:32:54, loss=0.375774357917941, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.97(1.03), norm=1.1816403601707939, lr=0.12942101078766755
2023-12-07 06:59:13   INFO  epoch: 49/72, acc_iter=190438, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:38:37, time_cost(all): 1 day, 20:03:58/20:26:38, loss=0.375715160492, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.93(1.03), norm=2.944303895726508, lr=0.12930704371229973
2023-12-07 06:59:55   INFO  epoch: 49/72, acc_iter=190488, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:37:43, time_cost(all): 1 day, 20:04:40/20:08:16, loss=0.375655963066059, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.1(1.03), norm=4.835234515904386, lr=0.12919307663693191
2023-12-07 07:00:37   INFO  epoch: 49/72, acc_iter=190538, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:35:09, time_cost(all): 1 day, 20:05:22/20:58:17, loss=0.375596765640118, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.07(1.03), norm=1.3002952164273371, lr=0.12907910956156415
2023-12-07 07:01:18   INFO  epoch: 49/72, acc_iter=190588, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:34:09, time_cost(all): 1 day, 20:06:03/20:00:01, loss=0.375537568214177, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.05(1.03), norm=2.438419833563443, lr=0.12896514248619634
2023-12-07 07:02:00   INFO  epoch: 49/72, acc_iter=190638, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:34:58, time_cost(all): 1 day, 20:06:45/19:32:35, loss=0.375478370788236, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.2(1.03), norm=3.2679189047446724, lr=0.12885117541082852
2023-12-07 07:02:42   INFO  epoch: 49/72, acc_iter=190688, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:33:33, time_cost(all): 1 day, 20:07:27/20:22:45, loss=0.375419173362295, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.23(1.03), norm=3.3476955907599333, lr=0.12873720833546076
2023-12-07 07:03:24   INFO  epoch: 49/72, acc_iter=190738, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:32:39, time_cost(all): 1 day, 20:08:09/21:20:46, loss=0.375359975936354, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.91(1.03), norm=4.283093578614095, lr=0.128623241260093
2023-12-07 07:04:05   INFO  epoch: 49/72, acc_iter=190788, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:32:17, time_cost(all): 1 day, 20:08:50/20:08:00, loss=0.375300778510413, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.07(1.03), norm=4.4114681284325235, lr=0.12850927418472513
2023-12-07 07:04:47   INFO  epoch: 49/72, acc_iter=190838, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:32:10, time_cost(all): 1 day, 20:09:32/21:21:08, loss=0.375241581084472, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.22(1.03), norm=3.604970765292382, lr=0.12839530710935737
2023-12-07 07:05:29   INFO  epoch: 49/72, acc_iter=190888, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:29:58, time_cost(all): 1 day, 20:10:14/21:18:44, loss=0.375182383658531, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.91(1.03), norm=4.6465445273277, lr=0.1282813400339896
2023-12-07 07:06:11   INFO  epoch: 49/72, acc_iter=190938, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:31:05, time_cost(all): 1 day, 20:10:56/20:32:40, loss=0.37512318623259, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.21(1.03), norm=1.7685744737399534, lr=0.1281673729586218
2023-12-07 07:06:53   INFO  epoch: 49/72, acc_iter=190988, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:30:22, time_cost(all): 1 day, 20:11:38/20:31:49, loss=0.375063988806649, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.02(1.03), norm=1.301678780385803, lr=0.12805340588325398
2023-12-07 07:07:34   INFO  epoch: 49/72, acc_iter=191038, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:28:33, time_cost(all): 1 day, 20:12:19/20:13:26, loss=0.375004791380708, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.9(1.03), norm=0.8866326459723851, lr=0.12793943880788616
2023-12-07 07:08:16   INFO  epoch: 49/72, acc_iter=191088, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:27:11, time_cost(all): 1 day, 20:13:01/21:08:31, loss=0.374945593954767, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.05(1.03), norm=4.885094392631137, lr=0.1278254717325184
2023-12-07 07:08:58   INFO  epoch: 49/72, acc_iter=191138, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:28:37, time_cost(all): 1 day, 20:13:43/19:25:54, loss=0.374886396528827, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.86(1.03), norm=2.0381348731534144, lr=0.12771150465715064
2023-12-07 07:09:40   INFO  epoch: 49/72, acc_iter=191188, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:26:38, time_cost(all): 1 day, 20:14:25/20:31:29, loss=0.374827199102886, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.85(1.03), norm=0.6110725149128042, lr=0.12759753758178277
2023-12-07 07:10:21   INFO  epoch: 49/72, acc_iter=191238, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:26:22, time_cost(all): 1 day, 20:15:06/20:19:52, loss=0.374768001676945, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.15(1.03), norm=2.2884207330299864, lr=0.127483570506415
2023-12-07 07:11:03   INFO  epoch: 49/72, acc_iter=191288, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:33, time_cost(all): 1 day, 20:15:48/19:18:31, loss=0.374708804251004, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.13(1.03), norm=1.7396171926427149, lr=0.12736960343104725
2023-12-07 07:11:45   INFO  epoch: 49/72, acc_iter=191338, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:25:07, time_cost(all): 1 day, 20:16:30/19:52:19, loss=0.374649606825063, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.9(1.03), norm=0.5565855468163272, lr=0.12725563635567944
2023-12-07 07:12:27   INFO  epoch: 49/72, acc_iter=191388, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:30, time_cost(all): 1 day, 20:17:12/20:48:50, loss=0.374590409399122, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.03(1.03), norm=2.88783119201799, lr=0.12714166928031162
2023-12-07 07:13:09   INFO  epoch: 49/72, acc_iter=191438, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:03, time_cost(all): 1 day, 20:17:54/20:09:27, loss=0.374531211973181, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.11(1.03), norm=4.370554026235007, lr=0.12702770220494386
2023-12-07 07:13:50   INFO  epoch: 49/72, acc_iter=191488, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:21:32, time_cost(all): 1 day, 20:18:35/20:28:44, loss=0.37447201454724, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.89(1.03), norm=3.079719556434992, lr=0.12691373512957604
2023-12-07 07:14:32   INFO  epoch: 49/72, acc_iter=191538, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:22:31, time_cost(all): 1 day, 20:19:17/20:38:52, loss=0.374412817121299, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.84(1.03), norm=4.263083703134333, lr=0.12679976805420823
2023-12-07 07:15:14   INFO  epoch: 49/72, acc_iter=191588, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:03, time_cost(all): 1 day, 20:19:59/20:33:09, loss=0.374353619695358, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.88(1.03), norm=2.0529966682625265, lr=0.12668580097884047
2023-12-07 07:15:56   INFO  epoch: 49/72, acc_iter=191638, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:19:46, time_cost(all): 1 day, 20:20:41/20:15:13, loss=0.374294422269417, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.95(1.03), norm=2.6036075359545356, lr=0.12657183390347265
2023-12-07 07:16:37   INFO  epoch: 49/72, acc_iter=191688, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:20:20, time_cost(all): 1 day, 20:21:22/20:05:29, loss=0.374235224843476, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.05(1.03), norm=2.6887511128797095, lr=0.1264578668281049
2023-12-07 07:17:19   INFO  epoch: 49/72, acc_iter=191738, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:53, time_cost(all): 1 day, 20:22:04/20:10:03, loss=0.374176027417535, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.21(1.03), norm=3.1926748122059507, lr=0.12634389975273708
2023-12-07 07:18:01   INFO  epoch: 49/72, acc_iter=191788, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:17:42, time_cost(all): 1 day, 20:22:46/19:31:06, loss=0.374116829991594, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.14(1.03), norm=2.5020382809719166, lr=0.12622993267736926
2023-12-07 07:18:43   INFO  epoch: 49/72, acc_iter=191838, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:34, time_cost(all): 1 day, 20:23:28/20:26:32, loss=0.374057632565653, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.07(1.03), norm=2.6396409142405792, lr=0.1261159656020015
2023-12-07 07:19:25   INFO  epoch: 49/72, acc_iter=191888, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:21, time_cost(all): 1 day, 20:24:10/19:08:02, loss=0.373998435139712, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.14(1.03), norm=2.5493594486751427, lr=0.12600199852663369
2023-12-07 07:20:06   INFO  epoch: 49/72, acc_iter=191938, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:35, time_cost(all): 1 day, 20:24:51/19:32:52, loss=0.373939237713771, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.84(1.03), norm=3.222352620618523, lr=0.12588803145126587
2023-12-07 07:20:48   INFO  epoch: 49/72, acc_iter=191988, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:01, time_cost(all): 1 day, 20:25:33/19:33:35, loss=0.373880040287831, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.13(1.03), norm=0.9103186586037721, lr=0.1257740643758981
2023-12-07 07:21:30   INFO  epoch: 49/72, acc_iter=192038, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:59, time_cost(all): 1 day, 20:26:15/20:38:05, loss=0.37382084286189, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.06(1.03), norm=4.525929997647216, lr=0.1256600973005303
2023-12-07 07:22:12   INFO  epoch: 49/72, acc_iter=192088, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:13:50, time_cost(all): 1 day, 20:26:57/20:52:43, loss=0.373761645435949, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.14(1.03), norm=1.2539424775221686, lr=0.12554613022516253
2023-12-07 07:22:53   INFO  epoch: 49/72, acc_iter=192138, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:14:03, time_cost(all): 1 day, 20:27:38/19:19:15, loss=0.373702448010008, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.99(1.03), norm=3.19203310091533, lr=0.12543216314979472
2023-12-07 07:23:35   INFO  epoch: 49/72, acc_iter=192188, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:10, time_cost(all): 1 day, 20:28:20/21:03:26, loss=0.373643250584067, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.14(1.03), norm=2.939705465650395, lr=0.1253181960744269
2023-12-07 07:24:17   INFO  epoch: 49/72, acc_iter=192238, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:18, time_cost(all): 1 day, 20:29:02/20:45:39, loss=0.373584053158126, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.88(1.03), norm=3.9841126275341097, lr=0.12520422899905914
2023-12-07 07:24:59   INFO  epoch: 49/72, acc_iter=192288, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:14, time_cost(all): 1 day, 20:29:44/20:29:52, loss=0.373524855732185, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.96(1.03), norm=1.6284320666633225, lr=0.12509026192369133
2023-12-07 07:25:41   INFO  epoch: 49/72, acc_iter=192338, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:52, time_cost(all): 1 day, 20:30:26/20:59:37, loss=0.373465658306244, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.12(1.03), norm=0.6198990984579111, lr=0.12497629484832351
2023-12-07 07:26:22   INFO  epoch: 49/72, acc_iter=192388, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:54, time_cost(all): 1 day, 20:31:07/19:43:46, loss=0.373406460880303, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.98(1.03), norm=0.52023893775345, lr=0.12486232777295575
2023-12-07 07:27:04   INFO  epoch: 49/72, acc_iter=192438, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:09, time_cost(all): 1 day, 20:31:49/19:52:32, loss=0.373347263454362, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.06(1.03), norm=0.7678918120027858, lr=0.12474836069758799
2023-12-07 07:27:46   INFO  epoch: 49/72, acc_iter=192488, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:19, time_cost(all): 1 day, 20:32:31/19:03:10, loss=0.373288066028421, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.92(1.03), norm=1.5886237414922166, lr=0.12463439362222012
2023-12-07 07:28:28   INFO  epoch: 49/72, acc_iter=192538, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:38, time_cost(all): 1 day, 20:33:13/19:57:31, loss=0.37322886860248, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.02(1.03), norm=2.4343729923914834, lr=0.12452042654685236
2023-12-07 07:29:10   INFO  epoch: 49/72, acc_iter=192588, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:13, time_cost(all): 1 day, 20:33:55/19:50:46, loss=0.373169671176539, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.86(1.03), norm=3.921411685836627, lr=0.12440645947148454
2023-12-07 07:29:51   INFO  epoch: 49/72, acc_iter=192638, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:44, time_cost(all): 1 day, 20:34:36/20:51:11, loss=0.373110473750598, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.85(1.03), norm=0.663510813682437, lr=0.12429249239611678
2023-12-07 07:30:33   INFO  epoch: 49/72, acc_iter=192688, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:51, time_cost(all): 1 day, 20:35:18/19:57:00, loss=0.373051276324657, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.04(1.03), norm=4.573947981715195, lr=0.12417852532074897
2023-12-07 07:31:15   INFO  epoch: 49/72, acc_iter=192738, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:11, time_cost(all): 1 day, 20:36:00/19:58:37, loss=0.372992078898716, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.93(1.03), norm=1.550247738106891, lr=0.12406455824538115
2023-12-07 07:31:57   INFO  epoch: 49/72, acc_iter=192788, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:16, time_cost(all): 1 day, 20:36:42/19:24:43, loss=0.372932881472775, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.17(1.03), norm=4.96316852075096, lr=0.12395059117001339
2023-12-07 07:32:38   INFO  epoch: 49/72, acc_iter=192838, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:37, time_cost(all): 1 day, 20:37:23/19:21:01, loss=0.372873684046835, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.0(1.03), norm=3.7076202472313353, lr=0.12383662409464563
2023-12-07 07:33:20   INFO  epoch: 49/72, acc_iter=192888, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:51, time_cost(all): 1 day, 20:38:05/19:50:50, loss=0.372814486620894, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.14(1.03), norm=4.480641136808961, lr=0.12372265701927776
2023-12-07 07:34:02   INFO  epoch: 49/72, acc_iter=192938, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:10, time_cost(all): 1 day, 20:38:47/19:41:00, loss=0.372755289194953, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.06(1.03), norm=3.1267626281248355, lr=0.12360868994391
2023-12-07 07:34:44   INFO  epoch: 49/72, acc_iter=192988, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:34, time_cost(all): 1 day, 20:39:29/20:43:21, loss=0.372696091769012, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.01(1.03), norm=2.596952592991222, lr=0.12349472286854224
2023-12-07 07:35:26   INFO  epoch: 49/72, acc_iter=193038, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:54, time_cost(all): 1 day, 20:40:11/19:42:09, loss=0.372636894343071, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.17(1.03), norm=1.9523998238654652, lr=0.12338075579317442
2023-12-07 07:36:07   INFO  epoch: 49/72, acc_iter=193088, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 1 day, 20:40:52/20:45:12, loss=0.37257769691713, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.14(1.03), norm=2.092099876338268, lr=0.12326678871780661
2023-12-07 07:36:49   INFO  epoch: 50/72, acc_iter=193150, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:55:19, time_cost(all): 1 day, 20:41:34/20:21:34, loss=0.372504292108963, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.03(1.03), norm=1.768039739949707, lr=0.12312546954435055
2023-12-07 07:37:31   INFO  epoch: 50/72, acc_iter=193200, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:49:57, time_cost(all): 1 day, 20:42:16/19:30:46, loss=0.372445094683022, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.92(1.03), norm=4.250468503161185, lr=0.12301150246898274
2023-12-07 07:38:13   INFO  epoch: 50/72, acc_iter=193250, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:50:38, time_cost(all): 1 day, 20:42:58/20:39:09, loss=0.372385897257081, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.04(1.03), norm=1.7749343363424466, lr=0.12289753539361498
2023-12-07 07:38:54   INFO  epoch: 50/72, acc_iter=193300, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:49:37, time_cost(all): 1 day, 20:43:39/19:08:02, loss=0.37232669983114, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.04(1.03), norm=1.810601377936214, lr=0.12278356831824716
2023-12-07 07:39:36   INFO  epoch: 50/72, acc_iter=193350, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:50:27, time_cost(all): 1 day, 20:44:21/19:01:03, loss=0.372267502405199, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.01(1.03), norm=3.8167038846600705, lr=0.1226696012428794
2023-12-07 07:40:18   INFO  epoch: 50/72, acc_iter=193400, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:50:52, time_cost(all): 1 day, 20:45:03/18:56:55, loss=0.372208304979258, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.92(1.03), norm=2.4233657807662583, lr=0.12255563416751158
2023-12-07 07:41:00   INFO  epoch: 50/72, acc_iter=193450, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:50:38, time_cost(all): 1 day, 20:45:45/20:27:26, loss=0.372149107553317, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.87(1.03), norm=4.301004419717236, lr=0.12244166709214377
2023-12-07 07:41:42   INFO  epoch: 50/72, acc_iter=193500, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:50:33, time_cost(all): 1 day, 20:46:27/19:06:07, loss=0.372089910127376, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.17(1.03), norm=3.2152476206097056, lr=0.12232770001677601
2023-12-07 07:42:23   INFO  epoch: 50/72, acc_iter=193550, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:47:14, time_cost(all): 1 day, 20:47:08/19:51:00, loss=0.372030712701436, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.06(1.03), norm=0.7731818171439044, lr=0.12221373294140819
2023-12-07 07:43:05   INFO  epoch: 50/72, acc_iter=193600, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:45:12, time_cost(all): 1 day, 20:47:50/19:32:32, loss=0.371971515275495, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.13(1.03), norm=3.2880913569170187, lr=0.12209976586604038
2023-12-07 07:43:47   INFO  epoch: 50/72, acc_iter=193650, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:44:48, time_cost(all): 1 day, 20:48:32/20:08:27, loss=0.371912317849554, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.92(1.03), norm=1.6122905773108478, lr=0.12198579879067262
2023-12-07 07:44:29   INFO  epoch: 50/72, acc_iter=193700, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:45:21, time_cost(all): 1 day, 20:49:14/18:50:40, loss=0.371853120423613, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.99(1.03), norm=0.7597306413967176, lr=0.12187183171530486
2023-12-07 07:45:10   INFO  epoch: 50/72, acc_iter=193750, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:46:55, time_cost(all): 1 day, 20:49:55/19:52:23, loss=0.371793922997672, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.09(1.03), norm=0.8145997555763838, lr=0.12175786463993699
2023-12-07 07:45:52   INFO  epoch: 50/72, acc_iter=193800, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:44:34, time_cost(all): 1 day, 20:50:37/19:42:31, loss=0.371734725571731, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.0(1.03), norm=3.4784623402734387, lr=0.12164389756456923
2023-12-07 07:46:34   INFO  epoch: 50/72, acc_iter=193850, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:43:51, time_cost(all): 1 day, 20:51:19/18:58:23, loss=0.37167552814579, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.95(1.03), norm=2.4090087034205867, lr=0.12152993048920141
2023-12-07 07:47:16   INFO  epoch: 50/72, acc_iter=193900, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:42:22, time_cost(all): 1 day, 20:52:01/19:23:47, loss=0.371616330719849, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.19(1.03), norm=0.9243237819431442, lr=0.12141596341383365
2023-12-07 07:47:58   INFO  epoch: 50/72, acc_iter=193950, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:41:27, time_cost(all): 1 day, 20:52:43/18:48:48, loss=0.371557133293908, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.02(1.03), norm=0.7015402570597729, lr=0.12130199633846583
2023-12-07 07:48:39   INFO  epoch: 50/72, acc_iter=194000, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:42:13, time_cost(all): 1 day, 20:53:24/20:29:13, loss=0.371497935867967, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.84(1.03), norm=4.799450326544207, lr=0.12118802926309802
2023-12-07 07:49:21   INFO  epoch: 50/72, acc_iter=194050, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:41:24, time_cost(all): 1 day, 20:54:06/19:21:19, loss=0.371438738442026, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.9(1.03), norm=4.913822305436781, lr=0.12107406218773026
2023-12-07 07:50:03   INFO  epoch: 50/72, acc_iter=194100, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:41:40, time_cost(all): 1 day, 20:54:48/19:42:10, loss=0.371379541016085, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.08(1.03), norm=1.471746086477976, lr=0.1209600951123625
2023-12-07 07:50:45   INFO  epoch: 50/72, acc_iter=194150, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:40:38, time_cost(all): 1 day, 20:55:30/19:25:21, loss=0.371320343590144, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.21(1.03), norm=1.9648835726423943, lr=0.12084612803699463
2023-12-07 07:51:26   INFO  epoch: 50/72, acc_iter=194200, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:39:58, time_cost(all): 1 day, 20:56:11/19:17:12, loss=0.371261146164203, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.05(1.03), norm=4.552254787862758, lr=0.12073216096162687
2023-12-07 07:52:08   INFO  epoch: 50/72, acc_iter=194250, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:37:15, time_cost(all): 1 day, 20:56:53/19:24:47, loss=0.371201948738262, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.18(1.03), norm=0.6833086409845397, lr=0.1206181938862591
2023-12-07 07:52:50   INFO  epoch: 50/72, acc_iter=194300, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:38:03, time_cost(all): 1 day, 20:57:35/19:15:52, loss=0.371142751312321, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.98(1.03), norm=2.986505661228386, lr=0.12050422681089129
2023-12-07 07:53:32   INFO  epoch: 50/72, acc_iter=194350, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:37:48, time_cost(all): 1 day, 20:58:17/19:18:03, loss=0.37108355388638, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.88(1.03), norm=4.130734446727855, lr=0.12039025973552347
2023-12-07 07:54:14   INFO  epoch: 50/72, acc_iter=194400, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:36:57, time_cost(all): 1 day, 20:58:59/19:06:56, loss=0.37102435646044, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.04(1.03), norm=2.581360824489487, lr=0.12027629266015571
2023-12-07 07:54:55   INFO  epoch: 50/72, acc_iter=194450, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:33:17, time_cost(all): 1 day, 20:59:40/19:36:55, loss=0.370965159034499, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.9(1.03), norm=1.6692846342932666, lr=0.1201623255847879
2023-12-07 07:55:37   INFO  epoch: 50/72, acc_iter=194500, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:35:49, time_cost(all): 1 day, 21:00:22/19:21:23, loss=0.370905961608558, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.02(1.03), norm=2.532159964452583, lr=0.12004835850942008
2023-12-07 07:56:19   INFO  epoch: 50/72, acc_iter=194550, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:34:44, time_cost(all): 1 day, 21:01:04/18:57:11, loss=0.370846764182617, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.02(1.03), norm=1.5571013091428154, lr=0.11993439143405232
2023-12-07 07:57:01   INFO  epoch: 50/72, acc_iter=194600, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:31:32, time_cost(all): 1 day, 21:01:46/18:40:10, loss=0.370787566756676, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.22(1.03), norm=2.5798767952609265, lr=0.11982042435868451
2023-12-07 07:57:42   INFO  epoch: 50/72, acc_iter=194650, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:32:00, time_cost(all): 1 day, 21:02:27/18:43:06, loss=0.370728369330735, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.02(1.03), norm=0.7049905231945781, lr=0.11970645728331675
2023-12-07 07:58:24   INFO  epoch: 50/72, acc_iter=194700, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:31:27, time_cost(all): 1 day, 21:03:09/18:43:20, loss=0.370669171904794, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.09(1.03), norm=4.606038298498101, lr=0.11959249020794888
2023-12-07 07:59:06   INFO  epoch: 50/72, acc_iter=194750, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:31:46, time_cost(all): 1 day, 21:03:51/19:34:32, loss=0.370609974478853, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.15(1.03), norm=2.113874654059775, lr=0.11947852313258112
2023-12-07 07:59:48   INFO  epoch: 50/72, acc_iter=194800, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:29:14, time_cost(all): 1 day, 21:04:33/19:53:20, loss=0.370550777052912, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.05(1.03), norm=0.8609790612235562, lr=0.11936455605721336
2023-12-07 08:00:30   INFO  epoch: 50/72, acc_iter=194850, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:28:22, time_cost(all): 1 day, 21:05:15/19:40:00, loss=0.370491579626971, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.95(1.03), norm=0.8511270179817388, lr=0.11925058898184554
2023-12-07 08:01:11   INFO  epoch: 50/72, acc_iter=194900, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:29:01, time_cost(all): 1 day, 21:05:56/18:32:13, loss=0.37043238220103, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.06(1.03), norm=3.654671642144477, lr=0.11913662190647772
2023-12-07 08:01:53   INFO  epoch: 50/72, acc_iter=194950, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:27:26, time_cost(all): 1 day, 21:06:38/18:55:10, loss=0.370373184775089, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.21(1.03), norm=4.466776399620622, lr=0.11902265483110996
2023-12-07 08:02:35   INFO  epoch: 50/72, acc_iter=195000, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:27:13, time_cost(all): 1 day, 21:07:20/19:57:37, loss=0.370313987349148, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.09(1.03), norm=4.5279673723755725, lr=0.11890868775574215
2023-12-07 08:03:17   INFO  epoch: 50/72, acc_iter=195050, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:26:58, time_cost(all): 1 day, 21:08:02/19:10:19, loss=0.370254789923207, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.09(1.03), norm=2.5206465484434997, lr=0.11879472068037439
2023-12-07 08:03:59   INFO  epoch: 50/72, acc_iter=195100, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:26:56, time_cost(all): 1 day, 21:08:44/19:11:48, loss=0.370195592497266, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.19(1.03), norm=2.739095111220782, lr=0.11868075360500657
2023-12-07 08:04:40   INFO  epoch: 50/72, acc_iter=195150, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:26:21, time_cost(all): 1 day, 21:09:25/18:58:53, loss=0.370136395071325, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.05(1.03), norm=2.383591977526039, lr=0.11856678652963876
2023-12-07 08:05:22   INFO  epoch: 50/72, acc_iter=195200, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:23:38, time_cost(all): 1 day, 21:10:07/20:04:42, loss=0.370077197645384, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.19(1.03), norm=4.426736011962237, lr=0.118452819454271
2023-12-07 08:06:04   INFO  epoch: 50/72, acc_iter=195250, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:22:42, time_cost(all): 1 day, 21:10:49/20:18:34, loss=0.370018000219444, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.14(1.03), norm=0.838981465541567, lr=0.11833885237890318
2023-12-07 08:06:46   INFO  epoch: 50/72, acc_iter=195300, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:18, time_cost(all): 1 day, 21:11:31/18:52:15, loss=0.369958802793503, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.94(1.03), norm=0.6485024137646468, lr=0.11822488530353537
2023-12-07 08:07:27   INFO  epoch: 50/72, acc_iter=195350, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:23:00, time_cost(all): 1 day, 21:12:12/18:24:03, loss=0.369899605367562, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.08(1.03), norm=1.0023677034616387, lr=0.1181109182281676
2023-12-07 08:08:09   INFO  epoch: 50/72, acc_iter=195400, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:21, time_cost(all): 1 day, 21:12:54/18:43:00, loss=0.369840407941621, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.22(1.03), norm=3.2459482016478995, lr=0.11799695115279979
2023-12-07 08:08:51   INFO  epoch: 50/72, acc_iter=195450, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:22, time_cost(all): 1 day, 21:13:36/20:01:20, loss=0.36978121051568, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.21(1.03), norm=2.0540792013908886, lr=0.11788298407743197
2023-12-07 08:09:33   INFO  epoch: 50/72, acc_iter=195500, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:20:40, time_cost(all): 1 day, 21:14:18/20:07:20, loss=0.369722013089739, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.9(1.03), norm=3.7468531615240117, lr=0.11776901700206421
2023-12-07 08:10:15   INFO  epoch: 50/72, acc_iter=195550, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:18:59, time_cost(all): 1 day, 21:15:00/18:50:54, loss=0.369662815663798, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.9(1.03), norm=1.8833527986361767, lr=0.1176550499266964
2023-12-07 08:10:56   INFO  epoch: 50/72, acc_iter=195600, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:54, time_cost(all): 1 day, 21:15:41/19:38:24, loss=0.369603618237857, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.88(1.03), norm=0.565864353591101, lr=0.11754108285132864
2023-12-07 08:11:38   INFO  epoch: 50/72, acc_iter=195650, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:17:45, time_cost(all): 1 day, 21:16:23/19:17:53, loss=0.369544420811916, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.94(1.03), norm=4.851143881426528, lr=0.11742711577596082
2023-12-07 08:12:20   INFO  epoch: 50/72, acc_iter=195700, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:16:50, time_cost(all): 1 day, 21:17:05/19:32:26, loss=0.369485223385975, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.99(1.03), norm=3.9143126829882604, lr=0.117313148700593
2023-12-07 08:13:02   INFO  epoch: 50/72, acc_iter=195750, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:17:16, time_cost(all): 1 day, 21:17:47/19:10:17, loss=0.369426025960034, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.08(1.03), norm=3.666724778766697, lr=0.11719918162522525
2023-12-07 08:13:43   INFO  epoch: 50/72, acc_iter=195800, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:27, time_cost(all): 1 day, 21:18:28/18:43:51, loss=0.369366828534093, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.13(1.03), norm=0.6134057975378763, lr=0.11708521454985749
2023-12-07 08:14:25   INFO  epoch: 50/72, acc_iter=195850, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:07, time_cost(all): 1 day, 21:19:10/19:32:51, loss=0.369307631108152, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.19(1.03), norm=4.516100303240968, lr=0.11697124747448961
2023-12-07 08:15:07   INFO  epoch: 50/72, acc_iter=195900, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:15:06, time_cost(all): 1 day, 21:19:52/18:16:19, loss=0.369248433682211, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.08(1.03), norm=0.9578463230236984, lr=0.11685728039912185
2023-12-07 08:15:49   INFO  epoch: 50/72, acc_iter=195950, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:40, time_cost(all): 1 day, 21:20:34/18:26:41, loss=0.36918923625627, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.93(1.03), norm=1.401636353909577, lr=0.1167433133237541
2023-12-07 08:16:31   INFO  epoch: 50/72, acc_iter=196000, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:21, time_cost(all): 1 day, 21:21:16/19:51:48, loss=0.369130038830329, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.84(1.03), norm=1.065414621546766, lr=0.11662934624838628
2023-12-07 08:17:12   INFO  epoch: 50/72, acc_iter=196050, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:55, time_cost(all): 1 day, 21:21:57/19:47:41, loss=0.369070841404389, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.88(1.03), norm=4.702350361913982, lr=0.11651537917301846
2023-12-07 08:17:54   INFO  epoch: 50/72, acc_iter=196100, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:06, time_cost(all): 1 day, 21:22:39/19:14:27, loss=0.369011643978448, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.03(1.03), norm=4.702616458941313, lr=0.1164014120976507
2023-12-07 08:18:36   INFO  epoch: 50/72, acc_iter=196150, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:20, time_cost(all): 1 day, 21:23:21/19:57:01, loss=0.368952446552507, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.1(1.03), norm=4.893771722357427, lr=0.11628744502228289
2023-12-07 08:19:18   INFO  epoch: 50/72, acc_iter=196200, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:17, time_cost(all): 1 day, 21:24:03/19:11:27, loss=0.368893249126566, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.86(1.03), norm=4.831819191202098, lr=0.11617347794691507
2023-12-07 08:19:59   INFO  epoch: 50/72, acc_iter=196250, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:30, time_cost(all): 1 day, 21:24:44/18:24:26, loss=0.368834051700625, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.99(1.03), norm=3.595769313309069, lr=0.11605951087154726
2023-12-07 08:20:41   INFO  epoch: 50/72, acc_iter=196300, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:08:51, time_cost(all): 1 day, 21:25:26/19:44:22, loss=0.368774854274684, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.84(1.03), norm=1.0731233052118443, lr=0.1159455437961795
2023-12-07 08:21:23   INFO  epoch: 50/72, acc_iter=196350, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:29, time_cost(all): 1 day, 21:26:08/19:33:23, loss=0.368715656848743, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.93(1.03), norm=3.976167501465053, lr=0.11583157672081174
2023-12-07 08:22:05   INFO  epoch: 50/72, acc_iter=196400, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:31, time_cost(all): 1 day, 21:26:50/19:55:51, loss=0.368656459422802, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.0(1.03), norm=4.445655521526742, lr=0.11571760964544386
2023-12-07 08:22:47   INFO  epoch: 50/72, acc_iter=196450, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:18, time_cost(all): 1 day, 21:27:32/19:40:52, loss=0.368597261996861, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.03(1.03), norm=2.6419627471942646, lr=0.1156036425700761
2023-12-07 08:23:28   INFO  epoch: 50/72, acc_iter=196500, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:42, time_cost(all): 1 day, 21:28:13/18:47:27, loss=0.36853806457092, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.22(1.03), norm=1.0177769317619267, lr=0.11548967549470834
2023-12-07 08:24:10   INFO  epoch: 50/72, acc_iter=196550, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:42, time_cost(all): 1 day, 21:28:55/18:13:24, loss=0.368478867144979, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.04(1.03), norm=2.5372251037501528, lr=0.11537570841934053
2023-12-07 08:24:52   INFO  epoch: 50/72, acc_iter=196600, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:04:49, time_cost(all): 1 day, 21:29:37/19:58:18, loss=0.368419669719038, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.89(1.03), norm=3.9282994720011333, lr=0.11526174134397271
2023-12-07 08:25:34   INFO  epoch: 50/72, acc_iter=196650, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:20, time_cost(all): 1 day, 21:30:19/19:20:58, loss=0.368360472293097, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.1(1.03), norm=4.215346672916726, lr=0.11514777426860495
2023-12-07 08:26:15   INFO  epoch: 50/72, acc_iter=196700, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:34, time_cost(all): 1 day, 21:31:00/19:37:21, loss=0.368301274867156, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.83(1.03), norm=1.5469235790154936, lr=0.11503380719323714
2023-12-07 08:26:57   INFO  epoch: 50/72, acc_iter=196750, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:03:02, time_cost(all): 1 day, 21:31:42/19:38:51, loss=0.368242077441215, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.19(1.03), norm=2.263838823713403, lr=0.11491984011786938
2023-12-07 08:27:39   INFO  epoch: 50/72, acc_iter=196800, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:14, time_cost(all): 1 day, 21:32:24/18:55:24, loss=0.368182880015274, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.15(1.03), norm=2.6920197968986814, lr=0.11480587304250156
2023-12-07 08:28:21   INFO  epoch: 50/72, acc_iter=196850, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:30, time_cost(all): 1 day, 21:33:06/19:36:02, loss=0.368123682589333, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.04(1.03), norm=3.5646415585853926, lr=0.11469190596713374
2023-12-07 08:29:03   INFO  epoch: 50/72, acc_iter=196900, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:54, time_cost(all): 1 day, 21:33:48/18:14:25, loss=0.368064485163393, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.11(1.03), norm=0.6170605100552475, lr=0.11457793889176598
2023-12-07 08:29:44   INFO  epoch: 50/72, acc_iter=196950, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 1 day, 21:34:29/19:06:55, loss=0.368005287737452, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.88(1.03), norm=3.439493985611464, lr=0.11446397181639817
2023-12-07 08:30:26   INFO  epoch: 51/72, acc_iter=197012, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:54:00, time_cost(all): 1 day, 21:35:11/19:42:10, loss=0.367931882929285, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.18(1.03), norm=1.1004844463425147, lr=0.11432265264294211
2023-12-07 08:31:08   INFO  epoch: 51/72, acc_iter=197062, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:52:48, time_cost(all): 1 day, 21:35:53/19:31:12, loss=0.367872685503344, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.09(1.03), norm=0.6165768160354663, lr=0.11420868556757435
2023-12-07 08:31:50   INFO  epoch: 51/72, acc_iter=197112, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:52:43, time_cost(all): 1 day, 21:36:35/19:02:13, loss=0.367813488077403, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.18(1.03), norm=2.4159339428398026, lr=0.11409471849220648
2023-12-07 08:32:31   INFO  epoch: 51/72, acc_iter=197162, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:48:42, time_cost(all): 1 day, 21:37:16/18:29:50, loss=0.367754290651462, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.16(1.03), norm=4.405054429976902, lr=0.11398075141683872
2023-12-07 08:33:13   INFO  epoch: 51/72, acc_iter=197212, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:47:48, time_cost(all): 1 day, 21:37:58/19:20:46, loss=0.367695093225521, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.21(1.03), norm=0.7595325182879982, lr=0.11386678434147096
2023-12-07 08:33:55   INFO  epoch: 51/72, acc_iter=197262, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:48:10, time_cost(all): 1 day, 21:38:40/19:32:44, loss=0.36763589579958, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.22(1.03), norm=3.1081044218736116, lr=0.11375281726610315
2023-12-07 08:34:37   INFO  epoch: 51/72, acc_iter=197312, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:50:25, time_cost(all): 1 day, 21:39:22/18:21:57, loss=0.367576698373639, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.04(1.03), norm=1.1086516676165574, lr=0.11363885019073533
2023-12-07 08:35:19   INFO  epoch: 51/72, acc_iter=197362, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:48:16, time_cost(all): 1 day, 21:40:04/19:10:28, loss=0.367517500947698, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.0(1.03), norm=0.8488389384342883, lr=0.11352488311536757
2023-12-07 08:36:00   INFO  epoch: 51/72, acc_iter=197412, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:46:50, time_cost(all): 1 day, 21:40:45/19:06:02, loss=0.367458303521757, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.15(1.03), norm=4.293836286080391, lr=0.11341091603999975
2023-12-07 08:36:42   INFO  epoch: 51/72, acc_iter=197462, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:46:46, time_cost(all): 1 day, 21:41:27/18:46:33, loss=0.367399106095816, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.98(1.03), norm=3.4937596945912643, lr=0.11329694896463194
2023-12-07 08:37:24   INFO  epoch: 51/72, acc_iter=197512, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:46:44, time_cost(all): 1 day, 21:42:09/18:11:45, loss=0.367339908669875, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.85(1.03), norm=2.358888947873554, lr=0.11318298188926412
2023-12-07 08:38:06   INFO  epoch: 51/72, acc_iter=197562, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:43:53, time_cost(all): 1 day, 21:42:51/18:37:00, loss=0.367280711243934, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.98(1.03), norm=4.467008030542541, lr=0.11306901481389636
2023-12-07 08:38:48   INFO  epoch: 51/72, acc_iter=197612, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:43:43, time_cost(all): 1 day, 21:43:33/18:03:19, loss=0.367221513817994, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.18(1.03), norm=2.7857695965707867, lr=0.1129550477385286
2023-12-07 08:39:29   INFO  epoch: 51/72, acc_iter=197662, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:46:05, time_cost(all): 1 day, 21:44:14/19:25:00, loss=0.367162316392053, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.05(1.03), norm=2.4792947232973637, lr=0.11284108066316073
2023-12-07 08:40:11   INFO  epoch: 51/72, acc_iter=197712, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:42:13, time_cost(all): 1 day, 21:44:56/18:40:41, loss=0.367103118966112, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.9(1.03), norm=2.1876762754165506, lr=0.11272711358779297
2023-12-07 08:40:53   INFO  epoch: 51/72, acc_iter=197762, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:41:37, time_cost(all): 1 day, 21:45:38/17:57:55, loss=0.367043921540171, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.06(1.03), norm=4.159518514546873, lr=0.11261314651242521
2023-12-07 08:41:35   INFO  epoch: 51/72, acc_iter=197812, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:43:21, time_cost(all): 1 day, 21:46:20/18:41:50, loss=0.36698472411423, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.08(1.03), norm=0.5631285787438676, lr=0.1124991794370574
2023-12-07 08:42:16   INFO  epoch: 51/72, acc_iter=197862, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:39:29, time_cost(all): 1 day, 21:47:01/17:55:24, loss=0.366925526688289, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.09(1.03), norm=2.3974752568567066, lr=0.11238521236168958
2023-12-07 08:42:58   INFO  epoch: 51/72, acc_iter=197912, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:41:19, time_cost(all): 1 day, 21:47:43/19:07:44, loss=0.366866329262348, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.86(1.03), norm=2.2476019420250988, lr=0.11227124528632182
2023-12-07 08:43:40   INFO  epoch: 51/72, acc_iter=197962, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:38:45, time_cost(all): 1 day, 21:48:25/19:10:28, loss=0.366807131836407, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.85(1.03), norm=3.5398480582340266, lr=0.112157278210954
2023-12-07 08:44:22   INFO  epoch: 51/72, acc_iter=198012, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:37:54, time_cost(all): 1 day, 21:49:07/18:04:34, loss=0.366747934410466, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.22(1.03), norm=2.0998651823654884, lr=0.11204331113558624
2023-12-07 08:45:04   INFO  epoch: 51/72, acc_iter=198062, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:39:57, time_cost(all): 1 day, 21:49:49/18:01:36, loss=0.366688736984525, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.94(1.03), norm=3.7477671756951447, lr=0.11192934406021843
2023-12-07 08:45:45   INFO  epoch: 51/72, acc_iter=198112, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:36:56, time_cost(all): 1 day, 21:50:30/19:12:52, loss=0.366629539558584, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.06(1.03), norm=3.3478523934277886, lr=0.11181537698485061
2023-12-07 08:46:27   INFO  epoch: 51/72, acc_iter=198162, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:37:14, time_cost(all): 1 day, 21:51:12/17:55:04, loss=0.366570342132643, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.17(1.03), norm=3.532027382765124, lr=0.11170140990948285
2023-12-07 08:47:09   INFO  epoch: 51/72, acc_iter=198212, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:37:40, time_cost(all): 1 day, 21:51:54/18:23:40, loss=0.366511144706702, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.87(1.03), norm=4.9381589884913, lr=0.11158744283411504
2023-12-07 08:47:51   INFO  epoch: 51/72, acc_iter=198262, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:36:17, time_cost(all): 1 day, 21:52:36/18:50:53, loss=0.366451947280761, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.89(1.03), norm=1.109360822080562, lr=0.11147347575874722
2023-12-07 08:48:32   INFO  epoch: 51/72, acc_iter=198312, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:35:08, time_cost(all): 1 day, 21:53:17/19:33:27, loss=0.36639274985482, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.92(1.03), norm=1.7163035263589443, lr=0.11135950868337946
2023-12-07 08:49:14   INFO  epoch: 51/72, acc_iter=198362, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:35:58, time_cost(all): 1 day, 21:53:59/18:14:44, loss=0.366333552428879, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.87(1.03), norm=4.018684115505826, lr=0.11124554160801164
2023-12-07 08:49:56   INFO  epoch: 51/72, acc_iter=198412, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:34:18, time_cost(all): 1 day, 21:54:41/19:22:45, loss=0.366274355002938, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.22(1.03), norm=2.3107249231834217, lr=0.11113157453264383
2023-12-07 08:50:38   INFO  epoch: 51/72, acc_iter=198462, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:32:07, time_cost(all): 1 day, 21:55:23/17:53:15, loss=0.366215157576998, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.17(1.03), norm=4.4445033930761415, lr=0.11101760745727607
2023-12-07 08:51:20   INFO  epoch: 51/72, acc_iter=198512, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:32:01, time_cost(all): 1 day, 21:56:05/18:36:22, loss=0.366155960151057, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.94(1.03), norm=2.480464041485245, lr=0.11090364038190825
2023-12-07 08:52:01   INFO  epoch: 51/72, acc_iter=198562, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:32:13, time_cost(all): 1 day, 21:56:46/18:20:22, loss=0.366096762725116, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.84(1.03), norm=2.7058351241121725, lr=0.11078967330654049
2023-12-07 08:52:43   INFO  epoch: 51/72, acc_iter=198612, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:29:54, time_cost(all): 1 day, 21:57:28/18:35:23, loss=0.366037565299175, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.14(1.03), norm=4.734194229122983, lr=0.11067570623117268
2023-12-07 08:53:25   INFO  epoch: 51/72, acc_iter=198662, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:29:05, time_cost(all): 1 day, 21:58:10/17:42:13, loss=0.365978367873234, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.23(1.03), norm=1.8439878755743795, lr=0.11056173915580486
2023-12-07 08:54:07   INFO  epoch: 51/72, acc_iter=198712, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:28:55, time_cost(all): 1 day, 21:58:52/18:27:49, loss=0.365919170447293, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.17(1.03), norm=1.488370645840743, lr=0.1104477720804371
2023-12-07 08:54:48   INFO  epoch: 51/72, acc_iter=198762, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:28:23, time_cost(all): 1 day, 21:59:33/19:26:47, loss=0.365859973021352, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.94(1.03), norm=3.526272967069059, lr=0.11033380500506934
2023-12-07 08:55:30   INFO  epoch: 51/72, acc_iter=198812, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:27:28, time_cost(all): 1 day, 22:00:15/18:26:33, loss=0.365800775595411, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.05(1.03), norm=2.287234494842747, lr=0.11021983792970147
2023-12-07 08:56:12   INFO  epoch: 51/72, acc_iter=198862, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:28:30, time_cost(all): 1 day, 22:00:57/19:24:04, loss=0.36574157816947, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.97(1.03), norm=2.6376280665354024, lr=0.11010587085433371
2023-12-07 08:56:54   INFO  epoch: 51/72, acc_iter=198912, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:26:11, time_cost(all): 1 day, 22:01:39/18:03:59, loss=0.365682380743529, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.85(1.03), norm=1.2626546965660292, lr=0.10999190377896595
2023-12-07 08:57:36   INFO  epoch: 51/72, acc_iter=198962, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:26:38, time_cost(all): 1 day, 22:02:21/18:36:33, loss=0.365623183317588, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.84(1.03), norm=0.6468581118829557, lr=0.10987793670359813
2023-12-07 08:58:17   INFO  epoch: 51/72, acc_iter=199012, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:06, time_cost(all): 1 day, 22:03:02/18:56:39, loss=0.365563985891647, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.86(1.03), norm=2.7884879735291928, lr=0.10976396962823032
2023-12-07 08:58:59   INFO  epoch: 51/72, acc_iter=199062, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:25:32, time_cost(all): 1 day, 22:03:44/18:09:26, loss=0.365504788465706, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.91(1.03), norm=0.9758353189884599, lr=0.1096500025528625
2023-12-07 08:59:41   INFO  epoch: 51/72, acc_iter=199112, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:23:33, time_cost(all): 1 day, 22:04:26/17:38:54, loss=0.365445591039765, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.04(1.03), norm=2.0316744581594044, lr=0.10953603547749474
2023-12-07 09:00:23   INFO  epoch: 51/72, acc_iter=199162, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:51, time_cost(all): 1 day, 22:05:08/18:03:06, loss=0.365386393613824, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.91(1.03), norm=3.9763753925540994, lr=0.10942206840212693
2023-12-07 09:01:04   INFO  epoch: 51/72, acc_iter=199212, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:21:33, time_cost(all): 1 day, 22:05:49/18:28:10, loss=0.365327196187883, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.17(1.03), norm=4.633194041573397, lr=0.10930810132675911
2023-12-07 09:01:46   INFO  epoch: 51/72, acc_iter=199262, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:22:06, time_cost(all): 1 day, 22:06:31/18:50:45, loss=0.365267998761942, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.08(1.03), norm=1.3004077308605235, lr=0.10919413425139135
2023-12-07 09:02:28   INFO  epoch: 51/72, acc_iter=199312, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:49, time_cost(all): 1 day, 22:07:13/18:36:16, loss=0.365208801336002, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.13(1.03), norm=4.845285298900934, lr=0.10908016717602359
2023-12-07 09:03:10   INFO  epoch: 51/72, acc_iter=199362, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:19:56, time_cost(all): 1 day, 22:07:55/18:05:40, loss=0.365149603910061, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.0(1.03), norm=1.879644351087261, lr=0.10896620010065572
2023-12-07 09:03:52   INFO  epoch: 51/72, acc_iter=199412, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:18:53, time_cost(all): 1 day, 22:08:37/17:43:00, loss=0.36509040648412, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.12(1.03), norm=2.5768073444213235, lr=0.10885223302528796
2023-12-07 09:04:33   INFO  epoch: 51/72, acc_iter=199462, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:18:11, time_cost(all): 1 day, 22:09:18/18:08:53, loss=0.365031209058179, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.16(1.03), norm=0.985528318486931, lr=0.1087382659499202
2023-12-07 09:05:15   INFO  epoch: 51/72, acc_iter=199512, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:17:37, time_cost(all): 1 day, 22:10:00/17:40:59, loss=0.364972011632238, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.15(1.03), norm=1.716077644195103, lr=0.10862429887455238
2023-12-07 09:05:57   INFO  epoch: 51/72, acc_iter=199562, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:56, time_cost(all): 1 day, 22:10:42/18:21:39, loss=0.364912814206297, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.1(1.03), norm=4.353923271473052, lr=0.10851033179918457
2023-12-07 09:06:39   INFO  epoch: 51/72, acc_iter=199612, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:31, time_cost(all): 1 day, 22:11:24/18:14:29, loss=0.364853616780356, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.04(1.03), norm=3.4687476031550353, lr=0.10839636472381681
2023-12-07 09:07:20   INFO  epoch: 51/72, acc_iter=199662, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:50, time_cost(all): 1 day, 22:12:05/19:02:12, loss=0.364794419354415, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.09(1.03), norm=0.9908391529801575, lr=0.10828239764844899
2023-12-07 09:08:02   INFO  epoch: 51/72, acc_iter=199712, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:16, time_cost(all): 1 day, 22:12:47/17:25:18, loss=0.364735221928474, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.23(1.03), norm=1.0281957261004129, lr=0.10816843057308123
2023-12-07 09:08:44   INFO  epoch: 51/72, acc_iter=199762, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:15:28, time_cost(all): 1 day, 22:13:29/18:35:18, loss=0.364676024502533, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.21(1.03), norm=0.7303218791488011, lr=0.10805446349771342
2023-12-07 09:09:26   INFO  epoch: 51/72, acc_iter=199812, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:22, time_cost(all): 1 day, 22:14:11/17:52:47, loss=0.364616827076592, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.95(1.03), norm=1.4789730485978576, lr=0.1079404964223456
2023-12-07 09:10:08   INFO  epoch: 51/72, acc_iter=199862, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:12:45, time_cost(all): 1 day, 22:14:53/17:23:24, loss=0.364557629650651, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.2(1.03), norm=2.9445713408933707, lr=0.10782652934697784
2023-12-07 09:10:49   INFO  epoch: 51/72, acc_iter=199912, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:44, time_cost(all): 1 day, 22:15:34/18:41:10, loss=0.36449843222471, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.14(1.03), norm=1.60069374826688, lr=0.10771256227161002
2023-12-07 09:11:31   INFO  epoch: 51/72, acc_iter=199962, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:24, time_cost(all): 1 day, 22:16:16/18:12:52, loss=0.364439234798769, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.96(1.03), norm=2.437881263816571, lr=0.10759859519624221
2023-12-07 09:12:13   INFO  epoch: 51/72, acc_iter=200012, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:32, time_cost(all): 1 day, 22:16:58/17:52:32, loss=0.364380037372828, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.94(1.03), norm=3.126413153878789, lr=0.10748462812087445
2023-12-07 09:12:55   INFO  epoch: 51/72, acc_iter=200062, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:32, time_cost(all): 1 day, 22:17:40/17:28:44, loss=0.364320839946887, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.96(1.03), norm=0.7356559788935503, lr=0.10737066104550663
2023-12-07 09:13:37   INFO  epoch: 51/72, acc_iter=200112, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:43, time_cost(all): 1 day, 22:18:22/17:56:59, loss=0.364261642520946, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.0(1.03), norm=2.5333662031159783, lr=0.10725669397013882
2023-12-07 09:14:18   INFO  epoch: 51/72, acc_iter=200162, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:08, time_cost(all): 1 day, 22:19:03/18:43:26, loss=0.364202445095005, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.15(1.03), norm=4.987250142359651, lr=0.10714272689477106
2023-12-07 09:15:00   INFO  epoch: 51/72, acc_iter=200212, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:33, time_cost(all): 1 day, 22:19:45/18:24:37, loss=0.364143247669065, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.06(1.03), norm=4.018332985644651, lr=0.10702875981940324
2023-12-07 09:15:42   INFO  epoch: 51/72, acc_iter=200262, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:08:09, time_cost(all): 1 day, 22:20:27/18:50:07, loss=0.364084050243124, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.85(1.03), norm=0.7593606895814544, lr=0.10691479274403548
2023-12-07 09:16:24   INFO  epoch: 51/72, acc_iter=200312, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:12, time_cost(all): 1 day, 22:21:09/17:38:47, loss=0.364024852817183, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.21(1.03), norm=1.2237563131915568, lr=0.10680082566866766
2023-12-07 09:17:05   INFO  epoch: 51/72, acc_iter=200362, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:28, time_cost(all): 1 day, 22:21:50/17:52:11, loss=0.363965655391242, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.06(1.03), norm=0.6620817313080806, lr=0.10668685859329985
2023-12-07 09:17:47   INFO  epoch: 51/72, acc_iter=200412, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:57, time_cost(all): 1 day, 22:22:32/19:02:20, loss=0.363906457965301, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.19(1.03), norm=3.424501068219879, lr=0.10657289151793209
2023-12-07 09:18:29   INFO  epoch: 51/72, acc_iter=200462, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:00, time_cost(all): 1 day, 22:23:14/17:24:38, loss=0.36384726053936, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.21(1.03), norm=4.694485580976892, lr=0.10645892444256433
2023-12-07 09:19:11   INFO  epoch: 51/72, acc_iter=200512, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:33, time_cost(all): 1 day, 22:23:56/17:28:19, loss=0.363788063113419, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.87(1.03), norm=2.325556363016876, lr=0.10634495736719646
2023-12-07 09:19:53   INFO  epoch: 51/72, acc_iter=200562, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:36, time_cost(all): 1 day, 22:24:38/18:17:21, loss=0.363728865687478, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.84(1.03), norm=0.7545485364045821, lr=0.1062309902918287
2023-12-07 09:20:34   INFO  epoch: 51/72, acc_iter=200612, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:58, time_cost(all): 1 day, 22:25:19/18:43:08, loss=0.363669668261537, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.0(1.03), norm=2.618028291074195, lr=0.10611702321646094
2023-12-07 09:21:16   INFO  epoch: 51/72, acc_iter=200662, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:09, time_cost(all): 1 day, 22:26:01/17:51:26, loss=0.363610470835596, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.14(1.03), norm=0.7694916095732092, lr=0.10600305614109312
2023-12-07 09:21:58   INFO  epoch: 51/72, acc_iter=200712, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:30, time_cost(all): 1 day, 22:26:43/18:04:11, loss=0.363551273409655, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.07(1.03), norm=2.874799143408486, lr=0.1058890890657253
2023-12-07 09:22:40   INFO  epoch: 51/72, acc_iter=200762, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:50, time_cost(all): 1 day, 22:27:25/17:32:39, loss=0.363492075983714, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.22(1.03), norm=3.611065772400746, lr=0.10577512199035749
2023-12-07 09:23:21   INFO  epoch: 51/72, acc_iter=200812, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 1 day, 22:28:06/18:56:33, loss=0.363432878557773, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.99(1.03), norm=3.173705873554291, lr=0.10566115491498973
2023-12-07 09:24:03   INFO  epoch: 52/72, acc_iter=200874, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:53:52, time_cost(all): 1 day, 22:28:48/17:48:02, loss=0.363359473749606, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.88(1.03), norm=1.2100779049188695, lr=0.10551983574153367
2023-12-07 09:24:45   INFO  epoch: 52/72, acc_iter=200924, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:53:46, time_cost(all): 1 day, 22:29:30/17:45:19, loss=0.363300276323666, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.04(1.03), norm=2.868490887749309, lr=0.10540586866616586
2023-12-07 09:25:27   INFO  epoch: 52/72, acc_iter=200974, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:53:34, time_cost(all): 1 day, 22:30:12/18:29:05, loss=0.363241078897725, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.04(1.03), norm=4.930007136761063, lr=0.1052919015907981
2023-12-07 09:26:09   INFO  epoch: 52/72, acc_iter=201024, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:49:36, time_cost(all): 1 day, 22:30:54/18:24:34, loss=0.363181881471784, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.04(1.03), norm=1.7308100020641926, lr=0.10517793451543028
2023-12-07 09:26:50   INFO  epoch: 52/72, acc_iter=201074, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:50:58, time_cost(all): 1 day, 22:31:35/18:43:01, loss=0.363122684045843, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.02(1.03), norm=1.4497025179816159, lr=0.10506396744006247
2023-12-07 09:27:32   INFO  epoch: 52/72, acc_iter=201124, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:52:01, time_cost(all): 1 day, 22:32:17/17:59:27, loss=0.363063486619902, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.0(1.03), norm=4.289045953982422, lr=0.1049500003646947
2023-12-07 09:28:14   INFO  epoch: 52/72, acc_iter=201174, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:51:04, time_cost(all): 1 day, 22:32:59/18:29:10, loss=0.363004289193961, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.22(1.03), norm=1.840318437255996, lr=0.10483603328932689
2023-12-07 09:28:56   INFO  epoch: 52/72, acc_iter=201224, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:46:06, time_cost(all): 1 day, 22:33:41/18:27:43, loss=0.36294509176802, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.12(1.03), norm=4.247154343431913, lr=0.10472206621395908
2023-12-07 09:29:37   INFO  epoch: 52/72, acc_iter=201274, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:46:13, time_cost(all): 1 day, 22:34:22/17:25:42, loss=0.362885894342079, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.21(1.03), norm=2.1175046513362235, lr=0.10460809913859132
2023-12-07 09:30:19   INFO  epoch: 52/72, acc_iter=201324, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:45:02, time_cost(all): 1 day, 22:35:04/18:24:12, loss=0.362826696916138, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.99(1.03), norm=3.840308660351928, lr=0.1044941320632235
2023-12-07 09:31:01   INFO  epoch: 52/72, acc_iter=201374, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:46:45, time_cost(all): 1 day, 22:35:46/17:51:46, loss=0.362767499490197, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.01(1.03), norm=3.44610502566122, lr=0.10438016498785568
2023-12-07 09:31:43   INFO  epoch: 52/72, acc_iter=201424, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:44:48, time_cost(all): 1 day, 22:36:28/18:18:28, loss=0.362708302064256, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.2(1.03), norm=2.4993625104543766, lr=0.10426619791248792
2023-12-07 09:32:25   INFO  epoch: 52/72, acc_iter=201474, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:45:00, time_cost(all): 1 day, 22:37:10/18:40:09, loss=0.362649104638315, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.98(1.03), norm=3.861900118360944, lr=0.10415223083712011
2023-12-07 09:33:06   INFO  epoch: 52/72, acc_iter=201524, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:43:33, time_cost(all): 1 day, 22:37:51/17:25:26, loss=0.362589907212374, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.16(1.03), norm=4.398632098325859, lr=0.10403826376175235
2023-12-07 09:33:48   INFO  epoch: 52/72, acc_iter=201574, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:42:20, time_cost(all): 1 day, 22:38:33/18:07:43, loss=0.362530709786433, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.84(1.03), norm=1.6908230431759717, lr=0.10392429668638453
2023-12-07 09:34:30   INFO  epoch: 52/72, acc_iter=201624, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:42:11, time_cost(all): 1 day, 22:39:15/17:44:46, loss=0.362471512360492, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.93(1.03), norm=4.0907822481204565, lr=0.10381032961101672
2023-12-07 09:35:12   INFO  epoch: 52/72, acc_iter=201674, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:40:58, time_cost(all): 1 day, 22:39:57/17:13:18, loss=0.362412314934551, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.16(1.03), norm=1.1008744858757649, lr=0.10369636253564896
2023-12-07 09:35:53   INFO  epoch: 52/72, acc_iter=201724, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:40:43, time_cost(all): 1 day, 22:40:38/18:14:01, loss=0.36235311750861, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.09(1.03), norm=3.045050112479501, lr=0.1035823954602812
2023-12-07 09:36:35   INFO  epoch: 52/72, acc_iter=201774, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:40:28, time_cost(all): 1 day, 22:41:20/17:14:43, loss=0.36229392008267, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.14(1.03), norm=2.9638614974138338, lr=0.10346842838491332
2023-12-07 09:37:17   INFO  epoch: 52/72, acc_iter=201824, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:39:16, time_cost(all): 1 day, 22:42:02/18:04:42, loss=0.362234722656729, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.92(1.03), norm=3.1984055548770884, lr=0.10335446130954556
2023-12-07 09:37:59   INFO  epoch: 52/72, acc_iter=201874, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:39:48, time_cost(all): 1 day, 22:42:44/18:28:51, loss=0.362175525230788, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.08(1.03), norm=3.8664219324923415, lr=0.10324049423417775
2023-12-07 09:38:41   INFO  epoch: 52/72, acc_iter=201924, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:39:04, time_cost(all): 1 day, 22:43:26/17:12:23, loss=0.362116327804847, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.86(1.03), norm=4.2230903336455405, lr=0.10312652715880999
2023-12-07 09:39:22   INFO  epoch: 52/72, acc_iter=201974, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:39:21, time_cost(all): 1 day, 22:44:07/17:52:28, loss=0.362057130378906, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.84(1.03), norm=0.7611767387152826, lr=0.10301256008344217
2023-12-07 09:40:04   INFO  epoch: 52/72, acc_iter=202024, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:37:31, time_cost(all): 1 day, 22:44:49/17:57:06, loss=0.361997932952965, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.1(1.03), norm=1.2417791728396816, lr=0.10289859300807436
2023-12-07 09:40:46   INFO  epoch: 52/72, acc_iter=202074, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:36:47, time_cost(all): 1 day, 22:45:31/18:06:43, loss=0.361938735527024, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.16(1.03), norm=3.9122153783117977, lr=0.1027846259327066
2023-12-07 09:41:28   INFO  epoch: 52/72, acc_iter=202124, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:36:15, time_cost(all): 1 day, 22:46:13/18:21:02, loss=0.361879538101083, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.98(1.03), norm=0.9276688876356783, lr=0.10267065885733878
2023-12-07 09:42:09   INFO  epoch: 52/72, acc_iter=202174, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:34:46, time_cost(all): 1 day, 22:46:54/17:48:35, loss=0.361820340675142, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.19(1.03), norm=4.180663097794495, lr=0.10255669178197097
2023-12-07 09:42:51   INFO  epoch: 52/72, acc_iter=202224, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:34:30, time_cost(all): 1 day, 22:47:36/17:55:16, loss=0.361761143249201, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.98(1.03), norm=1.6385301634631293, lr=0.1024427247066032
2023-12-07 09:43:33   INFO  epoch: 52/72, acc_iter=202274, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:34:47, time_cost(all): 1 day, 22:48:18/17:02:30, loss=0.36170194582326, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.02(1.03), norm=2.8920893720516903, lr=0.10232875763123545
2023-12-07 09:44:15   INFO  epoch: 52/72, acc_iter=202324, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:33:46, time_cost(all): 1 day, 22:49:00/18:09:41, loss=0.361642748397319, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.14(1.03), norm=0.8859915459111074, lr=0.10221479055586757
2023-12-07 09:44:57   INFO  epoch: 52/72, acc_iter=202374, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:32:32, time_cost(all): 1 day, 22:49:42/17:11:06, loss=0.361583550971378, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.14(1.03), norm=2.9057740619673114, lr=0.10210082348049981
2023-12-07 09:45:38   INFO  epoch: 52/72, acc_iter=202424, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:30:14, time_cost(all): 1 day, 22:50:23/17:45:51, loss=0.361524353545437, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.96(1.03), norm=4.859498442315583, lr=0.10198685640513205
2023-12-07 09:46:20   INFO  epoch: 52/72, acc_iter=202474, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:29:18, time_cost(all): 1 day, 22:51:05/17:25:14, loss=0.361465156119496, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.1(1.03), norm=2.296820969938833, lr=0.10187288932976424
2023-12-07 09:47:02   INFO  epoch: 52/72, acc_iter=202524, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:30:27, time_cost(all): 1 day, 22:51:47/17:00:21, loss=0.361405958693555, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.9(1.03), norm=3.932624267747788, lr=0.10175892225439642
2023-12-07 09:47:44   INFO  epoch: 52/72, acc_iter=202574, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:29:52, time_cost(all): 1 day, 22:52:29/16:58:56, loss=0.361346761267614, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.1(1.03), norm=3.8604781884705557, lr=0.10164495517902866
2023-12-07 09:48:25   INFO  epoch: 52/72, acc_iter=202624, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:27:26, time_cost(all): 1 day, 22:53:10/18:10:01, loss=0.361287563841674, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.07(1.03), norm=0.5392669928378971, lr=0.10153098810366085
2023-12-07 09:49:07   INFO  epoch: 52/72, acc_iter=202674, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:27:16, time_cost(all): 1 day, 22:53:52/18:05:04, loss=0.361228366415733, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.85(1.03), norm=3.0974455450562908, lr=0.10141702102829309
2023-12-07 09:49:49   INFO  epoch: 52/72, acc_iter=202724, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:27:52, time_cost(all): 1 day, 22:54:34/17:59:44, loss=0.361169168989792, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.94(1.03), norm=0.7623972753307762, lr=0.10130305395292521
2023-12-07 09:50:31   INFO  epoch: 52/72, acc_iter=202774, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:26:40, time_cost(all): 1 day, 22:55:16/17:37:51, loss=0.361109971563851, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.95(1.03), norm=2.123121602176811, lr=0.10118908687755745
2023-12-07 09:51:13   INFO  epoch: 52/72, acc_iter=202824, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:26:37, time_cost(all): 1 day, 22:55:58/16:54:00, loss=0.36105077413791, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.85(1.03), norm=2.4818666537824168, lr=0.1010751198021897
2023-12-07 09:51:54   INFO  epoch: 52/72, acc_iter=202874, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:24, time_cost(all): 1 day, 22:56:39/17:40:37, loss=0.360991576711969, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.12(1.03), norm=1.2072511134640485, lr=0.10096115272682188
2023-12-07 09:52:36   INFO  epoch: 52/72, acc_iter=202924, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:25:16, time_cost(all): 1 day, 22:57:21/17:58:35, loss=0.360932379286028, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.16(1.03), norm=1.5064054602203907, lr=0.10084718565145406
2023-12-07 09:53:18   INFO  epoch: 52/72, acc_iter=202974, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:11, time_cost(all): 1 day, 22:58:03/18:25:25, loss=0.360873181860087, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.09(1.03), norm=1.2222061245377296, lr=0.1007332185760863
2023-12-07 09:54:00   INFO  epoch: 52/72, acc_iter=203024, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:23:22, time_cost(all): 1 day, 22:58:45/16:45:22, loss=0.360813984434146, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.85(1.03), norm=3.12155788438804, lr=0.10061925150071849
2023-12-07 09:54:42   INFO  epoch: 52/72, acc_iter=203074, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:22:36, time_cost(all): 1 day, 22:59:27/17:18:06, loss=0.360754787008205, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.92(1.03), norm=1.618111393217001, lr=0.10050528442535067
2023-12-07 09:55:23   INFO  epoch: 52/72, acc_iter=203124, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:18, time_cost(all): 1 day, 23:00:08/18:02:35, loss=0.360695589582264, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.1(1.03), norm=2.5844687998639713, lr=0.10039131734998291
2023-12-07 09:56:05   INFO  epoch: 52/72, acc_iter=203174, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:01, time_cost(all): 1 day, 23:00:50/17:56:27, loss=0.360636392156323, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.11(1.03), norm=0.6969652612731585, lr=0.1002773502746151
2023-12-07 09:56:47   INFO  epoch: 52/72, acc_iter=203224, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:20:12, time_cost(all): 1 day, 23:01:32/17:02:59, loss=0.360577194730382, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.16(1.03), norm=2.7498254012166488, lr=0.10016338319924734
2023-12-07 09:57:29   INFO  epoch: 52/72, acc_iter=203274, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:20:33, time_cost(all): 1 day, 23:02:14/16:55:50, loss=0.360517997304441, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.86(1.03), norm=1.814704453403767, lr=0.10004941612387952
2023-12-07 09:58:10   INFO  epoch: 52/72, acc_iter=203324, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:18:14, time_cost(all): 1 day, 23:02:55/18:14:46, loss=0.3604587998785, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.89(1.03), norm=2.4151944640747365, lr=0.0999354490485117
2023-12-07 09:58:52   INFO  epoch: 52/72, acc_iter=203374, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:54, time_cost(all): 1 day, 23:03:37/17:39:56, loss=0.360399602452559, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.85(1.03), norm=2.675951401208173, lr=0.09982148197314394
2023-12-07 09:59:34   INFO  epoch: 52/72, acc_iter=203424, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:18:00, time_cost(all): 1 day, 23:04:19/18:16:17, loss=0.360340405026618, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.18(1.03), norm=1.9521282374858397, lr=0.09970751489777618
2023-12-07 10:00:16   INFO  epoch: 52/72, acc_iter=203474, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:17:03, time_cost(all): 1 day, 23:05:01/17:41:50, loss=0.360281207600677, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.11(1.03), norm=4.065116810488585, lr=0.09959354782240831
2023-12-07 10:00:58   INFO  epoch: 52/72, acc_iter=203524, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:16, time_cost(all): 1 day, 23:05:43/16:36:51, loss=0.360222010174737, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.88(1.03), norm=2.3859557912838905, lr=0.09947958074704055
2023-12-07 10:01:39   INFO  epoch: 52/72, acc_iter=203574, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:19, time_cost(all): 1 day, 23:06:24/17:06:07, loss=0.360162812748796, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.22(1.03), norm=3.7965517100456183, lr=0.09936561367167274
2023-12-07 10:02:21   INFO  epoch: 52/72, acc_iter=203624, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:34, time_cost(all): 1 day, 23:07:06/18:07:37, loss=0.360103615322855, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.2(1.03), norm=3.1723087556453406, lr=0.09925164659630498
2023-12-07 10:03:03   INFO  epoch: 52/72, acc_iter=203674, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:13:55, time_cost(all): 1 day, 23:07:48/18:02:54, loss=0.360044417896914, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.92(1.03), norm=0.969487946571394, lr=0.09913767952093716
2023-12-07 10:03:45   INFO  epoch: 52/72, acc_iter=203724, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:48, time_cost(all): 1 day, 23:08:30/17:11:20, loss=0.359985220470973, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.15(1.03), norm=1.5278493131218411, lr=0.09902371244556935
2023-12-07 10:04:26   INFO  epoch: 52/72, acc_iter=203774, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:12, time_cost(all): 1 day, 23:09:11/17:37:15, loss=0.359926023045032, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.17(1.03), norm=4.1579639965961075, lr=0.09890974537020158
2023-12-07 10:05:08   INFO  epoch: 52/72, acc_iter=203824, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:26, time_cost(all): 1 day, 23:09:53/18:11:05, loss=0.359866825619091, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.08(1.03), norm=0.5338634241861336, lr=0.09879577829483377
2023-12-07 10:05:50   INFO  epoch: 52/72, acc_iter=203874, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:29, time_cost(all): 1 day, 23:10:35/17:49:02, loss=0.35980762819315, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.93(1.03), norm=3.2747767664246044, lr=0.09868181121946595
2023-12-07 10:06:32   INFO  epoch: 52/72, acc_iter=203924, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:43, time_cost(all): 1 day, 23:11:17/17:08:25, loss=0.359748430767209, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.97(1.03), norm=4.282525712199657, lr=0.0985678441440982
2023-12-07 10:07:14   INFO  epoch: 52/72, acc_iter=203974, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:10:18, time_cost(all): 1 day, 23:11:59/17:16:01, loss=0.359689233341268, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.99(1.03), norm=1.7243114441649707, lr=0.09845387706873043
2023-12-07 10:07:55   INFO  epoch: 52/72, acc_iter=204024, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:21, time_cost(all): 1 day, 23:12:40/16:44:53, loss=0.359630035915327, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.14(1.03), norm=2.4501113312342757, lr=0.09833990999336256
2023-12-07 10:08:37   INFO  epoch: 52/72, acc_iter=204074, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:40, time_cost(all): 1 day, 23:13:22/17:23:19, loss=0.359570838489386, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.96(1.03), norm=4.9699198287040804, lr=0.0982259429179948
2023-12-07 10:09:19   INFO  epoch: 52/72, acc_iter=204124, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:35, time_cost(all): 1 day, 23:14:04/17:18:13, loss=0.359511641063445, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.11(1.03), norm=1.9186906024857429, lr=0.09811197584262704
2023-12-07 10:10:01   INFO  epoch: 52/72, acc_iter=204174, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:06:58, time_cost(all): 1 day, 23:14:46/16:37:55, loss=0.359452443637504, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.15(1.03), norm=0.8820787238181859, lr=0.09799800876725923
2023-12-07 10:10:42   INFO  epoch: 52/72, acc_iter=204224, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:38, time_cost(all): 1 day, 23:15:27/17:26:59, loss=0.359393246211563, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.16(1.03), norm=4.401398419183886, lr=0.09788404169189141
2023-12-07 10:11:24   INFO  epoch: 52/72, acc_iter=204274, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:31, time_cost(all): 1 day, 23:16:09/18:04:21, loss=0.359334048785622, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.9(1.03), norm=4.022920616180176, lr=0.0977700746165236
2023-12-07 10:12:06   INFO  epoch: 52/72, acc_iter=204324, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:13, time_cost(all): 1 day, 23:16:51/17:59:02, loss=0.359274851359682, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.21(1.03), norm=1.9953773913320527, lr=0.09765610754115583
2023-12-07 10:12:48   INFO  epoch: 52/72, acc_iter=204374, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:15, time_cost(all): 1 day, 23:17:33/17:55:37, loss=0.359215653933741, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.11(1.03), norm=4.033446984282816, lr=0.09754214046578807
2023-12-07 10:13:30   INFO  epoch: 52/72, acc_iter=204424, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:33, time_cost(all): 1 day, 23:18:15/17:51:32, loss=0.3591564565078, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.88(1.03), norm=3.3744911514204463, lr=0.0974281733904202
2023-12-07 10:14:11   INFO  epoch: 52/72, acc_iter=204474, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:54, time_cost(all): 1 day, 23:18:56/17:43:30, loss=0.359097259081859, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.18(1.03), norm=3.9281546158769576, lr=0.09731420631505244
2023-12-07 10:14:53   INFO  epoch: 52/72, acc_iter=204524, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:20, time_cost(all): 1 day, 23:19:38/16:36:52, loss=0.359038061655918, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.84(1.03), norm=3.024950642025189, lr=0.09720023923968468
2023-12-07 10:15:35   INFO  epoch: 52/72, acc_iter=204574, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:32, time_cost(all): 1 day, 23:20:20/16:20:40, loss=0.358978864229977, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.02(1.03), norm=2.8759715987802457, lr=0.09708627216431687
2023-12-07 10:16:17   INFO  epoch: 52/72, acc_iter=204624, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:51, time_cost(all): 1 day, 23:21:02/17:50:23, loss=0.358919666804036, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.09(1.03), norm=0.7682389353078243, lr=0.09697230508894905
2023-12-07 10:16:58   INFO  epoch: 52/72, acc_iter=204674, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 1 day, 23:21:43/16:50:07, loss=0.358860469378095, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.03(1.03), norm=4.327389691437251, lr=0.09685833801358129
2023-12-07 10:17:40   INFO  epoch: 53/72, acc_iter=204736, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:55:16, time_cost(all): 1 day, 23:22:25/16:44:54, loss=0.358787064569928, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.19(1.03), norm=1.9229188787077138, lr=0.09671701884012518
2023-12-07 10:18:22   INFO  epoch: 53/72, acc_iter=204786, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:52:51, time_cost(all): 1 day, 23:23:07/16:24:24, loss=0.358727867143987, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.18(1.03), norm=4.178895533917737, lr=0.09660305176475742
2023-12-07 10:19:04   INFO  epoch: 53/72, acc_iter=204836, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:50:51, time_cost(all): 1 day, 23:23:49/16:34:29, loss=0.358668669718046, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.99(1.03), norm=4.301702666329213, lr=0.0964890846893896
2023-12-07 10:19:46   INFO  epoch: 53/72, acc_iter=204886, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:49:03, time_cost(all): 1 day, 23:24:31/17:49:10, loss=0.358609472292105, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.94(1.03), norm=2.53582768906775, lr=0.09637511761402184
2023-12-07 10:20:27   INFO  epoch: 53/72, acc_iter=204936, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:49:06, time_cost(all): 1 day, 23:25:12/17:35:24, loss=0.358550274866164, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.16(1.03), norm=4.347841329552221, lr=0.09626115053865403
2023-12-07 10:21:09   INFO  epoch: 53/72, acc_iter=204986, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:51:54, time_cost(all): 1 day, 23:25:54/17:35:30, loss=0.358491077440223, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.03(1.03), norm=4.9725218264327795, lr=0.09614718346328621
2023-12-07 10:21:51   INFO  epoch: 53/72, acc_iter=205036, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:48:55, time_cost(all): 1 day, 23:26:36/16:21:35, loss=0.358431880014283, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.15(1.03), norm=1.592551987358584, lr=0.09603321638791845
2023-12-07 10:22:33   INFO  epoch: 53/72, acc_iter=205086, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:47:22, time_cost(all): 1 day, 23:27:18/17:21:53, loss=0.358372682588342, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.84(1.03), norm=3.2270434787686613, lr=0.09591924931255064
2023-12-07 10:23:14   INFO  epoch: 53/72, acc_iter=205136, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:46:55, time_cost(all): 1 day, 23:27:59/17:54:18, loss=0.358313485162401, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.18(1.03), norm=1.6402147601694204, lr=0.09580528223718282
2023-12-07 10:23:56   INFO  epoch: 53/72, acc_iter=205186, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:46:12, time_cost(all): 1 day, 23:28:41/16:36:28, loss=0.35825428773646, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.02(1.03), norm=1.7609387192892383, lr=0.09569131516181506
2023-12-07 10:24:38   INFO  epoch: 53/72, acc_iter=205236, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:48:10, time_cost(all): 1 day, 23:29:23/17:27:58, loss=0.358195090310519, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.84(1.03), norm=3.8382272905703902, lr=0.0955773480864473
2023-12-07 10:25:20   INFO  epoch: 53/72, acc_iter=205286, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:47:36, time_cost(all): 1 day, 23:30:05/17:02:08, loss=0.358135892884578, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.2(1.03), norm=3.2503909964351543, lr=0.09546338101107943
2023-12-07 10:26:02   INFO  epoch: 53/72, acc_iter=205336, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:45:23, time_cost(all): 1 day, 23:30:47/17:49:14, loss=0.358076695458637, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.09(1.03), norm=2.8162288664235704, lr=0.09534941393571167
2023-12-07 10:26:43   INFO  epoch: 53/72, acc_iter=205386, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:45:23, time_cost(all): 1 day, 23:31:28/16:52:36, loss=0.358017498032696, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.98(1.03), norm=2.1176054498967365, lr=0.09523544686034391
2023-12-07 10:27:25   INFO  epoch: 53/72, acc_iter=205436, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:41:48, time_cost(all): 1 day, 23:32:10/17:37:09, loss=0.357958300606755, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.21(1.03), norm=3.7627948923709518, lr=0.09512147978497609
2023-12-07 10:28:07   INFO  epoch: 53/72, acc_iter=205486, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:41:03, time_cost(all): 1 day, 23:32:52/16:36:09, loss=0.357899103180814, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.98(1.03), norm=2.3221222001182564, lr=0.09500751270960828
2023-12-07 10:28:49   INFO  epoch: 53/72, acc_iter=205536, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:40:17, time_cost(all): 1 day, 23:33:34/17:28:24, loss=0.357839905754873, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.85(1.03), norm=4.29947749330163, lr=0.09489354563424046
2023-12-07 10:29:31   INFO  epoch: 53/72, acc_iter=205586, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:41:43, time_cost(all): 1 day, 23:34:16/16:42:20, loss=0.357780708328932, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.94(1.03), norm=2.2512317519905807, lr=0.0947795785588727
2023-12-07 10:30:12   INFO  epoch: 53/72, acc_iter=205636, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:41:37, time_cost(all): 1 day, 23:34:57/16:37:54, loss=0.357721510902991, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.21(1.03), norm=4.43409431368298, lr=0.09466561148350494
2023-12-07 10:30:54   INFO  epoch: 53/72, acc_iter=205686, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:41:19, time_cost(all): 1 day, 23:35:39/16:27:41, loss=0.35766231347705, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.03(1.03), norm=3.2124165475677238, lr=0.09455164440813707
2023-12-07 10:31:36   INFO  epoch: 53/72, acc_iter=205736, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:38:47, time_cost(all): 1 day, 23:36:21/17:20:05, loss=0.357603116051109, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.23(1.03), norm=0.5207566376774673, lr=0.09443767733276931
2023-12-07 10:32:18   INFO  epoch: 53/72, acc_iter=205786, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:38:28, time_cost(all): 1 day, 23:37:03/16:34:44, loss=0.357543918625168, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.2(1.03), norm=3.574073606146238, lr=0.09432371025740155
2023-12-07 10:32:59   INFO  epoch: 53/72, acc_iter=205836, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:37:04, time_cost(all): 1 day, 23:37:44/16:08:19, loss=0.357484721199227, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.96(1.03), norm=1.950746433549959, lr=0.09420974318203373
2023-12-07 10:33:41   INFO  epoch: 53/72, acc_iter=205886, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:37:51, time_cost(all): 1 day, 23:38:26/16:57:09, loss=0.357425523773287, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.83(1.03), norm=2.014666130523424, lr=0.09409577610666592
2023-12-07 10:34:23   INFO  epoch: 53/72, acc_iter=205936, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:38:08, time_cost(all): 1 day, 23:39:08/17:20:54, loss=0.357366326347346, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.1(1.03), norm=3.7904892020219294, lr=0.09398180903129816
2023-12-07 10:35:05   INFO  epoch: 53/72, acc_iter=205986, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:35:47, time_cost(all): 1 day, 23:39:50/16:56:11, loss=0.357307128921405, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.1(1.03), norm=1.0656652363366514, lr=0.09386784195593034
2023-12-07 10:35:47   INFO  epoch: 53/72, acc_iter=206036, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:35:50, time_cost(all): 1 day, 23:40:32/17:33:12, loss=0.357247931495464, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.18(1.03), norm=1.579605025349966, lr=0.09375387488056253
2023-12-07 10:36:28   INFO  epoch: 53/72, acc_iter=206086, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:33:55, time_cost(all): 1 day, 23:41:13/16:12:26, loss=0.357188734069523, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.2(1.03), norm=2.429250775641816, lr=0.09363990780519477
2023-12-07 10:37:10   INFO  epoch: 53/72, acc_iter=206136, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:32:14, time_cost(all): 1 day, 23:41:55/16:22:05, loss=0.357129536643582, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.85(1.03), norm=4.046576447426263, lr=0.09352594072982695
2023-12-07 10:37:52   INFO  epoch: 53/72, acc_iter=206186, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:32:43, time_cost(all): 1 day, 23:42:37/17:28:50, loss=0.357070339217641, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.84(1.03), norm=4.992227766830091, lr=0.09341197365445919
2023-12-07 10:38:34   INFO  epoch: 53/72, acc_iter=206236, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:30:52, time_cost(all): 1 day, 23:43:19/16:55:56, loss=0.3570111417917, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.86(1.03), norm=3.5272786079646687, lr=0.09329800657909137
2023-12-07 10:39:15   INFO  epoch: 53/72, acc_iter=206286, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:31:55, time_cost(all): 1 day, 23:44:00/16:16:54, loss=0.356951944365759, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.97(1.03), norm=4.286601421043425, lr=0.09318403950372356
2023-12-07 10:39:57   INFO  epoch: 53/72, acc_iter=206336, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:30:10, time_cost(all): 1 day, 23:44:42/17:19:52, loss=0.356892746939818, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.99(1.03), norm=1.4066868458034683, lr=0.0930700724283558
2023-12-07 10:40:39   INFO  epoch: 53/72, acc_iter=206386, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:31:13, time_cost(all): 1 day, 23:45:24/17:11:40, loss=0.356833549513877, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.22(1.03), norm=4.840442417060502, lr=0.09295610535298798
2023-12-07 10:41:21   INFO  epoch: 53/72, acc_iter=206436, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:29:05, time_cost(all): 1 day, 23:46:06/16:20:12, loss=0.356774352087936, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.86(1.03), norm=3.4160030014065352, lr=0.09284213827762017
2023-12-07 10:42:03   INFO  epoch: 53/72, acc_iter=206486, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:28:38, time_cost(all): 1 day, 23:46:48/16:55:18, loss=0.356715154661995, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.16(1.03), norm=0.913826469538684, lr=0.09272817120225241
2023-12-07 10:42:44   INFO  epoch: 53/72, acc_iter=206536, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:27:56, time_cost(all): 1 day, 23:47:29/16:48:15, loss=0.356655957236054, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.08(1.03), norm=4.603941915292223, lr=0.09261420412688459
2023-12-07 10:43:26   INFO  epoch: 53/72, acc_iter=206586, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:27:12, time_cost(all): 1 day, 23:48:11/17:00:43, loss=0.356596759810113, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.01(1.03), norm=3.3212555812722044, lr=0.09250023705151683
2023-12-07 10:44:08   INFO  epoch: 53/72, acc_iter=206636, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:26:27, time_cost(all): 1 day, 23:48:53/17:04:28, loss=0.356537562384172, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.06(1.03), norm=4.364839604859547, lr=0.09238626997614902
2023-12-07 10:44:50   INFO  epoch: 53/72, acc_iter=206686, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:26:50, time_cost(all): 1 day, 23:49:35/16:58:50, loss=0.356478364958231, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.17(1.03), norm=2.924060731836556, lr=0.0922723029007812
2023-12-07 10:45:31   INFO  epoch: 53/72, acc_iter=206736, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:28, time_cost(all): 1 day, 23:50:16/16:43:34, loss=0.356419167532291, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.9(1.03), norm=3.8849868786280446, lr=0.09215833582541344
2023-12-07 10:46:13   INFO  epoch: 53/72, acc_iter=206786, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:25:20, time_cost(all): 1 day, 23:50:58/16:23:36, loss=0.35635997010635, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.86(1.03), norm=1.9489330778965144, lr=0.09204436875004562
2023-12-07 10:46:55   INFO  epoch: 53/72, acc_iter=206836, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:31, time_cost(all): 1 day, 23:51:40/15:57:50, loss=0.356300772680409, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.1(1.03), norm=4.453567072520995, lr=0.09193040167467781
2023-12-07 10:47:37   INFO  epoch: 53/72, acc_iter=206886, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:23:36, time_cost(all): 1 day, 23:52:22/17:08:37, loss=0.356241575254468, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.91(1.03), norm=2.69465832460591, lr=0.09181643459931005
2023-12-07 10:48:19   INFO  epoch: 53/72, acc_iter=206936, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:21:59, time_cost(all): 1 day, 23:53:04/17:11:15, loss=0.356182377828527, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.01(1.03), norm=1.0493778744057374, lr=0.09170246752394229
2023-12-07 10:49:00   INFO  epoch: 53/72, acc_iter=206986, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:20:45, time_cost(all): 1 day, 23:53:45/16:50:24, loss=0.356123180402586, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.83(1.03), norm=4.062896561429698, lr=0.09158850044857442
2023-12-07 10:49:42   INFO  epoch: 53/72, acc_iter=207036, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:18, time_cost(all): 1 day, 23:54:27/16:59:21, loss=0.356063982976645, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.11(1.03), norm=4.307122689667926, lr=0.09147453337320666
2023-12-07 10:50:24   INFO  epoch: 53/72, acc_iter=207086, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:19:27, time_cost(all): 1 day, 23:55:09/16:08:05, loss=0.356004785550704, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.04(1.03), norm=4.088257836428685, lr=0.09136056629783884
2023-12-07 10:51:06   INFO  epoch: 53/72, acc_iter=207136, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:20:27, time_cost(all): 1 day, 23:55:51/16:17:36, loss=0.355945588124763, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.89(1.03), norm=0.5398407154303801, lr=0.09124659922247108
2023-12-07 10:51:47   INFO  epoch: 53/72, acc_iter=207186, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:01, time_cost(all): 1 day, 23:56:32/16:06:07, loss=0.355886390698822, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.85(1.03), norm=4.4716124992990025, lr=0.09113263214710327
2023-12-07 10:52:29   INFO  epoch: 53/72, acc_iter=207236, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:52, time_cost(all): 1 day, 23:57:14/17:02:18, loss=0.355827193272881, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.98(1.03), norm=3.11443542992889, lr=0.09101866507173545
2023-12-07 10:53:11   INFO  epoch: 53/72, acc_iter=207286, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:12, time_cost(all): 1 day, 23:57:56/17:12:08, loss=0.35576799584694, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.96(1.03), norm=2.8251892660805575, lr=0.09090469799636769
2023-12-07 10:53:53   INFO  epoch: 53/72, acc_iter=207336, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:28, time_cost(all): 1 day, 23:58:38/15:54:08, loss=0.355708798420999, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.96(1.03), norm=2.471741788399604, lr=0.09079073092099993
2023-12-07 10:54:35   INFO  epoch: 53/72, acc_iter=207386, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:31, time_cost(all): 1 day, 23:59:20/16:23:52, loss=0.355649600995058, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.89(1.03), norm=4.596281115594918, lr=0.09067676384563206
2023-12-07 10:55:16   INFO  epoch: 53/72, acc_iter=207436, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:26, time_cost(all): 2 days, 0:00:01/17:06:03, loss=0.355590403569117, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.01(1.03), norm=0.9913728689998298, lr=0.0905627967702643
2023-12-07 10:55:58   INFO  epoch: 53/72, acc_iter=207486, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:56, time_cost(all): 2 days, 0:00:43/16:09:10, loss=0.355531206143176, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.92(1.03), norm=3.568856457053289, lr=0.09044882969489654
2023-12-07 10:56:40   INFO  epoch: 53/72, acc_iter=207536, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:13:45, time_cost(all): 2 days, 0:01:25/16:51:40, loss=0.355472008717235, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.03(1.03), norm=2.99803588677928, lr=0.09033486261952872
2023-12-07 10:57:22   INFO  epoch: 53/72, acc_iter=207586, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:33, time_cost(all): 2 days, 0:02:07/15:53:36, loss=0.355412811291295, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.04(1.03), norm=3.023285749552618, lr=0.0902208955441609
2023-12-07 10:58:03   INFO  epoch: 53/72, acc_iter=207636, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:13:05, time_cost(all): 2 days, 0:02:48/16:58:48, loss=0.355353613865354, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.01(1.03), norm=3.6402323475544445, lr=0.09010692846879315
2023-12-07 10:58:45   INFO  epoch: 53/72, acc_iter=207686, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:18, time_cost(all): 2 days, 0:03:30/16:06:05, loss=0.355294416439413, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.97(1.03), norm=2.416879593348328, lr=0.08999296139342533
2023-12-07 10:59:27   INFO  epoch: 53/72, acc_iter=207736, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:10:56, time_cost(all): 2 days, 0:04:12/16:45:37, loss=0.355235219013472, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.12(1.03), norm=2.585284313659429, lr=0.08987899431805751
2023-12-07 11:00:09   INFO  epoch: 53/72, acc_iter=207786, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:15, time_cost(all): 2 days, 0:04:54/16:55:46, loss=0.355176021587531, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.07(1.03), norm=4.969823791475764, lr=0.08976502724268975
2023-12-07 11:00:51   INFO  epoch: 53/72, acc_iter=207836, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:44, time_cost(all): 2 days, 0:05:36/15:47:31, loss=0.35511682416159, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.0(1.03), norm=4.622847206018111, lr=0.08965106016732194
2023-12-07 11:01:32   INFO  epoch: 53/72, acc_iter=207886, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:11, time_cost(all): 2 days, 0:06:17/17:00:15, loss=0.355057626735649, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.08(1.03), norm=2.6281456054633816, lr=0.08953709309195418
2023-12-07 11:02:14   INFO  epoch: 53/72, acc_iter=207936, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:09, time_cost(all): 2 days, 0:06:59/15:45:54, loss=0.354998429309708, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=0.6643266179521112, lr=0.08942312601658631
2023-12-07 11:02:56   INFO  epoch: 53/72, acc_iter=207986, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:34, time_cost(all): 2 days, 0:07:41/16:43:13, loss=0.354939231883767, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.22(1.03), norm=1.5647163740687744, lr=0.08930915894121855
2023-12-07 11:03:38   INFO  epoch: 53/72, acc_iter=208036, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:25, time_cost(all): 2 days, 0:08:23/16:39:52, loss=0.354880034457826, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.16(1.03), norm=4.022867961876075, lr=0.08919519186585079
2023-12-07 11:04:20   INFO  epoch: 53/72, acc_iter=208086, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:15, time_cost(all): 2 days, 0:09:05/16:15:27, loss=0.354820837031885, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.94(1.03), norm=4.786703787260695, lr=0.08908122479048297
2023-12-07 11:05:01   INFO  epoch: 53/72, acc_iter=208136, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:36, time_cost(all): 2 days, 0:09:46/16:13:46, loss=0.354761639605944, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.0(1.03), norm=2.8382165425407844, lr=0.08896725771511516
2023-12-07 11:05:43   INFO  epoch: 53/72, acc_iter=208186, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:16, time_cost(all): 2 days, 0:10:28/16:23:34, loss=0.354702442180003, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.95(1.03), norm=0.7107294727979706, lr=0.0888532906397474
2023-12-07 11:06:25   INFO  epoch: 53/72, acc_iter=208236, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:17, time_cost(all): 2 days, 0:11:10/16:13:30, loss=0.354643244754062, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.96(1.03), norm=1.0689090792249458, lr=0.08873932356437958
2023-12-07 11:07:07   INFO  epoch: 53/72, acc_iter=208286, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:34, time_cost(all): 2 days, 0:11:52/16:53:52, loss=0.354584047328121, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.14(1.03), norm=2.4648494489034225, lr=0.08862535648901182
2023-12-07 11:07:48   INFO  epoch: 53/72, acc_iter=208336, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:03:04, time_cost(all): 2 days, 0:12:33/16:12:32, loss=0.35452484990218, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.19(1.03), norm=1.6177711098748082, lr=0.088511389413644
2023-12-07 11:08:30   INFO  epoch: 53/72, acc_iter=208386, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:10, time_cost(all): 2 days, 0:13:15/15:35:40, loss=0.354465652476239, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.03(1.03), norm=3.944371474889982, lr=0.08839742233827619
2023-12-07 11:09:12   INFO  epoch: 53/72, acc_iter=208436, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:29, time_cost(all): 2 days, 0:13:57/17:00:34, loss=0.354406455050299, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.11(1.03), norm=3.7363202445592987, lr=0.08828345526290843
2023-12-07 11:09:54   INFO  epoch: 53/72, acc_iter=208486, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:53, time_cost(all): 2 days, 0:14:39/16:56:07, loss=0.354347257624358, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.06(1.03), norm=3.6550865238941865, lr=0.08816948818754061
2023-12-07 11:10:36   INFO  epoch: 53/72, acc_iter=208536, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 2 days, 0:15:21/16:38:55, loss=0.354288060198417, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.01(1.03), norm=1.5463843138540954, lr=0.0880555211121728
2023-12-07 11:11:17   INFO  epoch: 54/72, acc_iter=208598, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:53:46, time_cost(all): 2 days, 0:16:02/16:24:06, loss=0.35421465539025, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.88(1.03), norm=1.9445565287351516, lr=0.0879142019387168
2023-12-07 11:11:59   INFO  epoch: 54/72, acc_iter=208648, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:54:04, time_cost(all): 2 days, 0:16:44/16:39:11, loss=0.354155457964309, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.92(1.03), norm=2.575255875334034, lr=0.08780023486334892
2023-12-07 11:12:41   INFO  epoch: 54/72, acc_iter=208698, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:52:06, time_cost(all): 2 days, 0:17:26/16:52:35, loss=0.354096260538368, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.13(1.03), norm=2.583761215309903, lr=0.08768626778798116
2023-12-07 11:13:23   INFO  epoch: 54/72, acc_iter=208748, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:52:33, time_cost(all): 2 days, 0:18:08/16:03:30, loss=0.354037063112427, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.93(1.03), norm=3.615270902770515, lr=0.0875723007126134
2023-12-07 11:14:04   INFO  epoch: 54/72, acc_iter=208798, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:51:15, time_cost(all): 2 days, 0:18:49/15:38:48, loss=0.353977865686486, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.08(1.03), norm=3.8621209116695785, lr=0.08745833363724559
2023-12-07 11:14:46   INFO  epoch: 54/72, acc_iter=208848, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:49:50, time_cost(all): 2 days, 0:19:31/16:11:12, loss=0.353918668260545, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.88(1.03), norm=2.7813680583897114, lr=0.08734436656187777
2023-12-07 11:15:28   INFO  epoch: 54/72, acc_iter=208898, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:48:32, time_cost(all): 2 days, 0:20:13/15:42:00, loss=0.353859470834604, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.19(1.03), norm=4.543769532584841, lr=0.08723039948651001
2023-12-07 11:16:10   INFO  epoch: 54/72, acc_iter=208948, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:47:26, time_cost(all): 2 days, 0:20:55/16:32:12, loss=0.353800273408663, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.0(1.03), norm=4.322085493507656, lr=0.0871164324111422
2023-12-07 11:16:52   INFO  epoch: 54/72, acc_iter=208998, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:46:43, time_cost(all): 2 days, 0:21:37/15:34:06, loss=0.353741075982722, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.88(1.03), norm=0.8642304309218938, lr=0.08700246533577438
2023-12-07 11:17:33   INFO  epoch: 54/72, acc_iter=209048, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:47:18, time_cost(all): 2 days, 0:22:18/16:39:06, loss=0.353681878556781, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.86(1.03), norm=1.5352280970884071, lr=0.08688849826040662
2023-12-07 11:18:15   INFO  epoch: 54/72, acc_iter=209098, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:45:56, time_cost(all): 2 days, 0:23:00/16:26:13, loss=0.35362268113084, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.06(1.03), norm=1.5154159070850106, lr=0.0867745311850388
2023-12-07 11:18:57   INFO  epoch: 54/72, acc_iter=209148, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:45:34, time_cost(all): 2 days, 0:23:42/15:55:16, loss=0.3535634837049, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.91(1.03), norm=3.9882321064657944, lr=0.08666056410967105
2023-12-07 11:19:39   INFO  epoch: 54/72, acc_iter=209198, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:46:13, time_cost(all): 2 days, 0:24:24/16:21:07, loss=0.353504286278959, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.09(1.03), norm=4.686581767580405, lr=0.08654659703430317
2023-12-07 11:20:20   INFO  epoch: 54/72, acc_iter=209248, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:43:44, time_cost(all): 2 days, 0:25:05/16:13:21, loss=0.353445088853018, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.05(1.03), norm=3.192726353885716, lr=0.08643262995893541
2023-12-07 11:21:02   INFO  epoch: 54/72, acc_iter=209298, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:41:34, time_cost(all): 2 days, 0:25:47/15:30:34, loss=0.353385891427077, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.18(1.03), norm=2.232827698620744, lr=0.08631866288356765
2023-12-07 11:21:44   INFO  epoch: 54/72, acc_iter=209348, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:42:18, time_cost(all): 2 days, 0:26:29/16:05:51, loss=0.353326694001136, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.02(1.03), norm=1.7481202474023363, lr=0.08620469580819984
2023-12-07 11:22:26   INFO  epoch: 54/72, acc_iter=209398, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:40:44, time_cost(all): 2 days, 0:27:11/15:55:43, loss=0.353267496575195, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.14(1.03), norm=3.065158342908869, lr=0.08609072873283202
2023-12-07 11:23:08   INFO  epoch: 54/72, acc_iter=209448, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:39:44, time_cost(all): 2 days, 0:27:53/15:19:32, loss=0.353208299149254, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.19(1.03), norm=4.617940384023417, lr=0.08597676165746426
2023-12-07 11:23:49   INFO  epoch: 54/72, acc_iter=209498, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:39:09, time_cost(all): 2 days, 0:28:34/16:08:55, loss=0.353149101723313, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.91(1.03), norm=0.8854848412072849, lr=0.08586279458209645
2023-12-07 11:24:31   INFO  epoch: 54/72, acc_iter=209548, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:38:01, time_cost(all): 2 days, 0:29:16/16:07:17, loss=0.353089904297372, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.02(1.03), norm=2.695766147530568, lr=0.08574882750672869
2023-12-07 11:25:13   INFO  epoch: 54/72, acc_iter=209598, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:37:44, time_cost(all): 2 days, 0:29:58/16:18:27, loss=0.353030706871431, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.03(1.03), norm=1.2499451179475407, lr=0.08563486043136087
2023-12-07 11:25:55   INFO  epoch: 54/72, acc_iter=209648, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:40:06, time_cost(all): 2 days, 0:30:40/16:15:52, loss=0.35297150944549, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.05(1.03), norm=2.619237768825141, lr=0.08552089335599306
2023-12-07 11:26:36   INFO  epoch: 54/72, acc_iter=209698, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:37:38, time_cost(all): 2 days, 0:31:21/15:43:21, loss=0.352912312019549, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.1(1.03), norm=4.598777324439867, lr=0.0854069262806253
2023-12-07 11:27:18   INFO  epoch: 54/72, acc_iter=209748, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:35:56, time_cost(all): 2 days, 0:32:03/15:12:35, loss=0.352853114593608, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.21(1.03), norm=2.054316016023554, lr=0.08529295920525748
2023-12-07 11:28:00   INFO  epoch: 54/72, acc_iter=209798, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:34:38, time_cost(all): 2 days, 0:32:45/16:15:57, loss=0.352793917167667, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.17(1.03), norm=2.5143465112852033, lr=0.08517899212988966
2023-12-07 11:28:42   INFO  epoch: 54/72, acc_iter=209848, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:34:31, time_cost(all): 2 days, 0:33:27/16:33:49, loss=0.352734719741726, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.02(1.03), norm=1.7826286138826046, lr=0.0850650250545219
2023-12-07 11:29:24   INFO  epoch: 54/72, acc_iter=209898, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:36:30, time_cost(all): 2 days, 0:34:09/15:39:16, loss=0.352675522315785, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.95(1.03), norm=1.8291804831904583, lr=0.08495105797915414
2023-12-07 11:30:05   INFO  epoch: 54/72, acc_iter=209948, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:35:45, time_cost(all): 2 days, 0:34:50/16:26:07, loss=0.352616324889844, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.98(1.03), norm=2.965754583099691, lr=0.08483709090378627
2023-12-07 11:30:47   INFO  epoch: 54/72, acc_iter=209998, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:34:10, time_cost(all): 2 days, 0:35:32/15:59:18, loss=0.352557127463904, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.21(1.03), norm=1.0748264529043037, lr=0.08472312382841851
2023-12-07 11:31:29   INFO  epoch: 54/72, acc_iter=210048, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:31:53, time_cost(all): 2 days, 0:36:14/15:08:37, loss=0.352497930037963, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.94(1.03), norm=2.039211733917943, lr=0.0846091567530507
2023-12-07 11:32:11   INFO  epoch: 54/72, acc_iter=210098, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:30:41, time_cost(all): 2 days, 0:36:56/15:55:28, loss=0.352438732612022, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.19(1.03), norm=1.9074416865276977, lr=0.08449518967768294
2023-12-07 11:32:52   INFO  epoch: 54/72, acc_iter=210148, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:31:43, time_cost(all): 2 days, 0:37:37/16:01:27, loss=0.352379535186081, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.23(1.03), norm=4.623060961697914, lr=0.08438122260231512
2023-12-07 11:33:34   INFO  epoch: 54/72, acc_iter=210198, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:31:46, time_cost(all): 2 days, 0:38:19/15:33:22, loss=0.35232033776014, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.14(1.03), norm=2.763074491126849, lr=0.0842672555269473
2023-12-07 11:34:16   INFO  epoch: 54/72, acc_iter=210248, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:31:27, time_cost(all): 2 days, 0:39:01/16:40:02, loss=0.352261140334199, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.16(1.03), norm=3.5894997124690025, lr=0.08415328845157954
2023-12-07 11:34:58   INFO  epoch: 54/72, acc_iter=210298, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:29:17, time_cost(all): 2 days, 0:39:43/15:30:43, loss=0.352201942908258, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.06(1.03), norm=0.7814160120100359, lr=0.08403932137621178
2023-12-07 11:35:40   INFO  epoch: 54/72, acc_iter=210348, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:27:27, time_cost(all): 2 days, 0:40:25/15:09:00, loss=0.352142745482317, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.87(1.03), norm=4.049708186620821, lr=0.08392535430084391
2023-12-07 11:36:21   INFO  epoch: 54/72, acc_iter=210398, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:27:42, time_cost(all): 2 days, 0:41:06/15:40:36, loss=0.352083548056376, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.93(1.03), norm=4.304714579469961, lr=0.08381138722547615
2023-12-07 11:37:03   INFO  epoch: 54/72, acc_iter=210448, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:28:14, time_cost(all): 2 days, 0:41:48/15:48:40, loss=0.352024350630435, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.93(1.03), norm=1.7167373002485795, lr=0.08369742015010839
2023-12-07 11:37:45   INFO  epoch: 54/72, acc_iter=210498, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:26:24, time_cost(all): 2 days, 0:42:30/16:11:01, loss=0.351965153204494, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.92(1.03), norm=2.8890292789894643, lr=0.08358345307474058
2023-12-07 11:38:27   INFO  epoch: 54/72, acc_iter=210548, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:25:42, time_cost(all): 2 days, 0:43:12/16:16:44, loss=0.351905955778553, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.23(1.03), norm=4.488626577802002, lr=0.08346948599937276
2023-12-07 11:39:09   INFO  epoch: 54/72, acc_iter=210598, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:24:42, time_cost(all): 2 days, 0:43:54/15:14:07, loss=0.351846758352612, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.05(1.03), norm=3.324296409366528, lr=0.083355518924005
2023-12-07 11:39:50   INFO  epoch: 54/72, acc_iter=210648, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:23:45, time_cost(all): 2 days, 0:44:35/16:35:00, loss=0.351787560926671, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.08(1.03), norm=4.289137330127577, lr=0.08324155184863719
2023-12-07 11:40:32   INFO  epoch: 54/72, acc_iter=210698, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:52, time_cost(all): 2 days, 0:45:17/15:24:09, loss=0.35172836350073, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.98(1.03), norm=4.848988579810522, lr=0.08312758477326937
2023-12-07 11:41:14   INFO  epoch: 54/72, acc_iter=210748, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:23:03, time_cost(all): 2 days, 0:45:59/15:10:57, loss=0.351669166074789, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.04(1.03), norm=0.7917047537687447, lr=0.08301361769790155
2023-12-07 11:41:56   INFO  epoch: 54/72, acc_iter=210798, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:22:06, time_cost(all): 2 days, 0:46:41/15:49:42, loss=0.351609968648848, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.05(1.03), norm=1.5062850903793525, lr=0.0828996506225338
2023-12-07 11:42:37   INFO  epoch: 54/72, acc_iter=210848, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:30, time_cost(all): 2 days, 0:47:22/14:58:29, loss=0.351550771222908, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.02(1.03), norm=1.1907464767907465, lr=0.08278568354716603
2023-12-07 11:43:19   INFO  epoch: 54/72, acc_iter=210898, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:09, time_cost(all): 2 days, 0:48:04/16:07:17, loss=0.351491573796967, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.86(1.03), norm=1.9615142796950786, lr=0.08267171647179816
2023-12-07 11:44:01   INFO  epoch: 54/72, acc_iter=210948, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:19:23, time_cost(all): 2 days, 0:48:46/15:24:46, loss=0.351432376371026, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.95(1.03), norm=0.9145092407068118, lr=0.0825577493964304
2023-12-07 11:44:43   INFO  epoch: 54/72, acc_iter=210998, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:20:32, time_cost(all): 2 days, 0:49:28/15:38:47, loss=0.351373178945085, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.03(1.03), norm=3.2015940140006345, lr=0.08244378232106264
2023-12-07 11:45:25   INFO  epoch: 54/72, acc_iter=211048, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:18:43, time_cost(all): 2 days, 0:50:10/16:10:15, loss=0.351313981519144, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.12(1.03), norm=3.657314095613953, lr=0.08232981524569483
2023-12-07 11:46:06   INFO  epoch: 54/72, acc_iter=211098, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:17:31, time_cost(all): 2 days, 0:50:51/15:52:29, loss=0.351254784093203, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.22(1.03), norm=3.2841777304412703, lr=0.08221584817032701
2023-12-07 11:46:48   INFO  epoch: 54/72, acc_iter=211148, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:18:02, time_cost(all): 2 days, 0:51:33/15:55:51, loss=0.351195586667262, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.88(1.03), norm=2.1474733149472707, lr=0.08210188109495925
2023-12-07 11:47:30   INFO  epoch: 54/72, acc_iter=211198, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:03, time_cost(all): 2 days, 0:52:15/16:12:56, loss=0.351136389241321, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.0(1.03), norm=4.456690245877985, lr=0.08198791401959143
2023-12-07 11:48:12   INFO  epoch: 54/72, acc_iter=211248, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:55, time_cost(all): 2 days, 0:52:57/15:48:40, loss=0.35107719181538, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.16(1.03), norm=2.4051857132515866, lr=0.08187394694422367
2023-12-07 11:48:53   INFO  epoch: 54/72, acc_iter=211298, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:16:03, time_cost(all): 2 days, 0:53:38/15:00:03, loss=0.351017994389439, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.21(1.03), norm=3.55570793957387, lr=0.08175997986885586
2023-12-07 11:49:35   INFO  epoch: 54/72, acc_iter=211348, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:48, time_cost(all): 2 days, 0:54:20/15:26:14, loss=0.350958796963498, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.21(1.03), norm=2.865515015859638, lr=0.08164601279348804
2023-12-07 11:50:17   INFO  epoch: 54/72, acc_iter=211398, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:13:41, time_cost(all): 2 days, 0:55:02/16:16:18, loss=0.350899599537557, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.06(1.03), norm=0.6388673859730176, lr=0.08153204571812028
2023-12-07 11:50:59   INFO  epoch: 54/72, acc_iter=211448, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:57, time_cost(all): 2 days, 0:55:44/15:44:32, loss=0.350840402111616, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.04(1.03), norm=0.989985314018935, lr=0.08141807864275247
2023-12-07 11:51:41   INFO  epoch: 54/72, acc_iter=211498, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:21, time_cost(all): 2 days, 0:56:26/14:51:15, loss=0.350781204685675, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.01(1.03), norm=1.0492706590525547, lr=0.08130411156738465
2023-12-07 11:52:22   INFO  epoch: 54/72, acc_iter=211548, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:24, time_cost(all): 2 days, 0:57:07/15:45:54, loss=0.350722007259734, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.87(1.03), norm=2.147380071010544, lr=0.08119014449201689
2023-12-07 11:53:04   INFO  epoch: 54/72, acc_iter=211598, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:48, time_cost(all): 2 days, 0:57:49/15:02:53, loss=0.350662809833793, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.05(1.03), norm=2.551081925561111, lr=0.08107617741664908
2023-12-07 11:53:46   INFO  epoch: 54/72, acc_iter=211648, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:08, time_cost(all): 2 days, 0:58:31/15:20:39, loss=0.350603612407852, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.84(1.03), norm=4.853290199672324, lr=0.08096221034128126
2023-12-07 11:54:28   INFO  epoch: 54/72, acc_iter=211698, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:10:24, time_cost(all): 2 days, 0:59:13/15:35:55, loss=0.350544414981912, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.91(1.03), norm=4.200345707867337, lr=0.0808482432659135
2023-12-07 11:55:09   INFO  epoch: 54/72, acc_iter=211748, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:38, time_cost(all): 2 days, 0:59:54/15:36:39, loss=0.350485217555971, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.14(1.03), norm=2.1771252392151426, lr=0.08073427619054568
2023-12-07 11:55:51   INFO  epoch: 54/72, acc_iter=211798, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:32, time_cost(all): 2 days, 1:00:36/15:36:41, loss=0.35042602013003, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.88(1.03), norm=1.5968754121863578, lr=0.08062030911517792
2023-12-07 11:56:33   INFO  epoch: 54/72, acc_iter=211848, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:48, time_cost(all): 2 days, 1:01:18/14:44:47, loss=0.350366822704089, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.91(1.03), norm=1.7364691270294785, lr=0.08050634203981011
2023-12-07 11:57:15   INFO  epoch: 54/72, acc_iter=211898, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:06:56, time_cost(all): 2 days, 1:02:00/15:35:48, loss=0.350307625278148, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.13(1.03), norm=1.8250369871586656, lr=0.08039237496444229
2023-12-07 11:57:57   INFO  epoch: 54/72, acc_iter=211948, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:40, time_cost(all): 2 days, 1:02:42/15:49:07, loss=0.350248427852207, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.88(1.03), norm=3.6395767786517226, lr=0.08027840788907453
2023-12-07 11:58:38   INFO  epoch: 54/72, acc_iter=211998, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:53, time_cost(all): 2 days, 1:03:23/15:54:31, loss=0.350189230426266, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.09(1.03), norm=2.7883168002469976, lr=0.08016444081370677
2023-12-07 11:59:20   INFO  epoch: 54/72, acc_iter=212048, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:04:57, time_cost(all): 2 days, 1:04:05/15:40:03, loss=0.350130033000325, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.2(1.03), norm=1.2925519873177966, lr=0.0800504737383389
2023-12-07 12:00:02   INFO  epoch: 54/72, acc_iter=212098, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:13, time_cost(all): 2 days, 1:04:47/15:57:53, loss=0.350070835574384, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.9(1.03), norm=4.0287700541847435, lr=0.07993650666297114
2023-12-07 12:00:44   INFO  epoch: 54/72, acc_iter=212148, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:31, time_cost(all): 2 days, 1:05:29/16:10:09, loss=0.350011638148443, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.03(1.03), norm=2.0739737992922294, lr=0.07982253958760338
2023-12-07 12:01:25   INFO  epoch: 54/72, acc_iter=212198, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:52, time_cost(all): 2 days, 1:06:10/15:23:30, loss=0.349952440722502, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.11(1.03), norm=4.4039243926580145, lr=0.07970857251223556
2023-12-07 12:02:07   INFO  epoch: 54/72, acc_iter=212248, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:16, time_cost(all): 2 days, 1:06:52/15:23:37, loss=0.349893243296561, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.96(1.03), norm=3.491850077256, lr=0.07959460543686775
2023-12-07 12:02:49   INFO  epoch: 54/72, acc_iter=212298, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:30, time_cost(all): 2 days, 1:07:34/15:29:22, loss=0.34983404587062, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.89(1.03), norm=4.877124613015511, lr=0.07948063836149999
2023-12-07 12:03:31   INFO  epoch: 54/72, acc_iter=212348, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:49, time_cost(all): 2 days, 1:08:16/15:13:32, loss=0.349774848444679, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.19(1.03), norm=0.9167397140392213, lr=0.07936667128613217
2023-12-07 12:04:13   INFO  epoch: 54/72, acc_iter=212398, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 2 days, 1:08:58/16:08:44, loss=0.349715651018738, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.21(1.03), norm=2.208728398017545, lr=0.07925270421076436
2023-12-07 12:04:54   INFO  epoch: 55/72, acc_iter=212460, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:51:23, time_cost(all): 2 days, 1:09:39/15:06:00, loss=0.349642246210572, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.13(1.03), norm=3.602333851426928, lr=0.0791113850373083
2023-12-07 12:05:36   INFO  epoch: 55/72, acc_iter=212510, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:54:38, time_cost(all): 2 days, 1:10:21/14:56:11, loss=0.349583048784631, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.09(1.03), norm=0.5215796157231631, lr=0.07899741796194054
2023-12-07 12:06:18   INFO  epoch: 55/72, acc_iter=212560, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:50:44, time_cost(all): 2 days, 1:11:03/15:44:11, loss=0.34952385135869, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.02(1.03), norm=4.374838468882398, lr=0.07888345088657273
2023-12-07 12:07:00   INFO  epoch: 55/72, acc_iter=212610, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:53:30, time_cost(all): 2 days, 1:11:45/15:29:08, loss=0.349464653932749, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.0(1.03), norm=1.9375567603864687, lr=0.07876948381120491
2023-12-07 12:07:41   INFO  epoch: 55/72, acc_iter=212660, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:51:43, time_cost(all): 2 days, 1:12:26/15:44:31, loss=0.349405456506808, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.03(1.03), norm=0.6713494211810731, lr=0.07865551673583715
2023-12-07 12:08:23   INFO  epoch: 55/72, acc_iter=212710, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:48:14, time_cost(all): 2 days, 1:13:08/16:03:08, loss=0.349346259080867, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.08(1.03), norm=1.956249816970714, lr=0.07854154966046933
2023-12-07 12:09:05   INFO  epoch: 55/72, acc_iter=212760, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:51:09, time_cost(all): 2 days, 1:13:50/14:36:13, loss=0.349287061654926, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.04(1.03), norm=1.4467814170161544, lr=0.07842758258510152
2023-12-07 12:09:47   INFO  epoch: 55/72, acc_iter=212810, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:47:04, time_cost(all): 2 days, 1:14:32/15:17:09, loss=0.349227864228985, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.15(1.03), norm=2.5672998355474697, lr=0.07831361550973376
2023-12-07 12:10:29   INFO  epoch: 55/72, acc_iter=212860, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:47:41, time_cost(all): 2 days, 1:15:14/14:32:35, loss=0.349168666803044, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.97(1.03), norm=2.5589979855690848, lr=0.07819964843436594
2023-12-07 12:11:10   INFO  epoch: 55/72, acc_iter=212910, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:49:02, time_cost(all): 2 days, 1:15:55/15:07:55, loss=0.349109469377103, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.87(1.03), norm=1.9492494710927935, lr=0.07808568135899813
2023-12-07 12:11:52   INFO  epoch: 55/72, acc_iter=212960, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:48:11, time_cost(all): 2 days, 1:16:37/15:19:31, loss=0.349050271951162, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.9(1.03), norm=4.626082463642643, lr=0.07797171428363037
2023-12-07 12:12:34   INFO  epoch: 55/72, acc_iter=213010, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:46:14, time_cost(all): 2 days, 1:17:19/15:28:42, loss=0.348991074525221, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.12(1.03), norm=0.6290923991014469, lr=0.07785774720826255
2023-12-07 12:13:16   INFO  epoch: 55/72, acc_iter=213060, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:42:36, time_cost(all): 2 days, 1:18:01/15:50:04, loss=0.34893187709928, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.1(1.03), norm=4.412504563509289, lr=0.07774378013289479
2023-12-07 12:13:58   INFO  epoch: 55/72, acc_iter=213110, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:44:12, time_cost(all): 2 days, 1:18:43/15:39:39, loss=0.348872679673339, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.0(1.03), norm=1.7023607040803157, lr=0.07762981305752698
2023-12-07 12:14:39   INFO  epoch: 55/72, acc_iter=213160, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:45:27, time_cost(all): 2 days, 1:19:24/15:28:44, loss=0.348813482247398, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.05(1.03), norm=1.0269178537227788, lr=0.07751584598215916
2023-12-07 12:15:21   INFO  epoch: 55/72, acc_iter=213210, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:42:33, time_cost(all): 2 days, 1:20:06/14:51:05, loss=0.348754284821457, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.18(1.03), norm=2.8490642670428317, lr=0.0774018789067914
2023-12-07 12:16:03   INFO  epoch: 55/72, acc_iter=213260, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:41:09, time_cost(all): 2 days, 1:20:48/15:18:20, loss=0.348695087395517, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.94(1.03), norm=0.9546142399830828, lr=0.07728791183142364
2023-12-07 12:16:45   INFO  epoch: 55/72, acc_iter=213310, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:41:08, time_cost(all): 2 days, 1:21:30/14:51:43, loss=0.348635889969576, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.83(1.03), norm=0.6365652186966342, lr=0.07717394475605577
2023-12-07 12:17:26   INFO  epoch: 55/72, acc_iter=213360, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:39:36, time_cost(all): 2 days, 1:22:11/15:06:49, loss=0.348576692543635, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.96(1.03), norm=2.322125143094014, lr=0.07705997768068801
2023-12-07 12:18:08   INFO  epoch: 55/72, acc_iter=213410, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:41:45, time_cost(all): 2 days, 1:22:53/15:00:48, loss=0.348517495117694, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.99(1.03), norm=2.1235925849959743, lr=0.07694601060532025
2023-12-07 12:18:50   INFO  epoch: 55/72, acc_iter=213460, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:37:44, time_cost(all): 2 days, 1:23:35/15:41:38, loss=0.348458297691753, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.98(1.03), norm=0.7720433652049183, lr=0.07683204352995243
2023-12-07 12:19:32   INFO  epoch: 55/72, acc_iter=213510, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:38:41, time_cost(all): 2 days, 1:24:17/14:54:04, loss=0.348399100265812, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.07(1.03), norm=2.519497500110326, lr=0.07671807645458462
2023-12-07 12:20:14   INFO  epoch: 55/72, acc_iter=213560, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:39:19, time_cost(all): 2 days, 1:24:59/15:02:39, loss=0.348339902839871, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.86(1.03), norm=3.011172490236594, lr=0.0766041093792168
2023-12-07 12:20:55   INFO  epoch: 55/72, acc_iter=213610, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:38:10, time_cost(all): 2 days, 1:25:40/15:22:59, loss=0.34828070541393, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.98(1.03), norm=3.8990339313770774, lr=0.07649014230384904
2023-12-07 12:21:37   INFO  epoch: 55/72, acc_iter=213660, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:36:28, time_cost(all): 2 days, 1:26:22/14:44:58, loss=0.348221507987989, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.0(1.03), norm=2.0121547490355924, lr=0.07637617522848122
2023-12-07 12:22:19   INFO  epoch: 55/72, acc_iter=213710, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:35:53, time_cost(all): 2 days, 1:27:04/15:33:57, loss=0.348162310562048, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.21(1.03), norm=1.3557198274606537, lr=0.07626220815311341
2023-12-07 12:23:01   INFO  epoch: 55/72, acc_iter=213760, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:36:05, time_cost(all): 2 days, 1:27:46/15:44:26, loss=0.348103113136107, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.09(1.03), norm=2.1619028160189195, lr=0.07614824107774565
2023-12-07 12:23:42   INFO  epoch: 55/72, acc_iter=213810, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:32:35, time_cost(all): 2 days, 1:28:27/15:18:42, loss=0.348043915710166, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.86(1.03), norm=3.252481464461035, lr=0.07603427400237789
2023-12-07 12:24:24   INFO  epoch: 55/72, acc_iter=213860, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:34:09, time_cost(all): 2 days, 1:29:09/15:14:58, loss=0.347984718284225, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.13(1.03), norm=4.215039157504018, lr=0.07592030692701002
2023-12-07 12:25:06   INFO  epoch: 55/72, acc_iter=213910, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:32:33, time_cost(all): 2 days, 1:29:51/14:53:01, loss=0.347925520858284, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.1(1.03), norm=2.8728485303304945, lr=0.07580633985164226
2023-12-07 12:25:48   INFO  epoch: 55/72, acc_iter=213960, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:31:37, time_cost(all): 2 days, 1:30:33/14:55:46, loss=0.347866323432343, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.09(1.03), norm=1.7936884235987365, lr=0.0756923727762745
2023-12-07 12:26:30   INFO  epoch: 55/72, acc_iter=214010, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:32:16, time_cost(all): 2 days, 1:31:15/14:21:57, loss=0.347807126006402, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.2(1.03), norm=3.470523415977661, lr=0.07557840570090668
2023-12-07 12:27:11   INFO  epoch: 55/72, acc_iter=214060, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:31:15, time_cost(all): 2 days, 1:31:56/15:41:01, loss=0.347747928580461, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.93(1.03), norm=1.2924289748063171, lr=0.07546443862553887
2023-12-07 12:27:53   INFO  epoch: 55/72, acc_iter=214110, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:30:59, time_cost(all): 2 days, 1:32:38/15:29:24, loss=0.347688731154521, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.18(1.03), norm=3.760114741102169, lr=0.0753504715501711
2023-12-07 12:28:35   INFO  epoch: 55/72, acc_iter=214160, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:28:22, time_cost(all): 2 days, 1:33:20/15:28:45, loss=0.34762953372858, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.12(1.03), norm=1.3971527891906521, lr=0.07523650447480329
2023-12-07 12:29:17   INFO  epoch: 55/72, acc_iter=214210, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:27:33, time_cost(all): 2 days, 1:34:02/14:39:24, loss=0.347570336302639, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.21(1.03), norm=3.2134606528311473, lr=0.07512253739943553
2023-12-07 12:29:58   INFO  epoch: 55/72, acc_iter=214260, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:28:02, time_cost(all): 2 days, 1:34:43/14:31:05, loss=0.347511138876698, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.21(1.03), norm=4.19049364235792, lr=0.07500857032406771
2023-12-07 12:30:40   INFO  epoch: 55/72, acc_iter=214310, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:27:25, time_cost(all): 2 days, 1:35:25/14:45:24, loss=0.347451941450757, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.18(1.03), norm=1.4813930678997735, lr=0.0748946032486999
2023-12-07 12:31:22   INFO  epoch: 55/72, acc_iter=214360, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:27:18, time_cost(all): 2 days, 1:36:07/14:14:43, loss=0.347392744024816, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.83(1.03), norm=4.406821918550441, lr=0.07478063617333214
2023-12-07 12:32:04   INFO  epoch: 55/72, acc_iter=214410, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:24:58, time_cost(all): 2 days, 1:36:49/15:29:21, loss=0.347333546598875, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.13(1.03), norm=1.2726086958004517, lr=0.07466666909796432
2023-12-07 12:32:46   INFO  epoch: 55/72, acc_iter=214460, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:26:14, time_cost(all): 2 days, 1:37:31/14:36:57, loss=0.347274349172934, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.07(1.03), norm=1.157256697660928, lr=0.0745527020225965
2023-12-07 12:33:27   INFO  epoch: 55/72, acc_iter=214510, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:23:23, time_cost(all): 2 days, 1:38:12/14:23:40, loss=0.347215151746993, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.22(1.03), norm=3.698683458280863, lr=0.07443873494722875
2023-12-07 12:34:09   INFO  epoch: 55/72, acc_iter=214560, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:23:42, time_cost(all): 2 days, 1:38:54/14:29:14, loss=0.347155954321052, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.03(1.03), norm=3.977182286868277, lr=0.07432476787186093
2023-12-07 12:34:51   INFO  epoch: 55/72, acc_iter=214610, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:23:10, time_cost(all): 2 days, 1:39:36/15:32:24, loss=0.347096756895111, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.96(1.03), norm=4.585986727633174, lr=0.07421080079649311
2023-12-07 12:35:33   INFO  epoch: 55/72, acc_iter=214660, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:21:58, time_cost(all): 2 days, 1:40:18/14:46:29, loss=0.34703755946917, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.88(1.03), norm=2.2869244375534823, lr=0.07409683372112535
2023-12-07 12:36:14   INFO  epoch: 55/72, acc_iter=214710, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:36, time_cost(all): 2 days, 1:40:59/14:58:28, loss=0.346978362043229, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.86(1.03), norm=0.883865505449565, lr=0.07398286664575754
2023-12-07 12:36:56   INFO  epoch: 55/72, acc_iter=214760, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:16, time_cost(all): 2 days, 1:41:41/15:26:19, loss=0.346919164617288, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.08(1.03), norm=2.0555919715290836, lr=0.07386889957038978
2023-12-07 12:37:38   INFO  epoch: 55/72, acc_iter=214810, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:21:06, time_cost(all): 2 days, 1:42:23/14:38:27, loss=0.346859967191347, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.91(1.03), norm=4.224053906159504, lr=0.07375493249502196
2023-12-07 12:38:20   INFO  epoch: 55/72, acc_iter=214860, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:20:00, time_cost(all): 2 days, 1:43:05/14:34:05, loss=0.346800769765406, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.89(1.03), norm=4.082300437863617, lr=0.07364096541965415
2023-12-07 12:39:02   INFO  epoch: 55/72, acc_iter=214910, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:20, time_cost(all): 2 days, 1:43:47/15:18:49, loss=0.346741572339466, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.17(1.03), norm=2.8641755841375725, lr=0.07352699834428639
2023-12-07 12:39:43   INFO  epoch: 55/72, acc_iter=214960, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:00, time_cost(all): 2 days, 1:44:28/14:50:54, loss=0.346682374913525, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.02(1.03), norm=3.7437024395907006, lr=0.07341303126891863
2023-12-07 12:40:25   INFO  epoch: 55/72, acc_iter=215010, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:10, time_cost(all): 2 days, 1:45:10/14:45:35, loss=0.346623177487584, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.88(1.03), norm=1.445087405934753, lr=0.07329906419355076
2023-12-07 12:41:07   INFO  epoch: 55/72, acc_iter=215060, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:17:35, time_cost(all): 2 days, 1:45:52/15:10:09, loss=0.346563980061643, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.87(1.03), norm=1.679441718935268, lr=0.073185097118183
2023-12-07 12:41:49   INFO  epoch: 55/72, acc_iter=215110, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:15:23, time_cost(all): 2 days, 1:46:34/14:58:10, loss=0.346504782635702, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.06(1.03), norm=1.2042751669626632, lr=0.07307113004281524
2023-12-07 12:42:30   INFO  epoch: 55/72, acc_iter=215160, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:16:13, time_cost(all): 2 days, 1:47:15/15:15:52, loss=0.346445585209761, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.2(1.03), norm=1.5448114540780118, lr=0.07295716296744742
2023-12-07 12:43:12   INFO  epoch: 55/72, acc_iter=215210, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:15:12, time_cost(all): 2 days, 1:47:57/15:20:28, loss=0.34638638778382, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.83(1.03), norm=4.64661396578248, lr=0.0728431958920796
2023-12-07 12:43:54   INFO  epoch: 55/72, acc_iter=215260, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:13:40, time_cost(all): 2 days, 1:48:39/14:19:21, loss=0.346327190357879, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.15(1.03), norm=4.8835429943817985, lr=0.07272922881671179
2023-12-07 12:44:36   INFO  epoch: 55/72, acc_iter=215310, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:12:43, time_cost(all): 2 days, 1:49:21/14:07:31, loss=0.346267992931938, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.12(1.03), norm=3.248306370545065, lr=0.07261526174134403
2023-12-07 12:45:18   INFO  epoch: 55/72, acc_iter=215360, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:42, time_cost(all): 2 days, 1:50:03/14:36:29, loss=0.346208795505997, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.94(1.03), norm=0.6726488801984855, lr=0.07250129466597621
2023-12-07 12:45:59   INFO  epoch: 55/72, acc_iter=215410, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:16, time_cost(all): 2 days, 1:50:44/14:23:01, loss=0.346149598080056, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.2(1.03), norm=2.3359627814335373, lr=0.0723873275906084
2023-12-07 12:46:41   INFO  epoch: 55/72, acc_iter=215460, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:10:53, time_cost(all): 2 days, 1:51:26/14:01:21, loss=0.346090400654115, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.15(1.03), norm=1.123383341002525, lr=0.07227336051524064
2023-12-07 12:47:23   INFO  epoch: 55/72, acc_iter=215510, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:57, time_cost(all): 2 days, 1:52:08/14:39:18, loss=0.346031203228174, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.06(1.03), norm=2.5034721534423907, lr=0.07215939343987288
2023-12-07 12:48:05   INFO  epoch: 55/72, acc_iter=215560, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:34, time_cost(all): 2 days, 1:52:50/15:15:28, loss=0.345972005802233, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.1(1.03), norm=2.773943572224865, lr=0.072045426364505
2023-12-07 12:48:47   INFO  epoch: 55/72, acc_iter=215610, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:08:54, time_cost(all): 2 days, 1:53:32/15:17:37, loss=0.345912808376292, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.03(1.03), norm=3.7797432975707945, lr=0.07193145928913725
2023-12-07 12:49:28   INFO  epoch: 55/72, acc_iter=215660, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:27, time_cost(all): 2 days, 1:54:13/13:58:59, loss=0.345853610950351, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.09(1.03), norm=0.7356581977311709, lr=0.07181749221376948
2023-12-07 12:50:10   INFO  epoch: 55/72, acc_iter=215710, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:54, time_cost(all): 2 days, 1:54:55/15:15:28, loss=0.34579441352441, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.22(1.03), norm=4.297378045881664, lr=0.07170352513840167
2023-12-07 12:50:52   INFO  epoch: 55/72, acc_iter=215760, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:19, time_cost(all): 2 days, 1:55:37/14:29:08, loss=0.345735216098469, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.84(1.03), norm=3.8823607260378847, lr=0.07158955806303385
2023-12-07 12:51:34   INFO  epoch: 55/72, acc_iter=215810, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:12, time_cost(all): 2 days, 1:56:19/14:37:24, loss=0.345676018672529, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.88(1.03), norm=0.876833883395862, lr=0.0714755909876661
2023-12-07 12:52:15   INFO  epoch: 55/72, acc_iter=215860, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:29, time_cost(all): 2 days, 1:57:00/15:18:36, loss=0.345616821246588, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.2(1.03), norm=3.417684267284955, lr=0.07136162391229828
2023-12-07 12:52:57   INFO  epoch: 55/72, acc_iter=215910, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:04:49, time_cost(all): 2 days, 1:57:42/15:00:47, loss=0.345557623820647, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.91(1.03), norm=4.363789535795174, lr=0.07124765683693052
2023-12-07 12:53:39   INFO  epoch: 55/72, acc_iter=215960, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:13, time_cost(all): 2 days, 1:58:24/14:59:17, loss=0.345498426394706, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.97(1.03), norm=2.066883664052311, lr=0.07113368976156265
2023-12-07 12:54:21   INFO  epoch: 55/72, acc_iter=216010, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:43, time_cost(all): 2 days, 1:59:06/14:49:01, loss=0.345439228968765, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.04(1.03), norm=4.376402222391592, lr=0.07101972268619489
2023-12-07 12:55:03   INFO  epoch: 55/72, acc_iter=216060, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:56, time_cost(all): 2 days, 1:59:48/14:27:08, loss=0.345380031542824, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.16(1.03), norm=4.978879587659699, lr=0.07090575561082713
2023-12-07 12:55:44   INFO  epoch: 55/72, acc_iter=216110, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:09, time_cost(all): 2 days, 2:00:29/14:15:25, loss=0.345320834116883, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.23(1.03), norm=4.579714032041791, lr=0.07079178853545931
2023-12-07 12:56:26   INFO  epoch: 55/72, acc_iter=216160, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:31, time_cost(all): 2 days, 2:01:11/13:52:56, loss=0.345261636690942, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.84(1.03), norm=0.9565273958949811, lr=0.0706778214600915
2023-12-07 12:57:08   INFO  epoch: 55/72, acc_iter=216210, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:53, time_cost(all): 2 days, 2:01:53/14:21:37, loss=0.345202439265001, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.19(1.03), norm=4.813520888177809, lr=0.07056385438472373
2023-12-07 12:57:50   INFO  epoch: 55/72, acc_iter=216260, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 2 days, 2:02:35/13:55:08, loss=0.34514324183906, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.08(1.03), norm=3.2986155827203754, lr=0.07044988730935592
2023-12-07 12:58:31   INFO  epoch: 56/72, acc_iter=216322, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:54:35, time_cost(all): 2 days, 2:03:16/14:41:25, loss=0.345069837030893, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.98(1.03), norm=1.1097982539091027, lr=0.07030856813589986
2023-12-07 12:59:13   INFO  epoch: 56/72, acc_iter=216372, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:50:52, time_cost(all): 2 days, 2:03:58/14:55:38, loss=0.345010639604952, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.99(1.03), norm=2.7974513191034327, lr=0.07019460106053205
2023-12-07 12:59:55   INFO  epoch: 56/72, acc_iter=216422, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:49:32, time_cost(all): 2 days, 2:04:40/13:55:24, loss=0.344951442179011, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.93(1.03), norm=3.0653854851349713, lr=0.07008063398516429
2023-12-07 13:00:37   INFO  epoch: 56/72, acc_iter=216472, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:53:25, time_cost(all): 2 days, 2:05:22/14:44:30, loss=0.34489224475307, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.97(1.03), norm=3.3011335706893012, lr=0.06996666690979647
2023-12-07 13:01:19   INFO  epoch: 56/72, acc_iter=216522, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:48:48, time_cost(all): 2 days, 2:06:04/14:11:14, loss=0.34483304732713, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.85(1.03), norm=0.7689666366319969, lr=0.06985269983442866
2023-12-07 13:02:00   INFO  epoch: 56/72, acc_iter=216572, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:47:21, time_cost(all): 2 days, 2:06:45/14:24:34, loss=0.344773849901189, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.1(1.03), norm=4.589098402083249, lr=0.0697387327590609
2023-12-07 13:02:42   INFO  epoch: 56/72, acc_iter=216622, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:51:06, time_cost(all): 2 days, 2:07:27/14:08:37, loss=0.344714652475248, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.21(1.03), norm=2.5764118070515316, lr=0.06962476568369308
2023-12-07 13:03:24   INFO  epoch: 56/72, acc_iter=216672, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:47:20, time_cost(all): 2 days, 2:08:09/14:46:57, loss=0.344655455049307, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.85(1.03), norm=1.6692335056957621, lr=0.06951079860832526
2023-12-07 13:04:06   INFO  epoch: 56/72, acc_iter=216722, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:48:10, time_cost(all): 2 days, 2:08:51/14:37:46, loss=0.344596257623366, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.22(1.03), norm=2.487028424443908, lr=0.0693968315329575
2023-12-07 13:04:47   INFO  epoch: 56/72, acc_iter=216772, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:47:10, time_cost(all): 2 days, 2:09:32/14:14:16, loss=0.344537060197425, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.85(1.03), norm=2.638771671406399, lr=0.06928286445758974
2023-12-07 13:05:29   INFO  epoch: 56/72, acc_iter=216822, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:46:43, time_cost(all): 2 days, 2:10:14/14:03:45, loss=0.344477862771484, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.0(1.03), norm=2.7119575478065823, lr=0.06916889738222187
2023-12-07 13:06:11   INFO  epoch: 56/72, acc_iter=216872, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:46:20, time_cost(all): 2 days, 2:10:56/14:52:48, loss=0.344418665345543, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.02(1.03), norm=2.2565411699770133, lr=0.06905493030685411
2023-12-07 13:06:53   INFO  epoch: 56/72, acc_iter=216922, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:45:31, time_cost(all): 2 days, 2:11:38/14:48:26, loss=0.344359467919602, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.06(1.03), norm=2.802775744834333, lr=0.06894096323148635
2023-12-07 13:07:35   INFO  epoch: 56/72, acc_iter=216972, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:43:38, time_cost(all): 2 days, 2:12:20/14:47:34, loss=0.344300270493661, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.12(1.03), norm=4.869938796405002, lr=0.06882699615611854
2023-12-07 13:08:16   INFO  epoch: 56/72, acc_iter=217022, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:45:19, time_cost(all): 2 days, 2:13:01/14:55:52, loss=0.34424107306772, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.86(1.03), norm=2.836329323180407, lr=0.06871302908075072
2023-12-07 13:08:58   INFO  epoch: 56/72, acc_iter=217072, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:41:51, time_cost(all): 2 days, 2:13:43/14:12:59, loss=0.344181875641779, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.03(1.03), norm=1.676191064869138, lr=0.06859906200538296
2023-12-07 13:09:40   INFO  epoch: 56/72, acc_iter=217122, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:40:10, time_cost(all): 2 days, 2:14:25/14:13:49, loss=0.344122678215838, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.22(1.03), norm=4.403379575311406, lr=0.06848509493001514
2023-12-07 13:10:22   INFO  epoch: 56/72, acc_iter=217172, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:40:31, time_cost(all): 2 days, 2:15:07/13:46:48, loss=0.344063480789897, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.15(1.03), norm=4.996097075041634, lr=0.06837112785464738
2023-12-07 13:11:03   INFO  epoch: 56/72, acc_iter=217222, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:39:57, time_cost(all): 2 days, 2:15:48/14:01:29, loss=0.344004283363956, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.83(1.03), norm=1.1574019070883834, lr=0.06825716077927951
2023-12-07 13:11:45   INFO  epoch: 56/72, acc_iter=217272, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:38:00, time_cost(all): 2 days, 2:16:30/14:30:46, loss=0.343945085938015, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.21(1.03), norm=3.9633760655318495, lr=0.06814319370391175
2023-12-07 13:12:27   INFO  epoch: 56/72, acc_iter=217322, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:38:02, time_cost(all): 2 days, 2:17:12/14:12:12, loss=0.343885888512074, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.86(1.03), norm=1.8384864331812154, lr=0.06802922662854399
2023-12-07 13:13:09   INFO  epoch: 56/72, acc_iter=217372, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:39:20, time_cost(all): 2 days, 2:17:54/14:30:04, loss=0.343826691086134, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.99(1.03), norm=4.37895548302269, lr=0.06791525955317618
2023-12-07 13:13:51   INFO  epoch: 56/72, acc_iter=217422, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:37:17, time_cost(all): 2 days, 2:18:36/14:28:34, loss=0.343767493660193, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.13(1.03), norm=1.189444056274953, lr=0.06780129247780836
2023-12-07 13:14:32   INFO  epoch: 56/72, acc_iter=217472, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:35:33, time_cost(all): 2 days, 2:19:17/14:15:10, loss=0.343708296234252, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.9(1.03), norm=2.190058232431176, lr=0.0676873254024406
2023-12-07 13:15:14   INFO  epoch: 56/72, acc_iter=217522, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:37:44, time_cost(all): 2 days, 2:19:59/14:26:35, loss=0.343649098808311, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.2(1.03), norm=4.4671815794352545, lr=0.06757335832707279
2023-12-07 13:15:56   INFO  epoch: 56/72, acc_iter=217572, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:36:49, time_cost(all): 2 days, 2:20:41/14:51:44, loss=0.34358990138237, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.03(1.03), norm=1.75384642976256, lr=0.06745939125170497
2023-12-07 13:16:38   INFO  epoch: 56/72, acc_iter=217622, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:33:40, time_cost(all): 2 days, 2:21:23/14:02:10, loss=0.343530703956429, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.04(1.03), norm=2.4077957436850963, lr=0.06734542417633721
2023-12-07 13:17:19   INFO  epoch: 56/72, acc_iter=217672, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:33:25, time_cost(all): 2 days, 2:22:04/14:38:31, loss=0.343471506530488, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.17(1.03), norm=0.87717107809083, lr=0.0672314571009694
2023-12-07 13:18:01   INFO  epoch: 56/72, acc_iter=217722, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:32:45, time_cost(all): 2 days, 2:22:46/14:04:59, loss=0.343412309104547, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.12(1.03), norm=4.090172232845507, lr=0.06711749002560163
2023-12-07 13:18:43   INFO  epoch: 56/72, acc_iter=217772, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:31:47, time_cost(all): 2 days, 2:23:28/13:48:33, loss=0.343353111678606, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.2(1.03), norm=2.702399612469394, lr=0.06700352295023382
2023-12-07 13:19:25   INFO  epoch: 56/72, acc_iter=217822, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:32:51, time_cost(all): 2 days, 2:24:10/13:35:17, loss=0.343293914252665, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.84(1.03), norm=0.5867495279355892, lr=0.066889555874866
2023-12-07 13:20:07   INFO  epoch: 56/72, acc_iter=217872, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:30:05, time_cost(all): 2 days, 2:24:52/14:46:27, loss=0.343234716826724, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.9(1.03), norm=1.4232507029136625, lr=0.06677558879949824
2023-12-07 13:20:48   INFO  epoch: 56/72, acc_iter=217922, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:32:01, time_cost(all): 2 days, 2:25:33/13:53:14, loss=0.343175519400783, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.1(1.03), norm=2.555001908432514, lr=0.06666162172413048
2023-12-07 13:21:30   INFO  epoch: 56/72, acc_iter=217972, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:28:41, time_cost(all): 2 days, 2:26:15/14:17:08, loss=0.343116321974842, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.08(1.03), norm=1.9125114172850175, lr=0.06654765464876261
2023-12-07 13:22:12   INFO  epoch: 56/72, acc_iter=218022, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:29:38, time_cost(all): 2 days, 2:26:57/14:42:47, loss=0.343057124548901, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.15(1.03), norm=2.374347395670202, lr=0.06643368757339485
2023-12-07 13:22:54   INFO  epoch: 56/72, acc_iter=218072, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:29:49, time_cost(all): 2 days, 2:27:39/13:35:28, loss=0.34299792712296, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.86(1.03), norm=1.7413722786019241, lr=0.06631972049802703
2023-12-07 13:23:36   INFO  epoch: 56/72, acc_iter=218122, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:26:59, time_cost(all): 2 days, 2:28:21/14:18:41, loss=0.342938729697019, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.23(1.03), norm=1.4748251934390257, lr=0.06620575342265927
2023-12-07 13:24:17   INFO  epoch: 56/72, acc_iter=218172, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:28:07, time_cost(all): 2 days, 2:29:02/14:03:12, loss=0.342879532271078, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.92(1.03), norm=4.982926875520804, lr=0.06609178634729146
2023-12-07 13:24:59   INFO  epoch: 56/72, acc_iter=218222, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:26:10, time_cost(all): 2 days, 2:29:44/14:40:44, loss=0.342820334845138, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.18(1.03), norm=1.235727432798864, lr=0.06597781927192364
2023-12-07 13:25:41   INFO  epoch: 56/72, acc_iter=218272, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:27:08, time_cost(all): 2 days, 2:30:26/13:47:06, loss=0.342761137419197, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.96(1.03), norm=2.2603963316058673, lr=0.06586385219655588
2023-12-07 13:26:23   INFO  epoch: 56/72, acc_iter=218322, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:26:04, time_cost(all): 2 days, 2:31:08/13:59:37, loss=0.342701939993256, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.03(1.03), norm=2.070178521039193, lr=0.06574988512118807
2023-12-07 13:27:04   INFO  epoch: 56/72, acc_iter=218372, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:24:16, time_cost(all): 2 days, 2:31:49/13:50:21, loss=0.342642742567315, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.0(1.03), norm=2.2543513234842316, lr=0.06563591804582025
2023-12-07 13:27:46   INFO  epoch: 56/72, acc_iter=218422, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:20, time_cost(all): 2 days, 2:32:31/13:48:02, loss=0.342583545141374, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.03(1.03), norm=2.098935031136583, lr=0.06552195097045249
2023-12-07 13:28:28   INFO  epoch: 56/72, acc_iter=218472, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:31, time_cost(all): 2 days, 2:33:13/13:59:28, loss=0.342524347715433, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.9(1.03), norm=4.299080369517788, lr=0.06540798389508473
2023-12-07 13:29:10   INFO  epoch: 56/72, acc_iter=218522, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:21:43, time_cost(all): 2 days, 2:33:55/14:35:10, loss=0.342465150289492, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.92(1.03), norm=2.6401724579029717, lr=0.06529401681971686
2023-12-07 13:29:52   INFO  epoch: 56/72, acc_iter=218572, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:00, time_cost(all): 2 days, 2:34:37/13:48:19, loss=0.342405952863551, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.92(1.03), norm=4.906282410539086, lr=0.0651800497443491
2023-12-07 13:30:33   INFO  epoch: 56/72, acc_iter=218622, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:09, time_cost(all): 2 days, 2:35:18/13:55:20, loss=0.34234675543761, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.13(1.03), norm=1.6829523975710368, lr=0.06506608266898134
2023-12-07 13:31:15   INFO  epoch: 56/72, acc_iter=218672, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:19:33, time_cost(all): 2 days, 2:36:00/14:19:51, loss=0.342287558011669, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.08(1.03), norm=2.7145727004856672, lr=0.06495211559361352
2023-12-07 13:31:57   INFO  epoch: 56/72, acc_iter=218722, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:19:17, time_cost(all): 2 days, 2:36:42/13:52:57, loss=0.342228360585728, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.97(1.03), norm=3.261603552035393, lr=0.06483814851824571
2023-12-07 13:32:39   INFO  epoch: 56/72, acc_iter=218772, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:05, time_cost(all): 2 days, 2:37:24/13:47:11, loss=0.342169163159787, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.02(1.03), norm=1.7202096949115742, lr=0.06472418144287789
2023-12-07 13:33:20   INFO  epoch: 56/72, acc_iter=218822, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:47, time_cost(all): 2 days, 2:38:05/14:09:58, loss=0.342109965733846, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.11(1.03), norm=4.174841101312833, lr=0.06461021436751013
2023-12-07 13:34:02   INFO  epoch: 56/72, acc_iter=218872, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:16:43, time_cost(all): 2 days, 2:38:47/13:40:47, loss=0.342050768307905, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.9(1.03), norm=4.124883364127356, lr=0.06449624729214237
2023-12-07 13:34:44   INFO  epoch: 56/72, acc_iter=218922, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:46, time_cost(all): 2 days, 2:39:29/13:42:47, loss=0.341991570881964, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.96(1.03), norm=2.9372925940630483, lr=0.0643822802167745
2023-12-07 13:35:26   INFO  epoch: 56/72, acc_iter=218972, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:19, time_cost(all): 2 days, 2:40:11/14:15:28, loss=0.341932373456023, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.17(1.03), norm=0.6432614771813021, lr=0.06426831314140674
2023-12-07 13:36:08   INFO  epoch: 56/72, acc_iter=219022, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:45, time_cost(all): 2 days, 2:40:53/13:15:17, loss=0.341873176030082, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.9(1.03), norm=1.4715931647294351, lr=0.06415434606603898
2023-12-07 13:36:49   INFO  epoch: 56/72, acc_iter=219072, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:15:27, time_cost(all): 2 days, 2:41:34/14:08:02, loss=0.341813978604142, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.89(1.03), norm=1.7784526756697059, lr=0.06404037899067117
2023-12-07 13:37:31   INFO  epoch: 56/72, acc_iter=219122, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:26, time_cost(all): 2 days, 2:42:16/14:11:33, loss=0.341754781178201, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.09(1.03), norm=1.579205970899666, lr=0.06392641191530335
2023-12-07 13:38:13   INFO  epoch: 56/72, acc_iter=219172, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:29, time_cost(all): 2 days, 2:42:58/14:19:27, loss=0.34169558375226, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.18(1.03), norm=4.543974550568394, lr=0.06381244483993559
2023-12-07 13:38:55   INFO  epoch: 56/72, acc_iter=219222, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:08, time_cost(all): 2 days, 2:43:40/14:06:18, loss=0.341636386326319, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.98(1.03), norm=1.693847954422361, lr=0.06369847776456777
2023-12-07 13:39:36   INFO  epoch: 56/72, acc_iter=219272, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:18, time_cost(all): 2 days, 2:44:21/13:29:09, loss=0.341577188900378, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.02(1.03), norm=2.7377789412665163, lr=0.06358451068919996
2023-12-07 13:40:18   INFO  epoch: 56/72, acc_iter=219322, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:03, time_cost(all): 2 days, 2:45:03/13:31:44, loss=0.341517991474437, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.08(1.03), norm=0.5649374782028127, lr=0.0634705436138322
2023-12-07 13:41:00   INFO  epoch: 56/72, acc_iter=219372, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:37, time_cost(all): 2 days, 2:45:45/13:38:33, loss=0.341458794048496, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.88(1.03), norm=0.6019166996960525, lr=0.06335657653846438
2023-12-07 13:41:42   INFO  epoch: 56/72, acc_iter=219422, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:46, time_cost(all): 2 days, 2:46:27/13:08:13, loss=0.341399596622555, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.87(1.03), norm=1.5695476344252268, lr=0.06324260946309662
2023-12-07 13:42:24   INFO  epoch: 56/72, acc_iter=219472, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:04, time_cost(all): 2 days, 2:47:09/13:36:36, loss=0.341340399196614, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.14(1.03), norm=4.850321718874584, lr=0.0631286423877288
2023-12-07 13:43:05   INFO  epoch: 56/72, acc_iter=219522, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:32, time_cost(all): 2 days, 2:47:50/14:23:51, loss=0.341281201770673, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.02(1.03), norm=0.762546877671041, lr=0.06301467531236099
2023-12-07 13:43:47   INFO  epoch: 56/72, acc_iter=219572, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:59, time_cost(all): 2 days, 2:48:32/14:13:51, loss=0.341222004344732, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.15(1.03), norm=1.5043302439037396, lr=0.06290070823699323
2023-12-07 13:44:29   INFO  epoch: 56/72, acc_iter=219622, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:06, time_cost(all): 2 days, 2:49:14/13:27:58, loss=0.341162806918791, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.04(1.03), norm=4.444433245915009, lr=0.06278674116162541
2023-12-07 13:45:11   INFO  epoch: 56/72, acc_iter=219672, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:43, time_cost(all): 2 days, 2:49:56/13:10:59, loss=0.34110360949285, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.1(1.03), norm=4.323342039220579, lr=0.0626727740862576
2023-12-07 13:45:52   INFO  epoch: 56/72, acc_iter=219722, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:34, time_cost(all): 2 days, 2:50:37/13:26:02, loss=0.341044412066909, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.95(1.03), norm=3.18790749850171, lr=0.06255880701088984
2023-12-07 13:46:34   INFO  epoch: 56/72, acc_iter=219772, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:08, time_cost(all): 2 days, 2:51:19/13:47:37, loss=0.340985214640968, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.0(1.03), norm=1.2910913353123323, lr=0.06244483993552202
2023-12-07 13:47:16   INFO  epoch: 56/72, acc_iter=219822, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:20, time_cost(all): 2 days, 2:52:01/13:38:28, loss=0.340926017215027, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.86(1.03), norm=0.562126955022692, lr=0.06233087286015426
2023-12-07 13:47:58   INFO  epoch: 56/72, acc_iter=219872, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:37, time_cost(all): 2 days, 2:52:43/14:14:30, loss=0.340866819789086, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.96(1.03), norm=3.8719900180596194, lr=0.06221690578478645
2023-12-07 13:48:40   INFO  epoch: 56/72, acc_iter=219922, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:03:00, time_cost(all): 2 days, 2:53:25/13:16:06, loss=0.340807622363146, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.13(1.03), norm=4.447383254927069, lr=0.06210293870941863
2023-12-07 13:49:21   INFO  epoch: 56/72, acc_iter=219972, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:18, time_cost(all): 2 days, 2:54:06/13:59:35, loss=0.340748424937205, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.9(1.03), norm=1.8247506314883337, lr=0.06198897163405087
2023-12-07 13:50:03   INFO  epoch: 56/72, acc_iter=220022, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:30, time_cost(all): 2 days, 2:54:48/13:14:48, loss=0.340689227511264, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.93(1.03), norm=1.6425270198348099, lr=0.061875004558683055
2023-12-07 13:50:45   INFO  epoch: 56/72, acc_iter=220072, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:54, time_cost(all): 2 days, 2:55:30/14:05:56, loss=0.340630030085323, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.97(1.03), norm=3.9699221343536166, lr=0.06176103748331524
2023-12-07 13:51:27   INFO  epoch: 56/72, acc_iter=220122, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 2 days, 2:56:12/14:12:22, loss=0.340570832659382, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.95(1.03), norm=1.1293980661697571, lr=0.06164707040794748
2023-12-07 13:52:08   INFO  epoch: 57/72, acc_iter=220184, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:50:53, time_cost(all): 2 days, 2:56:53/13:58:35, loss=0.340497427851215, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.95(1.03), norm=3.3766179837403345, lr=0.06150575123449137
2023-12-07 13:52:50   INFO  epoch: 57/72, acc_iter=220234, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:50:05, time_cost(all): 2 days, 2:57:35/13:39:11, loss=0.340438230425274, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.22(1.03), norm=2.9550521323529297, lr=0.06139178415912361
2023-12-07 13:53:32   INFO  epoch: 57/72, acc_iter=220284, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:50:57, time_cost(all): 2 days, 2:58:17/13:37:53, loss=0.340379032999333, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.13(1.03), norm=3.1733381448289375, lr=0.06127781708375585
2023-12-07 13:54:14   INFO  epoch: 57/72, acc_iter=220334, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:53:29, time_cost(all): 2 days, 2:58:59/13:32:18, loss=0.340319835573392, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.11(1.03), norm=2.5389502481694173, lr=0.06116385000838803
2023-12-07 13:54:56   INFO  epoch: 57/72, acc_iter=220384, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:48:59, time_cost(all): 2 days, 2:59:41/13:02:41, loss=0.340260638147451, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.84(1.03), norm=1.7185736785840757, lr=0.061049882933020216
2023-12-07 13:55:37   INFO  epoch: 57/72, acc_iter=220434, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:49:52, time_cost(all): 2 days, 3:00:22/14:05:34, loss=0.34020144072151, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.22(1.03), norm=0.7658866726845761, lr=0.060935915857652456
2023-12-07 13:56:19   INFO  epoch: 57/72, acc_iter=220484, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:47:49, time_cost(all): 2 days, 3:01:04/13:06:36, loss=0.340142243295569, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.91(1.03), norm=4.189756948956314, lr=0.06082194878228464
2023-12-07 13:57:01   INFO  epoch: 57/72, acc_iter=220534, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:46:42, time_cost(all): 2 days, 3:01:46/13:53:39, loss=0.340083045869628, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.0(1.03), norm=3.993266048894895, lr=0.060707981706916825
2023-12-07 13:57:43   INFO  epoch: 57/72, acc_iter=220584, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:46:58, time_cost(all): 2 days, 3:02:28/13:38:08, loss=0.340023848443687, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.11(1.03), norm=1.8253044554363316, lr=0.060594014631549065
2023-12-07 13:58:25   INFO  epoch: 57/72, acc_iter=220634, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:48:18, time_cost(all): 2 days, 3:03:10/13:19:46, loss=0.339964651017747, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.0(1.03), norm=2.4800174298851414, lr=0.06048004755618125
2023-12-07 13:59:06   INFO  epoch: 57/72, acc_iter=220684, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:45:16, time_cost(all): 2 days, 3:03:51/13:43:16, loss=0.339905453591806, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.08(1.03), norm=3.220427291138805, lr=0.06036608048081349
2023-12-07 13:59:48   INFO  epoch: 57/72, acc_iter=220734, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:45:49, time_cost(all): 2 days, 3:04:33/13:27:03, loss=0.339846256165865, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.92(1.03), norm=1.837181311188833, lr=0.06025211340544567
2023-12-07 14:00:30   INFO  epoch: 57/72, acc_iter=220784, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:45:24, time_cost(all): 2 days, 3:05:15/13:54:03, loss=0.339787058739924, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.98(1.03), norm=4.95712659055595, lr=0.06013814633007786
2023-12-07 14:01:12   INFO  epoch: 57/72, acc_iter=220834, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:45:59, time_cost(all): 2 days, 3:05:57/12:46:39, loss=0.339727861313983, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.2(1.03), norm=4.816325496563967, lr=0.0600241792547101
2023-12-07 14:01:53   INFO  epoch: 57/72, acc_iter=220884, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:41:59, time_cost(all): 2 days, 3:06:38/13:28:38, loss=0.339668663888042, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.87(1.03), norm=4.17890875118753, lr=0.05991021217934228
2023-12-07 14:02:35   INFO  epoch: 57/72, acc_iter=220934, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:43:36, time_cost(all): 2 days, 3:07:20/13:57:21, loss=0.339609466462101, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.13(1.03), norm=4.3736542804463046, lr=0.059796245103974466
2023-12-07 14:03:17   INFO  epoch: 57/72, acc_iter=220984, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:42:03, time_cost(all): 2 days, 3:08:02/13:01:20, loss=0.33955026903616, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.83(1.03), norm=4.302970444050527, lr=0.059682278028606706
2023-12-07 14:03:59   INFO  epoch: 57/72, acc_iter=221034, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:43:00, time_cost(all): 2 days, 3:08:44/12:56:16, loss=0.339491071610219, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.1(1.03), norm=1.5002731274826706, lr=0.05956831095323889
2023-12-07 14:04:41   INFO  epoch: 57/72, acc_iter=221084, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:41:54, time_cost(all): 2 days, 3:09:26/14:01:44, loss=0.339431874184278, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.93(1.03), norm=3.453188583284058, lr=0.05945434387787113
2023-12-07 14:05:22   INFO  epoch: 57/72, acc_iter=221134, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:40:39, time_cost(all): 2 days, 3:10:07/13:42:17, loss=0.339372676758337, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.21(1.03), norm=0.6602198162125676, lr=0.059340376802503314
2023-12-07 14:06:04   INFO  epoch: 57/72, acc_iter=221184, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:39:29, time_cost(all): 2 days, 3:10:49/13:16:31, loss=0.339313479332396, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.15(1.03), norm=3.1762659157465025, lr=0.0592264097271355
2023-12-07 14:06:46   INFO  epoch: 57/72, acc_iter=221234, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:37:13, time_cost(all): 2 days, 3:11:31/13:36:28, loss=0.339254281906455, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.93(1.03), norm=3.549769817173739, lr=0.05911244265176774
2023-12-07 14:07:28   INFO  epoch: 57/72, acc_iter=221284, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:38:48, time_cost(all): 2 days, 3:12:13/13:16:01, loss=0.339195084480514, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.85(1.03), norm=2.912312009272756, lr=0.05899847557639992
2023-12-07 14:08:09   INFO  epoch: 57/72, acc_iter=221334, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:37:39, time_cost(all): 2 days, 3:12:54/13:45:38, loss=0.339135887054573, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.08(1.03), norm=4.75965513085905, lr=0.05888450850103211
2023-12-07 14:08:51   INFO  epoch: 57/72, acc_iter=221384, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:36:59, time_cost(all): 2 days, 3:13:36/13:26:44, loss=0.339076689628632, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.84(1.03), norm=0.5099987605232483, lr=0.05877054142566435
2023-12-07 14:09:33   INFO  epoch: 57/72, acc_iter=221434, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:34:10, time_cost(all): 2 days, 3:14:18/13:41:42, loss=0.339017492202691, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.23(1.03), norm=3.551712596042771, lr=0.058656574350296586
2023-12-07 14:10:15   INFO  epoch: 57/72, acc_iter=221484, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:36:06, time_cost(all): 2 days, 3:15:00/13:53:13, loss=0.338958294776751, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.12(1.03), norm=4.219911602904229, lr=0.058542607274928715
2023-12-07 14:10:57   INFO  epoch: 57/72, acc_iter=221534, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:33:15, time_cost(all): 2 days, 3:15:42/12:43:27, loss=0.33889909735081, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.22(1.03), norm=2.2269773778391935, lr=0.058428640199560955
2023-12-07 14:11:38   INFO  epoch: 57/72, acc_iter=221584, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:33:21, time_cost(all): 2 days, 3:16:23/13:26:41, loss=0.338839899924869, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.84(1.03), norm=3.654030894968538, lr=0.058314673124193195
2023-12-07 14:12:20   INFO  epoch: 57/72, acc_iter=221634, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:32:22, time_cost(all): 2 days, 3:17:05/13:09:50, loss=0.338780702498928, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.11(1.03), norm=1.4640277314752466, lr=0.05820070604882538
2023-12-07 14:13:02   INFO  epoch: 57/72, acc_iter=221684, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:32:42, time_cost(all): 2 days, 3:17:47/12:54:27, loss=0.338721505072987, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.01(1.03), norm=2.354929427555441, lr=0.05808673897345756
2023-12-07 14:13:44   INFO  epoch: 57/72, acc_iter=221734, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:32:54, time_cost(all): 2 days, 3:18:29/12:53:39, loss=0.338662307647046, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.0(1.03), norm=1.0304477901764117, lr=0.05797277189808975
2023-12-07 14:14:25   INFO  epoch: 57/72, acc_iter=221784, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:31:59, time_cost(all): 2 days, 3:19:10/13:05:14, loss=0.338603110221105, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.16(1.03), norm=1.5000681980707506, lr=0.05785880482272199
2023-12-07 14:15:07   INFO  epoch: 57/72, acc_iter=221834, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:30:18, time_cost(all): 2 days, 3:19:52/13:47:12, loss=0.338543912795164, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.89(1.03), norm=1.9893304960554914, lr=0.05774483774735423
2023-12-07 14:15:49   INFO  epoch: 57/72, acc_iter=221884, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:28:17, time_cost(all): 2 days, 3:20:34/13:39:16, loss=0.338484715369223, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.96(1.03), norm=4.957102815816511, lr=0.057630870671986356
2023-12-07 14:16:31   INFO  epoch: 57/72, acc_iter=221934, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:27:41, time_cost(all): 2 days, 3:21:16/13:19:43, loss=0.338425517943282, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.17(1.03), norm=0.6678481102115403, lr=0.057516903596618596
2023-12-07 14:17:13   INFO  epoch: 57/72, acc_iter=221984, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:26:59, time_cost(all): 2 days, 3:21:58/12:59:37, loss=0.338366320517341, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.89(1.03), norm=1.5182773301981711, lr=0.057402936521250836
2023-12-07 14:17:54   INFO  epoch: 57/72, acc_iter=222034, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:26:40, time_cost(all): 2 days, 3:22:39/13:06:58, loss=0.3383071230914, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.15(1.03), norm=1.9315992644842894, lr=0.05728896944588302
2023-12-07 14:18:36   INFO  epoch: 57/72, acc_iter=222084, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:27:55, time_cost(all): 2 days, 3:23:21/13:01:28, loss=0.338247925665459, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.21(1.03), norm=3.5506604095601704, lr=0.057175002370515204
2023-12-07 14:19:18   INFO  epoch: 57/72, acc_iter=222134, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:26:10, time_cost(all): 2 days, 3:24:03/13:38:11, loss=0.338188728239518, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.12(1.03), norm=4.074719128536171, lr=0.057061035295147444
2023-12-07 14:20:00   INFO  epoch: 57/72, acc_iter=222184, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:24:23, time_cost(all): 2 days, 3:24:45/13:43:35, loss=0.338129530813577, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.97(1.03), norm=0.9988798544477242, lr=0.05694706821977963
2023-12-07 14:20:41   INFO  epoch: 57/72, acc_iter=222234, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:25:06, time_cost(all): 2 days, 3:25:26/12:32:06, loss=0.338070333387636, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.01(1.03), norm=1.3335299597906167, lr=0.05683310114441181
2023-12-07 14:21:23   INFO  epoch: 57/72, acc_iter=222284, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:37, time_cost(all): 2 days, 3:26:08/12:42:52, loss=0.338011135961695, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.06(1.03), norm=0.6950292059562884, lr=0.05671913406904405
2023-12-07 14:22:05   INFO  epoch: 57/72, acc_iter=222334, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:04, time_cost(all): 2 days, 3:26:50/13:03:23, loss=0.337951938535755, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.22(1.03), norm=0.9846992903715549, lr=0.05660516699367624
2023-12-07 14:22:47   INFO  epoch: 57/72, acc_iter=222384, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:22:30, time_cost(all): 2 days, 3:27:32/13:31:33, loss=0.337892741109814, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.03(1.03), norm=3.89626957968698, lr=0.05649119991830848
2023-12-07 14:23:29   INFO  epoch: 57/72, acc_iter=222434, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:20:45, time_cost(all): 2 days, 3:28:14/13:00:28, loss=0.337833543683873, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.95(1.03), norm=1.7388006144019619, lr=0.056377232842940606
2023-12-07 14:24:10   INFO  epoch: 57/72, acc_iter=222484, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:56, time_cost(all): 2 days, 3:28:55/13:24:11, loss=0.337774346257932, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.11(1.03), norm=2.0476400857121178, lr=0.056263265767572845
2023-12-07 14:24:52   INFO  epoch: 57/72, acc_iter=222534, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:20:01, time_cost(all): 2 days, 3:29:37/13:19:47, loss=0.337715148831991, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.97(1.03), norm=4.707510478502543, lr=0.056149298692205085
2023-12-07 14:25:34   INFO  epoch: 57/72, acc_iter=222584, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:20:14, time_cost(all): 2 days, 3:30:19/12:45:22, loss=0.33765595140605, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.09(1.03), norm=0.7985286835601552, lr=0.05603533161683727
2023-12-07 14:26:16   INFO  epoch: 57/72, acc_iter=222634, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:43, time_cost(all): 2 days, 3:31:01/12:36:19, loss=0.337596753980109, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.08(1.03), norm=4.5419703804569895, lr=0.055921364541469454
2023-12-07 14:26:57   INFO  epoch: 57/72, acc_iter=222684, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:19:09, time_cost(all): 2 days, 3:31:42/13:34:57, loss=0.337537556554168, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.86(1.03), norm=0.6087807751351437, lr=0.055807397466101694
2023-12-07 14:27:39   INFO  epoch: 57/72, acc_iter=222734, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:18:16, time_cost(all): 2 days, 3:32:24/13:23:02, loss=0.337478359128227, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.02(1.03), norm=2.8268255375208025, lr=0.05569343039073388
2023-12-07 14:28:21   INFO  epoch: 57/72, acc_iter=222784, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:27, time_cost(all): 2 days, 3:33:06/13:33:12, loss=0.337419161702286, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.07(1.03), norm=2.933588777132584, lr=0.05557946331536612
2023-12-07 14:29:03   INFO  epoch: 57/72, acc_iter=222834, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:57, time_cost(all): 2 days, 3:33:48/13:25:29, loss=0.337359964276345, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.92(1.03), norm=2.843806480061905, lr=0.0554654962399983
2023-12-07 14:29:45   INFO  epoch: 57/72, acc_iter=222884, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:38, time_cost(all): 2 days, 3:34:30/13:25:09, loss=0.337300766850404, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.12(1.03), norm=3.519747655859106, lr=0.055351529164630486
2023-12-07 14:30:26   INFO  epoch: 57/72, acc_iter=222934, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:34, time_cost(all): 2 days, 3:35:11/13:12:53, loss=0.337241569424463, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.21(1.03), norm=3.6109623545068112, lr=0.055237562089262726
2023-12-07 14:31:08   INFO  epoch: 57/72, acc_iter=222984, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:13:25, time_cost(all): 2 days, 3:35:53/13:33:59, loss=0.337182371998522, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.02(1.03), norm=4.988743064949128, lr=0.05512359501389491
2023-12-07 14:31:50   INFO  epoch: 57/72, acc_iter=223034, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:27, time_cost(all): 2 days, 3:36:35/12:32:35, loss=0.337123174572581, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.12(1.03), norm=2.5325656521508737, lr=0.055009627938527095
2023-12-07 14:32:32   INFO  epoch: 57/72, acc_iter=223084, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:13:09, time_cost(all): 2 days, 3:37:17/12:51:04, loss=0.33706397714664, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.88(1.03), norm=1.0008881141503871, lr=0.054895660863159335
2023-12-07 14:33:13   INFO  epoch: 57/72, acc_iter=223134, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:11:41, time_cost(all): 2 days, 3:37:58/12:17:06, loss=0.337004779720699, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.12(1.03), norm=4.089948099264143, lr=0.054781693787791574
2023-12-07 14:33:55   INFO  epoch: 57/72, acc_iter=223184, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:10:48, time_cost(all): 2 days, 3:38:40/13:25:09, loss=0.336945582294759, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.96(1.03), norm=2.2373003847067414, lr=0.0546677267124237
2023-12-07 14:34:37   INFO  epoch: 57/72, acc_iter=223234, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:09, time_cost(all): 2 days, 3:39:22/13:17:59, loss=0.336886384868818, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.08(1.03), norm=4.746314156478013, lr=0.05455375963705594
2023-12-07 14:35:19   INFO  epoch: 57/72, acc_iter=223284, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:10:18, time_cost(all): 2 days, 3:40:04/12:48:33, loss=0.336827187442877, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.09(1.03), norm=2.8234209950867006, lr=0.05443979256168813
2023-12-07 14:36:01   INFO  epoch: 57/72, acc_iter=223334, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:11, time_cost(all): 2 days, 3:40:46/12:57:39, loss=0.336767990016936, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.99(1.03), norm=4.600397040322702, lr=0.05432582548632037
2023-12-07 14:36:42   INFO  epoch: 57/72, acc_iter=223384, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:48, time_cost(all): 2 days, 3:41:27/13:07:33, loss=0.336708792590995, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.83(1.03), norm=4.763172122384961, lr=0.05421185841095255
2023-12-07 14:37:24   INFO  epoch: 57/72, acc_iter=223434, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:43, time_cost(all): 2 days, 3:42:09/13:22:47, loss=0.336649595165054, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.88(1.03), norm=1.3295831541927405, lr=0.054097891335584736
2023-12-07 14:38:06   INFO  epoch: 57/72, acc_iter=223484, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:11, time_cost(all): 2 days, 3:42:51/12:50:24, loss=0.336590397739113, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.0(1.03), norm=4.2231910144206894, lr=0.053983924260216976
2023-12-07 14:38:48   INFO  epoch: 57/72, acc_iter=223534, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:40, time_cost(all): 2 days, 3:43:33/12:56:48, loss=0.336531200313172, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.85(1.03), norm=2.284989330193114, lr=0.053869957184849215
2023-12-07 14:39:30   INFO  epoch: 57/72, acc_iter=223584, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:35, time_cost(all): 2 days, 3:44:15/13:04:25, loss=0.336472002887231, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.2(1.03), norm=1.7659593213529985, lr=0.053755990109481344
2023-12-07 14:40:11   INFO  epoch: 57/72, acc_iter=223634, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:08, time_cost(all): 2 days, 3:44:56/12:35:06, loss=0.33641280546129, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.01(1.03), norm=2.7874692047278335, lr=0.053642023034113584
2023-12-07 14:40:53   INFO  epoch: 57/72, acc_iter=223684, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:13, time_cost(all): 2 days, 3:45:38/12:45:16, loss=0.336353608035349, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.11(1.03), norm=0.50419721343402, lr=0.053528055958745824
2023-12-07 14:41:35   INFO  epoch: 57/72, acc_iter=223734, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:28, time_cost(all): 2 days, 3:46:20/13:17:47, loss=0.336294410609408, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.05(1.03), norm=1.2492809515492356, lr=0.05341408888337801
2023-12-07 14:42:17   INFO  epoch: 57/72, acc_iter=223784, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:50, time_cost(all): 2 days, 3:47:02/12:08:19, loss=0.336235213183467, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.89(1.03), norm=2.308473192771928, lr=0.05330012180801019
2023-12-07 14:42:58   INFO  epoch: 57/72, acc_iter=223834, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:17, time_cost(all): 2 days, 3:47:43/12:37:56, loss=0.336176015757526, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.07(1.03), norm=4.988151773030762, lr=0.05318615473264243
2023-12-07 14:43:40   INFO  epoch: 57/72, acc_iter=223884, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:30, time_cost(all): 2 days, 3:48:25/12:43:55, loss=0.336116818331585, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.08(1.03), norm=0.7748534410946063, lr=0.053072187657274617
2023-12-07 14:44:22   INFO  epoch: 57/72, acc_iter=223934, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:50, time_cost(all): 2 days, 3:49:07/12:30:36, loss=0.336057620905644, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.17(1.03), norm=4.515722361214389, lr=0.0529582205819068
2023-12-07 14:45:04   INFO  epoch: 57/72, acc_iter=223984, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 2 days, 3:49:49/12:37:01, loss=0.335998423479703, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.07(1.03), norm=3.1936548573797507, lr=0.05284425350653904
2023-12-07 14:45:46   INFO  epoch: 58/72, acc_iter=224046, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:53:09, time_cost(all): 2 days, 3:50:31/12:52:13, loss=0.335925018671537, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.88(1.03), norm=3.0854454511177725, lr=0.052702934333082985
2023-12-07 14:46:27   INFO  epoch: 58/72, acc_iter=224096, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:50:11, time_cost(all): 2 days, 3:51:12/12:35:25, loss=0.335865821245596, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.89(1.03), norm=2.596234314852446, lr=0.05258896725771517
2023-12-07 14:47:09   INFO  epoch: 58/72, acc_iter=224146, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:53:29, time_cost(all): 2 days, 3:51:54/12:25:21, loss=0.335806623819655, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.11(1.03), norm=4.511019192126014, lr=0.05247500018234735
2023-12-07 14:47:51   INFO  epoch: 58/72, acc_iter=224196, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:51:33, time_cost(all): 2 days, 3:52:36/12:07:19, loss=0.335747426393714, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.04(1.03), norm=3.124698078665167, lr=0.05236103310697959
2023-12-07 14:48:33   INFO  epoch: 58/72, acc_iter=224246, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:52:21, time_cost(all): 2 days, 3:53:18/12:14:05, loss=0.335688228967773, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.86(1.03), norm=3.6738809567052195, lr=0.05224706603161178
2023-12-07 14:49:14   INFO  epoch: 58/72, acc_iter=224296, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:51:41, time_cost(all): 2 days, 3:53:59/12:06:26, loss=0.335629031541832, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.04(1.03), norm=4.153115072358867, lr=0.05213309895624396
2023-12-07 14:49:56   INFO  epoch: 58/72, acc_iter=224346, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:49:56, time_cost(all): 2 days, 3:54:41/12:22:24, loss=0.335569834115891, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.93(1.03), norm=2.177809597393908, lr=0.0520191318808762
2023-12-07 14:50:38   INFO  epoch: 58/72, acc_iter=224396, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:46:50, time_cost(all): 2 days, 3:55:23/12:24:11, loss=0.33551063668995, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.1(1.03), norm=4.215989018872781, lr=0.05190516480550844
2023-12-07 14:51:20   INFO  epoch: 58/72, acc_iter=224446, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:45:35, time_cost(all): 2 days, 3:56:05/12:35:55, loss=0.335451439264009, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.14(1.03), norm=0.6835650383887647, lr=0.05179119773014057
2023-12-07 14:52:02   INFO  epoch: 58/72, acc_iter=224496, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:45:43, time_cost(all): 2 days, 3:56:47/12:33:13, loss=0.335392241838068, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.09(1.03), norm=2.0272542268445766, lr=0.05167723065477281
2023-12-07 14:52:43   INFO  epoch: 58/72, acc_iter=224546, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:44:20, time_cost(all): 2 days, 3:57:28/12:48:59, loss=0.335333044412127, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.19(1.03), norm=3.2103431107133518, lr=0.051563263579404994
2023-12-07 14:53:25   INFO  epoch: 58/72, acc_iter=224596, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:43:37, time_cost(all): 2 days, 3:58:10/12:02:23, loss=0.335273846986186, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.23(1.03), norm=3.302091746838852, lr=0.051449296504037234
2023-12-07 14:54:07   INFO  epoch: 58/72, acc_iter=224646, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:42:57, time_cost(all): 2 days, 3:58:52/12:15:18, loss=0.335214649560245, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.21(1.03), norm=4.274743041469491, lr=0.05133532942866942
2023-12-07 14:54:49   INFO  epoch: 58/72, acc_iter=224696, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:42:46, time_cost(all): 2 days, 3:59:34/12:15:19, loss=0.335155452134304, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.06(1.03), norm=2.676768467648015, lr=0.0512213623533016
2023-12-07 14:55:30   INFO  epoch: 58/72, acc_iter=224746, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:44:14, time_cost(all): 2 days, 4:00:15/13:08:55, loss=0.335096254708364, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.98(1.03), norm=3.4009600359263237, lr=0.05110739527793384
2023-12-07 14:56:12   INFO  epoch: 58/72, acc_iter=224796, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:43:37, time_cost(all): 2 days, 4:00:57/12:48:08, loss=0.335037057282423, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.03(1.03), norm=3.1310189898779814, lr=0.05099342820256608
2023-12-07 14:56:54   INFO  epoch: 58/72, acc_iter=224846, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:41:51, time_cost(all): 2 days, 4:01:39/12:24:20, loss=0.334977859856482, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.88(1.03), norm=2.254389583827703, lr=0.05087946112719821
2023-12-07 14:57:36   INFO  epoch: 58/72, acc_iter=224896, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:42:14, time_cost(all): 2 days, 4:02:21/12:45:38, loss=0.334918662430541, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.11(1.03), norm=2.658657280041846, lr=0.05076549405183045
2023-12-07 14:58:18   INFO  epoch: 58/72, acc_iter=224946, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:41:11, time_cost(all): 2 days, 4:03:03/12:40:53, loss=0.3348594650046, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.17(1.03), norm=4.574163057107746, lr=0.05065152697646269
2023-12-07 14:58:59   INFO  epoch: 58/72, acc_iter=224996, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:39:33, time_cost(all): 2 days, 4:03:44/12:22:45, loss=0.334800267578659, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.09(1.03), norm=3.168946221375929, lr=0.050537559901094875
2023-12-07 14:59:41   INFO  epoch: 58/72, acc_iter=225046, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:40:19, time_cost(all): 2 days, 4:04:26/12:54:56, loss=0.334741070152718, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.13(1.03), norm=1.9658157486838186, lr=0.05042359282572706
2023-12-07 15:00:23   INFO  epoch: 58/72, acc_iter=225096, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:36:41, time_cost(all): 2 days, 4:05:08/12:52:50, loss=0.334681872726777, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.87(1.03), norm=3.639510831372786, lr=0.0503096257503593
2023-12-07 15:01:05   INFO  epoch: 58/72, acc_iter=225146, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:39:13, time_cost(all): 2 days, 4:05:50/12:13:46, loss=0.334622675300836, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.13(1.03), norm=2.9338094578087603, lr=0.050195658674991483
2023-12-07 15:01:46   INFO  epoch: 58/72, acc_iter=225196, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:36:25, time_cost(all): 2 days, 4:06:31/12:01:34, loss=0.334563477874895, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.89(1.03), norm=2.947284024010489, lr=0.05008169159962367
2023-12-07 15:02:28   INFO  epoch: 58/72, acc_iter=225246, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:36:07, time_cost(all): 2 days, 4:07:13/12:12:38, loss=0.334504280448954, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.12(1.03), norm=3.7853457131755066, lr=0.049983150303104185
2023-12-07 15:03:10   INFO  epoch: 58/72, acc_iter=225296, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:35:27, time_cost(all): 2 days, 4:07:55/12:08:01, loss=0.334445083023013, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.94(1.03), norm=4.800107983213659, lr=0.049923652785816584
2023-12-07 15:03:52   INFO  epoch: 58/72, acc_iter=225346, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:34:48, time_cost(all): 2 days, 4:08:37/11:58:38, loss=0.334385885597072, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.85(1.03), norm=0.9696283418405078, lr=0.04986415526852899
2023-12-07 15:04:34   INFO  epoch: 58/72, acc_iter=225396, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:33:19, time_cost(all): 2 days, 4:09:19/12:56:31, loss=0.334326688171131, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.17(1.03), norm=3.601387649361495, lr=0.04980465775124139
2023-12-07 15:05:15   INFO  epoch: 58/72, acc_iter=225446, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:33:23, time_cost(all): 2 days, 4:10:00/12:14:55, loss=0.33426749074519, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.96(1.03), norm=4.505866021874369, lr=0.04974516023395379
2023-12-07 15:05:57   INFO  epoch: 58/72, acc_iter=225496, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:34:05, time_cost(all): 2 days, 4:10:42/11:59:35, loss=0.334208293319249, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.09(1.03), norm=3.9267654370685454, lr=0.04968566271666619
2023-12-07 15:06:39   INFO  epoch: 58/72, acc_iter=225546, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:32:27, time_cost(all): 2 days, 4:11:24/12:46:50, loss=0.334149095893308, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.17(1.03), norm=4.737998236704823, lr=0.04962616519937859
2023-12-07 15:07:21   INFO  epoch: 58/72, acc_iter=225596, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:31:30, time_cost(all): 2 days, 4:12:06/12:46:19, loss=0.334089898467368, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.94(1.03), norm=1.2750114000660808, lr=0.049566667682091
2023-12-07 15:08:02   INFO  epoch: 58/72, acc_iter=225646, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:29:53, time_cost(all): 2 days, 4:12:47/12:38:56, loss=0.334030701041427, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.99(1.03), norm=1.54102409684092, lr=0.049507170164803396
2023-12-07 15:08:44   INFO  epoch: 58/72, acc_iter=225696, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:30:44, time_cost(all): 2 days, 4:13:29/12:08:20, loss=0.333971503615486, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.95(1.03), norm=2.1401031228825405, lr=0.049447672647515796
2023-12-07 15:09:26   INFO  epoch: 58/72, acc_iter=225746, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:30:11, time_cost(all): 2 days, 4:14:11/11:49:29, loss=0.333912306189545, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.94(1.03), norm=3.3143697265020315, lr=0.049388175130228196
2023-12-07 15:10:08   INFO  epoch: 58/72, acc_iter=225796, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:27:35, time_cost(all): 2 days, 4:14:53/12:34:43, loss=0.333853108763604, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.16(1.03), norm=3.559639455531378, lr=0.0493286776129406
2023-12-07 15:10:50   INFO  epoch: 58/72, acc_iter=225846, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:28:31, time_cost(all): 2 days, 4:15:35/11:54:57, loss=0.333793911337663, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.22(1.03), norm=3.4127133133724907, lr=0.049269180095653
2023-12-07 15:11:31   INFO  epoch: 58/72, acc_iter=225896, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:27:07, time_cost(all): 2 days, 4:16:16/12:51:40, loss=0.333734713911722, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.86(1.03), norm=2.3471717574703166, lr=0.0492096825783654
2023-12-07 15:12:13   INFO  epoch: 58/72, acc_iter=225946, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:26:03, time_cost(all): 2 days, 4:16:58/11:54:09, loss=0.333675516485781, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.9(1.03), norm=1.8651281072029802, lr=0.0491501850610778
2023-12-07 15:12:55   INFO  epoch: 58/72, acc_iter=225996, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:26:33, time_cost(all): 2 days, 4:17:40/12:10:10, loss=0.33361631905984, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.88(1.03), norm=2.37155462252463, lr=0.04909068754379021
2023-12-07 15:13:37   INFO  epoch: 58/72, acc_iter=226046, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:28, time_cost(all): 2 days, 4:18:22/12:03:50, loss=0.333557121633899, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.99(1.03), norm=3.3154781491801115, lr=0.04903119002650261
2023-12-07 15:14:19   INFO  epoch: 58/72, acc_iter=226096, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:24:13, time_cost(all): 2 days, 4:19:04/12:11:57, loss=0.333497924207958, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.11(1.03), norm=4.372094379112732, lr=0.04897169250921501
2023-12-07 15:15:00   INFO  epoch: 58/72, acc_iter=226146, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:34, time_cost(all): 2 days, 4:19:45/12:29:01, loss=0.333438726782017, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.97(1.03), norm=4.045234318699379, lr=0.04891219499192741
2023-12-07 15:15:42   INFO  epoch: 58/72, acc_iter=226196, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:56, time_cost(all): 2 days, 4:20:27/11:38:57, loss=0.333379529356076, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.18(1.03), norm=2.933272197404672, lr=0.04885269747463981
2023-12-07 15:16:24   INFO  epoch: 58/72, acc_iter=226246, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:21:49, time_cost(all): 2 days, 4:21:09/11:48:39, loss=0.333320331930135, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.23(1.03), norm=1.9861892109081452, lr=0.048793199957352214
2023-12-07 15:17:06   INFO  epoch: 58/72, acc_iter=226296, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:52, time_cost(all): 2 days, 4:21:51/12:39:55, loss=0.333261134504194, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.04(1.03), norm=4.937124322664497, lr=0.048733702440064613
2023-12-07 15:17:47   INFO  epoch: 58/72, acc_iter=226346, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:54, time_cost(all): 2 days, 4:22:32/12:22:43, loss=0.333201937078253, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.08(1.03), norm=2.9234746944981262, lr=0.04867420492277701
2023-12-07 15:18:29   INFO  epoch: 58/72, acc_iter=226396, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:21:04, time_cost(all): 2 days, 4:23:14/11:50:13, loss=0.333142739652312, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.21(1.03), norm=2.733811086858972, lr=0.04861470740548941
2023-12-07 15:19:11   INFO  epoch: 58/72, acc_iter=226446, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:19:48, time_cost(all): 2 days, 4:23:56/11:41:08, loss=0.333083542226372, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.13(1.03), norm=3.891972094557376, lr=0.04855520988820182
2023-12-07 15:19:53   INFO  epoch: 58/72, acc_iter=226496, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:07, time_cost(all): 2 days, 4:24:38/11:41:20, loss=0.333024344800431, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.83(1.03), norm=1.6247084923381703, lr=0.04849571237091422
2023-12-07 15:20:35   INFO  epoch: 58/72, acc_iter=226546, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:17:31, time_cost(all): 2 days, 4:25:20/12:22:47, loss=0.33296514737449, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.2(1.03), norm=4.176153443493727, lr=0.04843621485362662
2023-12-07 15:21:16   INFO  epoch: 58/72, acc_iter=226596, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:40, time_cost(all): 2 days, 4:26:01/12:16:33, loss=0.332905949948549, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.92(1.03), norm=3.300278961740875, lr=0.04837671733633902
2023-12-07 15:21:58   INFO  epoch: 58/72, acc_iter=226646, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:47, time_cost(all): 2 days, 4:26:43/12:17:42, loss=0.332846752522608, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.86(1.03), norm=4.676247034736062, lr=0.048317219819051425
2023-12-07 15:22:40   INFO  epoch: 58/72, acc_iter=226696, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:15:54, time_cost(all): 2 days, 4:27:25/11:58:25, loss=0.332787555096667, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.02(1.03), norm=0.9272177655497722, lr=0.048257722301763825
2023-12-07 15:23:22   INFO  epoch: 58/72, acc_iter=226746, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:24, time_cost(all): 2 days, 4:28:07/12:21:32, loss=0.332728357670726, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.09(1.03), norm=2.3393276989894445, lr=0.048198224784476225
2023-12-07 15:24:03   INFO  epoch: 58/72, acc_iter=226796, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:39, time_cost(all): 2 days, 4:28:48/12:25:51, loss=0.332669160244785, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.2(1.03), norm=4.096970613640531, lr=0.048138727267188625
2023-12-07 15:24:45   INFO  epoch: 58/72, acc_iter=226846, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:27, time_cost(all): 2 days, 4:29:30/12:08:00, loss=0.332609962818844, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.16(1.03), norm=1.0739014983417354, lr=0.048079229749901024
2023-12-07 15:25:27   INFO  epoch: 58/72, acc_iter=226896, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:47, time_cost(all): 2 days, 4:30:12/11:51:22, loss=0.332550765392903, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.11(1.03), norm=4.228066236595261, lr=0.04801973223261343
2023-12-07 15:26:09   INFO  epoch: 58/72, acc_iter=226946, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:21, time_cost(all): 2 days, 4:30:54/11:43:16, loss=0.332491567966962, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.1(1.03), norm=3.6070814829139763, lr=0.04796023471532583
2023-12-07 15:26:51   INFO  epoch: 58/72, acc_iter=226996, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:11:29, time_cost(all): 2 days, 4:31:36/12:08:13, loss=0.332432370541021, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.09(1.03), norm=4.6332592836168125, lr=0.04790073719803823
2023-12-07 15:27:32   INFO  epoch: 58/72, acc_iter=227046, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:46, time_cost(all): 2 days, 4:32:17/11:42:23, loss=0.33237317311508, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.91(1.03), norm=3.895676751796566, lr=0.04784123968075063
2023-12-07 15:28:14   INFO  epoch: 58/72, acc_iter=227096, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:50, time_cost(all): 2 days, 4:32:59/12:22:20, loss=0.332313975689139, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.21(1.03), norm=3.513626817740351, lr=0.04778174216346304
2023-12-07 15:28:56   INFO  epoch: 58/72, acc_iter=227146, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:10:04, time_cost(all): 2 days, 4:33:41/12:18:16, loss=0.332254778263198, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.04(1.03), norm=1.5805998851316765, lr=0.04772224464617544
2023-12-07 15:29:38   INFO  epoch: 58/72, acc_iter=227196, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:08:46, time_cost(all): 2 days, 4:34:23/12:16:39, loss=0.332195580837257, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.22(1.03), norm=1.2119553179123068, lr=0.047662747128887836
2023-12-07 15:30:19   INFO  epoch: 58/72, acc_iter=227246, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:51, time_cost(all): 2 days, 4:35:04/11:39:53, loss=0.332136383411316, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.18(1.03), norm=2.783578665048949, lr=0.047603249611600236
2023-12-07 15:31:01   INFO  epoch: 58/72, acc_iter=227296, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:08:09, time_cost(all): 2 days, 4:35:46/11:58:48, loss=0.332077185985376, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.93(1.03), norm=3.348529974536088, lr=0.047543752094312636
2023-12-07 15:31:43   INFO  epoch: 58/72, acc_iter=227346, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:19, time_cost(all): 2 days, 4:36:28/11:38:42, loss=0.332017988559435, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.12(1.03), norm=1.5254481201799202, lr=0.04748425457702504
2023-12-07 15:32:25   INFO  epoch: 58/72, acc_iter=227396, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:44, time_cost(all): 2 days, 4:37:10/12:30:18, loss=0.331958791133494, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.15(1.03), norm=2.8203329450901955, lr=0.04742475705973744
2023-12-07 15:33:07   INFO  epoch: 58/72, acc_iter=227446, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:38, time_cost(all): 2 days, 4:37:52/11:30:08, loss=0.331899593707553, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.04(1.03), norm=3.008560500732851, lr=0.04736525954244984
2023-12-07 15:33:48   INFO  epoch: 58/72, acc_iter=227496, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:04:52, time_cost(all): 2 days, 4:38:33/12:18:19, loss=0.331840396281612, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.06(1.03), norm=2.016539424714068, lr=0.04730576202516224
2023-12-07 15:34:30   INFO  epoch: 58/72, acc_iter=227546, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:32, time_cost(all): 2 days, 4:39:15/12:28:22, loss=0.331781198855671, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.22(1.03), norm=4.412794744553608, lr=0.04724626450787465
2023-12-07 15:35:12   INFO  epoch: 58/72, acc_iter=227596, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:43, time_cost(all): 2 days, 4:39:57/12:21:15, loss=0.33172200142973, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.0(1.03), norm=2.1255576216199015, lr=0.04718676699058705
2023-12-07 15:35:54   INFO  epoch: 58/72, acc_iter=227646, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:48, time_cost(all): 2 days, 4:40:39/12:05:12, loss=0.331662804003789, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.07(1.03), norm=0.5762317110627317, lr=0.04712726947329945
2023-12-07 15:36:35   INFO  epoch: 58/72, acc_iter=227696, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:16, time_cost(all): 2 days, 4:41:20/12:26:30, loss=0.331603606577848, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.18(1.03), norm=0.7925373622178078, lr=0.04706777195601185
2023-12-07 15:37:17   INFO  epoch: 58/72, acc_iter=227746, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:29, time_cost(all): 2 days, 4:42:02/12:00:18, loss=0.331544409151907, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.9(1.03), norm=1.8871829951527233, lr=0.047008274438724254
2023-12-07 15:37:59   INFO  epoch: 58/72, acc_iter=227796, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:51, time_cost(all): 2 days, 4:42:44/11:54:11, loss=0.331485211725966, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.13(1.03), norm=1.2292705755112325, lr=0.046948776921436654
2023-12-07 15:38:41   INFO  epoch: 58/72, acc_iter=227846, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 2 days, 4:43:26/11:21:59, loss=0.331426014300025, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.88(1.03), norm=1.49024995426476, lr=0.046889279404149053
2023-12-07 15:39:23   INFO  epoch: 59/72, acc_iter=227908, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:53:20, time_cost(all): 2 days, 4:44:08/11:39:22, loss=0.331352609491858, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.15(1.03), norm=3.4012517857177773, lr=0.046815502482712434
2023-12-07 15:40:04   INFO  epoch: 59/72, acc_iter=227958, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:54:47, time_cost(all): 2 days, 4:44:49/12:08:52, loss=0.331293412065917, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.84(1.03), norm=0.6522262278772994, lr=0.046756004965424834
2023-12-07 15:40:46   INFO  epoch: 59/72, acc_iter=228008, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:52:56, time_cost(all): 2 days, 4:45:31/11:46:34, loss=0.331234214639977, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.16(1.03), norm=4.476323404321801, lr=0.046696507448137234
2023-12-07 15:41:28   INFO  epoch: 59/72, acc_iter=228058, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:49:08, time_cost(all): 2 days, 4:46:13/12:17:34, loss=0.331175017214036, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.96(1.03), norm=3.0154671018985018, lr=0.046637009930849634
2023-12-07 15:42:10   INFO  epoch: 59/72, acc_iter=228108, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:48:43, time_cost(all): 2 days, 4:46:55/12:02:41, loss=0.331115819788095, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.1(1.03), norm=1.077864169922626, lr=0.04657751241356203
2023-12-07 15:42:51   INFO  epoch: 59/72, acc_iter=228158, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:49:14, time_cost(all): 2 days, 4:47:36/11:49:43, loss=0.331056622362154, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.02(1.03), norm=1.1209414597925604, lr=0.04651801489627444
2023-12-07 15:43:33   INFO  epoch: 59/72, acc_iter=228208, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:49:24, time_cost(all): 2 days, 4:48:18/11:19:24, loss=0.330997424936213, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.11(1.03), norm=1.9288328597058433, lr=0.04645851737898684
2023-12-07 15:44:15   INFO  epoch: 59/72, acc_iter=228258, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:46:03, time_cost(all): 2 days, 4:49:00/12:03:56, loss=0.330938227510272, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.97(1.03), norm=2.6115239617358776, lr=0.04639901986169924
2023-12-07 15:44:57   INFO  epoch: 59/72, acc_iter=228308, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:48:14, time_cost(all): 2 days, 4:49:42/11:17:49, loss=0.330879030084331, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.16(1.03), norm=3.7743516920721927, lr=0.04633952234441164
2023-12-07 15:45:39   INFO  epoch: 59/72, acc_iter=228358, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:46:34, time_cost(all): 2 days, 4:50:24/11:15:08, loss=0.33081983265839, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.1(1.03), norm=4.733584795613962, lr=0.046280024827124046
2023-12-07 15:46:20   INFO  epoch: 59/72, acc_iter=228408, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:46:35, time_cost(all): 2 days, 4:51:05/11:38:48, loss=0.330760635232449, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.15(1.03), norm=2.4606984289618685, lr=0.046220527309836446
2023-12-07 15:47:02   INFO  epoch: 59/72, acc_iter=228458, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:46:34, time_cost(all): 2 days, 4:51:47/11:36:16, loss=0.330701437806508, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.87(1.03), norm=2.7478433340470287, lr=0.046161029792548845
2023-12-07 15:47:44   INFO  epoch: 59/72, acc_iter=228508, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:46:07, time_cost(all): 2 days, 4:52:29/11:36:55, loss=0.330642240380567, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.18(1.03), norm=0.8008129053331439, lr=0.046101532275261245
2023-12-07 15:48:26   INFO  epoch: 59/72, acc_iter=228558, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:45:46, time_cost(all): 2 days, 4:53:11/12:11:15, loss=0.330583042954626, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.13(1.03), norm=2.658533558586133, lr=0.04604203475797365
2023-12-07 15:49:08   INFO  epoch: 59/72, acc_iter=228608, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:41:37, time_cost(all): 2 days, 4:53:53/11:19:06, loss=0.330523845528685, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.05(1.03), norm=0.969440786449399, lr=0.04598253724068605
2023-12-07 15:49:49   INFO  epoch: 59/72, acc_iter=228658, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:42:19, time_cost(all): 2 days, 4:54:34/12:05:10, loss=0.330464648102744, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.95(1.03), norm=2.96234137266437, lr=0.04592303972339845
2023-12-07 15:50:31   INFO  epoch: 59/72, acc_iter=228708, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:42:41, time_cost(all): 2 days, 4:55:16/11:43:14, loss=0.330405450676803, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.16(1.03), norm=2.031993894279774, lr=0.04586354220611085
2023-12-07 15:51:13   INFO  epoch: 59/72, acc_iter=228758, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:40:01, time_cost(all): 2 days, 4:55:58/11:50:33, loss=0.330346253250862, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.01(1.03), norm=4.7653172571668465, lr=0.04580404468882325
2023-12-07 15:51:55   INFO  epoch: 59/72, acc_iter=228808, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:42:06, time_cost(all): 2 days, 4:56:40/11:31:26, loss=0.330287055824921, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.12(1.03), norm=1.4574386637442849, lr=0.04574454717153566
2023-12-07 15:52:36   INFO  epoch: 59/72, acc_iter=228858, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:40:11, time_cost(all): 2 days, 4:57:21/12:01:23, loss=0.330227858398981, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.15(1.03), norm=1.0036223396541708, lr=0.04568504965424806
2023-12-07 15:53:18   INFO  epoch: 59/72, acc_iter=228908, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:40:54, time_cost(all): 2 days, 4:58:03/11:20:27, loss=0.33016866097304, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.14(1.03), norm=2.2562858667856984, lr=0.04562555213696046
2023-12-07 15:54:00   INFO  epoch: 59/72, acc_iter=228958, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:37:55, time_cost(all): 2 days, 4:58:45/11:54:30, loss=0.330109463547099, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.84(1.03), norm=4.267950059767012, lr=0.045566054619672856
2023-12-07 15:54:42   INFO  epoch: 59/72, acc_iter=229008, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:38:37, time_cost(all): 2 days, 4:59:27/11:49:07, loss=0.330050266121158, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.04(1.03), norm=4.791549032387803, lr=0.04550655710238526
2023-12-07 15:55:24   INFO  epoch: 59/72, acc_iter=229058, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:37:36, time_cost(all): 2 days, 5:00:09/11:00:37, loss=0.329991068695217, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.1(1.03), norm=4.112658069396481, lr=0.04544705958509766
2023-12-07 15:56:05   INFO  epoch: 59/72, acc_iter=229108, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:36:53, time_cost(all): 2 days, 5:00:50/11:28:36, loss=0.329931871269276, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.93(1.03), norm=2.2497852908682905, lr=0.04538756206781006
2023-12-07 15:56:47   INFO  epoch: 59/72, acc_iter=229158, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:36:56, time_cost(all): 2 days, 5:01:32/11:10:12, loss=0.329872673843335, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.89(1.03), norm=2.0072749232457725, lr=0.04532806455052246
2023-12-07 15:57:29   INFO  epoch: 59/72, acc_iter=229208, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:35:40, time_cost(all): 2 days, 5:02:14/10:59:49, loss=0.329813476417394, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.14(1.03), norm=0.9870150694723465, lr=0.04526856703323487
2023-12-07 15:58:11   INFO  epoch: 59/72, acc_iter=229258, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:33:50, time_cost(all): 2 days, 5:02:56/11:59:31, loss=0.329754278991453, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.0(1.03), norm=0.6277139959758631, lr=0.04520906951594727
2023-12-07 15:58:52   INFO  epoch: 59/72, acc_iter=229308, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:32:34, time_cost(all): 2 days, 5:03:37/11:49:13, loss=0.329695081565512, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.96(1.03), norm=0.7578052658920107, lr=0.04514957199865967
2023-12-07 15:59:34   INFO  epoch: 59/72, acc_iter=229358, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:32:48, time_cost(all): 2 days, 5:04:19/11:47:08, loss=0.329635884139571, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.11(1.03), norm=2.7281002296817167, lr=0.04509007448137207
2023-12-07 16:00:16   INFO  epoch: 59/72, acc_iter=229408, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:31:34, time_cost(all): 2 days, 5:05:01/11:10:22, loss=0.32957668671363, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.83(1.03), norm=2.1075961632194007, lr=0.04503057696408447
2023-12-07 16:00:58   INFO  epoch: 59/72, acc_iter=229458, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:30:36, time_cost(all): 2 days, 5:05:43/11:16:17, loss=0.329517489287689, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.03(1.03), norm=1.8685261676720761, lr=0.044971079446796874
2023-12-07 16:01:40   INFO  epoch: 59/72, acc_iter=229508, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:32:09, time_cost(all): 2 days, 5:06:25/11:25:22, loss=0.329458291861748, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.02(1.03), norm=2.3229770824198415, lr=0.044911581929509274
2023-12-07 16:02:21   INFO  epoch: 59/72, acc_iter=229558, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:30:17, time_cost(all): 2 days, 5:07:06/11:22:21, loss=0.329399094435807, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.91(1.03), norm=1.9852066462778653, lr=0.044852084412221674
2023-12-07 16:03:03   INFO  epoch: 59/72, acc_iter=229608, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:29:59, time_cost(all): 2 days, 5:07:48/11:19:45, loss=0.329339897009866, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.99(1.03), norm=1.8831995397391643, lr=0.044792586894934074
2023-12-07 16:03:45   INFO  epoch: 59/72, acc_iter=229658, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:28:19, time_cost(all): 2 days, 5:08:30/11:26:00, loss=0.329280699583925, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.06(1.03), norm=2.2379513010151375, lr=0.04473308937764648
2023-12-07 16:04:27   INFO  epoch: 59/72, acc_iter=229708, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:28:59, time_cost(all): 2 days, 5:09:12/11:23:38, loss=0.329221502157985, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.16(1.03), norm=1.1853460444511246, lr=0.04467359186035888
2023-12-07 16:05:08   INFO  epoch: 59/72, acc_iter=229758, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:26:13, time_cost(all): 2 days, 5:09:53/11:22:06, loss=0.329162304732044, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.94(1.03), norm=1.8451478787988191, lr=0.04461409434307128
2023-12-07 16:05:50   INFO  epoch: 59/72, acc_iter=229808, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:27:25, time_cost(all): 2 days, 5:10:35/11:28:49, loss=0.329103107306103, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.05(1.03), norm=4.671184587500183, lr=0.04455459682578368
2023-12-07 16:06:32   INFO  epoch: 59/72, acc_iter=229858, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:25:50, time_cost(all): 2 days, 5:11:17/11:23:39, loss=0.329043909880162, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.89(1.03), norm=2.7946710597850735, lr=0.044495099308496086
2023-12-07 16:07:14   INFO  epoch: 59/72, acc_iter=229908, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:08, time_cost(all): 2 days, 5:11:59/11:37:21, loss=0.328984712454221, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.08(1.03), norm=4.655493751268635, lr=0.044435601791208486
2023-12-07 16:07:56   INFO  epoch: 59/72, acc_iter=229958, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:24:55, time_cost(all): 2 days, 5:12:41/11:20:24, loss=0.32892551502828, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.08(1.03), norm=3.225457749903205, lr=0.044376104273920886
2023-12-07 16:08:37   INFO  epoch: 59/72, acc_iter=230008, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:54, time_cost(all): 2 days, 5:13:22/11:31:54, loss=0.328866317602339, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.04(1.03), norm=4.042816801436423, lr=0.044316606756633285
2023-12-07 16:09:19   INFO  epoch: 59/72, acc_iter=230058, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:23:31, time_cost(all): 2 days, 5:14:04/10:58:26, loss=0.328807120176398, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.84(1.03), norm=0.920228828455303, lr=0.04425710923934569
2023-12-07 16:10:01   INFO  epoch: 59/72, acc_iter=230108, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:23:12, time_cost(all): 2 days, 5:14:46/11:23:05, loss=0.328747922750457, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.9(1.03), norm=1.0560463955816348, lr=0.04419761172205809
2023-12-07 16:10:43   INFO  epoch: 59/72, acc_iter=230158, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:59, time_cost(all): 2 days, 5:15:28/10:45:35, loss=0.328688725324516, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.99(1.03), norm=3.273209268528038, lr=0.04413811420477049
2023-12-07 16:11:24   INFO  epoch: 59/72, acc_iter=230208, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:31, time_cost(all): 2 days, 5:16:09/11:06:00, loss=0.328629527898575, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.23(1.03), norm=4.742004755598526, lr=0.04407861668748289
2023-12-07 16:12:06   INFO  epoch: 59/72, acc_iter=230258, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:19:27, time_cost(all): 2 days, 5:16:51/11:06:08, loss=0.328570330472634, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.19(1.03), norm=4.780719781827678, lr=0.04401911917019529
2023-12-07 16:12:48   INFO  epoch: 59/72, acc_iter=230308, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:19:01, time_cost(all): 2 days, 5:17:33/11:43:47, loss=0.328511133046693, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.92(1.03), norm=1.3700611140395615, lr=0.0439596216529077
2023-12-07 16:13:30   INFO  epoch: 59/72, acc_iter=230358, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:48, time_cost(all): 2 days, 5:18:15/10:44:34, loss=0.328451935620752, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.96(1.03), norm=4.695153087006694, lr=0.0439001241356201
2023-12-07 16:14:12   INFO  epoch: 59/72, acc_iter=230408, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:19:00, time_cost(all): 2 days, 5:18:57/11:16:02, loss=0.328392738194811, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.11(1.03), norm=1.95585495709707, lr=0.0438406266183325
2023-12-07 16:14:53   INFO  epoch: 59/72, acc_iter=230458, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:21, time_cost(all): 2 days, 5:19:38/10:56:22, loss=0.32833354076887, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.03(1.03), norm=4.207215946568931, lr=0.0437811291010449
2023-12-07 16:15:35   INFO  epoch: 59/72, acc_iter=230508, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:17:35, time_cost(all): 2 days, 5:20:20/11:15:43, loss=0.328274343342929, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.99(1.03), norm=1.4102201897892392, lr=0.0437216315837573
2023-12-07 16:16:17   INFO  epoch: 59/72, acc_iter=230558, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:00, time_cost(all): 2 days, 5:21:02/11:34:04, loss=0.328215145916989, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.98(1.03), norm=2.298150421337664, lr=0.0436621340664697
2023-12-07 16:16:59   INFO  epoch: 59/72, acc_iter=230608, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:58, time_cost(all): 2 days, 5:21:44/11:44:14, loss=0.328155948491048, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.2(1.03), norm=1.9658827784797193, lr=0.0436026365491821
2023-12-07 16:17:40   INFO  epoch: 59/72, acc_iter=230658, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:31, time_cost(all): 2 days, 5:22:25/11:23:55, loss=0.328096751065107, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.98(1.03), norm=4.453952008033415, lr=0.0435431390318945
2023-12-07 16:18:22   INFO  epoch: 59/72, acc_iter=230708, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:25, time_cost(all): 2 days, 5:23:07/11:17:03, loss=0.328037553639166, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.88(1.03), norm=3.227937727362595, lr=0.0434836415146069
2023-12-07 16:19:04   INFO  epoch: 59/72, acc_iter=230758, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:12, time_cost(all): 2 days, 5:23:49/10:40:57, loss=0.327978356213225, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.04(1.03), norm=0.5056561881051429, lr=0.04342414399731931
2023-12-07 16:19:46   INFO  epoch: 59/72, acc_iter=230808, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:42, time_cost(all): 2 days, 5:24:31/11:10:59, loss=0.327919158787284, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.92(1.03), norm=2.8727046579940785, lr=0.04336464648003171
2023-12-07 16:20:28   INFO  epoch: 59/72, acc_iter=230858, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:29, time_cost(all): 2 days, 5:25:13/10:40:06, loss=0.327859961361343, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.95(1.03), norm=4.2786218222668495, lr=0.04330514896274411
2023-12-07 16:21:09   INFO  epoch: 59/72, acc_iter=230908, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:38, time_cost(all): 2 days, 5:25:54/10:58:56, loss=0.327800763935402, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.11(1.03), norm=4.8873415156605375, lr=0.04324565144545651
2023-12-07 16:21:51   INFO  epoch: 59/72, acc_iter=230958, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:52, time_cost(all): 2 days, 5:26:36/11:34:41, loss=0.327741566509461, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.08(1.03), norm=3.5369122023514756, lr=0.043186153928168915
2023-12-07 16:22:33   INFO  epoch: 59/72, acc_iter=231008, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:10:09, time_cost(all): 2 days, 5:27:18/10:59:37, loss=0.32768236908352, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.19(1.03), norm=4.473272709419071, lr=0.043126656410881314
2023-12-07 16:23:15   INFO  epoch: 59/72, acc_iter=231058, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:08:48, time_cost(all): 2 days, 5:28:00/10:47:52, loss=0.327623171657579, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.01(1.03), norm=2.0603593521043133, lr=0.043067158893593714
2023-12-07 16:23:57   INFO  epoch: 59/72, acc_iter=231108, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:51, time_cost(all): 2 days, 5:28:42/11:12:49, loss=0.327563974231638, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.94(1.03), norm=3.2396736259349797, lr=0.043007661376306114
2023-12-07 16:24:38   INFO  epoch: 59/72, acc_iter=231158, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:28, time_cost(all): 2 days, 5:29:23/10:48:50, loss=0.327504776805697, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.87(1.03), norm=2.1556042280677925, lr=0.04294816385901852
2023-12-07 16:25:20   INFO  epoch: 59/72, acc_iter=231208, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:06:50, time_cost(all): 2 days, 5:30:05/10:45:23, loss=0.327445579379756, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.83(1.03), norm=2.322890215471155, lr=0.04288866634173092
2023-12-07 16:26:02   INFO  epoch: 59/72, acc_iter=231258, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:40, time_cost(all): 2 days, 5:30:47/10:33:00, loss=0.327386381953815, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.22(1.03), norm=4.278208934978908, lr=0.04282916882444332
2023-12-07 16:26:44   INFO  epoch: 59/72, acc_iter=231308, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:33, time_cost(all): 2 days, 5:31:29/10:52:47, loss=0.327327184527874, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.87(1.03), norm=3.059031784891181, lr=0.04276967130715572
2023-12-07 16:27:25   INFO  epoch: 59/72, acc_iter=231358, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:12, time_cost(all): 2 days, 5:32:10/11:27:18, loss=0.327267987101933, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.94(1.03), norm=3.1532849953359223, lr=0.04271017378986812
2023-12-07 16:28:07   INFO  epoch: 59/72, acc_iter=231408, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:29, time_cost(all): 2 days, 5:32:52/11:09:07, loss=0.327208789675992, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.04(1.03), norm=3.6266400462101416, lr=0.042650676272580526
2023-12-07 16:28:49   INFO  epoch: 59/72, acc_iter=231458, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:43, time_cost(all): 2 days, 5:33:34/11:25:57, loss=0.327149592250052, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.83(1.03), norm=2.904626678156809, lr=0.042591178755292926
2023-12-07 16:29:31   INFO  epoch: 59/72, acc_iter=231508, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:03:01, time_cost(all): 2 days, 5:34:16/11:13:20, loss=0.327090394824111, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.09(1.03), norm=1.4126354067023459, lr=0.042531681238005326
2023-12-07 16:30:13   INFO  epoch: 59/72, acc_iter=231558, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:19, time_cost(all): 2 days, 5:34:58/10:30:08, loss=0.32703119739817, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.91(1.03), norm=0.9552904120844481, lr=0.042472183720717725
2023-12-07 16:30:54   INFO  epoch: 59/72, acc_iter=231608, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:35, time_cost(all): 2 days, 5:35:39/11:09:16, loss=0.326971999972229, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.18(1.03), norm=3.733910407757252, lr=0.04241268620343013
2023-12-07 16:31:36   INFO  epoch: 59/72, acc_iter=231658, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:53, time_cost(all): 2 days, 5:36:21/10:27:43, loss=0.326912802546288, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.21(1.03), norm=3.372567974569959, lr=0.04235318868614253
2023-12-07 16:32:18   INFO  epoch: 59/72, acc_iter=231708, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 2 days, 5:37:03/10:52:24, loss=0.326853605120347, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.22(1.03), norm=0.9923570720317727, lr=0.04229369116885493
2023-12-07 16:33:00   INFO  epoch: 60/72, acc_iter=231770, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:50:49, time_cost(all): 2 days, 5:37:45/10:51:26, loss=0.32678020031218, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.99(1.03), norm=0.6199260094287633, lr=0.04221991424741831
2023-12-07 16:33:41   INFO  epoch: 60/72, acc_iter=231820, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:51:17, time_cost(all): 2 days, 5:38:26/10:28:25, loss=0.326721002886239, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.16(1.03), norm=1.5011874401631702, lr=0.04216041673013071
2023-12-07 16:34:23   INFO  epoch: 60/72, acc_iter=231870, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:53:21, time_cost(all): 2 days, 5:39:08/10:37:24, loss=0.326661805460298, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.08(1.03), norm=1.212818533651461, lr=0.04210091921284311
2023-12-07 16:35:05   INFO  epoch: 60/72, acc_iter=231920, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:52:53, time_cost(all): 2 days, 5:39:50/11:21:57, loss=0.326602608034357, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.87(1.03), norm=0.5556465243065023, lr=0.04204142169555551
2023-12-07 16:35:47   INFO  epoch: 60/72, acc_iter=231970, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:51:36, time_cost(all): 2 days, 5:40:32/10:54:50, loss=0.326543410608416, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.12(1.03), norm=3.1470643037729715, lr=0.04198192417826792
2023-12-07 16:36:29   INFO  epoch: 60/72, acc_iter=232020, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:49:54, time_cost(all): 2 days, 5:41:14/11:13:41, loss=0.326484213182475, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.99(1.03), norm=1.8882831156583824, lr=0.04192242666098032
2023-12-07 16:37:10   INFO  epoch: 60/72, acc_iter=232070, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:48:14, time_cost(all): 2 days, 5:41:55/10:52:48, loss=0.326425015756534, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.03(1.03), norm=1.3942303158743488, lr=0.04186292914369272
2023-12-07 16:37:52   INFO  epoch: 60/72, acc_iter=232120, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:48:52, time_cost(all): 2 days, 5:42:37/11:19:05, loss=0.326365818330594, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.17(1.03), norm=3.4866471202369023, lr=0.04180343162640512
2023-12-07 16:38:34   INFO  epoch: 60/72, acc_iter=232170, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:46:07, time_cost(all): 2 days, 5:43:19/11:14:39, loss=0.326306620904653, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.12(1.03), norm=4.636968326608557, lr=0.04174393410911752
2023-12-07 16:39:16   INFO  epoch: 60/72, acc_iter=232220, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:45:32, time_cost(all): 2 days, 5:44:01/10:48:31, loss=0.326247423478712, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.13(1.03), norm=2.7501079378158804, lr=0.041684436591829924
2023-12-07 16:39:57   INFO  epoch: 60/72, acc_iter=232270, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:46:29, time_cost(all): 2 days, 5:44:42/11:04:41, loss=0.326188226052771, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.04(1.03), norm=4.317816255271242, lr=0.04162493907454232
2023-12-07 16:40:39   INFO  epoch: 60/72, acc_iter=232320, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:45:57, time_cost(all): 2 days, 5:45:24/10:50:03, loss=0.32612902862683, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.01(1.03), norm=1.2022355302728247, lr=0.04156544155725472
2023-12-07 16:41:21   INFO  epoch: 60/72, acc_iter=232370, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:42:40, time_cost(all): 2 days, 5:46:06/11:15:49, loss=0.326069831200889, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.18(1.03), norm=4.3579689866696825, lr=0.04150594403996712
2023-12-07 16:42:03   INFO  epoch: 60/72, acc_iter=232420, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:45:43, time_cost(all): 2 days, 5:46:48/10:32:22, loss=0.326010633774948, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.89(1.03), norm=1.4828449943581141, lr=0.04144644652267952
2023-12-07 16:42:45   INFO  epoch: 60/72, acc_iter=232470, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:43:31, time_cost(all): 2 days, 5:47:30/10:24:03, loss=0.325951436349007, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.18(1.03), norm=3.136334707237967, lr=0.04138694900539193
2023-12-07 16:43:26   INFO  epoch: 60/72, acc_iter=232520, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:40:48, time_cost(all): 2 days, 5:48:11/11:07:50, loss=0.325892238923066, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.87(1.03), norm=3.7472829930202853, lr=0.04132745148810433
2023-12-07 16:44:08   INFO  epoch: 60/72, acc_iter=232570, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:39:58, time_cost(all): 2 days, 5:48:53/10:38:14, loss=0.325833041497125, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.15(1.03), norm=4.010069554831874, lr=0.04126795397081673
2023-12-07 16:44:50   INFO  epoch: 60/72, acc_iter=232620, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:39:38, time_cost(all): 2 days, 5:49:35/10:54:17, loss=0.325773844071184, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.99(1.03), norm=2.525922013838118, lr=0.041208456453529135
2023-12-07 16:45:32   INFO  epoch: 60/72, acc_iter=232670, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:39:43, time_cost(all): 2 days, 5:50:17/11:10:28, loss=0.325714646645243, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.18(1.03), norm=4.897207166430809, lr=0.041148958936241535
2023-12-07 16:46:13   INFO  epoch: 60/72, acc_iter=232720, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:39:39, time_cost(all): 2 days, 5:50:58/10:17:02, loss=0.325655449219302, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.86(1.03), norm=2.361141630065996, lr=0.041089461418953935
2023-12-07 16:46:55   INFO  epoch: 60/72, acc_iter=232770, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:37:27, time_cost(all): 2 days, 5:51:40/10:41:50, loss=0.325596251793361, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.93(1.03), norm=3.1021547418560935, lr=0.041029963901666335
2023-12-07 16:47:37   INFO  epoch: 60/72, acc_iter=232820, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:37:07, time_cost(all): 2 days, 5:52:22/10:52:46, loss=0.32553705436742, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.13(1.03), norm=2.066459149221693, lr=0.040970466384378734
2023-12-07 16:48:19   INFO  epoch: 60/72, acc_iter=232870, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:37:15, time_cost(all): 2 days, 5:53:04/10:51:16, loss=0.325477856941479, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.08(1.03), norm=4.351171804693914, lr=0.04091096886709114
2023-12-07 16:49:01   INFO  epoch: 60/72, acc_iter=232920, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:36:09, time_cost(all): 2 days, 5:53:46/10:52:56, loss=0.325418659515538, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.89(1.03), norm=0.5366429018247312, lr=0.04085147134980354
2023-12-07 16:49:42   INFO  epoch: 60/72, acc_iter=232970, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:36:09, time_cost(all): 2 days, 5:54:27/11:09:10, loss=0.325359462089597, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.83(1.03), norm=4.104277444433691, lr=0.04079197383251594
2023-12-07 16:50:24   INFO  epoch: 60/72, acc_iter=233020, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:34:01, time_cost(all): 2 days, 5:55:09/10:18:25, loss=0.325300264663657, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.01(1.03), norm=1.9362500259859146, lr=0.04073247631522834
2023-12-07 16:51:06   INFO  epoch: 60/72, acc_iter=233070, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:33:32, time_cost(all): 2 days, 5:55:51/11:06:43, loss=0.325241067237716, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.06(1.03), norm=1.0415698523586325, lr=0.04067297879794074
2023-12-07 16:51:48   INFO  epoch: 60/72, acc_iter=233120, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:35:00, time_cost(all): 2 days, 5:56:33/10:13:08, loss=0.325181869811775, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.87(1.03), norm=3.835938499800165, lr=0.040613481280653146
2023-12-07 16:52:29   INFO  epoch: 60/72, acc_iter=233170, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:33:08, time_cost(all): 2 days, 5:57:14/10:07:22, loss=0.325122672385834, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.07(1.03), norm=1.6180400846988199, lr=0.040553983763365546
2023-12-07 16:53:11   INFO  epoch: 60/72, acc_iter=233220, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:31:54, time_cost(all): 2 days, 5:57:56/10:37:59, loss=0.325063474959893, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.89(1.03), norm=2.4973244449761154, lr=0.040494486246077946
2023-12-07 16:53:53   INFO  epoch: 60/72, acc_iter=233270, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:31:49, time_cost(all): 2 days, 5:58:38/10:37:19, loss=0.325004277533952, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.83(1.03), norm=4.196996565403913, lr=0.04043498872879035
2023-12-07 16:54:35   INFO  epoch: 60/72, acc_iter=233320, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:31:49, time_cost(all): 2 days, 5:59:20/11:01:34, loss=0.324945080108011, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.1(1.03), norm=2.71456255995502, lr=0.04037549121150275
2023-12-07 16:55:17   INFO  epoch: 60/72, acc_iter=233370, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:30:37, time_cost(all): 2 days, 6:00:02/10:04:54, loss=0.32488588268207, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.23(1.03), norm=2.251027918231638, lr=0.04031599369421515
2023-12-07 16:55:58   INFO  epoch: 60/72, acc_iter=233420, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:29:09, time_cost(all): 2 days, 6:00:43/10:35:08, loss=0.324826685256129, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.16(1.03), norm=4.061439235741574, lr=0.04025649617692755
2023-12-07 16:56:40   INFO  epoch: 60/72, acc_iter=233470, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:29:25, time_cost(all): 2 days, 6:01:25/10:28:22, loss=0.324767487830188, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.14(1.03), norm=3.1865645366294064, lr=0.04019699865963995
2023-12-07 16:57:22   INFO  epoch: 60/72, acc_iter=233520, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:29:02, time_cost(all): 2 days, 6:02:07/10:39:48, loss=0.324708290404247, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.11(1.03), norm=2.491865192948514, lr=0.04013750114235236
2023-12-07 16:58:04   INFO  epoch: 60/72, acc_iter=233570, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:27:42, time_cost(all): 2 days, 6:02:49/10:06:00, loss=0.324649092978306, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.03(1.03), norm=3.885184149039616, lr=0.04007800362506476
2023-12-07 16:58:46   INFO  epoch: 60/72, acc_iter=233620, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:27:49, time_cost(all): 2 days, 6:03:31/10:42:40, loss=0.324589895552365, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.18(1.03), norm=2.3035717613881688, lr=0.04001850610777716
2023-12-07 16:59:27   INFO  epoch: 60/72, acc_iter=233670, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:27:51, time_cost(all): 2 days, 6:04:12/10:58:05, loss=0.324530698126424, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.98(1.03), norm=1.6656252918578562, lr=0.03995900859048956
2023-12-07 17:00:09   INFO  epoch: 60/72, acc_iter=233720, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:27:10, time_cost(all): 2 days, 6:04:54/10:10:30, loss=0.324471500700483, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.94(1.03), norm=4.515005424954772, lr=0.03989951107320196
2023-12-07 17:00:51   INFO  epoch: 60/72, acc_iter=233770, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:29, time_cost(all): 2 days, 6:05:36/10:34:01, loss=0.324412303274542, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.94(1.03), norm=0.9411030382232255, lr=0.039840013555914364
2023-12-07 17:01:33   INFO  epoch: 60/72, acc_iter=233820, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:25:40, time_cost(all): 2 days, 6:06:18/10:48:25, loss=0.324353105848601, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.17(1.03), norm=4.4535438556338995, lr=0.03978051603862676
2023-12-07 17:02:14   INFO  epoch: 60/72, acc_iter=233870, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:23:25, time_cost(all): 2 days, 6:06:59/10:14:48, loss=0.324293908422661, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.13(1.03), norm=0.9872926458895408, lr=0.03972101852133916
2023-12-07 17:02:56   INFO  epoch: 60/72, acc_iter=233920, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:23:12, time_cost(all): 2 days, 6:07:41/10:42:40, loss=0.32423471099672, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.18(1.03), norm=2.20271323443486, lr=0.03966152100405157
2023-12-07 17:03:38   INFO  epoch: 60/72, acc_iter=233970, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:21:45, time_cost(all): 2 days, 6:08:23/9:56:29, loss=0.324175513570779, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.18(1.03), norm=3.666874263998375, lr=0.03960202348676397
2023-12-07 17:04:20   INFO  epoch: 60/72, acc_iter=234020, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:22:50, time_cost(all): 2 days, 6:09:05/9:57:43, loss=0.324116316144838, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.17(1.03), norm=0.7800603643313453, lr=0.03954252596947637
2023-12-07 17:05:02   INFO  epoch: 60/72, acc_iter=234070, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:05, time_cost(all): 2 days, 6:09:47/10:37:47, loss=0.324057118718897, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.08(1.03), norm=3.9360112478571625, lr=0.03948302845218877
2023-12-07 17:05:43   INFO  epoch: 60/72, acc_iter=234120, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:21:16, time_cost(all): 2 days, 6:10:28/10:00:27, loss=0.323997921292956, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.94(1.03), norm=2.859892247265349, lr=0.03942353093490117
2023-12-07 17:06:25   INFO  epoch: 60/72, acc_iter=234170, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:19:24, time_cost(all): 2 days, 6:11:10/10:46:55, loss=0.323938723867015, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.05(1.03), norm=0.9123922688663296, lr=0.039364033417613575
2023-12-07 17:07:07   INFO  epoch: 60/72, acc_iter=234220, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:53, time_cost(all): 2 days, 6:11:52/10:02:33, loss=0.323879526441074, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.18(1.03), norm=4.733211864642909, lr=0.039304535900325975
2023-12-07 17:07:49   INFO  epoch: 60/72, acc_iter=234270, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:17, time_cost(all): 2 days, 6:12:34/9:59:29, loss=0.323820329015133, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.08(1.03), norm=0.9713340228174079, lr=0.039245038383038375
2023-12-07 17:08:30   INFO  epoch: 60/72, acc_iter=234320, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:12, time_cost(all): 2 days, 6:13:15/10:36:46, loss=0.323761131589192, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.18(1.03), norm=3.1919657129187726, lr=0.039185540865750774
2023-12-07 17:09:12   INFO  epoch: 60/72, acc_iter=234370, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:23, time_cost(all): 2 days, 6:13:57/10:38:01, loss=0.323701934163251, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.95(1.03), norm=4.512407911606913, lr=0.039126043348463174
2023-12-07 17:09:54   INFO  epoch: 60/72, acc_iter=234420, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:15:22, time_cost(all): 2 days, 6:14:39/10:46:50, loss=0.32364273673731, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.95(1.03), norm=3.6311353735427936, lr=0.03906654583117558
2023-12-07 17:10:36   INFO  epoch: 60/72, acc_iter=234470, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:16:13, time_cost(all): 2 days, 6:15:21/10:15:44, loss=0.323583539311369, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.01(1.03), norm=1.1337966951049883, lr=0.03900704831388798
2023-12-07 17:11:18   INFO  epoch: 60/72, acc_iter=234520, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:28, time_cost(all): 2 days, 6:16:03/10:10:38, loss=0.323524341885428, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.22(1.03), norm=1.12018085467487, lr=0.03894755079660038
2023-12-07 17:11:59   INFO  epoch: 60/72, acc_iter=234570, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:05, time_cost(all): 2 days, 6:16:44/9:53:35, loss=0.323465144459487, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.98(1.03), norm=1.1402622493212475, lr=0.03888805327931279
2023-12-07 17:12:41   INFO  epoch: 60/72, acc_iter=234620, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:53, time_cost(all): 2 days, 6:17:26/9:52:46, loss=0.323405947033546, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.07(1.03), norm=1.0033334496697899, lr=0.03882855576202519
2023-12-07 17:13:23   INFO  epoch: 60/72, acc_iter=234670, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:31, time_cost(all): 2 days, 6:18:08/10:30:56, loss=0.323346749607606, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.01(1.03), norm=4.060905452886462, lr=0.038769058244737586
2023-12-07 17:14:05   INFO  epoch: 60/72, acc_iter=234720, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:30, time_cost(all): 2 days, 6:18:50/10:14:58, loss=0.323287552181665, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.15(1.03), norm=1.1053467368507972, lr=0.038709560727449986
2023-12-07 17:14:46   INFO  epoch: 60/72, acc_iter=234770, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:36, time_cost(all): 2 days, 6:19:31/10:30:38, loss=0.323228354755724, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.21(1.03), norm=2.7006814682302447, lr=0.038650063210162386
2023-12-07 17:15:28   INFO  epoch: 60/72, acc_iter=234820, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:42, time_cost(all): 2 days, 6:20:13/10:03:35, loss=0.323169157329783, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.18(1.03), norm=1.1485668407163971, lr=0.03859056569287479
2023-12-07 17:16:10   INFO  epoch: 60/72, acc_iter=234870, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:10:07, time_cost(all): 2 days, 6:20:55/10:24:14, loss=0.323109959903842, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.97(1.03), norm=3.54572059226944, lr=0.03853106817558719
2023-12-07 17:16:52   INFO  epoch: 60/72, acc_iter=234920, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:37, time_cost(all): 2 days, 6:21:37/9:47:20, loss=0.323050762477901, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.14(1.03), norm=4.256152644772819, lr=0.03847157065829959
2023-12-07 17:17:34   INFO  epoch: 60/72, acc_iter=234970, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:22, time_cost(all): 2 days, 6:22:19/9:55:52, loss=0.32299156505196, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.13(1.03), norm=1.2471145104735235, lr=0.038412073141012
2023-12-07 17:18:15   INFO  epoch: 60/72, acc_iter=235020, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:40, time_cost(all): 2 days, 6:23:00/10:05:59, loss=0.322932367626019, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.93(1.03), norm=1.748056291040868, lr=0.03835257562372439
2023-12-07 17:18:57   INFO  epoch: 60/72, acc_iter=235070, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:06:46, time_cost(all): 2 days, 6:23:42/10:15:32, loss=0.322873170200078, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.95(1.03), norm=1.5913225096883308, lr=0.0382930781064368
2023-12-07 17:19:39   INFO  epoch: 60/72, acc_iter=235120, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:16, time_cost(all): 2 days, 6:24:24/10:29:19, loss=0.322813972774137, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.92(1.03), norm=0.9577577258885066, lr=0.0382335805891492
2023-12-07 17:20:21   INFO  epoch: 60/72, acc_iter=235170, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:29, time_cost(all): 2 days, 6:25:06/10:17:06, loss=0.322754775348196, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.12(1.03), norm=4.831707988329946, lr=0.0381740830718616
2023-12-07 17:21:02   INFO  epoch: 60/72, acc_iter=235220, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:17, time_cost(all): 2 days, 6:25:47/10:03:17, loss=0.322695577922255, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.09(1.03), norm=1.118351407077761, lr=0.038114585554574004
2023-12-07 17:21:44   INFO  epoch: 60/72, acc_iter=235270, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:08, time_cost(all): 2 days, 6:26:29/9:53:56, loss=0.322636380496314, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.97(1.03), norm=2.7747172458635325, lr=0.038055088037286404
2023-12-07 17:22:26   INFO  epoch: 60/72, acc_iter=235320, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:42, time_cost(all): 2 days, 6:27:11/9:57:58, loss=0.322577183070373, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.87(1.03), norm=4.5126954628107, lr=0.037995590519998804
2023-12-07 17:23:08   INFO  epoch: 60/72, acc_iter=235370, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:48, time_cost(all): 2 days, 6:27:53/10:08:10, loss=0.322517985644432, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.92(1.03), norm=4.421063090445135, lr=0.0379360930027112
2023-12-07 17:23:50   INFO  epoch: 60/72, acc_iter=235420, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:09, time_cost(all): 2 days, 6:28:35/9:53:43, loss=0.322458788218491, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.22(1.03), norm=2.2350718586626686, lr=0.0378765954854236
2023-12-07 17:24:31   INFO  epoch: 60/72, acc_iter=235470, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:35, time_cost(all): 2 days, 6:29:16/10:31:48, loss=0.32239959079255, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.83(1.03), norm=2.0056882327467007, lr=0.03781709796813601
2023-12-07 17:25:13   INFO  epoch: 60/72, acc_iter=235520, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:51, time_cost(all): 2 days, 6:29:58/9:36:46, loss=0.32234039336661, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.18(1.03), norm=3.6411003489176474, lr=0.03775760045084841
2023-12-07 17:25:55   INFO  epoch: 60/72, acc_iter=235570, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 2 days, 6:30:40/10:21:51, loss=0.322281195940669, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.12(1.03), norm=4.214965992034587, lr=0.03769810293356081
2023-12-07 17:26:37   INFO  epoch: 61/72, acc_iter=235632, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:52:03, time_cost(all): 2 days, 6:31:22/10:20:53, loss=0.322207791132502, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.97(1.03), norm=4.234407893247945, lr=0.03762432601212419
2023-12-07 17:27:18   INFO  epoch: 61/72, acc_iter=235682, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:50:14, time_cost(all): 2 days, 6:32:03/10:04:29, loss=0.322148593706561, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.86(1.03), norm=0.9065885049349336, lr=0.03756482849483659
2023-12-07 17:28:00   INFO  epoch: 61/72, acc_iter=235732, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:52:05, time_cost(all): 2 days, 6:32:45/9:34:49, loss=0.32208939628062, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.01(1.03), norm=4.252256834556872, lr=0.03750533097754899
2023-12-07 17:28:42   INFO  epoch: 61/72, acc_iter=235782, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:50:15, time_cost(all): 2 days, 6:33:27/10:10:20, loss=0.322030198854679, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.99(1.03), norm=4.704913184927193, lr=0.037445833460261396
2023-12-07 17:29:24   INFO  epoch: 61/72, acc_iter=235832, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:51:05, time_cost(all): 2 days, 6:34:09/9:33:54, loss=0.321971001428738, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.97(1.03), norm=3.3259054548834017, lr=0.03738633594297379
2023-12-07 17:30:06   INFO  epoch: 61/72, acc_iter=235882, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:48:04, time_cost(all): 2 days, 6:34:51/9:45:19, loss=0.321911804002797, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.07(1.03), norm=1.6645226171735754, lr=0.037326838425686196
2023-12-07 17:30:47   INFO  epoch: 61/72, acc_iter=235932, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:47:37, time_cost(all): 2 days, 6:35:32/9:35:46, loss=0.321852606576856, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.87(1.03), norm=1.6987326133114098, lr=0.037267340908398595
2023-12-07 17:31:29   INFO  epoch: 61/72, acc_iter=235982, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:50:04, time_cost(all): 2 days, 6:36:14/10:17:19, loss=0.321793409150915, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.92(1.03), norm=1.9653187139783306, lr=0.037207843391110995
2023-12-07 17:32:11   INFO  epoch: 61/72, acc_iter=236032, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:46:04, time_cost(all): 2 days, 6:36:56/10:04:01, loss=0.321734211724974, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.06(1.03), norm=1.6901831194006152, lr=0.037148345873823395
2023-12-07 17:32:53   INFO  epoch: 61/72, acc_iter=236082, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:48:39, time_cost(all): 2 days, 6:37:38/9:26:43, loss=0.321675014299033, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.96(1.03), norm=4.077314990721661, lr=0.0370888483565358
2023-12-07 17:33:35   INFO  epoch: 61/72, acc_iter=236132, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:48:10, time_cost(all): 2 days, 6:38:20/9:42:17, loss=0.321615816873092, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.93(1.03), norm=3.348735591882648, lr=0.0370293508392482
2023-12-07 17:34:16   INFO  epoch: 61/72, acc_iter=236182, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:44:18, time_cost(all): 2 days, 6:39:01/9:27:56, loss=0.321556619447151, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.99(1.03), norm=4.7113181102483335, lr=0.0369698533219606
2023-12-07 17:34:58   INFO  epoch: 61/72, acc_iter=236232, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:43:45, time_cost(all): 2 days, 6:39:43/10:07:57, loss=0.321497422021211, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.99(1.03), norm=4.207868412107526, lr=0.036910355804673
2023-12-07 17:35:40   INFO  epoch: 61/72, acc_iter=236282, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:43:16, time_cost(all): 2 days, 6:40:25/9:49:59, loss=0.32143822459527, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.84(1.03), norm=4.234669229923709, lr=0.03685085828738541
2023-12-07 17:36:22   INFO  epoch: 61/72, acc_iter=236332, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:43:46, time_cost(all): 2 days, 6:41:07/10:10:03, loss=0.321379027169329, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.23(1.03), norm=1.8168925003974632, lr=0.03679136077009781
2023-12-07 17:37:03   INFO  epoch: 61/72, acc_iter=236382, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:42:03, time_cost(all): 2 days, 6:41:48/10:08:17, loss=0.321319829743388, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.2(1.03), norm=1.9225322876146254, lr=0.03673186325281021
2023-12-07 17:37:45   INFO  epoch: 61/72, acc_iter=236432, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:43:08, time_cost(all): 2 days, 6:42:30/10:06:06, loss=0.321260632317447, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.17(1.03), norm=2.0055998105391173, lr=0.03667236573552261
2023-12-07 17:38:27   INFO  epoch: 61/72, acc_iter=236482, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:41:14, time_cost(all): 2 days, 6:43:12/9:28:22, loss=0.321201434891506, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.22(1.03), norm=1.114011192920604, lr=0.036612868218235006
2023-12-07 17:39:09   INFO  epoch: 61/72, acc_iter=236532, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:38:43, time_cost(all): 2 days, 6:43:54/9:58:22, loss=0.321142237465565, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.83(1.03), norm=3.0529106388457112, lr=0.03655337070094741
2023-12-07 17:39:51   INFO  epoch: 61/72, acc_iter=236582, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:41:07, time_cost(all): 2 days, 6:44:36/10:04:47, loss=0.321083040039624, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.21(1.03), norm=2.37889355371636, lr=0.03649387318365981
2023-12-07 17:40:32   INFO  epoch: 61/72, acc_iter=236632, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:41:02, time_cost(all): 2 days, 6:45:17/9:23:39, loss=0.321023842613683, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.99(1.03), norm=1.1273006756464596, lr=0.03643437566637221
2023-12-07 17:41:14   INFO  epoch: 61/72, acc_iter=236682, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:36:44, time_cost(all): 2 days, 6:45:59/9:59:37, loss=0.320964645187742, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.2(1.03), norm=3.087123838513837, lr=0.03637487814908461
2023-12-07 17:41:56   INFO  epoch: 61/72, acc_iter=236732, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:38:52, time_cost(all): 2 days, 6:46:41/9:47:34, loss=0.320905447761801, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.88(1.03), norm=4.8468543380080416, lr=0.03631538063179702
2023-12-07 17:42:38   INFO  epoch: 61/72, acc_iter=236782, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:37:05, time_cost(all): 2 days, 6:47:23/10:02:05, loss=0.32084625033586, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.21(1.03), norm=2.3311209016537604, lr=0.03625588311450942
2023-12-07 17:43:19   INFO  epoch: 61/72, acc_iter=236832, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:34:49, time_cost(all): 2 days, 6:48:04/9:39:36, loss=0.320787052909919, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.02(1.03), norm=1.0829082174112752, lr=0.03619638559722182
2023-12-07 17:44:01   INFO  epoch: 61/72, acc_iter=236882, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:36:09, time_cost(all): 2 days, 6:48:46/9:34:49, loss=0.320727855483978, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.9(1.03), norm=2.5196020094872518, lr=0.03613688807993422
2023-12-07 17:44:43   INFO  epoch: 61/72, acc_iter=236932, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:33:29, time_cost(all): 2 days, 6:49:28/9:27:07, loss=0.320668658058037, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.9(1.03), norm=0.5950045859641593, lr=0.036077390562646625
2023-12-07 17:45:25   INFO  epoch: 61/72, acc_iter=236982, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:34:36, time_cost(all): 2 days, 6:50:10/9:23:28, loss=0.320609460632096, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.01(1.03), norm=0.8575278010952848, lr=0.036017893045359024
2023-12-07 17:46:07   INFO  epoch: 61/72, acc_iter=237032, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:34:48, time_cost(all): 2 days, 6:50:52/9:55:53, loss=0.320550263206155, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.9(1.03), norm=3.9890195706672587, lr=0.035958395528071424
2023-12-07 17:46:48   INFO  epoch: 61/72, acc_iter=237082, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:32:06, time_cost(all): 2 days, 6:51:33/9:24:54, loss=0.320491065780215, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.9(1.03), norm=3.64741584000536, lr=0.035898898010783824
2023-12-07 17:47:30   INFO  epoch: 61/72, acc_iter=237132, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:32:04, time_cost(all): 2 days, 6:52:15/9:51:57, loss=0.320431868354274, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.18(1.03), norm=2.77162159940217, lr=0.035839400493496223
2023-12-07 17:48:12   INFO  epoch: 61/72, acc_iter=237182, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:32:32, time_cost(all): 2 days, 6:52:57/9:52:35, loss=0.320372670928333, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.95(1.03), norm=1.1207016432796524, lr=0.03577990297620863
2023-12-07 17:48:54   INFO  epoch: 61/72, acc_iter=237232, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:30:34, time_cost(all): 2 days, 6:53:39/9:46:11, loss=0.320313473502392, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.03(1.03), norm=3.849142816271936, lr=0.03572040545892103
2023-12-07 17:49:35   INFO  epoch: 61/72, acc_iter=237282, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:30:54, time_cost(all): 2 days, 6:54:20/9:20:47, loss=0.320254276076451, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.15(1.03), norm=3.390363950355489, lr=0.03566090794163343
2023-12-07 17:50:17   INFO  epoch: 61/72, acc_iter=237332, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:28:04, time_cost(all): 2 days, 6:55:02/9:43:40, loss=0.32019507865051, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.91(1.03), norm=3.9887219131888716, lr=0.03560141042434583
2023-12-07 17:50:59   INFO  epoch: 61/72, acc_iter=237382, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:28:28, time_cost(all): 2 days, 6:55:44/9:10:40, loss=0.320135881224569, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.95(1.03), norm=1.9260159121287272, lr=0.035541912907058236
2023-12-07 17:51:41   INFO  epoch: 61/72, acc_iter=237432, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:27:46, time_cost(all): 2 days, 6:56:26/9:56:14, loss=0.320076683798628, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.22(1.03), norm=2.479462281347459, lr=0.035482415389770636
2023-12-07 17:52:23   INFO  epoch: 61/72, acc_iter=237482, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:28:05, time_cost(all): 2 days, 6:57:08/9:23:42, loss=0.320017486372687, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.89(1.03), norm=4.758500109404, lr=0.035422917872483035
2023-12-07 17:53:04   INFO  epoch: 61/72, acc_iter=237532, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:27:05, time_cost(all): 2 days, 6:57:49/9:11:10, loss=0.319958288946746, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.21(1.03), norm=2.6453754991927845, lr=0.035363420355195435
2023-12-07 17:53:46   INFO  epoch: 61/72, acc_iter=237582, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:26:19, time_cost(all): 2 days, 6:58:31/9:44:51, loss=0.319899091520805, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.05(1.03), norm=1.727119361635061, lr=0.03530392283790784
2023-12-07 17:54:28   INFO  epoch: 61/72, acc_iter=237632, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:26:27, time_cost(all): 2 days, 6:59:13/9:24:32, loss=0.319839894094864, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.07(1.03), norm=2.421563889482613, lr=0.03524442532062024
2023-12-07 17:55:10   INFO  epoch: 61/72, acc_iter=237682, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:24:28, time_cost(all): 2 days, 6:59:55/9:27:48, loss=0.319780696668923, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.07(1.03), norm=3.7405848327680786, lr=0.03518492780333264
2023-12-07 17:55:51   INFO  epoch: 61/72, acc_iter=237732, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:24, time_cost(all): 2 days, 7:00:36/9:44:21, loss=0.319721499242982, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.9(1.03), norm=0.7678831633463142, lr=0.03512543028604504
2023-12-07 17:56:33   INFO  epoch: 61/72, acc_iter=237782, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:59, time_cost(all): 2 days, 7:01:18/9:35:25, loss=0.319662301817041, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.93(1.03), norm=2.4304758353022593, lr=0.03506593276875744
2023-12-07 17:57:15   INFO  epoch: 61/72, acc_iter=237832, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:21:29, time_cost(all): 2 days, 7:02:00/9:39:09, loss=0.3196031043911, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.83(1.03), norm=3.0708206108213485, lr=0.03500643525146985
2023-12-07 17:57:57   INFO  epoch: 61/72, acc_iter=237882, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:59, time_cost(all): 2 days, 7:02:42/9:33:07, loss=0.319543906965159, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.0(1.03), norm=2.4765470913080856, lr=0.03494693773418225
2023-12-07 17:58:39   INFO  epoch: 61/72, acc_iter=237932, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:43, time_cost(all): 2 days, 7:03:24/9:27:03, loss=0.319484709539219, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.14(1.03), norm=2.1325345480670554, lr=0.03488744021689465
2023-12-07 17:59:20   INFO  epoch: 61/72, acc_iter=237982, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:20:53, time_cost(all): 2 days, 7:04:05/9:53:23, loss=0.319425512113278, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.1(1.03), norm=4.30017050926463, lr=0.034827942699607047
2023-12-07 18:00:02   INFO  epoch: 61/72, acc_iter=238032, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:20:17, time_cost(all): 2 days, 7:04:47/9:02:32, loss=0.319366314687337, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.93(1.03), norm=1.2376950746430688, lr=0.03476844518231945
2023-12-07 18:00:44   INFO  epoch: 61/72, acc_iter=238082, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:09, time_cost(all): 2 days, 7:05:29/9:21:00, loss=0.319307117261396, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.15(1.03), norm=2.1189671852031484, lr=0.03470894766503185
2023-12-07 18:01:26   INFO  epoch: 61/72, acc_iter=238132, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:32, time_cost(all): 2 days, 7:06:11/8:58:51, loss=0.319247919835455, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.0(1.03), norm=0.6761940383184843, lr=0.03464945014774425
2023-12-07 18:02:07   INFO  epoch: 61/72, acc_iter=238182, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:18:16, time_cost(all): 2 days, 7:06:52/9:51:27, loss=0.319188722409514, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.1(1.03), norm=3.809320514406193, lr=0.03458995263045665
2023-12-07 18:02:49   INFO  epoch: 61/72, acc_iter=238232, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:17:08, time_cost(all): 2 days, 7:07:34/9:27:58, loss=0.319129524983573, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.0(1.03), norm=2.4069768429089953, lr=0.03453045511316906
2023-12-07 18:03:31   INFO  epoch: 61/72, acc_iter=238282, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:15:31, time_cost(all): 2 days, 7:08:16/9:35:58, loss=0.319070327557632, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.07(1.03), norm=1.0651559646744502, lr=0.03447095759588146
2023-12-07 18:04:13   INFO  epoch: 61/72, acc_iter=238332, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:14:58, time_cost(all): 2 days, 7:08:58/9:06:10, loss=0.319011130131691, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.12(1.03), norm=3.082512570529282, lr=0.03441146007859386
2023-12-07 18:04:55   INFO  epoch: 61/72, acc_iter=238382, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:46, time_cost(all): 2 days, 7:09:40/9:06:25, loss=0.31895193270575, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.09(1.03), norm=3.8402883169106525, lr=0.034351962561306265
2023-12-07 18:05:36   INFO  epoch: 61/72, acc_iter=238432, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:39, time_cost(all): 2 days, 7:10:21/9:17:48, loss=0.318892735279809, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.1(1.03), norm=4.8460510013032, lr=0.03429246504401866
2023-12-07 18:06:18   INFO  epoch: 61/72, acc_iter=238482, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:10, time_cost(all): 2 days, 7:11:03/9:43:15, loss=0.318833537853868, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.19(1.03), norm=0.5079307129781845, lr=0.034232967526731065
2023-12-07 18:07:00   INFO  epoch: 61/72, acc_iter=238532, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:13:00, time_cost(all): 2 days, 7:11:45/9:14:11, loss=0.318774340427927, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.12(1.03), norm=1.8147090768430545, lr=0.034173470009443464
2023-12-07 18:07:42   INFO  epoch: 61/72, acc_iter=238582, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:11:48, time_cost(all): 2 days, 7:12:27/9:13:32, loss=0.318715143001986, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.08(1.03), norm=4.088154927792873, lr=0.034113972492155864
2023-12-07 18:08:24   INFO  epoch: 61/72, acc_iter=238632, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:02, time_cost(all): 2 days, 7:13:09/9:20:37, loss=0.318655945576045, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.12(1.03), norm=4.9618309280125805, lr=0.03405447497486827
2023-12-07 18:09:05   INFO  epoch: 61/72, acc_iter=238682, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:48, time_cost(all): 2 days, 7:13:50/9:26:21, loss=0.318596748150104, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.19(1.03), norm=1.020909624495934, lr=0.03399497745758067
2023-12-07 18:09:47   INFO  epoch: 61/72, acc_iter=238732, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:10:16, time_cost(all): 2 days, 7:14:32/9:15:55, loss=0.318537550724163, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.93(1.03), norm=4.132987013570628, lr=0.03393547994029307
2023-12-07 18:10:29   INFO  epoch: 61/72, acc_iter=238782, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:06, time_cost(all): 2 days, 7:15:14/9:33:32, loss=0.318478353298223, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.96(1.03), norm=1.8579527070425639, lr=0.03387598242300547
2023-12-07 18:11:11   INFO  epoch: 61/72, acc_iter=238832, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:26, time_cost(all): 2 days, 7:15:56/9:04:25, loss=0.318419155872282, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.21(1.03), norm=1.0251262739353857, lr=0.03381648490571787
2023-12-07 18:11:52   INFO  epoch: 61/72, acc_iter=238882, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:35, time_cost(all): 2 days, 7:16:37/9:11:46, loss=0.318359958446341, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.2(1.03), norm=2.187972887136982, lr=0.033756987388430276
2023-12-07 18:12:34   INFO  epoch: 61/72, acc_iter=238932, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:06:54, time_cost(all): 2 days, 7:17:19/9:07:16, loss=0.3183007610204, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.18(1.03), norm=3.011954665971238, lr=0.03369748987114267
2023-12-07 18:13:16   INFO  epoch: 61/72, acc_iter=238982, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:39, time_cost(all): 2 days, 7:18:01/9:25:30, loss=0.318241563594459, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.88(1.03), norm=1.6404623637660858, lr=0.033637992353855076
2023-12-07 18:13:58   INFO  epoch: 61/72, acc_iter=239032, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:52, time_cost(all): 2 days, 7:18:43/9:36:41, loss=0.318182366168518, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.99(1.03), norm=0.5539460406007197, lr=0.03357849483656748
2023-12-07 18:14:40   INFO  epoch: 61/72, acc_iter=239082, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:01, time_cost(all): 2 days, 7:19:25/9:08:09, loss=0.318123168742577, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.09(1.03), norm=1.5787089058989103, lr=0.033518997319279875
2023-12-07 18:15:21   INFO  epoch: 61/72, acc_iter=239132, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:10, time_cost(all): 2 days, 7:20:06/9:14:54, loss=0.318063971316636, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.92(1.03), norm=2.081715474742977, lr=0.03345949980199228
2023-12-07 18:16:03   INFO  epoch: 61/72, acc_iter=239182, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:49, time_cost(all): 2 days, 7:20:48/9:19:05, loss=0.318004773890695, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.93(1.03), norm=2.385775161541582, lr=0.03340000228470468
2023-12-07 18:16:45   INFO  epoch: 61/72, acc_iter=239232, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:03:00, time_cost(all): 2 days, 7:21:30/9:34:40, loss=0.317945576464754, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.85(1.03), norm=4.692129734937771, lr=0.03334050476741708
2023-12-07 18:17:27   INFO  epoch: 61/72, acc_iter=239282, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:12, time_cost(all): 2 days, 7:22:12/8:53:42, loss=0.317886379038813, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.11(1.03), norm=4.9035630734833076, lr=0.03328100725012949
2023-12-07 18:18:08   INFO  epoch: 61/72, acc_iter=239332, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:32, time_cost(all): 2 days, 7:22:53/9:36:39, loss=0.317827181612872, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.03(1.03), norm=4.618788181318419, lr=0.03322150973284189
2023-12-07 18:18:50   INFO  epoch: 61/72, acc_iter=239382, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:54, time_cost(all): 2 days, 7:23:35/9:31:10, loss=0.317767984186931, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.15(1.03), norm=2.4190062923631013, lr=0.03316201221555429
2023-12-07 18:19:32   INFO  epoch: 61/72, acc_iter=239432, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 2 days, 7:24:17/9:28:53, loss=0.31770878676099, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.87(1.03), norm=2.793342580194826, lr=0.03310251469826669
2023-12-07 18:20:14   INFO  epoch: 62/72, acc_iter=239494, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:52:10, time_cost(all): 2 days, 7:24:59/8:49:42, loss=0.317635381952824, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.91(1.03), norm=2.4144010794702746, lr=0.03302873777683006
2023-12-07 18:20:56   INFO  epoch: 62/72, acc_iter=239544, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:52:20, time_cost(all): 2 days, 7:25:41/8:45:16, loss=0.317576184526883, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.01(1.03), norm=3.93171229437944, lr=0.03296924025954247
2023-12-07 18:21:37   INFO  epoch: 62/72, acc_iter=239594, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:49:07, time_cost(all): 2 days, 7:26:22/9:17:11, loss=0.317516987100942, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.89(1.03), norm=1.9849469715762582, lr=0.03290974274225487
2023-12-07 18:22:19   INFO  epoch: 62/72, acc_iter=239644, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:48:31, time_cost(all): 2 days, 7:27:04/9:10:55, loss=0.317457789675001, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.84(1.03), norm=4.047812196856627, lr=0.03285024522496727
2023-12-07 18:23:01   INFO  epoch: 62/72, acc_iter=239694, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:48:57, time_cost(all): 2 days, 7:27:46/9:29:15, loss=0.31739859224906, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.96(1.03), norm=1.213630324884937, lr=0.032790747707679674
2023-12-07 18:23:43   INFO  epoch: 62/72, acc_iter=239744, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:47:44, time_cost(all): 2 days, 7:28:28/9:15:40, loss=0.317339394823119, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.94(1.03), norm=1.7051045282887975, lr=0.03273125019039207
2023-12-07 18:24:24   INFO  epoch: 62/72, acc_iter=239794, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:49:49, time_cost(all): 2 days, 7:29:09/8:36:40, loss=0.317280197397178, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.97(1.03), norm=3.469275477113673, lr=0.03267175267310447
2023-12-07 18:25:06   INFO  epoch: 62/72, acc_iter=239844, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:47:09, time_cost(all): 2 days, 7:29:51/9:14:36, loss=0.317220999971237, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.18(1.03), norm=4.615426294786393, lr=0.03261225515581688
2023-12-07 18:25:48   INFO  epoch: 62/72, acc_iter=239894, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:48:59, time_cost(all): 2 days, 7:30:33/8:48:46, loss=0.317161802545296, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.85(1.03), norm=1.1931507388283031, lr=0.03255275763852927
2023-12-07 18:26:30   INFO  epoch: 62/72, acc_iter=239944, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:44:36, time_cost(all): 2 days, 7:31:15/9:18:10, loss=0.317102605119355, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.02(1.03), norm=2.772234738311953, lr=0.03249326012124168
2023-12-07 18:27:12   INFO  epoch: 62/72, acc_iter=239994, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:45:54, time_cost(all): 2 days, 7:31:57/8:44:01, loss=0.317043407693414, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.08(1.03), norm=4.43565907284325, lr=0.03243376260395408
2023-12-07 18:27:53   INFO  epoch: 62/72, acc_iter=240044, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:47:24, time_cost(all): 2 days, 7:32:38/8:32:58, loss=0.316984210267473, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.9(1.03), norm=4.2420955644533365, lr=0.03237426508666648
2023-12-07 18:28:35   INFO  epoch: 62/72, acc_iter=240094, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:43:09, time_cost(all): 2 days, 7:33:20/8:43:52, loss=0.316925012841532, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.88(1.03), norm=0.6390126407316772, lr=0.032314767569378886
2023-12-07 18:29:17   INFO  epoch: 62/72, acc_iter=240144, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:45:19, time_cost(all): 2 days, 7:34:02/9:04:22, loss=0.316865815415591, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.12(1.03), norm=2.215872631857824, lr=0.03225527005209128
2023-12-07 18:29:59   INFO  epoch: 62/72, acc_iter=240194, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:43:47, time_cost(all): 2 days, 7:34:44/8:33:28, loss=0.31680661798965, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.92(1.03), norm=2.4162186428659567, lr=0.032195772534803685
2023-12-07 18:30:40   INFO  epoch: 62/72, acc_iter=240244, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:42:53, time_cost(all): 2 days, 7:35:25/9:20:56, loss=0.316747420563709, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.16(1.03), norm=2.047953211947066, lr=0.03213627501751609
2023-12-07 18:31:22   INFO  epoch: 62/72, acc_iter=240294, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:41:06, time_cost(all): 2 days, 7:36:07/8:32:33, loss=0.316688223137768, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.19(1.03), norm=1.785767058120009, lr=0.032076777500228484
2023-12-07 18:32:04   INFO  epoch: 62/72, acc_iter=240344, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:39:25, time_cost(all): 2 days, 7:36:49/9:00:19, loss=0.316629025711828, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.89(1.03), norm=1.87830412175268, lr=0.03201727998294089
2023-12-07 18:32:46   INFO  epoch: 62/72, acc_iter=240394, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:40:29, time_cost(all): 2 days, 7:37:31/9:13:08, loss=0.316569828285887, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.11(1.03), norm=4.732750062728645, lr=0.031957782465653284
2023-12-07 18:33:28   INFO  epoch: 62/72, acc_iter=240444, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:40:33, time_cost(all): 2 days, 7:38:13/8:51:04, loss=0.316510630859946, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.14(1.03), norm=4.876713232340355, lr=0.03189828494836569
2023-12-07 18:34:09   INFO  epoch: 62/72, acc_iter=240494, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:39:48, time_cost(all): 2 days, 7:38:54/9:00:09, loss=0.316451433434005, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.23(1.03), norm=0.7908227065993678, lr=0.0318387874310781
2023-12-07 18:34:51   INFO  epoch: 62/72, acc_iter=240544, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:39:36, time_cost(all): 2 days, 7:39:36/8:40:03, loss=0.316392236008064, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.0(1.03), norm=4.274223867607112, lr=0.03177928991379049
2023-12-07 18:35:33   INFO  epoch: 62/72, acc_iter=240594, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:38:23, time_cost(all): 2 days, 7:40:18/8:34:21, loss=0.316333038582123, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.22(1.03), norm=1.974138143926908, lr=0.0317197923965029
2023-12-07 18:36:15   INFO  epoch: 62/72, acc_iter=240644, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:37:53, time_cost(all): 2 days, 7:41:00/8:30:05, loss=0.316273841156182, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.08(1.03), norm=3.1057668082099767, lr=0.031660294879215296
2023-12-07 18:36:56   INFO  epoch: 62/72, acc_iter=240694, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:36:04, time_cost(all): 2 days, 7:41:41/8:36:05, loss=0.316214643730241, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.1(1.03), norm=2.9215103721596685, lr=0.031600797361927696
2023-12-07 18:37:38   INFO  epoch: 62/72, acc_iter=240744, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:35:57, time_cost(all): 2 days, 7:42:23/8:54:38, loss=0.3161554463043, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.14(1.03), norm=2.517352801620598, lr=0.0315412998446401
2023-12-07 18:38:20   INFO  epoch: 62/72, acc_iter=240794, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:35:49, time_cost(all): 2 days, 7:43:05/8:56:40, loss=0.316096248878359, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.18(1.03), norm=3.0284514321945997, lr=0.031481802327352496
2023-12-07 18:39:02   INFO  epoch: 62/72, acc_iter=240844, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:33:50, time_cost(all): 2 days, 7:43:47/8:27:27, loss=0.316037051452418, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.87(1.03), norm=3.1728125243899377, lr=0.0314223048100649
2023-12-07 18:39:44   INFO  epoch: 62/72, acc_iter=240894, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:35:08, time_cost(all): 2 days, 7:44:29/8:22:09, loss=0.315977854026477, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.14(1.03), norm=0.758610643331795, lr=0.03136280729277731
2023-12-07 18:40:25   INFO  epoch: 62/72, acc_iter=240944, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:32:28, time_cost(all): 2 days, 7:45:10/8:37:13, loss=0.315918656600536, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.89(1.03), norm=4.923623508257503, lr=0.0313033097754897
2023-12-07 18:41:07   INFO  epoch: 62/72, acc_iter=240994, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:33:46, time_cost(all): 2 days, 7:45:52/9:05:54, loss=0.315859459174595, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.05(1.03), norm=2.7637182157326343, lr=0.031243812258202105
2023-12-07 18:41:49   INFO  epoch: 62/72, acc_iter=241044, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:32:18, time_cost(all): 2 days, 7:46:34/8:53:30, loss=0.315800261748654, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.19(1.03), norm=2.5556435493699885, lr=0.031184314740914505
2023-12-07 18:42:31   INFO  epoch: 62/72, acc_iter=241094, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:31:11, time_cost(all): 2 days, 7:47:16/8:27:33, loss=0.315741064322713, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.2(1.03), norm=4.0757263145611065, lr=0.031124817223626908
2023-12-07 18:43:12   INFO  epoch: 62/72, acc_iter=241144, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:28:50, time_cost(all): 2 days, 7:47:57/8:49:37, loss=0.315681866896772, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.85(1.03), norm=4.910937740694511, lr=0.031065319706339307
2023-12-07 18:43:54   INFO  epoch: 62/72, acc_iter=241194, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:29:00, time_cost(all): 2 days, 7:48:39/8:56:37, loss=0.315622669470832, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.01(1.03), norm=3.5666715070607795, lr=0.03100582218905171
2023-12-07 18:44:36   INFO  epoch: 62/72, acc_iter=241244, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:29:31, time_cost(all): 2 days, 7:49:21/8:44:28, loss=0.315563472044891, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.23(1.03), norm=1.025708680684864, lr=0.030946324671764114
2023-12-07 18:45:18   INFO  epoch: 62/72, acc_iter=241294, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:28:31, time_cost(all): 2 days, 7:50:03/8:38:46, loss=0.31550427461895, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.89(1.03), norm=2.4388353308073083, lr=0.030886827154476514
2023-12-07 18:46:00   INFO  epoch: 62/72, acc_iter=241344, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:28:09, time_cost(all): 2 days, 7:50:45/8:32:10, loss=0.315445077193009, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.89(1.03), norm=2.055673281131404, lr=0.030827329637188913
2023-12-07 18:46:41   INFO  epoch: 62/72, acc_iter=241394, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:26:52, time_cost(all): 2 days, 7:51:26/8:37:26, loss=0.315385879767068, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.92(1.03), norm=4.711996480386245, lr=0.030767832119901316
2023-12-07 18:47:23   INFO  epoch: 62/72, acc_iter=241444, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:25:01, time_cost(all): 2 days, 7:52:08/8:24:24, loss=0.315326682341127, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.1(1.03), norm=4.390747185641149, lr=0.030708334602613716
2023-12-07 18:48:05   INFO  epoch: 62/72, acc_iter=241494, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:24:57, time_cost(all): 2 days, 7:52:50/8:34:19, loss=0.315267484915186, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.19(1.03), norm=4.506795532016694, lr=0.03064883708532612
2023-12-07 18:48:47   INFO  epoch: 62/72, acc_iter=241544, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:25:20, time_cost(all): 2 days, 7:53:32/8:39:53, loss=0.315208287489245, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.09(1.03), norm=4.989271744599741, lr=0.030589339568038523
2023-12-07 18:49:29   INFO  epoch: 62/72, acc_iter=241594, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:22:56, time_cost(all): 2 days, 7:54:14/8:37:06, loss=0.315149090063304, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.19(1.03), norm=4.910673248976506, lr=0.030529842050750922
2023-12-07 18:50:10   INFO  epoch: 62/72, acc_iter=241644, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:24:00, time_cost(all): 2 days, 7:54:55/8:39:41, loss=0.315089892637363, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.18(1.03), norm=4.019122830349566, lr=0.030470344533463322
2023-12-07 18:50:52   INFO  epoch: 62/72, acc_iter=241694, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:21:22, time_cost(all): 2 days, 7:55:37/8:31:22, loss=0.315030695211422, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.88(1.03), norm=4.345242619656531, lr=0.030410847016175722
2023-12-07 18:51:34   INFO  epoch: 62/72, acc_iter=241744, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:43, time_cost(all): 2 days, 7:56:19/8:34:42, loss=0.314971497785481, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.02(1.03), norm=3.953623637002957, lr=0.030351349498888125
2023-12-07 18:52:16   INFO  epoch: 62/72, acc_iter=241794, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:32, time_cost(all): 2 days, 7:57:01/8:19:06, loss=0.31491230035954, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.87(1.03), norm=1.0938207089058198, lr=0.030291851981600525
2023-12-07 18:52:57   INFO  epoch: 62/72, acc_iter=241844, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:21:14, time_cost(all): 2 days, 7:57:42/8:39:22, loss=0.314853102933599, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.19(1.03), norm=1.084471069235784, lr=0.030232354464312928
2023-12-07 18:53:39   INFO  epoch: 62/72, acc_iter=241894, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:20:17, time_cost(all): 2 days, 7:58:24/8:11:24, loss=0.314793905507658, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.06(1.03), norm=4.118743296535035, lr=0.03017285694702533
2023-12-07 18:54:21   INFO  epoch: 62/72, acc_iter=241944, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:38, time_cost(all): 2 days, 7:59:06/8:52:15, loss=0.314734708081717, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.86(1.03), norm=2.5821676389679444, lr=0.03011335942973773
2023-12-07 18:55:03   INFO  epoch: 62/72, acc_iter=241994, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:33, time_cost(all): 2 days, 7:59:48/8:09:08, loss=0.314675510655776, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.88(1.03), norm=1.8774532892194256, lr=0.03005386191245013
2023-12-07 18:55:45   INFO  epoch: 62/72, acc_iter=242044, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:26, time_cost(all): 2 days, 8:00:30/8:12:08, loss=0.314616313229836, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.2(1.03), norm=1.780525241935901, lr=0.029994364395162534
2023-12-07 18:56:26   INFO  epoch: 62/72, acc_iter=242094, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:17:10, time_cost(all): 2 days, 8:01:11/8:54:40, loss=0.314557115803895, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.96(1.03), norm=2.992672111125894, lr=0.029934866877874933
2023-12-07 18:57:08   INFO  epoch: 62/72, acc_iter=242144, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:15:48, time_cost(all): 2 days, 8:01:53/8:36:05, loss=0.314497918377954, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.01(1.03), norm=1.0603612868539594, lr=0.029875369360587337
2023-12-07 18:57:50   INFO  epoch: 62/72, acc_iter=242194, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:52, time_cost(all): 2 days, 8:02:35/8:18:05, loss=0.314438720952013, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.08(1.03), norm=1.8478440761909118, lr=0.02981587184329974
2023-12-07 18:58:32   INFO  epoch: 62/72, acc_iter=242244, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:15:30, time_cost(all): 2 days, 8:03:17/8:45:16, loss=0.314379523526072, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.98(1.03), norm=2.567770338944528, lr=0.02975637432601214
2023-12-07 18:59:13   INFO  epoch: 62/72, acc_iter=242294, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:13:28, time_cost(all): 2 days, 8:03:58/8:14:04, loss=0.314320326100131, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.88(1.03), norm=1.2736822727660644, lr=0.02969687680872454
2023-12-07 18:59:55   INFO  epoch: 62/72, acc_iter=242344, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:12:50, time_cost(all): 2 days, 8:04:40/8:07:19, loss=0.31426112867419, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.14(1.03), norm=3.792329551042249, lr=0.02963737929143694
2023-12-07 19:00:37   INFO  epoch: 62/72, acc_iter=242394, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:11, time_cost(all): 2 days, 8:05:22/8:42:26, loss=0.314201931248249, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.98(1.03), norm=3.8925631856548133, lr=0.029577881774149342
2023-12-07 19:01:19   INFO  epoch: 62/72, acc_iter=242444, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:11:28, time_cost(all): 2 days, 8:06:04/8:19:06, loss=0.314142733822308, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.95(1.03), norm=4.347278233850345, lr=0.029518384256861742
2023-12-07 19:02:01   INFO  epoch: 62/72, acc_iter=242494, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:23, time_cost(all): 2 days, 8:06:46/8:27:57, loss=0.314083536396367, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.02(1.03), norm=2.6269855750359454, lr=0.029458886739574145
2023-12-07 19:02:42   INFO  epoch: 62/72, acc_iter=242544, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:42, time_cost(all): 2 days, 8:07:27/8:31:09, loss=0.314024338970426, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.14(1.03), norm=4.267765773937407, lr=0.02939938922228655
2023-12-07 19:03:24   INFO  epoch: 62/72, acc_iter=242594, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:38, time_cost(all): 2 days, 8:08:09/8:24:54, loss=0.313965141544485, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.19(1.03), norm=1.4242640546100467, lr=0.029339891704998948
2023-12-07 19:04:06   INFO  epoch: 62/72, acc_iter=242644, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:00, time_cost(all): 2 days, 8:08:51/8:16:45, loss=0.313905944118544, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.98(1.03), norm=3.6615894956117505, lr=0.029280394187711348
2023-12-07 19:04:48   INFO  epoch: 62/72, acc_iter=242694, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:14, time_cost(all): 2 days, 8:09:33/8:32:55, loss=0.313846746692603, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.12(1.03), norm=0.7150950580791398, lr=0.02922089667042375
2023-12-07 19:05:29   INFO  epoch: 62/72, acc_iter=242744, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:58, time_cost(all): 2 days, 8:10:14/8:01:27, loss=0.313787549266662, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.93(1.03), norm=2.9571768218340835, lr=0.02916139915313615
2023-12-07 19:06:11   INFO  epoch: 62/72, acc_iter=242794, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:12, time_cost(all): 2 days, 8:10:56/8:00:24, loss=0.313728351840721, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.02(1.03), norm=3.1192499237157536, lr=0.029101901635848554
2023-12-07 19:06:53   INFO  epoch: 62/72, acc_iter=242844, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:33, time_cost(all): 2 days, 8:11:38/8:26:12, loss=0.31366915441478, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.91(1.03), norm=1.2549408859841074, lr=0.029042404118560954
2023-12-07 19:07:35   INFO  epoch: 62/72, acc_iter=242894, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:46, time_cost(all): 2 days, 8:12:20/8:37:47, loss=0.31360995698884, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.87(1.03), norm=1.5608057545275815, lr=0.028982906601273357
2023-12-07 19:08:17   INFO  epoch: 62/72, acc_iter=242944, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:04:49, time_cost(all): 2 days, 8:13:02/8:15:56, loss=0.313550759562899, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.17(1.03), norm=4.859363836299337, lr=0.028923409083985756
2023-12-07 19:08:58   INFO  epoch: 62/72, acc_iter=242994, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:25, time_cost(all): 2 days, 8:13:43/8:09:54, loss=0.313491562136958, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.15(1.03), norm=1.5640326136912441, lr=0.02886391156669816
2023-12-07 19:09:40   INFO  epoch: 62/72, acc_iter=243044, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:41, time_cost(all): 2 days, 8:14:25/8:07:35, loss=0.313432364711017, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.16(1.03), norm=3.4406232719002636, lr=0.02880441404941056
2023-12-07 19:10:22   INFO  epoch: 62/72, acc_iter=243094, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:53, time_cost(all): 2 days, 8:15:07/8:29:20, loss=0.313373167285076, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.83(1.03), norm=1.8528898072692268, lr=0.02874491653212296
2023-12-07 19:11:04   INFO  epoch: 62/72, acc_iter=243144, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:15, time_cost(all): 2 days, 8:15:49/8:19:55, loss=0.313313969859135, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.02(1.03), norm=2.8868329775921877, lr=0.028685419014835362
2023-12-07 19:11:45   INFO  epoch: 62/72, acc_iter=243194, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:34, time_cost(all): 2 days, 8:16:30/7:52:26, loss=0.313254772433194, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.22(1.03), norm=0.7008738177629508, lr=0.028625921497547765
2023-12-07 19:12:27   INFO  epoch: 62/72, acc_iter=243244, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:50, time_cost(all): 2 days, 8:17:12/7:55:22, loss=0.313195575007253, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.95(1.03), norm=3.4269153940113153, lr=0.028566423980260165
2023-12-07 19:13:09   INFO  epoch: 62/72, acc_iter=243294, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 2 days, 8:17:54/8:19:24, loss=0.313136377581312, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.84(1.03), norm=1.4443692567376905, lr=0.028506926462972565
2023-12-07 19:13:51   INFO  epoch: 63/72, acc_iter=243356, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:51:51, time_cost(all): 2 days, 8:18:36/8:12:11, loss=0.313062972773145, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.91(1.03), norm=3.787940474057617, lr=0.028433149541535946
2023-12-07 19:14:33   INFO  epoch: 63/72, acc_iter=243406, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:53:57, time_cost(all): 2 days, 8:19:18/8:03:25, loss=0.313003775347204, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.9(1.03), norm=3.3994993086342875, lr=0.028373652024248346
2023-12-07 19:15:14   INFO  epoch: 63/72, acc_iter=243456, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:51:14, time_cost(all): 2 days, 8:19:59/8:24:05, loss=0.312944577921263, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.02(1.03), norm=1.0619059678869296, lr=0.028314154506960745
2023-12-07 19:15:56   INFO  epoch: 63/72, acc_iter=243506, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:51:07, time_cost(all): 2 days, 8:20:41/7:53:40, loss=0.312885380495322, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.86(1.03), norm=0.7905580980002498, lr=0.02825465698967315
2023-12-07 19:16:38   INFO  epoch: 63/72, acc_iter=243556, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:48:38, time_cost(all): 2 days, 8:21:23/8:32:48, loss=0.312826183069381, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.98(1.03), norm=4.363874250766299, lr=0.028195159472385548
2023-12-07 19:17:20   INFO  epoch: 63/72, acc_iter=243606, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:48:54, time_cost(all): 2 days, 8:22:05/7:53:37, loss=0.312766985643441, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.01(1.03), norm=1.9720169311760622, lr=0.028135661955097948
2023-12-07 19:18:01   INFO  epoch: 63/72, acc_iter=243656, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:51:18, time_cost(all): 2 days, 8:22:46/8:10:22, loss=0.3127077882175, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.19(1.03), norm=3.9382523777321365, lr=0.02807616443781035
2023-12-07 19:18:43   INFO  epoch: 63/72, acc_iter=243706, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:50:11, time_cost(all): 2 days, 8:23:28/7:53:20, loss=0.312648590791559, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.13(1.03), norm=2.43989645892051, lr=0.028016666920522754
2023-12-07 19:19:25   INFO  epoch: 63/72, acc_iter=243756, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:48:58, time_cost(all): 2 days, 8:24:10/7:53:19, loss=0.312589393365618, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.12(1.03), norm=4.886669358360011, lr=0.027957169403235154
2023-12-07 19:20:07   INFO  epoch: 63/72, acc_iter=243806, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:45:01, time_cost(all): 2 days, 8:24:52/8:16:52, loss=0.312530195939677, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.01(1.03), norm=4.966850900317947, lr=0.027897671885947557
2023-12-07 19:20:49   INFO  epoch: 63/72, acc_iter=243856, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:47:53, time_cost(all): 2 days, 8:25:34/7:54:23, loss=0.312470998513736, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.98(1.03), norm=1.7547021696959724, lr=0.027838174368659957
2023-12-07 19:21:30   INFO  epoch: 63/72, acc_iter=243906, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:45:58, time_cost(all): 2 days, 8:26:15/7:59:17, loss=0.312411801087795, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.89(1.03), norm=1.649110338245582, lr=0.027778676851372357
2023-12-07 19:22:12   INFO  epoch: 63/72, acc_iter=243956, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:45:22, time_cost(all): 2 days, 8:26:57/8:15:59, loss=0.312352603661854, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.95(1.03), norm=4.579781088535896, lr=0.02771917933408476
2023-12-07 19:22:54   INFO  epoch: 63/72, acc_iter=244006, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:45:54, time_cost(all): 2 days, 8:27:39/7:46:58, loss=0.312293406235913, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.88(1.03), norm=4.413959271794089, lr=0.02765968181679716
2023-12-07 19:23:36   INFO  epoch: 63/72, acc_iter=244056, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:44:53, time_cost(all): 2 days, 8:28:21/8:08:42, loss=0.312234208809972, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.02(1.03), norm=4.144180861645568, lr=0.027600184299509563
2023-12-07 19:24:18   INFO  epoch: 63/72, acc_iter=244106, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:43:49, time_cost(all): 2 days, 8:29:03/7:53:08, loss=0.312175011384031, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.98(1.03), norm=4.884284809193273, lr=0.027540686782221963
2023-12-07 19:24:59   INFO  epoch: 63/72, acc_iter=244156, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:41:35, time_cost(all): 2 days, 8:29:44/7:58:56, loss=0.31211581395809, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.95(1.03), norm=2.000213776416851, lr=0.027481189264934366
2023-12-07 19:25:41   INFO  epoch: 63/72, acc_iter=244206, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:40:33, time_cost(all): 2 days, 8:30:26/8:00:15, loss=0.312056616532149, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.84(1.03), norm=1.4287489582430462, lr=0.027421691747646765
2023-12-07 19:26:23   INFO  epoch: 63/72, acc_iter=244256, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:38:46, time_cost(all): 2 days, 8:31:08/8:05:38, loss=0.311997419106208, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.1(1.03), norm=0.7797149731792468, lr=0.027362194230359165
2023-12-07 19:27:05   INFO  epoch: 63/72, acc_iter=244306, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:41:39, time_cost(all): 2 days, 8:31:50/7:41:39, loss=0.311938221680267, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.06(1.03), norm=4.0912661205311345, lr=0.02730269671307157
2023-12-07 19:27:46   INFO  epoch: 63/72, acc_iter=244356, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:40:41, time_cost(all): 2 days, 8:32:31/7:58:08, loss=0.311879024254326, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.2(1.03), norm=4.771490483249839, lr=0.027243199195783968
2023-12-07 19:28:28   INFO  epoch: 63/72, acc_iter=244406, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:38:25, time_cost(all): 2 days, 8:33:13/8:05:10, loss=0.311819826828385, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.16(1.03), norm=4.472456712244874, lr=0.027183701678496375
2023-12-07 19:29:10   INFO  epoch: 63/72, acc_iter=244456, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:39:15, time_cost(all): 2 days, 8:33:55/7:39:51, loss=0.311760629402445, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.0(1.03), norm=3.6565131236089528, lr=0.027124204161208774
2023-12-07 19:29:52   INFO  epoch: 63/72, acc_iter=244506, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:38:26, time_cost(all): 2 days, 8:34:37/7:56:14, loss=0.311701431976504, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.01(1.03), norm=2.180068246682146, lr=0.027064706643921174
2023-12-07 19:30:34   INFO  epoch: 63/72, acc_iter=244556, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:35:40, time_cost(all): 2 days, 8:35:19/8:18:07, loss=0.311642234550563, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.16(1.03), norm=3.901745664584669, lr=0.027005209126633574
2023-12-07 19:31:15   INFO  epoch: 63/72, acc_iter=244606, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:36:45, time_cost(all): 2 days, 8:36:00/7:57:06, loss=0.311583037124622, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.93(1.03), norm=2.994695746603774, lr=0.026945711609345977
2023-12-07 19:31:57   INFO  epoch: 63/72, acc_iter=244656, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:34:34, time_cost(all): 2 days, 8:36:42/7:35:49, loss=0.311523839698681, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.2(1.03), norm=4.796790650378987, lr=0.026886214092058377
2023-12-07 19:32:39   INFO  epoch: 63/72, acc_iter=244706, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:34:45, time_cost(all): 2 days, 8:37:24/7:39:08, loss=0.31146464227274, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.18(1.03), norm=2.455510685497527, lr=0.02682671657477078
2023-12-07 19:33:21   INFO  epoch: 63/72, acc_iter=244756, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:35:03, time_cost(all): 2 days, 8:38:06/7:57:58, loss=0.311405444846799, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.85(1.03), norm=0.9858608775880842, lr=0.026767219057483183
2023-12-07 19:34:02   INFO  epoch: 63/72, acc_iter=244806, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:31:41, time_cost(all): 2 days, 8:38:47/7:49:10, loss=0.311346247420858, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.09(1.03), norm=3.9279099455257125, lr=0.026707721540195583
2023-12-07 19:34:44   INFO  epoch: 63/72, acc_iter=244856, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:30:57, time_cost(all): 2 days, 8:39:29/8:10:33, loss=0.311287049994917, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.98(1.03), norm=2.768640395303601, lr=0.026648224022907983
2023-12-07 19:35:26   INFO  epoch: 63/72, acc_iter=244906, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:31:46, time_cost(all): 2 days, 8:40:11/7:37:45, loss=0.311227852568976, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.12(1.03), norm=3.3263627944188445, lr=0.026588726505620382
2023-12-07 19:36:08   INFO  epoch: 63/72, acc_iter=244956, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:30:58, time_cost(all): 2 days, 8:40:53/7:44:56, loss=0.311168655143035, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.13(1.03), norm=1.4398924057817681, lr=0.026529228988332786
2023-12-07 19:36:50   INFO  epoch: 63/72, acc_iter=245006, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:30:22, time_cost(all): 2 days, 8:41:35/7:31:27, loss=0.311109457717094, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.12(1.03), norm=2.3517075156951317, lr=0.02646973147104519
2023-12-07 19:37:31   INFO  epoch: 63/72, acc_iter=245056, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:28:02, time_cost(all): 2 days, 8:42:16/7:54:46, loss=0.311050260291153, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.16(1.03), norm=0.884365629189533, lr=0.026410233953757592
2023-12-07 19:38:13   INFO  epoch: 63/72, acc_iter=245106, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:30:06, time_cost(all): 2 days, 8:42:58/8:03:24, loss=0.310991062865212, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.0(1.03), norm=3.452715350390853, lr=0.02635073643646999
2023-12-07 19:38:55   INFO  epoch: 63/72, acc_iter=245156, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:27:53, time_cost(all): 2 days, 8:43:40/7:55:14, loss=0.310931865439271, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.85(1.03), norm=1.233019036299535, lr=0.02629123891918239
2023-12-07 19:39:37   INFO  epoch: 63/72, acc_iter=245206, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:27:26, time_cost(all): 2 days, 8:44:22/7:40:55, loss=0.31087266801333, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.86(1.03), norm=0.8615610988554236, lr=0.02623174140189479
2023-12-07 19:40:18   INFO  epoch: 63/72, acc_iter=245256, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:27:40, time_cost(all): 2 days, 8:45:03/7:30:22, loss=0.310813470587389, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.16(1.03), norm=1.9425677375669508, lr=0.02617224388460719
2023-12-07 19:41:00   INFO  epoch: 63/72, acc_iter=245306, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:25:03, time_cost(all): 2 days, 8:45:45/8:02:47, loss=0.310754273161449, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.93(1.03), norm=2.2694399188181165, lr=0.026112746367319594
2023-12-07 19:41:42   INFO  epoch: 63/72, acc_iter=245356, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:24:35, time_cost(all): 2 days, 8:46:27/7:32:05, loss=0.310695075735508, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.06(1.03), norm=3.25664378613956, lr=0.026053248850031997
2023-12-07 19:42:24   INFO  epoch: 63/72, acc_iter=245406, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:23:40, time_cost(all): 2 days, 8:47:09/7:35:51, loss=0.310635878309567, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.18(1.03), norm=4.8855398752011485, lr=0.0259937513327444
2023-12-07 19:43:06   INFO  epoch: 63/72, acc_iter=245456, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:36, time_cost(all): 2 days, 8:47:51/7:58:20, loss=0.310576680883626, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.13(1.03), norm=4.547268903600032, lr=0.0259342538154568
2023-12-07 19:43:47   INFO  epoch: 63/72, acc_iter=245506, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:23:08, time_cost(all): 2 days, 8:48:32/7:28:51, loss=0.310517483457685, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.97(1.03), norm=0.6587293120426121, lr=0.0258747562981692
2023-12-07 19:44:29   INFO  epoch: 63/72, acc_iter=245556, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:23:13, time_cost(all): 2 days, 8:49:14/7:33:41, loss=0.310458286031744, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.04(1.03), norm=0.5490774080511582, lr=0.0258152587808816
2023-12-07 19:45:11   INFO  epoch: 63/72, acc_iter=245606, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:22:12, time_cost(all): 2 days, 8:49:56/7:35:19, loss=0.310399088605803, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.09(1.03), norm=4.093283208883825, lr=0.025755761263594003
2023-12-07 19:45:53   INFO  epoch: 63/72, acc_iter=245656, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:22:03, time_cost(all): 2 days, 8:50:38/7:47:00, loss=0.310339891179862, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.18(1.03), norm=1.184313990973647, lr=0.025696263746306406
2023-12-07 19:46:34   INFO  epoch: 63/72, acc_iter=245706, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:20:14, time_cost(all): 2 days, 8:51:19/7:46:30, loss=0.310280693753921, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.86(1.03), norm=1.2283621220766114, lr=0.025636766229018806
2023-12-07 19:47:16   INFO  epoch: 63/72, acc_iter=245756, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:18:51, time_cost(all): 2 days, 8:52:01/7:50:02, loss=0.31022149632798, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.17(1.03), norm=3.208461519219287, lr=0.02557726871173121
2023-12-07 19:47:58   INFO  epoch: 63/72, acc_iter=245806, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:30, time_cost(all): 2 days, 8:52:43/7:17:05, loss=0.310162298902039, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.06(1.03), norm=3.0062464508407563, lr=0.02551777119444361
2023-12-07 19:48:40   INFO  epoch: 63/72, acc_iter=245856, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:11, time_cost(all): 2 days, 8:53:25/7:40:35, loss=0.310103101476098, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.11(1.03), norm=3.7416859169588377, lr=0.02545827367715601
2023-12-07 19:49:22   INFO  epoch: 63/72, acc_iter=245906, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:16:45, time_cost(all): 2 days, 8:54:07/7:46:56, loss=0.310043904050157, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.09(1.03), norm=0.6129180735712418, lr=0.025398776159868408
2023-12-07 19:50:03   INFO  epoch: 63/72, acc_iter=245956, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:47, time_cost(all): 2 days, 8:54:48/7:42:14, loss=0.309984706624216, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.14(1.03), norm=0.57667060558551, lr=0.02533927864258081
2023-12-07 19:50:45   INFO  epoch: 63/72, acc_iter=246006, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:00, time_cost(all): 2 days, 8:55:30/7:35:49, loss=0.309925509198275, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.96(1.03), norm=3.3709902092236432, lr=0.025279781125293214
2023-12-07 19:51:27   INFO  epoch: 63/72, acc_iter=246056, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:16:14, time_cost(all): 2 days, 8:56:12/7:16:29, loss=0.309866311772334, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.94(1.03), norm=4.678037236480422, lr=0.025220283608005618
2023-12-07 19:52:09   INFO  epoch: 63/72, acc_iter=246106, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:47, time_cost(all): 2 days, 8:56:54/7:37:06, loss=0.309807114346393, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.91(1.03), norm=1.4667496486471872, lr=0.025160786090718017
2023-12-07 19:52:50   INFO  epoch: 63/72, acc_iter=246156, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:13:39, time_cost(all): 2 days, 8:57:35/7:35:20, loss=0.309747916920453, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.95(1.03), norm=3.900787146753591, lr=0.025101288573430417
2023-12-07 19:53:32   INFO  epoch: 63/72, acc_iter=246206, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:12:49, time_cost(all): 2 days, 8:58:17/7:50:07, loss=0.309688719494512, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.83(1.03), norm=1.4378616815194911, lr=0.025041791056142817
2023-12-07 19:54:14   INFO  epoch: 63/72, acc_iter=246256, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:09, time_cost(all): 2 days, 8:58:59/7:39:50, loss=0.309629522068571, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.98(1.03), norm=3.297750190891546, lr=0.02498229353885522
2023-12-07 19:54:56   INFO  epoch: 63/72, acc_iter=246306, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:25, time_cost(all): 2 days, 8:59:41/7:15:53, loss=0.30957032464263, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.22(1.03), norm=4.642097636676304, lr=0.024922796021567623
2023-12-07 19:55:38   INFO  epoch: 63/72, acc_iter=246356, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:10:59, time_cost(all): 2 days, 9:00:23/7:42:56, loss=0.309511127216689, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.15(1.03), norm=0.8025942965520616, lr=0.024863298504280023
2023-12-07 19:56:19   INFO  epoch: 63/72, acc_iter=246406, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:29, time_cost(all): 2 days, 9:01:04/7:40:17, loss=0.309451929790748, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.89(1.03), norm=4.2577542174762755, lr=0.024803800986992426
2023-12-07 19:57:01   INFO  epoch: 63/72, acc_iter=246456, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:33, time_cost(all): 2 days, 9:01:46/7:10:01, loss=0.309392732364807, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.0(1.03), norm=2.1122634086965526, lr=0.024744303469704826
2023-12-07 19:57:43   INFO  epoch: 63/72, acc_iter=246506, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:40, time_cost(all): 2 days, 9:02:28/7:22:28, loss=0.309333534938866, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.87(1.03), norm=2.3632983446374842, lr=0.024684805952417226
2023-12-07 19:58:25   INFO  epoch: 63/72, acc_iter=246556, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:45, time_cost(all): 2 days, 9:03:10/7:10:30, loss=0.309274337512925, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.87(1.03), norm=0.9040381432801596, lr=0.024625308435129625
2023-12-07 19:59:07   INFO  epoch: 63/72, acc_iter=246606, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:58, time_cost(all): 2 days, 9:03:52/7:42:36, loss=0.309215140086984, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.03(1.03), norm=2.8246010883631114, lr=0.02456581091784203
2023-12-07 19:59:48   INFO  epoch: 63/72, acc_iter=246656, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:06:51, time_cost(all): 2 days, 9:04:33/7:14:32, loss=0.309155942661043, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.96(1.03), norm=1.5866528489452048, lr=0.02450631340055443
2023-12-07 20:00:30   INFO  epoch: 63/72, acc_iter=246706, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:25, time_cost(all): 2 days, 9:05:15/7:26:39, loss=0.309096745235102, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.06(1.03), norm=3.1260321153142634, lr=0.024446815883266835
2023-12-07 20:01:12   INFO  epoch: 63/72, acc_iter=246756, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:57, time_cost(all): 2 days, 9:05:57/7:30:59, loss=0.309037547809161, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.19(1.03), norm=3.015714861940587, lr=0.024387318365979235
2023-12-07 20:01:54   INFO  epoch: 63/72, acc_iter=246806, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:04, time_cost(all): 2 days, 9:06:39/7:11:17, loss=0.30897835038322, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.91(1.03), norm=3.9481151316614445, lr=0.024327820848691634
2023-12-07 20:02:35   INFO  epoch: 63/72, acc_iter=246856, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:19, time_cost(all): 2 days, 9:07:20/7:39:14, loss=0.308919152957279, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.21(1.03), norm=0.6572083075444141, lr=0.024268323331404034
2023-12-07 20:03:17   INFO  epoch: 63/72, acc_iter=246906, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:37, time_cost(all): 2 days, 9:08:02/7:11:54, loss=0.308859955531338, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.9(1.03), norm=0.6037361642193221, lr=0.024208825814116437
2023-12-07 20:03:59   INFO  epoch: 63/72, acc_iter=246956, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:48, time_cost(all): 2 days, 9:08:44/7:23:35, loss=0.308800758105397, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=0.7647550944522509, lr=0.02414932829682884
2023-12-07 20:04:41   INFO  epoch: 63/72, acc_iter=247006, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:13, time_cost(all): 2 days, 9:09:26/7:17:26, loss=0.308741560679457, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.03(1.03), norm=4.9779402046017776, lr=0.02408983077954124
2023-12-07 20:05:23   INFO  epoch: 63/72, acc_iter=247056, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:29, time_cost(all): 2 days, 9:10:08/7:00:20, loss=0.308682363253516, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.2(1.03), norm=1.7353031458907375, lr=0.024030333262253643
2023-12-07 20:06:04   INFO  epoch: 63/72, acc_iter=247106, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:49, time_cost(all): 2 days, 9:10:49/7:32:57, loss=0.308623165827575, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.17(1.03), norm=3.2950220868172497, lr=0.023970835744966043
2023-12-07 20:06:46   INFO  epoch: 63/72, acc_iter=247156, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 2 days, 9:11:31/7:19:04, loss=0.308563968401634, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.93(1.03), norm=3.1276834582588133, lr=0.023911338227678443
2023-12-07 20:07:28   INFO  epoch: 64/72, acc_iter=247218, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:53:03, time_cost(all): 2 days, 9:12:13/7:16:56, loss=0.308490563593467, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.86(1.03), norm=2.9746449048417762, lr=0.02383756130624182
2023-12-07 20:08:10   INFO  epoch: 64/72, acc_iter=247268, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:53:00, time_cost(all): 2 days, 9:12:55/7:39:34, loss=0.308431366167526, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.19(1.03), norm=0.5982767429771529, lr=0.023778063788954227
2023-12-07 20:08:51   INFO  epoch: 64/72, acc_iter=247318, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:51:47, time_cost(all): 2 days, 9:13:36/7:35:08, loss=0.308372168741585, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.85(1.03), norm=3.122103155565677, lr=0.023718566271666627
2023-12-07 20:09:33   INFO  epoch: 64/72, acc_iter=247368, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:53:14, time_cost(all): 2 days, 9:14:18/7:36:47, loss=0.308312971315644, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.9(1.03), norm=1.690791002720323, lr=0.023659068754379026
2023-12-07 20:10:15   INFO  epoch: 64/72, acc_iter=247418, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:51:33, time_cost(all): 2 days, 9:15:00/7:37:44, loss=0.308253773889703, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.07(1.03), norm=4.960291844084873, lr=0.023599571237091426
2023-12-07 20:10:57   INFO  epoch: 64/72, acc_iter=247468, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:47:09, time_cost(all): 2 days, 9:15:42/7:22:38, loss=0.308194576463762, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.05(1.03), norm=1.6918904278808058, lr=0.02354007371980383
2023-12-07 20:11:39   INFO  epoch: 64/72, acc_iter=247518, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:48:33, time_cost(all): 2 days, 9:16:24/6:58:14, loss=0.308135379037821, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.05(1.03), norm=2.794660166951081, lr=0.02348057620251623
2023-12-07 20:12:20   INFO  epoch: 64/72, acc_iter=247568, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:50:07, time_cost(all): 2 days, 9:17:05/7:32:29, loss=0.30807618161188, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.19(1.03), norm=3.6691578803917793, lr=0.02342107868522863
2023-12-07 20:13:02   INFO  epoch: 64/72, acc_iter=247618, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:49:42, time_cost(all): 2 days, 9:17:47/6:55:08, loss=0.308016984185939, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.99(1.03), norm=4.383040872386134, lr=0.023361581167941035
2023-12-07 20:13:44   INFO  epoch: 64/72, acc_iter=247668, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:48:33, time_cost(all): 2 days, 9:18:29/6:59:36, loss=0.307957786759998, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.95(1.03), norm=2.358515338095388, lr=0.023302083650653435
2023-12-07 20:14:26   INFO  epoch: 64/72, acc_iter=247718, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:47:44, time_cost(all): 2 days, 9:19:11/7:09:12, loss=0.307898589334058, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.22(1.03), norm=0.5401537218521637, lr=0.023242586133365835
2023-12-07 20:15:07   INFO  epoch: 64/72, acc_iter=247768, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:46:03, time_cost(all): 2 days, 9:19:52/7:03:01, loss=0.307839391908117, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.05(1.03), norm=1.8288460763084713, lr=0.023183088616078235
2023-12-07 20:15:49   INFO  epoch: 64/72, acc_iter=247818, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:45:38, time_cost(all): 2 days, 9:20:34/7:30:48, loss=0.307780194482176, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.96(1.03), norm=3.7633977758347013, lr=0.023123591098790638
2023-12-07 20:16:31   INFO  epoch: 64/72, acc_iter=247868, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:44:33, time_cost(all): 2 days, 9:21:16/7:09:06, loss=0.307720997056235, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.03(1.03), norm=2.247616949211926, lr=0.023064093581503037
2023-12-07 20:17:13   INFO  epoch: 64/72, acc_iter=247918, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:44:31, time_cost(all): 2 days, 9:21:58/7:17:12, loss=0.307661799630294, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.1(1.03), norm=1.9234765529129052, lr=0.023004596064215444
2023-12-07 20:17:55   INFO  epoch: 64/72, acc_iter=247968, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:42:04, time_cost(all): 2 days, 9:22:40/6:56:29, loss=0.307602602204353, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.1(1.03), norm=1.444377980328588, lr=0.022945098546927844
2023-12-07 20:18:36   INFO  epoch: 64/72, acc_iter=248018, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:40:47, time_cost(all): 2 days, 9:23:21/7:25:57, loss=0.307543404778412, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.14(1.03), norm=2.2260096980072546, lr=0.022885601029640244
2023-12-07 20:19:18   INFO  epoch: 64/72, acc_iter=248068, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:40:17, time_cost(all): 2 days, 9:24:03/7:23:48, loss=0.307484207352471, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.0(1.03), norm=3.074055828656272, lr=0.022826103512352643
2023-12-07 20:20:00   INFO  epoch: 64/72, acc_iter=248118, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:42:23, time_cost(all): 2 days, 9:24:45/7:23:27, loss=0.30742500992653, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.14(1.03), norm=0.9095878675594673, lr=0.022766605995065043
2023-12-07 20:20:42   INFO  epoch: 64/72, acc_iter=248168, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:38:50, time_cost(all): 2 days, 9:25:27/7:14:56, loss=0.307365812500589, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.23(1.03), norm=3.396166638003224, lr=0.022707108477777446
2023-12-07 20:21:23   INFO  epoch: 64/72, acc_iter=248218, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:41:04, time_cost(all): 2 days, 9:26:08/7:14:40, loss=0.307306615074648, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.12(1.03), norm=3.602263667544519, lr=0.022647610960489846
2023-12-07 20:22:05   INFO  epoch: 64/72, acc_iter=248268, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:37:45, time_cost(all): 2 days, 9:26:50/6:48:19, loss=0.307247417648707, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.95(1.03), norm=2.127107955691149, lr=0.022588113443202253
2023-12-07 20:22:47   INFO  epoch: 64/72, acc_iter=248318, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:36:11, time_cost(all): 2 days, 9:27:32/7:05:02, loss=0.307188220222766, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.17(1.03), norm=2.508910684175518, lr=0.022528615925914652
2023-12-07 20:23:29   INFO  epoch: 64/72, acc_iter=248368, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:36:34, time_cost(all): 2 days, 9:28:14/7:16:58, loss=0.307129022796825, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.04(1.03), norm=0.5166004073296224, lr=0.022469118408627052
2023-12-07 20:24:11   INFO  epoch: 64/72, acc_iter=248418, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:36:41, time_cost(all): 2 days, 9:28:56/6:48:04, loss=0.307069825370884, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.23(1.03), norm=2.8677289621432873, lr=0.022409620891339452
2023-12-07 20:24:52   INFO  epoch: 64/72, acc_iter=248468, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:36:58, time_cost(all): 2 days, 9:29:37/6:58:03, loss=0.307010627944943, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.22(1.03), norm=4.305143660381572, lr=0.022350123374051855
2023-12-07 20:25:34   INFO  epoch: 64/72, acc_iter=248518, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:33:43, time_cost(all): 2 days, 9:30:19/7:12:05, loss=0.306951430519002, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.06(1.03), norm=3.848282254312852, lr=0.022290625856764255
2023-12-07 20:26:16   INFO  epoch: 64/72, acc_iter=248568, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:34:06, time_cost(all): 2 days, 9:31:01/6:48:48, loss=0.306892233093062, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.97(1.03), norm=3.256806733797956, lr=0.02223112833947666
2023-12-07 20:26:58   INFO  epoch: 64/72, acc_iter=248618, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:33:33, time_cost(all): 2 days, 9:31:43/6:40:47, loss=0.306833035667121, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.2(1.03), norm=2.012868618661296, lr=0.02217163082218906
2023-12-07 20:27:39   INFO  epoch: 64/72, acc_iter=248668, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:33:01, time_cost(all): 2 days, 9:32:24/7:10:39, loss=0.30677383824118, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.88(1.03), norm=1.9365494181957434, lr=0.02211213330490146
2023-12-07 20:28:21   INFO  epoch: 64/72, acc_iter=248718, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:32:29, time_cost(all): 2 days, 9:33:06/7:06:27, loss=0.306714640815239, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.07(1.03), norm=2.2164569231104383, lr=0.02205263578761386
2023-12-07 20:29:03   INFO  epoch: 64/72, acc_iter=248768, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:31:34, time_cost(all): 2 days, 9:33:48/6:50:11, loss=0.306655443389298, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.93(1.03), norm=2.9014508344757557, lr=0.02199313827032626
2023-12-07 20:29:45   INFO  epoch: 64/72, acc_iter=248818, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:30:49, time_cost(all): 2 days, 9:34:30/7:02:34, loss=0.306596245963357, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.17(1.03), norm=4.47625489588958, lr=0.021933640753038663
2023-12-07 20:30:27   INFO  epoch: 64/72, acc_iter=248868, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:31:23, time_cost(all): 2 days, 9:35:12/7:05:34, loss=0.306537048537416, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.9(1.03), norm=2.5675324873451286, lr=0.021874143235751063
2023-12-07 20:31:08   INFO  epoch: 64/72, acc_iter=248918, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:28:25, time_cost(all): 2 days, 9:35:53/7:15:51, loss=0.306477851111475, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.85(1.03), norm=1.8106567962577853, lr=0.02181464571846347
2023-12-07 20:31:50   INFO  epoch: 64/72, acc_iter=248968, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:27:19, time_cost(all): 2 days, 9:36:35/7:00:11, loss=0.306418653685534, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.22(1.03), norm=2.7354051473959093, lr=0.02175514820117587
2023-12-07 20:32:32   INFO  epoch: 64/72, acc_iter=249018, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:27:22, time_cost(all): 2 days, 9:37:17/6:53:33, loss=0.306359456259593, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.98(1.03), norm=1.6913491594439454, lr=0.02169565068388827
2023-12-07 20:33:14   INFO  epoch: 64/72, acc_iter=249068, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:27:51, time_cost(all): 2 days, 9:37:59/7:12:13, loss=0.306300258833652, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.96(1.03), norm=3.159307558778124, lr=0.02163615316660067
2023-12-07 20:33:56   INFO  epoch: 64/72, acc_iter=249118, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:26:33, time_cost(all): 2 days, 9:38:41/7:12:26, loss=0.306241061407711, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.22(1.03), norm=0.56407536548096, lr=0.021576655649313072
2023-12-07 20:34:37   INFO  epoch: 64/72, acc_iter=249168, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:25:54, time_cost(all): 2 days, 9:39:22/7:05:21, loss=0.30618186398177, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.17(1.03), norm=0.9497396973032646, lr=0.021517158132025472
2023-12-07 20:35:19   INFO  epoch: 64/72, acc_iter=249218, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:24:26, time_cost(all): 2 days, 9:40:04/6:58:49, loss=0.306122666555829, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.0(1.03), norm=0.8116527444203643, lr=0.021457660614737875
2023-12-07 20:36:01   INFO  epoch: 64/72, acc_iter=249268, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:24:12, time_cost(all): 2 days, 9:40:46/6:35:46, loss=0.306063469129888, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.16(1.03), norm=0.7498142196043254, lr=0.02139816309745028
2023-12-07 20:36:43   INFO  epoch: 64/72, acc_iter=249318, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:22:48, time_cost(all): 2 days, 9:41:28/6:58:57, loss=0.306004271703947, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.86(1.03), norm=4.190810525041587, lr=0.021338665580162678
2023-12-07 20:37:24   INFO  epoch: 64/72, acc_iter=249368, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:01, time_cost(all): 2 days, 9:42:09/6:59:15, loss=0.305945074278006, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.0(1.03), norm=2.510703024336355, lr=0.021279168062875078
2023-12-07 20:38:06   INFO  epoch: 64/72, acc_iter=249418, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:22:17, time_cost(all): 2 days, 9:42:51/6:47:35, loss=0.305885876852066, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.88(1.03), norm=1.8497186058071566, lr=0.021219670545587477
2023-12-07 20:38:48   INFO  epoch: 64/72, acc_iter=249468, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:22:03, time_cost(all): 2 days, 9:43:33/6:51:09, loss=0.305826679426125, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.2(1.03), norm=1.130965313505642, lr=0.02116017302829988
2023-12-07 20:39:30   INFO  epoch: 64/72, acc_iter=249518, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:51, time_cost(all): 2 days, 9:44:15/6:39:22, loss=0.305767482000184, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.04(1.03), norm=2.2253303872896177, lr=0.02110067551101228
2023-12-07 20:40:12   INFO  epoch: 64/72, acc_iter=249568, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:21:08, time_cost(all): 2 days, 9:44:57/6:44:26, loss=0.305708284574243, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.88(1.03), norm=3.2693370472968497, lr=0.021041177993724687
2023-12-07 20:40:53   INFO  epoch: 64/72, acc_iter=249618, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:19:57, time_cost(all): 2 days, 9:45:38/6:30:12, loss=0.305649087148302, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.03(1.03), norm=0.8692125147552847, lr=0.020981680476437087
2023-12-07 20:41:35   INFO  epoch: 64/72, acc_iter=249668, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:50, time_cost(all): 2 days, 9:46:20/6:51:42, loss=0.305589889722361, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.15(1.03), norm=1.2318791699092608, lr=0.020922182959149487
2023-12-07 20:42:17   INFO  epoch: 64/72, acc_iter=249718, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:17:56, time_cost(all): 2 days, 9:47:02/6:30:03, loss=0.30553069229642, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.01(1.03), norm=4.99795115432704, lr=0.020862685441861886
2023-12-07 20:42:59   INFO  epoch: 64/72, acc_iter=249768, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:47, time_cost(all): 2 days, 9:47:44/6:27:03, loss=0.305471494870479, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.16(1.03), norm=2.154768620400962, lr=0.02080318792457429
2023-12-07 20:43:40   INFO  epoch: 64/72, acc_iter=249818, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:08, time_cost(all): 2 days, 9:48:25/6:24:47, loss=0.305412297444538, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.08(1.03), norm=4.791890793045786, lr=0.02074369040728669
2023-12-07 20:44:22   INFO  epoch: 64/72, acc_iter=249868, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:18, time_cost(all): 2 days, 9:49:07/6:38:19, loss=0.305353100018597, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.03(1.03), norm=1.6164597175292483, lr=0.020684192889999092
2023-12-07 20:45:04   INFO  epoch: 64/72, acc_iter=249918, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:16:10, time_cost(all): 2 days, 9:49:49/6:42:21, loss=0.305293902592656, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.91(1.03), norm=1.2422825500213737, lr=0.020624695372711496
2023-12-07 20:45:46   INFO  epoch: 64/72, acc_iter=249968, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:57, time_cost(all): 2 days, 9:50:31/6:59:40, loss=0.305234705166715, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.19(1.03), norm=1.8845452369607603, lr=0.020565197855423895
2023-12-07 20:46:28   INFO  epoch: 64/72, acc_iter=250018, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:30, time_cost(all): 2 days, 9:51:13/6:57:55, loss=0.305175507740774, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.19(1.03), norm=3.0516076871351587, lr=0.020505700338136295
2023-12-07 20:47:09   INFO  epoch: 64/72, acc_iter=250068, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:14:02, time_cost(all): 2 days, 9:51:54/6:22:55, loss=0.305116310314833, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.12(1.03), norm=3.031383190529548, lr=0.020446202820848695
2023-12-07 20:47:51   INFO  epoch: 64/72, acc_iter=250118, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:06, time_cost(all): 2 days, 9:52:36/6:23:15, loss=0.305057112888892, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.04(1.03), norm=4.080374557133934, lr=0.020386705303561098
2023-12-07 20:48:33   INFO  epoch: 64/72, acc_iter=250168, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:11:55, time_cost(all): 2 days, 9:53:18/6:32:07, loss=0.304997915462951, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.94(1.03), norm=3.946277310542794, lr=0.020327207786273498
2023-12-07 20:49:15   INFO  epoch: 64/72, acc_iter=250218, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:43, time_cost(all): 2 days, 9:54:00/6:29:35, loss=0.30493871803701, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.01(1.03), norm=3.3785547010236963, lr=0.020267710268985904
2023-12-07 20:49:56   INFO  epoch: 64/72, acc_iter=250268, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:49, time_cost(all): 2 days, 9:54:41/6:17:59, loss=0.304879520611069, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.05(1.03), norm=2.6003601630322093, lr=0.020208212751698304
2023-12-07 20:50:38   INFO  epoch: 64/72, acc_iter=250318, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:40, time_cost(all): 2 days, 9:55:23/6:49:20, loss=0.304820323185129, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.07(1.03), norm=0.5136707381431712, lr=0.020148715234410704
2023-12-07 20:51:20   INFO  epoch: 64/72, acc_iter=250368, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:13, time_cost(all): 2 days, 9:56:05/6:53:34, loss=0.304761125759188, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.91(1.03), norm=0.8448459318359084, lr=0.020089217717123103
2023-12-07 20:52:02   INFO  epoch: 64/72, acc_iter=250418, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:23, time_cost(all): 2 days, 9:56:47/6:51:23, loss=0.304701928333247, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.09(1.03), norm=4.211555039564829, lr=0.020029720199835507
2023-12-07 20:52:44   INFO  epoch: 64/72, acc_iter=250468, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:39, time_cost(all): 2 days, 9:57:29/6:51:22, loss=0.304642730907306, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.92(1.03), norm=1.9993296510053806, lr=0.019970222682547906
2023-12-07 20:53:25   INFO  epoch: 64/72, acc_iter=250518, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:23, time_cost(all): 2 days, 9:58:10/6:49:35, loss=0.304583533481365, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.22(1.03), norm=4.217002403620964, lr=0.01991072516526031
2023-12-07 20:54:07   INFO  epoch: 64/72, acc_iter=250568, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:35, time_cost(all): 2 days, 9:58:52/6:53:10, loss=0.304524336055424, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.07(1.03), norm=0.6127534297526591, lr=0.019851227647972713
2023-12-07 20:54:49   INFO  epoch: 64/72, acc_iter=250618, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:36, time_cost(all): 2 days, 9:59:34/6:38:00, loss=0.304465138629483, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.89(1.03), norm=4.981131096757842, lr=0.019791730130685112
2023-12-07 20:55:31   INFO  epoch: 64/72, acc_iter=250668, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:04:50, time_cost(all): 2 days, 10:00:16/6:39:58, loss=0.304405941203542, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.22(1.03), norm=0.5135731313807976, lr=0.019732232613397512
2023-12-07 20:56:12   INFO  epoch: 64/72, acc_iter=250718, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:13, time_cost(all): 2 days, 10:00:57/6:49:01, loss=0.304346743777601, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.91(1.03), norm=4.183212277662939, lr=0.019672735096109912
2023-12-07 20:56:54   INFO  epoch: 64/72, acc_iter=250768, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:38, time_cost(all): 2 days, 10:01:39/6:43:08, loss=0.30428754635166, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.98(1.03), norm=4.147458022507086, lr=0.019613237578822315
2023-12-07 20:57:36   INFO  epoch: 64/72, acc_iter=250818, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:58, time_cost(all): 2 days, 10:02:21/6:30:31, loss=0.304228348925719, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.97(1.03), norm=1.6156444205689475, lr=0.01955374006153472
2023-12-07 20:58:18   INFO  epoch: 64/72, acc_iter=250868, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:11, time_cost(all): 2 days, 10:03:03/6:12:19, loss=0.304169151499778, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.08(1.03), norm=0.5357806249142085, lr=0.01949424254424712
2023-12-07 20:59:00   INFO  epoch: 64/72, acc_iter=250918, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:34, time_cost(all): 2 days, 10:03:45/6:24:55, loss=0.304109954073837, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.85(1.03), norm=3.7933166749937586, lr=0.01943474502695952
2023-12-07 20:59:41   INFO  epoch: 64/72, acc_iter=250968, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:50, time_cost(all): 2 days, 10:04:26/6:28:19, loss=0.304050756647896, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.9(1.03), norm=4.278453211143852, lr=0.01937524750967192
2023-12-07 21:00:23   INFO  epoch: 64/72, acc_iter=251018, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 2 days, 10:05:08/6:44:56, loss=0.303991559221955, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.95(1.03), norm=4.277234169608503, lr=0.01931574999238432
2023-12-07 21:01:05   INFO  epoch: 65/72, acc_iter=251080, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:52:36, time_cost(all): 2 days, 10:05:50/6:31:33, loss=0.303918154413789, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.2(1.03), norm=4.524316202774792, lr=0.019241973070947698
2023-12-07 21:01:47   INFO  epoch: 65/72, acc_iter=251130, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:54:48, time_cost(all): 2 days, 10:06:32/6:28:51, loss=0.303858956987848, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.05(1.03), norm=3.0848501735905924, lr=0.019182475553660105
2023-12-07 21:02:28   INFO  epoch: 65/72, acc_iter=251180, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:51:04, time_cost(all): 2 days, 10:07:13/6:14:13, loss=0.303799759561907, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.22(1.03), norm=1.040988349799848, lr=0.019122978036372505
2023-12-07 21:03:10   INFO  epoch: 65/72, acc_iter=251230, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:49:22, time_cost(all): 2 days, 10:07:55/6:43:10, loss=0.303740562135966, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.93(1.03), norm=3.456074759041877, lr=0.019063480519084904
2023-12-07 21:03:52   INFO  epoch: 65/72, acc_iter=251280, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:51:46, time_cost(all): 2 days, 10:08:37/6:05:37, loss=0.303681364710025, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.12(1.03), norm=4.995443588360841, lr=0.019003983001797304
2023-12-07 21:04:34   INFO  epoch: 65/72, acc_iter=251330, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:47:46, time_cost(all): 2 days, 10:09:19/6:38:26, loss=0.303622167284084, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.15(1.03), norm=4.504148404767236, lr=0.018944485484509707
2023-12-07 21:05:16   INFO  epoch: 65/72, acc_iter=251380, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:49:30, time_cost(all): 2 days, 10:10:01/6:05:47, loss=0.303562969858143, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.23(1.03), norm=1.1985652312530812, lr=0.018884987967222107
2023-12-07 21:05:57   INFO  epoch: 65/72, acc_iter=251430, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:49:30, time_cost(all): 2 days, 10:10:42/6:29:36, loss=0.303503772432202, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.04(1.03), norm=3.8164025313655827, lr=0.018825490449934507
2023-12-07 21:06:39   INFO  epoch: 65/72, acc_iter=251480, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:48:35, time_cost(all): 2 days, 10:11:24/6:30:18, loss=0.303444575006261, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.9(1.03), norm=2.4712519144568224, lr=0.018765992932646913
2023-12-07 21:07:21   INFO  epoch: 65/72, acc_iter=251530, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:48:40, time_cost(all): 2 days, 10:12:06/6:38:49, loss=0.30338537758032, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.01(1.03), norm=3.8403628405602994, lr=0.018706495415359313
2023-12-07 21:08:03   INFO  epoch: 65/72, acc_iter=251580, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:46:07, time_cost(all): 2 days, 10:12:48/6:12:48, loss=0.303326180154379, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.86(1.03), norm=1.9648831248522853, lr=0.018646997898071713
2023-12-07 21:08:45   INFO  epoch: 65/72, acc_iter=251630, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:45:26, time_cost(all): 2 days, 10:13:30/6:12:24, loss=0.303266982728438, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.07(1.03), norm=3.4782326421110294, lr=0.018587500380784112
2023-12-07 21:09:26   INFO  epoch: 65/72, acc_iter=251680, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:46:42, time_cost(all): 2 days, 10:14:11/6:10:02, loss=0.303207785302497, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.87(1.03), norm=3.479359464770507, lr=0.018528002863496512
2023-12-07 21:10:08   INFO  epoch: 65/72, acc_iter=251730, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:43:08, time_cost(all): 2 days, 10:14:53/6:13:24, loss=0.303148587876556, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.23(1.03), norm=3.1511195312193845, lr=0.018468505346208912
2023-12-07 21:10:50   INFO  epoch: 65/72, acc_iter=251780, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:42:37, time_cost(all): 2 days, 10:15:35/6:07:57, loss=0.303089390450615, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.84(1.03), norm=2.4389574490932375, lr=0.01840900782892132
2023-12-07 21:11:32   INFO  epoch: 65/72, acc_iter=251830, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:44:12, time_cost(all): 2 days, 10:16:17/6:22:05, loss=0.303030193024674, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.17(1.03), norm=2.768184434186188, lr=0.018349510311633718
2023-12-07 21:12:13   INFO  epoch: 65/72, acc_iter=251880, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:43:18, time_cost(all): 2 days, 10:16:58/6:02:49, loss=0.302970995598734, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.88(1.03), norm=3.6675952525390065, lr=0.018290012794346125
2023-12-07 21:12:55   INFO  epoch: 65/72, acc_iter=251930, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:39:34, time_cost(all): 2 days, 10:17:40/6:23:05, loss=0.302911798172793, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.19(1.03), norm=0.7160451267478967, lr=0.018230515277058525
2023-12-07 21:13:37   INFO  epoch: 65/72, acc_iter=251980, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:40:55, time_cost(all): 2 days, 10:18:22/6:19:43, loss=0.302852600746852, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.84(1.03), norm=4.379032891445335, lr=0.018171017759770924
2023-12-07 21:14:19   INFO  epoch: 65/72, acc_iter=252030, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:41:26, time_cost(all): 2 days, 10:19:04/6:10:36, loss=0.302793403320911, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.94(1.03), norm=4.329028464912399, lr=0.018111520242483324
2023-12-07 21:15:01   INFO  epoch: 65/72, acc_iter=252080, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:37:25, time_cost(all): 2 days, 10:19:46/6:26:28, loss=0.30273420589497, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.94(1.03), norm=1.684373272952756, lr=0.018052022725195724
2023-12-07 21:15:42   INFO  epoch: 65/72, acc_iter=252130, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:39:13, time_cost(all): 2 days, 10:20:27/6:04:16, loss=0.302675008469029, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.17(1.03), norm=4.50344557452338, lr=0.01799252520790813
2023-12-07 21:16:24   INFO  epoch: 65/72, acc_iter=252180, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:37:59, time_cost(all): 2 days, 10:21:09/6:01:20, loss=0.302615811043088, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.9(1.03), norm=2.2960590156785394, lr=0.01793302769062053
2023-12-07 21:17:06   INFO  epoch: 65/72, acc_iter=252230, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:36:49, time_cost(all): 2 days, 10:21:51/6:18:59, loss=0.302556613617147, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.09(1.03), norm=1.9346732458792202, lr=0.01787353017333293
2023-12-07 21:17:48   INFO  epoch: 65/72, acc_iter=252280, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:36:52, time_cost(all): 2 days, 10:22:33/6:27:22, loss=0.302497416191206, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.15(1.03), norm=1.4254170916135094, lr=0.01781403265604533
2023-12-07 21:18:29   INFO  epoch: 65/72, acc_iter=252330, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:34:12, time_cost(all): 2 days, 10:23:14/6:22:56, loss=0.302438218765265, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.93(1.03), norm=1.3246958598421634, lr=0.01775453513875773
2023-12-07 21:19:11   INFO  epoch: 65/72, acc_iter=252380, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:33:30, time_cost(all): 2 days, 10:23:56/5:50:07, loss=0.302379021339324, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.94(1.03), norm=1.8234924632962155, lr=0.01769503762147013
2023-12-07 21:19:53   INFO  epoch: 65/72, acc_iter=252430, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:33:28, time_cost(all): 2 days, 10:24:38/6:18:33, loss=0.302319823913383, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.93(1.03), norm=4.603857025110131, lr=0.017635540104182536
2023-12-07 21:20:35   INFO  epoch: 65/72, acc_iter=252480, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:33:02, time_cost(all): 2 days, 10:25:20/6:18:22, loss=0.302260626487442, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.06(1.03), norm=2.095459496084654, lr=0.017576042586894935
2023-12-07 21:21:17   INFO  epoch: 65/72, acc_iter=252530, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:34:12, time_cost(all): 2 days, 10:26:02/6:19:27, loss=0.302201429061501, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.99(1.03), norm=3.0637421949693437, lr=0.017516545069607342
2023-12-07 21:21:58   INFO  epoch: 65/72, acc_iter=252580, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:31:19, time_cost(all): 2 days, 10:26:43/6:12:46, loss=0.30214223163556, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.05(1.03), norm=4.977508596702077, lr=0.017457047552319742
2023-12-07 21:22:40   INFO  epoch: 65/72, acc_iter=252630, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:31:05, time_cost(all): 2 days, 10:27:25/6:20:47, loss=0.302083034209619, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.87(1.03), norm=0.6070064680088596, lr=0.01739755003503214
2023-12-07 21:23:22   INFO  epoch: 65/72, acc_iter=252680, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:29:36, time_cost(all): 2 days, 10:28:07/6:00:02, loss=0.302023836783679, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.09(1.03), norm=3.552205006839474, lr=0.01733805251774454
2023-12-07 21:24:04   INFO  epoch: 65/72, acc_iter=252730, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:28:36, time_cost(all): 2 days, 10:28:49/6:11:34, loss=0.301964639357738, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.21(1.03), norm=4.6143758890068565, lr=0.01727855500045694
2023-12-07 21:24:45   INFO  epoch: 65/72, acc_iter=252780, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:28:06, time_cost(all): 2 days, 10:29:30/5:53:12, loss=0.301905441931797, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.02(1.03), norm=0.864211411837837, lr=0.017219057483169348
2023-12-07 21:25:27   INFO  epoch: 65/72, acc_iter=252830, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:29:22, time_cost(all): 2 days, 10:30:12/5:44:58, loss=0.301846244505856, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.12(1.03), norm=1.2361659890236951, lr=0.017159559965881747
2023-12-07 21:26:09   INFO  epoch: 65/72, acc_iter=252880, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:27:49, time_cost(all): 2 days, 10:30:54/5:50:42, loss=0.301787047079915, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.08(1.03), norm=3.6944620094677645, lr=0.017100062448594147
2023-12-07 21:26:51   INFO  epoch: 65/72, acc_iter=252930, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:26:17, time_cost(all): 2 days, 10:31:36/5:50:11, loss=0.301727849653974, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.99(1.03), norm=2.8603049176788904, lr=0.017040564931306547
2023-12-07 21:27:33   INFO  epoch: 65/72, acc_iter=252980, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:27:20, time_cost(all): 2 days, 10:32:18/5:48:10, loss=0.301668652228033, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.89(1.03), norm=0.6589792487487192, lr=0.016981067414018947
2023-12-07 21:28:14   INFO  epoch: 65/72, acc_iter=253030, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:25:47, time_cost(all): 2 days, 10:32:59/6:00:32, loss=0.301609454802092, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.09(1.03), norm=2.4757202611827873, lr=0.016921569896731346
2023-12-07 21:28:56   INFO  epoch: 65/72, acc_iter=253080, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:59, time_cost(all): 2 days, 10:33:41/5:55:41, loss=0.301550257376151, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.16(1.03), norm=2.963694664861885, lr=0.016862072379443753
2023-12-07 21:29:38   INFO  epoch: 65/72, acc_iter=253130, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:23:38, time_cost(all): 2 days, 10:34:23/6:12:55, loss=0.30149105995021, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.95(1.03), norm=4.826375363638121, lr=0.016802574862156153
2023-12-07 21:30:20   INFO  epoch: 65/72, acc_iter=253180, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:01, time_cost(all): 2 days, 10:35:05/5:56:22, loss=0.301431862524269, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.93(1.03), norm=1.2593288474740794, lr=0.01674307734486856
2023-12-07 21:31:01   INFO  epoch: 65/72, acc_iter=253230, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:09, time_cost(all): 2 days, 10:35:46/5:49:42, loss=0.301372665098328, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.85(1.03), norm=2.6018825687427287, lr=0.01668357982758096
2023-12-07 21:31:43   INFO  epoch: 65/72, acc_iter=253280, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:21:34, time_cost(all): 2 days, 10:36:28/5:38:48, loss=0.301313467672387, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.01(1.03), norm=0.7587976686039171, lr=0.01662408231029336
2023-12-07 21:32:25   INFO  epoch: 65/72, acc_iter=253330, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:22:05, time_cost(all): 2 days, 10:37:10/6:02:23, loss=0.301254270246446, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.96(1.03), norm=2.1043023060701556, lr=0.01656458479300576
2023-12-07 21:33:07   INFO  epoch: 65/72, acc_iter=253380, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:00, time_cost(all): 2 days, 10:37:52/6:11:15, loss=0.301195072820505, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.99(1.03), norm=0.779938870533643, lr=0.016505087275718158
2023-12-07 21:33:49   INFO  epoch: 65/72, acc_iter=253430, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:19:53, time_cost(all): 2 days, 10:38:34/5:37:08, loss=0.301135875394564, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.83(1.03), norm=3.225261401514249, lr=0.016445589758430565
2023-12-07 21:34:30   INFO  epoch: 65/72, acc_iter=253480, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:20:31, time_cost(all): 2 days, 10:39:15/6:00:14, loss=0.301076677968623, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.17(1.03), norm=2.198121242722287, lr=0.016386092241142965
2023-12-07 21:35:12   INFO  epoch: 65/72, acc_iter=253530, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:03, time_cost(all): 2 days, 10:39:57/6:10:02, loss=0.301017480542683, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.99(1.03), norm=2.786921119830847, lr=0.016326594723855364
2023-12-07 21:35:54   INFO  epoch: 65/72, acc_iter=253580, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:09, time_cost(all): 2 days, 10:40:39/5:58:13, loss=0.300958283116742, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.17(1.03), norm=3.180330037832757, lr=0.016267097206567764
2023-12-07 21:36:36   INFO  epoch: 65/72, acc_iter=253630, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:58, time_cost(all): 2 days, 10:41:21/5:54:58, loss=0.300899085690801, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.15(1.03), norm=1.8643890568507744, lr=0.016207599689280164
2023-12-07 21:37:17   INFO  epoch: 65/72, acc_iter=253680, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:17:14, time_cost(all): 2 days, 10:42:02/5:58:20, loss=0.30083988826486, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.98(1.03), norm=1.879416748569389, lr=0.016148102171992564
2023-12-07 21:37:59   INFO  epoch: 65/72, acc_iter=253730, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:45, time_cost(all): 2 days, 10:42:44/5:58:15, loss=0.300780690838919, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.9(1.03), norm=2.912597340327882, lr=0.01608860465470497
2023-12-07 21:38:41   INFO  epoch: 65/72, acc_iter=253780, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:11, time_cost(all): 2 days, 10:43:26/5:55:14, loss=0.300721493412978, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.16(1.03), norm=3.599004699825118, lr=0.01602910713741737
2023-12-07 21:39:23   INFO  epoch: 65/72, acc_iter=253830, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:15:26, time_cost(all): 2 days, 10:44:08/6:01:22, loss=0.300662295987037, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.11(1.03), norm=3.7420994788789628, lr=0.015969609620129777
2023-12-07 21:40:05   INFO  epoch: 65/72, acc_iter=253880, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:13:49, time_cost(all): 2 days, 10:44:50/5:35:04, loss=0.300603098561096, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.19(1.03), norm=3.956254261419702, lr=0.015910112102842176
2023-12-07 21:40:46   INFO  epoch: 65/72, acc_iter=253930, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:12:46, time_cost(all): 2 days, 10:45:31/5:58:37, loss=0.300543901135155, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.14(1.03), norm=4.587483285238651, lr=0.015850614585554576
2023-12-07 21:41:28   INFO  epoch: 65/72, acc_iter=253980, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:41, time_cost(all): 2 days, 10:46:13/5:45:53, loss=0.300484703709214, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.86(1.03), norm=1.1740081034631125, lr=0.015791117068266976
2023-12-07 21:42:10   INFO  epoch: 65/72, acc_iter=254030, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:11:31, time_cost(all): 2 days, 10:46:55/5:53:02, loss=0.300425506283273, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.2(1.03), norm=4.52195936888749, lr=0.015731619550979375
2023-12-07 21:42:52   INFO  epoch: 65/72, acc_iter=254080, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:41, time_cost(all): 2 days, 10:47:37/5:50:35, loss=0.300366308857332, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.86(1.03), norm=4.019469615264091, lr=0.015672122033691782
2023-12-07 21:43:34   INFO  epoch: 65/72, acc_iter=254130, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:53, time_cost(all): 2 days, 10:48:19/5:57:29, loss=0.300307111431391, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.09(1.03), norm=3.0770703886623756, lr=0.015612624516404182
2023-12-07 21:44:15   INFO  epoch: 65/72, acc_iter=254180, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:35, time_cost(all): 2 days, 10:49:00/5:37:35, loss=0.30024791400545, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.17(1.03), norm=2.856286885728313, lr=0.015553126999116582
2023-12-07 21:44:57   INFO  epoch: 65/72, acc_iter=254230, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:32, time_cost(all): 2 days, 10:49:42/5:50:41, loss=0.300188716579509, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.98(1.03), norm=3.7703942045775514, lr=0.015493629481828981
2023-12-07 21:45:39   INFO  epoch: 65/72, acc_iter=254280, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:12, time_cost(all): 2 days, 10:50:24/5:49:12, loss=0.300129519153568, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.01(1.03), norm=4.304343381336686, lr=0.015434131964541381
2023-12-07 21:46:21   INFO  epoch: 65/72, acc_iter=254330, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:08:10, time_cost(all): 2 days, 10:51:06/5:25:01, loss=0.300070321727627, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.86(1.03), norm=4.120641843176333, lr=0.01537463444725378
2023-12-07 21:47:02   INFO  epoch: 65/72, acc_iter=254380, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:09, time_cost(all): 2 days, 10:51:47/5:32:22, loss=0.300011124301687, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.92(1.03), norm=4.7298392817997845, lr=0.015315136929966187
2023-12-07 21:47:44   INFO  epoch: 65/72, acc_iter=254430, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:07, time_cost(all): 2 days, 10:52:29/5:30:23, loss=0.299951926875746, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.06(1.03), norm=3.6250042337366453, lr=0.015255639412678587
2023-12-07 21:48:26   INFO  epoch: 65/72, acc_iter=254480, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:35, time_cost(all): 2 days, 10:53:11/5:29:08, loss=0.299892729449805, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.05(1.03), norm=2.439943029025364, lr=0.015196141895390987
2023-12-07 21:49:08   INFO  epoch: 65/72, acc_iter=254530, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:04:51, time_cost(all): 2 days, 10:53:53/5:35:38, loss=0.299833532023864, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.12(1.03), norm=1.8150243383254792, lr=0.015136644378103394
2023-12-07 21:49:50   INFO  epoch: 65/72, acc_iter=254580, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:19, time_cost(all): 2 days, 10:54:35/5:51:27, loss=0.299774334597923, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.13(1.03), norm=2.4867535855867096, lr=0.015077146860815793
2023-12-07 21:50:31   INFO  epoch: 65/72, acc_iter=254630, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:47, time_cost(all): 2 days, 10:55:16/5:34:49, loss=0.299715137171982, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.21(1.03), norm=1.3968799038717186, lr=0.015017649343528193
2023-12-07 21:51:13   INFO  epoch: 65/72, acc_iter=254680, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:03:04, time_cost(all): 2 days, 10:55:58/5:27:25, loss=0.299655939746041, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.93(1.03), norm=2.4044159869169004, lr=0.014958151826240593
2023-12-07 21:51:55   INFO  epoch: 65/72, acc_iter=254730, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:09, time_cost(all): 2 days, 10:56:40/5:51:13, loss=0.2995967423201, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.88(1.03), norm=3.4642395516907385, lr=0.014898654308953
2023-12-07 21:52:37   INFO  epoch: 65/72, acc_iter=254780, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:35, time_cost(all): 2 days, 10:57:22/5:41:50, loss=0.299537544894159, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.17(1.03), norm=1.705581476208232, lr=0.014839156791665399
2023-12-07 21:53:18   INFO  epoch: 65/72, acc_iter=254830, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:53, time_cost(all): 2 days, 10:58:03/5:34:12, loss=0.299478347468218, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.94(1.03), norm=4.272450678083861, lr=0.014779659274377799
2023-12-07 21:54:00   INFO  epoch: 65/72, acc_iter=254880, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 2 days, 10:58:45/5:18:29, loss=0.299419150042277, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.07(1.03), norm=3.9266560378209374, lr=0.014720161757090199
2023-12-07 21:54:42   INFO  epoch: 66/72, acc_iter=254942, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:51:11, time_cost(all): 2 days, 10:59:27/5:49:22, loss=0.29934574523411, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.86(1.03), norm=1.2452941710132328, lr=0.01464638483565358
2023-12-07 21:55:24   INFO  epoch: 66/72, acc_iter=254992, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:51:01, time_cost(all): 2 days, 11:00:09/5:27:17, loss=0.299286547808169, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.98(1.03), norm=1.1547257632790087, lr=0.01458688731836598
2023-12-07 21:56:06   INFO  epoch: 66/72, acc_iter=255042, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:49:41, time_cost(all): 2 days, 11:00:51/5:47:23, loss=0.299227350382228, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.89(1.03), norm=2.530357487037768, lr=0.014527389801078379
2023-12-07 21:56:47   INFO  epoch: 66/72, acc_iter=255092, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:50:47, time_cost(all): 2 days, 11:01:32/5:19:19, loss=0.299168152956288, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.9(1.03), norm=4.697123424308602, lr=0.014467892283790786
2023-12-07 21:57:29   INFO  epoch: 66/72, acc_iter=255142, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:50:31, time_cost(all): 2 days, 11:02:14/5:18:09, loss=0.299108955530347, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.96(1.03), norm=1.491392684032914, lr=0.014408394766503185
2023-12-07 21:58:11   INFO  epoch: 66/72, acc_iter=255192, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:49:02, time_cost(all): 2 days, 11:02:56/5:26:11, loss=0.299049758104406, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.02(1.03), norm=1.7240601692611441, lr=0.014348897249215585
2023-12-07 21:58:53   INFO  epoch: 66/72, acc_iter=255242, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:47:36, time_cost(all): 2 days, 11:03:38/5:38:01, loss=0.298990560678465, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.14(1.03), norm=1.8014453975907072, lr=0.014289399731927985
2023-12-07 21:59:34   INFO  epoch: 66/72, acc_iter=255292, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:47:19, time_cost(all): 2 days, 11:04:19/5:39:56, loss=0.298931363252524, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.88(1.03), norm=0.5401059207603839, lr=0.014229902214640384
2023-12-07 22:00:16   INFO  epoch: 66/72, acc_iter=255342, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:47:49, time_cost(all): 2 days, 11:05:01/5:13:03, loss=0.298872165826583, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.17(1.03), norm=3.801429470207804, lr=0.014170404697352791
2023-12-07 22:00:58   INFO  epoch: 66/72, acc_iter=255392, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:45:48, time_cost(all): 2 days, 11:05:43/5:18:11, loss=0.298812968400642, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.14(1.03), norm=3.865777633888973, lr=0.01411090718006519
2023-12-07 22:01:40   INFO  epoch: 66/72, acc_iter=255442, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:47:39, time_cost(all): 2 days, 11:06:25/5:31:45, loss=0.298753770974701, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.07(1.03), norm=2.7025551639123666, lr=0.01405140966277759
2023-12-07 22:02:22   INFO  epoch: 66/72, acc_iter=255492, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:45:06, time_cost(all): 2 days, 11:07:07/5:14:35, loss=0.29869457354876, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.94(1.03), norm=4.904055261600934, lr=0.01399191214548999
2023-12-07 22:03:03   INFO  epoch: 66/72, acc_iter=255542, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:44:55, time_cost(all): 2 days, 11:07:48/5:14:05, loss=0.298635376122819, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.87(1.03), norm=0.8370661926347944, lr=0.01393241462820239
2023-12-07 22:03:45   INFO  epoch: 66/72, acc_iter=255592, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:46:14, time_cost(all): 2 days, 11:08:30/5:12:31, loss=0.298576178696878, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.21(1.03), norm=4.356482018514809, lr=0.013872917110914797
2023-12-07 22:04:27   INFO  epoch: 66/72, acc_iter=255642, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:44:04, time_cost(all): 2 days, 11:09:12/5:22:30, loss=0.298516981270937, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.16(1.03), norm=3.9646961782572916, lr=0.013813419593627196
2023-12-07 22:05:09   INFO  epoch: 66/72, acc_iter=255692, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:41:50, time_cost(all): 2 days, 11:09:54/5:35:56, loss=0.298457783844996, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.03(1.03), norm=0.9663658686767367, lr=0.013753922076339596
2023-12-07 22:05:50   INFO  epoch: 66/72, acc_iter=255742, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:39:54, time_cost(all): 2 days, 11:10:35/5:36:51, loss=0.298398586419055, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.21(1.03), norm=3.6102700858716545, lr=0.013694424559052003
2023-12-07 22:06:32   INFO  epoch: 66/72, acc_iter=255792, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:42:13, time_cost(all): 2 days, 11:11:17/5:14:15, loss=0.298339388993114, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.14(1.03), norm=2.0981887657195477, lr=0.013634927041764403
2023-12-07 22:07:14   INFO  epoch: 66/72, acc_iter=255842, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:40:43, time_cost(all): 2 days, 11:11:59/5:07:29, loss=0.298280191567173, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.16(1.03), norm=2.8693695064848503, lr=0.013575429524476802
2023-12-07 22:07:56   INFO  epoch: 66/72, acc_iter=255892, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:40:30, time_cost(all): 2 days, 11:12:41/5:10:56, loss=0.298220994141232, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.04(1.03), norm=4.518291150044278, lr=0.013515932007189202
2023-12-07 22:08:38   INFO  epoch: 66/72, acc_iter=255942, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:38:49, time_cost(all): 2 days, 11:13:23/5:32:03, loss=0.298161796715292, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.03(1.03), norm=1.7915533125295706, lr=0.013456434489901602
2023-12-07 22:09:19   INFO  epoch: 66/72, acc_iter=255992, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:36:43, time_cost(all): 2 days, 11:14:04/5:20:04, loss=0.298102599289351, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.17(1.03), norm=0.7017124535498308, lr=0.013396936972614008
2023-12-07 22:10:01   INFO  epoch: 66/72, acc_iter=256042, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:37:46, time_cost(all): 2 days, 11:14:46/5:01:53, loss=0.29804340186341, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.16(1.03), norm=2.937777868295666, lr=0.013337439455326408
2023-12-07 22:10:43   INFO  epoch: 66/72, acc_iter=256092, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:36:49, time_cost(all): 2 days, 11:15:28/5:13:20, loss=0.297984204437469, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.01(1.03), norm=2.4282697669935347, lr=0.013277941938038808
2023-12-07 22:11:25   INFO  epoch: 66/72, acc_iter=256142, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:36:40, time_cost(all): 2 days, 11:16:10/5:11:23, loss=0.297925007011528, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.05(1.03), norm=1.02119877167621, lr=0.013218444420751208
2023-12-07 22:12:06   INFO  epoch: 66/72, acc_iter=256192, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:36:13, time_cost(all): 2 days, 11:16:51/5:26:46, loss=0.297865809585587, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.04(1.03), norm=1.6122875256553515, lr=0.013158946903463607
2023-12-07 22:12:48   INFO  epoch: 66/72, acc_iter=256242, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:34:19, time_cost(all): 2 days, 11:17:33/5:30:10, loss=0.297806612159646, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.91(1.03), norm=1.357817535019545, lr=0.013099449386176007
2023-12-07 22:13:30   INFO  epoch: 66/72, acc_iter=256292, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:34:06, time_cost(all): 2 days, 11:18:15/5:06:18, loss=0.297747414733705, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.88(1.03), norm=1.7894960475438138, lr=0.013039951868888414
2023-12-07 22:14:12   INFO  epoch: 66/72, acc_iter=256342, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:33:59, time_cost(all): 2 days, 11:18:57/5:29:07, loss=0.297688217307764, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.02(1.03), norm=2.561781275780997, lr=0.012980454351600813
2023-12-07 22:14:54   INFO  epoch: 66/72, acc_iter=256392, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:33:48, time_cost(all): 2 days, 11:19:39/5:06:12, loss=0.297629019881823, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.07(1.03), norm=1.8956356110399295, lr=0.01292095683431322
2023-12-07 22:15:35   INFO  epoch: 66/72, acc_iter=256442, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:33:19, time_cost(all): 2 days, 11:20:20/5:18:47, loss=0.297569822455882, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.88(1.03), norm=4.355762336472109, lr=0.01286145931702562
2023-12-07 22:16:17   INFO  epoch: 66/72, acc_iter=256492, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:31:41, time_cost(all): 2 days, 11:21:02/5:20:45, loss=0.297510625029941, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.19(1.03), norm=4.816412048723456, lr=0.01280196179973802
2023-12-07 22:16:59   INFO  epoch: 66/72, acc_iter=256542, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:29:35, time_cost(all): 2 days, 11:21:44/5:24:33, loss=0.297451427604, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.88(1.03), norm=1.0704856657425978, lr=0.01274246428245042
2023-12-07 22:17:41   INFO  epoch: 66/72, acc_iter=256592, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:28:57, time_cost(all): 2 days, 11:22:26/5:04:27, loss=0.297392230178059, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.2(1.03), norm=0.6499199756191256, lr=0.012682966765162819
2023-12-07 22:18:23   INFO  epoch: 66/72, acc_iter=256642, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:29:35, time_cost(all): 2 days, 11:23:08/5:16:55, loss=0.297333032752118, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.11(1.03), norm=0.755466549597841, lr=0.012623469247875226
2023-12-07 22:19:04   INFO  epoch: 66/72, acc_iter=256692, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:29:03, time_cost(all): 2 days, 11:23:49/5:14:45, loss=0.297273835326177, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.06(1.03), norm=3.3911862914282067, lr=0.012563971730587625
2023-12-07 22:19:46   INFO  epoch: 66/72, acc_iter=256742, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:27:14, time_cost(all): 2 days, 11:24:31/4:54:15, loss=0.297214637900236, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.98(1.03), norm=2.5169786487436734, lr=0.012504474213300025
2023-12-07 22:20:28   INFO  epoch: 66/72, acc_iter=256792, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:26:09, time_cost(all): 2 days, 11:25:13/5:18:06, loss=0.297155440474295, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.19(1.03), norm=1.3368925883399874, lr=0.012444976696012425
2023-12-07 22:21:10   INFO  epoch: 66/72, acc_iter=256842, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:25:36, time_cost(all): 2 days, 11:25:55/5:00:39, loss=0.297096243048355, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.97(1.03), norm=4.589193109952631, lr=0.012385479178724824
2023-12-07 22:21:51   INFO  epoch: 66/72, acc_iter=256892, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:27:11, time_cost(all): 2 days, 11:26:36/5:07:52, loss=0.297037045622414, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.17(1.03), norm=4.0777409782731535, lr=0.012325981661437224
2023-12-07 22:22:33   INFO  epoch: 66/72, acc_iter=256942, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:05, time_cost(all): 2 days, 11:27:18/5:00:10, loss=0.296977848196473, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.2(1.03), norm=4.741557563167817, lr=0.01226648414414963
2023-12-07 22:23:15   INFO  epoch: 66/72, acc_iter=256992, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:25:18, time_cost(all): 2 days, 11:28:00/5:06:18, loss=0.296918650770532, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.12(1.03), norm=3.079104611209092, lr=0.01220698662686203
2023-12-07 22:23:57   INFO  epoch: 66/72, acc_iter=257042, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:22, time_cost(all): 2 days, 11:28:42/4:48:53, loss=0.296859453344591, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.09(1.03), norm=1.9656259929994335, lr=0.012147489109574437
2023-12-07 22:24:39   INFO  epoch: 66/72, acc_iter=257092, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:24:02, time_cost(all): 2 days, 11:29:24/5:13:12, loss=0.29680025591865, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.12(1.03), norm=2.9742031663491324, lr=0.012087991592286837
2023-12-07 22:25:20   INFO  epoch: 66/72, acc_iter=257142, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:22:24, time_cost(all): 2 days, 11:30:05/5:08:03, loss=0.296741058492709, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.18(1.03), norm=2.4194980429771062, lr=0.012028494074999237
2023-12-07 22:26:02   INFO  epoch: 66/72, acc_iter=257192, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:22:13, time_cost(all): 2 days, 11:30:47/4:54:06, loss=0.296681861066768, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.86(1.03), norm=3.4878851782223017, lr=0.011968996557711636
2023-12-07 22:26:44   INFO  epoch: 66/72, acc_iter=257242, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:59, time_cost(all): 2 days, 11:31:29/5:07:28, loss=0.296622663640827, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.97(1.03), norm=2.9608050775060737, lr=0.011909499040424036
2023-12-07 22:27:26   INFO  epoch: 66/72, acc_iter=257292, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:19:51, time_cost(all): 2 days, 11:32:11/5:00:52, loss=0.296563466214886, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.08(1.03), norm=1.0147010776857053, lr=0.011850001523136443
2023-12-07 22:28:07   INFO  epoch: 66/72, acc_iter=257342, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:19:27, time_cost(all): 2 days, 11:32:52/5:10:27, loss=0.296504268788945, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.89(1.03), norm=4.470983162378289, lr=0.011790504005848843
2023-12-07 22:28:49   INFO  epoch: 66/72, acc_iter=257392, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:32, time_cost(all): 2 days, 11:33:34/4:51:02, loss=0.296445071363004, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.21(1.03), norm=1.4217564730709036, lr=0.011731006488561242
2023-12-07 22:29:31   INFO  epoch: 66/72, acc_iter=257442, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:22, time_cost(all): 2 days, 11:34:16/4:43:50, loss=0.296385873937063, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.88(1.03), norm=1.8322159676091543, lr=0.011671508971273642
2023-12-07 22:30:13   INFO  epoch: 66/72, acc_iter=257492, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:40, time_cost(all): 2 days, 11:34:58/4:44:06, loss=0.296326676511122, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.03(1.03), norm=1.7241629470161037, lr=0.011612011453986042
2023-12-07 22:30:55   INFO  epoch: 66/72, acc_iter=257542, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:32, time_cost(all): 2 days, 11:35:40/5:08:36, loss=0.296267479085181, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.0(1.03), norm=1.0599587293530086, lr=0.011552513936698441
2023-12-07 22:31:36   INFO  epoch: 66/72, acc_iter=257592, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:23, time_cost(all): 2 days, 11:36:21/5:04:55, loss=0.29620828165924, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.92(1.03), norm=2.0949119527895474, lr=0.011493016419410848
2023-12-07 22:32:18   INFO  epoch: 66/72, acc_iter=257642, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:22, time_cost(all): 2 days, 11:37:03/5:05:17, loss=0.296149084233299, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.98(1.03), norm=3.8501614509135353, lr=0.011433518902123248
2023-12-07 22:33:00   INFO  epoch: 66/72, acc_iter=257692, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:15:30, time_cost(all): 2 days, 11:37:45/5:06:09, loss=0.296089886807359, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.99(1.03), norm=2.5721219826151644, lr=0.011374021384835654
2023-12-07 22:33:42   INFO  epoch: 66/72, acc_iter=257742, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:13:28, time_cost(all): 2 days, 11:38:27/4:44:16, loss=0.296030689381418, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.95(1.03), norm=2.236067657288489, lr=0.011314523867548054
2023-12-07 22:34:23   INFO  epoch: 66/72, acc_iter=257792, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:06, time_cost(all): 2 days, 11:39:08/5:05:05, loss=0.295971491955477, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.09(1.03), norm=4.234616821756266, lr=0.011255026350260454
2023-12-07 22:35:05   INFO  epoch: 66/72, acc_iter=257842, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:13:09, time_cost(all): 2 days, 11:39:50/4:56:56, loss=0.295912294529536, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.21(1.03), norm=2.394253463031897, lr=0.011195528832972854
2023-12-07 22:35:47   INFO  epoch: 66/72, acc_iter=257892, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:11:37, time_cost(all): 2 days, 11:40:32/4:46:11, loss=0.295853097103595, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.21(1.03), norm=1.8270677365729142, lr=0.011136031315685253
2023-12-07 22:36:29   INFO  epoch: 66/72, acc_iter=257942, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:26, time_cost(all): 2 days, 11:41:14/4:53:49, loss=0.295793899677654, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.98(1.03), norm=3.7030202055318875, lr=0.01107653379839766
2023-12-07 22:37:11   INFO  epoch: 66/72, acc_iter=257992, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:22, time_cost(all): 2 days, 11:41:56/4:47:55, loss=0.295734702251713, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.87(1.03), norm=0.7453228679477242, lr=0.01101703628111006
2023-12-07 22:37:52   INFO  epoch: 66/72, acc_iter=258042, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:28, time_cost(all): 2 days, 11:42:37/5:01:27, loss=0.295675504825772, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.89(1.03), norm=4.906895361275131, lr=0.01095753876382246
2023-12-07 22:38:34   INFO  epoch: 66/72, acc_iter=258092, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:08:50, time_cost(all): 2 days, 11:43:19/5:00:26, loss=0.295616307399831, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.17(1.03), norm=3.7750379136516146, lr=0.01089804124653486
2023-12-07 22:39:16   INFO  epoch: 66/72, acc_iter=258142, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:43, time_cost(all): 2 days, 11:44:01/4:52:16, loss=0.29555710997389, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.99(1.03), norm=2.9160796837009055, lr=0.010838543729247259
2023-12-07 22:39:58   INFO  epoch: 66/72, acc_iter=258192, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:39, time_cost(all): 2 days, 11:44:43/4:46:57, loss=0.295497912547949, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.2(1.03), norm=1.575458624322144, lr=0.010779046211959659
2023-12-07 22:40:39   INFO  epoch: 66/72, acc_iter=258242, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:06:49, time_cost(all): 2 days, 11:45:24/4:35:09, loss=0.295438715122008, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.85(1.03), norm=0.8491882813622813, lr=0.010719548694672065
2023-12-07 22:41:21   INFO  epoch: 66/72, acc_iter=258292, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:40, time_cost(all): 2 days, 11:46:06/4:57:00, loss=0.295379517696067, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.14(1.03), norm=2.1789398416119363, lr=0.010660051177384465
2023-12-07 22:42:03   INFO  epoch: 66/72, acc_iter=258342, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:28, time_cost(all): 2 days, 11:46:48/4:40:28, loss=0.295320320270126, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.97(1.03), norm=4.110705287288246, lr=0.010600553660096872
2023-12-07 22:42:45   INFO  epoch: 66/72, acc_iter=258392, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:03, time_cost(all): 2 days, 11:47:30/4:52:28, loss=0.295261122844185, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.85(1.03), norm=3.1396891523340074, lr=0.010541056142809271
2023-12-07 22:43:27   INFO  epoch: 66/72, acc_iter=258442, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:25, time_cost(all): 2 days, 11:48:12/4:40:49, loss=0.295201925418244, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.21(1.03), norm=1.2479887822611488, lr=0.010481558625521671
2023-12-07 22:44:08   INFO  epoch: 66/72, acc_iter=258492, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:47, time_cost(all): 2 days, 11:48:53/4:40:18, loss=0.295142727992304, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.86(1.03), norm=4.306278467425511, lr=0.01042206110823407
2023-12-07 22:44:50   INFO  epoch: 66/72, acc_iter=258542, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:54, time_cost(all): 2 days, 11:49:35/4:35:21, loss=0.295083530566363, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.09(1.03), norm=4.601040388127675, lr=0.01036256359094647
2023-12-07 22:45:32   INFO  epoch: 66/72, acc_iter=258592, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:18, time_cost(all): 2 days, 11:50:17/4:34:37, loss=0.295024333140422, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.85(1.03), norm=4.9701341642545565, lr=0.010303066073658877
2023-12-07 22:46:14   INFO  epoch: 66/72, acc_iter=258642, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:30, time_cost(all): 2 days, 11:50:59/4:36:50, loss=0.294965135714481, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.95(1.03), norm=2.0928430551610973, lr=0.010243568556371277
2023-12-07 22:46:55   INFO  epoch: 66/72, acc_iter=258692, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:52, time_cost(all): 2 days, 11:51:40/4:54:33, loss=0.29490593828854, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.16(1.03), norm=2.2132372105174536, lr=0.010184071039083677
2023-12-07 22:47:37   INFO  epoch: 66/72, acc_iter=258742, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 2 days, 11:52:22/4:48:32, loss=0.294846740862599, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.17(1.03), norm=1.7423424816822226, lr=0.010124573521796076
2023-12-07 22:48:19   INFO  epoch: 67/72, acc_iter=258804, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:51:37, time_cost(all): 2 days, 11:53:04/4:45:24, loss=0.294773336054432, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.93(1.03), norm=2.7353835072649946, lr=0.010050796600359457
2023-12-07 22:49:01   INFO  epoch: 67/72, acc_iter=258854, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:50:48, time_cost(all): 2 days, 11:53:46/4:45:57, loss=0.294714138628491, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.85(1.03), norm=0.5659592503221696, lr=0.009991299083071857
2023-12-07 22:49:43   INFO  epoch: 67/72, acc_iter=258904, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:51:02, time_cost(all): 2 days, 11:54:28/4:47:01, loss=0.29465494120255, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.06(1.03), norm=1.384050333549266, lr=0.009931801565784264
2023-12-07 22:50:24   INFO  epoch: 67/72, acc_iter=258954, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:53:00, time_cost(all): 2 days, 11:55:09/4:24:08, loss=0.294595743776609, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.03(1.03), norm=4.981538312126704, lr=0.009872304048496663
2023-12-07 22:51:06   INFO  epoch: 67/72, acc_iter=259004, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:49:40, time_cost(all): 2 days, 11:55:51/4:25:30, loss=0.294536546350668, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.01(1.03), norm=4.737675862505512, lr=0.009812806531209063
2023-12-07 22:51:48   INFO  epoch: 67/72, acc_iter=259054, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:51:49, time_cost(all): 2 days, 11:56:33/4:35:38, loss=0.294477348924727, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.93(1.03), norm=4.931754473820743, lr=0.009753309013921463
2023-12-07 22:52:30   INFO  epoch: 67/72, acc_iter=259104, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:50:20, time_cost(all): 2 days, 11:57:15/4:40:47, loss=0.294418151498786, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.95(1.03), norm=3.802890247189327, lr=0.009693811496633863
2023-12-07 22:53:12   INFO  epoch: 67/72, acc_iter=259154, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:46:48, time_cost(all): 2 days, 11:57:57/4:36:20, loss=0.294358954072845, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.09(1.03), norm=0.9249671466970499, lr=0.009634313979346262
2023-12-07 22:53:53   INFO  epoch: 67/72, acc_iter=259204, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:45:31, time_cost(all): 2 days, 11:58:38/4:21:20, loss=0.294299756646904, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.02(1.03), norm=4.029507612884061, lr=0.009574816462058669
2023-12-07 22:54:35   INFO  epoch: 67/72, acc_iter=259254, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:47:36, time_cost(all): 2 days, 11:59:20/4:38:52, loss=0.294240559220964, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.02(1.03), norm=2.7132596930197614, lr=0.009515318944771069
2023-12-07 22:55:17   INFO  epoch: 67/72, acc_iter=259304, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:47:46, time_cost(all): 2 days, 12:00:02/4:42:37, loss=0.294181361795023, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.22(1.03), norm=3.249664172577616, lr=0.009455821427483468
2023-12-07 22:55:59   INFO  epoch: 67/72, acc_iter=259354, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:47:22, time_cost(all): 2 days, 12:00:44/4:31:10, loss=0.294122164369082, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.88(1.03), norm=3.205923592981644, lr=0.009396323910195868
2023-12-07 22:56:40   INFO  epoch: 67/72, acc_iter=259404, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:46:38, time_cost(all): 2 days, 12:01:25/4:20:42, loss=0.294062966943141, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.98(1.03), norm=2.156920726367339, lr=0.009336826392908268
2023-12-07 22:57:22   INFO  epoch: 67/72, acc_iter=259454, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:43:16, time_cost(all): 2 days, 12:02:07/4:40:05, loss=0.2940037695172, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.18(1.03), norm=1.2502782859599337, lr=0.009277328875620675
2023-12-07 22:58:04   INFO  epoch: 67/72, acc_iter=259504, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:44:26, time_cost(all): 2 days, 12:02:49/4:33:07, loss=0.293944572091259, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.85(1.03), norm=2.281802314390779, lr=0.009217831358333074
2023-12-07 22:58:46   INFO  epoch: 67/72, acc_iter=259554, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:42:48, time_cost(all): 2 days, 12:03:31/4:18:17, loss=0.293885374665318, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.12(1.03), norm=1.08938502823007, lr=0.009158333841045481
2023-12-07 22:59:28   INFO  epoch: 67/72, acc_iter=259604, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:40:31, time_cost(all): 2 days, 12:04:13/4:21:39, loss=0.293826177239377, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.86(1.03), norm=4.4016370120288055, lr=0.00909883632375788
2023-12-07 23:00:09   INFO  epoch: 67/72, acc_iter=259654, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:43:12, time_cost(all): 2 days, 12:04:54/4:31:30, loss=0.293766979813436, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.02(1.03), norm=4.234818540623669, lr=0.00903933880647028
2023-12-07 23:00:51   INFO  epoch: 67/72, acc_iter=259704, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:41:01, time_cost(all): 2 days, 12:05:36/4:19:40, loss=0.293707782387495, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.84(1.03), norm=2.788820454278346, lr=0.00897984128918268
2023-12-07 23:01:33   INFO  epoch: 67/72, acc_iter=259754, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:39:28, time_cost(all): 2 days, 12:06:18/4:30:11, loss=0.293648584961554, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.19(1.03), norm=2.9206879500026592, lr=0.00892034377189508
2023-12-07 23:02:15   INFO  epoch: 67/72, acc_iter=259804, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:40:54, time_cost(all): 2 days, 12:07:00/4:14:04, loss=0.293589387535613, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.98(1.03), norm=1.3904991551559698, lr=0.00886084625460748
2023-12-07 23:02:56   INFO  epoch: 67/72, acc_iter=259854, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:37:54, time_cost(all): 2 days, 12:07:41/4:23:19, loss=0.293530190109672, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.19(1.03), norm=4.696917988078172, lr=0.008801348737319886
2023-12-07 23:03:38   INFO  epoch: 67/72, acc_iter=259904, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:37:05, time_cost(all): 2 days, 12:08:23/4:27:56, loss=0.293470992683731, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.89(1.03), norm=4.993218579741423, lr=0.008741851220032286
2023-12-07 23:04:20   INFO  epoch: 67/72, acc_iter=259954, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:37:43, time_cost(all): 2 days, 12:09:05/4:28:43, loss=0.29341179525779, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.06(1.03), norm=3.6380755392117967, lr=0.008682353702744686
2023-12-07 23:05:02   INFO  epoch: 67/72, acc_iter=260004, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:34:34, time_cost(all): 2 days, 12:09:47/4:11:28, loss=0.293352597831849, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.97(1.03), norm=2.2988408325610736, lr=0.008622856185457085
2023-12-07 23:05:44   INFO  epoch: 67/72, acc_iter=260054, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:35:01, time_cost(all): 2 days, 12:10:29/4:26:59, loss=0.293293400405909, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.08(1.03), norm=0.6424798095349195, lr=0.008563358668169485
2023-12-07 23:06:25   INFO  epoch: 67/72, acc_iter=260104, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:34:48, time_cost(all): 2 days, 12:11:10/4:29:32, loss=0.293234202979968, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.09(1.03), norm=2.2121100795279105, lr=0.008503861150881892
2023-12-07 23:07:07   INFO  epoch: 67/72, acc_iter=260154, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:34:30, time_cost(all): 2 days, 12:11:52/4:21:49, loss=0.293175005554027, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.94(1.03), norm=3.8781925520823277, lr=0.008444363633594291
2023-12-07 23:07:49   INFO  epoch: 67/72, acc_iter=260204, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:34:48, time_cost(all): 2 days, 12:12:34/4:28:37, loss=0.293115808128086, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.1(1.03), norm=2.904773214769666, lr=0.008384866116306691
2023-12-07 23:08:31   INFO  epoch: 67/72, acc_iter=260254, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:32:45, time_cost(all): 2 days, 12:13:16/4:25:23, loss=0.293056610702145, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.93(1.03), norm=0.5220181667117162, lr=0.008325368599019098
2023-12-07 23:09:12   INFO  epoch: 67/72, acc_iter=260304, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:31:55, time_cost(all): 2 days, 12:13:57/4:19:28, loss=0.292997413276204, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.13(1.03), norm=1.1443915944851897, lr=0.008265871081731498
2023-12-07 23:09:54   INFO  epoch: 67/72, acc_iter=260354, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:31:18, time_cost(all): 2 days, 12:14:39/4:19:49, loss=0.292938215850263, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.21(1.03), norm=2.4414339459268612, lr=0.008206373564443897
2023-12-07 23:10:36   INFO  epoch: 67/72, acc_iter=260404, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:30:57, time_cost(all): 2 days, 12:15:21/4:23:40, loss=0.292879018424322, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.03(1.03), norm=2.5861065872175812, lr=0.008146876047156297
2023-12-07 23:11:18   INFO  epoch: 67/72, acc_iter=260454, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:29:04, time_cost(all): 2 days, 12:16:03/4:08:44, loss=0.292819820998381, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.0(1.03), norm=4.730144443774793, lr=0.008087378529868697
2023-12-07 23:12:00   INFO  epoch: 67/72, acc_iter=260504, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:28:58, time_cost(all): 2 days, 12:16:45/4:05:52, loss=0.29276062357244, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.85(1.03), norm=3.480886789234397, lr=0.008027881012581103
2023-12-07 23:12:41   INFO  epoch: 67/72, acc_iter=260554, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:27:49, time_cost(all): 2 days, 12:17:26/4:23:50, loss=0.292701426146499, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.11(1.03), norm=2.135813124212955, lr=0.007968383495293503
2023-12-07 23:13:23   INFO  epoch: 67/72, acc_iter=260604, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:27:57, time_cost(all): 2 days, 12:18:08/4:05:59, loss=0.292642228720558, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.2(1.03), norm=2.4256834999778807, lr=0.007908885978005903
2023-12-07 23:14:05   INFO  epoch: 67/72, acc_iter=260654, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:28:18, time_cost(all): 2 days, 12:18:50/4:15:31, loss=0.292583031294617, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.89(1.03), norm=4.814308205778136, lr=0.007849388460718303
2023-12-07 23:14:47   INFO  epoch: 67/72, acc_iter=260704, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:27:55, time_cost(all): 2 days, 12:19:32/4:05:04, loss=0.292523833868676, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.1(1.03), norm=1.819032333794812, lr=0.007789890943430702
2023-12-07 23:15:28   INFO  epoch: 67/72, acc_iter=260754, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:26:07, time_cost(all): 2 days, 12:20:13/4:00:52, loss=0.292464636442735, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.83(1.03), norm=1.9482635258195318, lr=0.007730393426143109
2023-12-07 23:16:10   INFO  epoch: 67/72, acc_iter=260804, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:24:43, time_cost(all): 2 days, 12:20:55/4:04:13, loss=0.292405439016794, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.84(1.03), norm=1.6720998042330921, lr=0.007670895908855509
2023-12-07 23:16:52   INFO  epoch: 67/72, acc_iter=260854, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:25:28, time_cost(all): 2 days, 12:21:37/4:01:28, loss=0.292346241590853, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.83(1.03), norm=2.097347928686545, lr=0.007611398391567908
2023-12-07 23:17:34   INFO  epoch: 67/72, acc_iter=260904, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:24:05, time_cost(all): 2 days, 12:22:19/4:09:59, loss=0.292287044164913, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.11(1.03), norm=2.084365566873916, lr=0.007551900874280315
2023-12-07 23:18:16   INFO  epoch: 67/72, acc_iter=260954, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:16, time_cost(all): 2 days, 12:23:01/4:01:27, loss=0.292227846738972, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.03(1.03), norm=3.732065949774986, lr=0.007492403356992715
2023-12-07 23:18:57   INFO  epoch: 67/72, acc_iter=261004, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:22:20, time_cost(all): 2 days, 12:23:42/4:14:33, loss=0.292168649313031, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.97(1.03), norm=1.4462229528793888, lr=0.007432905839705115
2023-12-07 23:19:39   INFO  epoch: 67/72, acc_iter=261054, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:22:48, time_cost(all): 2 days, 12:24:24/4:18:02, loss=0.29210945188709, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.17(1.03), norm=1.4620159533446255, lr=0.007373408322417514
2023-12-07 23:20:21   INFO  epoch: 67/72, acc_iter=261104, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:15, time_cost(all): 2 days, 12:25:06/4:16:34, loss=0.292050254461149, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.06(1.03), norm=2.9841025336976887, lr=0.007313910805129914
2023-12-07 23:21:03   INFO  epoch: 67/72, acc_iter=261154, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:20:45, time_cost(all): 2 days, 12:25:48/3:55:31, loss=0.291991057035208, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.18(1.03), norm=4.930999317072855, lr=0.007254413287842321
2023-12-07 23:21:44   INFO  epoch: 67/72, acc_iter=261204, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:19:40, time_cost(all): 2 days, 12:26:29/4:09:57, loss=0.291931859609267, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.84(1.03), norm=4.526305021841177, lr=0.00719491577055472
2023-12-07 23:22:26   INFO  epoch: 67/72, acc_iter=261254, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:18:38, time_cost(all): 2 days, 12:27:11/4:11:55, loss=0.291872662183326, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.96(1.03), norm=1.3366944053919514, lr=0.00713541825326712
2023-12-07 23:23:08   INFO  epoch: 67/72, acc_iter=261304, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:38, time_cost(all): 2 days, 12:27:53/4:04:36, loss=0.291813464757385, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.01(1.03), norm=0.8163235296286648, lr=0.00707592073597952
2023-12-07 23:23:50   INFO  epoch: 67/72, acc_iter=261354, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:16:49, time_cost(all): 2 days, 12:28:35/4:07:16, loss=0.291754267331444, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.23(1.03), norm=4.836364077965715, lr=0.00701642321869192
2023-12-07 23:24:32   INFO  epoch: 67/72, acc_iter=261404, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:05, time_cost(all): 2 days, 12:29:17/3:53:56, loss=0.291695069905503, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.06(1.03), norm=2.4275724834299197, lr=0.006956925701404326
2023-12-07 23:25:13   INFO  epoch: 67/72, acc_iter=261454, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:15:39, time_cost(all): 2 days, 12:29:58/4:06:54, loss=0.291635872479562, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.17(1.03), norm=4.685611380597993, lr=0.006897428184116726
2023-12-07 23:25:55   INFO  epoch: 67/72, acc_iter=261504, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:14:45, time_cost(all): 2 days, 12:30:40/4:09:07, loss=0.291576675053621, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.98(1.03), norm=2.8070640013062107, lr=0.006837930666829126
2023-12-07 23:26:37   INFO  epoch: 67/72, acc_iter=261554, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:16, time_cost(all): 2 days, 12:31:22/3:55:31, loss=0.29151747762768, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.2(1.03), norm=0.7374974500544869, lr=0.006778433149541532
2023-12-07 23:27:19   INFO  epoch: 67/72, acc_iter=261604, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:41, time_cost(all): 2 days, 12:32:04/4:09:56, loss=0.291458280201739, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.92(1.03), norm=0.8305920205456607, lr=0.006718935632253932
2023-12-07 23:28:00   INFO  epoch: 67/72, acc_iter=261654, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:10, time_cost(all): 2 days, 12:32:45/4:09:43, loss=0.291399082775798, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.22(1.03), norm=3.2678190278408574, lr=0.006659438114966332
2023-12-07 23:28:42   INFO  epoch: 67/72, acc_iter=261704, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:40, time_cost(all): 2 days, 12:33:27/3:56:15, loss=0.291339885349857, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.19(1.03), norm=3.0491272962799307, lr=0.006599940597678731
2023-12-07 23:29:24   INFO  epoch: 67/72, acc_iter=261754, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:32, time_cost(all): 2 days, 12:34:09/3:51:07, loss=0.291280687923917, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.1(1.03), norm=2.8618918104373385, lr=0.006540443080391131
2023-12-07 23:30:06   INFO  epoch: 67/72, acc_iter=261804, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:41, time_cost(all): 2 days, 12:34:51/3:50:35, loss=0.291221490497976, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.06(1.03), norm=0.7719590886904197, lr=0.006480945563103538
2023-12-07 23:30:48   INFO  epoch: 67/72, acc_iter=261854, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:42, time_cost(all): 2 days, 12:35:33/3:57:12, loss=0.291162293072035, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.91(1.03), norm=1.2798991333198702, lr=0.006421448045815938
2023-12-07 23:31:29   INFO  epoch: 67/72, acc_iter=261904, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:57, time_cost(all): 2 days, 12:36:14/3:49:09, loss=0.291103095646094, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.21(1.03), norm=2.2996886258656595, lr=0.006361950528528337
2023-12-07 23:32:11   INFO  epoch: 67/72, acc_iter=261954, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:25, time_cost(all): 2 days, 12:36:56/4:05:37, loss=0.291043898220153, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.9(1.03), norm=4.906248775578742, lr=0.006302453011240737
2023-12-07 23:32:53   INFO  epoch: 67/72, acc_iter=262004, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:27, time_cost(all): 2 days, 12:37:38/4:04:28, loss=0.290984700794212, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.83(1.03), norm=3.604673962329974, lr=0.006242955493953137
2023-12-07 23:33:35   INFO  epoch: 67/72, acc_iter=262054, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:08:11, time_cost(all): 2 days, 12:38:20/3:45:04, loss=0.290925503368271, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.13(1.03), norm=4.12899631717217, lr=0.006183457976665536
2023-12-07 23:34:17   INFO  epoch: 67/72, acc_iter=262104, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:12, time_cost(all): 2 days, 12:39:02/3:55:17, loss=0.29086630594233, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.97(1.03), norm=3.034683203207981, lr=0.006123960459377943
2023-12-07 23:34:58   INFO  epoch: 67/72, acc_iter=262154, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:10, time_cost(all): 2 days, 12:39:43/4:03:46, loss=0.290807108516389, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.85(1.03), norm=2.345792000765359, lr=0.006064462942090343
2023-12-07 23:35:40   INFO  epoch: 67/72, acc_iter=262204, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:36, time_cost(all): 2 days, 12:40:25/3:49:46, loss=0.290747911090448, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.16(1.03), norm=0.7664505644866824, lr=0.00600496542480275
2023-12-07 23:36:22   INFO  epoch: 67/72, acc_iter=262254, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:11, time_cost(all): 2 days, 12:41:07/3:40:56, loss=0.290688713664507, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.06(1.03), norm=0.7962201313413639, lr=0.005945467907515149
2023-12-07 23:37:04   INFO  epoch: 67/72, acc_iter=262304, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:21, time_cost(all): 2 days, 12:41:49/3:49:44, loss=0.290629516238566, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.98(1.03), norm=4.064126080560084, lr=0.005885970390227549
2023-12-07 23:37:45   INFO  epoch: 67/72, acc_iter=262354, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:42, time_cost(all): 2 days, 12:42:30/4:01:03, loss=0.290570318812625, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.84(1.03), norm=4.722608571813021, lr=0.005826472872939949
2023-12-07 23:38:27   INFO  epoch: 67/72, acc_iter=262404, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:49, time_cost(all): 2 days, 12:43:12/3:53:07, loss=0.290511121386684, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.09(1.03), norm=3.585626983146055, lr=0.005766975355652348
2023-12-07 23:39:09   INFO  epoch: 67/72, acc_iter=262454, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:21, time_cost(all): 2 days, 12:43:54/3:46:29, loss=0.290451923960743, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.88(1.03), norm=3.260790012354005, lr=0.005707477838364755
2023-12-07 23:39:51   INFO  epoch: 67/72, acc_iter=262504, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:33, time_cost(all): 2 days, 12:44:36/3:48:41, loss=0.290392726534802, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.97(1.03), norm=4.6395749209316826, lr=0.005647980321077155
2023-12-07 23:40:33   INFO  epoch: 67/72, acc_iter=262554, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:52, time_cost(all): 2 days, 12:45:18/3:52:17, loss=0.290333529108861, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.87(1.03), norm=4.18652833720204, lr=0.005588482803789555
2023-12-07 23:41:14   INFO  epoch: 67/72, acc_iter=262604, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 2 days, 12:45:59/3:37:54, loss=0.29027433168292, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.13(1.03), norm=4.461742695293127, lr=0.005528985286501954
2023-12-07 23:41:56   INFO  epoch: 68/72, acc_iter=262666, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:53:01, time_cost(all): 2 days, 12:46:41/3:54:56, loss=0.290200926874754, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.08(1.03), norm=4.328561377405701, lr=0.005455208365065335
2023-12-07 23:42:38   INFO  epoch: 68/72, acc_iter=262716, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:52:22, time_cost(all): 2 days, 12:47:23/3:34:12, loss=0.290141729448813, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.06(1.03), norm=4.51149939658978, lr=0.005395710847777735
2023-12-07 23:43:20   INFO  epoch: 68/72, acc_iter=262766, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:51:17, time_cost(all): 2 days, 12:48:05/3:33:29, loss=0.290082532022872, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.96(1.03), norm=3.1509782514439784, lr=0.005336213330490142
2023-12-07 23:44:01   INFO  epoch: 68/72, acc_iter=262816, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:49:17, time_cost(all): 2 days, 12:48:46/3:39:55, loss=0.290023334596931, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.86(1.03), norm=1.7431162073002913, lr=0.005276715813202541
2023-12-07 23:44:43   INFO  epoch: 68/72, acc_iter=262866, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:49:27, time_cost(all): 2 days, 12:49:28/3:39:56, loss=0.28996413717099, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.14(1.03), norm=2.121558739601226, lr=0.005217218295914941
2023-12-07 23:45:25   INFO  epoch: 68/72, acc_iter=262916, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:50:12, time_cost(all): 2 days, 12:50:10/3:52:37, loss=0.289904939745049, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.2(1.03), norm=4.095086568360129, lr=0.005157720778627341
2023-12-07 23:46:07   INFO  epoch: 68/72, acc_iter=262966, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:48:39, time_cost(all): 2 days, 12:50:52/3:39:34, loss=0.289845742319108, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.99(1.03), norm=2.2365684889315123, lr=0.00509822326133974
2023-12-07 23:46:49   INFO  epoch: 68/72, acc_iter=263016, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:48:40, time_cost(all): 2 days, 12:51:34/3:46:52, loss=0.289786544893167, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.84(1.03), norm=1.5438937315536374, lr=0.00503872574405214
2023-12-07 23:47:30   INFO  epoch: 68/72, acc_iter=263066, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:46:03, time_cost(all): 2 days, 12:52:15/3:31:16, loss=0.289727347467226, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.11(1.03), norm=3.5891292435170374, lr=0.004994187903983733
2023-12-07 23:48:12   INFO  epoch: 68/72, acc_iter=263116, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:44:54, time_cost(all): 2 days, 12:52:57/3:34:23, loss=0.289668150041285, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.87(1.03), norm=1.8052002824943498, lr=0.004977540057924313
2023-12-07 23:48:54   INFO  epoch: 68/72, acc_iter=263166, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:45:00, time_cost(all): 2 days, 12:53:39/3:44:45, loss=0.289608952615344, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.99(1.03), norm=1.3510645956085632, lr=0.004960892211864893
2023-12-07 23:49:36   INFO  epoch: 68/72, acc_iter=263216, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:45:59, time_cost(all): 2 days, 12:54:21/3:46:43, loss=0.289549755189403, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.09(1.03), norm=1.404485642990005, lr=0.004944244365805472
2023-12-07 23:50:17   INFO  epoch: 68/72, acc_iter=263266, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:46:10, time_cost(all): 2 days, 12:55:02/3:39:16, loss=0.289490557763462, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.98(1.03), norm=4.425914298029262, lr=0.004927596519746052
2023-12-07 23:50:59   INFO  epoch: 68/72, acc_iter=263316, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:43:11, time_cost(all): 2 days, 12:55:44/3:29:32, loss=0.289431360337522, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.06(1.03), norm=2.462774966990932, lr=0.004910948673686632
2023-12-07 23:51:41   INFO  epoch: 68/72, acc_iter=263366, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:45:16, time_cost(all): 2 days, 12:56:26/3:38:14, loss=0.289372162911581, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.08(1.03), norm=3.2929905198326344, lr=0.004894300827627211
2023-12-07 23:52:23   INFO  epoch: 68/72, acc_iter=263416, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:43:50, time_cost(all): 2 days, 12:57:08/3:42:22, loss=0.28931296548564, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.12(1.03), norm=3.9809483548851423, lr=0.004877652981567791
2023-12-07 23:53:05   INFO  epoch: 68/72, acc_iter=263466, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:42:12, time_cost(all): 2 days, 12:57:50/3:26:57, loss=0.289253768059699, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.98(1.03), norm=0.9005855424223494, lr=0.004861005135508371
2023-12-07 23:53:46   INFO  epoch: 68/72, acc_iter=263516, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:40:53, time_cost(all): 2 days, 12:58:31/3:43:37, loss=0.289194570633758, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.03(1.03), norm=4.9827991539204515, lr=0.00484435728944895
2023-12-07 23:54:28   INFO  epoch: 68/72, acc_iter=263566, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:41:58, time_cost(all): 2 days, 12:59:13/3:30:30, loss=0.289135373207817, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.16(1.03), norm=1.02161527763763, lr=0.004827709443389529
2023-12-07 23:55:10   INFO  epoch: 68/72, acc_iter=263616, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:40:33, time_cost(all): 2 days, 12:59:55/3:32:47, loss=0.289076175781876, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.15(1.03), norm=3.6963288990694427, lr=0.004811061597330109
2023-12-07 23:55:52   INFO  epoch: 68/72, acc_iter=263666, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:39:12, time_cost(all): 2 days, 13:00:37/3:33:16, loss=0.289016978355935, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.0(1.03), norm=2.477962609400815, lr=0.004794413751270689
2023-12-07 23:56:33   INFO  epoch: 68/72, acc_iter=263716, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:36:59, time_cost(all): 2 days, 13:01:18/3:30:55, loss=0.288957780929994, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.07(1.03), norm=2.750260657982515, lr=0.004777765905211269
2023-12-07 23:57:15   INFO  epoch: 68/72, acc_iter=263766, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:39:28, time_cost(all): 2 days, 13:02:00/3:34:09, loss=0.288898583504053, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.22(1.03), norm=2.461645743573368, lr=0.004761118059151848
2023-12-07 23:57:57   INFO  epoch: 68/72, acc_iter=263816, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:37:31, time_cost(all): 2 days, 13:02:42/3:20:48, loss=0.288839386078112, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.91(1.03), norm=1.7555268069730081, lr=0.004744470213092428
2023-12-07 23:58:39   INFO  epoch: 68/72, acc_iter=263866, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:35:53, time_cost(all): 2 days, 13:03:24/3:35:15, loss=0.288780188652171, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.07(1.03), norm=2.457696826642361, lr=0.004727822367033008
2023-12-07 23:59:21   INFO  epoch: 68/72, acc_iter=263916, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:33:54, time_cost(all): 2 days, 13:04:06/3:35:06, loss=0.28872099122623, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.98(1.03), norm=3.3152127207627777, lr=0.004711174520973587
2023-12-08 00:00:02   INFO  epoch: 68/72, acc_iter=263966, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:35:14, time_cost(all): 2 days, 13:04:47/3:25:33, loss=0.288661793800289, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.86(1.03), norm=3.0184380721908908, lr=0.004694526674914167
2023-12-08 00:00:44   INFO  epoch: 68/72, acc_iter=264016, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:35:09, time_cost(all): 2 days, 13:05:29/3:33:32, loss=0.288602596374348, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.11(1.03), norm=0.88417369158065, lr=0.004677878828854746
2023-12-08 00:01:26   INFO  epoch: 68/72, acc_iter=264066, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:34:52, time_cost(all): 2 days, 13:06:11/3:16:25, loss=0.288543398948407, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.04(1.03), norm=0.9688065995181936, lr=0.004661230982795326
2023-12-08 00:02:08   INFO  epoch: 68/72, acc_iter=264116, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:33:31, time_cost(all): 2 days, 13:06:53/3:27:02, loss=0.288484201522466, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.84(1.03), norm=3.9398518552000534, lr=0.004644583136735905
2023-12-08 00:02:49   INFO  epoch: 68/72, acc_iter=264166, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:31:42, time_cost(all): 2 days, 13:07:34/3:24:20, loss=0.288425004096526, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.09(1.03), norm=0.688920438919393, lr=0.004627935290676485
2023-12-08 00:03:31   INFO  epoch: 68/72, acc_iter=264216, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:31:16, time_cost(all): 2 days, 13:08:16/3:24:31, loss=0.288365806670585, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.99(1.03), norm=4.335602549096167, lr=0.004611287444617065
2023-12-08 00:04:13   INFO  epoch: 68/72, acc_iter=264266, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:30:18, time_cost(all): 2 days, 13:08:58/3:24:19, loss=0.288306609244644, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.02(1.03), norm=4.542082282666995, lr=0.004594639598557645
2023-12-08 00:04:55   INFO  epoch: 68/72, acc_iter=264316, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:31:23, time_cost(all): 2 days, 13:09:40/3:17:10, loss=0.288247411818703, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.04(1.03), norm=1.6379383455362193, lr=0.004577991752498224
2023-12-08 00:05:37   INFO  epoch: 68/72, acc_iter=264366, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:28:03, time_cost(all): 2 days, 13:10:22/3:31:17, loss=0.288188214392762, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.83(1.03), norm=3.8781989280038216, lr=0.004561343906438804
2023-12-08 00:06:18   INFO  epoch: 68/72, acc_iter=264416, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:28:13, time_cost(all): 2 days, 13:11:03/3:14:24, loss=0.288129016966821, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.89(1.03), norm=3.4935837210396707, lr=0.004544696060379384
2023-12-08 00:07:00   INFO  epoch: 68/72, acc_iter=264466, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:28:01, time_cost(all): 2 days, 13:11:45/3:17:46, loss=0.28806981954088, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.1(1.03), norm=2.722331981651342, lr=0.004528048214319963
2023-12-08 00:07:42   INFO  epoch: 68/72, acc_iter=264516, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:27:09, time_cost(all): 2 days, 13:12:27/3:26:44, loss=0.288010622114939, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.2(1.03), norm=1.1557148863687943, lr=0.004511400368260542
2023-12-08 00:08:24   INFO  epoch: 68/72, acc_iter=264566, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:26:07, time_cost(all): 2 days, 13:13:09/3:21:16, loss=0.287951424688998, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.88(1.03), norm=4.419268904264083, lr=0.004494752522201122
2023-12-08 00:09:06   INFO  epoch: 68/72, acc_iter=264616, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:25:57, time_cost(all): 2 days, 13:13:51/3:18:54, loss=0.287892227263057, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.99(1.03), norm=1.8564844019363316, lr=0.004478104676141702
2023-12-08 00:09:47   INFO  epoch: 68/72, acc_iter=264666, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:39, time_cost(all): 2 days, 13:14:32/3:26:14, loss=0.287833029837116, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.17(1.03), norm=0.508024108009737, lr=0.004461456830082281
2023-12-08 00:10:29   INFO  epoch: 68/72, acc_iter=264716, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:24:07, time_cost(all): 2 days, 13:15:14/3:15:27, loss=0.287773832411175, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.02(1.03), norm=0.8283669995115988, lr=0.004444808984022861
2023-12-08 00:11:11   INFO  epoch: 68/72, acc_iter=264766, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:22:54, time_cost(all): 2 days, 13:15:56/3:22:50, loss=0.287714634985234, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.15(1.03), norm=2.9146936341750567, lr=0.004428161137963441
2023-12-08 00:11:53   INFO  epoch: 68/72, acc_iter=264816, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:12, time_cost(all): 2 days, 13:16:38/3:07:55, loss=0.287655437559293, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.93(1.03), norm=4.427890511362519, lr=0.004411513291904021
2023-12-08 00:12:34   INFO  epoch: 68/72, acc_iter=264866, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:23:07, time_cost(all): 2 days, 13:17:19/3:10:12, loss=0.287596240133352, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.83(1.03), norm=0.7365103210223019, lr=0.0043948654458446
2023-12-08 00:13:16   INFO  epoch: 68/72, acc_iter=264916, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:22:02, time_cost(all): 2 days, 13:18:01/3:14:14, loss=0.287537042707411, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.21(1.03), norm=4.1721174304665025, lr=0.00437821759978518
2023-12-08 00:13:58   INFO  epoch: 68/72, acc_iter=264966, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:21:32, time_cost(all): 2 days, 13:18:43/3:11:08, loss=0.28747784528147, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.0(1.03), norm=4.636802492498304, lr=0.00436156975372576
2023-12-08 00:14:40   INFO  epoch: 68/72, acc_iter=265016, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:20:02, time_cost(all): 2 days, 13:19:25/3:19:55, loss=0.28741864785553, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.09(1.03), norm=0.691528977340076, lr=0.00434492190766634
2023-12-08 00:15:22   INFO  epoch: 68/72, acc_iter=265066, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:20:01, time_cost(all): 2 days, 13:20:07/3:07:04, loss=0.287359450429589, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.99(1.03), norm=3.0674218927518893, lr=0.004328274061606918
2023-12-08 00:16:03   INFO  epoch: 68/72, acc_iter=265116, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:18:28, time_cost(all): 2 days, 13:20:48/3:18:25, loss=0.287300253003648, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.9(1.03), norm=4.324919661284331, lr=0.004311626215547498
2023-12-08 00:16:45   INFO  epoch: 68/72, acc_iter=265166, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:45, time_cost(all): 2 days, 13:21:30/3:17:12, loss=0.287241055577707, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.94(1.03), norm=0.9792191576932481, lr=0.004294978369488078
2023-12-08 00:17:27   INFO  epoch: 68/72, acc_iter=265216, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:18:09, time_cost(all): 2 days, 13:22:12/3:12:19, loss=0.287181858151766, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.04(1.03), norm=1.3977727552544117, lr=0.004278330523428658
2023-12-08 00:18:09   INFO  epoch: 68/72, acc_iter=265266, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:17, time_cost(all): 2 days, 13:22:54/3:09:07, loss=0.287122660725825, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.95(1.03), norm=3.362537864131062, lr=0.004261682677369237
2023-12-08 00:18:50   INFO  epoch: 68/72, acc_iter=265316, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:35, time_cost(all): 2 days, 13:23:35/3:10:47, loss=0.287063463299884, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.05(1.03), norm=2.2702076219390137, lr=0.004245034831309817
2023-12-08 00:19:32   INFO  epoch: 68/72, acc_iter=265366, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:16:13, time_cost(all): 2 days, 13:24:17/3:00:52, loss=0.287004265873943, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.01(1.03), norm=3.0655282716442924, lr=0.004228386985250397
2023-12-08 00:20:14   INFO  epoch: 68/72, acc_iter=265416, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:15:01, time_cost(all): 2 days, 13:24:59/3:02:03, loss=0.286945068448002, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.2(1.03), norm=0.6097219740562823, lr=0.004211739139190976
2023-12-08 00:20:56   INFO  epoch: 68/72, acc_iter=265466, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:15, time_cost(all): 2 days, 13:25:41/2:57:49, loss=0.286885871022061, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.98(1.03), norm=1.6885544031501603, lr=0.004195091293131556
2023-12-08 00:21:38   INFO  epoch: 68/72, acc_iter=265516, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:19, time_cost(all): 2 days, 13:26:23/3:06:35, loss=0.28682667359612, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.98(1.03), norm=0.6309627622445273, lr=0.004178443447072135
2023-12-08 00:22:19   INFO  epoch: 68/72, acc_iter=265566, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:13:10, time_cost(all): 2 days, 13:27:04/3:09:07, loss=0.286767476170179, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.21(1.03), norm=2.5180867558875546, lr=0.004161795601012715
2023-12-08 00:23:01   INFO  epoch: 68/72, acc_iter=265616, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:13, time_cost(all): 2 days, 13:27:46/3:13:55, loss=0.286708278744238, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.88(1.03), norm=0.9958448749325866, lr=0.004145147754953294
2023-12-08 00:23:43   INFO  epoch: 68/72, acc_iter=265666, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:07, time_cost(all): 2 days, 13:28:28/3:04:17, loss=0.286649081318297, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.96(1.03), norm=2.113045862654827, lr=0.004128499908893874
2023-12-08 00:24:25   INFO  epoch: 68/72, acc_iter=265716, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:27, time_cost(all): 2 days, 13:29:10/2:57:46, loss=0.286589883892356, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.21(1.03), norm=2.528597725350601, lr=0.004111852062834454
2023-12-08 00:25:06   INFO  epoch: 68/72, acc_iter=265766, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:10:11, time_cost(all): 2 days, 13:29:51/3:08:29, loss=0.286530686466415, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.03(1.03), norm=2.99622441301737, lr=0.004095204216775034
2023-12-08 00:25:48   INFO  epoch: 68/72, acc_iter=265816, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:08:53, time_cost(all): 2 days, 13:30:33/2:57:15, loss=0.286471489040474, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.87(1.03), norm=3.849427428094847, lr=0.004078556370715613
2023-12-08 00:26:30   INFO  epoch: 68/72, acc_iter=265866, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:21, time_cost(all): 2 days, 13:31:15/3:02:56, loss=0.286412291614534, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.03(1.03), norm=0.758732264464097, lr=0.004061908524656193
2023-12-08 00:27:12   INFO  epoch: 68/72, acc_iter=265916, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:40, time_cost(all): 2 days, 13:31:57/2:55:30, loss=0.286353094188593, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.97(1.03), norm=4.592113883815475, lr=0.004045260678596773
2023-12-08 00:27:54   INFO  epoch: 68/72, acc_iter=265966, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:06:48, time_cost(all): 2 days, 13:32:39/3:01:40, loss=0.286293896762652, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.83(1.03), norm=0.5873568744702093, lr=0.004028612832537352
2023-12-08 00:28:35   INFO  epoch: 68/72, acc_iter=266016, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:30, time_cost(all): 2 days, 13:33:20/3:05:59, loss=0.286234699336711, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.04(1.03), norm=3.31066608985447, lr=0.004011964986477931
2023-12-08 00:29:17   INFO  epoch: 68/72, acc_iter=266066, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:39, time_cost(all): 2 days, 13:34:02/3:02:44, loss=0.28617550191077, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.07(1.03), norm=1.5764509751110567, lr=0.003995317140418511
2023-12-08 00:29:59   INFO  epoch: 68/72, acc_iter=266116, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:04:58, time_cost(all): 2 days, 13:34:44/2:55:36, loss=0.286116304484829, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.13(1.03), norm=3.1467098631661523, lr=0.003978669294359091
2023-12-08 00:30:41   INFO  epoch: 68/72, acc_iter=266166, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:10, time_cost(all): 2 days, 13:35:26/2:49:15, loss=0.286057107058888, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.95(1.03), norm=1.2882581867480587, lr=0.00396202144829967
2023-12-08 00:31:22   INFO  epoch: 68/72, acc_iter=266216, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:39, time_cost(all): 2 days, 13:36:07/3:04:19, loss=0.285997909632947, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.01(1.03), norm=1.6064164598603188, lr=0.00394537360224025
2023-12-08 00:32:04   INFO  epoch: 68/72, acc_iter=266266, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:58, time_cost(all): 2 days, 13:36:49/2:56:42, loss=0.285938712207006, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.95(1.03), norm=2.1166885949077914, lr=0.00392872575618083
2023-12-08 00:32:46   INFO  epoch: 68/72, acc_iter=266316, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:21, time_cost(all): 2 days, 13:37:31/2:58:11, loss=0.285879514781065, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.19(1.03), norm=4.283417636338016, lr=0.00391207791012141
2023-12-08 00:33:28   INFO  epoch: 68/72, acc_iter=266366, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:37, time_cost(all): 2 days, 13:38:13/2:49:49, loss=0.285820317355124, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.15(1.03), norm=4.192892826523584, lr=0.003895430064061989
2023-12-08 00:34:10   INFO  epoch: 68/72, acc_iter=266416, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:50, time_cost(all): 2 days, 13:38:55/2:57:28, loss=0.285761119929183, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.07(1.03), norm=2.069236418531352, lr=0.003878782218002569
2023-12-08 00:34:51   INFO  epoch: 68/72, acc_iter=266466, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 2 days, 13:39:36/2:53:28, loss=0.285701922503242, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.09(1.03), norm=1.5699761236822825, lr=0.003862134371943148
2023-12-08 00:35:33   INFO  epoch: 69/72, acc_iter=266528, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:55:36, time_cost(all): 2 days, 13:40:18/2:58:33, loss=0.285628517695075, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.89(1.03), norm=4.639927117807888, lr=0.003841491042829467
2023-12-08 00:36:15   INFO  epoch: 69/72, acc_iter=266578, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:51:48, time_cost(all): 2 days, 13:41:00/2:58:31, loss=0.285569320269135, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.2(1.03), norm=3.2908681534601145, lr=0.003824843196770047
2023-12-08 00:36:57   INFO  epoch: 69/72, acc_iter=266628, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:51:51, time_cost(all): 2 days, 13:41:42/2:58:14, loss=0.285510122843194, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.89(1.03), norm=4.116624593328259, lr=0.003808195350710626
2023-12-08 00:37:38   INFO  epoch: 69/72, acc_iter=266678, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:53:20, time_cost(all): 2 days, 13:42:23/2:47:47, loss=0.285450925417253, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.01(1.03), norm=0.990069653628767, lr=0.003791547504651206
2023-12-08 00:38:20   INFO  epoch: 69/72, acc_iter=266728, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:51:58, time_cost(all): 2 days, 13:43:05/2:46:54, loss=0.285391727991312, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.97(1.03), norm=0.5914170592470898, lr=0.003774899658591785
2023-12-08 00:39:02   INFO  epoch: 69/72, acc_iter=266778, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:47:51, time_cost(all): 2 days, 13:43:47/2:53:34, loss=0.285332530565371, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.05(1.03), norm=3.377773933512698, lr=0.003758251812532365
2023-12-08 00:39:44   INFO  epoch: 69/72, acc_iter=266828, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:46:49, time_cost(all): 2 days, 13:44:29/2:47:09, loss=0.28527333313943, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.21(1.03), norm=4.434181345421186, lr=0.003741603966472945
2023-12-08 00:40:26   INFO  epoch: 69/72, acc_iter=266878, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:48:13, time_cost(all): 2 days, 13:45:11/2:49:29, loss=0.285214135713489, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.98(1.03), norm=1.7805997205028004, lr=0.003724956120413524
2023-12-08 00:41:07   INFO  epoch: 69/72, acc_iter=266928, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:49:38, time_cost(all): 2 days, 13:45:52/2:53:24, loss=0.285154938287548, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.96(1.03), norm=4.725753236988642, lr=0.003708308274354104
2023-12-08 00:41:49   INFO  epoch: 69/72, acc_iter=266978, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:48:35, time_cost(all): 2 days, 13:46:34/2:48:31, loss=0.285095740861607, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.22(1.03), norm=1.811396922002932, lr=0.003691660428294684
2023-12-08 00:42:31   INFO  epoch: 69/72, acc_iter=267028, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:46:23, time_cost(all): 2 days, 13:47:16/2:39:49, loss=0.285036543435666, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.04(1.03), norm=2.1281950420820546, lr=0.003675012582235264
2023-12-08 00:43:13   INFO  epoch: 69/72, acc_iter=267078, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:45:28, time_cost(all): 2 days, 13:47:58/2:46:59, loss=0.284977346009725, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.0(1.03), norm=2.6333564057246988, lr=0.003658364736175843
2023-12-08 00:43:55   INFO  epoch: 69/72, acc_iter=267128, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:43:37, time_cost(all): 2 days, 13:48:40/2:39:13, loss=0.284918148583784, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.23(1.03), norm=3.937289314473112, lr=0.003641716890116423
2023-12-08 00:44:36   INFO  epoch: 69/72, acc_iter=267178, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:44:14, time_cost(all): 2 days, 13:49:21/2:44:56, loss=0.284858951157843, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.18(1.03), norm=2.004554316721773, lr=0.003625069044057002
2023-12-08 00:45:18   INFO  epoch: 69/72, acc_iter=267228, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:42:05, time_cost(all): 2 days, 13:50:03/2:43:29, loss=0.284799753731902, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.01(1.03), norm=4.9498422905036845, lr=0.003608421197997582
2023-12-08 00:46:00   INFO  epoch: 69/72, acc_iter=267278, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:40:49, time_cost(all): 2 days, 13:50:45/2:35:32, loss=0.284740556305961, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.0(1.03), norm=4.1428123830842, lr=0.003591773351938161
2023-12-08 00:46:42   INFO  epoch: 69/72, acc_iter=267328, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:41:34, time_cost(all): 2 days, 13:51:27/2:48:05, loss=0.28468135888002, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.89(1.03), norm=1.8684633484752022, lr=0.003575125505878741
2023-12-08 00:47:23   INFO  epoch: 69/72, acc_iter=267378, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:40:46, time_cost(all): 2 days, 13:52:08/2:40:18, loss=0.284622161454079, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.85(1.03), norm=3.5591034347694652, lr=0.003558477659819321
2023-12-08 00:48:05   INFO  epoch: 69/72, acc_iter=267428, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:41:06, time_cost(all): 2 days, 13:52:50/2:37:19, loss=0.284562964028139, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.12(1.03), norm=1.774739194120616, lr=0.0035418298137599
2023-12-08 00:48:47   INFO  epoch: 69/72, acc_iter=267478, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:39:30, time_cost(all): 2 days, 13:53:32/2:41:06, loss=0.284503766602198, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.1(1.03), norm=3.30929766490885, lr=0.00352518196770048
2023-12-08 00:49:29   INFO  epoch: 69/72, acc_iter=267528, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:39:14, time_cost(all): 2 days, 13:54:14/2:34:45, loss=0.284444569176257, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.88(1.03), norm=4.078684117941419, lr=0.00350853412164106
2023-12-08 00:50:11   INFO  epoch: 69/72, acc_iter=267578, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:36:38, time_cost(all): 2 days, 13:54:56/2:36:14, loss=0.284385371750316, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.12(1.03), norm=2.7307489999656354, lr=0.00349188627558164
2023-12-08 00:50:52   INFO  epoch: 69/72, acc_iter=267628, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:38:47, time_cost(all): 2 days, 13:55:37/2:38:00, loss=0.284326174324375, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.84(1.03), norm=3.506280096064959, lr=0.003475238429522219
2023-12-08 00:51:34   INFO  epoch: 69/72, acc_iter=267678, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:38:06, time_cost(all): 2 days, 13:56:19/2:35:12, loss=0.284266976898434, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.95(1.03), norm=2.266896786268095, lr=0.003458590583462799
2023-12-08 00:52:16   INFO  epoch: 69/72, acc_iter=267728, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:37:48, time_cost(all): 2 days, 13:57:01/2:43:03, loss=0.284207779472493, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.96(1.03), norm=2.6722227738472535, lr=0.003441942737403378
2023-12-08 00:52:58   INFO  epoch: 69/72, acc_iter=267778, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:35:59, time_cost(all): 2 days, 13:57:43/2:34:40, loss=0.284148582046552, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.22(1.03), norm=1.318653284537258, lr=0.003425294891343958
2023-12-08 00:53:39   INFO  epoch: 69/72, acc_iter=267828, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:35:32, time_cost(all): 2 days, 13:58:24/2:33:06, loss=0.284089384620611, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.22(1.03), norm=0.7683690779869666, lr=0.003408647045284537
2023-12-08 00:54:21   INFO  epoch: 69/72, acc_iter=267878, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:34:53, time_cost(all): 2 days, 13:59:06/2:32:53, loss=0.28403018719467, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.21(1.03), norm=2.5383315123577845, lr=0.003391999199225117
2023-12-08 00:55:03   INFO  epoch: 69/72, acc_iter=267928, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:32:33, time_cost(all): 2 days, 13:59:48/2:30:38, loss=0.283970989768729, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.0(1.03), norm=3.6472275224102098, lr=0.003375351353165697
2023-12-08 00:55:45   INFO  epoch: 69/72, acc_iter=267978, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:34:14, time_cost(all): 2 days, 14:00:30/2:26:26, loss=0.283911792342788, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.19(1.03), norm=2.317753749404095, lr=0.003358703507106276
2023-12-08 00:56:27   INFO  epoch: 69/72, acc_iter=268028, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:31:33, time_cost(all): 2 days, 14:01:12/2:27:13, loss=0.283852594916847, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.16(1.03), norm=3.744075397608389, lr=0.003342055661046856
2023-12-08 00:57:08   INFO  epoch: 69/72, acc_iter=268078, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:30:18, time_cost(all): 2 days, 14:01:53/2:33:56, loss=0.283793397490906, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.9(1.03), norm=3.875243383886749, lr=0.003325407814987436
2023-12-08 00:57:50   INFO  epoch: 69/72, acc_iter=268128, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:29:58, time_cost(all): 2 days, 14:02:35/2:31:33, loss=0.283734200064965, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.14(1.03), norm=1.2735601615897965, lr=0.003308759968928016
2023-12-08 00:58:32   INFO  epoch: 69/72, acc_iter=268178, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:30:58, time_cost(all): 2 days, 14:03:17/2:28:42, loss=0.283675002639024, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.86(1.03), norm=0.6390815717388625, lr=0.003292112122868595
2023-12-08 00:59:14   INFO  epoch: 69/72, acc_iter=268228, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:29:06, time_cost(all): 2 days, 14:03:59/2:21:48, loss=0.283615805213083, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.08(1.03), norm=1.089437918575794, lr=0.003275464276809175
2023-12-08 00:59:55   INFO  epoch: 69/72, acc_iter=268278, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:29:29, time_cost(all): 2 days, 14:04:40/2:34:03, loss=0.283556607787143, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.11(1.03), norm=2.393694242803247, lr=0.003258816430749754
2023-12-08 01:00:37   INFO  epoch: 69/72, acc_iter=268328, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:28:33, time_cost(all): 2 days, 14:05:22/2:21:37, loss=0.283497410361202, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.0(1.03), norm=1.8724392973201756, lr=0.003242168584690334
2023-12-08 01:01:19   INFO  epoch: 69/72, acc_iter=268378, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:27:26, time_cost(all): 2 days, 14:06:04/2:29:27, loss=0.283438212935261, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.11(1.03), norm=4.049338017713234, lr=0.003225520738630913
2023-12-08 01:02:01   INFO  epoch: 69/72, acc_iter=268428, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:26:08, time_cost(all): 2 days, 14:06:46/2:31:43, loss=0.28337901550932, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.92(1.03), norm=3.4719203287636184, lr=0.003208872892571493
2023-12-08 01:02:43   INFO  epoch: 69/72, acc_iter=268478, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:25:30, time_cost(all): 2 days, 14:07:28/2:28:24, loss=0.283319818083379, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.95(1.03), norm=3.357089301369439, lr=0.003192225046512073
2023-12-08 01:03:24   INFO  epoch: 69/72, acc_iter=268528, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:46, time_cost(all): 2 days, 14:08:09/2:29:32, loss=0.283260620657438, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.21(1.03), norm=4.9641837722971, lr=0.003175577200452652
2023-12-08 01:04:06   INFO  epoch: 69/72, acc_iter=268578, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:23:26, time_cost(all): 2 days, 14:08:51/2:22:10, loss=0.283201423231497, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.16(1.03), norm=2.8159284917588003, lr=0.003158929354393232
2023-12-08 01:04:48   INFO  epoch: 69/72, acc_iter=268628, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:23:52, time_cost(all): 2 days, 14:09:33/2:28:40, loss=0.283142225805556, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.88(1.03), norm=4.156655074329706, lr=0.003142281508333811
2023-12-08 01:05:30   INFO  epoch: 69/72, acc_iter=268678, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:16, time_cost(all): 2 days, 14:10:15/2:27:49, loss=0.283083028379615, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.99(1.03), norm=1.9135706476203993, lr=0.003125633662274391
2023-12-08 01:06:11   INFO  epoch: 69/72, acc_iter=268728, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:21:53, time_cost(all): 2 days, 14:10:56/2:21:54, loss=0.283023830953674, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.92(1.03), norm=3.145156213766768, lr=0.003108985816214971
2023-12-08 01:06:53   INFO  epoch: 69/72, acc_iter=268778, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:03, time_cost(all): 2 days, 14:11:38/2:23:56, loss=0.282964633527733, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.09(1.03), norm=1.9519041300480573, lr=0.00309233797015555
2023-12-08 01:07:35   INFO  epoch: 69/72, acc_iter=268828, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:47, time_cost(all): 2 days, 14:12:20/2:25:21, loss=0.282905436101792, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.88(1.03), norm=0.8694820160442305, lr=0.00307569012409613
2023-12-08 01:08:17   INFO  epoch: 69/72, acc_iter=268878, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:20:34, time_cost(all): 2 days, 14:13:02/2:16:56, loss=0.282846238675851, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.13(1.03), norm=0.7957096474472078, lr=0.00305904227803671
2023-12-08 01:08:59   INFO  epoch: 69/72, acc_iter=268928, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:18:58, time_cost(all): 2 days, 14:13:44/2:14:07, loss=0.28278704124991, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.89(1.03), norm=3.0963249259667003, lr=0.003042394431977289
2023-12-08 01:09:40   INFO  epoch: 69/72, acc_iter=268978, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:41, time_cost(all): 2 days, 14:14:25/2:17:05, loss=0.282727843823969, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.2(1.03), norm=3.61525891444512, lr=0.003025746585917869
2023-12-08 01:10:22   INFO  epoch: 69/72, acc_iter=269028, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:04, time_cost(all): 2 days, 14:15:07/2:12:22, loss=0.282668646398028, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.87(1.03), norm=3.833957763026814, lr=0.003009098739858449
2023-12-08 01:11:04   INFO  epoch: 69/72, acc_iter=269078, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:16:56, time_cost(all): 2 days, 14:15:49/2:19:56, loss=0.282609448972087, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.94(1.03), norm=2.2634228720396887, lr=0.002992450893799028
2023-12-08 01:11:46   INFO  epoch: 69/72, acc_iter=269128, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:24, time_cost(all): 2 days, 14:16:31/2:19:56, loss=0.282550251546146, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.21(1.03), norm=1.4909287928019594, lr=0.002975803047739608
2023-12-08 01:12:27   INFO  epoch: 69/72, acc_iter=269178, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:57, time_cost(all): 2 days, 14:17:12/2:11:55, loss=0.282491054120206, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.92(1.03), norm=2.9055665295673014, lr=0.002959155201680188
2023-12-08 01:13:09   INFO  epoch: 69/72, acc_iter=269228, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:14:51, time_cost(all): 2 days, 14:17:54/2:08:10, loss=0.282431856694265, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.17(1.03), norm=3.5130135630632786, lr=0.002942507355620767
2023-12-08 01:13:51   INFO  epoch: 69/72, acc_iter=269278, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:15:11, time_cost(all): 2 days, 14:18:36/2:16:11, loss=0.282372659268324, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.94(1.03), norm=1.522030060718761, lr=0.002925859509561347
2023-12-08 01:14:33   INFO  epoch: 69/72, acc_iter=269328, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:25, time_cost(all): 2 days, 14:19:18/2:09:19, loss=0.282313461842383, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.05(1.03), norm=3.8754864918670306, lr=0.002909211663501926
2023-12-08 01:15:15   INFO  epoch: 69/72, acc_iter=269378, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:52, time_cost(all): 2 days, 14:20:00/2:06:42, loss=0.282254264416442, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.93(1.03), norm=3.023824504850021, lr=0.002892563817442506
2023-12-08 01:15:56   INFO  epoch: 69/72, acc_iter=269428, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:17, time_cost(all): 2 days, 14:20:41/2:09:38, loss=0.282195066990501, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.06(1.03), norm=3.9749314925980874, lr=0.002875915971383086
2023-12-08 01:16:38   INFO  epoch: 69/72, acc_iter=269478, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:12:23, time_cost(all): 2 days, 14:21:23/2:11:35, loss=0.28213586956456, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.19(1.03), norm=0.8502524383219903, lr=0.002859268125323665
2023-12-08 01:17:20   INFO  epoch: 69/72, acc_iter=269528, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:01, time_cost(all): 2 days, 14:22:05/2:09:40, loss=0.282076672138619, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.1(1.03), norm=4.939844982493428, lr=0.002842620279264245
2023-12-08 01:18:02   INFO  epoch: 69/72, acc_iter=269578, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:11:02, time_cost(all): 2 days, 14:22:47/2:07:11, loss=0.282017474712678, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.21(1.03), norm=3.6409544118057804, lr=0.002825972433204825
2023-12-08 01:18:44   INFO  epoch: 69/72, acc_iter=269628, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:36, time_cost(all): 2 days, 14:23:29/2:08:00, loss=0.281958277286737, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.88(1.03), norm=1.8095335476589485, lr=0.002809324587145404
2023-12-08 01:19:25   INFO  epoch: 69/72, acc_iter=269678, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:08:57, time_cost(all): 2 days, 14:24:10/2:08:18, loss=0.281899079860796, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.94(1.03), norm=0.7349004895817053, lr=0.002792676741085984
2023-12-08 01:20:07   INFO  epoch: 69/72, acc_iter=269728, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:17, time_cost(all): 2 days, 14:24:52/2:03:30, loss=0.281839882434855, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.85(1.03), norm=1.9897951464225554, lr=0.002776028895026564
2023-12-08 01:20:49   INFO  epoch: 69/72, acc_iter=269778, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:40, time_cost(all): 2 days, 14:25:34/2:13:07, loss=0.281780685008914, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.15(1.03), norm=3.5397878034966106, lr=0.002759381048967143
2023-12-08 01:21:31   INFO  epoch: 69/72, acc_iter=269828, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:06:58, time_cost(all): 2 days, 14:26:16/2:01:45, loss=0.281721487582973, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.9(1.03), norm=4.8233988454348715, lr=0.002742733202907723
2023-12-08 01:22:12   INFO  epoch: 69/72, acc_iter=269878, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:07, time_cost(all): 2 days, 14:26:57/2:06:24, loss=0.281662290157032, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.95(1.03), norm=2.633970978788108, lr=0.002726085356848302
2023-12-08 01:22:54   INFO  epoch: 69/72, acc_iter=269928, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:31, time_cost(all): 2 days, 14:27:39/1:59:55, loss=0.281603092731091, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.97(1.03), norm=0.9606913035326488, lr=0.002709437510788882
2023-12-08 01:23:36   INFO  epoch: 69/72, acc_iter=269978, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:08, time_cost(all): 2 days, 14:28:21/2:09:21, loss=0.281543895305151, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.83(1.03), norm=0.7449763378265402, lr=0.002692789664729462
2023-12-08 01:24:18   INFO  epoch: 69/72, acc_iter=270028, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:19, time_cost(all): 2 days, 14:29:03/2:02:33, loss=0.28148469787921, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.95(1.03), norm=4.303784773319878, lr=0.002676141818670041
2023-12-08 01:25:00   INFO  epoch: 69/72, acc_iter=270078, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:47, time_cost(all): 2 days, 14:29:45/1:58:32, loss=0.281425500453269, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.21(1.03), norm=2.419247185663254, lr=0.002659493972610621
2023-12-08 01:25:41   INFO  epoch: 69/72, acc_iter=270128, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:03:02, time_cost(all): 2 days, 14:30:26/2:06:29, loss=0.281366303027328, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.21(1.03), norm=2.3040920290981024, lr=0.0026428461265512
2023-12-08 01:26:23   INFO  epoch: 69/72, acc_iter=270178, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:21, time_cost(all): 2 days, 14:31:08/2:00:07, loss=0.281307105601387, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.21(1.03), norm=2.379538360205227, lr=0.00262619828049178
2023-12-08 01:27:05   INFO  epoch: 69/72, acc_iter=270228, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:34, time_cost(all): 2 days, 14:31:50/1:55:04, loss=0.281247908175446, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.12(1.03), norm=1.3398525628050562, lr=0.00260955043443236
2023-12-08 01:27:47   INFO  epoch: 69/72, acc_iter=270278, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:53, time_cost(all): 2 days, 14:32:32/1:57:21, loss=0.281188710749505, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.03(1.03), norm=2.2777921356519237, lr=0.00259290258837294
2023-12-08 01:28:28   INFO  epoch: 69/72, acc_iter=270328, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 2 days, 14:33:13/1:56:29, loss=0.281129513323564, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.05(1.03), norm=1.3248526724337528, lr=0.002576254742313519
2023-12-08 01:29:10   INFO  epoch: 70/72, acc_iter=270390, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:54:29, time_cost(all): 2 days, 14:33:55/1:55:48, loss=0.281056108515397, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.94(1.03), norm=2.6647508997601, lr=0.002555611413199838
2023-12-08 01:29:52   INFO  epoch: 70/72, acc_iter=270440, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:52:29, time_cost(all): 2 days, 14:34:37/1:59:35, loss=0.280996911089456, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.94(1.03), norm=4.590440059835817, lr=0.002538963567140418
2023-12-08 01:30:34   INFO  epoch: 70/72, acc_iter=270490, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:49:51, time_cost(all): 2 days, 14:35:19/1:56:32, loss=0.280937713663515, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.12(1.03), norm=3.8881691323750496, lr=0.002522315721080997
2023-12-08 01:31:16   INFO  epoch: 70/72, acc_iter=270540, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:50:11, time_cost(all): 2 days, 14:36:01/1:57:45, loss=0.280878516237574, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.07(1.03), norm=1.2055701852526497, lr=0.002505667875021577
2023-12-08 01:31:57   INFO  epoch: 70/72, acc_iter=270590, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:52:47, time_cost(all): 2 days, 14:36:42/1:56:38, loss=0.280819318811633, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.93(1.03), norm=3.949045539084365, lr=0.002489020028962156
2023-12-08 01:32:39   INFO  epoch: 70/72, acc_iter=270640, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:50:02, time_cost(all): 2 days, 14:37:24/1:55:32, loss=0.280760121385692, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.95(1.03), norm=2.3661210552069374, lr=0.002472372182902736
2023-12-08 01:33:21   INFO  epoch: 70/72, acc_iter=270690, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:48:26, time_cost(all): 2 days, 14:38:06/1:58:53, loss=0.280700923959751, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.17(1.03), norm=3.900628156631517, lr=0.002455724336843316
2023-12-08 01:34:03   INFO  epoch: 70/72, acc_iter=270740, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:49:16, time_cost(all): 2 days, 14:38:48/1:59:07, loss=0.280641726533811, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.87(1.03), norm=2.8697704042195977, lr=0.002439076490783895
2023-12-08 01:34:44   INFO  epoch: 70/72, acc_iter=270790, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:48:10, time_cost(all): 2 days, 14:39:29/1:57:59, loss=0.28058252910787, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.13(1.03), norm=3.99539861580526, lr=0.002422428644724475
2023-12-08 01:35:26   INFO  epoch: 70/72, acc_iter=270840, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:45:36, time_cost(all): 2 days, 14:40:11/1:54:30, loss=0.280523331681929, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.91(1.03), norm=4.986441287889038, lr=0.002405780798665054
2023-12-08 01:36:08   INFO  epoch: 70/72, acc_iter=270890, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:44:50, time_cost(all): 2 days, 14:40:53/1:52:17, loss=0.280464134255988, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.1(1.03), norm=3.713226661540327, lr=0.002389132952605634
2023-12-08 01:36:50   INFO  epoch: 70/72, acc_iter=270940, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:43:37, time_cost(all): 2 days, 14:41:35/1:53:33, loss=0.280404936830047, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.06(1.03), norm=3.7458255465571524, lr=0.002372485106546214
2023-12-08 01:37:32   INFO  epoch: 70/72, acc_iter=270990, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:44:07, time_cost(all): 2 days, 14:42:17/1:47:26, loss=0.280345739404106, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.1(1.03), norm=4.442372023836285, lr=0.002355837260486793
2023-12-08 01:38:13   INFO  epoch: 70/72, acc_iter=271040, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:42:01, time_cost(all): 2 days, 14:42:58/1:47:22, loss=0.280286541978165, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.9(1.03), norm=2.0926469722971914, lr=0.002339189414427373
2023-12-08 01:38:55   INFO  epoch: 70/72, acc_iter=271090, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:42:48, time_cost(all): 2 days, 14:43:40/1:51:59, loss=0.280227344552224, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.09(1.03), norm=4.94297573885055, lr=0.002322541568367953
2023-12-08 01:39:37   INFO  epoch: 70/72, acc_iter=271140, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:40:35, time_cost(all): 2 days, 14:44:22/1:49:30, loss=0.280168147126283, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.2(1.03), norm=2.8372913300364124, lr=0.002305893722308532
2023-12-08 01:40:19   INFO  epoch: 70/72, acc_iter=271190, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:40:35, time_cost(all): 2 days, 14:45:04/1:44:40, loss=0.280108949700342, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.2(1.03), norm=3.0868378603610847, lr=0.002289245876249112
2023-12-08 01:41:00   INFO  epoch: 70/72, acc_iter=271240, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:42:25, time_cost(all): 2 days, 14:45:45/1:42:59, loss=0.280049752274401, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.0(1.03), norm=4.876648118697967, lr=0.002272598030189692
2023-12-08 01:41:42   INFO  epoch: 70/72, acc_iter=271290, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:39:55, time_cost(all): 2 days, 14:46:27/1:45:29, loss=0.27999055484846, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.02(1.03), norm=2.7682407781959846, lr=0.002255950184130271
2023-12-08 01:42:24   INFO  epoch: 70/72, acc_iter=271340, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:39:22, time_cost(all): 2 days, 14:47:09/1:44:08, loss=0.279931357422519, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.87(1.03), norm=2.2118633301020036, lr=0.002239302338070851
2023-12-08 01:43:06   INFO  epoch: 70/72, acc_iter=271390, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:40:35, time_cost(all): 2 days, 14:47:51/1:39:28, loss=0.279872159996578, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.21(1.03), norm=0.9451892851806487, lr=0.00222265449201143
2023-12-08 01:43:48   INFO  epoch: 70/72, acc_iter=271440, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:36:39, time_cost(all): 2 days, 14:48:33/1:43:35, loss=0.279812962570637, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.89(1.03), norm=4.873315626166382, lr=0.00220600664595201
2023-12-08 01:44:29   INFO  epoch: 70/72, acc_iter=271490, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:36:31, time_cost(all): 2 days, 14:49:14/1:41:08, loss=0.279753765144696, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.2(1.03), norm=3.790941939420696, lr=0.002189358799892589
2023-12-08 01:45:11   INFO  epoch: 70/72, acc_iter=271540, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:35:36, time_cost(all): 2 days, 14:49:56/1:44:54, loss=0.279694567718756, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.98(1.03), norm=1.119080311692119, lr=0.002172710953833169
2023-12-08 01:45:53   INFO  epoch: 70/72, acc_iter=271590, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:38:11, time_cost(all): 2 days, 14:50:38/1:42:27, loss=0.279635370292815, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.11(1.03), norm=0.7259242136690065, lr=0.002156063107773749
2023-12-08 01:46:35   INFO  epoch: 70/72, acc_iter=271640, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:36:04, time_cost(all): 2 days, 14:51:20/1:36:09, loss=0.279576172866874, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.0(1.03), norm=1.2805733673719861, lr=0.002139415261714329
2023-12-08 01:47:16   INFO  epoch: 70/72, acc_iter=271690, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:34:41, time_cost(all): 2 days, 14:52:01/1:35:39, loss=0.279516975440933, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.9(1.03), norm=0.8640771901160895, lr=0.002122767415654908
2023-12-08 01:47:58   INFO  epoch: 70/72, acc_iter=271740, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:35:38, time_cost(all): 2 days, 14:52:43/1:38:35, loss=0.279457778014992, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.99(1.03), norm=4.291615142854601, lr=0.002106119569595488
2023-12-08 01:48:40   INFO  epoch: 70/72, acc_iter=271790, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:32:46, time_cost(all): 2 days, 14:53:25/1:36:24, loss=0.279398580589051, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.01(1.03), norm=2.6918189323367407, lr=0.002089471723536068
2023-12-08 01:49:22   INFO  epoch: 70/72, acc_iter=271840, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:32:44, time_cost(all): 2 days, 14:54:07/1:35:42, loss=0.27933938316311, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.2(1.03), norm=1.906235005494294, lr=0.002072823877476647
2023-12-08 01:50:04   INFO  epoch: 70/72, acc_iter=271890, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:31:27, time_cost(all): 2 days, 14:54:49/1:41:39, loss=0.279280185737169, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.98(1.03), norm=4.815550303463184, lr=0.002056176031417226
2023-12-08 01:50:45   INFO  epoch: 70/72, acc_iter=271940, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:32:08, time_cost(all): 2 days, 14:55:30/1:37:25, loss=0.279220988311228, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.84(1.03), norm=2.728162515751239, lr=0.002039528185357806
2023-12-08 01:51:27   INFO  epoch: 70/72, acc_iter=271990, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:30:45, time_cost(all): 2 days, 14:56:12/1:32:19, loss=0.279161790885287, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.2(1.03), norm=4.693703110619413, lr=0.002022880339298386
2023-12-08 01:52:09   INFO  epoch: 70/72, acc_iter=272040, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:28:41, time_cost(all): 2 days, 14:56:54/1:38:15, loss=0.279102593459346, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.12(1.03), norm=2.256480396558827, lr=0.002006232493238965
2023-12-08 01:52:51   INFO  epoch: 70/72, acc_iter=272090, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:28:11, time_cost(all): 2 days, 14:57:36/1:34:55, loss=0.279043396033405, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.89(1.03), norm=4.973566769732009, lr=0.001989584647179545
2023-12-08 01:53:33   INFO  epoch: 70/72, acc_iter=272140, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:28:22, time_cost(all): 2 days, 14:58:18/1:33:25, loss=0.278984198607464, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.04(1.03), norm=1.4379857639356846, lr=0.001972936801120125
2023-12-08 01:54:14   INFO  epoch: 70/72, acc_iter=272190, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:29:03, time_cost(all): 2 days, 14:58:59/1:35:06, loss=0.278925001181523, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.9(1.03), norm=4.2638014301842055, lr=0.001956288955060705
2023-12-08 01:54:56   INFO  epoch: 70/72, acc_iter=272240, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:27:11, time_cost(all): 2 days, 14:59:41/1:29:59, loss=0.278865803755582, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.05(1.03), norm=3.122415136097013, lr=0.001939641109001284
2023-12-08 01:55:38   INFO  epoch: 70/72, acc_iter=272290, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:25:21, time_cost(all): 2 days, 15:00:23/1:31:48, loss=0.278806606329641, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.91(1.03), norm=3.874279673400929, lr=0.001922993262941864
2023-12-08 01:56:20   INFO  epoch: 70/72, acc_iter=272340, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:25:24, time_cost(all): 2 days, 15:01:05/1:28:13, loss=0.2787474089037, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.84(1.03), norm=1.042731623556099, lr=0.001906345416882444
2023-12-08 01:57:01   INFO  epoch: 70/72, acc_iter=272390, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:25:02, time_cost(all): 2 days, 15:01:46/1:27:22, loss=0.278688211477759, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.05(1.03), norm=1.1826856232949385, lr=0.001889697570823023
2023-12-08 01:57:43   INFO  epoch: 70/72, acc_iter=272440, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:24:46, time_cost(all): 2 days, 15:02:28/1:26:56, loss=0.278629014051819, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.03(1.03), norm=0.5742103541991841, lr=0.001873049724763602
2023-12-08 01:58:25   INFO  epoch: 70/72, acc_iter=272490, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:22:50, time_cost(all): 2 days, 15:03:10/1:26:15, loss=0.278569816625878, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.04(1.03), norm=1.1273815155823355, lr=0.001856401878704182
2023-12-08 01:59:07   INFO  epoch: 70/72, acc_iter=272540, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:24:10, time_cost(all): 2 days, 15:03:52/1:30:56, loss=0.278510619199937, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.13(1.03), norm=1.3150145085645988, lr=0.001839754032644762
2023-12-08 01:59:49   INFO  epoch: 70/72, acc_iter=272590, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:22:26, time_cost(all): 2 days, 15:04:34/1:27:31, loss=0.278451421773996, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.86(1.03), norm=1.6548453644131234, lr=0.001823106186585341
2023-12-08 02:00:30   INFO  epoch: 70/72, acc_iter=272640, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:21:35, time_cost(all): 2 days, 15:05:15/1:29:26, loss=0.278392224348055, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.94(1.03), norm=3.240809125119781, lr=0.001806458340525921
2023-12-08 02:01:12   INFO  epoch: 70/72, acc_iter=272690, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:15, time_cost(all): 2 days, 15:05:57/1:29:03, loss=0.278333026922114, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.83(1.03), norm=1.9100803176207433, lr=0.001789810494466501
2023-12-08 02:01:54   INFO  epoch: 70/72, acc_iter=272740, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:19:50, time_cost(all): 2 days, 15:06:39/1:29:09, loss=0.278273829496173, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.95(1.03), norm=4.138682509796219, lr=0.001773162648407081
2023-12-08 02:02:36   INFO  epoch: 70/72, acc_iter=272790, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:20:29, time_cost(all): 2 days, 15:07:21/1:24:23, loss=0.278214632070232, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.94(1.03), norm=1.3763047434037257, lr=0.00175651480234766
2023-12-08 02:03:17   INFO  epoch: 70/72, acc_iter=272840, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:18:15, time_cost(all): 2 days, 15:08:02/1:23:20, loss=0.278155434644291, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.9(1.03), norm=2.7384504506237106, lr=0.00173986695628824
2023-12-08 02:03:59   INFO  epoch: 70/72, acc_iter=272890, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:18:39, time_cost(all): 2 days, 15:08:44/1:23:05, loss=0.27809623721835, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.08(1.03), norm=2.2817094202565333, lr=0.00172321911022882
2023-12-08 02:04:41   INFO  epoch: 70/72, acc_iter=272940, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:57, time_cost(all): 2 days, 15:09:26/1:25:28, loss=0.278037039792409, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.08(1.03), norm=3.065789179672113, lr=0.001706571264169399
2023-12-08 02:05:23   INFO  epoch: 70/72, acc_iter=272990, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:16:30, time_cost(all): 2 days, 15:10:08/1:22:45, loss=0.277977842366468, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.16(1.03), norm=4.849328077174207, lr=0.001689923418109978
2023-12-08 02:06:05   INFO  epoch: 70/72, acc_iter=273040, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:19, time_cost(all): 2 days, 15:10:50/1:18:18, loss=0.277918644940527, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.1(1.03), norm=2.9944972019259777, lr=0.001673275572050558
2023-12-08 02:06:46   INFO  epoch: 70/72, acc_iter=273090, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:15:13, time_cost(all): 2 days, 15:11:31/1:19:45, loss=0.277859447514586, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.21(1.03), norm=4.480193230552131, lr=0.001656627725991138
2023-12-08 02:07:28   INFO  epoch: 70/72, acc_iter=273140, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:22, time_cost(all): 2 days, 15:12:13/1:18:42, loss=0.277800250088645, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.95(1.03), norm=2.3943082841591496, lr=0.001639979879931718
2023-12-08 02:08:10   INFO  epoch: 70/72, acc_iter=273190, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:14:07, time_cost(all): 2 days, 15:12:55/1:18:24, loss=0.277741052662704, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.12(1.03), norm=3.228038489376653, lr=0.001623332033872297
2023-12-08 02:08:52   INFO  epoch: 70/72, acc_iter=273240, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:09, time_cost(all): 2 days, 15:13:37/1:15:19, loss=0.277681855236763, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.2(1.03), norm=0.54750879276882, lr=0.001606684187812877
2023-12-08 02:09:33   INFO  epoch: 70/72, acc_iter=273290, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:17, time_cost(all): 2 days, 15:14:18/1:20:06, loss=0.277622657810823, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.22(1.03), norm=3.561871375383975, lr=0.001590036341753457
2023-12-08 02:10:15   INFO  epoch: 70/72, acc_iter=273340, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:11:42, time_cost(all): 2 days, 15:15:00/1:19:55, loss=0.277563460384882, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.06(1.03), norm=1.9649006405361111, lr=0.001573388495694036
2023-12-08 02:10:57   INFO  epoch: 70/72, acc_iter=273390, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:11:33, time_cost(all): 2 days, 15:15:42/1:13:08, loss=0.277504262958941, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.15(1.03), norm=3.2542985468842134, lr=0.001556740649634616
2023-12-08 02:11:39   INFO  epoch: 70/72, acc_iter=273440, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:48, time_cost(all): 2 days, 15:16:24/1:19:39, loss=0.277445065533, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.88(1.03), norm=2.0699533512416064, lr=0.001540092803575195
2023-12-08 02:12:21   INFO  epoch: 70/72, acc_iter=273490, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:54, time_cost(all): 2 days, 15:17:06/1:17:14, loss=0.277385868107059, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.2(1.03), norm=3.7861517454960363, lr=0.001523444957515775
2023-12-08 02:13:02   INFO  epoch: 70/72, acc_iter=273540, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:36, time_cost(all): 2 days, 15:17:47/1:17:48, loss=0.277326670681118, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.91(1.03), norm=3.882332691663751, lr=0.001506797111456354
2023-12-08 02:13:44   INFO  epoch: 70/72, acc_iter=273590, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:48, time_cost(all): 2 days, 15:18:29/1:15:07, loss=0.277267473255177, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.16(1.03), norm=3.6438919630750477, lr=0.001490149265396934
2023-12-08 02:14:26   INFO  epoch: 70/72, acc_iter=273640, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:07:42, time_cost(all): 2 days, 15:19:11/1:14:53, loss=0.277208275829236, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.18(1.03), norm=4.827444400747713, lr=0.001473501419337514
2023-12-08 02:15:08   INFO  epoch: 70/72, acc_iter=273690, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:07:22, time_cost(all): 2 days, 15:19:53/1:11:54, loss=0.277149078403295, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.92(1.03), norm=3.1144989711589375, lr=0.001456853573278094
2023-12-08 02:15:49   INFO  epoch: 70/72, acc_iter=273740, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:31, time_cost(all): 2 days, 15:20:34/1:09:40, loss=0.277089880977354, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.99(1.03), norm=4.753806176791203, lr=0.001440205727218673
2023-12-08 02:16:31   INFO  epoch: 70/72, acc_iter=273790, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:54, time_cost(all): 2 days, 15:21:16/1:12:24, loss=0.277030683551413, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.2(1.03), norm=2.162462043694678, lr=0.001423557881159253
2023-12-08 02:17:13   INFO  epoch: 70/72, acc_iter=273840, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:12, time_cost(all): 2 days, 15:21:58/1:09:09, loss=0.276971486125472, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.18(1.03), norm=4.695670106303978, lr=0.001406910035099832
2023-12-08 02:17:55   INFO  epoch: 70/72, acc_iter=273890, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:18, time_cost(all): 2 days, 15:22:40/1:11:07, loss=0.276912288699531, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.83(1.03), norm=4.569692926672915, lr=0.001390262189040412
2023-12-08 02:18:37   INFO  epoch: 70/72, acc_iter=273940, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:31, time_cost(all): 2 days, 15:23:22/1:11:01, loss=0.27685309127359, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.01(1.03), norm=0.6594347634242324, lr=0.001373614342980992
2023-12-08 02:19:18   INFO  epoch: 70/72, acc_iter=273990, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:57, time_cost(all): 2 days, 15:24:03/1:07:17, loss=0.276793893847649, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.97(1.03), norm=4.386541283465289, lr=0.001356966496921571
2023-12-08 02:20:00   INFO  epoch: 70/72, acc_iter=274040, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:21, time_cost(all): 2 days, 15:24:45/1:06:20, loss=0.276734696421708, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.85(1.03), norm=4.254140908479496, lr=0.001340318650862151
2023-12-08 02:20:42   INFO  epoch: 70/72, acc_iter=274090, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:30, time_cost(all): 2 days, 15:25:27/1:09:07, loss=0.276675498995767, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.08(1.03), norm=4.8925977630286726, lr=0.00132367080480273
2023-12-08 02:21:24   INFO  epoch: 70/72, acc_iter=274140, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:52, time_cost(all): 2 days, 15:26:09/1:05:01, loss=0.276616301569827, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.97(1.03), norm=1.9941115790374189, lr=0.00130702295874331
2023-12-08 02:22:05   INFO  epoch: 70/72, acc_iter=274190, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:09, time_cost(all): 2 days, 15:26:50/1:07:23, loss=0.276557104143886, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.06(1.03), norm=4.554873225731136, lr=0.00129037511268389
2023-12-08 02:22:47   INFO  epoch: 71/72, acc_iter=274252, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:41/0:53:28, time_cost(all): 2 days, 15:27:32/1:06:05, loss=0.276483699335719, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.08(1.03), norm=1.6912699865438614, lr=0.001269731783570208
2023-12-08 02:23:29   INFO  epoch: 71/72, acc_iter=274302, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:23/0:54:05, time_cost(all): 2 days, 15:28:14/1:01:52, loss=0.276424501909778, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.89(1.03), norm=1.157615934030567, lr=0.001253083937510788
2023-12-08 02:24:11   INFO  epoch: 71/72, acc_iter=274352, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:05/0:49:35, time_cost(all): 2 days, 15:28:56/1:05:21, loss=0.276365304483837, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.19(1.03), norm=2.545274852817866, lr=0.001236436091451368
2023-12-08 02:24:53   INFO  epoch: 71/72, acc_iter=274402, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:02:47/0:51:36, time_cost(all): 2 days, 15:29:38/0:59:52, loss=0.276306107057896, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.87(1.03), norm=0.9089681719148366, lr=0.001219788245391948
2023-12-08 02:25:34   INFO  epoch: 71/72, acc_iter=274452, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:03:28/0:52:29, time_cost(all): 2 days, 15:30:19/0:59:24, loss=0.276246909631955, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.93(1.03), norm=4.239234567499048, lr=0.001203140399332527
2023-12-08 02:26:16   INFO  epoch: 71/72, acc_iter=274502, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:04:10/0:50:10, time_cost(all): 2 days, 15:31:01/1:04:16, loss=0.276187712206014, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.92(1.03), norm=2.588602340489627, lr=0.001186492553273107
2023-12-08 02:26:58   INFO  epoch: 71/72, acc_iter=274552, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:04:52/0:49:20, time_cost(all): 2 days, 15:31:43/1:00:39, loss=0.276128514780073, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.88(1.03), norm=1.468962983142016, lr=0.001169844707213687
2023-12-08 02:27:40   INFO  epoch: 71/72, acc_iter=274602, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:05:34/0:48:10, time_cost(all): 2 days, 15:32:25/1:01:43, loss=0.276069317354132, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.16(1.03), norm=2.696868327455446, lr=0.001153196861154266
2023-12-08 02:28:22   INFO  epoch: 71/72, acc_iter=274652, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:06:16/0:45:47, time_cost(all): 2 days, 15:33:07/0:59:09, loss=0.276010119928191, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.02(1.03), norm=4.607088071465362, lr=0.001136549015094845
2023-12-08 02:29:03   INFO  epoch: 71/72, acc_iter=274702, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:06:57/0:47:30, time_cost(all): 2 days, 15:33:48/1:01:15, loss=0.27595092250225, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.9(1.03), norm=0.5802702439985836, lr=0.001119901169035425
2023-12-08 02:29:45   INFO  epoch: 71/72, acc_iter=274752, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:07:39/0:45:34, time_cost(all): 2 days, 15:34:30/0:57:27, loss=0.275891725076309, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.22(1.03), norm=4.727535017372459, lr=0.001103253322976005
2023-12-08 02:30:27   INFO  epoch: 71/72, acc_iter=274802, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:08:21/0:43:16, time_cost(all): 2 days, 15:35:12/0:58:45, loss=0.275832527650368, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.05(1.03), norm=3.1433500588222483, lr=0.001086605476916584
2023-12-08 02:31:09   INFO  epoch: 71/72, acc_iter=274852, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:09:03/0:46:18, time_cost(all): 2 days, 15:35:54/0:58:52, loss=0.275773330224428, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.85(1.03), norm=4.931779199572146, lr=0.001069957630857164
2023-12-08 02:31:50   INFO  epoch: 71/72, acc_iter=274902, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:09:44/0:42:56, time_cost(all): 2 days, 15:36:35/0:54:35, loss=0.275714132798487, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.1(1.03), norm=4.070730023570858, lr=0.001053309784797744
2023-12-08 02:32:32   INFO  epoch: 71/72, acc_iter=274952, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:10:26/0:44:40, time_cost(all): 2 days, 15:37:17/0:53:08, loss=0.275654935372546, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.98(1.03), norm=0.8499767697509188, lr=0.001036661938738324
2023-12-08 02:33:14   INFO  epoch: 71/72, acc_iter=275002, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:11:08/0:42:52, time_cost(all): 2 days, 15:37:59/0:52:50, loss=0.275595737946605, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.09(1.03), norm=2.8294960505935873, lr=0.001020014092678903
2023-12-08 02:33:56   INFO  epoch: 71/72, acc_iter=275052, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:11:50/0:43:21, time_cost(all): 2 days, 15:38:41/0:51:12, loss=0.275536540520664, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.91(1.03), norm=4.640785919412952, lr=0.001003366246619483
2023-12-08 02:34:38   INFO  epoch: 71/72, acc_iter=275102, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:12:32/0:41:28, time_cost(all): 2 days, 15:39:23/0:52:27, loss=0.275477343094723, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.96(1.03), norm=1.0935539164217434, lr=0.000986718400560062
2023-12-08 02:35:19   INFO  epoch: 71/72, acc_iter=275152, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:13:13/0:39:46, time_cost(all): 2 days, 15:40:04/0:53:00, loss=0.275418145668782, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.88(1.03), norm=4.449254223945005, lr=0.000970070554500642
2023-12-08 02:36:01   INFO  epoch: 71/72, acc_iter=275202, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:13:55/0:41:24, time_cost(all): 2 days, 15:40:46/0:52:52, loss=0.275358948242841, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.2(1.03), norm=1.3876822001517708, lr=0.000953422708441221
2023-12-08 02:36:43   INFO  epoch: 71/72, acc_iter=275252, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:14:37/0:39:41, time_cost(all): 2 days, 15:41:28/0:49:28, loss=0.2752997508169, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.02(1.03), norm=3.8654383303858135, lr=0.000936774862381801
2023-12-08 02:37:25   INFO  epoch: 71/72, acc_iter=275302, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:15:19/0:36:53, time_cost(all): 2 days, 15:42:10/0:49:40, loss=0.275240553390959, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.16(1.03), norm=1.3048731225910764, lr=0.000920127016322381
2023-12-08 02:38:06   INFO  epoch: 71/72, acc_iter=275352, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:16:00/0:37:20, time_cost(all): 2 days, 15:42:51/0:51:31, loss=0.275181355965018, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.88(1.03), norm=3.6395224063524068, lr=0.00090347917026296
2023-12-08 02:38:48   INFO  epoch: 71/72, acc_iter=275402, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:16:42/0:37:28, time_cost(all): 2 days, 15:43:33/0:47:49, loss=0.275122158539077, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.98(1.03), norm=4.626990405755437, lr=0.00088683132420354
2023-12-08 02:39:30   INFO  epoch: 71/72, acc_iter=275452, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:17:24/0:35:42, time_cost(all): 2 days, 15:44:15/0:45:53, loss=0.275062961113136, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.89(1.03), norm=1.2511195718892154, lr=0.00087018347814412
2023-12-08 02:40:12   INFO  epoch: 71/72, acc_iter=275502, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:18:06/0:34:38, time_cost(all): 2 days, 15:44:57/0:45:43, loss=0.275003763687195, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.19(1.03), norm=0.8710495771785884, lr=0.0008535356320847
2023-12-08 02:40:54   INFO  epoch: 71/72, acc_iter=275552, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:18:48/0:34:32, time_cost(all): 2 days, 15:45:39/0:47:31, loss=0.274944566261254, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.18(1.03), norm=0.9621110319841224, lr=0.000836887786025279
2023-12-08 02:41:35   INFO  epoch: 71/72, acc_iter=275602, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:19:29/0:35:16, time_cost(all): 2 days, 15:46:20/0:46:19, loss=0.274885368835313, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.85(1.03), norm=2.3894683519217135, lr=0.000820239939965859
2023-12-08 02:42:17   INFO  epoch: 71/72, acc_iter=275652, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:20:11/0:32:39, time_cost(all): 2 days, 15:47:02/0:46:13, loss=0.274826171409372, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.85(1.03), norm=4.507687745835196, lr=0.000803592093906438
2023-12-08 02:42:59   INFO  epoch: 71/72, acc_iter=275702, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:20:53/0:32:41, time_cost(all): 2 days, 15:47:44/0:42:33, loss=0.274766973983432, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.18(1.03), norm=4.950312766280215, lr=0.000786944247847018
2023-12-08 02:43:41   INFO  epoch: 71/72, acc_iter=275752, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:21:35/0:31:27, time_cost(all): 2 days, 15:48:26/0:43:02, loss=0.274707776557491, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.85(1.03), norm=0.5609148335108485, lr=0.000770296401787597
2023-12-08 02:44:22   INFO  epoch: 71/72, acc_iter=275802, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:22:16/0:30:45, time_cost(all): 2 days, 15:49:07/0:42:10, loss=0.27464857913155, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.01(1.03), norm=0.6956650229262482, lr=0.000753648555728177
2023-12-08 02:45:04   INFO  epoch: 71/72, acc_iter=275852, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:22:58/0:31:40, time_cost(all): 2 days, 15:49:49/0:44:46, loss=0.274589381705609, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.16(1.03), norm=4.088761291934092, lr=0.000737000709668757
2023-12-08 02:45:46   INFO  epoch: 71/72, acc_iter=275902, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:23:40/0:30:54, time_cost(all): 2 days, 15:50:31/0:42:48, loss=0.274530184279668, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.88(1.03), norm=2.5557981606924516, lr=0.000720352863609336
2023-12-08 02:46:28   INFO  epoch: 71/72, acc_iter=275952, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:24:22/0:30:52, time_cost(all): 2 days, 15:51:13/0:39:23, loss=0.274470986853727, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.84(1.03), norm=4.896052047268262, lr=0.000703705017549916
2023-12-08 02:47:10   INFO  epoch: 71/72, acc_iter=276002, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:25:04/0:28:24, time_cost(all): 2 days, 15:51:55/0:39:17, loss=0.274411789427786, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.12(1.03), norm=4.62556189103367, lr=0.000687057171490496
2023-12-08 02:47:51   INFO  epoch: 71/72, acc_iter=276052, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:25:45/0:27:40, time_cost(all): 2 days, 15:52:36/0:41:28, loss=0.274352592001845, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.96(1.03), norm=2.149782670911895, lr=0.000670409325431076
2023-12-08 02:48:33   INFO  epoch: 71/72, acc_iter=276102, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:26:27/0:26:32, time_cost(all): 2 days, 15:53:18/0:37:19, loss=0.274293394575904, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.94(1.03), norm=0.8214915477797992, lr=0.000653761479371655
2023-12-08 02:49:15   INFO  epoch: 71/72, acc_iter=276152, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:27:09/0:25:53, time_cost(all): 2 days, 15:54:00/0:38:57, loss=0.274234197149963, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.02(1.03), norm=3.453580184030024, lr=0.000637113633312235
2023-12-08 02:49:57   INFO  epoch: 71/72, acc_iter=276202, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:27:51/0:25:30, time_cost(all): 2 days, 15:54:42/0:39:10, loss=0.274174999724022, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.22(1.03), norm=4.715211018222414, lr=0.000620465787252814
2023-12-08 02:50:38   INFO  epoch: 71/72, acc_iter=276252, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:28:32/0:26:20, time_cost(all): 2 days, 15:55:23/0:36:02, loss=0.274115802298081, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.19(1.03), norm=4.808391063026031, lr=0.000603817941193394
2023-12-08 02:51:20   INFO  epoch: 71/72, acc_iter=276302, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:29:14/0:23:19, time_cost(all): 2 days, 15:56:05/0:36:35, loss=0.27405660487214, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.05(1.03), norm=3.836292575659351, lr=0.000587170095133973
2023-12-08 02:52:02   INFO  epoch: 71/72, acc_iter=276352, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:29:56/0:23:19, time_cost(all): 2 days, 15:56:47/0:37:26, loss=0.273997407446199, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.01(1.03), norm=3.7075693128083005, lr=0.000570522249074553
2023-12-08 02:52:44   INFO  epoch: 71/72, acc_iter=276402, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:30:38/0:22:09, time_cost(all): 2 days, 15:57:29/0:33:41, loss=0.273938210020258, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.88(1.03), norm=1.8777926937405, lr=0.000553874403015133
2023-12-08 02:53:26   INFO  epoch: 71/72, acc_iter=276452, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:31:20/0:21:59, time_cost(all): 2 days, 15:58:11/0:33:29, loss=0.273879012594317, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.92(1.03), norm=4.050981758846714, lr=0.000537226556955712
2023-12-08 02:54:07   INFO  epoch: 71/72, acc_iter=276502, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:32:01/0:20:59, time_cost(all): 2 days, 15:58:52/0:33:43, loss=0.273819815168377, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.08(1.03), norm=1.8444269850773192, lr=0.000520578710896292
2023-12-08 02:54:49   INFO  epoch: 71/72, acc_iter=276552, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:32:43/0:20:33, time_cost(all): 2 days, 15:59:34/0:33:12, loss=0.273760617742436, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.92(1.03), norm=3.9358649274632427, lr=0.000503930864836872
2023-12-08 02:55:31   INFO  epoch: 71/72, acc_iter=276602, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:33:25/0:19:26, time_cost(all): 2 days, 16:00:16/0:32:49, loss=0.273701420316495, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.85(1.03), norm=4.253372448970771, lr=0.000487283018777452
2023-12-08 02:56:13   INFO  epoch: 71/72, acc_iter=276652, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:34:07/0:20:27, time_cost(all): 2 days, 16:00:58/0:30:49, loss=0.273642222890554, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.91(1.03), norm=4.733839589454258, lr=0.000470635172718031
2023-12-08 02:56:54   INFO  epoch: 71/72, acc_iter=276702, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:34:48/0:19:13, time_cost(all): 2 days, 16:01:39/0:30:22, loss=0.273583025464613, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.06(1.03), norm=4.255465382518501, lr=0.000453987326658611
2023-12-08 02:57:36   INFO  epoch: 71/72, acc_iter=276752, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:35:30/0:17:21, time_cost(all): 2 days, 16:02:21/0:30:33, loss=0.273523828038672, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.2(1.03), norm=4.08805140362754, lr=0.00043733948059919
2023-12-08 02:58:18   INFO  epoch: 71/72, acc_iter=276802, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:36:12/0:17:16, time_cost(all): 2 days, 16:03:03/0:29:00, loss=0.273464630612731, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.95(1.03), norm=1.170293457520471, lr=0.00042069163453977
2023-12-08 02:59:00   INFO  epoch: 71/72, acc_iter=276852, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:36:54/0:17:34, time_cost(all): 2 days, 16:03:45/0:29:10, loss=0.27340543318679, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.08(1.03), norm=3.6328149464468904, lr=0.000404043788480349
2023-12-08 02:59:42   INFO  epoch: 71/72, acc_iter=276902, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:37:36/0:16:33, time_cost(all): 2 days, 16:04:27/0:26:56, loss=0.273346235760849, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.06(1.03), norm=1.1583562219718182, lr=0.000387395942420929
2023-12-08 03:00:23   INFO  epoch: 71/72, acc_iter=276952, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:38:17/0:14:48, time_cost(all): 2 days, 16:05:08/0:27:46, loss=0.273287038334908, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.13(1.03), norm=2.913561703154354, lr=0.000370748096361509
2023-12-08 03:01:05   INFO  epoch: 71/72, acc_iter=277002, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:38:59/0:14:32, time_cost(all): 2 days, 16:05:50/0:26:55, loss=0.273227840908967, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.22(1.03), norm=0.9945491936980773, lr=0.000354100250302089
2023-12-08 03:01:47   INFO  epoch: 71/72, acc_iter=277052, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:39:41/0:13:54, time_cost(all): 2 days, 16:06:32/0:25:16, loss=0.273168643483026, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.14(1.03), norm=4.136621739271208, lr=0.000337452404242668
2023-12-08 03:02:29   INFO  epoch: 71/72, acc_iter=277102, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:40:23/0:13:18, time_cost(all): 2 days, 16:07:14/0:25:18, loss=0.273109446057085, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.19(1.03), norm=0.726594724648555, lr=0.000320804558183248
2023-12-08 03:03:11   INFO  epoch: 71/72, acc_iter=277152, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:41:05/0:12:09, time_cost(all): 2 days, 16:07:56/0:25:39, loss=0.273050248631144, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.94(1.03), norm=1.472642864967753, lr=0.000304156712123828
2023-12-08 03:03:52   INFO  epoch: 71/72, acc_iter=277202, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:41:46/0:11:57, time_cost(all): 2 days, 16:08:37/0:23:10, loss=0.272991051205203, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.07(1.03), norm=2.6400767085836567, lr=0.000287508866064407
2023-12-08 03:04:34   INFO  epoch: 71/72, acc_iter=277252, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:42:28/0:10:54, time_cost(all): 2 days, 16:09:19/0:23:55, loss=0.272931853779262, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.05(1.03), norm=3.5769515713460134, lr=0.000270861020004986
2023-12-08 03:05:16   INFO  epoch: 71/72, acc_iter=277302, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:43:10/0:10:08, time_cost(all): 2 days, 16:10:01/0:22:48, loss=0.272872656353321, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.1(1.03), norm=3.218131770423581, lr=0.000254213173945566
2023-12-08 03:05:58   INFO  epoch: 71/72, acc_iter=277352, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:43:52/0:09:52, time_cost(all): 2 days, 16:10:43/0:22:51, loss=0.272813458927381, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.93(1.03), norm=3.0760355676584434, lr=0.000237565327886146
2023-12-08 03:06:39   INFO  epoch: 71/72, acc_iter=277402, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:44:33/0:09:28, time_cost(all): 2 days, 16:11:24/0:21:02, loss=0.27275426150144, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.84(1.03), norm=2.1507748667399733, lr=0.000220917481826725
2023-12-08 03:07:21   INFO  epoch: 71/72, acc_iter=277452, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:45:15/0:08:12, time_cost(all): 2 days, 16:12:06/0:19:42, loss=0.272695064075499, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.95(1.03), norm=4.2161146867127135, lr=0.000204269635767305
2023-12-08 03:08:03   INFO  epoch: 71/72, acc_iter=277502, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 0:45:57/0:08:01, time_cost(all): 2 days, 16:12:48/0:20:08, loss=0.272635866649558, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.97(1.03), norm=4.654609269869332, lr=0.000187621789707885
2023-12-08 03:08:45   INFO  epoch: 71/72, acc_iter=277552, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 0:46:39/0:06:58, time_cost(all): 2 days, 16:13:30/0:18:20, loss=0.272576669223617, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.94(1.03), norm=3.7471424276330163, lr=0.000170973943648465
2023-12-08 03:09:27   INFO  epoch: 71/72, acc_iter=277602, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 0:47:21/0:06:25, time_cost(all): 2 days, 16:14:12/0:18:28, loss=0.272517471797676, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.15(1.03), norm=3.4534493751076334, lr=0.000154326097589044
2023-12-08 03:10:08   INFO  epoch: 71/72, acc_iter=277652, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 0:48:02/0:05:29, time_cost(all): 2 days, 16:14:53/0:17:28, loss=0.272458274371735, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.0(1.03), norm=0.9690950778822807, lr=0.000137678251529624
2023-12-08 03:10:50   INFO  epoch: 71/72, acc_iter=277702, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 0:48:44/0:05:09, time_cost(all): 2 days, 16:15:35/0:17:36, loss=0.272399076945794, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.13(1.03), norm=3.008067511778857, lr=0.000121030405470204
2023-12-08 03:11:32   INFO  epoch: 71/72, acc_iter=277752, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 0:49:26/0:04:16, time_cost(all): 2 days, 16:16:17/0:16:44, loss=0.272339879519853, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.86(1.03), norm=4.866073257354368, lr=0.000104382559410783
2023-12-08 03:12:14   INFO  epoch: 71/72, acc_iter=277802, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 0:50:08/0:03:36, time_cost(all): 2 days, 16:16:59/0:16:02, loss=0.272280682093912, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.96(1.03), norm=1.2141570550805811, lr=8.7734713351362e-05
2023-12-08 03:12:55   INFO  epoch: 71/72, acc_iter=277852, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 0:50:49/0:02:57, time_cost(all): 2 days, 16:17:40/0:14:57, loss=0.272221484667971, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.0(1.03), norm=4.068370339296535, lr=7.1086867291942e-05
2023-12-08 03:13:37   INFO  epoch: 71/72, acc_iter=277902, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 0:51:31/0:02:19, time_cost(all): 2 days, 16:18:22/0:13:48, loss=0.27216228724203, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.89(1.03), norm=3.600519559331334, lr=5.4439021232522e-05
2023-12-08 03:14:19   INFO  epoch: 71/72, acc_iter=277952, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 0:52:13/0:01:37, time_cost(all): 2 days, 16:19:04/0:13:22, loss=0.272103089816089, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.86(1.03), norm=0.746398285420596, lr=3.7791175173101e-05
2023-12-08 03:15:01   INFO  epoch: 71/72, acc_iter=278002, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 0:52:55/0:00:51, time_cost(all): 2 days, 16:19:46/0:12:25, loss=0.272043892390148, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.9(1.03), norm=2.484872547838111, lr=2.1143329113681e-05
2023-12-08 03:15:43   INFO  epoch: 71/72, acc_iter=278052, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 0:53:37/0:00:10, time_cost(all): 2 days, 16:20:28/0:11:46, loss=0.271984694964207, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.04(1.03), norm=0.736776631535101, lr=4.495483054261e-06
2023-12-08 03:15:43   INFO  **********************End training picture_models/picture_nuscenes_ssl_seal_decoder_mask(offline_30e)**********************